Skip to content
View JacobKong's full-sized avatar

Block or report JacobKong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
44 stars written in Jupyter Notebook
Clear filter

A latent text-to-image diffusion model

Jupyter Notebook 68,192 10,151 Updated Jun 18, 2024

Google Research

Jupyter Notebook 34,183 7,893 Updated Oct 31, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 25,675 3,291 Updated Jul 23, 2024

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 13,192 2,596 Updated Oct 26, 2024

PRML algorithms implemented in Python

Jupyter Notebook 11,441 3,252 Updated Sep 27, 2024
Jupyter Notebook 10,358 1,285 Updated May 21, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,852 966 Updated Oct 11, 2024

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 5,211 337 Updated Jun 28, 2024

links to conference publications in graph-based deep learning

Jupyter Notebook 4,795 774 Updated Oct 9, 2024

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 4,777 637 Updated Aug 5, 2024

Single Shot MultiBox Detector in TensorFlow

Jupyter Notebook 4,113 1,887 Updated Aug 12, 2021

VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.

Jupyter Notebook 3,257 334 Updated Mar 3, 2024

Easily compute clip embeddings and build a clip retrieval system with them

Jupyter Notebook 2,398 209 Updated Apr 15, 2024

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

Jupyter Notebook 1,787 241 Updated Jan 24, 2024
Jupyter Notebook 1,671 161 Updated Sep 27, 2024

Text To Video Synthesis Colab

Jupyter Notebook 1,453 175 Updated Mar 28, 2024

Simple image captioning model

Jupyter Notebook 1,307 216 Updated Jun 9, 2024

Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion

Jupyter Notebook 1,290 88 Updated Oct 18, 2022

ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

Jupyter Notebook 1,148 177 Updated Oct 27, 2023

[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"

Jupyter Notebook 1,108 106 Updated Aug 14, 2023
Jupyter Notebook 1,085 485 Updated Oct 26, 2023

This repository is intended to host tools and demos for ActivityNet

Jupyter Notebook 941 330 Updated Mar 21, 2024
Jupyter Notebook 794 159 Updated Aug 17, 2024

OpenAI CLIP text encoders for multiple languages!

Jupyter Notebook 759 71 Updated May 15, 2023

Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".

Jupyter Notebook 738 110 Updated May 22, 2023
Jupyter Notebook 571 52 Updated Sep 17, 2024

The source code for paper "Deep Image Spatial Transformation for Person Image Generation"

Jupyter Notebook 566 84 Updated Sep 3, 2022

Code for CVPR'19 paper Linkage-based Face Clustering via GCN

Jupyter Notebook 360 86 Updated Dec 2, 2021

DiscoDiffusion Warp

Jupyter Notebook 326 42 Updated Aug 30, 2024

code for R-C3D

Jupyter Notebook 254 94 Updated Dec 22, 2019
Next