Stars
Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts!
a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini
Code associated with Tuning Language Models by Proxy (Liu et al., 2024)
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
A concise but complete full-attention transformer with a set of promising experimental features from various papers
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Simple, unified interface to multiple Generative AI providers
RS5M: a large-scale vision language dataset for remote sensing [TGRS]
Multimodal Large Language Models for Remote Sensing (RS-MLLMs): A Survey
Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions (NeurIPS 2024)
Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.
utilities for decoding deep representations (like sentence embeddings) back to text
[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training
Fast computation of Krippendorff's alpha agreement measure in Python.
Official inference library for Mistral models
[EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
Graph Neural Network Library for PyTorch
Large Language Models Are Reasoning Teachers (ACL 2023)
Code for fine-tuning Platypus fam LLMs using LoRA
A generative world for general-purpose robotics & embodied AI learning.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
tiktoken is a fast BPE tokeniser for use with OpenAI's models.