Skip to content
View sarisel's full-sized avatar

Block or report sarisel

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts!

Python 189 14 Updated Jul 23, 2024

a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini

Jupyter Notebook 347 101 Updated Dec 21, 2023

AllenAI's post-training codebase

Python 2,787 358 Updated Mar 12, 2025

Code associated with Tuning Language Models by Proxy (Liu et al., 2024)

Python 106 16 Updated Mar 30, 2024

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

Python 1,235 62 Updated Oct 18, 2022

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 5,136 442 Updated Mar 11, 2025

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 8,998 1,100 Updated Oct 9, 2024

Simple, unified interface to multiple Generative AI providers

Python 11,608 1,127 Updated Mar 6, 2025

RS5M: a large-scale vision language dataset for remote sensing [TGRS]

Python 239 10 Updated Sep 29, 2024

Multimodal Large Language Models for Remote Sensing (RS-MLLMs): A Survey

209 6 Updated Jan 23, 2025
Jupyter Notebook 217 32 Updated Feb 19, 2022

Python logging made (stupidly) simple

Python 21,026 720 Updated Mar 1, 2025

Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions (NeurIPS 2024)

Python 155 9 Updated Jul 30, 2024

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

384 20 Updated Feb 18, 2023

utilities for decoding deep representations (like sentence embeddings) back to text

Python 773 88 Updated Jan 24, 2025

Document Ranking with Large Language Models.

Python 116 14 Updated Mar 11, 2025

[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training

Python 1,142 142 Updated Dec 29, 2024

Fast computation of Krippendorff's alpha agreement measure in Python.

Python 139 15 Updated Jan 15, 2025

Official inference library for Mistral models

Jupyter Notebook 10,073 901 Updated Nov 12, 2024

[EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning

Python 232 12 Updated Oct 31, 2023

Graph Neural Network Library for PyTorch

Python 21,996 3,779 Updated Mar 9, 2025

Large Language Models Are Reasoning Teachers (ACL 2023)

Jupyter Notebook 326 20 Updated Mar 7, 2025

Bayesian optimization in PyTorch

Jupyter Notebook 3,193 415 Updated Mar 11, 2025

Code for fine-tuning Platypus fam LLMs using LoRA

Python 628 60 Updated Feb 4, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 24,320 2,109 Updated Mar 12, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 17,359 1,440 Updated Feb 25, 2025

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 13,747 974 Updated Mar 9, 2025
Next