Highlights
- Pro
Stars
Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
[ICLR 2024] Code for the paper "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning"
[ICLR 2024] Lemur: Open Foundation Models for Language Agents
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
Paper collection on building and evaluating language model agents via executable language grounding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
[ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".
[EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models
[ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"
[ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"
Author implementation of the paper "Representing Schema Structure with Graph Neural Networks for Text-to-SQL Parsing"
Repository for "Generating Sentences by Editing Prototypes"
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Simple examples to introduce PyTorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Global-Locally Self-Attentive Dialogue State Tracker
Cool links & research papers related to Machine Learning applied to source code (MLonCode)
Pre-trained models and code and data to train and use models from "Pushing the Limits of Paraphrastic Sentence Embeddings with Millions of Machine Translations"