Stars
CTF Archives: Collection of CTF Challenges.
Small, simple agent task environments for training and evaluation
🧬 RegMix: Data Mixture as Regression for Language Model Pre-training
BigCodeBench: Benchmarking Code Generation Towards AGI
XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts
Astraios: Parameter-Efficient Instruction Tuning Code Language Models
A framework for the evaluation of autoregressive code generation language models.
DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text
🐙 OctoPack: Instruction Tuning Code Large Language Models
Source Code Data Augmentation for Deep Learning: A Survey.
Home of StarCoder: fine-tuning & inference!
[EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code
💩State-of-the-art shitcode principles your project should follow to call it a proper shitcode
PyArmadillo: an alternative approach to linear algebra in Python
A corpus and code for understanding norms and subjectivity. 🤖
Stacked hierarchical attention for text-based games
🦾 A list of reported app support for Apple Silicon as well as Apple M4 and M3 Ultra Macs
A curated list of fellowships for graduate students in Computer Science and related fields.
[EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction
BLEURT is a metric for Natural Language Generation based on transfer learning.
Code associated with the "Natural Language Rationales with Full-Stack Visual Reasoning" EMNLP Findings 2020 paper
The code of ACL 2020 paper "You Impress Me: Dialogue Generation via Mutual Persona Perception"
Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)
⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).
Code for using and evaluating SpanBERT.