[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
-
Updated
Oct 20, 2024 - Jupyter Notebook
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
A Survey on Data Selection for Language Models
Code for Generative Deduplication For Socia Media Data Selection (Findings of EMNLP 2024)
This repo contains the code for "Privacy Preserving Data Selection for Bias Mitigation in Speech Models"
A Python package for studying neural learning
[ACL 2023] The code for our ACL'23 paper Cold-Start Data Selection for Few-shot Language Model Fine-tuning: A Prompt-Based Uncertainty Propagation Approach
CORE: Mitigating Catastrophic Forgetting in Continual Learning through Cognitive Replay (CogSci 2024 Oral)
Enhancing Efficiency in Multidevice Federated Learning through Data Selection
DSIR large-scale data selection framework for language model training
⛔ [DEPRECATED] Adapt Transformer-based language models to new text domains
A quick-start project that helps you to perform different types of selection in Vue Grid and know about different modes of selection – Row, Cell and Both. This project contains code snippet about cell, checkbox and toggle selection, and the way to get row index of selected cells using row selection events.
A project to select only part of a PDF file. It's usefull when you want to extract informations with some python library like fitz.
Repository for the experiments in my paper accepted to the CLIN Journal: "Selecting Parallel In-domain Sentences for Neural Machine Translation Using Monolingual Texts"
InstructionGPT-4
Quilt: Robust Data Segment Selection against Concept Drifts (AAAI 2024)
This is an official repository for "Performance Scaling via Optimal Transport: Enabling Data Selection from Partially Revealed Sources" (NeurIPS 2023).
Code for NeurIPS 2023 Paper (Imitation Learning from Imperfection: Theoretical Justifications and Algorithms)
NU Bootcamp Module 14
Enhanced spatio-temporal electric load forecasts with less data using active deep learning
Add a description, image, and links to the data-selection topic page so that developers can more easily learn about it.
To associate your repository with the data-selection topic, visit your repo's landing page and select "manage topics."