Skip to content
@BatsResearch

Bats Research

We are a machine learning research group at Brown University. We work on improving the processes by which humans teach and instruct computers.

Pinned Loading

  1. trove trove Public

    A Flexible Toolkit for Dense Retrieval

    Python 32 2

  2. bonito bonito Public

    A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

    Python 774 48

  3. alfred alfred Public

    A system for prompted weak supervision. Alfred is a powerful tool that leverages large language models to accelerate data annotation.

    Python 53 7

  4. csp csp Public

    Learning to compose soft prompts for compositional zero-shot learning.

    Python 88 5

Repositories

Showing 10 of 32 repositories
  • crosslingual-test-time-scaling Public

    Crosslingual Reasoning through Test-Time Scaling

    BatsResearch/crosslingual-test-time-scaling’s past year of commit activity
    Python 14 1 1 0 Updated May 13, 2025
  • trove Public

    A Flexible Toolkit for Dense Retrieval

    BatsResearch/trove’s past year of commit activity
    Python 32 Apache-2.0 2 0 0 Updated Apr 19, 2025
  • sycl Public

    Beyond Contrastive Learning: Synthetic Data Enables List-wise Training with Multiple Levels of Relevance

    BatsResearch/sycl’s past year of commit activity
    Python 8 0 0 0 Updated Apr 19, 2025
  • alfred Public

    A system for prompted weak supervision. Alfred is a powerful tool that leverages large language models to accelerate data annotation.

    BatsResearch/alfred’s past year of commit activity
    Python 53 BSD-3-Clause 7 1 0 Updated Apr 3, 2025
  • cross-lingual-detox Public

    Code for "Preference Tuning For Toxicity Mitigation Generalizes Across Languages." Paper accepted at Findings of EMNLP 2024

    BatsResearch/cross-lingual-detox’s past year of commit activity
    Jupyter Notebook 17 BSD-3-Clause 0 0 0 Updated Mar 25, 2025
  • BatsResearch/mazzetto-aistats2025-code’s past year of commit activity
    Python 0 0 0 0 Updated Mar 10, 2025
  • bonito Public

    A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

    BatsResearch/bonito’s past year of commit activity
    Python 774 BSD-3-Clause 48 5 1 Updated Feb 28, 2025
  • menghini-neurips23-code Public

    Exploring prompt tuning with pseudolabels for multiple modalities, learning settings, and training strategies.

    BatsResearch/menghini-neurips23-code’s past year of commit activity
    Python 50 3 1 0 Updated Nov 8, 2024
  • planetarium Public

    Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL

    BatsResearch/planetarium’s past year of commit activity
    Python 51 BSD-3-Clause 3 1 0 Updated Oct 16, 2024
  • LexC-Gen-Data-Archive Public

    Data Repository for LexC-Gen: Generating Data for Extremely Low-Resource Languages with Large Language Models and Bilingual Lexicons

    BatsResearch/LexC-Gen-Data-Archive’s past year of commit activity
    1 1 0 0 Updated Oct 3, 2024

Top languages

Loading…

Most used topics

Loading…