Skip to content

Pinned Loading

  1. OLMo OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5.5k 590

  2. dolma dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.2k 133

  3. ai2thor ai2thor Public

    An open-source platform for Visual AI.

    C# 1.3k 231

  4. olmocr olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 11k 743

  5. OLMoE OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 703 62

Repositories

Showing 10 of 496 repositories
  • allenai/rslearn_projects’s past year of commit activity
    Python 7 Apache-2.0 2 6 3 Updated Apr 9, 2025
  • regmixer Public
    allenai/regmixer’s past year of commit activity
    Python 4 0 0 3 Updated Apr 9, 2025
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    allenai/olmo-cookbook’s past year of commit activity
    Python 21 Apache-2.0 5 1 6 Updated Apr 8, 2025
  • dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    allenai/dolma’s past year of commit activity
    Python 1,188 Apache-2.0 133 26 16 Updated Apr 9, 2025
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    allenai/OLMo-core’s past year of commit activity
    Python 188 Apache-2.0 34 1 19 Updated Apr 9, 2025
  • ccget Public

    Tools for an internal archive of some Common Crawl files

    allenai/ccget’s past year of commit activity
    Python 1 Apache-2.0 0 0 0 Updated Apr 9, 2025
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    allenai/olmocr’s past year of commit activity
    Python 11,001 Apache-2.0 743 71 17 Updated Apr 8, 2025
  • ai2thor Public

    An open-source platform for Visual AI.

    allenai/ai2thor’s past year of commit activity
    C# 1,332 Apache-2.0 231 248 4 Updated Apr 9, 2025
  • open-instruct Public

    AllenAI's post-training codebase

    allenai/open-instruct’s past year of commit activity
    Python 2,887 Apache-2.0 371 11 15 Updated Apr 8, 2025
  • rslearn Public

    A tool for developing remote sensing datasets and models.

    allenai/rslearn’s past year of commit activity
    Python 31 Apache-2.0 2 8 4 Updated Apr 8, 2025