Skip to content

Pinned Loading

  1. OLMo OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5.4k 575

  2. dolma dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.2k 130

  3. ai2thor ai2thor Public

    An open-source platform for Visual AI.

    C# 1.3k 231

  4. olmocr olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 10.1k 667

  5. OLMoE OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 684 56

Repositories

Showing 10 of 494 repositories
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    allenai/olmocr’s past year of commit activity
    Python 10,087 Apache-2.0 667 61 17 Updated Mar 18, 2025
  • open-instruct Public

    AllenAI's post-training codebase

    allenai/open-instruct’s past year of commit activity
    Python 2,808 Apache-2.0 362 16 11 Updated Mar 18, 2025
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    allenai/OLMo-core’s past year of commit activity
    Python 155 Apache-2.0 28 1 22 Updated Mar 18, 2025
  • OLMo Public

    Modeling, training, eval, and inference code for OLMo

    allenai/OLMo’s past year of commit activity
    Python 5,396 Apache-2.0 575 46 53 Updated Mar 18, 2025
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    allenai/olmo-cookbook’s past year of commit activity
    Python 14 Apache-2.0 5 1 6 Updated Mar 18, 2025
  • ai2-scholarqa-lib Public

    Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library

    allenai/ai2-scholarqa-lib’s past year of commit activity
    Python 128 Apache-2.0 16 0 0 Updated Mar 18, 2025
  • SciRIFF Public

    Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.

    allenai/SciRIFF’s past year of commit activity
    Python 36 Apache-2.0 5 2 0 Updated Mar 18, 2025
  • ai2thor Public

    An open-source platform for Visual AI.

    allenai/ai2thor’s past year of commit activity
    C# 1,302 Apache-2.0 231 245 4 Updated Mar 14, 2025
  • OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    allenai/OLMoE’s past year of commit activity
    Jupyter Notebook 684 Apache-2.0 56 13 0 Updated Mar 14, 2025
  • lighthouse Public
    allenai/lighthouse’s past year of commit activity
    Python 0 Apache-2.0 0 0 0 Updated Mar 13, 2025