Skip to content
@lmarena

lmarena

An Open Platform for Crowdsourced AI Benchmarking

Popular repositories Loading

  1. arena-hard-auto arena-hard-auto Public

    Arena-Hard-Auto: An automatic LLM benchmark.

    Python 939 128

  2. copilot-arena copilot-arena Public

    TypeScript 330 22

  3. p2l p2l Public

    Prompt-to-Leaderboard

    Python 258 22

  4. PPE PPE Public

    Jupyter Notebook 52 12

  5. search-arena search-arena Public

    ⚔️ Official code of "Search Arena: Analyzing Search-Augmented LLMs".

    Jupyter Notebook 33 5

  6. lmarena.github.io lmarena.github.io Public

    HTML 12 14

Repositories

Showing 10 of 10 repositories

Top languages

Loading…

Most used topics

Loading…