Skip to content
View JoshuaPurtell's full-sized avatar
💭
Working
💭
Working

Block or report JoshuaPurtell

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Apropos Apropos Public

    A framework for rapidly building compound AI systems

    Python 2

  2. craftaxlm craftaxlm Public

    A wrapper around the Craftax agent benchmark, for evaluating digital agents over extremely long time horizons

    Python

  3. LRCBench LRCBench Public

    Evals meant to evaluate language models' ability to reason over long contexts.

    Python 8

  4. SmallBench SmallBench Public

    Small, simple agent task environments for training and evaluation

    Python 14

  5. icl-bench icl-bench Public

    Evaluating Language Models' Ability to Learn In Context

    Python

  6. jazyk jazyk Public

    Simple LM api for production

    Python