Skip to content
@AlignmentResearch

FAR.AI

Frontier alignment research to ensure the safe development and deployment of advanced AI systems.

Popular repositories Loading

  1. tuned-lens tuned-lens Public

    Tools for understanding how transformer predictions are built layer-by-layer

    Python 493 56

  2. go_attack go_attack Public

    Python 86 7

  3. vlmrm vlmrm Public

    Python 55 15

  4. gpt-4-novel-apis-attacks gpt-4-novel-apis-attacks Public

    20 1

  5. learned-planner learned-planner Public

    Interpretability tools for recurrent convolutional networks (DRC) that play Sokoban

    Python 13 4

  6. scaling-poisoning scaling-poisoning Public

    Python 8 2

Repositories

Showing 10 of 46 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…