Skip to content
@redwoodresearch

Redwood Research

Popular repositories Loading

  1. Easy-Transformer Easy-Transformer Public

    Forked from TransformerLensOrg/TransformerLens

    Python 122 18

  2. mlab mlab Public

    Machine Learning for Alignment Bootcamp

    Jupyter Notebook 77 41

  3. alignment_faking_public alignment_faking_public Public

    Forked from rgreenblatt/model_organism_public

    Python 73 13

  4. rust_circuit_public rust_circuit_public Public

    Rust 63 2

  5. Text-Steganography-Benchmark Text-Steganography-Benchmark Public

    Code for Preventing Language Models From Hiding Their Reasoning, which evaluates defenses against LLM steganography.

    Python 21 5

  6. remix_public remix_public Public

    Python 18 3

Repositories

Showing 10 of 20 repositories

Most used topics

Loading…