Research Scientist at Hugging Face.
- 
                  Hugging Face
- Lyon
- https://edbeeching.github.io/
- @edwardbeeching
Pinned Loading
- 
  huggingface/open-r1huggingface/open-r1 PublicFully open reproduction of DeepSeek-R1 
- 
  huggingface/trlhuggingface/trl PublicTrain transformer language models with reinforcement learning. 
- 
  huggingface/alignment-handbookhuggingface/alignment-handbook PublicRobust recipes to align language models with human and AI preferences 
- 
  godot_rl_agentsgodot_rl_agents PublicAn Open Source package that allows video game creators, AI researchers and hobbyists the opportunity to learn complex behaviors for their Non Player Characters or agents 
- 
  alex-petrenko/sample-factoryalex-petrenko/sample-factory PublicHigh throughput synchronous and asynchronous reinforcement learning 
          Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
  If the problem persists, check the GitHub status page or contact support.





