🦙
Training
Reward Hacking @ Meta
- New York City
- https://www.threads.net/@zkwentz
- in/zkwentz
Highlights
Pinned Loading
-
meta-pytorch/OpenEnv
meta-pytorch/OpenEnv PublicAn interface library for RL post training with environments.
-
-
ralphy
ralphy PublicForked from michaelshimeles/ralphy
My Ralphy Wiggum setup, an autonomous bash script that runs Claude Code, Codex, OpenCode, Cursor agent, Qwen & Droid in a loop until your PRD is complete.
Shell
-
Netflix/pollyjs
Netflix/pollyjs PublicRecord, Replay, and Stub HTTP Interactions.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.




