Skip to content
View minqi's full-sized avatar

Organizations

@uclnlp @lucidalabs @ucl-dark @FLAIROx

Block or report minqi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. facebookresearch/llm-speedrunner facebookresearch/llm-speedrunner Public

    The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in language modeling.

    Jupyter Notebook 103 6

  2. facebookresearch/minimax facebookresearch/minimax Public

    Efficient baselines for autocurricula in JAX.

    Python 196 16

  3. facebookresearch/dcd facebookresearch/dcd Public archive

    Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.

    Python 135 29

  4. facebookresearch/level-replay facebookresearch/level-replay Public archive

    This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to le…

    Python 92 16

  5. learning-to-communicate-pytorch learning-to-communicate-pytorch Public

    Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch

    Python 357 79

  6. facebookresearch/minihack facebookresearch/minihack Public archive

    MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

    Python 502 67