🥹
Meow
An interdisciplinary deep learning specialist and law school student from Taiwan. My ultimate goal is to create value-function-free reinforcement learning AI.
Pinned Loading
-
Reversal-Generative-Reinforcement-Learning
Reversal-Generative-Reinforcement-Learning PublicA simple model-free and value-function-free reinforcement learning model
Python 3
-
Cullama
Cullama PublicCustomize Your Own Llama Attention! No peft, no lora, only customizing your own llama attention to whatever you want!
Jupyter Notebook
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.