Skip to content

Open source implementation of InstructGPT (not finished)

License

Notifications You must be signed in to change notification settings

flippe3/chat-ltu

Repository files navigation

Update: This implementation is not finished and I will look to finish it once I have more time on my hand.

Chat-LTU

This is a chatbot project for the course D7058E at Luleå Univeristy of Technology. We try to implement something similar to Instruct-GPT or Chat-GPT mostly based on the papers and the rlhf blogpost from Huggingface.

Todo:

  • Implement PPO2 for faster RL fine-tuning.
  • Implement the website that is partially done to gather real human data.
  • Upload reward model and fine-tuned model to Huggingface for open source use.

About

Open source implementation of InstructGPT (not finished)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published