Skip to content
View Dev1nW's full-sized avatar

Block or report Dev1nW

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Dev1nW/README.md

πŸ‘‹ Hi, I'm Devin!

I'm a Machine Learning Research Fellow with the Army Educational Outreach Program.

πŸ‘€ My Interests:

  • Reinforcement Learning from Human Feedback (RLHF) and Alignment: Specifically focusing on using ratings, as in Rating-Based Reinforcement Learning.
  • Natural Language Processing (NLP): I’m actively exploring new research, including the use of LLMs (Large Language Models) in playing Atari games, as in Atari-GPT.

πŸ“š Recent Work:

  • Atari-GPT: Investigating the capabilities of LLMs as low-level policies in Atari environments. Check out the research here.
  • Rating-Based Reinforcement Learning: An alternative to traditional preference based reinforcement learning in which users provide ratings to learn a reward model. Read more about it here.
  • Optimal Time-Constrained Intercept Guidance: A project applying deep reinforcement learning to guidance systems. You can explore the paper here.

πŸ“« Connect with me:

Google Scholar GitHub LinkedIn

Pinned Loading

  1. Rating-based-Reinforcement-Learning Rating-based-Reinforcement-Learning Public

    Official Codebase for Rating-Based Reinforcement Learning.

    Python 2

  2. Gemini_Chatbot Gemini_Chatbot Public

    A Chatbot complete with a GUI using the Gemini API.

    Python 1

  3. atari-gpt atari-gpt Public

    Forked from nwayt001/atari-gpt

    Official Codebase for Atari-GPT

    Python

  4. Sign_language_recognition_final_project Sign_language_recognition_final_project Public

    Sign Language Recognition code using GRU, LSTM and Simple RNN.

    Jupyter Notebook 1

  5. Adversarial-MARL Adversarial-MARL Public

    Study of Adversarial Multi-Agent Reinforcement Learning.

    Python 2

  6. Stock-Predictor-App Stock-Predictor-App Public

    Stock Predictor App using Tkinter and LSTM.

    Python