Portfolio

I am actively improving my knowledge through online resources. I am currently working on Multi-Agent Reinforcement Learning and Neural Network Optimization Using Genetic Algorithms. Here is a list of some projects which I completed.

Deep Deterministic Policy Gradients

This algorithm is an extension of actor-critic methods to continuous action spaces. These models are very powerful as they combine the best from value based methods and policy based methods into a single algorithm.

Deep Q Learning

Deep Q Learning algorithms is one of the first and also a very powerful algorithm which combines the best from both worlds - Deep Learning and Reinforcement Learning. In this repository I have implemented the algorithm using PyTorch and tested it on an environment from Unity ML Agents. If you want to train your own agent you can follow the instructions in the repository.

Temporal Difference Methods

Temporal Difference methods are a model-free approach of learning from an environment. No prior knowledge of environment is required for these methods. This repository contains the implementation of 3 TD methods - SARSA, Q-Learning and Expected SARSA. These methods are tested on 'CliffWalking-v0' environment from OpenAI gym environments.

Emojifier

The emojifier model makes the texts more expressive. Rather than writing "Congratulations on the promotion! Lets get coffee and talk. Love you!" the emojifier can automatically turn this into "Congratulations on the promotion! 👍 Lets get coffee and talk. ☕️ Love you! ❤️". The code within this repository contains a simple Emojifier model trained with word embeddings and LSTM network.

Deep Cross Entropy Method

Deep Cross Entropy Methods is a very simple yet a very efficient method. You basically generate an episode using your agent and choose the observations which yielded the best results. These observations are used to train the agent and the process continues till convergence. That's how simple it is!

Monte Carlo Methods

Monte Carlo Methods search for optimal solutions with the process of repeated random sampling. In this repository I have implemented 4 types of MC methods.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
images		images
README.md		README.md
_config.yml		_config.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Portfolio

Deep Deterministic Policy Gradients

Deep Q Learning

Temporal Difference Methods

Emojifier

Deep Cross Entropy Method

Monte Carlo Methods

About

Releases

Packages

barbozadaniel/Aamod1996.github.io

Folders and files

Latest commit

History

Repository files navigation

Portfolio

About

Resources

Stars

Watchers

Forks