A simple framework for experimenting with Reinforcement Learning in Python.
-
Updated
Feb 27, 2024 - Python
A simple framework for experimenting with Reinforcement Learning in Python.
A QoE-Oriented Computation Offloading Algorithm based on Deep Reinforcement Learning (DRL) for Mobile Edge Computing (MEC) | This algorithm captures the dynamics of the MEC environment by integrating the Dueling Double Deep Q-Network (D3QN) model with Long Short-Term Memory (LSTM) networks.
Online Replanning in Belief Space for Partially Observable Task and Motion Problems
Agent Git: Agent Version Control, Open-Branching, and Reinforcement Learning MDP for Agentic AI. A Standalone Agentic AI Infrastructure Layer for LangGraph Ecosystems
Make it easy to specify simple MDPs that are compatible with the OpenAI Gym.
Feature selection for maximizing expected cumulative reward
Using reinforcement learning and genetic algorithms to improve traffic flow and reduce vehicle waiting times in a single-lane two-way junction simulator by coordinating traffic signal schedules.
Probabilistic planning in continuous state-action MDPs in TensorFlow.
Pathfinding Using Reinforcement Learning
MDP-ProbLog is a framework to represent and solve (infinite-horizon) MDPs specified by probabilistic logic programming.
Hosts domain and instance RDDL files, covering problems from a wide range of disciplines, integration with the pyRDDLGym ecosystem.
Principles & Applications of Artificial Intelligence at Amirkabir University of Technology course projects
In- and post- process methods for optimizing explanations path based on newly defined quantitative explanation metrics
Fundamental of AI course which focuses on search, multiagents, mdp and reinforcement learning algorithms.
Sample projects to learn reinforcement learning and deep reinforcement learning in practice.
Implemented reinforcement learning algorithms, including Value-Iteration and Q-Learning, for a 2D grid world Markov Decision Process resembling a Pac-man game. Also applied the Mini-Max algorithm and common path-planning techniques such as A*, Dijkstra, and bidirectional search.
A grid world simulation environment
Mathematical implementation of robotics algorisms such as MDP, EKF, RRT, and etc.
Add a description, image, and links to the mdp topic page so that developers can more easily learn about it.
To associate your repository with the mdp topic, visit your repo's landing page and select "manage topics."