You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Adaptive risk-aware reinforcement learning framework for time-constrained navigation tasks. This MSc thesis studies how an RL agent can adapt its risk attitude when mission time changes, using a hierarchical controller that switches from a risk-aware CVaR-constrained policy to a risk-seeking policy when time pressure makes switching benificial.
A Reinforcement Learning MVP (Minimum Viable Product) for Condition-Based Maintenance (CBM) using industrial equipment temperature sensor data. This project implements a sophisticated QR-DQN (Quantile Regression Deep Q-Network) agent to learn optimal maintenance policies balancing risk mitigation and cost minimization.