Robust Deep Monte Carlo Counterfactual Regret Minimization: Addressing Theoretical Risks in Neural Fictitious Self-Play
reinforcement-learning poker deep-learning monte-carlo solver neural-networks reinforcement-learning-algorithms clipping variance-reduction robustness target-network extensive-games counterfactual-regret-minimization regret-minimization poker-solver deep-cfr mccfr outcome-sampling deep-mccfr
-
Updated
Sep 9, 2025 - Python