🎯 DPOA: Scenario-Aware Pareto Optimization

Scenario-Aware Pareto Optimization for Limited-Round Unknown-Opponent Competition

PDCAT 2025 | International Conference on Parallel and Distributed Computing, Applications and Technologies

📄 Paper · 🚀 Quick Start · 📖 Usage · 🔧 Configuration

📌 Overview

This project implements a scenario-aware Pareto optimization method for limited-round unknown-opponent competition. The approach constructs a candidate opponent model space and leverages scenario-aware Pareto optimization strategies to train agents, enabling superior performance against unknown opponents.

🏗️ Architecture

DPOA/
├── get_opp.py          # Step 1: Generate true opponent models
├── get_candidate.py    # Step 2: Generate candidate opponent models
├── expand_model.py     # Step 3: Train agent models
├── condense.py         # Step 4: Condense model space
├── ILVE.py             # Step 5: Run experiments
├── DQN_model.py        # DQN model implementation
├── Mnt.py              # Environment implementation
└── no_sensor/          # No-sensor related modules
    ├── belief_filter.py
    └── ...

🚀 Quick Start

Prerequisites

pip install numpy torch tqdm

Installation

git clone https://github.com/your-username/DPOA.git
cd DPOA

📖 Usage

Execute the following steps in order:

Step 1️⃣ Generate True Opponent Models

Generate true opponent models for testing:

python get_opp.py

Models will be saved to the single_models/ directory.

Step 2️⃣ Generate Candidate Opponent Models

Generate a pool of candidate opponent models (possible opponent behavior models):

python get_candidate.py --num_candidate 10 --itera_num 3 --sub_num 1 --opp_num 1

Parameters:

Parameter	Description	Default
`--num_candidate`	Number of candidate models	1
`--itera_num`	Number of iterations	3
`--sub_num`	Number of subject agents	1
`--opp_num`	Number of opponent agents	1
`--outset`	Starting model ID	0

Models will be saved to the candidate_model/ directory.

Step 3️⃣ Train Agent Models

Train subject agents based on the candidate opponent model set:

python expand_model.py

Configurable parameters in the file:

model_j_num: Number of candidate models to use
max_frames: Maximum training frames
explore_rate: Exploration rate

Models will be saved to the expand_model/ directory.

Step 4️⃣ Condense Model Space

Condense the candidate model space using scenario-aware strategies:

python condense.py

This step performs model selection based on scenario similarity and outputs representative model sets at different scales.

Step 5️⃣ Run Experiments

Execute the complete online learning and evaluation experiment:

python ILVE.py

Supports multiple prediction modes (predict_mode):

0: No opponent prediction
1: Multi-model voting prediction (MTA)
2: Bayesian belief filter (POMDP)

🔧 Configuration

Main configuration parameters are located in respective script files:

Script	Parameter	Description
`ILVE.py`	`max_trials`	Maximum number of trials
`ILVE.py`	`predict_mode`	Prediction mode (0/1/2)
`ILVE.py`	`predict_num`	Number of prediction models
`expand_model.py`	`max_frames`	Maximum training frames
`condense.py`	`q, w, e`	Target quantities for different compression levels

📊 Output Structure

DPOA/
├── single_models/          # True opponent models
│   └── {itera_num}/
│       ├── sub_model_term.pkl
│       └── opp_model_{frames}.pkl
├── candidate_model/        # Candidate opponent models
│   └── {itera_num}/
│       └── id_{m}.pkl
└── expand_model/           # Trained agent models
    └── {model_j_num}/
        └── sub_model.pkl

📄 Citation

If you use this project in your research, please cite:

@inproceedings{chen2025scenario,
  title={Scenario-Aware Pareto Optimization for Limited-Round Unknown-Opponent Competition},
  author={Chen, Fanke and Mei, JiaKang and Liu, Zongyang and Pan, Yinghui},
  booktitle={International Conference on Parallel and Distributed Computing, Applications and Technologies},
  year={2025},
  organization={Springer}
}

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.

Made with ❤️ by SZU

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎯 DPOA: Scenario-Aware Pareto Optimization

📌 Overview

🏗️ Architecture

🚀 Quick Start

Prerequisites

Installation

📖 Usage

Step 1️⃣ Generate True Opponent Models

Step 2️⃣ Generate Candidate Opponent Models

Step 3️⃣ Train Agent Models

Step 4️⃣ Condense Model Space

Step 5️⃣ Run Experiments

🔧 Configuration

📊 Output Structure

📄 Citation

📜 License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.idea		.idea
__pycache__		__pycache__
no_sensor		no_sensor
.gitignore		.gitignore
DQN_model.py		DQN_model.py
ILVE.py		ILVE.py
Mnt.py		Mnt.py
Net_type.py		Net_type.py
README.md		README.md
condense.py		condense.py
expand_model.py		expand_model.py
get_candidate.py		get_candidate.py
get_opp.py		get_opp.py

chen-fo/DPOA

Folders and files

Latest commit

History

Repository files navigation

🎯 DPOA: Scenario-Aware Pareto Optimization

📌 Overview

🏗️ Architecture

🚀 Quick Start

Prerequisites

Installation

📖 Usage

Step 1️⃣ Generate True Opponent Models

Step 2️⃣ Generate Candidate Opponent Models

Step 3️⃣ Train Agent Models

Step 4️⃣ Condense Model Space

Step 5️⃣ Run Experiments

🔧 Configuration

📊 Output Structure

📄 Citation

📜 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages