Distrbuted policy learning for DMPC Project

This repository contains material related to the project on Distributed policy learning for DMPC.

DLPC_training: Implement the code to verify the convergence condition of actor-critic learning and the closed-stability condition under actor-critic learning in the receding horizon control framework. The code is implemented in Matlab.
DLPC_xtdrone: Deploy the control policy to control a number of multirotor drones in the Gazebo. This part is based on XTDrone, PX4, and MAVROS, containing materials related to XTDrone project. The code is implemented in Python.
- DLPC_xtdrone6: Control 6 multirotor drones to realize formation control and transformation.
- DLPC_xtdrone18: Control 18 multirotor drones to realize formation control and transformation.
DLPC_solving_one_robot_control: Implement the code to solve the centralized control problem of one robot distributedly and compare it with the centralized version. The code is implemented in Matlab.
dlpc_online_train_scales_to_10000.py: The Python code for online training of DLPC.
dlpc_online_train_scales_to_10000.m: The matlab code for online training of DLPC.

Dependencies

There is no dependency for DLPC_training within Matlab. As for DLPC_xtdrone, please follow the instructions in XTDrone project to complete the environment installation and basic configuration.

Run DLPC_xtdrone

To run the code in this repository, follow the instructions below.

Load worlds and drones.
```
roslaunch multi_vehicle.launch
```
Obtain the position information of drones. Replace 6 with the number in the name of the selected file folder.
```
python3 get_local_pose.py iris 6
```
Build the communication network among drones.
```
multi_vehicle_communication.sh
```
Keyboard control code.
```
python3 multirotor_keyboard_control_promotion.py
```
*Use the keyboard to control all drones to take off and press ‘s’ to hover after a desired height. Then press ‘g’ to enter leader control mode.
Run the DLPC code for formation control.
```
run_formation_promotion.sh
```
*Note: When the script is running, and the drones are stationary, switch to the keyboard control terminal to press ‘w’ to give the leader a specified velocity. After the drones achieve the specified formation, press ‘f’ or ‘h’ to turn or press numbers 0-9 to change the formation.
run the baseline controller for comparison in a straight-line formation scenario.
```
run_formation_baseline.sh
```
Run the following script for the figure plot.
```
python3 draw_figure.py
```

Reference

Please cite the following reference:

[1] Xinglong Zhang, et al. "Toward Scalable Multirobot Control: Fast Policy Learning in Distributed MPC." IEEE Transactions on Robotics, 41 (2025).

Name		Name	Last commit message	Last commit date
Latest commit History 148 Commits
.github/workflows		.github/workflows
DSLC_solving_one_robot_control		DSLC_solving_one_robot_control
DSLC_training		DSLC_training
DSLC_xtdrone18		DSLC_xtdrone18
DSLC_xtdrone6		DSLC_xtdrone6
.gitignore		.gitignore
CODEOWNERS		CODEOWNERS
LICENSE		LICENSE
README.md		README.md
dlpc_online_train_scales_to_10000.m		dlpc_online_train_scales_to_10000.m
dlpc_online_train_scales_to_10000.py		dlpc_online_train_scales_to_10000.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Distrbuted policy learning for DMPC Project

Table of Contents

Tutorials

Dependencies

Run DLPC_xtdrone

Reference

About

Uh oh!

Releases

Packages

Languages

License

xinglongzhangnudt/policy-learning-for-distributed-mpc

Folders and files

Latest commit

History

Repository files navigation

Distrbuted policy learning for DMPC Project

Table of Contents

Tutorials

Dependencies

Run DLPC_xtdrone

Reference

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages