Created by Andrii Maksai at CVLAB, EPFL. This is an approach for training sequence models for multiple object tracking.
This work is released under the MIT License (refer to the LICENSE file for details).
- DukeMTMC dataset. Should be placed in
DukeMTMC/
folder. - Motchallenge devkit for computation of IDF metric. Should be placed in
external/motchallenge-devkit
folder, together with files currently present there. - Open-reid to use the approach with appearance features. Should be placed in
external/open-reid
folder, together with files currently present there.
-
(Optional)
- Train a ReID model in DukeMTMC dataset by running
external/open-reid/train.sh
.
- Train a ReID model in DukeMTMC dataset by running
-
Start process that will answer to requests for computing the appearance model by running
external/open-reid/run.sh
. -
Modify
run.sh
according to your needs - see next section. -
Start tensorboard on
runs/<experiment_name>
folder to observe statistics related to the experiment. Experiment name can be set inrun.sh
.
- Start the dataset generation procedure as
run.sh gen_dataset <cam_id>
for cameras with numbers ranging 1 to 8. - Start the training procedure on the generated dataset as
run.sh train <cam_id>
. - Start the evaluation procedure that will pick best model from the checkpoints generated during training by running
run.sh eval <cam_id>
. This could be done in parallel with training. - Start the inference procedure by running
run.sh infer <cam_id>
. Output will be generated inruns/<experiment_name>/summaries/infer/tracks_*
file in the DukeMTMC benchmark-comparable format.
-
dp_freq, dp_size, dt_size (l.9-11) define frequency of frames sampling (0.33 refers to 3 per second), size of the batch for training, and maximum number of missed detections between two detections (to limit the number of pairs of detections that could possibly belong to the same trajectory).
-
gendata_step (l.89). During dataset generation multiple runners in parallel run the latest verion of the model on the parts of the dataset, while one trainer gets all of the combined data. This value describes how many frames are assigned to one runner and affects number of runners and training time.
-
label_config.features (l.115) List of features to be used for the model. When
appr
feature is provided, open-reid is required. -
model_config (l.150) - parameters of the model.
-
experiment_name (l.187) - name of the experiment. All data related to the run will be located in
runs/<experiment_name>
. -
nms_config.nms_option - how to select which hypotheses to keep in multiple hypotheses tracking.
start
corresponds to having at most one hypothesis of length X starting at each detection, andstart-0.3-ignore
additionally filters all hypotheses with IDF < 0.3 (speeds the inference, possibly why reducing accuracy, see paper appendix). -
final_solution_config (l.178) - how to select the final set of hypotheses. score_cutoff corresponds to minimum value of IDF to be considered for final solution, and the bounding box overlap of any two solutions should be below iou cutoff.
-
l.266 - size of the batch to use for inference. As mentioned in the paper, it was found beneficial to train with batch of size 6 and infer with size 12.
If you use the code or compare to the results obtained with it on MOT15, MOT17, or DukeMTMC dataset (available on MOTChallenge website), please consider citing our paper.
Please contact andrii dot maksai at epfl dot ch for any related queries.