Description

There has been remarkable progress on object detection and re-identification in recent years which are the core components for multi-object tracking. However, little attention has been focused on accomplishing the two tasks in a single network to improve the inference speed. The initial attempts along this path ended up with degraded results mainly because the re-identification branch is not appropriately learned. In this work, we study the essential reasons behind the failure, and accordingly present a simple baseline to addresses the problems. It remarkably outperforms the state-of-the-arts on the MOT challenge datasets at 30 FPS. This baseline could inspire and help evaluate new ideas in this field. More detail about this model can be found in:

Paper: Zhang Y, Wang C, Wang X, et al. FairMOT: On the Fairness of Detection and Re-Identification in Multiple Object Tracking. 2020.

This repository contains a Mindspore implementation of FairMot based upon original Pytorch implementation (https://github.com/ifzhang/FairMOT). The training and validating scripts are also included, and the evaluation results are shown in the Performance section.

Model Architecture

The overall network architecture of FairMOT is shown below:

Link

Dataset

Note that you can run the scripts based on the dataset mentioned in original paper or widely used in relevant domain/network architecture. In the following sections, we will introduce how to run the scripts using the related dataset below.

Dataset used: ETH, CalTech, MOT17, CUHK-SYSU, PRW, CityPerson

Features

Mixed Precision

The mixed precision training method accelerates the deep learning neural network training process by using both the single-precision and half-precision data formats, and maintains the network precision achieved by the single-precision training at the same time. Mixed precision training can accelerate the computation process, reduce memory usage, and enable a larger model or batch size to be trained on specific hardware. For FP16 operators, if the input data type is FP32, the backend of MindSpore will automatically handle it with reduced precision. Users could check the reduced-precision operators by enabling INFO log and then searching ‘reduce precision’.

Environment Requirements

To run the python scripts in the repository, you need to prepare the environment as follow:

Python and dependencies
- Cython 0.29.23
- opencv-python 4.5.1.4
- cython-bbox 0.1.3
- sympy 1.7.1
- yacs
- numba
- progress
- motmetrics 1.2.0
- matplotlib 3.4.1
- lap 0.4.0
- openpyxl 3.0.7
- Pillow 8.1.0
- tensorboardX 2.2
- python 3.7
- mindspore 1.2.0
- pycocotools 2.0
For more information, please check the resources below：
- MindSpore tutorials
- MindSpore Python API

Quick Start

Requirements Installation

Some packages in requirements.txt need Cython package to be installed first. For this reason, you should use the following commands to install dependencies:

pip install Cython && pip install -r requirements.txt

Dataset Preparation

FairMot model uses mix dataset to train and validate in this repository. We use the training data as JDE in this part and we call it "MIX". Please refer to their DATA ZOO to download and prepare all the training data including Caltech Pedestrian, CityPersons, CUHK-SYSU, PRW, ETHZ, MOT17 and MOT16.

Configure path to dataset root in data/data.json file.

Model Checkpoints

Baseline FairMOT model (DLA-34 backbone) is pretrained on the CrowdHuman for 60 epochs with the self-supervised learning approach before training on the MIX dataset for 30 epochs.

The baseline model can be downloaded here: crowdhuman_dla34.pth [Google] [Baidu, code: ggzx ].

Then you need to convert this model from pth to ckpt using script src/utils/pth2ckpt.py:

# in root fairmot directory
python src/utils/pth2ckpt.py

Install the torch package using the following command to run model convert scripts:

pip install torch

Running

To train the model, run the shell script scripts/run_standalone_train_ascend.sh or scripts/run_standalone_train_gpu.sh with the format below:

# standalone training on Ascend
bash scripts/run_standalone_train_ascend.sh DEVICE_ID DATA_CFG(options) LOAD_PRE_MODEL(options)

# standalone training on GPU
bash scripts/run_standalone_train_gpu.sh [config_file] [pretrained_model]

# distributed training on Ascend
bash scripts/run_distribute_train_ascend.sh RANK_SIZE DATA_CFG(options) LOAD_PRE_MODEL(options)

# distributed training on GPU
bash scripts/run_distribute_train_gpu.sh [DEVICE_NUM] [VISIBLE_DEVICES(0,1,2,3,4,5,6,7)] [config_file] [pretrained_model]

To validate the model, run the shell script scripts/run_eval.sh with the format below:

bash scripts/run_eval.sh [device] [config] [load_ckpt] [dataset_dir]

Script Description

Script and Sample Code

The structure of the files in this repository is shown below.

└─fairmot
 ├─scripts
 │ ├─run_eval.sh                    // launch ascend standalone evaluation
 │ ├─run_distribute_train_ascend.sh // launch ascend distributed training
 | ├─run_distribute_train_gpu.sh    // launch gpu distributed training
 │ ├─run_standalone_train_ascend.sh // launch ascend standalone training
 │ └─run_standalone_train_gpu.sh    // launch gpu standalone training
 ├─src
 │ ├─tracker
 │ │ ├─basetrack.py               // basic tracker
 │ │ ├─matching.py                // calculating box distance
 │ │ └─multitracker.py            // JDETracker
 │ ├─tracking_utils
 │ │ ├─evaluation.py              // evaluate tracking results
 │ │ ├─kalman_filter.py           // Kalman filter for tracking bounding boxes
 │ │ ├─log.py                     // logging tools
 │ │ ├─io.py                      //I/o tool
 │ │ ├─timer.py                   // evaluation of time consuming
 │ │ ├─utils.py                   // check that the folder exists
 │ │ └─visualization.py           // display image tool
 │ ├─utils
 │ │ ├─callback.py                // custom callback functions
 │ │ ├─image.py                   // image processing
 │ │ ├─jde.py                     // LoadImage
 │ │ ├─logger.py                  // a summary writer logging
 │ │ ├─lr_schedule.py             // learning ratio generator
 │ │ ├─pth2ckpt.py                // pth transformer
 │ │ └─tools.py                   // image processing tool
 │ ├─fairmot_poase.py             // WithLossCell
 │ ├─losses.py                    // loss
 │ ├─config.py                    // total config
 │ ├─util.py                      // routine operation
 │ ├─infer_net.py                 // infer net
 │ └─backbone_dla_conv.py         // dla34_conv net
 ├─eval.py                        // eval fairmot
 ├─fairmot_run.py                 // run fairmot
 ├─train.py                       // train fairmot
 ├─fairmot_export.py              // export fairmot
 ├─requirements.txt               // pip requirements
 ├─default_config.yaml            // default model configuration
 └─README.md                      // descriptions about this repository

Training Process

Training

Run scripts/run_standalone_train_<device>.sh to train the model standalone. The usage of the script is:

Running on Ascend

bash scripts/run_standalone_train_ascend.sh DEVICE_ID DATA_CFG LOAD_PRE_MODEL

For example, you can run the shell command below to launch the training procedure.

bash scripts/run_standalone_train_ascend.sh 0 ./dataset/ ./crowdhuman_dla34_ms.ckpt

Running on GPU

bash scripts/run_standalone_train_gpu.sh [config_file] [pretrained_model]

For example, you can run the shell command below to launch the training procedure:

bash scripts/run_standalone_train_gpu.sh ./default_config.yaml ./crowdhuman_dla34_ms.ckpt

The model checkpoint will be saved into ./train/ckpt.

Distributed Training

Run scripts/run_distribute_train_<device>.sh to train the model distributed. The usage of the script is:

Running on Ascend

bash scripts/run_distribute_train_ascend.sh RANK_SIZE DATA_CFG LOAD_PRE_MODEL

For example, you can run the shell command below to launch the distributed training procedure.

bash scripts/run_distribute_train_ascend.sh 8 ./data.json ./crowdhuman_dla34_ms.ckpt

Running on GPU

bash scripts/run_distribute_train_gpu.sh [DEVICE_NUM] [VISIBLE_DEVICES(0,1,2,3,4,5,6,7)] [config_file] [pretrained_model]

For example, you can run the shell command below to launch the distributed training procedure:

bash scripts/run_distribute_train_gpu.sh 8 0,1,2,3,4,5,6,7 ./default_config.yaml ./crowdhuman_dla34_ms.ckpt

The above shell script will run distribute training in the background. You can view the results through the file train/tran.log.

The model checkpoint will be saved into train/ckpt.

Evaluation Process

The evaluation data set was MOT20

Run scripts/run_eval.sh to evaluate the model. The usage of the script is:

bash scripts/run_eval.sh [device] [config] [load_ckpt] [dataset_dir]

For example, you can run the shell command below to launch the validation procedure.

bash scripts/run_eval.sh GPU ./default_config.yaml ./fairmot-30.ckpt data_path

The eval results can be viewed in eval/eval.log.

Model Description

Performance

FairMot on MIX dataset with detector

Performance parameters

Parameters	Ascend Standalone	Ascend Distributed	GPU Distributed
Model Version	FairMotNet	FairMotNet	FairMotNet
Resource	Ascend 910	8 Ascend 910 cards	8x RTX 3090 24GB
Uploaded Date	25/06/2021 (day/month/year)	25/06/2021 (day/month/year)	21/02/2021 (day/month/year)
MindSpore Version	1.2.0	1.2.0	1.5.0
Training Dataset	MIX	MIX	MIX
Evaluation Dataset	MOT20	MOT20	MOT20
Training Parameters	epoch=30, batch_size=4	epoch=30, batch_size=4	epoch=30, batch_size=12
Optimizer	Adam	Adam	Adam
Loss Function	FocalLoss,RegLoss	FocalLoss,RegLoss	FocalLoss,RegLoss
Train Performance	MOTA:43.8% Prcn:90.9%	MOTA:42.5% Prcn:91.9%%	MOTA: 41.2%, Prcn: 90.5%
Speed	1pc: 380.528 ms/step	8pc: 700.371 ms/step	8p: 1047 ms/step

Description of Random Situation

We also use random seed in src/utils/backbone_dla_conv.py to initial network weights.

ModelZoo Homepage

Please check the official homepage.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contents

Description

Model Architecture

Dataset

Features

Mixed Precision

Environment Requirements

Quick Start

Requirements Installation

Dataset Preparation

Model Checkpoints

Running

Script Description

Script and Sample Code

Training Process

Training

Running on Ascend

Running on GPU

Distributed Training

Running on Ascend

Running on GPU

Evaluation Process

Model Description

Performance

FairMot on MIX dataset with detector

Performance parameters

Description of Random Situation

ModelZoo Homepage

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
infer		infer
modelarts		modelarts
scripts		scripts
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
default_config.yaml		default_config.yaml
eval.py		eval.py
fairmot_export.py		fairmot_export.py
fairmot_run.py		fairmot_run.py
mxbase_eval.py		mxbase_eval.py
requirements.txt		requirements.txt
sdk_eval.py		sdk_eval.py
train.py		train.py

License

15534081591/FairMOT

Folders and files

Latest commit

History

Repository files navigation

Contents

Running on Ascend

Running on GPU

Running on Ascend

Running on GPU

FairMot on MIX dataset with detector

Performance parameters

About

Resources

License

Stars

Watchers

Forks

Languages