Skip to content

Commit 1d2c3f6

Browse files
committed
Update README and Guidelines for BEVFusion
1 parent 8a64de5 commit 1d2c3f6

File tree

3 files changed

+49
-1
lines changed

3 files changed

+49
-1
lines changed

README.md

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,10 @@ It is also the official code release of [`[PointRCNN]`](https://arxiv.org/abs/18
2222

2323

2424
## Changelog
25+
[2023-05-xx] Added support for the multi-modal 3D object detection model [`BEVFusion`](https://arxiv.org/abs/2205.13542) on Nuscenes dataset, which fuses multi-modal information on BEV space and reaches 70.98% NDS on Nuscenes validation dataset. (see the [guideline](docs/guidelines_of_approaches/bevfusion.md) on how to train/test with BEVFusion).
26+
* Support multi-modal Nuscenes detection (See the [GETTING_STARTED.md](docs/GETTING_STARTED.md) to process data).
27+
* Support TransFusion-Lidar head, which ahcieves 69.43% NDS on Nuscenes validation dataset.
28+
2529
[2023-04-02] Added support for [`VoxelNeXt`](https://github.com/dvlab-research/VoxelNeXt) on Nuscenes, Waymo, and Argoverse2 datasets. It is a fully sparse 3D object detection network, which is a clean sparse CNNs network and predicts 3D objects directly upon voxels.
2630

2731
[2022-09-02] **NEW:** Update `OpenPCDet` to v0.6.0:
@@ -199,7 +203,7 @@ We could not provide the above pretrained models due to [Waymo Dataset License A
199203
but you could easily achieve similar performance by training with the default configs.
200204

201205
### NuScenes 3D Object Detection Baselines
202-
All models are trained with 8 GTX 1080Ti GPUs and are available for download.
206+
All models are trained with 8 GPUs and are available for download. For training BEVFusion, please refer to the [guideline](docs/guidelines_of_approaches/bevfusion.md).
203207

204208
| | mATE | mASE | mAOE | mAVE | mAAE | mAP | NDS | download |
205209
|----------------------------------------------------------------------------------------------------|-------:|:------:|:------:|:-----:|:-----:|:-----:|:------:|:--------------------------------------------------------------------------------------------------:|
@@ -209,6 +213,8 @@ All models are trained with 8 GTX 1080Ti GPUs and are available for download.
209213
| [CenterPoint (voxel_size=0.1)](tools/cfgs/nuscenes_models/cbgs_voxel01_res3d_centerpoint.yaml) | 30.11 | 25.55 | 38.28 | 21.94 | 18.87 | 56.03 | 64.54 | [model-34M](https://drive.google.com/file/d/1Cz-J1c3dw7JAWc25KRG1XQj8yCaOlexQ/view?usp=sharing) |
210214
| [CenterPoint (voxel_size=0.075)](tools/cfgs/nuscenes_models/cbgs_voxel0075_res3d_centerpoint.yaml) | 28.80 | 25.43 | 37.27 | 21.55 | 18.24 | 59.22 | 66.48 | [model-34M](https://drive.google.com/file/d/1XOHAWm1MPkCKr1gqmc3TWi5AYZgPsgxU/view?usp=sharing) |
211215
| [VoxelNeXt (voxel_size=0.075)](tools/cfgs/nuscenes_models/cbgs_voxel0075_voxelnext.yaml) | 30.11 | 25.23 | 40.57 | 21.69 | 18.56 | 60.53 | 66.65 | [model-31M](https://drive.google.com/file/d/1IV7e7G9X-61KXSjMGtQo579pzDNbhwvf/view?usp=share_link) |
216+
| [TransFusion-L](tools/cfgs/nuscenes_models/cbgs_transfusion_lidar.yaml) | 27.96 | 25.37 | 29.35 | 27.31 | 18.55 | 64.58 | 69.43 | [model-32M] |
217+
| [BEVFusion](tools/cfgs/nuscenes_models/cbgs_bevfusion.yaml) | 28.03 | 25.43 | 30.19 | 26.76 | 18.48 | 67.75 | 70.98 | [model-157M] |
212218

213219

214220
### ONCE 3D Object Detection Baselines

docs/GETTING_STARTED.md

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -53,9 +53,16 @@ pip install nuscenes-devkit==1.0.5
5353

5454
* Generate the data infos by running the following command (it may take several hours):
5555
```python
56+
# for lidar-only setting
5657
python -m pcdet.datasets.nuscenes.nuscenes_dataset --func create_nuscenes_infos \
5758
--cfg_file tools/cfgs/dataset_configs/nuscenes_dataset.yaml \
5859
--version v1.0-trainval
60+
61+
# for multi-modal setting
62+
python -m pcdet.datasets.nuscenes.nuscenes_dataset --func create_nuscenes_infos \
63+
--cfg_file tools/cfgs/dataset_configs/nuscenes_dataset.yaml \
64+
--version v1.0-trainval \
65+
--with_cam
5966
```
6067

6168
### Waymo Open Dataset
Lines changed: 35 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,35 @@
1+
2+
## Installation
3+
4+
Please refer to [INSTALL.md](../INSTALL.md) for the installation of `OpenPCDet`.
5+
* We recommend the users to check the version of pillow and use pillow==8.4.0 to avoid bug in bev pooling.
6+
7+
## Data Preparation
8+
Please refer to [GETTING_STARTED.md](../GETTING_STARTED.md) to process the multi-modal Nuscenes Dataset.
9+
10+
## Training
11+
12+
1. Train the lidar branch for BEVFusion:
13+
```shell
14+
bash scripts/dist_train.sh ${NUM_GPUS} --cfg_file cfgs/nuscenes_models/cbgs_transfusion_lidar.yaml \
15+
```
16+
The ckpt will be saved in ../output/nuscenes_models/cbgs_transfusion_lidar/default/ckpt.
17+
18+
1. To train BEVFusion, you need to download pretrained parameters for image backbone [here](www.google.com), and specify the path in [config](../../tools/cfgs/nuscenes_models/cbgs_bevfusion.yaml#L88). Then run the following command:
19+
```shell
20+
bash scripts/dist_train.sh ${NUM_GPUS} --cfg_file cfgs/nuscenes_models/cbgs_bevfusion.yaml \
21+
--pretrained_model ../output/nuscenes_models/cbgs_transfusion_lidar/default/ckpt/checkpoint_epoch_20.pth \
22+
```
23+
## Evaluation
24+
* Test with a pretrained model:
25+
```shell
26+
bash scripts/dist_test.sh ${NUM_GPUS} --cfg_file cfgs/nuscenes_models/cbgs_bevfusion.yaml \
27+
--ckpt ../output/cfgs/nuscenes_models/cbgs_bevfusion/default/ckpt/checkpoint_epoch_6.pth
28+
```
29+
30+
## Performance
31+
All models are trained with spconv 1.0, but you can directly load them for testing regardless of the spconv version.
32+
| | mATE | mASE | mAOE | mAVE | mAAE | mAP | NDS | download |
33+
|----------------------------------------------------------------------------------------------------|-------:|:------:|:------:|:-----:|:-----:|:-----:|:------:|:--------------------------------------------------------------------------------------------------:|
34+
| [TransFusion-L](../../tools/cfgs/nuscenes_models/cbgs_transfusion_lidar.yaml) | 27.96 | 25.37 | 29.35 | 27.31 | 18.55 | 64.58 | 69.43 | [model-32M] |
35+
| [BEVFusion](../../tools/cfgs/nuscenes_models/cbgs_bevfusion.yaml) | 28.03 | 25.43 | 30.19 | 26.76 | 18.48 | 67.75 | 70.98 | [model-157M] |

0 commit comments

Comments
 (0)