Add support for multi-modal NuScenes Detection #1339

chenshi3 · 2023-05-09T11:01:01Z

Hi,

I'm submitting a pull request to add support for multi-modal detection on the NuScenes dataset. This PR consists of four main components:
(1) Support for the TransFusion-Lidar head.
(2) Implementation of the fade strategy, which disables data augmentations in the last several epochs during training.
(3) Support for the multi-modal NuScenes dataset, including data processing and dataset usage.
(4) Introduction of the multi-modal detector BEVFusion, including support for image backbones, LSS Transform, and Bev pooling.

Best regards,
Chen Shi

jihanyang · 2023-05-11T12:44:21Z

pcdet/datasets/augmentor/data_augmentor.py

@@ -23,6 +24,18 @@ def __init__(self, root_path, augmentor_configs, class_names, logger=None):
            cur_augmentor = getattr(self, cur_cfg.NAME)(config=cur_cfg)
            self.data_augmentor_queue.append(cur_augmentor)

+    def disableAugmentation(self, augmentor_configs):


I will be better to use disable_augmentation as the function name to keep the naming style consistent along pcdet

Ok, I will adjust the function name.

jihanyang · 2023-05-11T13:01:32Z

pcdet/datasets/dataset.py

@@ -130,6 +131,30 @@ def __getitem__(self, index):
        """
        raise NotImplementedError

+    def set_lidar_aug_matrix(self, data_dict):


I think it will be better to move this function to data_augmentor.py and call it after forward all data augmentation.

The function set_lidar_aug_matrix needs to be called during testing and identity matrix is generated, so it may not be appropriate to move it data_augmentor.py.

jihanyang · 2023-05-11T13:05:18Z

pcdet/datasets/nuscenes/nuscenes_dataset.py

@@ -137,6 +181,60 @@ def __getitem__(self, index):
                'gt_names': info['gt_names'] if mask is None else info['gt_names'][mask],
                'gt_boxes': info['gt_boxes'] if mask is None else info['gt_boxes'][mask]
            })
+        if self.use_camera:


It will be better to add a new function to include all following added codes. I think this will make __getitem__ easier to understand.

I will define a function called load_camera_info to include these codes.

Get it. Thanks a lot for your contribution. I think after the following adjustment, we can merge this PR. I will make sure with shaoshuai @sshaoshuai.

jihanyang · 2023-05-13T12:31:58Z

Great. @sshaoshuai will have a final check and then merge it.

sshaoshuai

Great, thanks!

Galaxy-ZRX · 2023-07-24T14:34:10Z

Hi @chenshi3 , thank you very much for your work on supporting TransFusion! May I ask if it's possible to train TransFusion-Lidar model with earlier version of pcdet? Could you please give me some hints on how should I change the related codes? (I am not sure if it's enough to just add the transfusion-lidar head and other model-related codes, or should I also modify other parts to support it?)

lacie-life · 2024-11-28T08:27:27Z

Hi @chenshi3, thank you very much for your contribution. However, I have a question about the data augmentation. In the config file, rotation and scale augmentations are applied to the point cloud. However, this augmentation alone has nothing to do with the image being augmented in the next step. Can you explain this? Does it affect the efficiency of the model?

chenshi3 added 7 commits May 7, 2023 21:52

Add support for TransFusion-Lidar Head

4dc1849

Add multi-modal support for Nuscenes dataset

c5dfdd7

Add support for BEVFusion

8a64de5

Update README and Guidelines for BEVFusion

1d2c3f6

Fixed bug in Nuscenes dataset and Update configs

7cc9e07

Update checkpoints of TransFusion and BEVFusion

123d42e

Update configs

ed2bb81

jihanyang reviewed May 11, 2023

View reviewed changes

Adjust the hook function and nuscenes_dataset.py

0db30f4

sshaoshuai reviewed May 13, 2023

View reviewed changes

update README.md

fcfa077

sshaoshuai merged commit 02ac3e1 into open-mmlab:master May 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for multi-modal NuScenes Detection #1339

Add support for multi-modal NuScenes Detection #1339

chenshi3 commented May 9, 2023

jihanyang May 11, 2023

chenshi3 May 11, 2023

jihanyang May 11, 2023

chenshi3 May 11, 2023

jihanyang May 11, 2023

chenshi3 May 11, 2023

jihanyang May 12, 2023

jihanyang commented May 13, 2023

sshaoshuai left a comment

Galaxy-ZRX commented Jul 24, 2023

lacie-life commented Nov 28, 2024

Add support for multi-modal NuScenes Detection #1339

Add support for multi-modal NuScenes Detection #1339

Conversation

chenshi3 commented May 9, 2023

jihanyang May 11, 2023

Choose a reason for hiding this comment

chenshi3 May 11, 2023

Choose a reason for hiding this comment

jihanyang May 11, 2023

Choose a reason for hiding this comment

chenshi3 May 11, 2023

Choose a reason for hiding this comment

jihanyang May 11, 2023

Choose a reason for hiding this comment

chenshi3 May 11, 2023

Choose a reason for hiding this comment

jihanyang May 12, 2023

Choose a reason for hiding this comment

jihanyang commented May 13, 2023

sshaoshuai left a comment

Choose a reason for hiding this comment

Galaxy-ZRX commented Jul 24, 2023

lacie-life commented Nov 28, 2024