Perception-Aware Multi-Sensor Fusion for 3D LiDAR Semantic Segmentation (ICCV 2021)

Introduction

In this work, we aim to explore an effective multi-sensor (LiDAR and RGB camera) fusion method for 3D LiDAR semantic segmentation. Existing fusion-based methods mainly conduct feature fusion in the LiDAR coordinates, which leads to the loss of perceptual features (shape or textures) of RGB images. In contrast, we try to fuse the information from the two modalities in the camera coordinates, and propose a Perception-aware Multi-sensor Fusion (PMF) scheme. More details can be found in our paper.

Paper

News!

We improve the efficiency and effectiveness of PMF, and provide more analysis results. We will release the pretrained models and updated code. （We show the performance of EPMF-ResNet34 below.）

Experimental Results

Leader board of SensatUrban@ICCV2021

More Results

More details can be found in file.

Method	Dataset	mIoU (%)
PMF-ResNet34	SemanticKITTI Validation Set	63.9
PMF-ResNet34	nuScenes Validation Set	76.9
PMF-ResNet50	nuScenes Validation Set	79.4
PMF48-ResNet101	SensatUrban Test Set (ICCV2021 Competition)	66.2 (Rank 5)

How To Use?

Note: please modify the path in the code.

Folder structure

|--- pc_processor/ # python package for point cloud processing
	|--- checkpoint/ # generate log for experiments
	|--- dataset/ # function for dataset pre-processing
	|--- layers/
	|--- loss/ 
	|--- metrices/ 
	|--- models/
	|--- postproc/ 
	|--- utils/
|--- tasks/ 
	|--- pmf/ # training code of PMF
	|--- pmf_eval_nuscenes/ # evaluation code of PMF on nuScenes
		|--- testset_eval/ # merge the results of salsanext and PMF to generate results for testset
		|--- xxx.py # evaluation code of PMF on nuScenes
	|--- pmf_eval_semantickitti/ # evaluation code of PMF on SemanticKITTI
	|--- salsanext/ # training code of SalsaNext (modified from the official version)
	|--- salsanext_eval_nuscenes/ # evaluation code of SalsaNext on nuScenes

Training

Code for training

|--- pmf/
	|--- config_server_kitti.yaml # configuration file for SemanticKITTI
	|--- config_server_nus.yaml # configuration file for nuScenes
	|--- main.py # main function
	|--- trainer.py # training code
	|--- option.py # parser for configuration file
	|--- run.sh # executable script

Steps

Enter tasks/pmfand modify the configuration file config_server_kitti.yaml. You need to set data_root to the path of your dataset。Additionally, you can setgpu, batch_size as you need.
Modify run.sh and make sure the value of nproc_per_node is equal to the number of GPU in the configuration file.
Run the following command

./run.sh
# or bash run.sh

If you run the script successfully , you can find the log folder at PMF/experiments/PMF-SemanticKitti. The structure of log folder:

|--- log_dataset_network_xxxx/
	|--- checkpoint/ 
	|--- code/ 
	|--- log/ 
	|--- events.out.tfevents.xxx

The output of the console:

Evaluation

Code for evaluation

|--- pmf_eval_semantickitti/ 
	|--- config_server_kitti.yaml 
	|--- infer.py 
	|--- option.py

Steps

Enter tasks/pmf_eval_semantickitti and modify the configuration file config_server_kitti.yaml. You need to set data_root to the path of your dataset and setpretrained_path to the log folder of the trained model.
Run the following command:

python infer.py config_server_kitti.yaml

If you run the script successfully, you will find the log folder of evaluation in the folder of your trained model.

|--- PMF/experiments/PMF-SemanticKitti/log_xxxx/ 
	|--- Eval_xxxxx/ 
		|--- code/ 
		|--- log/ 
		|--- pred/

Pretrained Models

SemanticKITTI-PMF-ResNet34
nuScenes-EPMF-ResNet34

Citation

@InProceedings{Zhuang_2021_ICCV,
    author    = {Zhuang, Zhuangwei and Li, Rong and Jia, Kui and Wang, Qicheng and Li, Yuanqing and Tan, Mingkui},
    title     = {Perception-Aware Multi-Sensor Fusion for 3D LiDAR Semantic Segmentation},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {16280-16290}
}

@article{tan2024epmf,
  title={EPMF: Efficient Perception-Aware Multi-Sensor Fusion for 3D Semantic Segmentation},
  author={Tan, Mingkui and Zhuang, Zhuangwei and Chen, Sitao and Li, Rong and Jia, Kui and Wang, Qicheng and Li, Yuanqing},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
  year={2024},
  publisher={IEEE}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README_en.md

README_en.md

Perception-Aware Multi-Sensor Fusion for 3D LiDAR Semantic Segmentation (ICCV 2021)

Introduction

News!

Experimental Results

More Results

How To Use?

Folder structure

Training

Code for training

Steps

Evaluation

Code for evaluation

Steps

Pretrained Models

Citation

Files

README_en.md

Latest commit

History

README_en.md

File metadata and controls

Perception-Aware Multi-Sensor Fusion for 3D LiDAR Semantic Segmentation (ICCV 2021)

Introduction

News!

Experimental Results

More Results

How To Use?

Folder structure

Training

Code for training

Steps

Evaluation

Code for evaluation

Steps

Pretrained Models

Citation