This repository provides the official implementation for the following two papers:
For further information, please contact Fangzhou Hong.
- 2022-04 Uploaded the pre-train model for 4D-DS-Net here.
- 2022-03 [NEW:partying_face:] Checkout our new extention 4D-DS-Net for 4D panoptic segmentation! Codes are released. Checkout the instructions here.
- 2021-03 DS-Net is accepted to CVPR 2021!
- 2020-12-01 Code release!
- 2020-11-16 We achieve 1st place in SemanticKITTI Panoptic Segmentation leaderboard. See the screenshot.
- easydict
- hdbscan
- numba
- numpy
- pyyaml
- python=3.7
- scikit-learn
- scipy
- spconv=1.1
- tensorboard=2.3.0
- torch=1.5
- torchvision=0.6.0
- torch-cluster=1.5.7
- torch-scatter=1.3.2
- tqdm
Please download the SemanticKITTI dataset to the folder data
and the structure of the folder should look like:
./
├──
├── ...
└── data/
├──sequences
├── 00/
│ ├── velodyne/
| | ├── 000000.bin
| | ├── 000001.bin
| | └── ...
│ └── labels/
| ├── 000000.label
| ├── 000001.label
| └── ...
├── 08/ # for validation
├── 11/ # 11-21 for testing
└── 21/
└── ...
The training pipeline of our DS-Net consists of three steps: 1) semantic segmentation training; 2) center regression training; 3) dynamic shifting training. The first two steps give us the backbone model. The last step gives our DS-Net. We provide the corresponding pretrained model of each step. The inferencing and training details are further explained in this section.
Note that our implementation only supports parallel training for now. We fix the batch size of each GPUs to 1
. In the first line of each script, you could choose the number of GPUs ${ngpu}
you wish to use for training or inferencing. In the second line, you could set the folder name ${tag}
and all the generated files will be put into ./output/${tag}
. All the provided pytorch distributed version of scripts are not tested due to the lack of proper environment. All the slurm version of scripts are tested and should work well. Should there be any problem, feel free to open an issue.
If you wish to use our pretrained models, remember to create a new folder pretrained_weight
and put all the downloaded models there.
Step | Download Link |
---|---|
1 | sem_pretrain.pth |
2 | offset_pretrain_pq_0.564.pth |
3 | dsnet_pretrain_pq_0.577.pth |
This is the trained final model. Download Link: checkpoint_epoch_5_0.640_0.594_0.648.pth.
We provide inferencing scripts for the backbone and our DS-Net.
Our backbone consists of the semantic segmentation module, the center regression module, a heuristic clustering algorithm and the consensus-driven fusion module. You are welcomed to play around with different heuristic algorithms and their parameter settings in ./cfgs/release/backbone.yaml
since we provide several clustering algorithms in ./utils/clustering.py
.
The inferencing scripts of our backbone are ./scripts/release/backbone/val_*.sh
. Before using the scripts, please make sure you have downloaded the pretrained model (of step 2) or put the models trained by yourself (in step 2) to ./pretrained_weight
and make sure to pass the correct path of the model to --pretrained_ckpt
option.
The inferencing scripts of our DS-Net are in ./scripts/release/dsnet
. val_*.sh
are for inferencing on the validation set of SemanticKITTI. test_*.sh
are for inferencing on the test set of SemanticKITTI and will generate prediction files under the corresponding output folder. Before using the scripts, remember to download the pretrained model (of step 3) or put the model trained by yourself (in step 3) to ./pretrained_weight
and make sure you pass the right path to --pretrained_ckpt
option in the scripts.
The training codes and scripts for this step will be released soon. For now, please download the step 1 pretrained model using the above link. Please note that the cylinder backbone used in our implementation is the original version of Cylinder3D instead of the latest version.
The training scripts of this step could be found in ./scripts/release/backbone/train_*.sh
. Before using the training scripts, please download the pretrained model of step 1 to folder ./pretrained_weight
. Feel free to play around with different parameter settings in ./cfgs/release/backbone.yaml
.
The training scripts of step 3 could be found in ./scripts/release/dsnet/train_*.sh
. Before using the training scripts of this part, please download the pretrained model (of step 2) to folder ./pretrained_weight
or put the model trained (in step 2) to ./pretrained_weight
and change the --pretrained_ckpt
option to the correct path. You could experiment with different parameter settings in ./cfgs/release/dsnet.yaml
.
For the trained models, checkout here.
Please checkout ./scripts/release/4d-dsnet/train_backbone_multi_frames_2.sh
.
Please checkout ./scripts/release/4d-dsnet/train_dsnet_multi_frames_2.sh
.
Distributed under the MIT License. See LICENSE
for more information.
If you find our work useful in your research, please consider citing the following papers:
@InProceedings{Hong_2021_CVPR,
author = {Hong, Fangzhou and Zhou, Hui and Zhu, Xinge and Li, Hongsheng and Liu, Ziwei},
title = {LiDAR-Based Panoptic Segmentation via Dynamic Shifting Network},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2021},
pages = {13090-13099}
}
@article{hong20224ddsnet,
title={LiDAR-based 4D Panoptic Segmentation via Dynamic Shifting Network},
author={Hong, Fangzhou and Zhou, Hui and Zhu, Xinge and Li, Hongsheng and Liu, Ziwei},
journal={arXiv preprint arXiv:2203.07186},
year={2022}
}
In our implementation, we refer to the following open-source databases: