Zhe Liu 1,* ,
Jinghua Hou 1,* ,
Xinyu Wang 1,* ,
Xiaoqing Ye 3,
Jingdong Wang 3,
Hengshuang Zhao 2,
Xiang Bai 1,✉
1 Huazhong University of Science and Technology,
2 The University of Hong Kong,
3 Baidu Inc.
* Equal contribution, ✉ Corresponding author.
-
Strong performance. LION achieves state-of-the-art performance on Waymo, nuScenes, Argoverse V2, and ONCE datasets. 💪
-
Strong generalization. LION can support almost all linear RNN operators including Mamba, RWKV, RetNet, xLSTM, and TTT. Anyone is welcome to verify more linear RNN operators. 😀
-
More friendly. LION can train all models on less 24G GPU memory (i.e., RTX 3090, RTX4090, V100 and A100 are enough to train our LION). 😀
- 2024.09.26: LION has been accepted by NeurIPS 2024. 🎉
- 2024.07.25: LION paper released. 🔥
- 2024.07.02: Our new works OPEN and SEED have been accepted by ECCV 2024. 🎉
- Waymo Val Set~(100%)
Model | mAP/mAPH_L1 | mAP/mAPH_L2 | Vec_L1 | Vec_L2 | Ped_L1 | Ped_L2 | Cyc_L1 | Cyc_L2 | Config |
---|---|---|---|---|---|---|---|---|---|
LION-RetNet | 80.9/78.8 | 74.6/72.7 | 79.0/78.5 | 70.6/70.2 | 84.6/80.0 | 77.2/72.8 | 79.0/78.0 | 76.1/75.1 | config |
LION-RWKV | 81.0/79.0 | 74.7/72.8 | 79.7/79.3 | 71.3/71.0 | 84.6/80.0 | 77.1/72.7 | 78.7/77.7 | 75.8/74.8 | config |
LION-Mamba | 81.4/79.4 | 75.1/73.2 | 79.5/79.1 | 71.1/70.7 | 84.9/80.4 | 77.5/73.2 | 79.7/78.7 | 76.7/75.8 | config |
LION-Mamba-L | 82.1/80.1 | 75.9/74.0 | 80.3/79.9 | 72.0/71.6 | 85.8/81.4 | 78.5/74.3 | 80.1/79.0 | 77.2/76.2 | config |
Note: You could reduce the training epochs from 24 to 12~(the performance gap is within 1 mAP/mAPH) or reduce the 100% training to 20% training sets.
- nuScenes
Model | Split | Epoch | CBGS | NDS | mAP | Config | Download (Baidu Pan) | Download (Google Drive) |
---|---|---|---|---|---|---|---|---|
LION-RetNet | Val | 36 | False | 71.9 | 67.3 | config | nus_retnet.pth (ksmp) | nus_retnet.pth |
LION-RWKV | Val | 36 | False | 71.7 | 66.8 | config | ||
LION-Mamba | Val | 36 | False | 72.1 | 68.0 | config | nus_mamba.pth (2tvc) | nus_mamba.pth |
LION-Mamba | Val | 48 | False | 72.3 | 68.2 | config | ||
LION-Mamba | Test | 36 | False | 73.9 | 69.8 |
Note: Our model on nuScenes does not use CBGS for training more time and without any test-time augmentation or model ensembling! For obtaining more stable and better performance, you could try to train more time~(e.g., 48 epochs)
- Argoverse V2 Val Set
Model | mAP | Config | Download (Baidu Pan) | Download (Google Drive) |
---|---|---|---|---|
LION-RetNet | 40.7 | config | argov2_retnet.pth (yghm) | argov2_retnet.pth |
LION-RWKV | 41.1 | config | argov2_rwkv.pth (cr4e) | argov2_rwkv.pth |
LION-Mamba | 41.5 | config | argov2_mamba.pth (k63i) | argov2_mamba.pth |
- ONCE Val Set
Model | Vehicle | Pedestrian | Cyclist | mAP | Config | Download |
---|---|---|---|---|---|---|
LION-RetNet | 78.1 | 52.4 | 68.3 | 66.3 | config | |
LION-RWKV | 78.3 | 50.6 | 68.4 | 65.8 | config | |
LION-Mamba | 78.2 | 53.2 | 68.5 | 66.6 | config |
- We provide some examples of LION models on KITTI dataset for quick validation of any Linear RNN operators.
- Here, we provide the results of moderate difficulty for LION with RetNet, RWKV, Mamba, xLSTM, and TTT.
- Anyone is welcome to verify more linear RNN operators. 😀
Model | Car | Pedestrian | Cyclist | Config | Download |
---|---|---|---|---|---|
LION-TTT | 78.0 | 58.6 | 69.6 | config | |
LION-xLSTM | 77.9 | 59.3 | 67.4 | config | |
LION-RetNet | 77.9 | 60.2 | 69.6 | config | |
LION-Mamba | 78.3 | 60.2 | 68.6 | config | |
LION-RWKV | 78.3 | 62.2 | 71.2 | config |
Please refer to INSTALL.md for the installation of LION codebase.
We provide all training&evaluation scripts for training our LION, please refer to tools/
- Train all models of LION on nuScenes
bash run_train_lion_for_nus.sh
- Train all models of LION on Waymo
bash run_train_lion_for_waymo.sh
- Train all models of LION on Argoverse V2
bash run_train_lion_for_argov2.sh
- Train all models of LION on ONCE
bash run_train_lion_for_once.sh
- Train all models of LION on KITTI
bash run_train_lion_for_kitti.sh
For more details about LION, please refer to GETTING_STARTED.md to learn more usage about LION.
- Release the paper.
- Release the code of LION on the Waymo.
- Release the code of LION on the nuScenes.
- Release the code of LION on the Argoverse V2.
- Release the code of LION on the ONCE.
- Release the code of LION on the KITTI.
- Release some important checkpoints of LION (nuScenes and Argoverse v2).
- Support more linear RNNs.
@article{liu2024lion,
title={LION: Linear Group RNN for 3D Object Detection in Point Clouds},
author={Zhe Liu, Jinghua Hou, Xingyu Wang, Xiaoqing Ye, Jingdong Wang, Hengshuang Zhao, Xiang Bai},
journal={Advances in Neural Information Processing Systems},
year={2024}
}
We thank these great works and open-source repositories: OpenPCDet, DSVT, FlatFormer, HEDNet, Mamba, RWKV, Vision-RWKV, RMT, xLSTM, TTT, and flash-linear-attention.