ICCV2023结果陆续都出来了,收到了很多朋友中稿的消息,ICCV 2023今年一共收录 2100多篇,自动驾驶之心也第一时间进行了跟进,将已确定中稿的工作分享给大家,后面将会持续更新!
后面将会按照3D目标检测、BEV、协同感知、语义分割、点云、SLAM、大模型、NeRF、端到端、多模态融合等多个方向罗列!
如果您的工作也需要被收录,欢迎提交Issue,或者联系邮箱autodrivingtech@163.com,我们会及时收录!
本内容由自公众号【自动驾驶之心】 团队整理,自动驾驶之心建立了一系列技术交流群,面向自动驾驶与AI领域,包括:目标检测、语义分割、全景分割、实例分割、车道线、目标跟踪、3D目标检测、多模态感知、BEV感知、Occupancy、多传感器融合、多传感器标定、transformer、大模型、点云处理、端到端自动驾驶、SLAM、光流估计、深度估计、轨迹预测、高精地图、NeRF、规划控制、模型部署落地、自动驾驶仿真测试、产品经理、硬件配置、AI求职交流等!
如果您有需要,欢迎加入自动驾驶之心:技术交流群
SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving
OccNet: Scene as Occupancy
OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy Prediction
OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception
VAD: Vectorized Scene Representation for Efficient Autonomous Driving
DriveAdapter: New Paradigm for End-to-End Autonomous Driving to Alleviate Causal Confusion
Among Us: Adversarially Robust Collaborative Perception by Consensus
HM-ViT: Hetero-modal Vehicle-to-Vehicle Cooperative perception with vision transformer
Optimizing the Placement of Roadside LiDARs for Autonomous Driving
待更新!
UMC: A Unified Bandwidth-efficient and Multi-resolution based Collaborative Perception Framework
ADAPT: Efficient Multi-Agent Trajectory Prediction with Adaptation
CORE: Cooperative Reconstruction for Multi-Agent Perception
PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images
StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
Cross Modal Transformer: Towards Fast and Robust 3D Object Detection
DQS3D: Densely-matched Quantization-aware Semi-supervised 3D Detection
SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection
MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and Class-balanced Pseudo-Labeling
Learning from Noisy Data for Semi-Supervised 3D Object Detection
- Paper: 待更新!
- Code: https://github.com/zehuichen123/NoiseDet
SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection
PG-RCNN: Semantic Surface Point Generation for 3D Object Detection
Rethinking Range View Representation for LiDAR Segmentation
Paper:https://arxiv.org/pdf/2303.05367.pdf
UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase
已收录,arxiv上暂未放出!
Segment Anything
MARS: Model-agnostic Biased Object Removal without Additional Supervision for Weakly-Supervised Semantic Segmentation
Tube-Link: A Flexible Cross Tube Baseline for Universal Video Segmentation
CPCM: Contextual Point Cloud Modeling for Weakly-supervised Point Cloud Semantic Segmentation
To Adapt or Not to Adapt? Real-Time Adaptation for Semantic Segmentation
PointDC: Unsupervised Semantic Segmentation of 3D Point Clouds via Cross-modal Distillation and Super-Voxel Clustering
Contrastive Model Adaptation for Cross-Condition Robustness in Semantic Segmentation
PODA: Prompt-driven Zero-shot Domain Adaptation
Similarity Min-Max: Zero-Shot Day-Night Domain Adaptation
- Paper: https://red-fairy.github.io/ZeroShotDayNightDA-Webpage/paper.pdf
- Code: https://github.com/Red-Fairy/ZeroShotDayNightDA
Robo3D: Towards Robust and Reliable 3D Perception against Corruptions
Implicit Autoencoder for Point Cloud Self-supervised Representation Learning
P2C: Self-Supervised Point Cloud Completion from Single Partial Clouds
- Paper:
- Code: https://github.com/CuiRuikai/Partial2Complete
CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth Pre-training
SVDFormer: Complementing Point Cloud via Self-view Augmentation and Self-structure Dual-generator
AdaptPoint: Sample-adaptive Augmentation for Point Cloud Recognition Against Real-world Corruptions
- Paper: 待更新!
- Code: https://github.com/Roywangj/AdaptPoint/tree/main
RegFormer: An Efficient Projection-Aware Transformer Network for Large-Scale Point Cloud Registration
Point Cloud regression with new algebraical representation on ModelNet40 datasets
- Paper: 待更新!
- Code: https://github.com/flatironinstitute/PointCloud_Regression
Clustering based Point Cloud Representation Learning for 3D Analysis
Implicit Autoencoder for Point Cloud Self-supervised Representation Learning
PVT++: A Simple End-to-End Latency-Aware Visual Tracking Framework
Cross-modal Orthogonal High-rank Augmentation for RGB-Event Transformer-trackers
- Paper: 待更新!
- Code: https://github.com/ZHU-Zhiyu/High-Rank_RGB-Event_Tracker
ReST: A Reconfigurable Spatial-Temporal Graph Model for Multi-Camera Multi-Object Tracking
- Paper: 待更新!
- Code: https://github.com/chengche6230/ReST
Multiple Planar Object Tracking
- Paper: 待更新!
- Code: https://github.com/nku-zhichengzhang/MPOT
3DMOTFormer: Graph Transformer for Online 3D Multi-Object Tracking
- Paper: 待更新!
- Code: https://github.com/dsx0511/3DMOTFormer
MBPTrack: Improving 3D Point Cloud Tracking with Memory Networks and Box Priors
EigenTrajectory: Low-Rank Descriptors for Multi-Modal Trajectory Forecasting
IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis
SceneRF: Self-Supervised Monocular 3D Scene Reconstruction with Radiance Fields
Single-Stage Diffusion NeRF
SemARFlow: Injecting Semantics into Unsupervised Optical Flow Estimation for Autonomous Driving
- Paper: https://arxiv.org/pdf/2303.06209.pdf
- Code: https://github.com/duke-vision/semantic-unsup-flow-release
ELFNet: Evidential Local-global Fusion for Stereo Matching
SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning
- Paper: 待更新
- Code: https://github.com/fh2019ustc/SimFIR