CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation

Zhao, Lingjun; Song, Jingyu; Skinner, Katherine A.

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2403

Computer Science > Computer Vision and Pattern Recognition

Title: CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation

Authors: Lingjun Zhao, Jingyu Song, Katherine A. Skinner

(Submitted on 28 Mar 2024)

Abstract: In the field of 3D object detection for autonomous driving, LiDAR-Camera (LC) fusion is the top-performing sensor configuration. Still, LiDAR is relatively high cost, which hinders adoption of this technology for consumer automobiles. Alternatively, camera and radar are commonly deployed on vehicles already on the road today, but performance of Camera-Radar (CR) fusion falls behind LC fusion. In this work, we propose Camera-Radar Knowledge Distillation (CRKD) to bridge the performance gap between LC and CR detectors with a novel cross-modality KD framework. We use the Bird's-Eye-View (BEV) representation as the shared feature space to enable effective knowledge distillation. To accommodate the unique cross-modality KD path, we propose four distillation losses to help the student learn crucial features from the teacher model. We present extensive evaluations on the nuScenes dataset to demonstrate the effectiveness of the proposed CRKD framework. The project page for CRKD is this https URL

Comments:	Accepted to CVPR 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:2403.19104 [cs.CV]
	(or arXiv:2403.19104v1 [cs.CV] for this version)

Submission history

From: Jingyu Song [view email]
[v1] Thu, 28 Mar 2024 02:39:45 GMT (5404kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2403.19104

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation

Submission history