GeoAuxNet: Towards Universal 3D Representation Learning for Multi-sensor Point Clouds

Zhang, Shengjun; Fei, Xin; Duan, Yueqi

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2403

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: GeoAuxNet: Towards Universal 3D Representation Learning for Multi-sensor Point Clouds

Authors: Shengjun Zhang, Xin Fei, Yueqi Duan

(Submitted on 28 Mar 2024)

Abstract: Point clouds captured by different sensors such as RGB-D cameras and LiDAR possess non-negligible domain gaps. Most existing methods design different network architectures and train separately on point clouds from various sensors. Typically, point-based methods achieve outstanding performances on even-distributed dense point clouds from RGB-D cameras, while voxel-based methods are more efficient for large-range sparse LiDAR point clouds. In this paper, we propose geometry-to-voxel auxiliary learning to enable voxel representations to access point-level geometric information, which supports better generalisation of the voxel-based backbone with additional interpretations of multi-sensor point clouds. Specifically, we construct hierarchical geometry pools generated by a voxel-guided dynamic point network, which efficiently provide auxiliary fine-grained geometric information adapted to different stages of voxel features. We conduct experiments on joint multi-sensor datasets to demonstrate the effectiveness of GeoAuxNet. Enjoying elaborate geometric information, our method outperforms other models collectively trained on multi-sensor datasets, and achieve competitive results with the-state-of-art experts on each single dataset.

Comments:	CVPR 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.19220 [cs.CV]
	(or arXiv:2403.19220v1 [cs.CV] for this version)

Submission history

From: Shengjun Zhang [view email]
[v1] Thu, 28 Mar 2024 08:34:04 GMT (2654kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2403.19220

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: GeoAuxNet: Towards Universal 3D Representation Learning for Multi-sensor Point Clouds

Submission history