Uncertainty Guided Depth Fusion for Spike Camera

Li, Jianing; Liu, Jiaming; Wei, Xiaobao; Zhang, Jiyuan; Lu, Ming; Ma, Lei; Du, Li; Huang, Tiejun; Zhang, Shanghang

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2208

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Uncertainty Guided Depth Fusion for Spike Camera

Authors: Jianing Li, Jiaming Liu, Xiaobao Wei, Jiyuan Zhang, Ming Lu, Lei Ma, Li Du, Tiejun Huang, Shanghang Zhang

(Submitted on 26 Aug 2022 (v1), last revised 29 Aug 2022 (this version, v2))

Abstract: Depth estimation is essential for various important real-world applications such as autonomous driving. However, it suffers from severe performance degradation in high-velocity scenario since traditional cameras can only capture blurred images. To deal with this problem, the spike camera is designed to capture the pixel-wise luminance intensity at high frame rate. However, depth estimation with spike camera remains very challenging using traditional monocular or stereo depth estimation algorithms, which are based on the photometric consistency. In this paper, we propose a novel Uncertainty-Guided Depth Fusion (UGDF) framework to fuse the predictions of monocular and stereo depth estimation networks for spike camera. Our framework is motivated by the fact that stereo spike depth estimation achieves better results at close range while monocular spike depth estimation obtains better results at long range. Therefore, we introduce a dual-task depth estimation architecture with a joint training strategy and estimate the distributed uncertainty to fuse the monocular and stereo results. In order to demonstrate the advantage of spike depth estimation over traditional camera depth estimation, we contribute a spike-depth dataset named CitySpike20K, which contains 20K paired samples, for spike depth estimation. UGDF achieves state-of-the-art results on CitySpike20K, surpassing all monocular or stereo spike depth estimation baselines. We conduct extensive experiments to evaluate the effectiveness and generalization of our method on CitySpike20K. To the best of our knowledge, our framework is the first dual-task fusion framework for spike camera depth estimation. Code and dataset will be released.

Comments:	18 pages, 11 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
ACM classes:	I.2.10
Cite as:	arXiv:2208.12653 [cs.CV]
	(or arXiv:2208.12653v2 [cs.CV] for this version)

Submission history

From: Jianing Li [view email]
[v1] Fri, 26 Aug 2022 13:04:01 GMT (27754kb,D)
[v2] Mon, 29 Aug 2022 06:48:58 GMT (27754kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2208.12653

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Uncertainty Guided Depth Fusion for Spike Camera

Submission history