We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Tunable Hybrid Proposal Networks for the Open World

Abstract: Current state-of-the-art object proposal networks are trained with a closed-world assumption, meaning they learn to only detect objects of the training classes. These models fail to provide high recall in open-world environments where important novel objects may be encountered. While a handful of recent works attempt to tackle this problem, they fail to consider that the optimal behavior of a proposal network can vary significantly depending on the data and application. Our goal is to provide a flexible proposal solution that can be easily tuned to suit a variety of open-world settings. To this end, we design a Tunable Hybrid Proposal Network (THPN) that leverages an adjustable hybrid architecture, a novel self-training procedure, and dynamic loss components to optimize the tradeoff between known and unknown object detection performance. To thoroughly evaluate our method, we devise several new challenges which invoke varying degrees of label bias by altering known class diversity and label count. We find that in every task, THPN easily outperforms existing baselines (e.g., RPN, OLN). Our method is also highly data efficient, surpassing baseline recall with a fraction of the labeled data.
Comments: Published in WACV 2024. 22 pages, 9 figures, 12 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2208.11050 [cs.CV]
  (or arXiv:2208.11050v3 [cs.CV] for this version)

Submission history

From: Matthew Inkawhich [view email]
[v1] Tue, 23 Aug 2022 15:57:19 GMT (2151kb,D)
[v2] Fri, 20 Jan 2023 21:17:20 GMT (4855kb,D)
[v3] Tue, 16 Apr 2024 19:16:34 GMT (4841kb,D)

Link back to: arXiv, form interface, contact.