We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Unsupervised motion segmentation in one go: Smooth long-term model over a video

Abstract: Human beings have the ability to continuously analyze a video and immediately extract the main motion components. Motion segmentation methods often proceed frame by frame. We want to go beyond this classical paradigm, and perform the motion segmentation over a video sequence in one go. It will be a prominent added value for downstream computer vision tasks, and could provide a pretext criterion for unsupervised video representation learning. In this perspective, we propose a novel long-term spatio-temporal model operating in a totally unsupervised way. It takes as input the volume of consecutive optical flow (OF) fields, and delivers a volume of segments of coherent motion over the video. More specifically, we have designed a transformer-based network, where we leverage a mathematically well-founded framework, the Evidence Lower Bound (ELBO), to infer the loss function. The loss function combines a flow reconstruction term involving spatio-temporal parametric motion models combining, in a novel way, polynomial (quadratic) motion models for the $(x,y)$-spatial dimensions and B-splines for the time dimension of the video sequence, and a regularization term enforcing temporal consistency on the masks. We report experiments on four VOS benchmarks with convincing quantitative results. We also highlight through visual results the key contributions on temporal consistency brought by our method.
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2310.01040 [cs.CV]
  (or arXiv:2310.01040v1 [cs.CV] for this version)

Submission history

From: Etienne Meunier [view email]
[v1] Mon, 2 Oct 2023 09:33:54 GMT (39996kb,D)
[v2] Sun, 28 Jan 2024 01:15:50 GMT (39984kb,D)
[v3] Wed, 17 Apr 2024 17:44:24 GMT (32395kb,D)

Link back to: arXiv, form interface, contact.