MVEB: Self-Supervised Learning with Multi-View Entropy Bottleneck

Wen, Liangjian; Wang, Xiasi; Liu, Jianzhuang; Xu, Zenglin

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2403

Computer Science > Computer Vision and Pattern Recognition

Title: MVEB: Self-Supervised Learning with Multi-View Entropy Bottleneck

Authors: Liangjian Wen, Xiasi Wang, Jianzhuang Liu, Zenglin Xu

(Submitted on 28 Mar 2024)

Abstract: Self-supervised learning aims to learn representation that can be effectively generalized to downstream tasks. Many self-supervised approaches regard two views of an image as both the input and the self-supervised signals, assuming that either view contains the same task-relevant information and the shared information is (approximately) sufficient for predicting downstream tasks. Recent studies show that discarding superfluous information not shared between the views can improve generalization. Hence, the ideal representation is sufficient for downstream tasks and contains minimal superfluous information, termed minimal sufficient representation. One can learn this representation by maximizing the mutual information between the representation and the supervised view while eliminating superfluous information. Nevertheless, the computation of mutual information is notoriously intractable. In this work, we propose an objective termed multi-view entropy bottleneck (MVEB) to learn minimal sufficient representation effectively. MVEB simplifies the minimal sufficient learning to maximizing both the agreement between the embeddings of two views and the differential entropy of the embedding distribution. Our experiments confirm that MVEB significantly improves performance. For example, it achieves top-1 accuracy of 76.9\% on ImageNet with a vanilla ResNet-50 backbone on linear evaluation. To the best of our knowledge, this is the new state-of-the-art result with ResNet-50.

Comments:	Accepted by TPAMI
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2403.19078 [cs.CV]
	(or arXiv:2403.19078v1 [cs.CV] for this version)

Submission history

From: Liangjian Wen PhD. [view email]
[v1] Thu, 28 Mar 2024 00:50:02 GMT (5494kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2403.19078

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: MVEB: Self-Supervised Learning with Multi-View Entropy Bottleneck

Submission history