We gratefully acknowledge support from
the Simons Foundation and member institutions.

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 125

[ total of 425 entries: 1-25 | ... | 51-75 | 76-100 | 101-125 | 126-150 | 151-175 | 176-200 | 201-225 | ... | 401-425 ]
[ showing 25 entries per page: fewer | more | all ]

Thu, 16 May 2024 (continued, showing last 22 of 57 entries)

[126]  arXiv:2405.09041 [pdf, other]
Title: Learning from Partial Label Proportions for Whole Slide Image Segmentation
Comments: Accepted at MICCAI2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[127]  arXiv:2405.09032 [pdf, other]
Title: ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expression Recognition
Comments: Accept by ICDAR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[128]  arXiv:2405.09024 [pdf, other]
Title: Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[129]  arXiv:2405.09006 [pdf, other]
Title: Spatial Semantic Recurrent Mining for Referring Image Segmentation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[130]  arXiv:2405.08996 [pdf, other]
Title: Learning Correspondence for Deformable Objects
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[131]  arXiv:2405.08992 [pdf, other]
Title: Contextual Emotion Recognition using Large Vision Language Models
Comments: 8 pages, website: this https URL arXiv admin note: text overlap with arXiv:2310.19995
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[132]  arXiv:2405.08991 [pdf, other]
Title: Theoretical Analysis for Expectation-Maximization-Based Multi-Model 3D Registration
Comments: arXiv admin note: substantial text overlap with arXiv:2402.10865
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[133]  arXiv:2405.08961 [pdf, other]
Title: Bird's-Eye View to Street-View: A Survey
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[134]  arXiv:2405.08932 [pdf, other]
Title: Self-supervised vision-langage alignment of deep learning representations for bone X-rays analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[135]  arXiv:2405.08911 [pdf, other]
Title: CLIP with Quality Captions: A Strong Pretraining for Vision Tasks
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[136]  arXiv:2405.08909 [pdf, other]
Title: ADA-Track: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association
Comments: 14 pages, 3 figures, accepted by CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[137]  arXiv:2405.08890 [pdf, other]
Title: Language-Guided Self-Supervised Video Summarization Using Text Semantic Matching Considering the Diversity of the Video
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[138]  arXiv:2405.09539 (cross-list from eess.IV) [pdf, ps, other]
Title: MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer
Comments: Early accepted to MICCAI 2024 (6/6/5)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[139]  arXiv:2405.09530 (cross-list from cs.CY) [pdf, other]
[140]  arXiv:2405.09472 (cross-list from eess.IV) [pdf, other]
Title: Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[141]  arXiv:2405.09353 (cross-list from eess.IV) [pdf, other]
Title: Large coordinate kernel attention network for lightweight image super-resolution
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[142]  arXiv:2405.09298 (cross-list from eess.IV) [pdf, ps, other]
Title: Deep Blur Multi-Model (DeepBlurMM) -- a strategy to mitigate the impact of image blur on deep learning model performance in histopathology image analysis
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[143]  arXiv:2405.09286 (cross-list from cs.MM) [pdf, other]
Title: MVBIND: Self-Supervised Music Recommendation For Videos Via Embedding Space Binding
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[144]  arXiv:2405.09077 (cross-list from eess.IV) [pdf, other]
Title: Compressive Feature Selection for Remote Visual Multi-Task Inference
Comments: 6 pages, 8 figures, IEEE ICME Workshop on Coding for Machines
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[145]  arXiv:2405.09049 (cross-list from cs.LG) [pdf, other]
Title: Perception Without Vision for Trajectory Prediction: Ego Vehicle Dynamics as Scene Representation for Efficient Active Learning in Autonomous Driving
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[146]  arXiv:2405.08981 (cross-list from cs.HC) [pdf, other]
Title: Impact of Design Decisions in Scanpath Modeling
Comments: 16 pages
Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[147]  arXiv:2405.08920 (cross-list from cs.LG) [pdf, other]
Title: Neural Collapse Meets Differential Privacy: Curious Behaviors of NoisyGD with Near-perfect Representation Learning
Comments: To appear in ICML 2024
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)

Wed, 15 May 2024 (showing first 3 of 76 entries)

[148]  arXiv:2405.08816 [pdf, other]
[149]  arXiv:2405.08815 [pdf, other]
Title: Efficient Vision-Language Pre-training by Cluster Masking
Comments: CVPR 2024, Project page: this https URL , Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[150]  arXiv:2405.08813 [pdf, other]
Title: CinePile: A Long Video Question Answering Dataset and Benchmark
Comments: Project page with all the artifacts - this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[ total of 425 entries: 1-25 | ... | 51-75 | 76-100 | 101-125 | 126-150 | 151-175 | 176-200 | 201-225 | ... | 401-425 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)