Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 471

[ total of 614 entries: 1-25 | ... | 397-421 | 422-446 | 447-471 | 472-496 | 497-521 | 522-546 | 547-571 | ... | 597-614 ]
[ showing 25 entries per page: fewer | more | all ]

Mon, 20 May 2024 (continued, showing 25 of 55 entries)

[472] arXiv:2405.10885 [pdf, other]: Title: FA-Depth: Toward Fast and Accurate Self-supervised Monocular Depth Estimation

Authors: Fei Wang, Jun Cheng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[473] arXiv:2405.10879 [pdf, other]: Title: One registration is worth two segmentations

Authors: Shiqi Huang, Tingfa Xu, Ziyi Shen, Shaheer Ullah Saeed, Wen Yan, Dean Barratt, Yipeng Hu

Comments: Early Accepted by MICCAI2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[474] arXiv:2405.10871 [pdf, other]: Title: BraTS-Path Challenge: Assessing Heterogeneous Histopathologic Brain Tumor Sub-regions

Authors: Spyridon Bakas, Siddhesh P. Thakur, Shahriar Faghani, Mana Moassefi, Ujjwal Baid, Verena Chung, Sarthak Pati, Shubham Innani, Bhakti Baheti, Jake Albrecht, Alexandros Karargyris, Hasan Kassem, MacLean P. Nasrallah, Jared T. Ahrendsen, Valeria Barresi, Maria A. Gubbiotti, Giselle Y. López, Calixto-Hope G. Lucas, Michael L. Miller, Lee A. D. Cooper, Jason T. Huse, William R. Bell

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[475] arXiv:2405.10868 [pdf, other]: Title: Air Signing and Privacy-Preserving Signature Verification for Digital Documents

Authors: P. Sarveswarasarma, T. Sathulakjan, V. J. V. Godfrey, Thanuja D. Ambegoda

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[476] arXiv:2405.10864 [pdf, other]: Title: Improving face generation quality and prompt following with synthetic captions

Authors: Michail Tarasiou, Stylianos Moschoglou, Jiankang Deng, Stefanos Zafeiriou

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[477] arXiv:2405.10842 [pdf, ps, other]: Title: Automated Radiology Report Generation: A Review of Recent Advances

Authors: Phillip Sloan, Philip Clatworthy, Edwin Simpson, Majid Mirmehdi

Comments: 24 pages, 8 figures, 6 tables. Submitted to IEEE Reviews in Biomedical Engineering

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[478] arXiv:2405.10832 [pdf, other]: Title: Open-Vocabulary Spatio-Temporal Action Detection

Authors: Tao Wu, Shuqiu Ge, Jie Qin, Gangshan Wu, Limin Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[479] arXiv:2405.10802 [pdf, other]: Title: Reduced storage direct tensor ring decomposition for convolutional neural networks compression

Authors: Mateusz Gabor, Rafał Zdunek

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[480] arXiv:2405.10748 [pdf, other]: Title: Deep Data Consistency: a Fast and Robust Diffusion Model-based Solver for Inverse Problems

Authors: Hanyu Chen, Zhixiu Hao, Liying Xiao

Comments: Codes: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[481] arXiv:2405.10739 [pdf, other]: Title: Efficient Multimodal Large Language Models: A Survey

Authors: Yizhang Jin, Jian Li, Yexin Liu, Tianjun Gu, Kai Wu, Zhengkai Jiang, Muyang He, Bo Zhao, Xin Tan, Zhenye Gan, Yabiao Wang, Chengjie Wang, Lizhuang Ma

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[482] arXiv:2405.10736 [pdf, other]: Title: StackOverflowVQA: Stack Overflow Visual Question Answering Dataset

Authors: Motahhare Mirzaei, Mohammad Javad Pirhadi, Sauleh Eetemadi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[483] arXiv:2405.10718 [pdf, other]: Title: SignLLM: Sign Languages Production Large Language Models

Authors: Sen Fang, Lei Wang, Ce Zheng, Yapeng Tian, Chen Chen

Comments: 33 pages, website at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[484] arXiv:2405.10707 [pdf, ps, other]: Title: HARIS: Human-Like Attention for Reference Image Segmentation

Authors: Mengxi Zhang, Heqing Lian, Yiming Liu, Jie Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[485] arXiv:2405.10696 [pdf, other]: Title: Autonomous AI-enabled Industrial Sorting Pipeline for Advanced Textile Recycling

Authors: Yannis Spyridis, Vasileios Argyriou, Antonios Sarigiannidis, Panagiotis Radoglou, Panagiotis Sarigiannidis

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[486] arXiv:2405.10690 [pdf, other]: Title: CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing

Authors: Faegheh Sardari, Armin Mustafa, Philip J. B. Jackson, Adrian Hilton

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[487] arXiv:2405.10674 [pdf, other]: Title: From Sora What We Can See: A Survey of Text-to-Video Generation

Authors: Rui Sun, Yumin Zhang, Tejal Shah, Jiahao Sun, Shuoying Zhang, Wenqi Li, Haoran Duan, Bo Wei, Rajiv Ranjan

Comments: A comprehensive list of text-to-video generation studies in this survey is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[488] arXiv:2405.10612 [pdf, other]: Title: Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transformers

Authors: Sheng Yang, Jiawang Bai, Kuofeng Gao, Yong Yang, Yiming Li, Shu-tao Xia

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[489] arXiv:2405.10610 [pdf, other]: Title: Driving Referring Video Object Segmentation with Vision-Language Pre-trained Models

Authors: Zikun Zhou, Wentao Xiong, Li Zhou, Xin Li, Zhenyu He, Yaowei Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[490] arXiv:2405.10598 [pdf, other]: Title: Learning Object-Centric Representation via Reverse Hierarchy Guidance

Authors: Junhong Zou, Xiangyu Zhu, Zhaoxiang Zhang, Zhen Lei

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[491] arXiv:2405.10591 [pdf, other]: Title: GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-Supervision

Authors: Xin Tan, Wenbin Wu, Zhiwei Zhang, Chaojie Fan, Yong Peng, Zhizhong Zhang, Yuan Xie, Lizhuang Ma

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[492] arXiv:2405.10589 [pdf, other]: Title: Improving Point-based Crowd Counting and Localization Based on Auxiliary Point Guidance

Authors: I-Hsiang Chen, Wei-Ting Chen, Yu-Wei Liu, Ming-Hsuan Yang, Sy-Yen Kuo

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[493] arXiv:2405.10577 [pdf, other]: Title: DuoSpaceNet: Leveraging Both Bird's-Eye-View and Perspective View Representations for 3D Object Detection

Authors: Zhe Huang, Yizhe Zhao, Hao Xiao, Chenyan Wu, Lingting Ge

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[494] arXiv:2405.10575 [pdf, other]: Title: Accurate Training Data for Occupancy Map Prediction in Automated Driving Using Evidence Theory

Authors: Jonas Kälble, Sascha Wirges, Maxim Tatarchenko, Eddy Ilg

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[495] arXiv:2405.10567 [pdf, other]: Title: Team Samsung-RAL: Technical Report for 2024 RoboDrive Challenge-Robust Map Segmentation Track

Authors: Xiaoshuai Hao, Yifan Yang, Hui Zhang, Mengchuan Wei, Yi Zhou, Haimei Zhao, Jing Zhang

Comments: ICRA 2024 RoboDrive Challenge Robust Map Segmentation Track 3rd Place Technical Report. arXiv admin note: text overlap with arXiv:2205.09743 by other authors

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[496] arXiv:2405.10557 [pdf, other]: Title: Resolving Symmetry Ambiguity in Correspondence-based Methods for Instance-level Object Pose Estimation

Authors: Yongliang Lin, Yongzhi Su, Sandeep Inuganti, Yan Di, Naeem Ajilforoushan, Hanqing Yang, Yu Zhang, Jason Rambach

Comments: 8 pages,10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)

[ total of 614 entries: 1-25 | ... | 397-421 | 422-446 | 447-471 | 472-496 | 497-521 | 522-546 | 547-571 | ... | 597-614 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help (Access key information)

> cs > cs.CV

Computer Vision and Pattern Recognition

Authors and titles for recent submissions, skipping first 471

Mon, 20 May 2024 (continued, showing 25 of 55 entries)