We gratefully acknowledge support from
the Simons Foundation and member institutions.

Image and Video Processing

Authors and titles for recent submissions

[ total of 78 entries: 1-78 ]
[ showing up to 87 entries per page: fewer | more ]

Fri, 17 May 2024

[1]  arXiv:2405.10254 [pdf, other]
Title: PRISM: A Multi-Modal Generative Foundation Model for Slide-Level Histopathology
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2]  arXiv:2405.10246 [pdf, other]
Title: A Foundation Model for Brain Lesion Segmentation with Mixture of Modality Experts
Comments: The work has been early accepted by MICCAI 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[3]  arXiv:2405.10186 [pdf, other]
Title: Introducing Learning Rate Adaptation CMA-ES into Rigid 2D/3D Registration for Robotic Navigation in Spine Surgery
Comments: Technical Report
Subjects: Image and Video Processing (eess.IV)
[4]  arXiv:2405.10068 [pdf, other]
Title: MrRegNet: Multi-resolution Mask Guided Convolutional Neural Network for Medical Image Registration with Large Deformations
Comments: Accepted for publication at IEEE International Symposium on Biomedical Imaging (ISBI) 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[5]  arXiv:2405.10004 [pdf, other]
Title: ROCOv2: Radiology Objects in COntext Version 2, an Updated Multimodal Image Dataset
Comments: Major revision Scientific Data
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[6]  arXiv:2405.09990 [pdf, other]
Title: Histopathology Foundation Models Enable Accurate Ovarian Cancer Subtype Classification
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[7]  arXiv:2405.09959 [pdf, other]
Title: Patient-Specific Real-Time Segmentation in Trackerless Brain Ultrasound
Comments: Early accept at MICCAI 2024 - code available at: this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[8]  arXiv:2405.09896 [pdf, other]
Title: Confidence Estimation in Unsupervised Deep Change Vector Analysis
Authors: Sudipan Saha
Subjects: Image and Video Processing (eess.IV)
[9]  arXiv:2405.09851 [pdf, other]
Title: Region of Interest Detection in Melanocytic Skin Tumor Whole Slide Images -- Nevus & Melanoma
Comments: 5 figures, NeurIPS 2022 Workshop
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[10]  arXiv:2405.09787 [pdf, other]
[11]  arXiv:2405.09716 [pdf, other]
Title: Illumination Histogram Consistency Metric for Quantitative Assessment of Video Sequences
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[12]  arXiv:2405.09594 [pdf, other]
Title: Learning Generalized Medical Image Representations through Image-Graph Contrastive Pretraining
Comments: Accepted into Machine Learning for Health (ML4H) 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[13]  arXiv:2405.09586 [pdf, other]
Title: Factual Serialization Enhancement: A Key Innovation for Chest X-ray Report Generation
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[14]  arXiv:2405.09553 [pdf, ps, other]
Title: Computer aided diagnosis system for Alzheimers disease using principal component analysis and machine learning based approaches
Authors: Lilia Lazli
Comments: Accepted for CIBB 2021: The 17th International Conference on Computational Intelligence Methods for Bioinformatics and Biostatistics
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI)
[15]  arXiv:2405.09552 [pdf, other]
Title: ODFormer: Semantic Fundus Image Segmentation Using Transformer for Optic Nerve Head Detection
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[16]  arXiv:2405.09549 [pdf, other]
Title: Deep-learning-based clustering of OCT images for biomarker discovery in age-related macular degeneration (Pinnacle study report 4)
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI)
[17]  arXiv:2405.10272 (cross-list from cs.CV) [pdf, other]
Title: Faces that Speak: Jointly Synthesising Talking Face and Speech from Text
Comments: CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV)
[18]  arXiv:2405.10014 (cross-list from cs.CV) [pdf, other]
Title: Frequency-Domain Refinement with Multiscale Diffusion for Super Resolution
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[19]  arXiv:2405.09923 (cross-list from cs.CV) [pdf, other]
Title: NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[20]  arXiv:2405.09873 (cross-list from cs.CV) [pdf, other]
Title: IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation Model
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[21]  arXiv:2405.09582 (cross-list from cs.CV) [pdf, other]
Title: AD-Aligning: Emulating Human-like Generalization for Cognitive Domain Adaptation in Deep Learning
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)

Thu, 16 May 2024

[22]  arXiv:2405.09539 [pdf, ps, other]
Title: MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer
Comments: Early accepted to MICCAI 2024 (6/6/5)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[23]  arXiv:2405.09472 [pdf, other]
Title: Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment
Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[24]  arXiv:2405.09446 [pdf, other]
Title: M$^4$oE: A Foundation Model for Medical Multimodal Image Segmentation with Mixture of Experts
Subjects: Image and Video Processing (eess.IV)
[25]  arXiv:2405.09353 [pdf, other]
Title: Large coordinate kernel attention network for lightweight image super-resolution
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[26]  arXiv:2405.09298 [pdf, ps, other]
Title: Deep Blur Multi-Model (DeepBlurMM) -- a strategy to mitigate the impact of image blur on deep learning model performance in histopathology image analysis
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[27]  arXiv:2405.09234 [pdf, other]
Title: Enhancing Image Privacy in Semantic Communication over Wiretap Channels leveraging Differential Privacy
Subjects: Image and Video Processing (eess.IV)
[28]  arXiv:2405.09077 [pdf, other]
Title: Compressive Feature Selection for Remote Visual Multi-Task Inference
Comments: 6 pages, 8 figures, IEEE ICME Workshop on Coding for Machines
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[29]  arXiv:2405.09291 (cross-list from cs.CV) [pdf, other]
Title: Sensitivity Decouple Learning for Image Compression Artifacts Reduction
Comments: Accepted by Transactions on Image Processing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)

Wed, 15 May 2024

[30]  arXiv:2405.08783 [pdf, other]
Title: The Developing Human Connectome Project: A Fast Deep Learning-based Pipeline for Neonatal Cortical Surface Reconstruction
Subjects: Image and Video Processing (eess.IV)
[31]  arXiv:2405.08745 [pdf, other]
Title: Enhancing Blind Video Quality Assessment with Rich Quality-aware Features
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[32]  arXiv:2405.08672 [pdf, other]
Title: EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera
Comments: early accepted by MICCAI 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[33]  arXiv:2405.08658 [pdf, other]
Title: Beyond the Black Box: Do More Complex Models Provide Superior XAI Explanations?
Comments: 15 pages, 9 figures, 5 tables
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[34]  arXiv:2405.08657 [pdf, other]
Title: Self-supervised learning improves robustness of deep learning lung tumor segmentation to CT imaging differences
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[35]  arXiv:2405.08621 [pdf, other]
Title: RMT-BVQA: Recurrent Memory Transformer-based Blind Video Quality Assessment for Enhanced Video Content
Comments: 8pages, 2figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[36]  arXiv:2405.08556 [pdf, other]
Title: Shape-aware synthesis of pathological lung CT scans using CycleGAN for enhanced semi-supervised lung segmentation
Comments: 14 pages, 7 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[37]  arXiv:2405.08530 [pdf, other]
Title: Parameter-Efficient Instance-Adaptive Neural Video Compression
Comments: 23 pages, 13 figures
Subjects: Image and Video Processing (eess.IV)
[38]  arXiv:2405.08431 [pdf, other]
Title: Similarity Metrics for MR Image-To-Image Translation
Comments: 29 pages, 6 figures, appendix with 5 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[39]  arXiv:2405.08423 [pdf, other]
Title: NAFRSSR: a Lightweight Recursive Network for Efficient Stereo Image Super-Resolution
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[40]  arXiv:2405.08282 [pdf, ps, other]
Title: Automatic Segmentation of the Kidneys and Cystic Renal Lesions on Non-Contrast CT Using a Convolutional Neural Network
Authors: Lucas Aronson (1), Ruben Ngnitewe Massaa (1), Syed Jamal Safdar Gardezi (1), Andrew L. Wentland (1,2,3) ((1) Department of Radiology, University of Wisconsin School of Medicine & Public Health, Madison, WI, USA, (2) Department of Medical Physics, University of Wisconsin School of Medicine & Public Health, Madison, WI, USA, (3) Department of Biomedical Engineering, University of Wisconsin School of Medicine & Public Health, Madison, WI, USA)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[41]  arXiv:2405.08247 [pdf, other]
Title: Automated classification of multi-parametric body MRI series
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI)
[42]  arXiv:2405.08179 [pdf, other]
Title: Do Bayesian imaging methods report trustworthy probabilities?
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG); Signal Processing (eess.SP); Applications (stat.AP); Machine Learning (stat.ML)
[43]  arXiv:2405.08169 [pdf, other]
Title: Rethinking Histology Slide Digitization Workflows for Low-Resource Settings
Comments: MICCAI 2024 Early Accept. First four authors contributed equally
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[44]  arXiv:2405.08049 [pdf, other]
Title: Optimizing Synthetic Correlated Diffusion Imaging for Breast Cancer Tumour Delineation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[45]  arXiv:2405.07994 [pdf, ps, other]
Title: BubbleID: A Deep Learning Framework for Bubble Interface Dynamics Analysis
Comments: 16 pages, 4 figures
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[46]  arXiv:2405.08654 (cross-list from cs.LG) [pdf, other]
Title: Can we Defend Against the Unknown? An Empirical Study About Threshold Selection for Neural Network Monitoring
Comments: 13 pages, 5 figures, 6 tables. To appear in the proceedings of the 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)

Tue, 14 May 2024

[47]  arXiv:2405.07905 [pdf, other]
[48]  arXiv:2405.07869 [pdf, other]
Title: Enhancing Clinically Significant Prostate Cancer Prediction in T2-weighted Images through Transfer Learning from Breast Cancer
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[49]  arXiv:2405.07861 [pdf, other]
Title: Improving Breast Cancer Grade Prediction with Multiparametric MRI Created Using Optimized Synthetic Correlated Diffusion Imaging
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[50]  arXiv:2405.07854 [pdf, other]
Title: Using Multiparametric MRI with Optimized Synthetic Correlated Diffusion Imaging to Enhance Breast Cancer Pathologic Complete Response Prediction
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[51]  arXiv:2405.07762 [pdf, other]
Title: A method for supervoxel-wise association studies of age and other non-imaging variables from coronary computed tomography angiograms
Comments: 34 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[52]  arXiv:2405.07717 [pdf, other]
Title: On the Adversarial Robustness of Learning-based Image Compression Against Rate-Distortion Attacks
Subjects: Image and Video Processing (eess.IV)
[53]  arXiv:2405.07674 [pdf, other]
Title: CoVScreen: Pitfalls and recommendations for screening COVID-19 using Chest X-rays
Authors: Sonit Singh
Comments: 21 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[54]  arXiv:2405.07338 [pdf, other]
Title: Explainable Convolutional Neural Networks for Retinal Fundus Classification and Cutting-Edge Segmentation Models for Retinal Blood Vessels from Fundus Images
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[55]  arXiv:2405.07256 [pdf, other]
Title: Leveraging Fixed and Dynamic Pseudo-labels for Semi-supervised Medical Image Segmentation
Comments: Under Review
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[56]  arXiv:2405.07050 [pdf, ps, other]
Title: Neuromorphic Vision Data Coding: Classifying and Reviewing
Comments: This article has been submitted to IEEE Access
Subjects: Image and Video Processing (eess.IV)
[57]  arXiv:2405.07023 [pdf, other]
Title: Efficient Real-world Image Super-Resolution Via Adaptive Directional Gradient Convolution
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[58]  arXiv:2405.06880 [pdf, other]
Title: EMCAD: Efficient Multi-scale Convolutional Attention Decoding for Medical Image Segmentation
Comments: 14 pages, 5 figures, 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[59]  arXiv:2405.06838 [pdf, other]
Title: Merging Point Data for InSAR Deformation Processing
Comments: 9 pages, 5 figures, one table
Subjects: Image and Video Processing (eess.IV)
[60]  arXiv:2405.06789 [pdf, other]
Title: Self-Consistent Recursive Diffusion Bridge for Medical Image Translation
Comments: 11 pages, 6 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[61]  arXiv:2405.06786 [pdf, other]
Title: SAM3D: Zero-Shot Semi-Automatic Segmentation in 3D Medical Images with the Segment Anything Model
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[62]  arXiv:2405.07777 (cross-list from cs.CV) [pdf, other]
Title: GMSR:Gradient-Guided Mamba for Spectral Reconstruction from RGB Images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[63]  arXiv:2405.07776 (cross-list from cs.CV) [pdf, other]
Title: SAR Image Synthesis with Diffusion Models
Comments: Published at IEEE Radar Conference 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[64]  arXiv:2405.07759 (cross-list from cs.MM) [pdf, other]
Title: MADRL-Based Rate Adaptation for 360$\degree$ Video Streaming with Multi-Viewpoint Prediction
Comments: Accepted by IEEE Internet of Things Journal
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Networking and Internet Architecture (cs.NI); Image and Video Processing (eess.IV)
[65]  arXiv:2405.07648 (cross-list from cs.CV) [pdf, other]
Title: CDFormer:When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[66]  arXiv:2405.07033 (cross-list from cs.NI) [pdf, ps, other]
Title: A Performance Analysis Modeling Framework for Extended Reality Applications in Edge-Assisted Wireless Networks
Comments: 12 pages, 4 figures; To appear in Proceedings of IEEE International Conference on Distributed Computing Systems (ICDCS), 2024
Subjects: Networking and Internet Architecture (cs.NI); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Image and Video Processing (eess.IV)

Mon, 13 May 2024

[67]  arXiv:2405.06463 [pdf, other]
Title: MRSegmentator: Robust Multi-Modality Segmentation of 40 Classes in MRI and CT Sequences
Comments: 13 pages, 6 figures; corrected co-author info
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[68]  arXiv:2405.06284 [pdf, other]
Title: Modality-agnostic Domain Generalizable Medical Image Segmentation by Multi-Frequency in Multi-Scale Attention
Comments: Accepted in Computer Vision and Pattern Recognition (CVPR) 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[69]  arXiv:2405.06230 [pdf, ps, other]
Title: Fire in SRRN: Next-Gen 3D Temperature Field Reconstruction Technology
Subjects: Image and Video Processing (eess.IV)
[70]  arXiv:2405.06188 [pdf, other]
Title: Multidimensional empirical wavelet transform
Subjects: Image and Video Processing (eess.IV)
[71]  arXiv:2405.06178 [pdf, other]
Title: ACTION: Augmentation and Computation Toolbox for Brain Network Analysis with Functional MRI
Comments: 14 pages, 5 figures, 5 tables
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG); Neurons and Cognition (q-bio.NC)
[72]  arXiv:2405.06175 [pdf, other]
Title: Prior-guided Diffusion Model for Cell Segmentation in Quantitative Phase Imaging
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[73]  arXiv:2405.06166 [pdf, other]
Title: MDNet: Multi-Decoder Network for Abdominal CT Organs Segmentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[74]  arXiv:2405.05980 [pdf, ps, other]
Title: Overcoming challenges of translating deep-learning models for glioblastoma: the ZGBM consortium
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG)
[75]  arXiv:2405.06434 (cross-list from physics.optics) [pdf, ps, other]
Title: Photonic Neuromorphic Accelerator for Convolutional Neural Networks based on an Integrated Reconfigurable Mesh
Comments: 18 pages, 10 figures, submitted to Optica Open
Subjects: Optics (physics.optics); Image and Video Processing (eess.IV)
[76]  arXiv:2405.06342 (cross-list from cs.CV) [pdf, other]
Title: Compression-Realized Deep Structural Network for Video Quality Enhancement
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[77]  arXiv:2405.06198 (cross-list from cs.CV) [pdf, ps, other]
Title: MAPL: Memory Augmentation and Pseudo-Labeling for Semi-Supervised Anomaly Detection
Authors: Junzhuo Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[78]  arXiv:2404.17736 (cross-list from eess.SP) [pdf, other]
Title: Diffusion-Aided Joint Source Channel Coding For High Realism Wireless Image Transmission
Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Image and Video Processing (eess.IV)
[ total of 78 entries: 1-78 ]
[ showing up to 87 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, new, 2405, contact, help  (Access key information)