We gratefully acknowledge support from
the Simons Foundation and member institutions.

Multimedia

Authors and titles for recent submissions, skipping first 39

[ total of 26 entries: 1-10 | 7-16 | 17-26 ]
[ showing 10 entries per page: fewer | more | all ]

Tue, 21 May 2024

[17]  arXiv:2405.11742 [pdf, other]
Title: Universal Organizer of SAM for Unsupervised Semantic Segmentation
Comments: accepted by IEEE International Conference on Multimedia & Expo
Subjects: Multimedia (cs.MM)
[18]  arXiv:2405.12221 (cross-list from cs.CV) [pdf, other]
Title: Images that Sound: Composing Images and Sounds on a Single Canvas
Comments: Project site: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[19]  arXiv:2405.12126 (cross-list from cs.CV) [pdf, other]
Title: Alzheimer's Magnetic Resonance Imaging Classification Using Deep and Meta-Learning Models
Subjects: Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET); Machine Learning (cs.LG); Multimedia (cs.MM)
[20]  arXiv:2405.11295 (cross-list from eess.IV) [pdf, ps, other]
Title: Medical Image Analysis for Detection, Treatment and Planning of Disease using Artificial Intelligence Approaches
Comments: 10 pages, 3 figures
Journal-ref: International Journal of Microsystems and IoT, Vol. 1, Issue 5, pp.278- 287, 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[21]  arXiv:2405.11273 (cross-list from cs.AI) [pdf, other]
Title: Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts
Comments: 22 pages, 13 figures. Project Website: this https URL Working in progress
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[22]  arXiv:2405.11145 (cross-list from cs.CV) [pdf, other]
Title: Detecting Multimodal Situations with Insufficient Context and Abstaining from Baseless Predictions
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[23]  arXiv:2405.11093 (cross-list from eess.AS) [pdf, other]
Title: AudioSetMix: Enhancing Audio-Language Datasets with LLM-Assisted Augmentations
Authors: David Xu
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Multimedia (cs.MM); Sound (cs.SD)

Mon, 20 May 2024

[24]  arXiv:2405.10497 [pdf, other]
Title: SMP Challenge: An Overview and Analysis of Social Media Prediction Challenge
Comments: ACM Multimedia. arXiv admin note: text overlap with arXiv:1910.01795
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Social and Information Networks (cs.SI)

Fri, 17 May 2024

[25]  arXiv:2405.10029 [pdf, other]
Title: AsCL: An Asymmetry-sensitive Contrastive Learning Method for Image-Text Retrieval with Cross-Modal Fusion
Comments: This work has been strong-accepted as the oral conference paper by IEEE International Conference on Multimedia & Expo (ICME) 2024
Subjects: Multimedia (cs.MM)
[26]  arXiv:2405.10121 (cross-list from cs.CL) [pdf, other]
Title: Distilling Implicit Multimodal Knowledge into LLMs for Zero-Resource Dialogue Generation
Comments: Under Review
Subjects: Computation and Language (cs.CL); Multimedia (cs.MM)
[ total of 26 entries: 1-10 | 7-16 | 17-26 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2405, contact, help  (Access key information)