We gratefully acknowledge support from
the Simons Foundation and member institutions.

Audio and Speech Processing

Authors and titles for recent submissions, skipping first 38

[ total of 51 entries: 1-25 | 14-38 | 39-51 ]
[ showing 25 entries per page: fewer | more | all ]

Tue, 23 Apr 2024 (continued, showing last 13 of 17 entries)

[39]  arXiv:2404.13789 (cross-list from cs.SD) [pdf, other]
Title: Anchor-aware Deep Metric Learning for Audio-visual Retrieval
Comments: 9 pages, 5 figures. Accepted by ACM ICMR 2024
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[40]  arXiv:2404.13569 (cross-list from cs.SD) [pdf, other]
Title: Musical Word Embedding for Music Tagging and Retrieval
Comments: Submitted to IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP)
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[41]  arXiv:2404.13568 (cross-list from cs.SD) [pdf, ps, other]
Title: Sparse Direction of Arrival Estimation Method Based on Vector Signal Reconstruction with a Single Vector Sensor
Authors: Jiabin Guo
Comments: 20 pages
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[42]  arXiv:2404.13551 (cross-list from cs.SD) [pdf, other]
Title: AudioRepInceptionNeXt: A lightweight single-stream architecture for efficient audio recognition
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[43]  arXiv:2404.13509 (cross-list from cs.SD) [pdf, ps, other]
Title: MFHCA: Enhancing Speech Emotion Recognition Via Multi-Spatial Fusion and Hierarchical Cooperative Attention
Comments: Main paper (5 pages). Accepted for publication by ICME 2024
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[44]  arXiv:2404.13428 (cross-list from cs.SD) [pdf, ps, other]
Title: Text-dependent Speaker Verification (TdSV) Challenge 2024: Challenge Evaluation Plan
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[45]  arXiv:2404.13418 (cross-list from cs.HC) [pdf, ps, other]
Title: Interactive tools for making temporally variable, multiple-attributes, and multiple-instances morphing accessible: Flexible manipulation of divergent speech instances for explorational research and education
Comments: 5 pages, 7 figures, submitted to Acoustical Science and Technology of Acoustical Society of Japan
Subjects: Human-Computer Interaction (cs.HC); Audio and Speech Processing (eess.AS)
[46]  arXiv:2404.13362 (cross-list from cs.CL) [pdf, other]
Title: Semantically Corrected Amharic Automatic Speech Recognition
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[47]  arXiv:2404.13358 (cross-list from cs.SD) [pdf, other]
Title: Music Consistency Models
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[48]  arXiv:2404.13289 (cross-list from cs.CL) [pdf, other]
Title: Double Mixture: Towards Continual Event Detection from Speech
Comments: The first two authors contributed equally to this work
Subjects: Computation and Language (cs.CL); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[49]  arXiv:2404.13286 (cross-list from cs.SD) [pdf, other]
Title: Track Role Prediction of Single-Instrumental Sequences
Comments: ISMIR LBD 2023
Subjects: Sound (cs.SD); Information Retrieval (cs.IR); Audio and Speech Processing (eess.AS)
[50]  arXiv:2404.13140 (cross-list from quant-ph) [pdf, ps, other]
Title: Intro to Quantum Harmony: Chords in Superposition
Subjects: Quantum Physics (quant-ph); Emerging Technologies (cs.ET); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[51]  arXiv:2404.13116 (cross-list from eess.SP) [pdf, other]
Title: On fusing active and passive acoustic sensing for simultaneous localization and mapping
Comments: 14 pages, 13 figures, 2 tables, journal submission
Subjects: Signal Processing (eess.SP); Audio and Speech Processing (eess.AS)
[ total of 51 entries: 1-25 | 14-38 | 39-51 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, new, 2404, contact, help  (Access key information)