We gratefully acknowledge support from
the Simons Foundation and member institutions.

Audio and Speech Processing

Authors and titles for eess.AS in Dec 2023

[ total of 233 entries: 1-10 | 11-20 | 21-30 | 31-40 | ... | 231-233 ]
[ showing 10 entries per page: fewer | more | all ]
[1]  arXiv:2312.00174 [pdf, other]
Title: Compression of end-to-end non-autoregressive image-to-speech system for low-resourced devices
Comments: 5 pages, 2 figures, 2 tables, presented at the 15th ITG Conference on Speech Communications, September 2023, Aachen
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[2]  arXiv:2312.00231 [pdf, other]
Title: Learning domain-invariant classifiers for infant cry sounds
Subjects: Audio and Speech Processing (eess.AS)
[3]  arXiv:2312.00249 [pdf, other]
Title: Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities
Subjects: Audio and Speech Processing (eess.AS)
[4]  arXiv:2312.00698 [pdf, other]
Title: SPIRE-SIES: A Spontaneous Indian English Speech Corpus
Comments: 6 pages, 7 plots, 3 tables, Accepted at O-COCOSDA 2023
Subjects: Audio and Speech Processing (eess.AS)
[5]  arXiv:2312.01744 [pdf, other]
Title: SEFGAN: Harvesting the Power of Normalizing Flows and GANs for Efficient High-Quality Speech Enhancement
Comments: Preprint. Accepted to IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2023
Subjects: Audio and Speech Processing (eess.AS)
[6]  arXiv:2312.01808 [pdf, ps, other]
Title: Head Orientation Estimation with Distributed Microphones Using Speech Radiation Patterns
Comments: 6 pages, submitted to 57th Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, USA, 2023
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[7]  arXiv:2312.02581 [pdf, ps, other]
Title: Auralization based on multi-perspective ambisonic room impulse responses
Comments: 18 pages, published in Acta Acustica (Open Access), datasets are available via this https URL and this https URL
Journal-ref: Acta Acustica, Volume 4, Number 6, Article Number 25, 2020
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[8]  arXiv:2312.02683 [pdf, other]
Title: Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler
Comments: Accepted to ICASSP 2024
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[9]  arXiv:2312.03034 [pdf, other]
Title: Distributed Speech Dereverberation Using Weighted Prediction Error
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[10]  arXiv:2312.03129 [pdf, other]
Title: Leveraging Laryngograph Data for Robust Voicing Detection in Speech
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[ total of 233 entries: 1-10 | 11-20 | 21-30 | 31-40 | ... | 231-233 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, 2406, contact, help  (Access key information)