Audio and Speech Processing

Authors and titles for eess.AS in Dec 2023

[ total of 233 entries: 1-10 | 11-20 | 21-30 | 31-40 | ... | 231-233 ]
[ showing 10 entries per page: fewer | more | all ]

[1] arXiv:2312.00174 [pdf, other]: Title: Compression of end-to-end non-autoregressive image-to-speech system for low-resourced devices

Authors: Gokul Srinivasagan, Michael Deisher, Munir Georges

Comments: 5 pages, 2 figures, 2 tables, presented at the 15th ITG Conference on Speech Communications, September 2023, Aachen

Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[2] arXiv:2312.00231 [pdf, other]: Title: Learning domain-invariant classifiers for infant cry sounds

Authors: Charles C. Onu, Hemanth K. Sheetha, Arsenii Gorin, Doina Precup

Subjects: Audio and Speech Processing (eess.AS)
[3] arXiv:2312.00249 [pdf, other]: Title: Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities

Authors: Jinhua Liang, Xubo Liu, Wenwu Wang, Mark D. Plumbley, Huy Phan, Emmanouil Benetos

Subjects: Audio and Speech Processing (eess.AS)
[4] arXiv:2312.00698 [pdf, other]: Title: SPIRE-SIES: A Spontaneous Indian English Speech Corpus

Authors: Abhayjeet Singh, Charu Shah, Rajashri Varadaraj, Sonakshi Chauhan, Prasanta Kumar Ghosh

Comments: 6 pages, 7 plots, 3 tables, Accepted at O-COCOSDA 2023

Subjects: Audio and Speech Processing (eess.AS)
[5] arXiv:2312.01744 [pdf, other]: Title: SEFGAN: Harvesting the Power of Normalizing Flows and GANs for Efficient High-Quality Speech Enhancement

Authors: Martin Strauss, Nicola Pia, Nagashree K. S. Rao, Bernd Edler

Comments: Preprint. Accepted to IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2023

Subjects: Audio and Speech Processing (eess.AS)
[6] arXiv:2312.01808 [pdf, ps, other]: Title: Head Orientation Estimation with Distributed Microphones Using Speech Radiation Patterns

Authors: Kaspar Müller, Bilgesu Çakmak, Paul Didier, Simon Doclo, Jan Østergaard, Tobias Wolff

Comments: 6 pages, submitted to 57th Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, USA, 2023

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[7] arXiv:2312.02581 [pdf, ps, other]: Title: Auralization based on multi-perspective ambisonic room impulse responses

Authors: Kaspar Müller, Franz Zotter

Comments: 18 pages, published in Acta Acustica (Open Access), datasets are available via this https URL and this https URL

Journal-ref: Acta Acustica, Volume 4, Number 6, Article Number 25, 2020

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[8] arXiv:2312.02683 [pdf, other]: Title: Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler

Authors: Philippe Gonzalez, Zheng-Hua Tan, Jan Østergaard, Jesper Jensen, Tommy Sonne Alstrøm, Tobias May

Comments: Accepted to ICASSP 2024

Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[9] arXiv:2312.03034 [pdf, other]: Title: Distributed Speech Dereverberation Using Weighted Prediction Error

Authors: Ziye Yang, Mengfei Zhang, Jie Chen

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[10] arXiv:2312.03129 [pdf, other]: Title: Leveraging Laryngograph Data for Robust Voicing Detection in Speech

Authors: Yixuan Zhang, Heming Wang, DeLiang Wang

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)

[ total of 233 entries: 1-10 | 11-20 | 21-30 | 31-40 | ... | 231-233 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, 2406, contact, help (Access key information)

> eess > eess.AS

Audio and Speech Processing

Authors and titles for eess.AS in Dec 2023