We gratefully acknowledge support from
the Simons Foundation and member institutions.

Audio and Speech Processing

Authors and titles for eess.AS in Jun 2023

[ total of 377 entries: 1-10 | 11-20 | 21-30 | 31-40 | ... | 371-377 ]
[ showing 10 entries per page: fewer | more | all ]
[1]  arXiv:2306.00160 [pdf, other]
Title: Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model
Comments: Accepted by Interspeech 2023
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[2]  arXiv:2306.00203 [pdf, ps, other]
Title: Speaker-independent Speech Inversion for Estimation of Nasalance
Comments: Interspeech 2023
Subjects: Audio and Speech Processing (eess.AS)
[3]  arXiv:2306.00331 [pdf, other]
Title: A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models
Comments: Accepted to Interspeech 2023. Code will be released at this https URL
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Sound (cs.SD); Signal Processing (eess.SP); Systems and Control (eess.SY)
[4]  arXiv:2306.00426 [pdf, ps, other]
Title: Speaker verification using attentive multi-scale convolutional recurrent network
Comments: 21 pages, 6 figures, 8 tables. Accepted for publication in Applied Soft Computing
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[5]  arXiv:2306.00452 [pdf, ps, other]
Title: Speech Self-Supervised Representation Benchmarking: Are We Doing it Right?
Comments: 6 pages
Journal-ref: INTERSPEECH 2023
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG)
[6]  arXiv:2306.00481 [pdf, other]
Title: Automatic Data Augmentation for Domain Adapted Fine-Tuning of Self-Supervised Speech Representations
Comments: 6 pages,INTERSPEECH 2023
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG)
[7]  arXiv:2306.00625 [pdf, other]
Title: Frame-wise and overlap-robust speaker embeddings for meeting diarization
Comments: ICASSP 2023
Subjects: Audio and Speech Processing (eess.AS)
[8]  arXiv:2306.00634 [pdf, other]
Title: A Teacher-Student approach for extracting informative speaker embeddings from speech mixtures
Comments: Proceedings of INTERSPEECH
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[9]  arXiv:2306.00736 [pdf, other]
Title: Spoken Language Identification System for English-Mandarin Code-Switching Child-Directed Speech
Comments: Accepted by Interspeech 2023, 5 pages, 1 figure, 4 tables
Journal-ref: Proc. INTERSPEECH 2023, 4114--4118
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[10]  arXiv:2306.00812 [pdf, other]
Title: Harmonic enhancement using learnable comb filter for light-weight full-band speech enhancement model
Comments: accepted by Interspeech 2023
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[ total of 377 entries: 1-10 | 11-20 | 21-30 | 31-40 | ... | 371-377 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, 2405, contact, help  (Access key information)