We gratefully acknowledge support from
the Simons Foundation and member institutions.

Audio and Speech Processing

Authors and titles for eess.AS in Feb 2021, skipping first 75

[ total of 208 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-150 | 151-175 | ... | 201-208 ]
[ showing 25 entries per page: fewer | more | all ]
[76]  arXiv:2102.12394 [pdf, other]
Title: SEP-28k: A Dataset for Stuttering Event Detection From Podcasts With People Who Stutter
Comments: Accepted to ICASSP 2021
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[77]  arXiv:2102.12397 [pdf, other]
Title: Thoughts on the potential to compensate a hearing loss in noise
Comments: 26 pages, 22 figures, related code this https URL
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[78]  arXiv:2102.12624 [pdf, other]
Title: Meta-Learning for improving rare word recognition in end-to-end ASR
Comments: Revised version to be published in the proceedings of ICASSP 2021
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[79]  arXiv:2102.12829 [pdf, other]
Title: Automatic Classification of OSA related Snoring Signals from Nocturnal Audio Recordings
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[80]  arXiv:2102.13334 [pdf, ps, other]
Title: Integration of deep learning with expectation maximization for spatial cue based speech separation in reverberant conditions
Subjects: Audio and Speech Processing (eess.AS)
[81]  arXiv:2102.13397 [pdf, other]
Title: Underwater Acoustic Communication Receiver Using Deep Belief Network
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Signal Processing (eess.SP)
[82]  arXiv:2102.13468 [pdf, other]
Title: The INTERSPEECH 2021 Computational Paralinguistics Challenge: COVID-19 Cough, COVID-19 Speech, Escalation & Primates
Comments: 5 pages
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[83]  arXiv:2102.04832 (cross-list from eess.SP) [pdf, other]
Title: Fast and Accurate Amplitude Demodulation of Wideband Signals
Comments: Accepted for publication in IEEE Transactions on Signal Processing
Subjects: Signal Processing (eess.SP); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[84]  arXiv:2102.06269 (cross-list from eess.IV) [pdf, other]
Title: Disentanglement for audio-visual emotion recognition using multitask setup
Comments: Accepted for ICASSP 2021, 5 pages
Subjects: Image and Video Processing (eess.IV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[85]  arXiv:2102.06393 (cross-list from eess.SP) [pdf, other]
Title: Mind the beat: detecting audio onsets from EEG recordings of music listening
Comments: to be published in ICASSP 2021 4 figures, 5 pages (4 pages of content + 1 page of references)
Subjects: Signal Processing (eess.SP); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[86]  arXiv:2102.07896 (cross-list from eess.SP) [pdf, other]
Title: A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images
Comments: 27 pages, 6 figures, 5 tables, submitted to Nature Scientific Data
Subjects: Signal Processing (eess.SP); Sound (cs.SD); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV)
[87]  arXiv:2102.07990 (cross-list from eess.SP) [pdf, other]
Title: Through-the-Wall Radar under Electromagnetic Complex Wall: A Deep Learning Approach
Subjects: Signal Processing (eess.SP); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[88]  arXiv:2102.00151 (cross-list from cs.SD) [pdf, other]
Title: Expressive Neural Voice Cloning
Comments: 12 pages, 2 figures, 2 tables
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[89]  arXiv:2102.00201 (cross-list from cs.SD) [pdf, other]
Title: Melon Playlist Dataset: a public dataset for audio-based playlist generation and music tagging
Comments: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing
Subjects: Sound (cs.SD); Information Retrieval (cs.IR); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[90]  arXiv:2102.00247 (cross-list from cs.CL) [pdf, other]
Title: Triple M: A Practical Text-to-speech Synthesis System With Multi-guidance Attention And Multi-band Multi-time LPCNet
Subjects: Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[91]  arXiv:2102.00291 (cross-list from cs.SD) [pdf, other]
Title: Speech Recognition by Simply Fine-tuning BERT
Comments: Accepted to ICASSP 2021
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[92]  arXiv:2102.00313 (cross-list from cs.SD) [pdf, other]
Title: Cortical Features for Defense Against Adversarial Audio Attacks
Comments: Co-author legal name changed
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[93]  arXiv:2102.00382 (cross-list from cs.SD) [pdf, other]
Title: Structure-Aware Audio-to-Score Alignment using Progressively Dilated Convolutional Neural Networks
Comments: ICASSP 2021 camera-ready version. Copyrights belong to IEEE
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[94]  arXiv:2102.00429 (cross-list from cs.SD) [pdf, other]
Title: High Fidelity Speech Regeneration with Application to Speech Enhancement
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[95]  arXiv:2102.00550 (cross-list from cs.SD) [pdf, other]
Title: Boosting the Predictive Accurary of Singer Identification Using Discrete Wavelet Transform For Feature Extraction
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[96]  arXiv:2102.00616 (cross-list from cs.SD) [pdf, ps, other]
Title: Neural Network architectures to classify emotions in Indian Classical Music
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[97]  arXiv:2102.01013 (cross-list from cs.CL) [pdf, other]
Title: End2End Acoustic to Semantic Transduction
Comments: Accepted at IEEE ICASSP 2021
Journal-ref: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[98]  arXiv:2102.01133 (cross-list from cs.SD) [pdf, other]
Title: Deep Music Information Dynamics
Authors: Shlomo Dubnov
Journal-ref: The 2020 Joint Conference on AI Music Creativity, October 19-23, 2020, Royal Institute of Technology (KTH), Stockholm, Sweden
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[99]  arXiv:2102.01243 (cross-list from cs.SD) [pdf, other]
Title: PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation
Comments: Published in IEEE/ACM Transactions on Audio Speech and Language Processing. Code at this https URL
Journal-ref: in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 29, pp. 3292-3306, 2021
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[100]  arXiv:2102.01547 (cross-list from cs.SD) [pdf, other]
Title: WeNet: Production oriented Streaming and Non-streaming End-to-End Speech Recognition Toolkit
Comments: 5 pages, 2 figures, 4 tables
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[ total of 208 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-150 | 151-175 | ... | 201-208 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, eess, 2405, contact, help  (Access key information)