Sound

Authors and titles for recent submissions

[ total of 45 entries: 1-10 | 11-20 | 21-30 | 31-40 | 41-45 ]
[ showing 10 entries per page: fewer | more | all ]

Fri, 26 Apr 2024

[1] arXiv:2404.16619 [pdf, other]: Title: The THU-HCSI Multi-Speaker Multi-Lingual Few-Shot Voice Cloning System for LIMMITS'24 Challenge

Authors: Yixuan Zhou, Shuoyi Zhou, Shun Lei, Zhiyong Wu, Menglin Wu

Comments: Accepted in Grand Challenge of ICASSP 2024

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[2] arXiv:2404.16436 [pdf, ps, other]: Title: Leveraging tropical reef, bird and unrelated sounds for superior transfer learning in marine bioacoustics

Authors: Ben Williams, Bart van Merriënboer, Vincent Dumoulin, Jenny Hamer, Eleni Triantafillou, Abram B. Fleishman, Matthew McKown, Jill E. Munger, Aaron N. Rice, Ashlee Lillis, Clemency E. White, Catherine A. D. Hobbs, Tries B. Razak, Kate E. Jones, Tom Denton

Comments: 18 pages, 5 figures

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[3] arXiv:2404.16259 [pdf, other]: Title: An Experiment with Electric Guitar Signals for Exploring the Virtuosity based on the Entropy of Music

Authors: Igor Lugo, Martha G. Alatriste-Contreras

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[4] arXiv:2404.16743 (cross-list from cs.CL) [pdf, other]: Title: Automatic Speech Recognition System-Independent Word Error Rate Estimatio

Authors: Chanho Park, Mingjie Chen, Thomas Hain

Comments: Accepted to LREC-COLING 2024 (long)

Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[5] arXiv:2404.16547 (cross-list from eess.AS) [pdf, other]: Title: Developing Acoustic Models for Automatic Speech Recognition in Swedish

Authors: Giampiero Salvi

Comments: 16 pages, 7 figures

Journal-ref: European Student Journal of Language and Speech, 1999

Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Sound (cs.SD)
[6] arXiv:2404.16305 (cross-list from cs.MM) [pdf, other]: Title: Semantically consistent Video-to-Audio Generation using Multimodal Language Large Model

Authors: Gehui Chen, Guan'an Wang, Xiaowen Huang, Jitao Sang

Subjects: Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[7] arXiv:2404.16216 (cross-list from cs.CV) [pdf, other]: Title: ActiveRIR: Active Audio-Visual Exploration for Acoustic Environment Modeling

Authors: Arjun Somayazulu, Sagnik Majumder, Changan Chen, Kristen Grauman

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[8] arXiv:2404.16104 (cross-list from eess.AS) [pdf, other]: Title: Evolution of Voices in French Audiovisual Media Across Genders and Age in a Diachronic Perspective

Authors: Albert Rilliard, David Doukhan, Rémi Uro, Simon Devauchelle

Comments: 5 pages, 2 figures, keywords:, Gender, Diachrony, Vocal Tract Resonance, Vocal register, Broadcast speech

Journal-ref: Radek Skarnitzl & Jan Vol\'in (Eds.), Proceedings of the 20th International Congress of Phonetic Sciences (ICPhS), Prague 2023, pp. 753-757. Guarant International. ISBN 978-80-908 114-2-3

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[9] arXiv:2404.13101 (cross-list from eess.IV) [pdf, ps, other]: Title: DensePANet: An improved generative adversarial network for photoacoustic tomography image reconstruction from sparse data

Authors: Hesam Hakimnejad, Zohreh Azimifar, Narjes Goshtasbi

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD)

Thu, 25 Apr 2024 (showing first 1 of 4 entries)

[10] arXiv:2404.15637 [pdf, other]: Title: HybridVC: Efficient Voice Style Conversion with Text and Audio Prompts

Authors: Xinlei Niu, Jing Zhang, Charles Patrick Martin

Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)

[ total of 45 entries: 1-10 | 11-20 | 21-30 | 31-40 | 41-45 ]
[ showing 10 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2404, contact, help (Access key information)

> cs > cs.SD

Sound

Authors and titles for recent submissions

Fri, 26 Apr 2024

Thu, 25 Apr 2024 (showing first 1 of 4 entries)