Voice Passing : a Non-Binary Voice Gender Prediction System for evaluating Transgender voice transition

Doukhan, David; Devauchelle, Simon; Girard-Monneron, Lucile; Ruz, Mía Chávez; Chaddouk, V.; Wagner, Isabelle; Rilliard, Albert

doi:10.21437/Interspeech.2023-1835

Full-text links:

Download:

Current browse context:

eess.AS

< prev | next >

new | recent | 2404

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Voice Passing : a Non-Binary Voice Gender Prediction System for evaluating Transgender voice transition

Authors: David Doukhan, Simon Devauchelle, Lucile Girard-Monneron, Mía Chávez Ruz, V. Chaddouk, Isabelle Wagner, Albert Rilliard

(Submitted on 23 Apr 2024)

Abstract: This paper presents a software allowing to describe voices using a continuous Voice Femininity Percentage (VFP). This system is intended for transgender speakers during their voice transition and for voice therapists supporting them in this process. A corpus of 41 French cis- and transgender speakers was recorded. A perceptual evaluation allowed 57 participants to estimate the VFP for each voice. Binary gender classification models were trained on external gender-balanced data and used on overlapping windows to obtain average gender prediction estimates, which were calibrated to predict VFP and obtained higher accuracy than $F_0$ or vocal track length-based models. Training data speaking style and DNN architecture were shown to impact VFP estimation. Accuracy of the models was affected by speakers' age. This highlights the importance of style, age, and the conception of gender as binary or not, to build adequate statistical representations of cultural concepts.

Comments:	5 pages, 1 figure, keywords: Transgender voice, Gender perception, Speaker gender classification, CNN, X-Vector
Subjects:	Audio and Speech Processing (eess.AS); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Sound (cs.SD)
Journal reference:	Proc. INTERSPEECH 2023, 5207-5211
DOI:	10.21437/Interspeech.2023-1835
Cite as:	arXiv:2404.15176 [eess.AS]
	(or arXiv:2404.15176v1 [eess.AS] for this version)

Submission history

From: David Doukhan [view email]
[v1] Tue, 23 Apr 2024 16:15:39 GMT (63kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2404.15176

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Voice Passing : a Non-Binary Voice Gender Prediction System for evaluating Transgender voice transition

Submission history