We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Child Speech Recognition in Human-Robot Interaction: Problem Solved?

Abstract: Automated Speech Recognition shows superhuman performance for adult English speech on a range of benchmarks, but disappoints when fed children's speech. This has long sat in the way of child-robot interaction. Recent evolutions in data-driven speech recognition, including the availability of Transformer architectures and unprecedented volumes of training data, might mean a breakthrough for child speech recognition and social robot applications aimed at children. We revisit a study on child speech recognition from 2017 and show that indeed performance has increased, with newcomer OpenAI Whisper doing markedly better than leading commercial cloud services. While transcription is not perfect yet, the best model recognises 60.3% of sentences correctly barring small grammatical differences, with sub-second transcription time running on a local GPU, showing potential for usable autonomous child-robot speech interactions.
Comments: Presented at 2024 International Symposium on Technological Advances in Human-Robot Interaction
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Robotics (cs.RO)
Cite as: arXiv:2404.17394 [cs.CL]
  (or arXiv:2404.17394v1 [cs.CL] for this version)

Submission history

From: Ruben Janssens [view email]
[v1] Fri, 26 Apr 2024 13:14:28 GMT (605kb,D)

Link back to: arXiv, form interface, contact.