We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: To Whom are You Talking? A Deep Learning Model to Endow Social Robots with Addressee Estimation Skills

Abstract: Communicating shapes our social word. For a robot to be considered social and being consequently integrated in our social environment it is fundamental to understand some of the dynamics that rule human-human communication. In this work, we tackle the problem of Addressee Estimation, the ability to understand an utterance's addressee, by interpreting and exploiting non-verbal bodily cues from the speaker. We do so by implementing an hybrid deep learning model composed of convolutional layers and LSTM cells taking as input images portraying the face of the speaker and 2D vectors of the speaker's body posture. Our implementation choices were guided by the aim to develop a model that could be deployed on social robots and be efficient in ecological scenarios. We demonstrate that our model is able to solve the Addressee Estimation problem in terms of addressee localisation in space, from a robot ego-centric point of view.
Comments: Accepted v. of IJCNN 2023 publication. Funded by the Horizon Europe project TERAIS (G.A. 101079338), the UKRI Node on Trust (EP/V026682/1), the EU projects TRAINCREASE and MUSAE, and the US project THRIVE++. Cite: this https URL Code: this https URL Data: this https URL 10 pages, 8 Figures, 3 Tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
MSC classes: 68T07, 68T40
ACM classes: I.2.6; I.2.9; I.2.10; J.7
Journal reference: 2023 International Joint Conference on Neural Networks (IJCNN), pp. 1-10
DOI: 10.1109/IJCNN54540.2023.10191452
Cite as: arXiv:2308.10757 [cs.LG]
  (or arXiv:2308.10757v2 [cs.LG] for this version)

Submission history

From: Carlo Mazzola [view email]
[v1] Mon, 21 Aug 2023 14:43:42 GMT (15556kb,D)
[v2] Thu, 28 Mar 2024 08:26:50 GMT (15556kb,D)

Link back to: arXiv, form interface, contact.