Optimised Preprocessing for Automatic Mouth Gesture Classification

Autor/a: BRUMM, Maren; GRIGAT, Rolf-Rainer
Año: 2020
Editorial: ELRA
Tipo de código: Copyright
Soporte: Digital


Lingüística » Sistemas de transcripción de las Lenguas de Signos, Medios de comunicación y acceso a la información » Nuevas Tecnologías


Mouth gestures are facial expressions in sign language, that do not refer to lip patterns of a spoken language. Research on this topic has been limited so far. The aim of this work is to automatically classify mouth gestures from video material by training a neural network. This could render time-consuming manual annotation unnecessary and help advance the field of automatic sign language translation. However, it is a challenging task due to the little data available as training material and the similarity of different mouth gesture classes. In this paper we focus on the preprocessing of the data, such as finding the area of the face important for mouth gesture recognition. Furthermore we analyse the duration of mouth gestures and determine the optimal length of video clips for classification. Our experiments show, that this can improve the classification results significantly and helps to reach a near human accuracy.

En Proceedings of the 9th Workshop on the Representation and Processing of Sign Languages, pp. 13-20.