Towards Large Vocabulary Kazakh-Russian Sign Language Dataset: KRSL-OnlineSchool

Autor/a: MUKUSHEV, Medet; KYDYRBEKOVA, Aigerim; KIMMELMAN, Vadim; SANDYGULOVA, Anara
Año: 2022
Editorial: European Language Resources Association
Tipo de código: Copyright
Soporte: Digital

Temas

Medios de comunicación y acceso a la información » Nuevas Tecnologías, Lingüística » Sistemas de transcripción de las Lenguas de Signos

Detalles

This paper presents a new dataset for Kazakh-Russian Sign Language (KRSL) created for the purposes of Sign Language Processing. In 2020, Kazakhstan's schools were quickly switched to online mode due to the COVID-19 pandemic. Every working day, the El-arna TV channel was broadcasting video lessons for grades from 1 to 11 with sign language translation. This opportunity allowed us to record a corpus with a large vocabulary and spontaneous SL interpretation. To this end, this corpus contains video recordings of Kazakhstan's online school translated to Kazakh-Russian sign language by 7 interpreters. At the moment we collected and cleaned 890 hours of video material. A custom annotation tool was created to make the process of data annotation simple and easy-to-use by the Deaf community. To date, around 325 hours of videos have been annotated with glosses and 4,009 lessons out of 4,547 were transcribed with automatic speech-to-text software.

En Proceedings of the LREC2022 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources.

Ubicación