BeCoS Corpus: Belgian Covid-19 Sign Language Corpus: A Corpus for Training Sign Language Recognition and Translation

Autor/a: VANDEGHINSTE, V.; VAN DYCK, B.; DE COSTER, Mathieu; GODDEFROY, M.; DAMBRE, J.
Año: 2022
Editorial: Computational Linguistics in the Netherlands Journal, 12
Tipo de código: Copyright
Soporte: Digital

Temas

Lingüística » Corpus signados, Medios de comunicación y acceso a la información » Tecnologías

Detalles

We are presenting the Belgian Federal COVID-19 corpus, nicknamed the BeCoS (Belgian CovidSign language) corpus. It consists of the entire archive of official press conferences from the BelgianFederal Government concerning the COVID-19 pandemic. The speakers speak mostly in Dutchor French and occasionally in German, and nearly all speech is accompanied by a deaf signer whoperforms live interpreting from what is being said. We have preprocessed the corpus with speaker diarisation, applied Belgian Dutch ASR, and post-ASR language identification and punctuation prediction as well as signer diarisation, sign language identification and sign language key pointr ecognition. The corpus is made publicly available.

Ubicación