Overview of datasets for the Sign languages of Europe

Autor/a: Project Easier: Intelligent Automatic Sign Language Translation
Año: 2021
Editorial: Project Easier
Tipo de código: Copyright
Soporte: Digital




This document identifies linguistic corpora that can be explored as high- quality training data for automatic translation within EASIER (as opposed to loosely aligned broadcast data). For each data set, the document lists what parts of the data are available under what access conditions. It also lists the elicitation formats used in several corpora in order to identify those parts of the available corpora that could be explored to build multilingual resources. In order to support the construction of an interlingual index across European sign languages, the document also lists lexical resources (lexical databases and dictionaries) available and their characteristics.