Toward the Elaboration of a Spanish-Chinese Parallel Annotated Corpus

TitleToward the Elaboration of a Spanish-Chinese Parallel Annotated Corpus
Publication TypeConference Paper
Year of Publication2016
AuthorsCao S, da Cunha I, Iruskieta M
Conference Name34th Internationa Conference of the AESLA. In Profesional and Academic Discourse: an Interdisciplinary Perspective. The EPiC Series in Language and Linguistics Vol. 2. April 14-16. Alicante
ISBN Number 2398-5283

Spanish and Chinese are two very different languages in all language levels. Therefore, translation (both human and machine translation) from one to another and learning one of them as a foreign language are challenging tasks. Some automatic translation systems exist for this pair of languages, but there is enough room to improve the translation quality between Spanish and Chinese. In addition, the accessible sources, such as a parallel corpus for studying and understanding this language pair, are still few. In this paper, we present how we have created a Spanish-Chinese parallel corpus designed for language learning and translation tasks at the discourse level. This corpus has been enriched automatically with part-of-speech (POS) and several queries based on morpho-syntactic information can be realized. We have made available the parallel corpus to the academic community.