A Corpus-based Approach for Spanish-Chinese Language Learning

TitleA Corpus-based Approach for Spanish-Chinese Language Learning
Publication TypeConference Paper
Year of Publication2016
AuthorsCao S, da Cunha I, Iruskieta M
Conference NameProceedings of the 3rd Workshop on Natural Language Processing Techniques for Educational Applications, 26th International Conference on Computational Linguistics (COLING 2016). pages 97-106, Osaka, Japan, December 12 2016.
ISBN Number978-4-87974-717-4

Due to the huge population that speaks Spanish and Chinese, these languages occupy an important positionin the language learning studies. Although there are some automatic translation systems that benefitthe learning of both languages, there is enough space to create resources in order to help language learners.As a quick and effective resource that can give large amount language information, corpus-basedlearning is becoming more and more popular. In this paper we enrich a Spanish-Chinese parallel corpusautomatically with part of-speech (POS) information and manually with discourse segmentation (followingthe Rhetorical Structure Theory (RST) (Mann and Thompson, 1988)). Two search tools allowthe Spanish-Chinese language learners to carry out different queries based on tokens and lemmas. Theparallel corpus and the research tools are available to the academic community. We propose some examplesto illustrate how learners can use the corpus to learn Spanish and Chinese.