CSTNews Corpus

Corpus acronym: 
CSTNews
Developer: 
NILC – Núcleo Interinstitucional de Linguística Computacional
Authors: 
Thiago Pardo, Lucia Castro, Erick Maziero, Vinicius Uzêda, Pedro Balage
Contact person(s): 
Amália Mendes
Availability: 
By request.
Languages covered: 
Brazilian Portuguese
Corpus size (documents): 
140
Corpus size (sentences): 
2,088
Corpus size (tokens): 
47,240
Mode: 
written
Genre: 
journalistic
Years of the data origin: 
2007
Types of DSDs annotated: 
words, intra-sentential, inter-sentential, explicit relations
Method of annotation: 
manual
Style/theory of annotation: 
RST, CST
Version number, release date: 

2008, finished

Citation (text format): 

Cardoso, P. et al. (2011). CSTNews - A Discourse-Annotated Corpus for Single and Multi-Document Summarization of News Texts in Brazilian Portuguese, Proccedings of III Workshop "A RST e os Estudos do Texto", pp. 88-105, Cuiabá, MT, Brasil;

Notes: