Developer:
CLLE-ERSS, University of Toulouse Jean Jaurès
Authors:
Stergos D. Afantenos, Nicholas Asher, Farah Benamara, Myriam Bras, Cécile Fabre, Lydia-Mai Ho-Dac, Anne Le Draoulec, Philippe Muller, Marie-Paule Péry-Woodley, Laurent Prévot, Josette Rebeyrolle, Ludovic Tanguy, Marianne Vergez-Couret, laure Vieu
Availability:
Creative Commons By-NC-SA 3.0
Genre:
journalistic
science
reports and enyclopaedia articles
Genre (detailed):
news in brief (from the daily newspaper Est Républicain), encyclopaedia articles (from wikipedia), research papers in linguistics, reports and articles from the french think tank IFRI
Years of the data origin:
Document structure:
documents, paragraph boudaries, headings and subheadings, bulleted and numbered lists, examples, citations
Types of DSDs annotated:
two DSDs are marked-up:- rhetorical relations annotation including Elementary Discourse Units (EDU) and Complex Discourse Units (CDU) linked by rhetorical relations (e.g. contrast, elaboration, result, attribution, etc.)- multi-level structures annotion including Enumerative Structures (ES) and Topical Chains (TC) with their clues
Number of DSD instances:
3,188 EDU, 1,395 CDU, 3,355 rhetorical relations, 991 ES and 4,649 ES cues, 588 TC and 3,456 TC cues
Method of annotation:
manual for discourse relations and assisted for multi-level structures
Style/theory of annotation:
SDRT and Systemic Functional Linguistics
Version number, release date:
Further info about the discourse relations:
senses/semantic labels are annotated for the relations