Turkish Discourse Bank

Corpus acronym: 
TDB
Developer: 
METU, Institute of Informatics
Authors: 
Deniz Zeyrek, Ruket Çakıcı, Ümit Deniz Turan, Işın Demirşahin, Ayışığı Sevdik Çallı, Hale Ögel Balaban
Contact person(s): 
Ruken Çakıcı
Availability: 
corpora@metu.edu.tr
Languages covered: 
Turkish
Available translations: 
none
Corpus size (hours): 
-
Corpus size (documents): 
197
Corpus size (sentences): 
-
Corpus size (tokens): 
400,000
Mode: 
written
Genre: 
journalistic
fiction
science
Genre (detailed): 
Novel, Story, Research Survey, Article, Travel, Interview, Memoir, News
Register: 
casual
semi-formal
formal
Register (2): 
non-spontaneous
Text type: 
narrative
descriptive
argumentative
Years of the data origin: 
1990-2000
Document structure: 
headings, paragraphs.
Tools for annotation: 
DATT: http://medid.ii.metu.edu.tr/annotationTool.html
Tools for browsing: 
Browser: http://medid.ii.metu.edu.tr/browser.html
Tools for querying: 
Browser: http://medid.ii.metu.edu.tr/browser.html
Types of DSDs annotated: 
Explicit inter- and intra-sentential.
Implicit inter-sentential
Altlex inter- and intra-sentential.
EntRel
Number of DSD instances: 
TDB 1.0: 8483 (argument annotation of Explicit DRs)
TDB 1.1: 1923 (includes argument/sense annotation of Explicit, Implicit, Altlex and Entrel DRs)
Method of annotation: 
manual with a semi-automatic annotation tool
Style/theory of annotation: 
PDTB style
Format: 
XML
Version number, release date: 

1.1 April 2017

Previous versions and their release dates: 

1.0 February 2011

Citation (text format): 

TDB1.0: Zeyrek, D., Demirşahin, I., Sevdik-Çallı, Ayışığı, B., Çakıcı, R. (2013). Turkish Discourse Bank: Porting a discourse annotation style to a morphologically rich language. Dialogue & Discourse. Vol. 4, No. 2: 174-184.

TDB 1.1: Zeyrek, D., & Kurfalı, M. (2017). TDB 1.1: Extensions on Turkish discourse bank. In Proceedings of the 11th Linguistic Annotation Workshop (pp. 76-81).

Citation (bibTeX format): 

TDB1.0:

@article{zeyrek2013turkish,   title={Turkish Discourse Bank: Porting a discourse annotation style to a morphologically rich language.},   author={Zeyrek, Deniz and Demir{\c{s}}ahin, I{\c{s}}{\i}n and Sevdik-{\c{C}}all{\i}, AB and {\c{C}}ak{\i}c{\i}, Ruket},   journal={D\&D},   volume={4},   number={2},   pages={174--184},   year={2013} }

TDB 1.1: 

@inproceedings{zeyrek2017tdb,   title={TDB 1.1: Extensions on Turkish discourse bank},   author={Zeyrek, Deniz and Kurfal{\i}, Murathan},   booktitle={Proceedings of the 11th Linguistic Annotation Workshop},   pages={76--81},   year={2017} }
Notes: 

-

Further info about the discourse relations: 
information about arguments of each relation is available