Finnish PropBank

Developer: 
University of Turku
Authors: 
Haverinen, K.; Laippala, V.; Kohonen, S.; Missilä, A.; Nyblom, J.; Ojala, S.; Viljanen, T.; Salakoski, T. & Ginter, F.
Contact person(s): 
Veronika Laippala
Languages covered: 
Finnish
Corpus size (sentences): 
15,126 sentences)
Corpus size (tokens): 
204,399 tokens
Mode: 
written
Genre: 
journalistic
fiction
interactional (social networks, sms, everyday conversation, etc.)
Wikipedia, Jrc Acquis
Register: 
semi-formal
formal
Text type: 
instructive
narrative
expository
descriptive
argumentative
Years of the data origin: 
~2007-2012
Tools for browsing: 
our own
Tools for querying: 
our own
Types of DSDs annotated: 
Intrasentential (ArgMs), types location, extent, general, negation, modality, cause, time, purpose, manner, direction: 28001 relationsIntersententials (ArgMs Discourse): 2,254
Number of DSD instances: 
30,255
Method of annotation: 
double manual
Style/theory of annotation: 
PropBank
Format: 
will be xml
Citation (text format): 

Haverinen, K.; Laippala, V.; Kohonen, S.; Missilä, A.; Nyblom, J.; Ojala, S.; Viljanen, T.; Salakoski, T. & Ginter, F.: Towards a Dependency-based PropBank of General Finnish. 2013. Proceedings of the 19th Nordic Conference on Computational Linguistics (NoDaLiDa'13) , pp. 41-57.

Citation (bibTeX format): 

@inproceedings{haverinen13propbank, author = {Haverinen, Katri and Laippala, Veronika and Kohonen, Samuel and Missilä, Anna and Nyblom, Jenna and Ojala, Stina and Viljanen, Timo and Salakoski, Tapio and Ginter, Filip}, title = {Towards a Dependency-based PropBank of General Finnish}, booktitle = {Proceedings of the 19th Nordic Conference on Computational Linguistics (NoDaLiDa'13)}, year = {2013}, pages = {41--57}}

Notes: 
Further info about the discourse relations: 
information about arguments of each relation is available
senses/semantic labels are annotated for the relations
Other annotation layers: 
sentence morphosyntax, parse structure