Normal view MARC view ISBD view

Babel Treebank of Public Messages in Croatian / Merkler, Danijela ; Agić, Željko ; Agić, Ana.

By: Merkler, Danijela.
Contributor(s): Agić, Ana [aut] | Agić, Željko [aut].
Material type: materialTypeLabelArticleDescription: 490-497.ISSN: 1877-0428.Other title: Babel Treebank of Public Messages in Croatian [Naslov na engleskom:].Subject(s): 5.04 | 6.03 | dependency treebank, dependency parsing ; public messages, non-standard text, Croatian language hrv | dependency treebank, dependency parsing ; public messages, non-standard text, Croatian language eng In: Procedia -- Social and Behavioral Sciences 95C (2013), str. 490-497Summary: The paper presents the process of constructing a publicly available treebank of public messages written in Croatian. The messages were collected from various electronic sources – e-mail, blog, Facebook and SMS – and published on the Zagreb Museum of Contemporary Art LED facade within the Babel art project. The project aimed to use the facade as an open-space blog or social interface for enabling citizens to publicly express their views. Construction and current state of the treebank is presented along with future work plans. A comparison of Babel Treebank with Croatian Dependency Treebank and SETimes.HR treebank regarding differing domains and annotation schemes is briefly sketched. The treebank is used as a test platform for introducing a new standard for syntactic annotation of Croatian texts. An experiment with morphosyntactic tagging and dependency parsing of the treebank is conducted, providing first insight to computational processing of non-standard text in Croatian.
Tags from this library: No tags from this library for this title. Log in to add tags.
No physical items for this record

The paper presents the process of constructing a publicly available treebank of public messages written in Croatian. The messages were collected from various electronic sources – e-mail, blog, Facebook and SMS – and published on the Zagreb Museum of Contemporary Art LED facade within the Babel art project. The project aimed to use the facade as an open-space blog or social interface for enabling citizens to publicly express their views. Construction and current state of the treebank is presented along with future work plans. A comparison of Babel Treebank with Croatian Dependency Treebank and SETimes.HR treebank regarding differing domains and annotation schemes is briefly sketched. The treebank is used as a test platform for introducing a new standard for syntactic annotation of Croatian texts. An experiment with morphosyntactic tagging and dependency parsing of the treebank is conducted, providing first insight to computational processing of non-standard text in Croatian.

ENG

There are no comments for this item.

Log in to your account to post a comment.

Powered by Koha