Normal view MARC view ISBD view

Croatian Dependency Treebank: Recent Development and Initial Experiments / Berović, Daša ; Agić, Željko ; Tadić, Marko.

By: Berović, Daša.
Contributor(s): Tadić, Marko [aut] | Agić, Željko [aut].
Material type: ArticleArticleDescription: 1902-1906 str.Other title: Croatian Dependency Treebank: Recent Development and Initial Experiments [Naslov na engleskom:].Subject(s): 5.04 | 6.03 | dependency treebank, dependency parsing, Croatian language hrv | dependency treebank, dependency parsing, Croatian language engOnline resources: Click here to access online | Click here to access online In: Eigth International Conference on Language Resources and Evaluation (LREC'12) (23-25.05.2012. ; Istanbul, Turska) Proceedings of the Eigth International Conference on Language Resources and Evaluation (LREC'12) str. 1902-1906Calzolari, Nicoletta ; Choukri, Khalid ; Declerck, Thierry ; Ugur Dogan, Mehmet ; Maegaard, Bente ; Mariani, Joseph ; Odijk, Jan ; Piperidis, SteliosSummary: We present the current state of development of the Croatian Dependency Treebank – with special empahsis on adapting the Prague Dependency Treebank formalism to Croatian language specifics – and illustrate its possible applications in an experiment with dependency parsing using MaltParser. The treebank currently contains approximately 2870 sentences, out of which the 2699 sentences and 66930 tokens were used in this experiment. Three linear-time projective algorithms implemented by the MaltParser system – Nivre eager, Nivre standard and stack projective – running on default settings were used in the experiment. The highest performing system, implementing the Nivre eager algorithm, scored (LAS 71.31 UAS 80.93 LA 83.87) within our experiment setup. The results obtained serve as an illustration of treebank’s usefulness in natural language processing research and as a baseline for further research in dependency parsing of Croatian.
Tags from this library: No tags from this library for this title. Log in to add tags.
No physical items for this record

We present the current state of development of the Croatian Dependency Treebank – with special empahsis on adapting the Prague Dependency Treebank formalism to Croatian language specifics – and illustrate its possible applications in an experiment with dependency parsing using MaltParser. The treebank currently contains approximately 2870 sentences, out of which the 2699 sentences and 66930 tokens were used in this experiment. Three linear-time projective algorithms implemented by the MaltParser system – Nivre eager, Nivre standard and stack projective – running on default settings were used in the experiment. The highest performing system, implementing the Nivre eager algorithm, scored (LAS 71.31 UAS 80.93 LA 83.87) within our experiment setup. The results obtained serve as an illustration of treebank’s usefulness in natural language processing research and as a baseline for further research in dependency parsing of Croatian.

Projekt MZOS 130-1300646-0645

Projekt MZOS 130-1300646-1776

ENG

There are no comments for this item.

Log in to your account to post a comment.

Powered by Koha

//