Normal view MARC view ISBD view

Slovene-Croatian Treebank Transfer Using Bilingual Lexicon Improves Croatian Dependency Parsing / Agić, Željko ; Merkler, Danijela ; Berović, Daša.

By: Agić, Željko.
Contributor(s): Berović, Daša [aut] | Merkler, Danijela [aut].
Material type: materialTypeLabelArticleDescription: 5-9.Other title: Slovene-Croatian Treebank Transfer Using Bilingual Lexicon Improves Croatian Dependency Parsing [Naslov na engleskom:].Subject(s): 5.04 | 6.03 | treebank transfer, bilingual lexicon, dependency parsing hrv | treebank transfer, bilingual lexicon, dependency parsing engOnline resources: Click here to access online | Click here to access online In: Eighth Language Technologies Conference (8-9.10.2012. ; Ljubljana, Slovenija) Proceedings of the 15th International Multiconference Information Society (IS 2012), Volume C, Proceedings of the 8th Language Technologies Conference str. 5-9Erjavec, Tomaž ; Žganec Gros, JernejaSummary: A method is presented for transferring dependency treebanks between similar languages by using a bilingual lexicon, aiming to improve dependency parsing accuracy on the target language. It is illustrated by transferring the Slovene Dependency Treebank to Croatian by using a GIZA++ bilingual lexicon constructed from the Croatian-Slovene 1984 parallel corpus from the Multext East project. The transferred treebank is merged with the Croatian Dependency Treebank and the merged treebank is used to train and test two graph-based dependency parsers. MSTParser and CroDep accuracy on parsing the 1984 fictional text shows a statistically significant increase and a similar decrease on parsing the Croatian Dependency Treebank newspaper text.
Tags from this library: No tags from this library for this title. Log in to add tags.
No physical items for this record

A method is presented for transferring dependency treebanks between similar languages by using a bilingual lexicon, aiming to improve dependency parsing accuracy on the target language. It is illustrated by transferring the Slovene Dependency Treebank to Croatian by using a GIZA++ bilingual lexicon constructed from the Croatian-Slovene 1984 parallel corpus from the Multext East project. The transferred treebank is merged with the Croatian Dependency Treebank and the merged treebank is used to train and test two graph-based dependency parsers. MSTParser and CroDep accuracy on parsing the 1984 fictional text shows a statistically significant increase and a similar decrease on parsing the Croatian Dependency Treebank newspaper text.

Projekt MZOS 130-1300646-0645

Projekt MZOS 130-1300646-1776

ENG

There are no comments for this item.

Log in to your account to post a comment.

Powered by Koha