Normal view MARC view ISBD view

Evaluating Sentence Alignment on Croatian-English Parallel Corpora / Seljan, Sanja ; Agić, Željko ; Tadić, Marko.

By: Seljan, Sanja.
Contributor(s): Tadić, Marko [aut] | Agić, Željko [aut].
Material type: ArticleArticleDescription: 101-108 str.Other title: Evaluating sentence alignment on Croatian-English parallel corpora [Naslov na engleskom:].Subject(s): 2.09 | 5.04 | 6.03 | sentence alignment, croatian-english parallel corpora hrv | sentence alignment, croatian-english parallel corpora engOnline resources: Click here to access online In: 6th International Conference on Formal Approaches to South Slavic and Balkan Languages (FASSBL 2008) (25-28.09.2008. ; Dubrovnik, Hrvatska) Proceedings of the 6th International Conference on Formal Approaches to South Slavic and Balkan Languages str. 101-108Tadić, Marko ; Dimitrova-Vulchanova, Mila ; Koeva, SvetlaSummary: This paper describes an experiment in applying sentence alignment methods to Croatian-English parallel corpora and systematically evaluate their performance within the recall, precision and F-measure framework. It is our primary goal to provide an insight and a reference point on sentence alignment accuracy for Croatian-English language pair and also to extend the scope of (Tadić, 2000) – to our knowledge, the first experiment dealing with sentence alignment of Croatian-English parallel corpora – by utilizing newly implemented tools, creating corpora subsets defined by genre and finally by expanding and formalizing its preliminary observations on alignment accuracy. Therefore, in this paper we start off by briefly describing and argumenting sentence alignment paradigms of choice and presenting available language resources, subset of Croatian-English parallel corpus described in (Tadić, 2000) being our primary asset. These descriptions are followed by a formal definition of our testing framework. Results are then discussed in detail and conclusions are stated along with a brief insight on possible future work.
Tags from this library: No tags from this library for this title. Log in to add tags.
No physical items for this record

This paper describes an experiment in applying sentence alignment methods to Croatian-English parallel corpora and systematically evaluate their performance within the recall, precision and F-measure framework. It is our primary goal to provide an insight and a reference point on sentence alignment accuracy for Croatian-English language pair and also to extend the scope of (Tadić, 2000) – to our knowledge, the first experiment dealing with sentence alignment of Croatian-English parallel corpora – by utilizing newly implemented tools, creating corpora subsets defined by genre and finally by expanding and formalizing its preliminary observations on alignment accuracy. Therefore, in this paper we start off by briefly describing and argumenting sentence alignment paradigms of choice and presenting available language resources, subset of Croatian-English parallel corpus described in (Tadić, 2000) being our primary asset. These descriptions are followed by a formal definition of our testing framework. Results are then discussed in detail and conclusions are stated along with a brief insight on possible future work.

Projekt MZOS 130-1300646-0645

Projekt MZOS 130-1300646-0909

Projekt MZOS 130-1300646-1776

ENG

There are no comments for this item.

Log in to your account to post a comment.

Powered by Koha

//