Normal view MARC view ISBD view

Croatian Error-Annotated Corpus of Non- Professional Written Language / Vanja Štefanec ; Nikola Ljubešić ; JelenaKuvač Kraljević.

By: Štefanec, Vanja.
Contributor(s): Ljubešić, Nikola, informatičar [aut] | Kuvač Kraljević, Jelena [aut].
Material type: ArticleArticlePublisher: 2016Description: 3220-3226 str.Other title: Croatian Error-Annotated Corpus of Non- Professional Written Language [Naslov na engleskom:].Subject(s): 5.07 | error corpus; language disorders; Croatian | error corpus; language disorders; CroatianOnline resources: Elektronička verzija In: Proceedings of the Tenth International Conference on Language Resources and Evaluation ( International Conference on Language Resources and Evaluation, LREC (10 ; 2016 ; Portorož) str. 3220-3226Summary: In the paper authors will present the Croatian corpus of non-professional written language. Consisting of two subcorpora, i.e. the clinical subcorpus, consisting of written texts produced by speakers with various types of language disorders, and the healthy speakers subcorpus, as well as by the levels of its annotation, it offers an opportunity for different lines of research. Authors will present the corpus structure, describe the sampling methodology, explain the levels of annotation, and give some very basic statistic. On the basis of data from the corpus, existing language technologies for Croatian will be adapted in order to be implemented in a platform facilitating text production to speakers with language disorders. In this respect, several analyses of the corpus data will be presented.
Tags from this library: No tags from this library for this title. Log in to add tags.
No physical items for this record

In the paper authors will present the Croatian corpus of non-professional written language. Consisting of two subcorpora, i.e. the clinical subcorpus, consisting of written texts produced by speakers with various types of language disorders, and the healthy speakers subcorpus, as well as by the levels of its annotation, it offers an opportunity for different lines of research. Authors will present the corpus structure, describe the sampling methodology, explain the levels of annotation, and give some very basic statistic. On the basis of data from the corpus, existing language technologies for Croatian will be adapted in order to be implemented in a platform facilitating text production to speakers with language disorders. In this respect, several analyses of the corpus data will be presented.

Projekt MZOS projekt

ENG

There are no comments for this item.

Log in to your account to post a comment.

Powered by Koha

//