Normal view MARC view ISBD view

Term extraction, tagging, and mapping tools for under-resourced languages / Pinnis, Mārcis ; Ljubešić, Nikola ; Ştefănescu, Dan ; Skadiņa, Inguna ; Tadić, Marko ; Gornostay, Tatiana.

By: Pinnis, Mārcis.
Contributor(s): Ştefănescu, Dan [aut] | Skadiņa, Inguna [aut] | Tadić, Marko [aut] | Ljubešić, Nikola, informatičar [aut].
Material type: ArticleArticleDescription: 193-208 str.Other title: Term Extraction, Tagging, and Mapping Tools for Under-Resourced Languages [Naslov na engleskom:].Subject(s): 5.04 | term extraction, term tagging, term mapping, under-resourced languages, comparable corpora hrv | term extraction, term tagging, term mapping, under-resourced languages, comparable corpora eng In: Terminology and Knowledge Engineering Conference (20-21.06.2012. ; Madrid, Španjolska) Proceedings of the Terminology and Knowledge Engineering (TKE2012) Conference str. 193-208Aguado de CeaSummary: Although term extraction has been researched for more than 20 years, only a few studies focus on under-resourced languages. Moreover, bilingual term mapping from comparable corpora for these languages has attracted researchers only recently. This paper presents methods for term extraction, term tagging in documents, and bilingual term mapping from comparable corpora for four under-resourced languages: Croatian, Latvian, Lithuanian, and Romanian. Methods described in this paper are language independent as long as language specific parameter data is provided by the user and the user has access to a part of speech or a morpho-syntactic tagger.
Tags from this library: No tags from this library for this title. Log in to add tags.
No physical items for this record

Although term extraction has been researched for more than 20 years, only a few studies focus on under-resourced languages. Moreover, bilingual term mapping from comparable corpora for these languages has attracted researchers only recently. This paper presents methods for term extraction, term tagging in documents, and bilingual term mapping from comparable corpora for four under-resourced languages: Croatian, Latvian, Lithuanian, and Romanian. Methods described in this paper are language independent as long as language specific parameter data is provided by the user and the user has access to a part of speech or a morpho-syntactic tagger.

Projekt MZOS FP7-248347

ENG

There are no comments for this item.

Log in to your account to post a comment.

Powered by Koha

//