Use this url to cite publication: https://hdl.handle.net/20.500.12259/45587
Automatic inference of base forms for multiword terms in Lithuanian
Type of publication
Straipsnis konferencijos medžiagoje Web of Science ir Scopus duomenų bazėje / Article in conference proceedings in Web of Science and Scopus database (P1a)
Author(s)
Author | Affiliation | |
---|---|---|
LT | ||
LT | ||
LT | ||
LT |
Title [en]
Automatic inference of base forms for multiword terms in Lithuanian
Part Of
Human language technologies - the Baltic perspective : 5th international conference Baltic HLT, Tartu, Estonia, October 4–5, 2012 : proceedings
Date Issued
Date |
---|
2012 |
Publisher
Amsterdam : IOS press
Publisher (trusted)
Extent
p. 27-35
Abstract (en)
This paper reports on a specific problem of automatic terminology extraction in Lithuanian – base form inference. While the process of lemmatisation is properly carried out by existing tools, problems arise with normalizing multiword terms. It can be described as the discrepancy between the base form (i. e. lemma) of a term and the sequence of the base forms of constituent lexical items within a term. Lithuanian is a strongly inflected language and the lemmatisation of each word separately within a multiword term breaks the syntactic relations expressed by inflection (case, gender, number) which need to be kept in order to ensure the cohesion of the term.
Series/Report no.
(Frontiers in artificial inteligence and applications, v. 247)
Type of document
type::text::journal::journal article::research article
Language
Anglų / English (en)
Coverage Spatial
Nyderlandai / Netherlands (NL)
ISBN (of the container)
9781614991328
WOS
WOS:000349002400005
Other Identifier(s)
VDU02-000012490