Use this url to cite publication: https://hdl.handle.net/20.500.12259/101578
Options
Document classification to functional styles (Domains of use): Lithuanian case
Type of publication
Straipsnis kitoje duomenų bazėje / Article in other database (S4)
Author(s)
Baltijos pažangiųjų technologijų institutas | LT | Vilniaus universitetas | LT | |
LT | Baltijos pažangių technologijų institutas, Vilnius | LT | ||
Man, Ka Lok | Xian Jiaotong-Liverpool University, China | CN | Swinburne University of Technology Sarawak, Malaysia | CN |
Title
Document classification to functional styles (Domains of use): Lithuanian case
Is part of
International journal of design, analysis and tools for integrated circuits and systems. Hong Kong: Solari (HK) Co., 2019, vol. 8, iss. 1
Date Issued
Date Issued |
---|
2019 |
Publisher
Hong Kong: Solari (HK) Co
Is Referenced by
Extent
p. 38-41
Field of Science
Abstract
We report an experiment on classification of Lithuanian texts according to their domain (area of use), i.e. functional style. Functional style is a variety of standard language that is defined by domain, contents, functions, stylistic devices and linguistic means. In this paper we discuss an experiment on document classification into 3 functional styles of Lithuanian language – administrative, publicist and scientific. We compare results of 5 algorithms: Linear Discriminant Analysis (LDA), Quadratic Discriminant Analysis (QDA), k-Nearest Neighbors (k-NN), Support Vector Machine (SVM) with kernel function and Naïve Bayes. We also used 8 quantitative linguistic indicators as discriminating features. For administrative style SVM was the most effective (96.5 % of texts classified correctly), for publicist style – LDA (98.9 % of texts classified correctly) and for scientific style – QDA (93.1 % of texts classified correctly). We achieved the best F-score with SVM (94.7 % – for administrative style, 98.9 – for publicist style and 85.9 – for scientific style).
Type of document
type::text::journal::journal article::research article
Language
Anglų / English (en)
Coverage Spatial
Taivanas / Taiwan Province of China (TW)
Description
This volume is comprised of research papers from the International Conference on Recent Advancements in Computing in AI, Internet of Things (IoT) and Computer Engineering Technology (CICET), October 21-23, 2019, Taipei, Taiwan. CICET 2019 is hosted by The Tamkang University amid pleasant surroundings in Taipei, which is a delightful city for the conference and traveling around; and co-hosted