Please use this identifier to cite or link to this item:https://hdl.handle.net/20.500.12259/40798
Type of publication: Straipsnis Clarivate Analytics Web of Science ar/ir Scopus / Article in Clarivate Analytics Web of Science or / and Scopus (S1)
Field of Science: Informatika / Informatics (N009)
Author(s): Kapočiūtė-Dzikienė, Jurgita;Krupavičius, Algis
Title: Predicting party group from the Lithuanian parliamentary speeches
Is part of: Informacinės technologijos ir valdymas = Information technology and control. Kaunas : Technologija, 2014, t. 43, Nr. 3
Extent: p. 321-332
Date: 2014
Keywords: Computational linguistics;Machine learning;Text classification
Abstract: A number of recent research works have used supervised machine learning approaches with a bag-of-words to classify political texts –in particular, speeches and debates– by their ideological position, expressed with a party membership. However, our classification task is more complex due to the several reasons. First, we deal with the Lithuanian language which is highly inflective, has rich morphology, vocabulary, word derivation system, and relatively free-word-order in a sentence. Besides, we have more classes, as the Lithuanian Parliament consists of more party groups if compared to e.g. the European Parliament or the US Senate. Moreover, classes are not stable, because a considerable number of the Lithuanian parliamentarians migrate from one party group to another even within the same parliamentary term. In this research we experimentally investigated the influence of different pre-processing techniques and feature types on two datasets composed of the texts taken from two parliamentary terms. A classifier based on the bag-of-words and token bigrams interpolation gave the best results: i.e. it outperformed random and majority baselines by more than 0.13 points and achieved 0.54 and 0.49 accuracy on the 1st and the 2nd dataset, respectively. The error analysis revealed that the same confusion patterns stand for both datasets, besides, majority of these confusions can be explained on the basis of the ideological or pragmatic similarities between those party groups
Internet: http://www.itc.ktu.lt/index.php/ITC/article/view/5871
Affiliation(s): Informatikos fakultetas
Kauno technologijos universitetas
Taikomosios informatikos katedra
Vytauto Didžiojo universitetas
Appears in Collections:Universiteto mokslo publikacijos / University Research Publications

Files in This Item:
marc.xml8.72 kBXMLView/Open

MARC21 XML metadata

Show full item record
Export via OAI-PMH Interface in XML Formats
Export to Other Non-XML Formats

WEB OF SCIENCETM
Citations 5

1
checked on Jun 2, 2020

Page view(s)

142
checked on Mar 5, 2020

Download(s)

12
checked on Mar 5, 2020

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.