Parameswari_faith_nagaraju@Dravidian-CodeMixFIRE: A machine-learning approach using n-grams in sentiment analysis for code-mixed texts: A case study in Tamil and Malayalam

dc.contributor.author Krishnamurthy, Parameswari
dc.contributor.author Varghese, Faith
dc.contributor.author Vuppala, Nagaraju
dc.date.accessioned 2022-03-26T13:38:06Z
dc.date.available 2022-03-26T13:38:06Z
dc.date.issued 2020-01-01
dc.description.abstract Sentiment analysis is a fast growing research positioned to uncover the underlying meaning of a text by categorizing it into different levels. This paper is an attempt to decode the deeply entangled code-mixed Malayalam and Tamil datasets and classify its interlined meaning at five various levels. Along with the corpus creation, [1] propose a five-level classification for Malayalam and Tamil code-mixed datasets. In this paper, we follow the five-level annotated datasets and aim to solve the classification problem by implementing unigram and bigram knowledge with a Multinomial Naive Bayes model. Our model scores an F1-score of 0.55 for Tamil and 0.48 for Malayalam.
dc.identifier.citation CEUR Workshop Proceedings. v.2826
dc.identifier.issn 16130073
dc.identifier.uri https://dspace.uohyd.ac.in/handle/1/2042
dc.subject A Multinomial Naive Bayes model
dc.subject Code-mixed texts
dc.subject Malayalam
dc.subject N-gram
dc.subject Sentiment Analysis
dc.subject Tamil
dc.title Parameswari_faith_nagaraju@Dravidian-CodeMixFIRE: A machine-learning approach using n-grams in sentiment analysis for code-mixed texts: A case study in Tamil and Malayalam
dc.type Conference Proceeding. Conference Paper
dspace.entity.type
Files
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: