AlbMoRe: A Corpus of Movie Reviews for Sentiment Analysis in Albanian
Abstract
Lack of available resources such as text corpora for low-resource languages seriously hinders research on natural language processing and computational linguistics. This paper presents AlbMoRe, a corpus of 800 sentiment annotated movie reviews in Albanian. Each text is labeled as positive or negative and can be used for sentiment analysis research. Preliminary results based on traditional machine learning classifiers trained with the AlbMoRe samples are also reported. They can serve as comparison baselines for future research experiments.
Top- Çano, Erion
Shortfacts
Category |
Technical Report (Discussion Paper) |
Divisions |
Data Mining and Machine Learning |
Subjects |
Kuenstliche Intelligenz Sprachverarbeitung |
Date |
14 June 2023 |
Export |