AT-ODTSA: a Dataset of Arabic Tweets for Open Domain Targeted Sentiment Analysis


Creative Commons License

Sahmoud S., Abudalfa S., ELMASRY W.

International Journal of Computing and Digital Systems, cilt.11, sa.1, ss.1299-1307, 2022 (Scopus) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 11 Sayı: 1
  • Basım Tarihi: 2022
  • Doi Numarası: 10.12785/ijcds/1101105
  • Dergi Adı: International Journal of Computing and Digital Systems
  • Derginin Tarandığı İndeksler: Scopus, INSPEC, Directory of Open Access Journals
  • Sayfa Sayıları: ss.1299-1307
  • Anahtar Kelimeler: Arabic Tweets, Open-Domain Targeted Sentiment Analysis, Sentiment Analysis, Target Dependent
  • İstanbul Kültür Üniversitesi Adresli: Evet

Özet

In the field of sentiment analysis, most of research has conducted experiments on datasets collected from Twitter for manipulating a specific language. Little number of datasets has been collected for detecting sentiments expressed in Arabic tweets. Moreover, very limited number of such datasets is suitable for conducting recent research directions such as target dependent sentiment analysis and open-domain targeted sentiment analysis. Thereby, there is a dire need for reliable datasets that are specifically acquired for open-domain targeted sentiment analysis with Arabic language. Therefore, in this paper, we introduce AT-ODTSA, a dataset of Arabic Tweets for Open-Domain Targeted Sentiment Analysis, which includes Arabic tweets along with labels that specify targets (topics) and sentiments (opinions) expressed in the collected tweets. To the best of our knowledge, our work presents the first dataset that manually annotated for applying Arabic open-domain targeted sentiment analysis. We also present a detailed statistical analysis of the dataset. The AT-ODTSA dataset is suitable for train numerous machine learning models such as a deep learning-based model.