Sentiment Analysis Dataset and Web Application for Turkish Tweets


ELMASRY V.

9th International Symposium on Innovative Approaches in Smart Technologies, ISAS 2025, Gaziantep, Türkiye, 27 - 28 Haziran 2025, (Tam Metin Bildiri) identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Doi Numarası: 10.1109/isas66241.2025.11101972
  • Basıldığı Şehir: Gaziantep
  • Basıldığı Ülke: Türkiye
  • Anahtar Kelimeler: Data Preprocessing, Natural Language Processing (NLP), Sentiment Analysis, Turkish Tweets Dataset, Twitter (X)
  • İstanbul Kültür Üniversitesi Adresli: Evet

Özet

Today, Twitter (X ) is one of the most essential and popular social networking sites. It is very important to analyze the sentiments of the tweets posted on this platform to understand people and understand their opinions on any topic. Thus, you can determine what people are thinking and talking about on a topic you can choose, such as a brand, business, economy, or education. In this study, a dataset is created with Turkish tweets collected using the Twitter API. Then, techniques such as Word2Vec and Bag of Words (BoW) are used to clean this dataset and use it more comfortably. Afterward, this cleaned dataset is classified as Positive, Negative, and Neutral using classification methods such as Decision Tree, Logistic Regression, Support Vector Machine (SVM), Random Forest, and XGBClassifier. Finally, a simple website has been created using JavaScript for users to use this application efficiently.