Application of Text Mining Methods for Sentiment Analysis on Twitter Social Media

Authors

  • Nizam Sanjay Computer and Information Sciences Department, Faculty of Science and Information Technology, Universiti Teknologi Petronas (UTP), Malaysia

Keywords:

Sentiment Analysis, Twitter, Text Mining, Machine Learning, Deep Learning

Abstract

Social media platforms, particularly Twitter, serve as significant channels for public expression, making sentiment analysis a crucial tool for understanding opinions, trends, and emotions in various domains. However, analyzing sentiment on Twitter presents challenges due to the short text length, informal language, slang, emojis, sarcasm, and multilingual content. This research applies text mining methods to enhance sentiment analysis on Twitter by developing improved text preprocessing techniques, hybrid machine learning models, and contextual sentiment classification approaches. The study explores various machine learning and deep learning models, including Naïve Bayes, Support Vector Machines (SVM), Long Short-Term Memory (LSTM), and Bidirectional Encoder Representations from Transformers (BERT), to evaluate their effectiveness in sentiment classification. A hybrid approach integrating machine learning and deep learning is proposed to enhance accuracy while maintaining computational efficiency. The research also introduces an adaptive text preprocessing method to normalize informal language, improve handling of code-mixed tweets, and enhance sarcasm detection using context-aware models with attention mechanisms. The results demonstrate that the hybrid classification model outperforms traditional machine learning and standalone deep learning models, achieving higher accuracy and better contextual understanding of sentiments. Furthermore, multilingual sentiment analysis and sarcasm detection improvements contribute to more reliable classification outcomes. The findings have practical implications for various sectors, including brand reputation analysis, political sentiment tracking, financial market predictions, and public health monitoring. This study contributes to the advancement of natural language processing (NLP), sentiment analysis, and social media analytics by proposing novel methodologies for handling Twitter-specific linguistic challenges. Future research directions include real-time sentiment analysis integration and expanding multimodal sentiment detection using text, images, and videos.

Downloads

Download data is not yet available.

References

Atefeh, F., & Khreich, W. (2015). A survey of techniques for event detection in twitter. Computational Intelligence, 31(1), 132–164.

Blevins, J. L., Lee, J. J., McCabe, E. E., & Edgerton, E. (2019). Tweeting for social justice in# Ferguson: Affective discourse in Twitter hashtags. New Media & Society, 21(7), 1636–1653.

de Las Heras-Pedrosa, C., Sánchez-Núñez, P., & Peláez, J. I. (2020). Sentiment analysis and emotion understanding during the COVID-19 pandemic in Spain and its impact on digital ecosystems. International Journal of Environmental Research and Public Health, 17(15), 5542.

Ebrahimi, M., Yazdavar, A. H., & Sheth, A. (2017). Challenges of sentiment analysis for dynamic events. IEEE Intelligent Systems, 32(5), 70–75.

Giachanou, A., & Crestani, F. (2016). Like it or not: A survey of twitter sentiment analysis methods. ACM Computing Surveys (CSUR), 49(2), 1–41.

Griffin, A. (2014). Crisis, issues and reputation management. Kogan Page Publishers.

Halim, Z., Waqar, M., & Tahir, M. (2020). A machine learning-based investigation utilizing the in-text features for the identification of dominant emotion in an email. Knowledge-Based Systems, 208, 106443.

Joshi, A. (2019). Sentiment Analysis and Opinion Mining from Noisy Social Media Content. Master’s Thesis in IIIT, Hyderabad.

Kharde, V., & Sonawane, P. (2016). Sentiment analysis of twitter data: a survey of techniques. ArXiv Preprint ArXiv:1601.06971.

Lo, S. L., Cambria, E., Chiong, R., & Cornforth, D. (2017). Multilingual sentiment analysis: from formal to informal and scarce resource languages. Artificial Intelligence Review, 48, 499–527.

Nakov, P., Ritter, A., Rosenthal, S., Sebastiani, F., & Stoyanov, V. (2019). SemEval-2016 task 4: Sentiment analysis in Twitter. ArXiv Preprint ArXiv:1912.01973.

Novianti, W. (2013). The use of code switching in Twitter (A case study in English education department). Passage, 1(2), 1–10.

Oza, N. C., & Tumer, K. (2008). Classifier ensembles: Select real-world applications. Information Fusion, 9(1), 4–20.

Pathak, X., & Pathak-Shelat, M. (2017). Sentiment analysis of virtual brand communities for effective tribal marketing. Journal of Research in Interactive Marketing, 11(1), 16–38.

Patwa, P., Aguilar, G., Kar, S., Pandey, S., Pykl, S., Gambäck, B., Chakraborty, T., Solorio, T., & Das, A. (2020). Semeval-2020 task 9: Overview of sentiment analysis of code-mixed tweets. ArXiv Preprint ArXiv:2008.04277.

Salge, C. A. D. L., & Karahanna, E. (2018). Protesting corruption on Twitter: Is it a bot or is it a person? Academy of Management Discoveries, 4(1), 32–49.

Salur, M. U., & Aydin, I. (2020). A novel hybrid deep learning model for sentiment classification. IEEE Access, 8, 58080–58093.

Samad, M. D., Khounviengxay, N. D., & Witherow, M. A. (2020). Effect of text processing steps on twitter sentiment classification using word embedding. ArXiv Preprint ArXiv:2007.13027.

Stieglitz, S., & Dang-Xuan, L. (2013). Emotions and information diffusion in social media—sentiment of microblogs and sharing behavior. Journal of Management Information Systems, 29(4), 217–248.

Sykora, M., Elayan, S., & Jackson, T. W. (2020). A qualitative analysis of sarcasm, irony and related# hashtags on Twitter. Big Data & Society, 7(2), 2053951720972735.

Vijayarani, S., Ilamathi, M. J., & Nithya, M. (2015). Preprocessing techniques for text mining-an overview. International Journal of Computer Science & Communication Networks, 5(1), 7–16.

Xue, J., Chen, J., Hu, R., Chen, C., Zheng, C., Su, Y., & Zhu, T. (2020). Twitter discussions and emotions about the COVID-19 pandemic: Machine learning approach. Journal of Medical Internet Research, 22(11), e20550.

Yang, S. Y., Mo, S. Y. K., & Liu, A. (2015). Twitter financial community sentiment and its predictive relationship to stock market movement. Quantitative Finance, 15(10), 1637–1656.

Younis, E. M. G. (2015). Sentiment analysis and text mining for social media microblogs using open source tools: an empirical study. International Journal of Computer Applications, 112(5).

Downloads

Published

2025-03-30

How to Cite

Sanjay, N. (2025). Application of Text Mining Methods for Sentiment Analysis on Twitter Social Media. Idea: Future Research, 3(1), 9–17. Retrieved from https://idea.ristek.or.id/index.php/idea/article/view/35