Abstract
Sport-fanaticism is one of the social problems. Studying this problem in social network sites such as Twitter becomes important where social sites provide a mean for people to communicate and share emotions. Hence, a huge amount of data is posted on social media every day where text mining and sentiment analysis are essential to automatically analyze such data to extract the desired information and knowledge. In this paper, two main contributions are introduced. The first contribution is that we generated twelve large-scale fanatic-lexicons that can help in building fanatic-classification to automatically classify Arabic social text (e.g., tweets) into fanatic-text or non-fanatic text. The generated fanatic-lexicons can help in building anti-fanatic tools and automatically detecting and measuring sport-fanaticism in Arabic social text. As far as we know, the generated fanatic-lexicons are the first large-scale fanatic-lexicons. The generated resources are publicly available for research purpose. The second contribution is that we proposed a new method to automatically generate sentiment lexicons which is called Term Frequency-Inverse Context Frequency (TFICF). The performance of the proposed-TFICF method is analyzed and compared with one of the common methods in this path which is called Pointwise Mutual Information (PMI). Our proposed-TFICF method showed better performance where the highest accuracy of TFICF is 89%, and the highest accuracy of PMI is 82%.



Similar content being viewed by others
References
Aldayel HK, Azmi AM (2016) Arabic tweets sentiment analysis–a hybrid scheme. J Inf Sci 42(6):782–797
Al-Moslmi T, Albared M, Al-Shabi A, Omar N, Abdullah S (2017) Arabic senti-lexicon: constructing publicly available language resources for arabic sentiment analysis. J Inf Sci. https://doi.org/10.1177/0165551516683908
Alqmase MM (2019) Building Sentiment Resources, In: Sentiment analysis for sports fanaticism in Arabic social media text (https://eprints.kfupm.edu.sa/id/eprint/141005), King Fahd University of Petroleum & Minerals
Alqmase MM (2021) “Anti-fanatic Repository,” Anti-Fanatic, [Online]. https://github.com/qumasi/Anti-fanatic-resources. Accessed 22 May 2022
Alqmase M, Al-Muhtasab H, Rabaan H (2021) Sports-fanaticism formalism for sentiment analysis in Arabic Text. Soc Netw Anal Min (SNAM). https://doi.org/10.1007/s13278-021-00757-9
Alshahrani HA, Fong AC (2018) Arabic domain-oriented sentiment lexicon construction using latent Dirichlet Allocation, In: 2018 IEEE International Conference on Electro/Information Technology (EIT)
Al-Twairesh N, Al-Khalifa H, Al-Salman A (2014) Subjectivity and sentiment analysis of Arabic: trends and challenges, In: IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA), Doha, Qatar
Al-Twairesh N, Al-Khalifa H, AlSalman A (2016) Arasenti: large-scale twitter-specific arabic sentiment lexicons, In: Proceedings of the 54th annual meeting of the association for computational linguistics
Church KW, Hanks P (1990) Word association norms, mutual information, and lexicography. Comput Linguist 16(1):22–29
Darwish K, Mubarak H (2016) "Farasa: A new fast and accurate Arabic word segmenter," In: Proceedings of the Tenth International Conference on Language Resources and Evaluation
E--Beltagy SR (2017) WeightedNileULex: a scored Arabic sentiment lexicon for improved sentiment analysis. In: Gayar NE, Suen CY (eds) Language processing, pattern recognition and intelligent systems. Special issue on computational linguistics, speech and image processing for Arabic language. World Scientific Publishing Co
El-Beltagy SR (2016) NileULex: a phrase and word level sentiment lexicon for Egyptian and Modern Standard Arabic, In: LREC
El-Beltagy SR, Ali A (2013a) "Open issues in the sentiment analysis of Arabic social media: A case study," In: 2013a 9th International Conference on Innovations in Information Technology (IIT)
El-Beltagy S, Ali A (2013b) “unWeighted opinion mining Lexicon (Egyptian Arabic)
El-Beltagy S, Ali A (2013c) “unWeighted Opinion Mining Lexicon (Egyptian Arabic),” [Online]. http://bit.ly/MGtMqU. Accessed 4 Apr 2019
El-Masri M, Altrabsheh N, Mansour H (2017) Successes and challenges of Arabic sentiment analysis research: a literature review. Soc Netw Anal Min Springer 1(7):54
ElSahar H, El-Beltagy SR (2014) A fully automated approach for arabic slang lexicon extraction from microblogs," In: International conference on intelligent text processing and computational linguistics
ElSahar H, El-Beltagy SR (2015) Building large arabic multi-domain resources for sentiment analysis, In: International Conference on Intelligent Text Processing and Computational Linguistics
Esuli A, Sebastiani F (2006) Sentiwordnet: a publicly available lexical resource for opinion mining, In: LREC, Citeseer, pp 417–422
Hu M, Liu B (2014) Mining and summarizing customer reviews, In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Kiritchenko S, Zhu X, Mohammad SM (2014) Sentiment analysis of short informal texts. J Artif Intell Res 50:723–762
Liu B (2012) Sentiment analysis and opinion mining, Morgan and Claypool Publishers
Mahyoub FH, Siddiqui MA, Dahab MY (2014) Building an Arabic sentiment lexicon using semi-supervised learning. J King Saud Univ Comput Inf Sci 26(4):417–424
Mataoui M, Zelmati O, Boumechache M (2016) A proposed lexicon-based sentiment analysis approach for the vernacular Algerian Arabic. Res Comput Sci 110:55–70
Mohammad S, Turney P (2010) Emotions evoked by common words and phrases: Using mechanical turk to create an emotion lexicon, In: Proceedings of the NAACL HLT 2010 workshop on computational approaches to analysis and generation of emotion in text
Mohammad SM, Kiritchenko S, Zhu X (2013) Nrc-canada: Building the state-of-the-art in sentiment analysis of tweets, arXiv preprint arXiv:1308.6242
Mohammad S, Salameh M, Kiritchenko S (2016) Sentiment Lexicons for Arabic Social Media, In: LREC
Mohammad H, Sulaiman MN (2015) A review on evaluation metrics for data classification evaluations. Int J Data Min Knowl Manag Process 5(2):1
Mohammad SM, Turney PD (2013) Crowdsourcing a word–emotion association lexicon. Comput Intell 29(3):436–465
Nielsen F (2011) A new ANEW: Evaluation of a word list for sentiment analysis in microblogs, arXiv preprint arXiv:1103.2903
Niwa Y, Nitta Y (1994) Co-occurrence vectors from corpora vs. distance vectors from dictionaries, In: Proceedings of the 15th conference on Computational linguistics
Pasha A, Al-Badrashiny M, Mohamed MT, El Kholy A, Eskander R, Habash N, Pooleery M, Rambow O, Roth R (2014) Madamira: a fast, comprehensive tool for morphological analysis and disambiguation of arabic," In: LREC
QCRI Arabic Language Technologies Tools & Demos (2016) “FARASA” developed by Arabic Language Technologies Group at Qatar Computing Research Institute (QCRI), website accessed (2022) [Online]. https://farasa.qcri.org/. Accessed 13 May 2022
Refaee E, Rieser V (2014) An arabic twitter corpus for subjectivity and sentiment analysis, In: LREC
Wilson T, Wiebe J, Hoffmann P (2005) Recognizing contextual polarity in phrase-level sentiment analysis, In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing
Youssef M, El-Beltagy SR (2018) MoArLex: an Arabic sentiment lexicon built through automatic lexicon expansion. Procedia Comput Sci 142:94–103
Acknowledgements
The authors would like to acknowledge the support provided by King Fahd University of Petroleum & Minerals.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Alqmase, M., Al-Muhtaseb, H. Sport-fanaticism lexicons for sentiment analysis in Arabic social text. Soc. Netw. Anal. Min. 12, 56 (2022). https://doi.org/10.1007/s13278-022-00871-2
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-022-00871-2