Classification of Holy Quran Verses based on Imbalanced Learning
Abstract
Imbalanced Learning (IL) is considered as a special case of text classification. It is applied in order to classify Imbalanced classes that are not equal in the number of samples. There are many researches on classified Quranic text which depends on different methods of classification. However, there is no study that classifies the Quranic topics based on Imbalanced Leaning. So, this paper aims to apply the concept of IL to assign corresponding topics for the Quranic verses according to their contents. In this paper, two Quranic datasets have been classified by using Imbalanced Learning consecutively; the first dataset is Unification of God “Tawheed” and Polytheism of God “Shirk” verses, the second dataset is Meccan, and Medinan chapters. Imbalanced Classification is applied here since these topics have imbalanced classes which cannot be classified correctly by traditional methods. The results showed that applying Imbalanced Classification produces better outcomes than the results that are executed without using Imbalanced Classification techniques.
Full Text:
PDFReferences
• Adeleke, A. O., et al. (2017). "Comparative analysis of text classification algorithms for automated labelling of Quranic verses." Int. J. Advanc. Sci. Eng. Info. Tech 7: 1419-1427.
• Adeleke, A. O., et al. (2018). A Group-Based Feature Selection Approach to Improve Classification of Holy Quran Verses. International Conference on Soft Computing and Data Mining, Springer.
• Akour, M., et al. (2014). "MQVC: measuring Quranic verses similarity and sura classification using N-gram." WSEAS Transactions on Computers.
• Al-Azani, S. and E.-S. M. El-Alfy (2017). "Using word embedding and ensemble learning for highly imbalanced data sentiment analysis in short arabic text." Procedia Computer Science 109: 359-366.
• Al-Kabi, M. N., et al. (2013). A topical classification of Quranic Arabic text, NOORIC 2013: Taibah University International Conference on Advances in Information Technology for the Holy Quran and Its Sciences
• Al-Kabi, M. N., et al. (2005). "Statistical classifier of the holy Quran verses (Fatiha and Yaseen chapters)." Journal of Applied Sciences 5(3): 580-583.
• Blagus, R. and L. Lusa (2013). "SMOTE for high-dimensional class-imbalanced data." BMC Bioinformatics 14(1): 106.
• Chawla, N. V., et al. (2002). "SMOTE: synthetic minority over-sampling technique." Journal of artificial intelligence research 16: 321-357.
• Haixiang, G., et al. (2017). "Learning from class-imbalanced data: Review of methods and applications." Expert Systems with Applications 73: 220-239.
• Hamed, S. K. and M. J. Ab Aziz (2016). "A question answering system on Holy Quran translation based on question expansion technique and Neural Network classification." J. Comput. Sci 12(3): 169-177.
• Hassan, G. S., et al. (2015). "Categorization of ‘Holy Quran-Tafseer’using K-Nearest Neighbor Algorithm." International Journal of Computer Applications11/2015 129(12): 1-6.
• Jamil, N. S., et al. (2017). "Topic Identification Method For Textual Document." Journal of Multidisciplinary Engineering Science and Technology .
• Jamil, N. S., et al. (2017). "A subject identification method based on term frequency technique." International Journal of Advanced Computer Research 7(30): 103.
• Khalilia, M., et al. (2011). "Predicting disease risks from highly imbalanced data using random forest." BMC medical informatics and decision making 11(1): 51.
• Khedher, M. Z. (2017). "Multiword corpus of the Holy Quran." International Journal on Islamic Applications in Computer Science And Technology. 2017 5(1).
• Nassourou, M. (2012). Using Machine Learning Algorithms for Categorizing Quranic Chaptersby Major Phases of Prophet Mohammad's Messengership, Citeseer.
• Nayal, A., et al. (2017). "KerMinSVM for imbalanced datasets with a case study on arabic comics classification." Engineering Applications of Artificial Intelligence 59: 159-169.
• Patel, H. and G. S. Thakur (2017). "Classification of imbalanced data using a modified fuzzy-neighbor weighted approach." International Journal of Intelligent Engineering and Systems 10(1): 56-64.
• Skryjomski, P. and B. Krawczyk (2017). Influence of minority class instance types on SMOTE imbalanced data oversampling. First International Workshop on Learning with Imbalanced Domains: Theory and Applications.
• Tang, Y., et al. (2009). "SVMs modeling for highly imbalanced classification." IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 39(1): 281-288.
Refbacks
- There are currently no refbacks.