Quran Question Answering System Using Arabic Number Patterns (Singular, Dual, Plural)

Mohamed Ayed, Eric Atwell

Abstract


In the field of Information Retrieval (IR), it may be difficult to answer a question posed by the user, because the search engine retrieves a ranked list of documents that may contain the answer inside the documents, but this needs extra effort from the user to search for the answer inside the documents, and there may be no answer. The alternative to  IR search engine is a question answering system, which retrieves the answer to the question in the natural language text if found. A question answering system accepts the question in natural language, then applies a series of processes to extract the  answer. In general a question answering system is composed of three main components: question classification module, information retrieval module and answer extraction module. We developed a question answering system applied to the Holy Quran written in Classical Arabic. Some characteristics of the Arabic language were used to enhance the answer extraction: one of these important characteristics is number in nouns: singular, dual and plural. A version of the question-answering system was built which uses noun number patterns to process the number in Arabic questions and candidate answers, which enhances the result set of answers by adding more words and meaning. A corpus of questions and its answers about the Holy Quran was used to test and compare baseline and enhanced versions of our Quran Qurstion Answering system.


Full Text:

PDF

References


Abdelnasser, H. (2014). Al-Bayan : An Arabic Question Answering System for the Holy Quran. Proceedings of the 9th International Workshop on Semantic Evaluation, 57–64.

Abouenour, L., Bouzoubaa, K., & Rosso, P. (2008). Improving QA Using Arabic WordNet. In the 2008 International Arab Conference on Information Technology.

Adany, M. A. H., & Atwell, E. (2015). Islamic Applications of Automatic Question-Answering. SUST Journal of Engineering and Computer Science (JECS), 1(2), 51–57.

Allam, A., & Haggag, M. (2012). The Question Answering Systems: A Survey. International Journal of Research and Reviews in Information Sciences, IJRRIS, 2(3). Retrieved from http://aliallam.com/QA Survey Paper (IJRRIS).pdf

Al-Omari, A., & Abuata, B. (2014). Arabic light stemmer (ARS). Journal of Engineering Science and Technology, 9(6), 702–717.

Alotaibi, Y. A., & Selouani, S.-A. (2009). Evaluating the MSA West Point Speech Corpus. International Journal of Computer Processing of Languages, 22(4), 285–304. http://doi.org/10.1142/S1793840609002111

Attia, M. (2008). Handling Arabic Morphological and Syntactic Ambiguity within the LFG Framework with a View to Machine Translation. PhD Thesis, University of Manchester. Retrieved from http://attiaspace.com/Publications/Attia-PhD-Thesis.pdf

Cavalli-sforza, V., Soudi, A., & Mitamura, T. (2000). Arabic Morphology Generation Using a Concatenative Strategy. In Proceedings of NAACL, 86–93.

Gravano, A. (2009). Turn-taking and affirmative cue words in task-oriented dialogue. PhD Thesis, Columbia University. http://www.cs.columbia.edu/speech/ThesisFiles/agustin_gravano.pdf

Hammo, B., Sleit, A., El-Haj, M. (2007). Effectiveness of Query Expansion in Searching the Holy Quran. Colloque Internationale Traitement Automatique de La Langue Arabe:, CITALA, 7, 18–19. Retrieved from http://scholar.google.com/scholar?hl=en&btnG=Search&q=intitle:Effectiveness+of+Query+Expansion+in+Searching+the+Holy+Quran#0

Hammo, B., Abuleil, S., Lytinen, S., & Evens, M. (2004). Experimenting with a question answering system for the Arabic language. Computers and the Humanities, 38(4), 397–415. http://doi.org/10.1007/s10579-004-1917-3

Hammo, B., & El-haj, M. (2008). Enhancing Retrieval Effectiveness of Diacritisized Arabic Passages Using Stemmer and Thesaurus. 19th Midwest Artificial Intelligence And Cognitive Science Conference.

Heba Kurdi, Sara Alkhaider, N. A., & Department. (2014). Development And Evaluation Of A Web Based Question Answering System For Arabic Language. In Logic-based approach for improving Arabic question answering. Computational Intelligence and Computing Research (ICCIC), 2014 IEEE International Conference on. IEEE, 2014.‏ (pp. 187–202).

Ishkewy, H., Harb, H., & Farahat, H. (2014). Azhary: An Arabic Lexical Ontology. International Journal of Web & Semantic Technology, 5(4), 71–82. http://doi.org/10.5121/ijwest.2014.5405

Kanaan, G., Hammouri, A., Al-Shalabi, R., & Swalha, M. (2009). A New Question Answering System for the Arabic Language. American Journal of Applied Sciences, 6(4), 797–805. http://doi.org/10.3844/ajas.2009.797.805

Loni, B. (2011). A survey of state-of-the-art methods on question classification. Literature Survey, Published on TU Delft Repository, 1999(October). Retrieved from http://repository.tudelft.nl/assets/uuid:8e57caa8-04fc-4fe2-b668-20767ab3db92/A_Survey_of_State-of-the-Art_Methods_on_Question_Classification.pdf

Lopez, V. et al. (2011). Is Question Answering fit for the Semantic Web ?: a Survey. Semantic Web Journal, 2:2 pp.125-155.

Mohamed, A., Allam, N., & Haggag, M. H. (2012). The Question Answering Systems : A Survey. International Journal of Research and Reviews in Information Sciences (IJRRIS) Vol. 2, No. 3,

Shaalan, K. (2007). Person Name Entity Recognition for Arabic, Proceedings of 2007 Workshop On Computational Approaches To Semitic Languages pp. 17–24.

Shaalan, K. (2010). Rule-based Approach in Arabic Natural Language Processing. International Journal on Information and Communication Technologies, 3(3), 11–19.

Yauri, A. R., Kadir, R. A., Azman, A., & Murad, M. A. A. (2013). Ontology semantic approach to extraction of knowledge from holy quran. 2013 5th International Conference on Computer Science and Information Technology, 1–5. http://doi.org/10.1109/CSIT.2013.6588804

Zeroual, I., & Lakhouaja, A. (2016). A new Quranic Corpus rich in morphosyntactical information. International Journal of Speech Technology, pp. 1–8. http://doi.org/10.1007/s10772-016-9335-7


Refbacks

  • There are currently no refbacks.