Multiword corpus of the Holy Quran

Mohammed Zeki Khedher

Abstract


Text of Quran has been served in many forms. Recent attempts have presented such serves in corpus form. However no previous attempt made to deal with multiword in Quran in corpus form. This paper is based on presenting multiword corpus of Quran based on roots of the word rather than the word itself. This combines the relations between same words in addition to other words with common roots. This enriches the corpus since Arabic language word structure is based on root of the word. This corpus should enable future research in going deep in analyzing Quran ontology, Arabic language studies and inter relations between Quran and other Islamic resources, e.g. Hadith, Fiqh, Usool etc.

Full Text:

PDF

References


Abdelati Hawwari, Kfir Bar, Mona Diab, Building an Arabic Multiword Expressions Repository https://aclweb.org/anthology/W/W12/W12-3403.pdf

Alnaser, Mohammad, Multi-word Items in Dictionaries from a Translator’s Perspective, Ph.D Thesis Durham University,2010

Annelen Brunner and Kathrin Steyer, Corpus-driven study of multi-word expressions based on collocations from a very large corpus, http://www1.ids-mannheim.de/fileadmin/lexik/uwv/dateien/BrunnerSteyerBirmingham2007.pdf

Boulaknadel, Siham, Beatrice Daille, Driss Aboutajdine, A multi-word term extraction program for Arabic language, LINA FRE CNRS 2729 University of Nantes, GSCM LRIT University Med V. 2 rue la Houssinire BP 92208 44322 Nantes, B.P. 2008.

Corpus 2011 http://corpus.quran.com

Douglas Biber, A corpus-driven approach to formulaic language in english: Multi-word patterns in speech and writing,

Dukes, Kais, Eric Atwell, Nizar Habash, Supervised Collaboration for Syntactic Annotation of Quranic Arabic, Language Resources and Evaluation Journal, 2011

Eric SanJuan and Fidelia Ibekwe-SanJuan, Multi Word Term queries for focused Information Retrieval, http://dev.termwatch.es/cv/pub/ibesan10.pdf

Greaves, Chris and Martin Warren, What can a corpus tell us about multi-word units? The Routledge Handbook of Corpus Linguistics, March 2010, ISBN: 9780415464895,

Khedher, 2001

محمد زكي خضر ، معجم كلمات القرآن – www.al-mishkat.com/words

Khedher, 2002

محمد زكي خضر ، المعجم المفهرس للتراكيب المتشابهة لفظًا في القرآن الكريم – دار عمار – الأردن 2002

Khedher, 2004

محمد زكي محمد خضر ، الجوانب البرمجية في إعداد المعجم المفهرس للتراكيب المتشابهة لفظًا في القرآن الكريم – دراسات – الجامعة الأردنية – المجلد 31 العدد 1, 2004

Klavans, Judith and Christian Jacquemin. 1997. A natural language approach to multi-word term conflation. In Proceedings, DELOS Workshop on Cross-Language Information Retrieval, ETHZ, Zurich. ERCIM: European Consortium for Informatics and Mathematics.

Lexemes Kis, Balázs; Villada, Begoña; Bouma, Gosse; Ugray, Gábor; Bíró, Tamás; Pohl, Gábor; Nerbonne, John Rijksuniversiteit Groningen, A New Approach to the Corpus-based Statistical Investigation of Hungarian Multi-word, Proceeding of the 4th International Conference on Language resources and Evaluation, Lisbon, Portogal, 2004

Liling Tan and Santanu Pal, Manawi: Using Multi-Word Expressions and Named Entities to Improve Machine Translation, http://statmt.org/wmt14/pdf/W14-3323.pdf

Meghawry, Samah, Abeer Elkorany, Akram Salah, and Tarek Elghazaly, Semantic Extraction of Arabic Multiword Expressions”, Computer Science & Information Technology (CS & IT), 2015.

Mohammed Fouad, 2008

محمد فؤاد عبد الباقي: المعجم المفهرس لألفاظ القرآن الكريم- دار إحياء التراث العربي

Nathan Schneider, “Lexical Semantic Analysis in Natural Language Text”, Carnegie Mellon University, PhD thesis, 2014. http://www.cs.cmu.edu/~nschneid/thesis/thesis-print.pdf

Peromingo, Juan Pedro Rica, Corpus analysis and phraseology: Transfer of multi-word units, LINGUISTICS AND THE HUMAN SCIENCES, VOL 6, NO 1-3 (2010)

Schneider, Nathan and Noah Smith, “A Corpus and Model Integrating Multiword Expressions and Supersenses”, 2015. http://people.cs.georgetown.edu/nschneid/p/sst-slides.pdf

Yoon, H., & Hirvela, A. (2004). ESL Student Attitudes toward Corpus Use in L2 Writing. Journal Of Second Language Writing, 13(4), 257–283. Retrieved 21 March 2012.


Refbacks

  • There are currently no refbacks.