Learning to Rank for Arabic Transcriptions Retrieval

Farida Sabry, Mayada Hadhoud, Nevin Darwish

Abstract


The amount of spoken documents being shared on the web per minute is increasing dramatically posing a true challenge for any search engine in order to satisfy its customers’ queries. With the ingoing improvement in the speech recognizers accuracy, this research addresses the problem of ranking transcriptions that can be obtained by speech recognizers to enhance search engine ranking results. Depending on the title of the video and some of its meta-data only is not sufficient for some queries that have the information need to get relevant spoken segments within audio files. Feature extraction based on both the meta-data of the spoken documents and the timed spoken content transcription for an Arabic audio dataset for Quran is proposed. The results revealed that applying learning to rank techniques are superior to the baseline unsupervised BM25 scoring. In addition, using transcription-based features proved its effectiveness in terms of both the Normalized Discounted Cumulative Gain (NDCG@10) and Expected Reciprocal Rank (ERR@10).


Full Text:

PDF

References


• Breiman, L. 2001. Random Forests. Machine Learning 45: 5-32.

• Burges, C., T. Shaked, E. Renshaw, A. Lazier, M. Deeds, N. Hamilton, and G. Hullender. 2005. Learning to rank using gradient descent. In Proceedings of the 22nd International Conference on Machine Learning, 89-96. Bonn, Germany.

• Cao, Z., T. Qin, T. Y. Liu, M. F. Tsai, and H. Li. 2007. Learning to Rank: From Pairwise Approach to Listwise Approach. In Proceedings of the 24th international conference on Machine learning, 129-136. Oregon, USA.

• Chapelle, O. and D. Metzler, Y. Zhang, and P. Grinspan. 2009. Expected Reciprocal Rank for Graded Relevance. In Proceedings of the 18th ACM conference on Information and knowledge management, 621-630. Hong Kong.

• Chen, J., C. Xiong, and J. Callan. 2016. An Empirical Study of Learning to Rank for Entity Search. In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 737-740. Pisa, Italy.

• Dang, V. The Lemur Project - RankLib. https://sourceforge.net/p/lemur/wiki/RankLib/ (last accessed November 2018).

• Donald, M, and W. B. Croft. Linear feature-based models for information retrieval. Information Retrieval, 10(3):257–274, June 2007.

• Freund, Y. and R. Iyer, R. Schapire, and Y. Singer. 2003. An efficient boosting algorithm for combining preferences. The Journal of Machine Learning Research 4: 933-969.

• Friedman, J. H. 2001. Greedy function approximation: A gradient boosting machine. The Annals of Statistics 29: 1189-1232.

• Kalervo, J., and J. Kekalainen. 2002. Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems 20: 422-446.

• Larkey, L. S., L. Ballesteros, and M.E.Connell. 2007. Light Stemming for Arabic Information Retrieval. In Arabic Computational Morphology: Knowledge-based and Empirical Methods, ed. by Abdelhadi Soudi, Antal van den Bosch and Günter Neumann, 221-243. Springer Netherlands.

• Larson, M., and G. J. F. Jones. 2011. Spoken Content Retrieval: A Survey of Techniques and Technologies. Foundations and Trends in Information Retrieval 5: 235-422.

• Liu, T. Y. 2011. Learning to Rank for Information Retrieval. Springer-Verlag Berlin Heidelberg.

• Ma, M. 2015. Spoken Keyword Rescoring and Document Retrieval for Low-resource Languages. In Proceedings of the 16th Annual Conference of the International Speech Communication Association. Dresden, Germany.

• Rybach, D. and C. Gollan, R. Schluter, and H. Ney. 2009. Audio segmentation for speech recognition using segment features. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan.

• Verse by verse Quran dataset. http://www.everyayah.com (last accessed November 2018)

• Weenink, D., P. Boersma. Praat: doing phonetics by computer.

• Wu, Q., C.J.C. Burges, K. Svore, and J. Gao. 2007. Adapting Boosting for Information Retrieval Measures. Journal of Information Retrieval 13: 254 - 270.

• Xu, J., and H. Li. 2007. AdaRank: a boosting algorithm for information retrieval. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval: 391-398. Amsterdam, The Netherlands.

• Zhai, C., and J. Lafferty. 2001. A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval. In Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 334-342. New York, USA: ACM Press.

• Zhao, X., X. Li, and Z. Zhang. 2015. Multimedia Retrieval via Deep Learning to Rank. IEEE Signal Processing Letters 22: 1487 - 1491.


Refbacks

  • There are currently no refbacks.