Intelligent Information Management

Volume 9, Issue 2 (March 2017)

ISSN Print: 2160-5912   ISSN Online: 2160-5920

Google-based Impact Factor: 1.6  Citations  

A Comparative Survey on Arabic Stemming: Approaches and Challenges

HTML  XML Download Download as PDF (Size: 664KB)  PP. 39-67  
DOI: 10.4236/iim.2017.92003    2,561 Downloads   5,813 Views  Citations

ABSTRACT

Arabic, as one of the Semitic languages, has a very rich and complex morphology, which is radically different from the European and the East Asian languages. The derivational system of Arabic, is therefore, based on roots, which are often inflected to compose words, using a spectacular and a relatively large set of Arabic morphemes affixes, e.g., antefixs, prefixes, suffixes, etc. Stemming is the process of rendering all the inflected forms of word into a common canonical form. Stemming is one of the early and major phases in natural processing, machine translation and information retrieval tasks. A number of Arabic language stemmers were proposed. Examples include light stemming, morphological analysis, statistical-based stemming, N-grams and parallel corpora (collections). Motivated by the reported results in the literature, this paper attempts to exhaustively review current achievements for stemming Arabic texts. A variety of algorithms are discussed. The main contribution of the paper is to provide better understanding among existing approaches with the hope of building an error-free and effective Arabic stemmer in the near future.

Share and Cite:

Mustafa, M. , Eldeen, A. , Bani-Ahmad, S. and Elfaki, A. (2017) A Comparative Survey on Arabic Stemming: Approaches and Challenges. Intelligent Information Management, 9, 39-67. doi: 10.4236/iim.2017.92003.

Cited by

[1] Extractive Arabic Text Summarization-Graph-Based Approach
Khassawneh, ES Hanandeh - Electronics, 2023
[2] MULTI-CLASS SENTIMENT CLASSIFICATION ON BENGALI SOCIAL MEDIA COMMENTS USING MACHINE LEARNING
International Journal of Cognitive …, 2023
[3] Meta-search based approach for Arabic information retrieval
Online Information Review, 2022
[4] Arabic fake news detection based on textual analysis
Barhamtoshy - Arabian Journal for Science …, 2022
[5] Arabic light-based stemmer using new rules
Aswadi… - Journal of King Saud …, 2022
[6] Combining a Novel Scoring Approach with Arabic Stemming Techniques for Arabic Chatbots Conversation Engine
Transactions on Asian and Low-Resource …, 2022
[7] Multi-layered network model for text summarization using feature representation
Soft Computing, 2022
[8] Exploiting discourse relations to produce Arabic extracts
International Journal of …, 2022
[9] Sentiment Analysis of Political Post Classification Based on XGBoost
Proceedings of International Conference on Computing …, 2022
[10] Computer and Information Sciences
2021
[11] Arabic light-based stemming: a comparative study among ligh10 stemmer, P-stemmer, and Conditional light stemmer
2021 2nd Information …, 2021
[12] Political Arabic Articles Orientation Using Rough Set Theory with Sentiment Lexicon
2021
[13] Arabic question answering system: a survey
2021
[14] Comparative Analysis of Nine Arabic Stemmers on Microblog Information Retrieval
2020
[15] Political Arabic Articles Classification Based on Machine Learning and Hybrid Vector
2020
[16] A study on Arabic sign language recognition for differently abled using advanced machine learning classifiers
2020
[17] Enhanced Arabic Root-Based Lemmatizer ةيبرعلا روذجلل نسحم رزيتيميل‎
2020
[18] Sentiment Analysis for Arabic Reviews using Machine Learning Classification Algorithms
2020
[19] Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering
2020
[20] Comparing the Effectiveness of the Improved ARLSTem Algorithm with Existing Arabic Light Stemmers
2019
[21] Arabic Multi-Objective Optimization with K-mediod Clustering for Multi-Document Summarization
2019
[22] Improving Arabic Stemmer: ISRI Stemmer
2019
[23] An efficient single document Arabic text summarization using a combination of statistical and semantic features
2019
[24] ASA: A framework for Arabic sentiment analysis
2019
[25] Out of vocabulary word detection and recovery in Arabic handwritten text recognition
2019
[26] Developing Two Different Novel Techniques for Arabic Text Stemming
2019
[27] CLASSIFYING ARABIC TEXT USING DEEP LEARNING
2019
[28] Classifying Political Arabic Articles Using Support Vector Machine with Different Feature Extraction
2019
[29] Arabic Document Indexing for Improved Text Retrieval
2019
[30] Arabic Light Stemming: A Comparative Study between P-Stemmer, Khoja Stemmer, and Light10 Stemmer
2019
[31] P-Stemmer or NLTK Stemmer for Arabic Text Classification?
2019
[32] Political Articles Categorization Based on Different Naïve Bayes Models
2019
[33] Elaboration d'un corpus de test pour un système d'évaluation automatique des réponses courtes
2019
[34] Improving Sentiment Analysis of Moroccan Tweets Using Ensemble Learning
Big Data, Cloud and Applications, 2018
[35] Arabic Sign Language Recognition: A Review

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.