TY - GEN
T1 - An experimental study for Arabic text classification techniques
AU - Al-Shargabi, Bassam
AU - Olayah, Fekry
PY - 2012/6/2
Y1 - 2012/6/2
N2 - Several algorithms have been implemented to resolve the problem of text categorization. Most of the work in this area geared for English text, whereas few researches have been conducted on Arabic text. However, the nature of Arabic text is different than English text; pre-processing of Arabic text are more challenging. In this paper an experimental study was conducted on three techniques for Arabic text classification; these techniques, Discriminative Multinominal Naive Bayes (DMNB), Naïve Bayesian (NB) and IBK Algorithms, The paper aimed to assess the accuracy for each classifier and to determine which classifier is more accurate for Arabic text classification based on stop words elimination. The accuracy for each classifier is measured by Percentage split method (holdout), and K-fold cross validation methods, along with the time needed to classify Arabic text.
AB - Several algorithms have been implemented to resolve the problem of text categorization. Most of the work in this area geared for English text, whereas few researches have been conducted on Arabic text. However, the nature of Arabic text is different than English text; pre-processing of Arabic text are more challenging. In this paper an experimental study was conducted on three techniques for Arabic text classification; these techniques, Discriminative Multinominal Naive Bayes (DMNB), Naïve Bayesian (NB) and IBK Algorithms, The paper aimed to assess the accuracy for each classifier and to determine which classifier is more accurate for Arabic text classification based on stop words elimination. The accuracy for each classifier is measured by Percentage split method (holdout), and K-fold cross validation methods, along with the time needed to classify Arabic text.
KW - Accuracy
KW - Arabic text classification
KW - categorizations algorithms
KW - error rate
UR - http://www.scopus.com/inward/record.url?scp=84874699276&partnerID=8YFLogxK
U2 - 10.1117/12.946039
DO - 10.1117/12.946039
M3 - Conference contribution
AN - SCOPUS:84874699276
SN - 9780819489913
T3 - Proceedings of SPIE - The International Society for Optical Engineering
BT - Fourth International Conference on Digital Image Processing, ICDIP 2012
T2 - 4th International Conference on Digital Image Processing, ICDIP 2012
Y2 - 7 April 2012 through 8 April 2012
ER -