Abstract
This paper compares three techniques for Arabic text classification; these techniques are Support Vector Machine (SVM) with Sequential Minimal Optimization (SMO), Naïve Bayesian (NB), and J48. The main objective of this paper is to measure the accuracy for each classifier and to determine which classifier is more accurate for Arabic text classification based on stop words elimination. The accuracy for classifier is measured by Percentage split method (holdout), and K-fold cross validation methods,. The results show that the SMO classifier achieves the highest accuracy and the lowest error rate, and shows that the time needed to build the SMO model is the smallest time.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2nd International Conference on Intelligent Semantic Web-Services and Applications, ISWSA'11 |
DOIs | |
Publication status | Published - 18 Apr 2011 |
Externally published | Yes |
Event | 2nd International Conference on Intelligent Semantic Web-Services and Applications, ISWSA 2011 - Amman, Jordan Duration: 18 Apr 2011 → 20 Apr 2011 |
Conference
Conference | 2nd International Conference on Intelligent Semantic Web-Services and Applications, ISWSA 2011 |
---|---|
Country/Territory | Jordan |
City | Amman |
Period | 18/04/11 → 20/04/11 |
Keywords
- Arabic text classification
- Naive Bayesian
- Stop word elimination
- Support vector machine