Click for new scientific resources and news about Corona[COVID-19]

Paper Information

Journal:   JOURNAL OF ADVANCES IN COMPUTER ENGINEERING AND TECHNOLOGY   summer 2018 , Volume 4 , Number 3 (serial 15); Page(s) 167 To 184.

A New Approach for Text Documents Classification with Invasive Weed Optimization and Naive Bayes Classifier

* Department of Computer Engineering, Urmia Azad University, Urmai, Iran
With the fast increase of the documents, using Text Document Classification (TDC) methods has become a crucial matter. This paper presented a hybrid model of Invasive Weed Optimization (IWO) and Naive Bayes (NB) classifier (IWO-NB) for Feature Selection (FS) in order to reduce the big size of features space in TDC. TDC includes different actions such as text processing, feature extraction, forming feature vectors, and final classification. In the presented model, the authors formed a feature vector for each document by means of weighting features use for IWO. Then, documents are trained with NB classifier; then using the test, similar documents are classified together. FS do increase accuracy and decrease the calculation time. IWO-NB was performed on the datasets Reuters-21578, WebKb, and Cade 12. In order to demonstrate the superiority of the proposed model in the FS, Genetic Algorithm (GA) and Particle Swarm Optimization (PSO) have been used as comparison models. Results show that in FS the proposed model has a higher accuracy than NB and other models. In addition, comparing the proposed model with and without FS suggests that error rate has decreased.
Keyword(s): Text Document Classification,Invasive Weed Optimization,Naive Bayes,Feature Selection
  • ندارد
  pdf-File tarjomyar Yearly Visit 53
Latest on Blog
Enter SID Blog