Journal of Intelligent Learning Systems and Applications

Volume 8, Issue 4 (November 2016)

ISSN Print: 2150-8402   ISSN Online: 2150-8410

Google-based Impact Factor: 1.5  Citations  

Improved Term Weighting Technique for Automatic Web Page Classification

HTML  XML Download Download as PDF (Size: 1467KB)  PP. 63-76  
DOI: 10.4236/jilsa.2016.84006    1,737 Downloads   3,104 Views  Citations

ABSTRACT

Automatic web page classification has become inevitable for web directories due to the multitude of web pages in the World Wide Web. In this paper an improved Term Weighting technique is proposed for automatic and effective classification of web pages. The web documents are represented as set of features. The proposed method selects and extracts the most prominent features reducing the high dimensionality problem of classifier. The proper selection of features among the large set improves the performance of the classifier. The proposed algorithm is implemented and tested on a benchmarked dataset. The results show the better performance than most of the existing term weighting techniques.

Share and Cite:

Thangairulappan, K. and Kanagavel, A. (2016) Improved Term Weighting Technique for Automatic Web Page Classification. Journal of Intelligent Learning Systems and Applications, 8, 63-76. doi: 10.4236/jilsa.2016.84006.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.