A Multi-Classifier Based Prediction Model for Phishing Emails Detection Using Topic Modelling, Named Entity Recognition and Image Processing - Circuits and Systems

CS > Vol.7 No.9, July 2016

A Multi-Classifier Based Prediction Model for Phishing Emails Detection Using Topic Modelling, Named Entity Recognition and Image Processing ()

HTML XML

Download as PDF (Size: 1674KB) PP. 2507-2520

DOI: 10.4236/cs.2016.79217 1,591 Downloads 3,182 Views Citations

Author(s)

C. Emilin Shyni¹, S. Sarju², S. Swamynathan³

Affiliation(s)

¹Department of Information Technology, KCG College of Technology, Chennai, India.
²Department of Computer Science, St. Joseph’s College of Engineering and Technology, Kerala, India.
³Department of Information Science and Technology, Anna University, Chennai, India.

ABSTRACT

Phishing is the act of attempting to steal a user’s financial and personal information, such as credit card numbers and passwords by pretending to be a trustworthy participant, during online communication. Attackers may direct the users to a fake website that could seem legitimate, and then gather useful and confidential information using that site. In order to protect users from Social Engineering techniques such as phishing, various measures have been developed, including improvement of Technical Security. In this paper, we propose a new technique, namely, “A Prediction Model for the Detection of Phishing e-mails using Topic Modelling, Named Entity Recognition and Image Processing”. The features extracted are Topic Modelling features, Named Entity features and Structural features. A multi-classifier prediction model is used to detect the phishing mails. Experimental results show that the multi-classification technique outperforms the single-classifier-based prediction techniques. The resultant accuracy of the detection of phishing e-mail is 99% with the highest False Positive Rate being 2.1%.

KEYWORDS

Phishing, Conditional Random Field Classifier, Latent Dirichlet Allocation, Natural Language Processing, Machine Learning, Image Segmentation, Image Processing

Share and Cite:

Shyni, C. , Sarju, S. and Swamynathan, S. (2016) A Multi-Classifier Based Prediction Model for Phishing Emails Detection Using Topic Modelling, Named Entity Recognition and Image Processing. Circuits and Systems, 7, 2507-2520. doi: 10.4236/cs.2016.79217.

Journals Menu

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies