An Approach for Content Retrieval from Web Pages Using Clustering Techniques - Circuits and Systems

CS > Vol.7 No.9, July 2016

Circuits and Systems

Volume 7, Issue 9 (July 2016)

ISSN Print: 2153-1285 ISSN Online: 2153-1293

Google-based Impact Factor: 0.48 Citations

An Approach for Content Retrieval from Web Pages Using Clustering Techniques ()

HTML XML

Download as PDF (Size: 794KB) PP. 2663-2675

DOI: 10.4236/cs.2016.79230 1,691 Downloads 2,850 Views Citations

Author(s)

R. Manjula, A. Chilambuchelvan

Affiliation(s)

Department of CSE, R.M.K Engineering College, Chennai, India.

ABSTRACT

Mining the content from an information database provides challenging solutions to the industry experts and researchers, due to the overcrowded information in huge data. In web searching, the information retrieved is not an appropriate, because it gives ambiguous information for the user query, and the user cannot get relevant information within the stipulated time. To overcome these issues, we propose a new methodology for information retrieval EPCRR by providing the top most exact information to the user, by using the collaborative clustered automated filter which makes use of the collaborative data set and filter works on the prediction by providing the highest ranking for the exact data retrieved. The retrieval works on the basis of recommendation of data which consists of relevant data set with highest priority from the cluster of data which is on high usage. In this work, we make use of the automated wrapper which works similar to the meta crawler functionality and it obtains the content in the semantic usage data format. Obtained information from the user to the agent will be ranked based on the Enabled Pile clustered data with respect to the metadata information from the agent and end-user. The information is given to the end-user with the top most ranking data within the stipulated time and the remaining top information will be moved to the data repository for future use. The data collected will remain stable based on the user preference and works on the intelligence system approach in which the user can choose any information under any instances and can be provided with suitable high range of exact content. In this approach, we find that the proposed algorithm has produced better results than existing work and it costs less online computation time.

KEYWORDS

Collaborative Filter, Automated Wrapper, Clustering, Information Retrieval, Data Repository

Share and Cite:

Manjula, R. and Chilambuchelvan, A. (2016) An Approach for Content Retrieval from Web Pages Using Clustering Techniques. Circuits and Systems, 7, 2663-2675. doi: 10.4236/cs.2016.79230.

Cited by

[1]	An efficient approach for indexing web pages using various similarity features
	Advances in Natural and Applied Sciences, 2017

[2]	An novel approach to extract the content retrieval with the image perception using collaborative community oriented sifting (CCOS)
	Cluster Computing, 2017

Journals Menu

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies