Hadoop-Based Similarity Computation System for Composed Documents

HTML  XML Download Download as PDF (Size: 393KB)  PP. 196-202  
DOI: 10.4236/jcc.2015.35025    2,990 Downloads   3,573 Views  Citations

ABSTRACT

There exist a large number of composed documents in universities in the teaching process. Most of them are required to check the similarity for validation. A kind of similarity computation system is constructed for composed documents with images and text information. Firstly, each document is split and outputs two parts as images and text information. Then, these documents are compared by computing the similarities of images and text contents independently. Through Hadoop system, the text contents are easily and quickly separated. Experimental results show that the proposed system is efficient and practical.

Share and Cite:

Zhang, X. , Qin, Z. , Liu, X. , Hou, Q. , Zhang, B. and Wu, J. (2015) Hadoop-Based Similarity Computation System for Composed Documents. Journal of Computer and Communications, 3, 196-202. doi: 10.4236/jcc.2015.35025.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.