TITLE:
Hadoop-Based Similarity Computation System for Composed Documents
AUTHORS:
Xiaoming Zhang, Zhipeng Qin, Xuwei Liu, Qianyun Hou, Baishuang Zhang, Jie Wu
KEYWORDS:
Similarity Computation, Composed Documents, Map Reduce, System Integration
JOURNAL NAME:
Journal of Computer and Communications,
Vol.3 No.5,
May
25,
2015
ABSTRACT:
There exist a large number of
composed documents in universities in the teaching process. Most of them are
required to check the similarity for validation. A kind of similarity
computation system is constructed for composed documents with images and text
information. Firstly, each document is split and outputs two parts as images
and text information. Then, these documents are compared by computing the
similarities of images and text contents independently. Through Hadoop system,
the text contents are easily and quickly separated. Experimental results show
that the proposed system is efficient and practical.