Journal of Computer and Communications

Volume 3, Issue 5 (May 2015)

ISSN Print: 2327-5219   ISSN Online: 2327-5227

Google-based Impact Factor: 1.12  Citations  

Cluster Analysis Based on Contextual Features Extraction for Conversational Corpus

HTML  XML Download Download as PDF (Size: 392KB)  PP. 33-37  
DOI: 10.4236/jcc.2015.35004    3,065 Downloads   3,687 Views  Citations
Author(s)

ABSTRACT

Cluster analysis related to computational linguistics seldom concerned with Pragmatics level. Features of corpus on Pragmatics level related to specific situations, including backgrounds, titles and habits. To improve the accuracy of clustering for conversations collected from international students in Tsinghua University, it required contextual features. Here, we collected four-hundred conversations as a corpus and built it to Vector Space Model. With the Oxford-Duden Dictionary and other methods we modified the model and concluded into three groups. We testified our hypothesis through self-organizing map neural network. The result suggested that the modified model had a better outcome.

Share and Cite:

Chen, Q. , Chen, Y. and Jiang, M. (2015) Cluster Analysis Based on Contextual Features Extraction for Conversational Corpus. Journal of Computer and Communications, 3, 33-37. doi: 10.4236/jcc.2015.35004.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.