Open Access Library Journal

Volume 7, Issue 12 (December 2020)

ISSN Print: 2333-9705   ISSN Online: 2333-9721

Google-based Impact Factor: 1.18  Citations  

Research on Chinese Text Feature Extraction and Sentiment Analysis Based on Combination Network

HTML  XML Download Download as PDF (Size: 741KB)  PP. 1-12  
DOI: 10.4236/oalib.1106905    267 Downloads   1,187 Views  Citations
Author(s)

ABSTRACT

The complexity of Chinese language system brings great challenge to sentiment analysis. Traditional artificial feature selection is easy to cause the problem of inaccurate segmentation semantics. High quality preprocessing results are of great significance to the subsequent network model learning. In order to effectively extract key features of sentences, retain feature words while removing irrelevant noise and reducing vector dimensions, an algorithm module based on sentiment lexicon combined with Word2vec incremental training is proposed in terms of feature engineering. Firstly, the data set is cleaned, and the sentence is segmented by loading a custom sentiment lexicon with Jieba. Secondly, the results after stopping words are obtained through Skip-gram training algorithm to obtain the word vector model. Secondly, the model is added to a large corpus for incremental training to obtain a more accurate word vector model. Finally, the features are learned and classified by inputting the embedding layer into the neural network model. Through the comparison experiment of multiple models, it is found that the combined model (CNN-BiLSTM-Attention) has better classification effect and better application ability.

Share and Cite:

Xu, H.Y. and Yang, L.H. (2020) Research on Chinese Text Feature Extraction and Sentiment Analysis Based on Combination Network. Open Access Library Journal, 7, 1-12. doi: 10.4236/oalib.1106905.

Copyright © 2025 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.