Comparison of Word Length Distributions in Spoken and Written Chinese

HTML  XML Download Download as PDF (Size: 419KB)  PP. 1-11  
DOI: 10.4236/oalib.1104660    490 Downloads   1,498 Views  Citations
Author(s)

ABSTRACT

In this study we apply Zipf-Alecseev’s function to word length distributions of Chinese prose and dialogue texts. Since there are two potential measurement units of Chinese word length, we applied Zipf-Alecseev’s function to both of them. The results show that all the word length distributions fit Zipf-Alecseev’s function, no matter the word length is measured in characters or components. The parameters a and b in Zipf-Alecseev’s function y = cxa bln(x) show no difference in different text styles (which are prose and dialogue in our case). However, the parameters are different when word length is measured in different units (character and component respectively). This indicates that the Zipf-Alecseev’s function is sensitive to word length measurement units, but not text styles.

Share and Cite:

Chen, H. (2018) Comparison of Word Length Distributions in Spoken and Written Chinese. Open Access Library Journal, 5, 1-11. doi: 10.4236/oalib.1104660.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.