Visualization of Special Features in “The Tale of Genji” by Text Mining and Correspondence Analysis with Clustering

DOI: 10.4236/jfcmv.2014.21001   PDF   HTML     3,182 Downloads   5,568 Views   Citations


In this paper, visualization of special features in “The Tale of Genji”, which is a typical Japanese classical literature, is studied by text mining the auxiliary verbs and examining the similarity in the sentence style by the correspondence analysis with clustering. The result shows that the text mining error in the number of auxiliary verbs can be as small as 15%. The extracted feature in this study supports the multiple authors of “The Tale of Genji”, which agrees well with the result by Murakami and Imanishi [1]. It is also found that extracted features are robust to the text mining error, which suggests that the classification error is less affected by the text mining error and the possible use of this technique for further statistical study in classical literatures.

Share and Cite:

H. Hosoi, T. Yamagata, Y. Ikarashi and N. Fujisawa, "Visualization of Special Features in “The Tale of Genji” by Text Mining and Correspondence Analysis with Clustering," Journal of Flow Control, Measurement & Visualization, Vol. 2 No. 1, 2014, pp. 1-6. doi: 10.4236/jfcmv.2014.21001.

Conflicts of Interest

The authors declare no conflicts of interest.


[1] M. Murakami and Y. Imanishi, “On a Quantitative Analysis of Auxiliary Verbs Used in Genji Monogatari,” Transactions of Information Processing Society of Japan, Vol. 40, No. 3, 1999, pp. 774-782.
[2] Y. Nakayama, M. Oki, K. Aoki and S. Takayama, “Jomon Pottery Observed from the Point of View of Fluid Mechanics: Did Jomon People Discover Twin and Karman Vortices?” Journal of Visualization, Vol. 7, No. 4, 2004, pp. 349-356.
[3] J. Hertzberg and A. Sweetman, “Images of Fluid Flow: Art and Physics by Student,” Journal of Visualization, Vol. 8, No. 2, 2005, pp. 145-152.
[4] N. Fujisawa, M. Verhoeckx, D. Dabiri, M. Gharib and J. Hertzberg, “Recent Progress in Flow Visualization Techniques toward the Generation of Fluid Art,” Journal of Visualization, Vol. 10, No. 2, 2007, pp. 163-170.
[5] P. Burge, “Hidden Patterns,” Journal of Visualization, Vol. 10, No. 2, 2007, pp. 171-178.
[6] M. Uchida and S. Shirayama, “Formation of Pattern from Complex Networks,” Journal of Visualization, Vol. 10, No. 3, 2007, pp. 253-255.
[7] K. Ohmi, “Music Visualization in Style and Structure,” Journal of Visualization, Vol. 10, No. 3, 2007, pp. 257-258.
[8] R. Sakashita, N. Fujisawa, F. Matsuura and K. Takizawa, “Anaglyph Stereo Visualization of Rhythmical Movements,” Journal of Visualization, Vol. 10, No. 4, 2007, pp. 345-346.
[9] N. Fujisawa, K. Brown, Y. Nakayama, J. Hyatt and T. Corby, “Visualization of Scientific Arts and Some Examples of Applications,” Journal of Visualization, Vol. 11, No. 4, 2008, pp. 387-394.
[10] M. Inami, H. Iwasaki, K. Miyazawa, H. Tuchiya, Y. Saito and K. Horii, “Love between Genji and Utsusemi in the Tale of Genji: Descrete Wavelets Multi-Resolution Analysis,” Transactions of Visualization Society of Japan, Vol. 25, No. 5, 2005, pp. 8-12.
[11] M. Yamada and Y. Murai, “Story Visualization of Literary Works,” Journal of Visualization, Vol. 12, No. 2, 2009, pp. 181-188.
[12] M. Yamada and Y. Murai, “Stereoscopic Story Visualization in Literary Works Demonstrated by Shakespeare’s Plays,” Journal of Visualization, Vol. 13, No. 4, 2010, pp. 355-363.
[13] P. Carpena, P. Bernaola-Galvan, M. Hackenberg, A. V. Coronado and J. L. Oliver, “Level Statistics of Words: Finding Keywords in Literary Texts and Symbolic Sequences,” Physical Review E, Vol. 79, No. 3, 2009, Article ID: 035102.
[14] N. A. Desbiens, “Ancient Japanese Medicine in the Tale of Genji,” The American Journal of Medicine, Vol. 120, No. 6, 2007, p. 560.
[15] K. Eremin, J. Stenger and M. L. Green, “Raman Spectroscopy of Japanese Artist’s Materials: The Tale of Genji by Tosa Mitsunobu,” Journal of Raman Spectroscopy, Vol. 37, No. 10, 2006, pp. 1119-1124.
[16] H. Ueda, M. Murakami, Y. Imanishi, T. Kabashima and Y. Ueda, “Vocabulary indices of The Tale of Genji (in Japanese),” Bensei Press, Tokyo, 1994.
[17] E. Shibuya, “Murasaki Shikibu, The Tale of Genji—The Intelligence & Database on GENJI-MONOGATARI Revised by Fujiwara Teika,” 2013.
[18] S. Petrovic, B. D. Basic, A. Morin, B. Zupan and J. H. Chauchat, “Textual Features for Corpus Visualization Using Correspondence Analysis,” Intelligent Data Analysis, Vol. 13, No. 5, 2009, pp. 795-813.
[19] D. Steingly, “K-Means Clustering: A Half-Century Synthesis,” British Journal of Mathematical and Statistical Psychology, Vol. 59, 2006, pp. 1-34.
[20] K. Ikeda, “Summarization of The Tale of Genji (in Japanese),” Chuokoronsha, Tokyo, 1951.

comments powered by Disqus

Copyright © 2020 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.