<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article">
 <front>
  <journal-meta>
   <journal-id journal-id-type="publisher-id">
    jss
   </journal-id>
   <journal-title-group>
    <journal-title>
     Open Journal of Social Sciences
    </journal-title>
   </journal-title-group>
   <issn pub-type="epub">
    2327-5952
   </issn>
   <issn publication-format="print">
    2327-5960
   </issn>
   <publisher>
    <publisher-name>
     Scientific Research Publishing
    </publisher-name>
   </publisher>
  </journal-meta>
  <article-meta>
   <article-id pub-id-type="doi">
    10.4236/jss.2025.139013
   </article-id>
   <article-id pub-id-type="publisher-id">
    jss-145802
   </article-id>
   <article-categories>
    <subj-group subj-group-type="heading">
     <subject>
      Articles
     </subject>
    </subj-group>
    <subj-group subj-group-type="Discipline-v2">
     <subject>
      Business 
     </subject>
     <subject>
       Economics, Social Sciences 
     </subject>
     <subject>
       Humanities
     </subject>
    </subj-group>
   </article-categories>
   <title-group>
    What Data Can Reveal in Analyzing Literature
    <br>—A Corpus-Based Study on Two Chinese Translations of A Room of One’s Own at the Lexical Level</br>
   </title-group>
   <contrib-group>
    <contrib contrib-type="author" xlink:type="simple">
     <name name-style="western">
      <surname>
       Ting
      </surname>
      <given-names>
       Hu
      </given-names>
     </name>
    </contrib>
    <contrib contrib-type="author" xlink:type="simple">
     <name name-style="western">
      <surname>
       Ziyue
      </surname>
      <given-names>
       Feng
      </given-names>
     </name>
    </contrib>
   </contrib-group> 
   <aff id="affnull">
    <addr-line>
     aSchool of Foreign Languages, Shanghai Institute of Technology, Shanghai, China
    </addr-line> 
   </aff> 
   <pub-date pub-type="epub">
    <day>
     08
    </day> 
    <month>
     09
    </month>
    <year>
     2025
    </year>
   </pub-date> 
   <volume>
    13
   </volume> 
   <issue>
    09
   </issue>
   <fpage>
    209
   </fpage>
   <lpage>
    216
   </lpage>
   <history>
    <date date-type="received">
     <day>
      22,
     </day>
     <month>
      August
     </month>
     <year>
      2025
     </year>
    </date>
    <date date-type="published">
     <day>
      19,
     </day>
     <month>
      August
     </month>
     <year>
      2025
     </year> 
    </date> 
    <date date-type="accepted">
     <day>
      19,
     </day>
     <month>
      September
     </month>
     <year>
      2025
     </year> 
    </date>
   </history>
   <permissions>
    <copyright-statement>
     © Copyright 2014 by authors and Scientific Research Publishing Inc. 
    </copyright-statement>
    <copyright-year>
     2014
    </copyright-year>
    <license>
     <license-p>
      This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/
     </license-p>
    </license>
   </permissions>
   <abstract>
    Quantitative methods have empowered researchers with new perspectives to assume, test, and interpret what can hardly be analyzed through manual reading. This study employs corpora and data processing techniques to analyze and compare the discourse of A Room of One’s Own by Virginia Woolf and its two Chinese translations by Zhou Yingqi and Jia Huifeng, respectively. WordSmith and Python are used for data processing and statistical analysis for quantitative examination at lexical levels. The study disclosed a paradox of Woolf’s stylistic complexity versus her low lexical diversity that may not have been adequately clarified in previous studies. The statistical findings offered a data-driven explanation for this apparent contradiction. The comparison also unveiled the diverse translation strategies employed by different translators due to their varying interpretations of the source novel and the distinct ideological positions or communicative goals that have informed their choices.
   </abstract>
   <kwd-group> 
    <kwd>
     Corpus
    </kwd> 
    <kwd>
      A Room of One’s Own
    </kwd> 
    <kwd>
      Translation
    </kwd> 
    <kwd>
      Strategies
    </kwd>
   </kwd-group>
  </article-meta>
 </front>
 <body>
  <sec id="s1">
   <title>1. Introduction</title>
   <p>
    <xref ref-type="bibr" rid="scirp.145802-"></xref>The growing corpora and data analysis technology are transforming translation comparison studies by providing empirical, large-scale, and systematic evidence. The corpus-based approach can offer a more objective way to identify patterns in translation that anecdotal observation and manual analysis can hardly uncover (<xref ref-type="bibr" rid="scirp.145802-5">
     Heine &amp; Narrog, 2015
    </xref>). The frequencies and statistical information about the structures and rules of texts that are invisible when looking into a single context can be retrieved, providing an overall picture out of thousands of usages (<xref ref-type="bibr" rid="scirp.145802-2">
     Atar &amp; Erdem, 2019
    </xref>). This study employs corpora techniques to analyze and compare the two translations of Virginia Woolf’s A Room of One’s Own by two translators, Zhou Yingqi and Jia Huifeng, at the lexical level, with an aim to disclose some trends that are unseen before.</p>
   <p>First published in 1929, A Room of One’s Own was composed based on two lectures given by the author in 1928 at Newnham College and Girton College, two higher education institutions for women at Cambridge at that time. By stating the status of women and women artists in particular, Woolf presents her central theme that if a woman were to write, having money and a room of her own was essential. Women are not short of creativity, but were undermined by centuries of discrimination and disadvantages in finance and education imposed and practiced by the culture and society of that time.</p>
   <p>In the novel, metaphors are used to illustrate social injustices and women’s lack of free expression. A fish is used to exemplify how a woman with some ideals for writing is rigidized and loses her inspiration eventually, for the society labels women as mere domestic child bearers, ignorant and chaste. Woolf’s arguments are extended to many political circumstances later, in which case the shape of social problems has shifted, but the absence of opportunity still causes isolation and inequality. These arouse social attention and awareness of women’s suffering from the pressures of society that limited women. Women have been marginalized for decades. In literature, a male-dominated field, women’s writing and publishing symbolize the rise of women and challenges to the discrimination.</p>
   <p>Virginia Woolf was thus recognized as a prominent literary figure in history, with A Room of One’s Own as the most famous non-fiction work on feminism. She is acknowledged as one of the best and brightest writers of the 20th century not only for her impressive descriptive powers, which are characterized by lively, graceful prose, but also for her efforts to push the boundaries of activism and literature (<xref ref-type="bibr" rid="scirp.145802-12">
     Transue, 1986
    </xref>).</p>
  </sec><sec id="s2">
   <title>2. Data Collection and Processing</title>
   <p>
    <xref ref-type="bibr" rid="scirp.145802-"></xref>Texts of Virginia Woolf’s A Room of One’s Own and two translations from Jia Huifeng and Zhou Yingqi are first cleaned in Word to remove garbled characters, spaces, tabs, carriage returns, and redundant information. WordSmith, primarily designed to tokenize based on whitespace, is used for Woolf’s text in English for word counting or clustering in either alphabetical or frequency order. Concordances are created in the Concord module to facilitate the search of previously defined text, statistical profiles of the words or word forms are unveiled in the Wordlist module, and keywords can be listed in the KeyWord module to show their frequency.</p>
   <p>However, as the Chinese language does not use spaces to separate words, tokenizing Chinese in WordSmith is a known challenge. Thus, Jieba, a Python library for Chinese text segmentation, is used with the jieba.cut() function on the two Chinese translation texts. The pre-processed texts with spaces between words are then imported into WordSmith to perform analysis.</p>
  </sec><sec id="s3">
   <title>3. Results and Discussion</title>
   <p>Linguistic variables of tokens and types are statistical terminologies used in corpus linguistics to unpack the model of texts. Words in different contexts are to be calculated and then compared to identify patterns of language use. Different analyses using different tools under different variable definition criteria may yield quite different results. Therefore, in this study, a token is defined as a word counted as one token that is separated by white space or punctuation. A type refers to how many different word forms exist in the text. The occurrence of the same word is counted as one type.</p>
   <sec id="s3_1">
    <title>3.1. Token and Type</title>
    <p>As shown in <xref ref-type="table" rid="table1">
      Table 1
     </xref>, the tokens of Jia’s translation and Zhou’s translation are respectively 34,543 and 34,502, while the number of types of these two Chinese translations are respectively 7,151 and 6,221, compared with Woolf’s 37,655 and 5,573. It indicates that there is no significant difference in the two translations from Woolf’s essay. However, the distinction in type is obvious between the translations and Woolf’s original work, and between the two translations. Word forms in Jia’s translation are significantly more diverse than Zhou’s, with the difference being 1,596, or 28.7%, and 930, or 11.9%, respectively. This signifies that the diversity of words in Jia’s translation is higher than Zhou’s (See <xref ref-type="table" rid="table2">
      Table 2
     </xref>).</p>
    <table-wrap id="table1">
     <label>
      <xref ref-type="table" rid="table1">
       Table 1
      </xref></label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145802-"></xref>Table 1. Token and type.</title>
     </caption>
     <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
      <tr> 
       <td class="custom-bottom-td acenter" width="33.33%"><p style="text-align:center">Text</p></td> 
       <td class="custom-bottom-td acenter" width="33.33%"><p style="text-align:center">Token</p></td> 
       <td class="custom-bottom-td acenter" width="33.34%"><p style="text-align:center">Type</p></td> 
      </tr> 
      <tr> 
       <td class="custom-top-td acenter" width="33.33%"><p style="text-align:center">Woolf’s</p></td> 
       <td class="custom-top-td acenter" width="33.33%"><p style="text-align:center">37,655</p></td> 
       <td class="custom-top-td acenter" width="33.34%"><p style="text-align:center">5573</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="33.33%"><p style="text-align:center">Zhou’s</p></td> 
       <td class="acenter" width="33.33%"><p style="text-align:center">34,502</p></td> 
       <td class="acenter" width="33.34%"><p style="text-align:center">6221</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="33.33%"><p style="text-align:center">Jia’s</p></td> 
       <td class="acenter" width="33.33%"><p style="text-align:center">34,543</p></td> 
       <td class="acenter" width="33.34%"><p style="text-align:center">7151</p></td> 
      </tr> 
     </table>
    </table-wrap>
    <table-wrap id="table2">
     <label>
      <xref ref-type="table" rid="table2">
       Table 2
      </xref></label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145802-"></xref>Table 2. Tokens by chapter.</title>
     </caption>
     <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
      <tr> 
       <td class="acenter" width="19.58%"><p style="text-align:center">Chapter</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">Woolf’s</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">Jia’s</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">Zhou’s</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">Jia vs. Woolf’s</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">Zhou vs. Woolf’s</p></td> 
       <td class="acenter" width="13.42%"><p style="text-align:center">Jia vs. Zhou</p></td> 
      </tr> 
      <tr> 
       <td class="custom-top-td acenter" width="19.58%"><p style="text-align:center">Chapter 1</p></td> 
       <td class="custom-top-td acenter" width="13.40%"><p style="text-align:center">7509</p></td> 
       <td class="custom-top-td acenter" width="13.40%"><p style="text-align:center">6857</p></td> 
       <td class="custom-top-td acenter" width="13.40%"><p style="text-align:center">7083</p></td> 
       <td class="custom-top-td acenter" width="13.40%"><p style="text-align:center">652</p></td> 
       <td class="custom-top-td acenter" width="13.40%"><p style="text-align:center">426</p></td> 
       <td class="custom-top-td acenter" width="13.42%"><p style="text-align:center">226</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="19.58%"><p style="text-align:center">Chapter 2</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">5300</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">4778</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">4746</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">522</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">554</p></td> 
       <td class="acenter" width="13.42%"><p style="text-align:center">32</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="19.58%"><p style="text-align:center">Chapter 3</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">5383</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">4915</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">4976</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">468</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">407</p></td> 
       <td class="acenter" width="13.42%"><p style="text-align:center">61</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="19.58%"><p style="text-align:center">Chapter 4</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">7000</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">6447</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">6412</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">553</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">588</p></td> 
       <td class="acenter" width="13.42%"><p style="text-align:center">35</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="19.58%"><p style="text-align:center">Chapter 5</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">5498</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">5254</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">5033</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">244</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">465</p></td> 
       <td class="acenter" width="13.42%"><p style="text-align:center">221</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="19.58%"><p style="text-align:center">Chapter 6</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">6965</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">6292</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">6252</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">673</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">713</p></td> 
       <td class="acenter" width="13.42%"><p style="text-align:center">40</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="19.58%"><p style="text-align:center">Total</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">37,655</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">34,543</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">34,502</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">3112</p></td> 
       <td class="acenter" width="13.40%"><p style="text-align:center">3153</p></td> 
       <td class="acenter" width="13.42%"><p style="text-align:center">41</p></td> 
      </tr> 
     </table>
    </table-wrap>
    <p>Tokens in each chapter are then calculated to observe the dispersion of words in the essay and translation. <xref ref-type="table" rid="table2">
      Table 2
     </xref> shows that the two translations are similarly close in tokens with two exceptions, Chapter 1 and Chapter 5. The first and fifth chapters introduce and reiterate the essay’s theme that a woman should have her own room for writing, with evidence exemplifying the inequality women endure. The divergence of the word count in these two chapters indicates that the two translators use different words and word lengths in translation.</p>
    <p>To investigate how words are diverse in addition to word count distribution, tokens by word class are investigated. As <xref ref-type="table" rid="table3">
      Table 3
     </xref> shows, in general, verb and noun are the two word classes used most in the Chinese translations, much higher than adverb and adjective, almost 3 to 3.5 times higher, while in Woolf’s work, verb and noun are 3 to 3.5 times higher than adverb and adjectives. This echoes what previous studies have found. The proportion of nouns and verbs in Chinese speakers is evidently higher than that of English speakers, even from childhood (<xref ref-type="bibr" rid="scirp.145802-3">
      Choi, 2000
     </xref>). Despite individual characteristics and activity context of children, the extent of nouns or verbs in Chinese children’s vocabularies is equally significant. When comparing noun and verb specifically, English speakers contain a much higher proportion of nouns in their vocabulary than verbs, while Chinese speakers use almost the same number of nouns and verbs. The proportion of verbs used by Chinese speakers is much higher than that of English speakers (<xref ref-type="bibr" rid="scirp.145802-11">
      Tardif, 2006
     </xref>). The same patterns are found in two Chinese translations.</p>
    <table-wrap id="table3">
     <label>
      <xref ref-type="table" rid="table3">
       Table 3
      </xref></label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145802-"></xref>Table 3. Tokens by word class.</title>
     </caption>
     <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
      <tr> 
       <td class="custom-bottom-td acenter" width="19.58%"><p style="text-align:center">Word Class</p></td> 
       <td class="custom-bottom-td acenter" width="16.08%"><p style="text-align:center">Zhou’s</p></td> 
       <td class="custom-bottom-td acenter" width="16.08%"><p style="text-align:center">Percentage</p></td> 
       <td class="custom-bottom-td acenter" width="16.08%"><p style="text-align:center">Jia’s</p></td> 
       <td class="custom-bottom-td acenter" width="16.08%"><p style="text-align:center">Percentage</p></td> 
       <td class="custom-bottom-td acenter" width="16.08%"><p style="text-align:center">Woolf’s</p></td> 
      </tr> 
      <tr> 
       <td class="custom-top-td acenter" width="19.58%"><p style="text-align:center">Noun</p></td> 
       <td class="custom-top-td acenter" width="16.08%"><p style="text-align:center">6901</p></td> 
       <td class="custom-top-td acenter" width="16.08%"><p style="text-align:center">9.14%</p></td> 
       <td class="custom-top-td acenter" width="16.08%"><p style="text-align:center">6928</p></td> 
       <td class="custom-top-td acenter" width="16.08%"><p style="text-align:center">8.98%</p></td> 
       <td class="custom-top-td acenter" width="16.08%"><p style="text-align:center">8726</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="19.58%"><p style="text-align:center">Verb</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">6981</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">9.25%</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">7214</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">9.35%</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">6622</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="19.58%"><p style="text-align:center">Adjective</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">2518</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">3.34%</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">2505</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">3.25%</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">2369</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="19.58%"><p style="text-align:center">Adverb</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">2659</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">3.52%</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">2967</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">3.84%</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">2298</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="19.58%"><p style="text-align:center">Pronoun</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">4214</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">5.58%</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">3729</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">3.84%</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">3565</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="19.58%"><p style="text-align:center">Conjunction</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">1363</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">1.81%</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">1456</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">4.83%</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">1877</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="19.58%"><p style="text-align:center">Preposition</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">1101</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">1.46%</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">1044</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">1.89%</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">5536</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="19.58%"><p style="text-align:center">Numerals</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">807</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">1.07%</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">904</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">1.35%</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">332</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="19.58%"><p style="text-align:center">Total</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">26,544</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">35.17%</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">26,747</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">35%</p></td> 
       <td class="acenter" width="16.08%"><p style="text-align:center">25,943</p></td> 
      </tr> 
     </table>
    </table-wrap>
    <p>As widely recognized, nouns and abstract entities are often prioritized in Woolf’s prose to create a rhythm of contemplative stasis (<xref ref-type="bibr" rid="scirp.145802-1">
      Ahmed, Ahmad, &amp; Afzaal, 2024
     </xref>). Such economy of verbs embodies her feminist discourse in that the nominal weight highlights women’s material and intellectual space while verbs symbolize masculinized narratives of action and conquest. By contrast, a greater density of verbs and adverbs in the Chinese translations is firstly due to Chinese linguistic structure. Verb-driven constructions are more entitled in Chinese than in English, which illustrates the use of domestication strategies in translation to clarify logical relations and adverbs to intensify evaluative stance. However, it may soften Woolf’s stylistic resistance that she produces and demands for women (<xref ref-type="bibr" rid="scirp.145802-6">
      Huang, 2020
     </xref>).</p>
   </sec>
   <sec id="s3_2">
    <title>3.2. Type/Token Ratio and Standardized Type/Token Ratio</title>
    <p>To investigate how different words are used to explore their meanings for communication, lexical diversity statistics are often calculated to determine whether a wide range or a limited range of vocabulary is used in the text. TTR (type/token ratio) is the proportion of different word forms versus running words. A higher TTR value usually points to a more lexically varied text. STTR (standardized type/token ratio) is the text divided into standard-size segments, here in 1,000 words, to calculate the mean value of the TTRs for each segment. TTR is calculated as Formula 1:</p>
    <p>
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mfrac> 
        <mrow> 
         <mtext>
           type 
         </mtext> 
        </mrow> 
        <mrow> 
         <mtext>
           token 
         </mtext> 
        </mrow> 
       </mfrac> 
       <mtext>
         ratio 
       </mtext> 
       <mo>
         = 
       </mo> 
       <mfrac> 
        <mrow> 
         <mtext>
           number 
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
           of 
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
           types 
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
           in 
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
           text 
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
           or 
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
           corpus 
         </mtext> 
        </mrow> 
        <mrow> 
         <mtext>
           number 
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
           of 
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
           tokens 
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
           in 
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
           text 
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
           or 
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
           corpus 
         </mtext> 
        </mrow> 
       </mfrac> 
      </mrow> 
     </math> (1)</p>
    <p>As shown in <xref ref-type="table" rid="table4">
      Table 4
     </xref>, the STTR (Standardized Type/token ratio) in Jia’s translation (55.95%) is relatively higher than in Zhou’s (53.01%), but the discrepancy (2.94%) is small, indicating no significant divergence in translations. Notably, the TTR and STTR in the Chinese translations are significantly higher than in Woolf’s work, with the difference being 9.25% and 12.19%, respectively.</p>
    <table-wrap id="table4">
     <label>
      <xref ref-type="table" rid="table4">
       Table 4
      </xref></label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145802-"></xref>Table 4. TTR and STTR.</title>
     </caption>
     <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
      <tr> 
       <td class="custom-bottom-td acenter" width="33.33%"><p style="text-align:center">Text</p></td> 
       <td class="custom-bottom-td acenter" width="33.33%"><p style="text-align:center">TTR</p></td> 
       <td class="custom-bottom-td acenter" width="33.34%"><p style="text-align:center">STTR</p></td> 
      </tr> 
      <tr> 
       <td class="custom-top-td acenter" width="33.33%"><p style="text-align:center">Woolf’s</p></td> 
       <td class="custom-top-td acenter" width="33.33%"><p style="text-align:center">14.80%</p></td> 
       <td class="custom-top-td acenter" width="33.34%"><p style="text-align:center">43.76%</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="33.33%"><p style="text-align:center">Zhou’s</p></td> 
       <td class="acenter" width="33.33%"><p style="text-align:center">18.12%</p></td> 
       <td class="acenter" width="33.34%"><p style="text-align:center">53.01%</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="33.33%"><p style="text-align:center">Jia’s</p></td> 
       <td class="acenter" width="33.33%"><p style="text-align:center">20.70%</p></td> 
       <td class="acenter" width="33.34%"><p style="text-align:center">55.95%</p></td> 
      </tr> 
     </table>
    </table-wrap>
    <p>The small difference between the two translations highlights how both the inherent linguistic distinctions of Chinese and English, and the specific strategies employed by the translators, contribute to the final translated text.</p>
   </sec>
   <sec id="s3_3">
    <title>3.3. Keywords</title>
    <p>Key words are investigated to explore why there is a paradox of Woolf’s reputation for stylistic complexity but with lower TTR and STTR compared with its translations. As Woolf’s A Room of One’s Own is about women and advocates equality and feminism, the world “women” and “equality” are examined. <xref ref-type="table" rid="table5">
      Table 5
     </xref> shows that Woolf used the word “women” in her essay 196 times and “female” 6 times while in the two Chinese translations. <xref ref-type="table" rid="table6">
      Table 6
     </xref> shows that “女性” is used 159 times and 269 times, and “女人” is used 127 times and 83 times respectively by Jia and Zhou. A chi-square test shows a statistically significant difference between Zhou’s and Jia’s usage of “女性” and “女人”, with p &lt; 0.00000005.</p>
    <p>As the statistics indicate, Woolf’s prose relies heavily on the repetition of such key words as ‘room’, ‘money’, ‘women’, and ‘freedom’. The recurrence produces the rhetorical backbone of her feminist argument and secures thematic cohesion across the texts. That also explains the paradox. Meanwhile, the higher lexical</p>
    <table-wrap id="table5">
     <label>
      <xref ref-type="table" rid="table5">
       Table 5
      </xref></label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145802-"></xref>Table 5. Keyword “women, girl, female” in Woolf’s text.</title>
     </caption>
     <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
      <tr> 
       <td class="custom-bottom-td acenter" width="50.00%"><p style="text-align:center">Word</p></td> 
       <td class="custom-bottom-td acenter" width="50.00%"><p style="text-align:center">TTR</p></td> 
      </tr> 
      <tr> 
       <td class="custom-top-td acenter" width="50.00%"><p style="text-align:center">women</p></td> 
       <td class="custom-top-td acenter" width="50.00%"><p style="text-align:center">196</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="50.00%"><p style="text-align:center">Zhou’s</p></td> 
       <td class="acenter" width="50.00%"><p style="text-align:center">15</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="50.00%"><p style="text-align:center">Jia’s</p></td> 
       <td class="acenter" width="50.00%"><p style="text-align:center">6</p></td> 
      </tr> 
     </table>
    </table-wrap>
    <table-wrap id="table6">
     <label>
      <xref ref-type="table" rid="table6">
       Table 6
      </xref></label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145802-"></xref>Table 6. Keyword “women” in translations.</title>
     </caption>
     <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
      <tr> 
       <td class="custom-bottom-td acenter" width="33.33%"><p style="text-align:center">Word</p></td> 
       <td class="custom-bottom-td acenter" width="33.33%"><p style="text-align:center">Zhou’s</p></td> 
       <td class="custom-bottom-td acenter" width="33.34%"><p style="text-align:center">Jia’s</p></td> 
      </tr> 
      <tr> 
       <td class="custom-top-td acenter" width="33.33%"><p style="text-align:center">女性</p></td> 
       <td class="custom-top-td acenter" width="33.33%"><p style="text-align:center">269</p></td> 
       <td class="custom-top-td acenter" width="33.34%"><p style="text-align:center">159</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="33.33%"><p style="text-align:center">女人</p></td> 
       <td class="acenter" width="33.33%"><p style="text-align:center">83</p></td> 
       <td class="acenter" width="33.34%"><p style="text-align:center">127</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="33.33%"><p style="text-align:center">女子</p></td> 
       <td class="acenter" width="33.33%"><p style="text-align:center">4</p></td> 
       <td class="acenter" width="33.34%"><p style="text-align:center">31</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="33.33%"><p style="text-align:center">女士</p></td> 
       <td class="acenter" width="33.33%"><p style="text-align:center">6</p></td> 
       <td class="acenter" width="33.34%"><p style="text-align:center">2</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="33.33%"><p style="text-align:center">女孩</p></td> 
       <td class="acenter" width="33.33%"><p style="text-align:center">10</p></td> 
       <td class="acenter" width="33.34%"><p style="text-align:center">5</p></td> 
      </tr> 
     </table>
    </table-wrap>
    <p>diversity, as illustrated in TTR and STTR in the two translations, is caused by synonymic variation and grammatical compensation. “女性”, “女人”, “女子” and “女孩” are used alternately in the Chinese translation to reduce repetition according to context. This increases the TTR/STTR. Meanwhile, meanings are often compressed in English nominalization, leading translators to unpack the nouns with verbs, adjectives, and adverbs to specify and elaborate, to account for Woolf’s abstract terms and to bridge cultural gaps. These extensions add lexical variety.</p>
    <p>What is interesting is the contrast in the number of “女性” in the two translations. To explore the specific meaning of “woman” and “female,” definitions from several dictionaries are examined. In the Oxford English Dictionary, “woman” is first a noun, referring to “an adult female human, the counterpart of man,” while the word “female” “describes the presence of ovaries and a uterus in some object that experiences sexual differentiation (human, mammal, lizard, dolphin, etc.)” (Oxford Dictionary). The word “woman” and its plural form “women” are used as opposed to “men,” while “female” emphasizes sexual distinction from a more biological view. That explains why in Woolf’s original work, the word “women” was used most of the time. In the English-Chinese Cambridge Dictionary, the word “woman” is translated as “成年女子，女人,” while “female” is translated as “雌性动物; 女人, 女性” as a noun. “女性” being specifically used in translating “female” rather than in “woman” signifies that in English, the word “女性” represents more sexual significance. In the Chinese Dictionary (汉语大词典), the definition of “女” is “女性, 与‘男’相对,” which is closer to “woman” in English in terms of literary meaning.</p>
    <p>The linguistic alternation of ‘women’ and ‘female’ is deliberately used in Woolf’s prose to distinguish identity and critique. To Woolf, “woman” is used to advocate women’s historical and social identities, who are deprived of education, financial independence, and access to the literary canon (<xref ref-type="bibr" rid="scirp.145802-9">
      Moi, 1985
     </xref>; <xref ref-type="bibr" rid="scirp.145802-8">
      Marcus, 1981
     </xref>) solely because of gender regardless of individual difference (<xref ref-type="bibr" rid="scirp.145802-10">
      Showalter, 1977
     </xref>). “Female” is used to signal the perspective imposed to reduce women to their biological sex rather than recognizing their intellectual or artistic subjectivity (<xref ref-type="bibr" rid="scirp.145802-4">
      Gubar
     </xref><xref ref-type="bibr" rid="scirp.145802-4">
      , 1981
     </xref>; <xref ref-type="bibr" rid="scirp.145802-7">
      Lanser, 1992
     </xref>), highlighting the objectification and categorization of women by male-dominated institutions. The rhetorical shifts are used to demonstrate how language itself is engaged in the construction and limitation of gender identities (<xref ref-type="bibr" rid="scirp.145802-8">
      Marcus, 1981
     </xref>).</p>
    <p>For the translations, Zhou’s preference for “女性,” which carries stronger academic, sociological, and feminist implications in modern Chinese discourse, illustrates the intention of maintaining consistency with a feminist, academic register that highlights women’s social identity instead of their biological role. This aligns with Zhou’s gender-oriented translation strategy.</p>
    <p>In comparison, Jia varies “women” into “女人”, “妇女” and “女性” to produce a more diverse lexical field. “女人” indicates the existence of women in everyday life with a broader cultural framing, and “妇女” implies the demographic presentation in Chinese, though it is used more in social policy contexts. This synonymic variation suggests an orientation toward accessibility to facilitate the readability of the general public, a domestication strategy.</p>
   </sec>
  </sec><sec id="s4">
   <title>4. Conclusion</title>
   <p>The use of corpora and data analysis has provided empirical grounds on which researchers can interpret, test, or exemplify what could never be analyzed through manual reading. Some subtle, non-obvious patterns become apparent, and paradox is explained in interpreting the statistical results. Woolf’s repetition of thematic keywords to increase feminist solidarity decreases its lexical diversity. The language structural differences in Chinese and English result in the disparity of word class in Woolf’s prose and the two Chinese translations. The distinctive translation strategies adopted by the two translators are manifested, with Zhou following more of the feminist strategy while Jia chooses domestication to avoid monopoly. The findings complement the theoretical assumptions with empirical evidence. The quantitative methods will advance our understanding of discourse and translation from a more diverse perspective.</p>
  </sec>
 </body><back>
  <ref-list>
   <title>References</title>
   <ref id="scirp.145802-ref1">
    <label>1</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Ahmed, S. N., Ahmad, S.,&amp;Afzaal, K. (2024). Exploring Subjectivity and Modernist Language Techniques: Time, Gender, and Identity in Virginia Woolf’s A Room of One’s Own. Journal of Social Sciences Review, 4, 169-180. &gt;https://doi.org/10.54183/jssr.v4i4.439
    </mixed-citation>
   </ref>
   <ref id="scirp.145802-ref2">
    <label>2</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Atar, C.,&amp;Erdem, C. (2019). The Advantages and Disadvantages of Corpus Linguistics and Conversation Analysis in Second Language Studies. 140-142.
    </mixed-citation>
   </ref>
   <ref id="scirp.145802-ref3">
    <label>3</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Choi, S. (2000). Caregiver Input in English and Korean: Use of Nouns and Verbs in Book-Reading and Toy-Play Contexts. Journal of Child Language, 27, 69-96. &gt;https://doi.org/10.1017/S0305000999004018
    </mixed-citation>
   </ref>
   <ref id="scirp.145802-ref4">
    <label>4</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Gubar, S. (1981). What Ails Feminist Criticism? Critical Inquiry, 8, 255-273. &gt;https://doi.org/10.1086/448153
    </mixed-citation>
   </ref>
   <ref id="scirp.145802-ref5">
    <label>5</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Heine, B.,&amp;Narrog, H. (2015). The Oxford Handbook of Linguistic Analysis. Oxford University Press. &gt;https://doi.org/10.1093/oxfordhb/9780199677078.001.0001
    </mixed-citation>
   </ref>
   <ref id="scirp.145802-ref6">
    <label>6</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Huang, Z. (2020). A Room of One’s Own in Suzhou: Analyzing the Dissemination of Woolf’s Feminism in China. Advances in Social Science, Education and Humanities Research, 638, 118-121.
    </mixed-citation>
   </ref>
   <ref id="scirp.145802-ref7">
    <label>7</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Lanser, S. S. (1992). Fictions of Authority: Women Writers and Narrative Voice. Cornell University Press, 304.
    </mixed-citation>
   </ref>
   <ref id="scirp.145802-ref8">
    <label>8</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Marcus, J. (1981). New Feminist Essays on Virginia Woolf. Macmillan. &gt;https://doi.org/10.1007/978-1-349-05486-2
    </mixed-citation>
   </ref>
   <ref id="scirp.145802-ref9">
    <label>9</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Moi, T. (1985). Sexual/Textual Politics: Feminist Literary Theory. Routledge.
    </mixed-citation>
   </ref>
   <ref id="scirp.145802-ref10">
    <label>10</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Showalter, E. (1977). A Literature of Their Own: British Women Novelists from Brontë to Lessing. Princeton University Press. &gt;https://doi.org/10.1515/9780691221960
    </mixed-citation>
   </ref>
   <ref id="scirp.145802-ref11">
    <label>11</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Tardif, T. (2006). But Are They Really Verbs? Chinese Words for Action. In K. A. Hirsh-Pasek,&amp;R. M. Golinkoff (Eds.), Action Meets Word: How Children Learn Verbs (p. 477). Oxford University Press. &gt;https://doi.org/10.1093/acprof:oso/9780195170009.003.0019
    </mixed-citation>
   </ref>
   <ref id="scirp.145802-ref12">
    <label>12</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Transue, P. J. (1986). Virginia Woolf and the Politics of Style. SUNY Press.
    </mixed-citation>
   </ref>
  </ref-list>
 </back>
</article>