The Influence of Clan Surname Diversity on County Economic Development Performance in China : An Empirical Study Based on Chinese Genealogy Data

Culture is considered to be a deep-seated factor affecting economic development, among which the economic impact of cultural diversity has attracted considerable attention. At the county level, clans play an important cultural and social role. Although traditional clans in China are influenced by Confucianism, different clans have different cultural inheritance. Genealogy, ancestral precepts and clan rules are the carriers of each clan’s ideological inheritance. Different clans are distinguished by surnames, and cultural backgrounds and concepts are quite diverse. This paper will examine the impact of this diversity on the performance of county economic development. This paper collects the data of the Chinese Genealogy published by the Shanghai Library, collates and constructs the number of clan surnames and the diversity index of clan surnames in 1087 counties, and uses the night light brightness data released by the National Oceanic and Atmospheric Administration (NOAA) to measure the economic development performance of counties in China. Empirical results show that the diversity of clan surnames has a significant positive impact on the performance of county economic development, which remains stable in the test. The more diverse the clan surnames are, the better the economic development performance of counties will be.


Introduction
holds that the main factors affecting the performance of economic development are capital, labor force and other factors of production, as well as technological level and system.The differences of these factors lead to the differences of regional economic development level, and there are many empirical studies in this regard.But in the long-term evolution history, these factors are relatively easy to change.Therefore, the three factors that influence economic development in traditional analysis are not enough to explain the whole history of national and regional economic development and the gap between the rich and the poor (Luo Hao, 2009) [1].Culture, which has accumulated and settled down gradually since the development of human beings, is a relatively fixed belief and value (Guiso et al., 2006)  [2].Culture has a profound impact on people's behavioral concepts, which is a deeper factor affecting economic development performance (Pan Yue et al., 2017) [3].
As for the study of culture and economy, the early literature is analyzed from the theoretical aspect.It was Weber who put forward this earlier.Weber (1904) [4] pointed out that the doctrine respected by Protestants in Western Europe gave birth to the spirit of capitalism, thus promoting the economic development of Western society.Since the 1990s, many scholars have begun to analyze the impact of culture on national or regional economy through empirical analysis.
In recent years, more and more scholars have paid attention to the economic  [5] studied the influence of dialect diversity on the economic growth of cities at prefecture level and above.Empirical results show that diversity hinders the economic growth of cities. Li Guangqin et al. ( 2017) [6] believe that linguistic diversity is not conducive to the improvement of regional openness.Pan Yue et al. (2017) [3] believe that cultural diversity is conducive to the exchange and collision of people from different backgrounds, resulting in a relaxed cultural atmosphere, which is conducive to enterprise innovation.These previous studies have emphasized the important influence of cultural diversity on China's economic development.
China's culture has a long history of more than two thousand years, and the  The trust scope of this special trust is relatively small, which will have a negative impact on regional business development, innovation and entrepreneurship, and the development of modern civilization (Greif and Tabellini, 2017) [10].Therefore, from the perspective of cultural diversity, this paper will further subdivide different clans with clan surnames, and measure the impact of clan surname diversity on county economic development performance.
In short, in recent years, the economic impact of cultural diversity has attracted considerable attention.The empirical literature in this regard mostly studies the impact of diversity of languages on economic development, and there is no empirical study on economic development from the perspective of clan surname diversity.This paper will empirically analyze the impact of clan surname diversity on county economic development performance at the county level.

The Current Research of Cultural Diversity and Economy
Human society has different groups and social forms.Each group inherits and develops culture in its own way by means of different cultural manifestations, thus forming cultural diversity.Cultural diversity has rich connotations, mainly influenced by race, language, religious beliefs, living environment and other factors.In China, cultural diversity is mainly manifested in ethnic cultural diversity.
Different cultures of different nationalities enrich the connotation of Chinese culture and are an important driving force for the progress of civilization and the exchange of economic activities.In recent years, with the development of research on the impact of culture on economic development, cultural diversity has become a hot topic for many scholars.In the existing literature on cultural diversity, there are different opinions on the role of cultural diversity in economy.
Cultural diversity has both positive and negative effects.
One kind of literature finds that cultural diversity has hindered economic development.This kind of literature mainly holds that higher cultural diversity will increase the cost of communication, reduce the level of social identity and interpersonal trust, and hinder the cross-regional mobility of personnel to a certain extent.Tsui et al. (1992) [13] conducted a field study from the perspective of cultural diversity of entrepreneurship team, and found that cultural diversity is not conducive to the increase of enterprise output and the development of enterprise financing.Its mechanism is that cultural diversity hinders self-classification and social identity.Xu Xianxiang et al. ( 2015) [5] used the data of prefecture level and above in China to do empirical analysis.It was found that the cultural diversity of dialects would hinder the dissemination of knowledge and technology, thus having a negative impact on economic growth.Zhang Bo and Fan Chenchen (2018) [14] found that cultural diversity is not conducive to the prosperity of private finance at prefecture level in China.Its influence mechanism is that diversity reduces the social capital formed by the same regional culture and identity, thus increasing the transaction cost and risk of lending in private finance.Ding Congming et al. ( 2018) [15] argued that cultural and dialectal diversity overlaps with administrative divisions in history, which may underestimate the impact of diversity and reduce the accuracy of empirical analysis.
Therefore, they synthesized the central city and the cities adjacent to it and constructed the city circle.Through empirical analysis, they found that dialect diversity hindered the formation of domestic market integration.Li Guangqin et al. (2017) [6] first analyzed the impact of linguistic diversity on the degree of re- gional opening up.He believed that the more diverse the languages, the lower the degree of regional opening up.Empirical analysis shows that cultural diversity can promote stock market prosperity in general.Liu Gang and Wang Zeyu (2016) [18] studied the impact of cultural diversity on Internet venture financing.They used the data of Internet product crowdfunding to conduct empirical analysis, and found that the cultural diversity of entrepreneurial teams is conducive to improving the performance of Internet financing.However, this promotion effect will decline after a certain turning point, and generally presents an inverted U-shaped relationship, which can be adjusted by the education level and entrepreneurial experience of team members.Pan Yue et al. ( 2017) [3] studied the impact of cultural diversity on enterprise innovation.The number of dialects and dialect differentiation index were used to measure cultural diversity.Empirical results showed that cultural diversity promoted enterprise innovation.

Measurement of Cultural Diversity
Alesina and Giuliano (2015) [11] summarized three methods of measuring culture in empirical analysis: 1) Using statistical survey data including cultural variables.2) Controlling other economic and institutional environmental factors.Second, from different perspectives of cultural connotation, the measurement of cultural diversity mainly includes ethnic diversity, religious diversity, linguistic diversity and birthplace diversity.Alesina and Ferrara (2005) [20] measured ethnic diversity at the urban and county levels in the United States and analyzed its impact on economic growth.Empirical analysis found that ethnic diversity hindered regional economic growth.Guo Yunnan and Wang Chunfei (2017) [21] found that the investment of public goods in villages with religious beliefs is higher in China's rural areas.They further subdivided religion.Empirical analysis shows that villages with Christian beliefs play a greater role in the investment of public goods in villages.Owen and Videras (2007) [22] found that although believers with religious beliefs are more willing to build environmental protection, different religious believers have different attitudes and ideas on specific construction.Qian (2013) [23] used birth nationality data of American urban residents to measure cultural diversity and analyze its impact on innovation.(2018) [15] added minority dialects on the basis of Chinese dialects, taking into account the dialect data of all administrative divisions in China, making the measurement of dialect diversity more complete.This paper measures the diversity of clan surnames, and the measurement method will be introduced in the next section.There are two main methods for calculating diversity index by scholars at home and abroad.The first is to count the number of different categories as a direct index of diversity.The advantage of this method is that it only pays attention to the category, and the category situation will not change in a short time, so it has high stability.However, there may be some errors in this calculation method, which is equivalent to assuming that the status of each category is equal, without considering the differences of specific situations of different categories.

Data Selection and Measurement of Indicators
Therefore, the second calculation method is more common, which combines the weight (usually population) into the calculation on the basis of the former. of dialects used within a region, and calculated the diversity index by the number of dialects used in a region.In addition, considering the differences in the number of people using different dialects, they constructed the dialect diversity differentiation index of the city, and the calculation formula is as follows: " ".Among them, ji S represents the proportion of people who use j dialect in i city to the total urban population.This paper mainly refers to the calculation method of Xu Xianxiang's diversity index (2015) [5] and the Herfindahl index, which represents the degree of concentration.The measurement method of diversity index in this paper is calculated by subtracting Herfindahl index from 1.The specific calculation method is as follows: ( ) Among them, diversity i denotes the diversity index of clan surname in i county; ti X denotes the number of genealogies with t surname in the total ge- nealogies owned by i county; i X denotes the number of genealogies in i county; ti ti i S X X = denotes the share of genealogies with t surname in i county; and n denotes the number of surnames of genealogies in i county.Obviously, the value of diversity i ranges from 0 to 1.The larger the value of diversity i , the higher the diversity of genealogy surnames, the stronger the cultural diversity of the county.In the robustness test, the number of genealogy surnames owned by the county will be used as another measure of clan surname diversity.

Data Sources
Data to measure county economic development performance indicators are derived from night light brightness data published by the National Oceanic and Atmospheric Administration (NOAA).Since the publication of the data, night light brightness data has been regarded as an objective indicator to measure the economic development performance of countries and regions, which has been adopted by many empirical economic literatures.Compared with GDP, which is a traditional indicator of regional economic development, night light brightness data is obtained by satellite sensors, and the accuracy of the data is relatively high.However, the traditional GDP data are mostly obtained by human statistics, which is greatly disturbed by human factors.However, night light brightness has a range of values, the maximum value is 63, which means that if the night light brightness of an area is actually greater than 63, the value displayed in the data is still 63.Therefore, the luminance value of night light is 63, which does not necessarily reflect the true luminance.However, for most underdeveloped countries, the problem of light saturation can be basically ignored.Wang Xianbin et al. (2017) [26] also pointed out that the use of light data to measure regional economic development in China is less affected by this problem.

Measurement Methods
In this paper, the performance of county economic development was measured by two methods using the night light brightness data from 2000 to 2012.One is to directly measure the absolute value of the night light brightness data of the county, and the other is to measure the economic growth of the county by relative value, that is, the growth of the night light brightness of the county compared with the previous year.

Major Control Variables
In addition to the cultural diversity studied in this paper, the level of regional economic development is also affected by many factors, such as natural conditions, education level, factor input and so on.Based on the theory of economic development and the methods of previous studies, the control variables selected in this paper include land resources, labor resources, education level and capital investment.Among them, the common cultivated land area per capita is used to represent land resources; the population per unit of land area is used to represent labor resources; the number of students in primary and secondary schools per 10,000 people is used to represent education level; and the investment in fixed assets per unit of GDP is used to represent capital investment.The data come from county statistical yearbook of China and regional economic statistical yearbook of China.

Descriptive Statistics of Variables
Table 1 reports the results of variable descriptive statistics.From the results, we can see that the minimum and maximum luminance of county night light are 0.028 and 62.165 respectively, which are within the range of the values of the lighting data, so there is no measurement deviation due to the saturation of the night light data.The great difference of night light brightness indicates that there is a big regional gap in the economic development performance of counties in China.From the annual growth of light brightness, the average value is 33.677, which indicates that the economic situation of the counties in China is developing in the direction of growth as a whole.The maximum value of clan surname diversity index is 0.983, and the average value is higher than 0.5, while the minimum number of clan surnames is 1, and the maximum number is 110.It can be seen that the clan surnames in counties of China are quite diverse, and the diversity of clan surnames in different counties is quite different.

Basic Model Construction
In order to study the impact of clan surname diversity on county economic development performance, paper uses night light brightness data at county level as explained variable, clan surname diversity at county level as explanatory variable, and ordinary least square (OLS) was used for regression, and the regression equation was constructed as follows: Among them, i represents each county; nightlight i represents the performance of economic development of i county, measured by night light brightness data of i county; diversity i represents the diversity of clan surnames of i county; i X is a group of control variables, including resources, labor, education lev- el, capital, etc; and i µ is a random interference item.

Regression Results
Table 2 reports the estimated results of the baseline regression Equation (4.1).
From the regression results, we can see that the regression coefficient of clan surnames for county economic development performance indicators is positive, and all pass the test of 1% significance level.This result remains stable with or without controlling variables, indicating that the diversity of clan surnames has a positive impact on county economic development.Among them, the first column reports the regression results of introducing only the diversity of clan surnames.At this time, the regression coefficient of the diversity index of clan surnames is 2.939, which means that for every 1% increase in the diversity of clan surnames, the night light brightness of county will increase by 2.939%.Columns 2 -5 report the estimated results of stepwise introducing control variables for regression.In column 2 -5, the county natural resources, labor resources, education level and capital endowment are introduced.From the results, we can see that the regression coefficients of each variable are almost significant at the level of 1% significance.The Fifth Column reports the regression results after introducing all variables.At this time, the regression coefficient of clan surname diversity is 0.617, which is 2.322 units lower than the regression coefficient in the first column.It shows that the introduced control variables can better eliminate the impact of other factors on economic development.This also Table 2. Benchmark regression results. (1) ( means that after controlling for other factors affecting economic development, the night light brightness of county will increase by 0.617% for every 1% increase in the diversity of clan surnames.In addition, other indicators of control variables have a positive impact on county economic development performance, which is also consistent with the expected results.In a word, the results of benchmark regression are consistent with the theoretical expectations.The diversity of clan surnames in counties has a significant positive impact on the economic development performance measured by night light brightness data.The more diverse the clan surnames in the county, the better the economic development performance.Among them, d_light i indicates the increase of night light brightness in i county compared with the previous year (unit: percentage); diversity i represents the diversity index of clan surnames in i county, i X represents a group of control variables, and i represents random interference items.

Robustness Test
From the regression results of Table 3, we can see that the diversity of clan surnames still has a positive impact on the county economic growth, and it passes the test of 5% significance level as a whole.

Regression of GDP Indicator
The traditional index of economic development performance is gross domestic product (GDP).This paper uses the per capita GDP (unit: 10,000 yuan) of the county as the explained variable to carry out regression test on the basis of the basic regression Equation (4.1).The results are stable.The diversity of clan surnames has a promoting effect on the economic development performance of the county.
From the regression results of Table 4, we can see that the impact of clan surname diversity on economic development measured by GDP per capita is significantly  positive.The first column reports the results of not adding control variables, when the regression coefficient of clan surname diversity index is 0.220, and passed the test of significance level of 1%.After adding all control variables, the regression coefficient of clan surname diversity index is 0.238, which means that the per capita GDP of county will increase by 23.8 yuan for every 1% increase in clan surname diversity.

Regression of the Number of Clan Surnames
In this paper, Herfindahl index is used to measure the concentration of clan surnames in benchmark model regression, and then one minus the concentration degree is used to get the measure index of diversity.In addition, referring to the method of Xu Xianxiang et al. (2015) [5] to measure the diversity of dialects, this paper uses the number of genealogy surnames in county as another indicator to measure the diversity of clan surnames, and carries out a regression test: From the regression results of  will increase by 0.173% for each additional surname of county genealogy.From the per capita GDP, the regression coefficient of the sixth column is 0.004, which indicates that the per capita GDP of the county will increase by 40 yuan for each additional surname of county genealogy.

Research Conclusions
The issue of economic development has always been the focus of people's atten- Economic development is closely related to human life.The study of economic How to cite this paper: Zhang, H.Y. (2019) The Influence of Clan Surname Diversity on County Economic Development Performance in China: An Empirical Study Based on Chinese Genealogy Data.Modern Economy, 10, 1073-1089.https://doi.org/10.4236/me.2019.104072H. Y. Zhang DOI: 10.4236/me.2019.1040721074 Modern Economy development and development gap between countries and regions has always been an important topic in economic literature.Traditional economic theory impact of cultural diversity, and empirical research has both advantages and disadvantages.Cultural diversity includes the diversity brought about by different races, religions, languages and birthplaces.Most foreign scholars pay attention to the impact of ethnic and linguistic diversity on the economy.China is vast in territory and has a history of more than two thousand years.Due to geographical distance and ethnic diversity, cultural diversity is a basic feature of Chinese culture and plays a profound role in economic development.In the specific context of China, the diversity of languages in cultural diversity has attracted more attention from domestic scholars.Dialect data have been used as indicators for empirical analysis of cultural diversity in recent years (Xu Xianxiang et al., 2015; Pan Yue et al., 2017; Li Guangqin et al., 2017) [3] [5] [6].Xu Xianxiang et al. (2015)

(2014) [ 8 ]
used ancestral temples and genealogy numbers to measure clans.Empirical research found that clan network would narrow the income distribution gap in rural areas.Zhang (2016) [9] measured the strength of clan culture by the number of genealogies per 10,000 people in cities of China, and empirically tested the relationship between clan culture and private economic development.These studies analyzed the impact of clans on economy from different aspects, but in the measurement of clans, all clans in the region were measured uniformly, without subdividing the impact of the distribution of different clans in the region on economic development.Different clans have different family discipline and family rules.They advocate different cultural emphases.Some value moral cultivation and some value business.The impact of these clan differences on the economy is uncertain.In addition, as an organization based on consanguinity, clan advocates the mutual promotion and common development of members within clan, forming a special trust different from the general trust of society.

3 )
Autonomous collection of experimental data.In the first method, the most commonly used are World Values Survey (WVS), China Comprehensive Social Survey (CGSS), China Family Tracking Survey (CFPS) and so on.The second method often uses immigration data to observe the economic output of immigrants from different countries and cultures in the same country.In addition, in order to eliminate the influence of self-selection of migration destination cleanly, empirical analysis samples often use the data of the second generation of migrants.The third method is mainly to design the experimental scheme, questionnaire and so on.People with different cultural backgrounds can participate in the experiment and the experimental data will be got for investigation.However, the experimental data involve limited audiences after all, and it is difficult to have a generally accepted conclusion whether the empirical results can be extended to the general society.Therefore, among the three methods, statistical survey data are the most widely used.This paper also uses statistical survey data to measure cultural diversity.In terms of measurement content, because of the rich and abstract connotation of culture, the indicators of measurement are different, and the direction of measurement is often determined according to specific research perspectives.At present, scholars at home and abroad mainly measure cultural diversity in two directions.One is to measure cultural diversity by the difference of beliefs or values from the essence of culture.Inglehart and Baker (2000) [19] used the survey data of world values to analyze the results of the survey by factor analysis.They divided H. Y. Zhang DOI: 10.4236/me.2019.1040721079 Modern Economy values into two categories: social characteristics and personal characteristics, forming a two-dimensional cultural model.This is also the official indicator for measuring cultural diversity with data from the World Values Survey.On their basis, Li Zhongfei et al. (2017) [17] regards the scoring position of each country in the two-dimensional model as a binary vector, calculates the cultural distance based on these vectors, and then carries on the clustering, obtains the number and distance of clustering to represent the cultural diversity.
Böheim et al.(2014)  [24] and Trax et al.(2015)  [25] respectively used samples of Australian and German enterprises to carry out empirical research, and found that the diversity of workers' birthplaces has a significant promoting effect on the benefits of enterprises and workers.Most domestic scholars measure cultural diversity from dialect.Xu Xianxiang et al. (2015) [5] used the Dictionary of Chinese Dialects as a data source to measure the dialect diversity in China.2356 counties and cities with complete dialect information were sorted out and the dialect diversity index of 278 cities at prefecture level and above was calculated.This method of measuring cultural diversity is highly representative and has been directly cited by many scholars in the empirical study of cultural diversity (Pan Yue et al., 2017; Li Guangqin et al., 2017; Zhang Bo and Fan Chenchen, 2018) [3] [6] [14].Ding Congming et al.

3. 1 . 3 . 1 . 2 .
Clan Surname Diversity 3.1.1.Data Sources Genealogy is a kind of cultural symbol, which records the genealogy and blood relationship of the clan and the ancestors, celebrities and deeds of the clan.It is also the external expression carrier of the patriarchal system and clan.The data to measure the diversity of clan surnames are derived from "Chinese Genealogy Catalogue: Knowledge Service Platform of Family Genealogy in Shanghai Library".The general catalogue covers all the Chinese genealogies in the world, extracts the content summary according to the content of the genealogies, and compiles a joint catalogue of Chinese genealogies.The compilation was started in 2000 and completed in 2008.It was included and published by Shanghai Library.The collection and distribution of Chinese genealogy also promoted the in-depth study of Chinese genealogy culture by scholars both at home and abroad.The genealogy itself is an important carrier of the cultural heritage of the Chinese nation and plays a special role of unity and cohesion for Chinese people at home and abroad.According to the statistics at the end of 2003, Shanghai Library has collected 76,781 genealogical catalogues from all over the world, deducting about 30% of the copies.There are 52,401 genealogical items and 608 surnames in Shanghai Library.So far, it has the largest number of genealogies and the most complete surnames in China.The genealogy catalogue is complete and detailed, including not only the name of the genealogy, the responsible person, the number of volumes, the age of the edition and other information, but also the family name, ancestors, hall names, celebrities of past dynasties and other more detailed information.Measurement Methods Cultural diversity has rich connotations, including the diversity of language, race, religion, birth environment and other factors.This paper studies the diversity of clan surnames at the county level.The clan is rooted in the Confucian culture and inherits the strict patriarchal cultural characteristics.The family bloodline is extended through the family name.Genealogy is an important external carrier inherited by the clan, which records the lineage, rules, ancestors, celebrities and deeds of the same clan and consanguineous group.Genealogical surnames, which distinguish different clans and represent different kinship, are the external manifestation of Chinese traditional clan concept.In this paper, the surname diversity index and the number of surnames in the genealogies in county were used as the index to measure the clan surname diversity in county.
and the fourth are listed as the regression results of the influence of clan name diversity on the county night light brightness annual growth, and the regression results of per capita GDP are listed in the fifth and sixth columns.The regression coefficient of the second column is 0.026, which means that the brightness of county night light will increase by 0.026 units for each additional surname of county genealogy.The regression coefficient of the fourth column is 0.173, which shows that the annual growth of night light brightness (unit: percentage) tion, and studies in this field are endless.Most studies focus on economic growth from knowledge, technology and other factors of production.As a broad and abstract concept, culture has been less involved in empirical studies of economics in the past because it is difficult to quantify.Over the past decade, the research on the impact of culture on economy has been deepening, and new progress has been made in empirical research.Many scholars at home and abroad pay more attention to the economic impact of cultural diversity.This paper studies the economic development performance of counties in China.H. Y. Zhang DOI: 10.4236/me.2019.1040721088 Modern Economy Clan is an active social organization at the county level, especially in areas with imperfect formal system.It plays an important cultural and social function (Tsai, 2007; Ding Congming et al., 2018) [12] [15].However, there is no empirical research on economic development from the perspective of clan surname diversity.Although the concept of clan in China is deeply influenced by Confucianism, different clans have different historical and cultural backgrounds and cultural inheritance, which are reflected in ancestral precepts, family rules and other carriers.The surname of the genealogy can distinguish different clans, this paper constructs the clan surname diversity index of each county in China.According to the Chinese Genealogy, this paper collects and calculates the surname number and surname diversity index of genealogy in 1087 counties in 27 provinces and 270 prefecture-level cities in China, so as to measure the diversity of clan surnames in county.This paper studies the impact of clan surname diversity on county economic development performance.The performance of county economic development is measured by night light brightness data of county, which spans from 2000 to 2012, and is tested by traditional GDP in robustness test.Empirical analysis shows that the diversity of clan surnames has a significant positive impact on the performance of county economic development.The results of benchmark regression show that for every 1% positive change of clan surname diversity index in county, the economic development performance will change by 2.939%.The results of this study belong to the literature that cultural diversity is beneficial to economic growth.To some extent, this study provides a new perspective to study the economic impact of cultural diversity.

2.1. Cultural Diversity: Definition of the Concept of Clan Surname Diversity
basic content of cultural diversity.Obviously, understanding the connotation of cultural diversity needs to clarify the connotation of culture first.The concept of culture is very extensive and rich in connotation.It is the collective name of various forms of human life elements.It has different concepts groups and different societies are the basic conditions for cultural diversity, and different forms of cultural expression are the direct manifestation of cultural diversity.Every country and region in the world has its own unique culture, which is the most

Table 1 .
Descriptive statistics of variables.

Table 3 .
Regression results of economic growth.

Table 4 .
Regression results of GDP indicator.

Table 5 ,
we can see that the clan surnames diversity measured by the number of genealogy surnames in county still has a positive impact on county economy, and all pass the test of 5% significance level.
Among them, the first and the second are listed as the regression results of the influence of clan surname diversity on county night light brightness, the third H. Y. Zhang DOI: 10.4236/me.2019.1040721087 Modern Economy

Table 5 .
Regression results of the number of clan surnames.