Determinants of the Wage Gap between Migrants and Local Urban Residents in China: 2002-2013

This study explores the determinants of the wage gaps between rural-to-urban migrants and local urban residents in China from 2002 to 2013. Using the Chinese Household Income Project (CHIP) 2002 and 2013 survey data, the study provides an analysis based on the Oaxaca-Blinder decomposition model. The estimation results indicate that individual characteristics, regional location, and the distribution differences among industries and public and private sectors were the main factors causing the wage gaps. Furthermore, the main factors causing the wage gaps between 2002 and 2013 are human capital factors, industry sectors, and gender discrimination.

In previous studies on the wage gap between migrants and local urban residents, Wang (2003) [2], Xie and Yao (2006) [8], Deng (2007) [9], Xing and Luo (2009) [10], Ma (2011) [6] utilized the Oaxaca-Blinder model (Oaxaca, 1973 [11]; Blinder, 1973 [12]), the FFL model (Firpo, Fortin, and Lemieux, 2009 [13]) or the DFL (Dinardo, Fortin and Lemieux, 1996 [14]) model to undertake the decomposition analysis and found that both the discrimination and human capital differentials affect the wage gap; however, the influences of the discrimi-nation on the wage gap differ among these empirical study results.Meng (1998) [15], Meng and Zhang (2001) [1], Ma and Li (2016) [7] analyzed occupational segregation and the wage gap between migrants and local urban residents and found that occupational discrimination is the main factor underlying the wage gap.However, these studies did not consider the sector selection bias problem in wage functions and the analysis periods reach only the early 2000s.Thus, recent factors contributing to wage gaps between migrants and local urban residents are not clear.
This study presents an empirical approach to answer the following questions: First, do wage gaps exist between migrants and local urban residents?Second, what factors determine the wage levels of migrants and urban local residents?Third, what factors determine the wage gaps between migrants and urban local residents?Finally, has the influence of these factors on the wage levels and wage gaps changed from 2002 to 2013?
This study contributes in the following ways.First, it investigates how unexplained differentials (i.e., discrimination) and explained differentials (e.g., those based on individual characteristics, including human capital factors) affected the wage gap between migrants and local urban residents during the 2000s (from 2002 to 2013).Second, it considers sector segmentations by industry sectors, public and private sectors, and regions in the Chinese urban labor market.Finally, it discusses the changes in the influences that these factors have on the wage gap from 2001 to 2013.The study's results are new discoveries for the wage-gap issue.
This paper is structured as follows.Part 2 describes the analysis methods, including introduction to models and data.Part 3 is the description analysis results, and Part 4 states the quantitative analysis results.Part 5 presents the main conclusions.

Model
First, to explore the wage gaps between migrants and local urban residents, estimation based on OLS and quantile regression models (Koenker and Baset, 1978) are utilized [16].The OLS (Ordinary Least Squares) analysis is express by Equation (1.1) ln Wage a Mig X u In Equation (1.1), i represents the individual (a migrant or a local urban resident), lnWage is the logarithm of the average wage, Mig is the migrant dummy, X represents the other factors (e.g.education, experience years, industries, occupations) which affect wage, u is a random error item.When the coefficient of the migrant dummy ( m β ) is negatively significant, it is shown that if the other factors are consistent, the average wage level is higher for the migrants than for the local urban residents.
Considering the selection bias problem in Equation (1.1), the selection bias corrected wage function model is proposed (Maddala, 1983) [17].Equation (1.2) expresses the probability of entry to industry sectors using multinomial logit model.For example, the probability to become a worker in an industry sector is expressed as

( )
Pr 1 i Y * = , and the other probability is expressed as ( ) H represents factors identical to those ex- pressed in Equation (1.1)-including Mig and X, Z is used as an identification variable 1 .Using the estimated results of the distribution function and the density function by probit regression model, selectivity items (

( ) ( )
, , The corrected wage functions expressed by Equation (1.3) can be estimated using these selectivity items.

(
) Second, to estimate what factors determine the average wage levels of migrants and local urban residents, the 1 Age, age-squared are used as identification variables in the study.
sample bias corrected wage functions by migrants and local urban residents are estimated.Finally, to estimate which factor determinates he wage gap between migrants and local urban residents, the Oaxaca-Blinder decomposition model is utilized.Based on wage functions, the Oaxaca-Blinder decomposition model can be derived as Equation ((3.1), (3.2)).

( ) ( )
In Equations ((3.1), (3.2)), u represents local urban residents, rm represents rural migrants, lnW is the logarithm of the average wage, X is the average values of variables, and β u and β rm represent the estimated coefficients resulting from the wage function of males and females, respectively.

Data
The survey data of CHIPs 2002 and CHIPs 2013 are used for the analysis.These data are gained from the two surveys of Chinese Household Income Project conducted by NBS, Economic Institute of Chinese Academy Social Science (CASS) and Beijing Normal University (BNU) in 2003 and 2014, including respective information about the individual characteristic factors, industries, workplace ownership types and wages of migrants and local urban residents.Because there are design similarities of the data in the questionnaire, we can use the same information for analysis for two periods.

Variables Setting
The wage is defined as the total earnings from work (called "the total wage").Here, it comprises the basic wage, bonus, cash subsidy, and no cash subsidy.The CPI in 2002 is utilized as the standard to adjust the nominal wage in 2013.
The analytic objects of this study are workers, excluding the unemployed.In considering the retirement system implemented in the public sector-the state-owned enterprises (SOEs) and the government organizations, to reduce the effect of that system on the analysis result, analytic objects are limited in the groups to between the ages of 16 and 60.No answer samples, abnormal value samples2 , and the missing value samples are deleted.
To see the depended variables setting.First, to correct the sample selection bias by the probability of entry to the monopoly or the competitive industry sectors, the probability function of entry to industries is estimated.In the probability function of entry to industries, the depended variable is a category variable.To maintain the analysis samples by each industry category and consider the feature of the industry distributions of migrants, the industrial categories in the CHIPs3 are reclassified.Five kinds of industries-construction, manufacturing, retail and wholesale industries, service, and other industries are utilized to construct the category variable.
Second, in the wage function, the depended variable is the logarithm of the wage rate.The wage rates are calculated based on total wage and work hours.The CHIP survey data for local urban residents are included those who were re-employed as non-regular workers after the employment adjustment of state-owned enterprises.The total wage in those samples are the total value of base salary, bonuses and goods calculated by monetary, excluding layoff living assistance, minimum income assistance, and living assistance by firms, income by asset and financials, security transfer income.For work hours, work hours yearly for local urban residents are calculated by "work hours daily × work days monthly × work month yearly", and work hours monthly for migrants are calculated by "work hours daily × work days weekly × 4".Wage rate are calculated by total wage divided by work hours.
The independent variables are the variables likely to affect the wage, they are conducted as the follows.
First, education (primary school or below, junior high school, senior high school/vocational school, college and above), experience years 4 , age, health status (very good, good, fair, bad) are conducted as the index of human capital.It is though that these might factors affect the wage level.
Third, it is thought that the special political membership may affect the wage levels.Party membership dummy is used in the analysis.
Fourth, considering gender, the married, and the race might affect the wage levels, these dummy variables are utilized.
Finally, because there exists regional disparity for economic development levels, and the labor markets are different by the regions, East, Central, West regions dummy variables are used to control these influences.

Individual Characteristics Differentials by Migrants and Local Urban Residents in 2002 and 2013
The data's descriptive statistics are shown in Table 1.The individual characteristics of the differentials by migrants and local urban residents are observed as follows: First, the logarithms of the average wage rates are higher for local urban residents than for migrants for both 2002 and 2013.For local urban residents, they are 1.524 and 2.608 for 2002 and 2013, and for migrants, they are 0.861 and 2.442 in 2002 and 2013.In addition, the wage gaps between migrants and local urban residents declined from 2002 (−0.663) to 2013 (−0.166).
Second, the individual characteristic differentials between migrants and local urban residents show that years of experience are greater, and ages are higher for local urban residents than for migrants, in both 2002 and 2013.These results are consistent with the phenomenon that most of the younger labor force with rural registrations is moving to and working in urban areas.However, the differentials of experience years between these two groups become small from 2002 to 2013.
Third, although in both 2002 and 2013 the proportion of workers with higher education (such as senior high school and college/university) is smaller for the migrants group, the proportion of migrant workers that has graduated from senior high school rises from 17.7% (2002) to 22.4% (2013), while the proportion of workers who have graduated from college or university rises from 2.3% (2002) to 12.0% (2013).These results show that education differentials between local urban residents and migrants have changed greatly from 2002 to 2013.
Fourth, in both 2002 and 2013, the proportion of communist party member is greater for local urban residents (29.3% in 2002, 20.8% in 2013) than for migrants (3.3% in 2002, 4.3% in 2013).
Fifth, in both 2002 and 2013, the proportions who answered that their health status is "very good" is greater for migrants than for local urban residents.However, the health differentials between these two groups decreased from 2002 to 2013.
Sixth, in both 2002 and 2013, there are differences in proportions in terms of gender and ethnicity between migrants and local urban residents, though these are smaller than for other factors.
Seventh, in both 2002 and 2013, although most of migrants work in the retail/catering, and service industry sectors, the industry distribution differentials between migrants and local urban residents become smaller from ).Moreover, the proportion of workers in the private sectors rises greatly for both migrants and local urban residents.For examples, the proportion rises from 13.8% (2002) to 32.3% (2013) for local urban residents, and it rises from 11.6% (2002) to 39.1% (2013) for migrants.These results reveal that along with the decrease of worker share in the public sector, private sector absorbed more workers (both migrants and local urban residents) from 2002 to 2013.

Wage Distributions by Migrants and Local Urban Residents in 2002 and 2013
The estimated kernel density of the logarithm of the wage rate is shown in Figure 1, the main finding are as follows: The density of high-wage groups is greater for local urban residents than for migrants in both 2002 and 2013.It is indicated that the proportion of workers with a high wage is greater for local urban residents than for migrants.It can be concluded that the majority of workers with high wages in 2013 were local urban residents during the 2000s.Additionally, the proportion of individuals with low wages is also greater for the local urban residents than for the migrants.The dispersion of wage rate in the intra-group is shown to be greater for the local urban residents than for the migrants.
Moreover, when comparing the change of the estimated kernel density curve, the differentials between migrants and local urban residents reduced from 2002 to 2013.This indicates that along with the marketization economy system transition from 2002 to 2013, the wage gaps between these two groups decreased.
Table 2 shows the statistical value of the logarithm of wage rate.The mean values and the 25th, 50th, and 75th percentiles of wages are all higher for local urban residents than those for migrants in both 2002 and 2013.Although these tabulation calculation results indicate that the individual characteristics, the wage density distributions are different for migrants and local urban residents, and that there exist wage gaps between the two groups, the factors that might affect the probabilities of entry to industry sectors and the wage level differentials have not been controlled in these results.An econometric analysis is thus conducted as follows.

Do Wage Gaps Exist between Migrants and Local Urban Residents?
Table 3 demonstrates the results for wage gaps between migrants and local urban residents.First, the wage gap based on model 1 is estimated as −0.654 in 2002 and −0.166 in 2013.Moreover, when the other factors are not controlled, the average wage level for migrants is 65.4%, which is 16.6% lower than that for local urban residents.When the main human capital factors (work experience and education) are consistent (model 2 and model 3), the wage gaps decrease to −0.481 in 2002 and 0.055 in 2013.These results indicate that education is the main factor that causes the wage gap.
Second, when the workplace ownership is consistent, wage gaps decrease to −0.192 (model 9) in 2002.This result suggests that workplace ownership is also a main determinant of the wage gap.
Finally, compared to 2002, the wage gap reduces in 2013.Particularly, when education is consistent, the average wage level for migrants is higher than that for local urban residents.These results can be explained as follows: Along with economic development, the surplus labor decreased and labor productivity increased in the rural region; thus, the wage level of migrants-which is mainly determined by the subsistence level in the rural region-increased.Consequently, the wage gaps between migrants and local urban residents decreased from 2002 to 2013.

What Factors Determine the Wage Levels of Migrants and Urban Local Residents?
Which factor determinates the wage levels of migrants and urban local residents?To answer this question, wage functions are estimated, the results being shown in Table 3.
First, the Maddala model is utilized to adjust the sample selection bias caused by the choice of entry to an industry.In both 2002 and 2013, the correct items are statistically significant for the local urban residents group and the coefficients of these correct items are all negative values.The results for the local urban residents group will thus be overestimated when these selection biases are not adjusted.
Second, U-shaped curves can be constructed from the estimated results for years of experience and their squared values, with points of inflection at 20 years for migrants and 46 years for local urban residents in 2002, and at 13 years for migrants and 23 years for local urban residents in 2013.Thus, while work experience affected wage levels for migrants and local urban residents, the effect was greater for local urban residents than for migrants in both 2002 and 2013.Third, in 2002, wages for the low-level education group (primary school) were less than those of the mid-level education group (junior high school) and higher for the high-level education groups (senior high school, college/university) for both migrants and local urban residents.However, in 2013, wages for the low-level education group were higher compared to the mid-level education group for migrants.In addition, if other factors are constant, the wage differentials between the different education groups are statistically insignificant.These results show that the returns on education have recently risen for low-skill migrants.
Fourth, in 2002, there are gender wage gaps for both migrants and local urban residents, and this was higher for migrants (−0.518) than for local urban residents (−0.115).However, in 2013, although the gender wage persisted for local urban residents (−0.178), it was not statistically significant for migrants.These results indicate that if other factors remain constant, gender wage gaps increased for local urban residents (female dummy coefficients change from −0.115 to −0.178), but decreased for migrants.
Fifth, considering segmentation by workplace ownership types in 2002, public-sector wages were lower than private-sector wages or for the self-employed sector for migrants, whereas wages were lower for private-sector workers or the self-employed for local urban residents.In addition, while wages for private-sector or self-employed workers in 2013 were lower for local urban residents, they decreased from 2002 to 2013.Moreover, wage gap differentials between each sector were not statistically significant for migrants in 2013.These results indicate that wage gaps by workplace ownership type decreased recently, all other factors remaining the same.
Sixth, we next consider the industry-level segmentation.Both migrant and local urban resident workers earned more in the construction industry than in the manufacturing industry in 2002 and 2013.Moreover, wages for both types of workers were lower for those in the retail/catering and service industries in 2002.However, while the results for the retail/catering and service industries remained consistent with the results in 2002 for local urban residents, the coefficients increased and were not statistically significant for migrants in 2013.Thus, all other factors constant, the industrial wage gaps declined for migrants and increased for local urban residents 5 .
Finally, In terms of regional location, both migrant and local urban resident workers in the central region earned less than those in the eastern and western regions in 2002 and 2013.These results show that regional wage disparities persisted from 2002 to 2013 for these two groups.

What Factors Determine the Wage Gaps between Migrants and Local Urban Residents?
Which factors determine the wage gap between migrants and local urban residents?The Oaxaca-Blinder model is utilized to decompose the factors that contribute to the wage gap based on the estimated results shown in Table 4 and Table 5.Table 6 and Table 7 report the decomposition results using the human capital factors and by labor market segmentation (industry, workplace ownership, and region), respectively.The main findings are as follows: First, in both 2002 and 2013, the main factor causing the wage gap is the contribution of the explained differentials.These contributions are 63.1% (Estimation 1) and 105.1% (Estimation 2) in 2002, and 125.4% (Estimation 1) and 129.7% (Estimation 2) in 2013.The results of Estimation 1 are consistent with Wang (2005) [3] and Mautrer-Fazio and Dinh (2004) [27], the results of Estimation 2 are consistent with Xing (2008).In addition, if labor market segmentation factors are considered, the contribution of explained differentials becomes greater in both 2002 and 2013.These results indicate that the individual characteristic differentials and the different distributions among industry sectors and regions as well as between the public-and private-sector are the main factors affecting the wage gap between migrants and local urban residents.It is thought that the different distributions among sectors might be caused by the discrimination when the migrants entrance to the monopoly industry sectors or the public sector.So it can be said the labor market segmentation by systems are one of the main factors affecting the wage gap.
The results indicate that industry, ownership, and education are the main factors causing the wage gaps in the 2002-2013 period.
Finally, to consider in detail the factors underlying the unexplained differentials, based on Table 7 (Estimation 2), experience (29.1%) and industry (22.9%) are the main causes of the wage gap in 2002, whereas in 2013, the influences of experience (26.9%) and gender (31.1%) are the greatest.These results indicated that discrimination within the industry sector, gender discrimination, and discrimination based on a seniority wage system are the main factors affecting the wage gap between migrants and local urban residents.

Conclusions
This study explores which factors determine the wage gap between rural-to-urban migrants and local urban residents in China from 2002 to 2013.Using the Chinese Household Income Project Surveys (CHIPs) for 2002 and 2013, the Oaxaca-Blinder model is utilized for a decomposition analysis of the wage gap.Several major conclusions emerge.First, when compared with unexplained differentials, the influence of explained differentials is greater in both 2002 and 2013.In addition, if labor market segmentation factors are considered, the contribution of explained  differentials becomes greater for both 2002 and 2013.These results indicate that the individual characteristic differentials and the different distributions among industry sectors and regions, as well as between the publicand private-sector, are the main factors affecting the wage gap between migrants and local urban residents.Second, considering the components of the explained differentials, the influences of workplace ownership types, education levels, and industry sectors are the greatest in both 2002 and 2013.It is indicated that the human capital differentials and the sector segmentations are the main factors causing the wage gap in the 2002 to 2013 period.
Third, considering the components of the unexplained differentials, the influences of industry sectors, gender and work experience years are the greatest in both 2002 and 2013.They show that the discrimination in the same industry sector, gender discrimination, and discrimination based on a seniority wage system are the main factors causing the wage gap.
These findings indicate that to reduce wage gaps between migrants and local urban residents, employment equality laws and an equal pay for equal work policy are immediate priorities.Policies that aim to reduce human capital differentials between these two groups, such as education and years of experience, should also be implemented in the long term.Moreover, in order to address the labor market segmentation problems fundamentally, the enforcement of economy systems transition from the planned economy to the marketization economy is an important issue for Chinese government in the long term.
gap resulting from a difference between migrants and local urban residents groups in the individual characteristics factors, including human capital (e.g.education, experience years) and the industry in which the individual is working, gap caused by the unexplained differentials, including discriminations.
For example, the mean values for local urban residents in 2002 and 2013 (1.516 and 2.605 respectively) are higher than those for migrants (0.855 in 2002 and 2.434 in 2013).Moreover, the maximum values of the logarithm of the wage rate are all higher for local urban residents than those for migrants.Similarly, the minimum values of the logarithm of the wage rate are all lower for local urban residents than those for migrants in both 2002 and 2013.For example, the maximum values of the logarithm of the wage rate for local urban residents in 2002 and 2013 (4.504 and 6.543 respectively) are higher than those for migrants (4.241 in 2002, 4.956 in 2013).In the same manner, the minimum values of the logarithm of wage rate for local urban residents in 2002 and 2013 (−1.515 and −2.092 respectively) are lower than those for migrants (−1.253 in 2002, −1.030 in 2013).Thus, the standard deviation is greater for local urban residents (0.721 in 2002 and 0.792 in 2013) than for migrants (0.684 in 2002 and 0.729 in 2013).These results are consistent with the Kernel Density estimated results.

Figure 1 .
Figure 1.Kernel density estimates of the logarithm of wage rate distributions by migrants and local urban residents in 2002 and 2013.

Table 1 .
Differentials of variable distributions and mean values between migrants and local urban residents in 2002 and 2013.Finally, in both 2002 and 2013, most of local urban residents work in the public sector (66.7% in 2002, 40.7% in 2013), whereas the proportion of self-employed workers is greater for migrants (73.0% in 2002, 44.4% in 2013

Table 2 .
Statistical values of the logarithm of the wage rate by migrants and local urban residents in 2002 and 2013.
Source: Calculated based on CHIPs 2002 and CHIPs 2013.

Table 3 .
Results of wage gaps between migrants and local urban residents.
Source: Calculated based on CHIPs 2002 and CHIPs 2013.Note: Samples limited on age 16 -60.
Source: Calculated based on CHIPs 2002 and CHIPs 2013.
Source: Calculated based on CHIPs 2002 and CHIPs 2013.