Spatial Heterogeneity Analysis of PM2.5 Concentration in Central Plains Economic Region

The research of the spatial heterogeneity of PM2.5 concentration in an area, is of great significance for understanding its regional spatial distribution structure, exploring the transmission relationship between regions, in order to for-mulate joint prevention and control measures within the entire area. Based on the daily monitoring data of PM2.5 concentration in the Central Plains Economic Region in 2019, this paper utilizes cluster analysis to divide the regional PM2.5 concentration into 5 classes, builds their spatial semi-variogram model, and then utilizes interpolation analysis method to study the regional overall distribution characteristics and transmission law. The results show that the PM2.5 concentration in the Central Plains Economic Region has a medium or higher spatial autocorrelation. The critical value of the overall PM2.5 concentration in the area is 150 μg/m 3 , as the overall PM2.5 concentration less than the value, the PM2.5 in a region mainly comes from local emissions, as the overall PM2.5 concentration higher than the value, the influence of spatial structure on the distribution of PM2.5 concentration is gradually obvious. PM2.5 has a certain degree of spatial transmission, which mainly includes two routes as Puyang-Xingtai and Puyang-Zhengzhou, and the transmission intensity of the former is greater than the latter.


Introduction
With the continuous development of China's economy, the rapid urbanization and industrialization, the increase of energy and natural resources consumption, a large amount of harmful substances have been discharging into the atmos-phere. Air pollution is becoming more and more seriously, the problem of air pollution control has been becoming one of the main problems to be solved in the sustainable development of China's economy and satisfying people's yearning for a better life (Liu et al., 2017;Ma et al., 2016;Lin & Wang, 2016).
The components of atmospheric pollutants in China mainly include PM2.5, PM10, SO 2 , NO 2 , CO and O 3 . The statistical results show that PM2.5 is the primary pollutant in most regions in pollution weather, which causes more than 50% pollution days of the total pollution weather. PM2.5 is atmospheric particulate matter with aerodynamic equivalent diameter less than 2.5 microns, mainly composed of water-soluble ions, carbon components and chemical elements, all of which have the characteristics of small diameter, large surface area, strong activity, strong adsorb ability for toxic and harmful substances (for example, heavy metals, microorganisms, etc.), long residence time in the atmosphere and long transport distance (Yuan et al., 2018;Bie et al., 2018;Zhang et al., 2019b;Zhang et al., 2019a).
In recent years, Many researchers have focused on the components analysis and diffusion model analysis, and the spatial distribution characteristics analysis in multi-scale of geoscience vision, as well as the influencing factors analysis, have achieved a lot of useful results (Yuan et al., 2018;Bie et al., 2018;Zhang et al., 2019b;Zhang et al., 2019a;Chen et al., 2017;Wen et al., 2018;Zhao et al., 2017;Zhang et al., 2019c;Chen et al., 2019;Li et al., 2019;Liao et al., 2016).
Although the amount of PM2.5 emissions is the main factor determining regional concentration, the regional topographic conditions, wind direction, wind speed, precipitation and temperature have some impacts on the its spread, regional transmission and transport is one of its important features, the concentration in a region is affected by the concentrations of its adjacent regions, and there is some certain correlations between regions (Zhao et al., 2017;Zhang, Meng, 2019b).
However, for the limited number of monitoring stations with an uneven distribution and the short of observation time, there are certain limitations in studying its temporal and spatial distribution characteristics and changing trends in a large-scale range. Therefore, based on the daily monitoring data of PM2.5 concentration in the Central Plains Economic region in 2019, this article uses the semi-variogram model to analyze the spatial heterogeneity of PM2.5 concentration, study the characteristics of its spatial variation and explore its transmission relationship in the area, in order to provide a scientific reference for the formulation of its joint prevention and control measures in the Central Plains Economic Region.

Research Area Overview and Data
The Central Plains Economic region (31˚23'N -37˚49'N, 110˚21'E -118˚11'E) with a total area of 280,000 km 2 , located in the central part of China, is a key development area in the planning of China's main functional zone, covering the whole province of Henan, the northwest part of Anhui Province (including Huaibei, Suzhou, Fuyang, Bozhou, Bengbu, and a part of Huainan), the southwest part of Shangdong Province(including Heze, Liaocheng, and a part of Tai'an), the south part of Hebei Province (including Handan and Xingtai), the southeast part of Shanxi Province (including Jincheng, Changzhi, and Yuncheng). The west and south sides are distributed by Funiu Mountain, Tongbai Mountain and Dabie Mountain along the boundary in a semi ring shape, the middle and east parts are the Huanghuaihai alluvial plain. Topographically, the west of the area is higher and lower in the east, higher in the south and lower in the north (see Figure 1).
The data used in the study mainly includes a 30 m × 30 m digital elevation model, which comes from Geospatial Data Cloud (http://www.gscloud.cn), the regional PM2.5 concentration data comes from China Air Quality Online Monitoring and Analysis Platform (https://www.aqistudy.cn/historydata/).

The Spatial Heterogeneity of PM2.5 Concentration
PM2.5 concentration ( ) ( ) , Z P Z x y = is a regionalized variable, which is different from an ordinary random variable. The value of an ordinary random variable conforms to a certain probability distribution, while a regionalized random variable takes a value according to its location in a region. In other words, the regionalized random variable is the specific value of the ordinary random variable at a certain time for a specific location. It is a random function based on the position, an ordinary random variable in one place (Chen et al., 2019;Li et al., 2019;Liao et al., 2016).
PM2.5 concentration has two significant characteristics of randomness and structure. Firstly, it is a random function, has locality, randomness and abnormality. Secondly, it has average structural properties, to some extent reflects the spatial correlation characteristics of the PM2.5 concentration ( ) 0 Z P and the PM2.5 concentration ( ) 1 Z P , where 1 P is h away from 0 P , these spatial correlations depends on the distance h between the two points and the characteristics of the PM2.5 concentration itself.
The correlation coefficient between two points is The research PM2.5 results with the confidence level of 0.95%, were shown. The PM2.5 concentration between adjacent regions in the Central Plains Economic Region presents a significant correlation characteristic, with the correlation coefficient of (Zhang et al., 2019a). There was a significant spatial autocorrelation characteristic showing that Xinxiang and Zhengzhou are the center of HH aggregation region, and the aggregation degree tends to be significant with the increase of the average PM2.5 concentration (Zhang et al., 2019b).
In order to study the overall diffusion and transport characteristics of PM2.5 in an area, explore the spatial variation of PM2.5 concentration at different distances, the distribution of the spatial variation at a specific distance, as well as the relationship between PM2.5 concentration distribution and distance, defines the semi-variogram as a half of the sum of covariance of the observation point

The Spatial Distribution Characteristics and Interpolation Model of PM2.5 Concentration
The regional PM2.5 concentration has a significant spatial correlation. In order to study the spatial variation and spatial heterogeneity at different distances, fit

Statistical Characteristics of PM2.5 Concentration
The daily monitoring data of PM2.5 concentration presents the average level of PM2.5 emissions around the monitoring stations and the diffusion from surrounding regions, which is affected by many factors such as daily emissions, meteorological conditions and etc., with a certain degree of randomness. for accurate research its spatial distribution and variation characteristics in the area, carries K-mean cluster analysis on the data, using the monitoring stations as the variables, the Euclidean distance between days as the criterion function, and the average concentration of PM2.5 corresponding to the first 5 levels of air quality, defined in the "Ambient Air Quality Index (AQI) Technical Regulations" (HJ 633-2012), as the starting clustering center vector, to classify the days into 5 classes, respectively corresponding to PM2.5 concentration of 5 levels, as Good, Moderate, lightly polluted, moderately polluted and heavily polluted. Calculate K. G. Zhang et al. their corresponding class centers as the basic data of variation characteristics research, and their spatial distribution is shown in the Figure 2. The spatial distributions of 5 classes show some non-uniformity and great difference, the high concentration regions show a zonal distribution, mainly located in the northern and central areas. The high concentration regions also have relatively high population densities, and relatively developed economy and relatively large proportion of total industrial output value, this is one of the main factors that lead to significantly higher concentration of PM2.5 in these areas than that in other areas.

Analysis of the Spatial Heterogeneity of PM2.5 Concentration
In order to quantitatively analyze the spatial heterogeneity characteristics of PM2.5 concentration at different levels, the semi-variogram model is used to calculate the digital characteristics of the spatial heterogeneity function of each level, the results are showed in Table 1.
The variation coefficient (Li et al., 2019) increases with the increase of PM2.5 concentration, which means that with the increase of overall regional concentration, the spatial variability gradually strengthens. The nugget value (C 0 ) represents the spatial heterogeneity caused by random factors, its variation range is between 2.00 and 549.00, as the overall concentration increases in the area, it exhibits a large fluctuation, and the value in level III is the minimum.
The partial sill value (C) represents the spatial heterogeneity of PM2.5 concentration caused by the spatial structure, and its variation range is 4.19 -2175, as the overall concentration increases, it shows a rapid growth trend.
The sill value ( 0 C C + ), is the sum of the nugget value and the partial sill value, which describes the total spatial heterogeneity, its variation is similar to that of the nugget value, and the variation range has been expanded to 6.85 -2724.
The nugget-sill ratio ( indicates the proportion of the spatial heterogeneity caused by random factors to the overall spatial heterogeneity, shows a relatively large fluctuation, ranging from 3% to 39%. This means that the spatial heterogeneity of PM2.5 concentration presents some certain nugget effect. As PM2.5 concentration is in level I, the proportion caused by random factors in the area is the maximum; as PM2.5 concentration is in level III, the proportion is the minimum, in other words, the spatial heterogeneity caused by the regional spatial structure is the maximum. In terms of correlation degree (Chen et al., 2019;Liao et al., 2016), levels II, III, and V have strong spatial autocorrelation, and levels 1 and 4 have moderate spatial correlation. In general, the fluctuation of the nugget-sill ratio is mainly affected by the changes of nuggets. Therefore, when studying the spatial heterogeneity of PM2.5 concentration, the influence of random factors must be considered.
The range indicates the variation range and uniformity of spatial heterogeneity.
In general, the range shows a certain fluctuation with the increase of the overall concentration, ranging from 121.40 to 589.070 km. The PM2.5 concentrations in level IV and V have a wide range of spatial heterogeneity, and the spatial distribution is relatively uniform; the PM2.5 concentrations in level II and III have a small spatial heterogeneity range, and the spatial distribution presents non-uniform characteristics.

Spatial Distribution Characteristics and Interpolation Analysis of PM2.5 Concentration
To study the overall spatial distribution of PM2.5 concentration, using the four semi-variogram models, mentioned before, to fit on the discrete variance values. According to the principle of optimal fitting criterion, obtain 5 optimal heterogeneity functions as The fitting coefficient R 2 is the ratio of the sum of regression squares to the total deviation in the fitting model. Among the five models, only IV and V, corresponding to levels IV and V, R 2 reach more than 90%, the fitting is significant, while the rest the three models is not significant. Carrying Kriging interpolation analysis on the PM2.5 concentrations of levels IV and V, the results are shown in Figure 3, the solid line is the 5 μg/m 3 contour line.
As the PM2.5 concentration is in level IV, see Figure 3(a), the regional PM2.5 concentration presents a large difference. The area with a concentration value within the corresponding concentration range of level 4 air quality ((115 -150] μg/m 3 ), Journal of Geoscience and Environment Protection accounts for 45.03% of the total area, distributes in the southwest-northeast distribution along Zhumadian-Heze. The proportion of 11.45% in the level belongs to level 3 air quality ((75 -115] μg/m 3 ), they mainly distribute in the south of the Xinyang-Huaibei. The proportion of 43.52% in the level belongs to level 5 air quality (150 -250] μg/m 3 , they show two pollution systems, the first is distributed in the north of Hebi, centers as Handan with north-south distribution; the second is located to the west of Zhengzhou, with Yuncheng as the center, shows north by east to west distribution. Xinxiang links two systems together forming a level 5 air quality area. As the PM2.5 concentration is in level V, see Figure 3(b) the regional PM2.5 concentration presents a central characteristic, spreading from to Puyang the surroundings. The area with a concentration value within the corresponding concentration range of level 5 ((150 -250] μg/m 3 ) accounts for 76.39% of the total area, mainly distributes in the north of Zhumadian-Shangqiu. The area with the air quality of level 4 ((115 -150] μg/m 3 ) in the level is mainly distributed in the south of the Xinyang-Huaibei, accounts for 22.44% of the total area. There are also 1.17% area with the concentration of level 3 air quality (≤ 115 μg/m 3 ), they distribute in the southern part of Xinyang, Huainan and Bengbu.
In the process of evolving from level IV to level V, the relative intensity of the first pollution system gradually weakens, and its center gradually moves to east. The relative intensity of the second pollution system gradually strengthens, and the center gradually moves southward, then they integrate into one, and the center is located near Puyang.

Conclusion
concentration, the spatial heterogeneity of concentration gradually increases. The critical value of the overall PM2.5 concentration in the area is 150 μg/m 3 . As the overall PM2.5 concentration is less than it, the PM2.5 in the area mainly comes from local emissions. When the overall PM2.5 concentration is higher than this value, the influence of geospatial structure on the distribution of PM2.5 concentration is gradually obvious, and PM2.5 has a certain degree of spatial transmission. The transmission mainly includes two routes as Puyang-Xingtai and Puyang-Zhengzhou, and the transmission intensity of the former is greater than the latter.