Analysis of Geographically Anomalous 2019 Novel Coronavirus Transmission in China

Approximately in the late-December of 2019, the coronavirus disease outbreak took place in China. COVID-19 (Coronavirus Disease 2019) is contagious and detrimental to the human body, which can even lead to death. As a result, the understanding of COVID-19 has become especially important. This paper studies four cases of anomalous disease-spreading in China (Guangdong Province, Heilongjiang province, Tianjin municipality, and Guizhou province) and analyzes four influencing factors of the transmission (temperature, transportation and passenger traffic volume, household size and distribution, and awareness). Major conclusions in this paper are as follows. Transportation and passenger traffic volume and the number of larger households are positively related to the extent of disease-spreading; the degree of awareness is negatively associated with the extent of disease-spreading. Provinces, municipalities, and autonomous regions with a more urbanized distribution of households are prone to experience a greater extent of disease transmission. Although the novel coronavirus prefers colder environment, temperature appears to be a secondary influencing factor, as regions with negative temperatures have fewer diagnoses. Disease transmission in Guangdong province is caused by a high volume of passenger traffic, large and urbanized households, and low awareness. Heilongjiang province is mainly a result of high passenger traffic volume, long travelling trips, and low public awareness. Guizhou province is benefited from high awareness, limited passenger volume, and scattered households. Tianjin municipality is protected from the severe disease-spreading owe to its beneficial temperature, low land transportation volume, and high public and government awareness.


Introduction
In the Late-December of 2019, the novel coronavirus outbreak took place in Wuhan City, Hubei province, China. Coronavirus disease is officially named by the World Health Organization as COVID-19 (Coronavirus Disease 2019) on 12th January 2020. It can result in influenza, or even Middle East Respiratory Syndrome and Severe Acute Respiratory Syndrome. Currently, it is commonly believed that COVID-19 is caused by the sale and consumption of bats. Since December 2019, coronavirus disease has spread through relocation diffusion and contagious diffusion from the center, Wuhan city.
The existing literature on contagious disease outbreak is mainly based on the researching of the pathology of novel coronavirus and drug development [1] [2], epidemic prediction [3], and disease transmission [4] [5]. Among the analysis of disease transmission, most are concentrated on assessing the potential of human-to-human transmission. As this conclusion is now verified, less study, however, focused on researching influencing factors that affect trends of disease transmissions. Additionally, while the existing papers based their study areas on the center of the outbreak, Wuhan City, or on China as a whole, fewer papers have concentrated on minor provinces or municipalities that experienced an anomalous transmission. Therefore, this research will study four provinces (Heilongjiang province, Guangdong province, Guizhou province, Tianjin municipality) that experience an anomalous disease-spreading (that is, disease-spreading does not influence by geographical locations to the center) from 21st January 2020 to 19th February 2020, and analyzes the four influencing factors (temperature, transportation and passenger traffic volume, urban planning, and policy-making and individual consciousness) with respect to transmission. This paper will provide helpful guidance for disease prevention and control in the capital city of China, Beijing, other cities and countries across the world, and for future copings with similar contagious diseases.
The paper is organized as follows. Section 2 presents past literature on disease transmission and their characteristics. Section 3 introduces the data sources and methodology. Section 4 presents and analyzes the influencing factors for anomalous disease-spreading in four provinces. Section 5 discusses plausible suggestions for disease control and prevention in Beijing, and concludes the paper.

Literature Review
Most papers at present have focused on researching disease transmission. In particular, Riou et al. studied transmission patterns and the potential for sustained human-to-human transmission of 2019-nCoV in China [6]. According to Lin et al., public health interventions implemented at both the social and personal levels are effective in preventing outbreaks of COVID-19 in Wuhan and the other 29 provinces in China [7]. Based on 41 cases of 2019-nCoV in Wuhan City, Tang et al. find out that intensive contact tracing followed by quarantine and isolation can effectively reduce the control reproduction number and transmission risk [8].
The existing literature has mostly pinpointed their study areas in China as a whole. Fewer studies have narrowed down their focus on provinces and regions that experience anomalous disease-spreading (transmitting trends in provinces, municipalities, autonomous regions that are not influenced by geographical locations to the center). In addition, these studies put less focus on analyzing other influencing factors, such as government and residential awareness, temperature, passenger traffic volume, and household size and distribution. Thus, this paper is going to focus on studying four influencing factors in four provinces that manifested anomalous disease-spreading geographically.

Data Source
The data of accumulated confirmed cases of 2019-nCoV comes from Jinritoutiao (Today's Headlines) APP by 19th February 2020 in all provinces/municipalities/ autonomous regions in China mainland. The data of passenger traffic volume, number of large households, household distribution, and temperature derived from 2019 China yearbook, which reflects the statistics in 2018. We assume that the general rankings and trends of development for each influencing factor did not change significantly in 2019. For temperature, we adopted the average value in provincial capitals to represent the overall value in each province for correlation analysis. In addition, all the correlation analysis has eliminated the statistics of an obvious outlier, Hubei province. The research utilizes Python language in calculating the mean and ranking the statistics

GIS Tools
TIANDITU is a map website of China, which provides official free web mapping services. It features detailed street-level geographic data for China, and it is constructed and maintained by the National Geomatics Center of China (NGCC). This paper adopts the GIS Functions in TIANDITU (https://www.tianditu.gov.cn/) to gauge the distance between different province/municipality/autonomous regions and Wuhan City, China. Thus, this study derives regions that experience geographically anomalous disease spreading.
ArcGIS is a geographic information system (GIS) software for creating and editing maps and compiling and analyzing geographic data. This research visualizes the extent of confirmed coronavirus disease in each province/municipality/ autonomous region in China mainland through ArcGIS. Confirmed cases are divided into seven intervals when presenting on maps in different colors, respectively 1 -15, 15 -60, 60 -99, 99 -499, 499 -999, 999 -4999, >4999 (unit: case). Table 1 displays the confirmed cases (volume from more to less), and Figure 1 illustrates the spatial distribution of confirmed diagnoses. According to Table 1,  Figure 1 further reveals that geographical locations influences disease-spreading significantly. In Journal of Geographic Information System Wuhan can easily arrive to for its developed railroad and highways, does not suffer greatly from coronavirus disease as well; Guangdong province and Heilongjiang province, situated more than 1000 km apart from Hubei province, has a large amount of confirmed diagnosis. Therefore, this paper will take these anomalous cases of disease-spreading into account and study the influencing factors that lead to the anomalies. Table 2 and Figure Figure 3 illustrates the correlation between average temperature of January and February and confirmed cases in each province/municipality/autonomous region. Accordingly, there is no relationship between confirmed cases and the temperature warmer than 0 degrees Celsius. For regions that are colder than 0 degrees Celsius, the low temperature is accompanied with low diagnoses. Besides Heilongjiang (an anomalous case), none of the provinces/municipalities/ autonomous regions with negative temperatures have more than 400 confirmed cases. Although the novel coronavirus prefers colder environment, temperature appears to be a secondary influencing factor of 2019-nCoV transmission. According to the data in Table 3, Guangdong's temperature ranked the top two among China. Based on the relationships illustrated in the scatterplot, Guangdong turns out to have a higher number of confirmed cases.   Figure 4 illustrates the correlation between the number of medium-and large-sized households and the accumulated confirmed cases in each province/municipality/ autonomous region in China mainland by 19th February 2020. There is a moderately strong, positive, approximately linear correlation. Since members within each household have close contact with each other every day, the larger families, the higher the possibility for infectious people to transmit 2019-nCoV to a broader group. Table 4 shows the statistics of medium-and large-sized households in each province. There are 17,860 million families that have more than two members in Guangdong. In this case, it is prone for Guangdong to have a greater scope of disease-spreading than other provinces. Figure 5 indicates the correlation between the number of urbanized population and confirmed cases. As implied in the scatterplot, there is a moderately strong, positive, approximately linear correlation between the two variables. Urbanized population displays a more concentrated residential distribution compared to rural population. A denser distribution of people is linked with increased contacts between individuals, resulting in a higher risk of contagion.

Household Size and Distribution
Guangdong has 80,220 million households that live in urban areas, ranking the first. This correlates to Guangdong's 1332 cases of diagnosis, as shown in Table   5. Figure 6 indicates, at the very beginning stage of coronavirus outbreak, a significant rate of increase in Guangdong compared to the other three provinces examined in this paper. This shows that Guangdong's government and the public lacked serious awareness for disease control, which causes a large amount of confirmed diagnosis in later stages (in February).

Transportation and Passenger Traffic Volume
Tianjin is located at the Bohai Bay in the Great Plains of North China. It has the Journal of Geographic Information System

Awareness
During the disease outbreak, Tianjin has carried out "Implementation measures during coronavirus disease outbreak for promoting economic and social development" early in time. As illustrated in Figure 7, at the very beginning stage, Tianjin has a slow rate of increase in confirmed cases per day. Additionally, the Jinyun application has shared detailed trip information of diagnosed patients to inform residents about ongoing situations efficiently. As indicated in Figure 8, Tianjin has the lowest number of diagnoses within the Beijing-Tianjin-Hebei region: the confirmed cases in Beijing and Hebei province are about 3 to 4 times of those in Tianjin. This underscores that the government control through on-time policy-making and the update of detailed patients' information has been especially critical for disease prevention and control.

Transportation and Passenger Traffic Volume
According to Table 2, the population traffic volume of Guizhou is 930,250 million people in 2018, ranking in the top ten. However, during the outbreak, the heavy snow forced Guizhou to close roads and airport runways, which dissuades population movement and reduces the spread of disease.

Household Size and Distribution
Guizhou is mainly constituted of mountainous, located at the Yungui Plateau. This reduces population density and concentration. In light of Table 4 and Table 5, Guizhou Province has only 6820 million urbanized households and 5080 million large households. The geography of Guizhou results in less urbanized residential concentrations and more scattered city arrangement across the provinces. Thus, the coronavirus cannot spread out easily.

Awareness
Guizhou province has carried out five pieces of implementation measures to strengthen disease control: first, identify suspected cases from data; second, verify their detailed profile; third, acquire their daily routine and mobility information; fourth, strictly ask for following report of their condition; fifth, restricted suspected cases from inflowing into Guizhou [9]. According to a detailed news report in Beijing Evening News, Guizhou has security officers collecting information about individuals' mobility and body temperature [10]. In addition, Guizhou has subsidized small and medium-sized enterprises to ensure their steady development during work extension. The high government awareness is conducive to disease control.

Transportation and Passenger Traffic Volume
Though according to Table 2, the passenger traffic volume is 315,680 million people in Heilongjiang, ranking low, its Northern location makes Heilongjiang prone to infection. Trips to Heilongjiang are longer than those to other provinces, which increases the potential exposure to the virus and suspected cases. This further increases the possibility of infection, resulting in more confirmed cases in Heilongjiang.

Temperature
Heilongjiang is located in the very northern part of China with an average outdoor temperature of −17.9 degree Celsius during January and February. The freezing weather dissuades people from room ventilations. Especially when people congregated, the indoor and enclosed areas make disease-spreading easier, causing a significant increase in diagnosis.

Awareness
Heilongjiang's policies and control are not so strict as Tianjin and Guizhou. Heilongjiang did not suffer from the outbreak of SARS in 2003, and neither the government institutions nor residents have taken timely measures and control during the 2019 outbreak of the novel coronavirus. The mobility information of infected people did not expose in time. The public has not taken high regard for self-isolation. According to Sohu news, 94 percent of cases of infection are due to indoor congregation in Heilongjiang [11]. Furthermore, in accordance with Figure 9, Heilongjiang has 4 to 5 times of confirmed cases than Liaoning and Journal of Geographic Information System

Discussions and Conclusions
This paper has studied the disease outbreak by 19th February 2020 in provinces and municipalities that experienced anomalous disease-spreading. Guangdong and Heilongjiang province have an unusually high quantity of confirmed diagnoses, whereas Tianjin municipality and Guizhou province are protected from a massive coronavirus transmission. This study focuses on four influencing factors that can account for these anomalies and obtains the following results. Transportation and passenger traffic volume and residential distribution and size are positively related to the extent of disease-spreading; the degree of government and individual consciousness has a negative correlation with disease-spreading.
Although the novel coronavirus prefers colder environment, temperature appears to be a secondary influencing factor of 2019-nCoV transmission in this study, as regions with negative temperatures have fewer diagnoses. Disease transmission in Guangdong province is caused by the high volume of passenger traffic, large and urbanized households, and low awareness. Heilongjiang province is mainly a result of high passenger traffic volume, long travelling trips, and low public awareness. Guizhou province is benefited from high awareness, limited passenger volume, and scattered households. Tianjin municipality is protected from the severe disease-spreading owe to its beneficial temperature, low land transportation volume, and high public and government awareness.
Based on the results mentioned above, this paper will provide some suggestions for Beijing's disease control and prevention. As a well-developed municipality and the center of political and cultural activities, Beijing deserves more attention, or it will affect a series of more regions. According to the positive cases of disease control in Tianjin municipality and Guizhou province, this paper proposes the followings for the Beijing government and community officers. 1) Beijing government should take high regard for disease control. For example, it should try best to limit the population inflow on-time. It should also carry out policies like verifying the identities of drivers and passengers at traffic stations carefully, to avoid infected people from entering Beijing. 2) For communities, officers should keep detailed track of mobility and information of incoming visitors and residents. However, as the capital of China, Beijing has a more complex procedure in its policy-making and implementation. So, at the same time, a higher awareness in public will be helpful for disease reduction, or Beijing may undergo a similar negative situation as Heilongjiang province. This paper further advises the public as follows. 1) Individual residents should reduce congregating activities and self-isolate at home. 2) For organizations and institutions, less congregating events should be held during the outbreak to reduce the possibility of wider 2019-nCoV transmission.
The shortcomings of this paper are, first, with increasing data in the future, this paper can conduct more accurate analysis; second, in the future study, this paper can adopt complex approaches and regressing methodologies to test correlations and to study disease transmission.

Author Contributions
Y.L.: conceptualization, data processing, formal analysis, writing original draft, writing review and editing; Y.L. and Z.D.: visualization.

Conflicts of Interest
The authors declared no conflicts of interest.