Analysis of Change Point in Surface Temperature Time Series Using Cumulative Sum Chart and Bootstrapping for Asansol Weather Observation Station, West Bengal, India

This paper aims to detect the short-term as well as long-term change point in the surface air temperature time series for Asansol weather observation station, West Bengal, India. Temperature data for the period from 1941 to 2010 of the said weather observatory have been collected from Indian Meteorological Department, Kolkata. Variations and trends of annual mean temperature, annual mean maximum temperature and annual minimum temperature time series were examined. The cumulative sum charts (CUSUM) and bootstrapping were used for the detection of abrupt changes in the time series data set. Statistically significant abrupt changes and trends have been detected. The major change point in the annual mean temperatures occurred around 1986 (0.57 ̊C) at the period of 25 years in the long-term regional scale. On the other side, the annual mean maximum and annual mean minimum temperatures have distinct change points at level 1. There are abrupt changes in the year 1961 (Confidence interval 1961, 1963) for the annual mean maximum and 1994 (Confidence interval 1993, 1996) for the annual mean minimum temperatures at a confidence level of 100% and 98%, respectively. Before the change, the annual mean maximum and annual mean minimum temperatures were 30.90 ̊C and 23.99 ̊C, respectively, while after the change, the temperatures became 33.93 ̊C and 24.84 ̊C, respectively. Over the entire period of consideration (1941-2010), 11 forward and backward changes were found in total. Out of 11, there are 3 changes (1961, 1986 and 2001) in annual mean temperatures, 4 changes (1957, 1961, 1980 and 1994) in annual mean maximum temperatures, and rest 4 changes (1968, 1981, 1994 and 2001) are associated with annual mean minimum temperature data set.


Introduction
Climate change and global warming are recognized worldwide as the most crucial environmental dilemma that the world is experiencing today [1]- [3].Concern in climate change and global warming by the international community, non-government organizations and governments has brought great interest to climate scientists leading to several studies on climate trend detection at global, hemispherical and regional scales [3]- [5].Nowadays, study of long-term temperature variability has been a topic of particular attention for climate researchers as temperature affects straightaway human activities in all domains.Increase in anthropogenic greenhouse gases' concentrations in the atmosphere is mainly due to human activities such as deforestation, burning of fossil fuel and the conversion of the Earth's land to urban uses driven largely by the rapid growth of the human population that are major causes of warming of the climate system and of the process of climate change [4]- [6].Several studies of long-term time series of temperatures have been done [7]- [9].Results have shown that the Earth's surface air temperature has increased by 0.6˚C -0.8˚C during the 20 th century, along with changes in the hydrologic cycle.Temperatures in the lower troposphere have augmented between 0.13˚C and 0.22˚C per decade since 1979, according to satellite temperature measurements [10].In an analysis of a time series combining global land and marine surface temperature records from 1850 to 2010 developed by the Climate Research Unit (CRU), the year 2005 was seen as the second warmest year, behind 1998 with 2003 and 2010 tied for third warmest year [8] [11]- [14].The two most recent decades were compared with the period 1979-1990.Warming has been observed to be concentrated in the most recent decade, from 2001 to 2010.The results were attributed to natural variability of the climate and/or to human activity but not to the El Niño-Southern Oscillation as previously suggested by other authors [13]- [15].Generally, there is consent among scientists that most of the observed increase in globally averaged temperatures since the mid-20 th century is unequivocal and very likely due to the observed increase in anthropogenic greenhouse gas concentrations.The 10 warmest years of the 20 th century all occurred in the last 15 years of the century, 1998 being the warmest.The Intergovernmental Panel on Climate Change (IPCC, 2007) projected that the average global surface temperature will continue to increase to between 1.4˚C and 5.8˚C above 1990 levels,by 2100 [2].To some extent, other factors, such as variations in solar radiation [8] and land use at regional scale, are also considered to be among the causes of the observed global warming [16]- [22].Though some studies have been done on climate change in different regions of India and whole of the adjacent sub-continent, the lack of reliable surface observational climate data still constitute a foremost gap affecting the detection capacity of impacts resulting from long-term climate changes.An effort is, therefore, required in maintaining existing observatories and increasing networks, and cooperation between countries.Regardless of the limited surface observational climate data, results from those studies indicate, in general, an increasing trend in temperature and a decreasing trend in rainfall during the last century.But key sources of errors in the detection of abrupt changes in climate data habitually consist of change of location of observatory; changes of instruments, change in observation times, missing data, and methods used to calculate daily means and increased urbanized and/or industrialized areas.Inhomogeneities in climate data time series can bring inaccuracies and make possible misinterpretation in the investigation of climate change over a region when analyzing a given climate parameter.Hence, there is a need for detecting change points in temperature time series and adjustment of data thereby.

Study Area
Asansol (23˚40'48"N and 86˚59'24"E) is situated in the Burdwan District of West Bengal It is Located between the Damodar and Ajay rivers over the extended part of Chhota Nagpur plateau, while the plateau proper occupies most of the state of Jharkhand.Another river, Barakar, joins the Damodar near Dishergarh.A small rivulet, Nunia, flows past Asansol.Dhanbad district lies on the western side of Asansol while, Durgapur sub division of Bardhman district lies on the eastern side and to the south, across the Damodar river are the Purulia and Bankura districts.To the north of Asansol are Dumka (Jharkhand) and Birbhum (West Bengal) districts.Dhanbad district across the Barakar river in Jharkhand is also a major mining area and has close links with Asansol.Both Dhanbad and Asansol lies in the Damodar valley.Mean an annual day time maximum temperature generally reaches as high as 31˚C.At night, the average annual minimum temperature drops down to around 22˚C.The average annual relative humidity is around 65%.The Heat Index is extreme and one can guess how hot it feels when relative humidity is added to actual air temperature.The average monthly amount of precipitation in rainy season has been recorded at around 78 mm.The average monthly wind speed around 2 km/h or 1 knot, approximately.In recent years, the maximum sustained wind speed has reached 56 km/h, or 30 knot.

Data Base
The data used in this study were collected from the Eastern Regional Centre of Indian Meteorological Department, Kolkata.They consisted of time series of year wise monthly average of maximum and minimum surface air temperature for the period ranging from 1941 to 2010 for the Asansol weather observatory.Those data were statistically processed and then condensed to annual mean values for further analysis.Studies of long-term climate change require that data be homogenous.Observed climate abrupt changes in a homogenous climate time series are caused only by variations in weather and climate [23].Several studies have been conducted on quality control and homogenization of climatological data for the detection of climate trends [24]- [27].Detailed explanations on the methods to be followed for data analysis are as under.

Cumulative Sum Charts (CUSUM) and Bootstrapping
The cumulative sum charts (CUSUM) and bootstrapping were performed as suggested by Taylor [28].Let, represent n data points of a time series, and 0 1 2 , , , b) Let, 0 ∑ be equal to zero.c) i ∑ are computed recursively as follows ( ) where, Actually, the cumulative sums are not the cumulative sums of the values.Instead they are the cumulative sums of differences between the values and the average.These differences sum to zero so the cumulative sum always ends at zero 0 n= ∑ .The confidence level can be determined for the apparent change by performing a bootstrap analysis [29] [30].Before performing the bootstrap analysis, an estimator of the magnitude of the change is required.One choice, which works well regardless of the distribution and despite multiple changes is, i ∆ which is defined as Once the estimator of the magnitude of the change has been selected, the bootstrap analysis can be performed.A single bootstrap is performed by a) Generating a bootstrap sample of n data points of time series, denoted as  , by randomly reordering the original n values.This is called sampling without replacement (SWOR).
b) Based on the bootstrap sample, the bootstrap CUSUM is calculated following same method and denoted as, j ∑ .c) The maximum, minimum, and difference of the bootstrap CUSUM are calculated and the difference between the maximum and minimum bootstrap CUSUM is defined as, 1 1 max min d) Determine whether, j i ∆ < ∆ .The bootstrap analysis consists of performing a large number of bootstraps and counting the number of bootstraps for which difference bootstrap j ∆ less than the original difference i ∆ .Let N is the number of boot- strap samples performed and let K be the number of bootstraps for which j i ∆ < ∆ .Then the confidence level that a change occurred as a percentage is calculated as follows Confidence Level ( ) Bootstrapping is a distribution free approach with only a single assumption, that of an independent error structure.
Once a change has been detected, an estimate of when the change occurred can be made.One such estimator is the CUSUM estimator.Let, i m = , such that max Then m is the point furthest from zero in the CUSUM chart.The point m estimates last point before the change occurred.The point 1 m + estimates the first point after the change.The second estimator of when the change occurred is the mean square error (MSE) estimator.Let MSE ( m ) be defined as where In MSE error estimation, the data series is split into two segments, 1 to m, and 1 m + to n, then it is esti- mated that how well the data in each segment fits their corresponding averages.The value of m, for which MSE (m) is minimized, gives the best estimate of the last point before change, while the point 1 m + denotes the first point after change.In the same way, data of each segment can be passed through the above method to find level 2 change points that divides corresponding segments into sub-segments.Repetition of the procedure mentioned above helps finding significant change points at subsequent levels for each of which associated confidence limits and levels can be determined by bootstrapping.In this manner multiple change points can be detected by incorporating additional change points each at successive passes that will continue to split the segments into two.Once the change points, along with associated confidence levels, have been detected a backward elimination procedure is then used to eliminate those points that no longer qualify test of significance.To reduce the rate of false detection, when a point is eliminated, the surrounding change points are re-estimated along with their significance level.Thus the significant change points have been detected for the temperature time series considered for this study.
Variations and trends of annual mean temperature, annual mean minimum temperature and annual minimum temperature values time series were examined following the method mentioned above.The cumulative sum charts (CUSUM) and bootstrapping were used for the detection of abrupt changes.Section of the CUSUM chart with an ascending trend indicates a period when the values remaining above the overall average.Likewise, a section with a descending trend indicates a period of time where the values lie below the overall average.The confidence level can be determined by performing bootstrap analysis.

Results and Discussion
Before the change point analysis, a simple regression model has been applied to show the long term trends of annual mean temperature series, annual mean maximum temperature series and annual mean minimum temperature series and results shows that the annual mean minimum temperature has been remarkably increasing trend than the annual mean maximum temperature in Figures 1(b) and (c).In both the cases, Adjusted R² values are 0.21 and 0.06, respectively.But annual mean temperature tend to be seriatim manner and it meets with Adjusted R² 0.58 in Figure 1(a).The results of change-point analysis for the annual mean temperature, annual mean minimum temperature and annual mean maximum temperature for Asansol weather observatory are presented in Figure 2 where period of change has been indicated with shaded background.In Table 1, confidence levels of those changes are mentioned.The table also gives a level associated with each change which is a measure of importance of the change.The level 1 change signifies the first change detected which is most visibly apparent in the CUSUM charts.Level 2 changes are detected on second pass of the data.There are as many numbers of levels as the numbers changes found.By means of the change point analysis from the independent error structure no outlier's assumptions were made in annual mean maximum and annual mean minimum temperature trend in one hand.On the other hand, the annual mean temperature appears to violate the assumption of independence error.The confidence levels and confidence interval may be affected.The errors are positively correlated, meaning that if single value is above average temperature trend, the next several values will also tend to be above average.As a result,  interval 1962, 1987); at a confidence level of 98% and 99% respectively, at level 3 and 1. Prior to changes 1961 and 1968, the annual mean temperatures for the two changing years were 27.11˚C and 27.79˚C respectively, while after the change the temperature became 27.79˚C and 28.36˚C respectively.But largest change of temperatures is associated with 1986 (0.57˚C) at the period of 25 years in the long term regional scale are presented in Table 1(a) and Figure 2(a).On the other side the annual mean maximum and annual mean minimum temperature series also have distinct change points and presents in Table 1(b), Figure 2(b) and Table 1(c), Figure 2(c) respectively.The annual mean maximum temperatures time series exhibits a level 1 change in the year 1961 (Confidence interval 1961(Confidence interval , 1963) at a confidence level of 100%, while for the annual mean minimum temperature series a level 1 changes is found to occur in 1994 (Confidence interval 1993, 1996) at a confidence level of 98%.Before these changes in 1961 and 1994, the annual mean maximum and annual mean minimum temperatures were 30.90˚C and 23.99˚C respectively, while after the change the mean temperatures became 33.93˚C and 24.84˚C, respectively.Over the entire period of consideration (1994-2010), 11 forward and backward changes were found.Out of 11, there are 3 changes (1961, 1986 and 2001) in annual mean temperature, 4 changes (1957, 1961, 1980 and 1994) in annual mean maximum temperature and rest 4 changes (1968, 1981, 1994 and 2001) in annual mean minimum temperature.
The CUSUM charts are presented in change in the year 1994 and chart line again gradually raises upward indicating values to be above the overall average of annual mean temperature (23.66˚C).
The Figure 4 with 4(a) and 4(c) presents that there is no significant change in annual mean and annual mean minimum temperature standard deviations and estimated standard deviations are 0.32˚C and 0.48˚C, respectively.But, a significant change in Table 2 and Figure 4 with 4(b) in annual means maximum temperature standard deviation was found.The CUSUM chart exhibits an abrupt change of level 1 category in the year 1963 (Confidence interval 1961, 1979) at a confidence level of 96% (Table 2).Prior to the change in 1963, annual mean maximum temperature standard deviation was 0.21˚C, while after the change the standard deviation of annual mean maximum temperature became 0.89˚C and another change in the year 1983 can be identified.Linear trends of standard deviations for each of the three data sets for the period (1941-2010) in question have been shown in Figure 5 with 5(a)-5(c) by simple plots of standard deviations against the years.between mean annual and mean minimum temperature trends.The inhomogeneity in an unadjusted temperature time series is often attributed to change in instrumental arrangements and/or measuring conditions.But if such was the case in the present context, all the data sets would have supposedly indicated changes within approximately common range of years which could not be found in case of Asansol.Hence, the change in temperature trend may be due to rapid industrialization and expansion of mining activities and subsequent urbanization since 1980s.Above all, those activities took a huge toll of large-scale forest destructions which may be responsible for local level increase in mean annual and mean maximum temperature.But drawing such inferences does require inclusion of more data from larger number of stations in the analyses and adjustment or homogenization of data sets before employing them in change point analysis.1941 1950 1959 1968 1977 1986 1995 2004 Year

Figure 1 .
Figure 1.Regression of adjusted long term time series of (a) mean annual temperature, (b) mean annual maximum temperature, and (c) mean annual minimum temperature (1941-2010) for Asansol Weather Observatory.

Figure 2 .
Figure 2. Temperature trends for (a) mean annual temperature, (b) mean annual maximum temperature, and (c) mean annual minimum temperature with maximum range of temperature fluctuation indicated by red lines under the situation of no change in trend.the analysis may incorrectly indicate that a change has taken place.But the associated level of confidence obtained from bootstrapping may confirm the change to have occurred.This analysis shows that the annual mean temperature reveal a quasi-change in the year of 2001 (Confidence interval 1999, 2003) at a confidence level of 100%, at level 2. Prior to the change in 2001, annual mean temperature was 28.36˚C; while subsequent to the change the average became 28.92˚C.It has been also noted that an abrupt change in mean annual temperature in the years of 1961 (Confidence interval 1960, 1978), and 1986 (Confidenceinterval 1962, 1987); at a confidence level of 98% and 99% respectively, at level 3 and 1. Prior to changes 1961 and 1968, the annual mean temperatures for the two changing years were 27.11˚C and 27.79˚C respectively, while after the change the temperature became 27.79˚C and 28.36˚C respectively.

Figure 3 .Figure 3 .
Figure 3. CUSUM charts for (a) mean annual temperature, (b) mean annual maximum temperature, and (c) mean annual minimum temperature.

Figure 4 .
Figure 4. CUSUM charts for (a) mean annual temperature standard deviation, (b) mean annual maximum temperature standard deviation, and (c) mean annual minimum temperature standard deviation.

Figure 5 .
Figure 5. Temperature trends for (a) mean annual temperature standard deviation, (b) mean annual maximum temperature standard deviation, and (c) mean annual minimum temperature standard deviation with maximum range of temperature fluctuation indicated by red lines under the situation of no change in trend.

Table 1 .
Significant changes in (a) mean annual temperature (b) mean annual maximum temperature and (c) mean annual minimum temperature.

Table 2 .
Significant changes in mean annual maximum temperature standard deviation.CUSUM charts have been prepared to identify the change points in the series, and bootstrapping technique has been employed to determine significance level associated with each of the detected change points.The analyses confirm a major change in mean annual temperature of Asansol to have occurred around 1986.For the mean maximum and mean minimum temperatures, such major changes have occurred somewhere around 1961 and 1994, respectively.Inspection of CUSUM Charts clearly demonstrates resemblance