A Neural Network Algorithm to Detect Sulphur Dioxide Using IASI Measurements ()
1. Introduction
Multi-layer perceptrons are used in many fields of science and technology. From the perspective of pattern recognition, they can be regarded as an extension of the many conventional statistical techniques which have been developed over several decades [1] . More extensive treatments of these topics can be found in the many textbooks on statistical pattern recognition, including [2] - [5] . Articles [6] and [7] have also emphasized the sta- tistical underpinnings of neural networks.
The main application of ANN to remote sense data analysis is scene classification, more commonly land cover classification, namely the process in which pixels are grouped according to the similarities of their spectral properties. If a pixel satisfies a certain set of criteria then it is assigned to the land cover class that corresponds to those criteria. This is the most common task achieved with remote sensing, and applications that involve both supervised and unsupervised ANNs are numerous. Due to the absence of any assumption about the data, many land cover classifications use data from different sensors such as optical and radar data [8] [9] , texture and ancillary data. Texture is incorporated in different ways: 1) precomputed per pixel texture information that is fed to the ANN together with the spectral information [10] or 2) incorporating neighbouring spectral information using a sliding window [11] .
It was reported that ANNs have more accurate performance than other techniques such as statistical classifiers, especially when the functional space is particularly complex and the source of data presents different statistical distributions [12] - [15] .
An additional advantage of the ANN approach is that they can perform supervised classification using fewer training data than those with maximum probability, because rules of recognition of a given category are based on the attributes, not only of training data of that particular category class, but also of other classes [16] .
The IASI instrument is used for operational measurements of meteorological parameters for numerical weather prediction. However, the high density of spectral information available and good signal-to-noise characteristics mean that the instrument can also provide measurements of many chemical species. The high density of spectral information and low instrument noise means that IASI could potentially provide very sensitive detection of SO2, especially for plumes in the upper-troposphere and lower-stratosphere, where the thermal contrast with the layers below is maximized. Usually full retrievals are time-consuming when searching for a sporadic event such as a volcanic eruption within a large data set. Detection methods do not require on-line radiative transfer cal- culations and can therefore be used to provide atmospheric measurements quickly, for example to help with volcanic hazard avoidance.
A commonly used method for the fast detection of trace atmospheric species is to calculate the difference in brightness temperature (BTD) between a small number of channels which are sensitive to the target species and others which are insensitive. The BTD between channels is then compared against a threshold which indicates a positive detection.
Simple BTD filters for the detection of SO2 are applied operationally to provide an early warning of volcanic activity and to help track volcanic plumes in near-real-time [17] - [19] . Although fast and robust, current BTD methods do not fully exploit the spectral range and resolution of hyperspectral instruments, and the noise associated with the measurements remains high. SO2 can be tracked from IASI using the difference in signal between two channels located at 1371.50 and 1371.75 cm−1 in the SO2 v3 vibrational band and two background channels at 1407.25 and 1408.75 cm−1 outside the SO2 band [20] . Recently the detection method of Walker [21] , uses channels between 1300 and 1410 cm−1 (441 channels). It is a fast method which uses many spectral channels to separate the contribution of SO2 from the contribution of other parameters affecting the measure- ment.
To date, no significant studies involving volcanic SO2 detection using ANNs have been carried out. The main aim of this study is to verify if such a technique may be useful and give substantial improvements for real time monitoring of SO2, representing a reliable ANN monitoring tool during preliminary detection phase.
The challenge was to assess a classification problem by using ANN in order to detect the presence in the atmosphere of volcanic SO2 using IASI hyperspectral data as input and the corresponding SO2 retrievals by [22] as target outputs.
Beside its fast usage in the real time application, the advantage of ANN approach is in the ability to model the non-linearity in the variable space and in exploiting all the spectral information contained in hyperspectral data, and it does not need to select any target and background channels for implementing detection filters.
In this study, we first provide in Section 2.1. an overview of the IASI sensor characteristics and description of SO2 total column estimation (Section 2.2.), then we discuss the neural network approach for SO2 detection (Section 2.3.). In Section 3, the scenario where the ANN approach has been applied is introduced and the results obtained are discussed. Finally the conclusions are reported in section 4.
2. Neural Network-based Classifier Algorithm
The neural network must first be “trained” by having it process a large number of input patterns and their as- sociated reference output patterns. Once trained, the neural network is able to recognize similarities when presented with a new input pattern, resulting in a predicted output pattern [23] .
2.1. Metop-IASI instrument
The IASI sensor is onboard Metop-A, a European weather satellite in operation since 2007. IASI is a Fourier transform spectrometer that covers the spectral range 645 - 2760 cm−1 (3.62 to 15.5 μm) with spectral sampling of 0.25 cm−1 and spectral resolution of 0.5 cm−1 [24] . It has a nominal radiometric accuracy of 0.25 - 0.58 K. The field-of-view (FOV) consists of four circular footprints of 12 diameter (at nadir) inside a square of 50 × 50 km, step-scanned across tracks (30 steps). It has a 2000 km wide swath and nominally can achieve global coverage in 12 h.
2.2. SO2 total column content description
Values used here as target outputs were obtained with the Optimal Estimation scheme to retrieve SO2 column amount and altitude from nadir satellite TIR measurements using the two SO2 absorption bands centred at about 8.7 and 7.3 μm, the v1 and v3 bands respectively; more details of the retrieval are in [22] . This retrieval technique uses an error covariance matrix based on an SO2-free climatology of differences between the IASI measure- ments and forward modelled spectra. Any differences not related to SO2 between IASI spectra and those simulated by a forward model are included in the covariance matrix, allowing a comprehensive error budget to be computed for every pixel.
As IASI measures atmospheric emission, it provides continuous measurements throughout an orbit. The IASI retrieval follows the method of [22] where SO2 concentration is modelled by a Gaussian profile. The optimal estimation technique of [25] is then used to estimate SO2 column amount and the height of the SO2 profile, and the surface skin temperature using IASI measurements from 1000 to 1200 cm−1 and from 1300 to 1410 cm−1 (the v1 and v3 SO2 bands).
2.3. Neural Network Methodology
In this work, Backpropagation Neural Networks (BPNN) have been used. These networks are the most widely used type for their flexibility and adaptability to model a wide spectrum of problems in many application areas [26] .
A neural network for SO2 detection was implemented using a training set of 59 IASI datasets, spanning the April-May 2010 Eyjafjallajökull and May 2011 Grímsvötn eruptions (Iceland). The training sets used IASI retrievals from [22] as representative of SO2 class.
The total number of training samples was 74,614, of which 43,437 were classified as SO2 affected of which 84% belong to the Eyjafjallajökull eruption.
The network topology consists of 441 inputs, namely the brightness temperatures in the 1300 - 1410 cm−1 channels, the so-called v3 band, comprising the range of wavelengths containing information used for SO2 amount estimation. Ten neurons were used in a single hidden layer, in order to cope with non linearly discrete problems [23] . Finally, outputs consisted of the two possible classification results, namely “SO2” and “not SO2”.
When training a neural network, it is of interest to obtain a network with an optimal generalization per- formance, i.e. the network performance should not degrade significantly when datasets other than the training one are analyzed.
Standard neural network architectures, like the fully connected multi layer perceptron, generally have too large a parameter space, and are prone to over-fitting [27] . During the training phase, although the network performance appears to constantly improve on the training sets, at some point it actually begins to worsen in terms of errors on unseen datasets. During supervised training of the neural network, cross validation can be used to detect when overfitting starts: training can then stopped before convergence to avoid overfitting, a process called early stopping [28] .
In this study, early stopping was carried out by splitting the training data into three sets: a training set, a cross validation set and a test set. These consisted respectively of 70%, 20% and 10% of the total number of training samples. The first set was used for network training. The cross validation set, consisting of 20%, was analyzed at a predefined number of epochs, to assess performance on datasets other than the training one. Finally, the test set (10%) was used as an independent data source to assess network performance after the training phase. The result obtained by processing 10% of the samples belonging to the test set, but not used for training, showed an overall accuracy of 100% without omission (false negatives) or commission (false positive) errors. The samples were 4092 pixels with SO2 and 1309 without, respectively.
3. Application to two operational scenarios
After completion of the training process, the ANN classifier algorithm has been proposed for a remote sensing application such as the monitoring of two volcanic eruptions that occurred in Iceland in 2010 and 2011. The 14 April to 24 May 2010 Eyjafjallajökull eruption caused widespread impacts [29] .
It was an explosive eruption with an injection of volcanic ash into the atmosphere and plumes that affected many regions of Europe. Plumes reached altitudes of 9 km, and the ash was transported across Europe and to the Atlantic Ocean. Despite its impact, the explosive phase was modest in size and the amount of SO2 released was low.
The Grímsvötn eruption occurred from May 21 to May 23 2011. It injected into the atmosphere a higher concentration of SO2 than Eyjafjallajökull. The height reached by the plume was 20 km [30] .
In order to evaluate the performance of neural network classifier in terms of its generalization capability and accuracy in detecting SO2 presence in the atmosphere from IASI datasets, we applied the trained ANN classifier to two independent IASI datasets from the May 2010 Eyjafjallajökull eruption and to two independent IASI datasets from the Grímsvötn eruption which occurred the following year. Note that ANN has been validated on datasets which include all IASI orbits, not only on SO2 plume data, with a significant number of pixels not affected by SO2 (more than 40%). Performance was assessed in terms of overall accuracy and Producer accuracy. The former represents the percentage of correct classifications, with respect to the total number of pixels analysed, considering all classes (i.e. SO2 and not SO2). The Producer accuracy instead represents the percentage of correct classifications with respect to the total obtained for that output class and it is an omission error indicator.
Tables 1-4 describe the confusion matrices obtained by applying the NN classifier to these independent datasets. Figures 1-4 show comparisons between classification maps obtained from the ANN classifier and from the SO2 retrieved by Carboni [22] for all validation dates.
In Table 1, Table 2, which summarize the results related to the Eyjafjallajökull eruptions of April 30 2010 and May 15 2010, the neural network classifier achieves an accuracy of 100%. This is a very good result, considering the ANN classifier has been applied to the entire IASI orbits. In fact, looking at Figure 1, despite most of the pixels not being affected by SO2, the ANN classifier perfectly detects SO2 plume. The same result is obtained when ANN is applied to May 2010 (Figure 2). Table 2 describes the confusion matrix and shows an overall accuracy of 100% for detecting the plume, despite the fact that it extends from 45˚ to 75˚N. Again, neither commission nor omission errors are present.
The results of applying the ANN classifier to the IASI datasets during the Grímsvötn eruption are described by confusion matrices depicted in Table 3 and Table 4 and depicted in Figure 3 and Figure 4, for May, 22 2011, at 12:00 and 20:00 UTC, respectively.
In the case of the morning orbit depicted in Table 3, the overall accuracy obtained is 99.1%. The producer accuracy, which measures the probability that the classifier correctly labels a pixel as SO2, is 70.3%. The lower accuracy obtained is due to the presence of an omission error, because 29.7% of pixels with SO2 have been
Table 1. Eyjafjallajökull. Confusion matrix assessing the classification accuracy on the IASI validation date of April 30, 2010, afternoon orbit.
Table 2. Eyjafjallajökull. Confusion matrix assessing the classification accuracy on the IASI validation date of May 15, 2010, morning orbit.
Table 3. Grímsvötn. Confusion matrix assessing the classification accuracy on the IASI validation date of May 22, 2011, 12:00 UTC.
Table 4. Grímsvötn. Confusion matrix assessing the classification accuracy on the IASI validation date of May 22, 2011, 20:00 UTC.
labelled as not affected by SO2.
Looking at the histograms in Figure 5 (top), representing the statistical distribution of false negatives for the 22 May 2011 date, we notice that most of the false negatives have values lower than 3 DU, which is around the SO2 detection threshold for a plume height between 2 and 4 km [21] . Looking at the maps depicted in Figure 6 we notice that they belong to a portion of the plume located close to the vent and are characterized by a height of between 2 - 4 km [30] . Considering that the SO2 detection threshold at these altitudes is 3.3 DU, we can assert that the ANN classifier shows a sensitivity in accord with the model for low SO2 amounts in the lower troposphere. Furthermore it puts in evidence the absorption band limit at 7.3 μm when observing SO2 in low troposhere.
For total SO2 column amounts of less than 0.2 DU concentrated in the boundary layer between 0 - 2 km the signal falls below the noise level of the instrument [21] . Instead, when SO2 is higher into the atmosphere due to an explosive volcanic event it is possible to discern very low concentrations. As shown in Figure 7, showing only positive detections for both Grímsvötn validation’s datasets, ANN classifier well detect low amount of SO2 when they are located at higher altitudes (low stratosphere) [30] .
The same assertion can be made for the afternoon orbit of the May, 22, 2011, 20:00 UTC: looking at Table 4, an overall accuracy of 98.5% is shown. Producer accuracy is 57% and, once again, this is due to false negatives related to pixels of plume at lower altitudes, as in the previous validation. In fact, looking at the histogram representing SO2 distribution (Figure 5, bottom) most of the values are around the detection threshold used for SO2 discrimination, but not higher than 3 DU, which characterizes the detection threshold for the part of the plume in the lower troposphere. Instead, when SO2 is higher into the atmosphere due to an explosive volcanic event it may be possible to discern very low concentrations.
The retrieval methodology of [22] establishes that for measurements in the thermal infrared spectral range the sensitivity is better for plume injected at high altitudes where the air is colder and the thermal contrast is higher,
Figure 1. Comparison between SO2 retrieved by Carboni [22] (top) and SO2 map from ANN classifier (bottom) for Eyjafjallajökull April 30, 2010 afternoon orbit. Pixels affected by SO2 are coloured red, pixels not affected by SO2 are coloured green.
Figure 2. Comparison between SO2 retrieved by Carboni [22] (top) and SO2 map from ANN classifier (bottom) for Eyjafjallajökull May 15, 2010 morning orbit. Pixels affected by SO2 are coloured red, pixels not affected by SO2 are coloured green.
Figure 3. Comparison between SO2 retrieved by Carboni [22] (top) and SO2 map from ANN classifier (bottom) for Grímsvötn May 22, 2011, 12:00 UTC. Pixels affected by SO2 are coloured red, pixels not affected by SO2 are coloured green.
Figure 4. Comparison between SO2 retrieved by Carboni [22] (top) and SO2 map from ANN classifier (bottom) for Grímsvötn May 22, 2011, 12:00 UTC. Pixels affected by SO2 are coloured red, pixels not affected by SO2 are coloured green.
Figure 5. SO2 amount distribution in DU, considering only false negatives, for Grímsvötn May 22, 2011, 12:00 UTC (top) and May 22, 2011, 20:00 UTC (bottom). Both histograms reveal that most values have an amount around the SO2 detection threshold.
Figure 6. Grímsvötn May 22, 2011, 12:00 UTC. SO2 Cloud height in mb (top) and SO2 amount in DU (bottom).
Figure 7. SO2 amount distribution in DU considering only positive detections, for Grímsvötn May 22, 2011, 12:00 UTC (top) and May 22, 2011, 20:00 UTC (botttom).
while at lower altitudes (e.g. the troposphere) the lower thermal contrast reduces the sensitivity of the detection method, so we can assert that the ANN classifier is in accord with the sensitivity of the model.
What can be underlined is that no commission errors are present at the validation stage (pixels erroneously labelled as affected by SO2) for both 2010 and 2011 eruptions, and the method shows the same accuracy when applied to IASI images with different illumination conditions (morning and afternoon orbits) and in cloudy sky conditions. Furthermore, it is noticeable that most of the training samples are from the Eyjafjallajökull eruption and only a low percentage from the Grímsvötn one (26%), which represents an event statistically not well known at the training stage. This can explain why the ANN classifier reveals a loss of accuracy when applied to Grímsvötn eruption.
4. Conclusions
This work proves the detection capability of volcanic SO2 by means of an algorithm neural network based classifier, applied to the thermal channels of the IASI hyperspectral sensor. SO2 amounts estimated by [22] were used as training target outputs.
The resulting confusion matrices associated with application to independent datasets spanning two Eyjafjalla- jökull volcanic eruptions revealed an accuracy of 100% on both tested images.
The results of applying the ANN classifier to the Grímsvötn eruption reveals an accuracy lower in detecting SO2 plume, but these cases included: a) The data-sets belonging to the Grímsvötn eruption represented a minor percentage of IASI time-series used as training samples; b) the presence of false negatives, i.e. pixels affected by SO2 not detected. In this case an analysis carried out on false negatives for both validation dates revealed that those pixel belonged to the portion of the plume at tropospheric altitudes where the sensitivity of the method is lower.
Nevertheless, the neural network classifier has been shown to work well both on daytime and night-time images and in cloudy sky conditions, and it successfully overcame the detection of false positives present in the validation dataset images, whose presence in multispectral or hyperspectral images can often undermine the performance of traditional classification algorithms. Therefore the proposed method represents a reliable new approach for volcanic SO2 detection and demonstrates also its usefulness in near real time volcanic monitoring activities, due to its fast application in the operational stage.