Effectiveness of Sentinel-1-2 Multi-Temporal Composite Images for Land-Cover Monitoring in the Indochinese Peninsula

The Indochinese Peninsula, which contains two thirds of the world’s tropical forests, however, is one of the world’s most threatened habitat with some of the highest rates of deforestation and land use changes. Availability of higher resolution satellite data collected by the likes of Landsat 8 and Sentinel-1-2 has brought new opportunities for precise land cover monitoring in recent years. However, utilizing a massive volume of high spatial and temporal resolution data for ecological applications is challenging. One approach is to employ composite images generated from the multi-temporal satellite data. The research was conducted in two study sites located in the Indochinese Peninsula, Laos-Thailand and Vietnam-Cambodia, vulnerable to deforestation and land use changes. We assessed the potential of recently available composite images, such as Biophysical Image Composite (BIC), Forest Cover Composite (FCC), Enhanced Forest Cover Composite (EFCC), and Water Cover Composite (WCC) for the classification and mapping of land cover types. Three machine learning classifiers, k-Nearest Neighbors (KNN), Support Vector Machines (SVM) and Random Forests (RF) were employed and the performance of composite images was evaluated quantitatively with the support of ground truth data. The overall accuracies (Kappa coefficient) obtained from the combination of composite images were 0.92 (0.89) and 0.90 (0.86) for Laos-Thailand, and Vietnam sites respectively. These results highlight effectiveness of the composite images for the classification and mapping of land cover types.


Introduction
The Indochinese Peninsula, usually referred to as the mainland of Southeast Asia, consists of the countries of Myanmar, Thailand, peninsular Malaysia, Laos, Cambodia and Vietnam (Keyes, 1994). This region is mostly drained by the river systems in a north-south direction from the Tibetan Plateau.
The Indochinese Peninsula is one of the world's top biodiversity hotspots (de Bruyn et al., 2014). It also contains two thirds of the world's tropical forests; however, it is one of the world's most threatened habitats with some of the highest rates of deforestation Keenan et al., 2015). The deforestation has affected the regional monsoon and climate (Kanae et al., 2001;Sen et al., 2004) and agricultural productivity (Lawrence & Vandecar, 2015) as well. The construction of large dams in the rivers also has impacts on wetlands and migratory birds, freshwater biodiversity, and rural livelihood (Dudgeon, 2000).
Satellite remote sensing can be a suitable technology for timely monitoring of rapidly changing environment and can play a key role to protect unique biodiversity of the region (Wang et al., 2010;Miettinen et al., 2014;Mulatu et al., 2017). The multi-spectral, high-resolution satellite imagery has been recognized as the potential technology to greatly facilitate conservation, management, and decision-making (Kerr & Ostrovsky, 2003;Boyle et al., 2014;Hoan et al., 2013).
Availability of higher resolution satellite data collected by the likes of Landsat 8 and Sentinel-1-2 has brought new opportunity for precise land cover monitoring in recent years (Malenovský et al., 2012;Roy et al., 2014;Herold et al., 2016;Hoan et al., 2018). However, utilizing a massive volume of high spatial and temporal resolution data for ecological applications is challenging. One approach is to employ composite images generated from the multi-temporal satellite data. The composite images are designed by conceptualizing the spectral characteristics and temporal/phenological variations of the land cover types by harnessing multi-temporal satellite images of an entire year (Sharma et al., 2019).
In previous study, Sharma et al. (2016) have already designed Biophysical Image Composite (BIC) for the visualization and extraction of major biophysical components, barren/urban, vegetation, and snow/water areas, on the surface of Earth. For the separate extraction and visualization of forest canopy, a new image composite called Forest Cover Composite (FCC) was designed in previous research . For enhanced extraction and visualization of the forested areas, the annual median backscattering intensity of the VH (Vertical-transmit horizontal-receive) polarization values (VH median ) obtained from Sentinel-1 mission was combined with the green term of the FCC (NDVI mean ) and Enhanced FCC (EFCC) was proposed . Water Cover Composite (WCC), made up of annual minimum green (Green min ) reflectance, annual minimum near infrared (Nir min ) reflectance, and annual maximum Superfine Water Index (SWI max ) values as the red (R), green (G), and blue (B) bands respectively, was designed for better extraction of surface water bodies in pre-N. T. Hoan et al. Journal of Geoscience and Environment Protection vious study (Sharma et al., 2015;Sharma et al., 2019).
Importance of assessing the applicability of composite images in different geographical zones for the classification of land cover types has been emphasized (Sharma et al., 2019). In this study, the four Multi-Temporal Composite Images (BIC, FCC, EFCC, and WCC) were utilized to obtain land cover information for some different geographical zones of the Indochinese Peninsula. The general objective of the research is to enhance the ability of satellite remote sensing for land cover monitoring of the Indochinese Peninsula. The specific objectives of the research are as follows: 1) Evaluate the potential of recently available composite images (BIC, FCC, EFCC, and WCC) for the classification and mapping of major land cover types such as forest, grass, crop, and non-vegetation.
2) Assess the performance of different machine learning classifiers such as k-Nearest Neighbours (KNN), Support Vector Machines (SVM) and Random Forests (RF) for classification.
3) Produce land cover maps for the study sites using best-performed classifier and composite images at 10m spatial resolution.

Study Areas
The research was conducted in two study sites, Laos-Thailand and Vietnam, located in the Indochinese Peninsula. The sites vulnerable to deforestation and land use changes were chosen for the research. The study sites are depicted in

Generation of Composite Images
We processed all Sentinel-1 Ground Range Detected (GRD) product scenes and Sentinel-2 Top-Of-Atmosphere (TOA) reflectance product scenes available over two study sites in 2018. Cloudy pixels in the Sentinel-2 scenes were masked out by using a separate quality assessment band available with the data. Sentinel-2 images with spatial resolutions varying from 10 to 20 m were resampled into 10 m. Sentinel-1 scenes were processed for radiometric calibration and terrain correction. The Sentinel-1 mission provides C-band SAR data. We extracted VH polarization data, and resampled it into 10m resolution to match with Sentinel-2 data. Using all the multi-temporal Sentinel-1 SAR and Sentinel-2 Optical images available over the study sites in the entire year of 2018, four composite images (BIC, FCC, EFCC, and WCC) were generated. The composite images are shown in Equations (1)-(4) in the following sections.
The BIC is a RGB (red, green, blue) color composite image made up of Normalized Difference Vegetation Index (NDVI), short wave infrared reflectance, and green reflectance, which were specially selected from the day of highest vegetation activity over an entire year (Sharma et al., 2016). The composition of the BIC is shown in Equation (1).
In Equation (1), NDVI is calculated by normalizing the difference between the near infrared (Nir) and red (Red) reflectance (Rouse et al., 1974). The annual maximum NDVI (NDVI max ) represents the highest vegetation activity period. The short wave infrared (Swir NDVImax ) and green (Green NDVImax ) reflectance from the day, when NDVI is maximum, are used to expose barren and water areas, respectively. The BIC was designed to be able to discriminate between vegetative (forests, crops, grasses) and non-vegetative (barren, urban, and water) areas.
Forest Cover Composite (FCC) was designed for the separate extraction and visualization of forest canopy in the assumption that annual mean values of the forested areas are usually lower than that of other non-forested areas . The composition of the FCC is shown in Equation (2) Enhanced FCC (EFCC) was designed for enhanced extraction and visualization of the forested areas, the annual median backscattering intensity of the VH (Vertical-transmit horizontal-receive) polarization values (VH median ) obtained from Sentinel-1 mission. The Enhanced FCC (EFCC) as shown in Equation (3) was also proposed in a previous study   (Sharma et al., 2015;Sharma et al., 2019). It is made up of annual minimum green (Green min ) reflectance, annual minimum near infrared (Nir min ) reflectance, and annual maximum Superfine Water Index (SWI max ) values as the red (R), green (G), and blue (B) bands respectively. The composition of the WCC is shown in Equation (4).

Preparation of Ground Truth Data
This research deals with the classification and mapping of four major land cover types, forest, crop, grass, and non-vegetation (water, barren, and built-up) present in the study sites. We prepared the ground truth data through visual interpretation procedure with reference to the Google Earth imagery. Altogether, 100 geo-location points, representing as large as 90 × 90 m homogenous area, were prepared for each class for each site.

Machine Learning and Mapping
Three machine learning classifiers, k-Nearest Neighbors (KNN; Cover & Hart, 1967;Altman, 1992), Support Vector Machines (SVM; Cortes & Vapnik, 1995) and Random Forests (RF; Breiman, 2001) were employed to evaluate performance of the composite images (BIC, FCC, EFCC, and WCC) for the classification of land cover types. The performance of the composite images was evaluated by a 5-fold cross-validation approach. Further details on the machine learning and cross-validation procedure have been described in previous study (Sharma et al., 2017). The accuracy metrics, overall accuracy and Kappa coefficient, calculated through the cross-validation approach were used for quantitative evaluation. The parameters of the classifier were optimized by repeated trial and error methods seeing the validation metrics. Land cover maps were also produced in each study site using the best-performed classifier.

Cross-Validation Results
Three machine learning classifiers-based validation results are shown in Figure   2 and

Land Cover Maps
We employed the best performed classifier (Random Forests) for the production of land cover maps in each study site using the combination of composite images

Conclusion
Image compositing techniques have evolved as an alternative approach for the retrieval of concise biophysical information from massive volumes of multi-temporal images from different sensors. In this research, we evaluated the performance of recently available composite images, such as BIC, FCC, EFCC, and WCC for the classification and mapping of land cover types in two study sites in the Indochinese peninsula. The composite images were designed for the extraction of individual land cover types; however the machine learning and validation approach employed in the research showed that the composite images are also efficient for the classification of major land cover types. The overall accuracies (Kappa coefficient) obtained from the combination of composite images were 0.92 (0.89) and 0.90 (0.86) for Laos-Thailand, and Vietnam sites respectively. These results highlight effectiveness of the composite images for the classification and mapping of land cover types. The methodology presented in the research is expected to be useful for land cover monitoring in other regions as well. Further improvement of the classification accuracy by aggregating the results from different classifiers is recommended.