A Study on Tropical Land Cover Classification Using ALOS PALSAR 50 m Ortho-Rectified Mosaic Data

The main objective of this study is to find better classifier of mapping tropical land covers using Synthetic Aperture Radar (SAR) imagery. The data used are Advanced Land Observing Satellite (ALOS) Phased Array type L-band Synthetic Aperture Radar (PALSAR) 50 m ortho-rectified mosaic data. Training data for forest, herbaceous, agriculture, urban and water body in the test area located in Kalimantan were collected. To achieve more accurate classification, a modified slope correction formula was created to calibrate the intensity distortions of SAR data. The accuracy of two classifiers called Sequential Minimal Optimization (SMO) and Random Forest (RF) were applied and compared in this study. We focused on object-based approach due to its capability of providing both spatial and spectral information. Optimal combination of features was selected from 32 sets of features based on layer value, texture and geometry. The overall accuracy of land cover classification using RF classifier and SMO classifier was 46.8% and 55.6% respectively, and that of forest and non-forest classification was 74.4% and 79.4% respectively. This indicates that RF classifier has better performance than SMO classifier.


Introduction
Remote sensing techniques have been used for land cover mapping for several decades.Since the Synthetic Aperture Radar (SAR) has been applied to monitor land cover distribution, the Earth's surface information can be observed in high resolution regardless of any weather condition [1].SAR is a form of radar that has different penetration and different wavelength.Therefore, it performs subtle function on observing different land target.Generally, longer wavelength L-band SAR is tending to be more helpful for forest distribution monitoring [2] [3].However, terrain influences caused by the side-looking technique of radar sensor had revealed significant brightness variations in SAR imagery.For example, the foreside of a slope area is always more illuminated than the backside in terms of a smaller incidence angle, even though the radar reflectivity is from homogeneous scattering type [4] [5].
In January 2006, Japan Aerospace Exploration Agency (JAXA) successfully launched the Advanced Land Observing Satellite (ALOS).The Phased Array type L-band Synthetic Aperture Radar (PALSAR) is one of three onboard remote-sensing instruments, which is used for day-and-night and all-weather land observation [6].The ALOS Kyoto and Carbon (K & C) Initiative, an international collaborative project led by the Earth Observation Research Center (EORC) of JAXA created ALOS PALSAR 50 m ortho-rectified mosaic dataset with the fine-beam dual polarization (FBD) mode to provide HH and HV channels covering Japan, Kalimantan, Central Africa and other eight large areas [7].As a free product, many studies have shown its high potential for land cover observation and classification because the geometric correction and geographic coordinate have been processed [8]- [10].However, some studies only chose the flat terrain as study areas due to slope correction has not yet been done for keeping the landscape characteristics [11] [12].
In order to extract high accuracy land cover information, the normalized SAR imagery is required for classification quality.In addition to using backscatter coefficient as radar measure, the gamma-naught (γ˚) which can be calculated with σ˚/cosθ loc shows a stronger capability for forest data analysis, where σ˚ and θ loc mean the normalized cross section and local incidence angle, respectively.However, there is still limitation of terrain height within this equation [1] [3] [13].Furthermore, other calibration methodologies, like the correction based on the covariance matrix and the ground scattering area variation, were frequently applied the terrain correction before geocoding that represent better effect [14] [15].Thus, the order of data processing would affect the calibration effect of SAR image.
On the other hand, for high resolution remote sensing imagery classification, image segmentation may provide lots of object information not only about spectral, but also about the spatial or shape features.With the using of object-based approaches, some scientific literatures have shown the improved classification accuracy when compared with traditional pixel-based techniques [16] [17].Therefore, object-based approach was chosen to be used in this study.
The objectives of this paper are: 1) to remove the terrain influence on HH and HV polarization of ALOS PALSAR 50 m ortho-rectified mosaic data; 2) to identify the better classification performance between Sequential Minimal Optimization (SMO) classifier and Random Forest (RF) classifier.

Study Area
In order to investigate the possibility of land cover classification by using ALOS PALSAR 50m ortho-rectified mosaic data, the proposed approach was tested on a tropical zoon where covered by tropical forest, herbaceous, agriculture, urban and water body.This testing area is located within the West Coast Division of Sabah, Malaysia (116d01'55.7685"E,5d56'25.2183"Nand 116d08'43.2737"E,5d51'19.5894"N),approximately 12.53 km × 9.39 km.The DEM (Digital Elevation Model) ranges from 0m to 507 m. Figure 1 shows the location of study area with the color composite image of PALSAR data (R = HH, G = HV, B = HH − HV).

Satellite Data and Image Preprocessing
The ALOS PALSAR 50 m ortho-rectified mosaic data and Shuttle Radar Topography Mission (SRTM) are the main satellite data used in this study.These datasets are available at the K & C mosaic homepage and the Consultative Group on International Agricultural Research-Consortium for Spatial Information (CGIAR-CSI) website.
PALSAR 50 m ortho-rectified mosaic data are obtained from July to October, 2008, which include HH polarization and HV polarization.We resampled 90 m SRTM data to a pixel size of 50 m × 50 m using the bilinear interpolation in order to correspond with HH and HV images.Then, Landsat images were chosen as standard to correct geometric location of PALSAR mosaic images and SRTM by using Ground Control Points (GCPs).
Digital Number (DN) of HH and HV polarization images were converted to the normalized radar cross section (Sigma-zero) by the following equation [18]: where DN is the digital number of HH and HV images, Calibration Factor (CF) for ALOS PALSAR 50 m orthorectified mosaic had been given as (−83) and σ is the backscattering coefficient (dB).

Previous Slope Correction Models
Some methods of calibration need to be applied to original SAR data before further investigation, because of the amount of distortion that happens on the image (e.g., Speckle filtering, geometric correction and radiometric correction).As an important processing step to reduce the topography influence, different slope correction models had been generated based on the cosine correction method and the scattering area changing method.Many of these models were dealt with the terrain correction with different code-level programming or software tool [19] [20].Therefore, the order of data processing or the processing environment may result in different slope correction effect.In this section, three existing slope correction models were tested to perform their restoration capability for ALOS PALSAR 50m ortho-rectified mosaic data.The brief description of these models is shown in Table 1.
Model-1 and model-2 were proposed by the cosine correction method [21]- [24], while model-3 was proposed based on the scattering changing method [25] [26].The main calculation steps consist of: 1) Calculation of local incidence angle ( ) In this study, loc θ was derived by the following equation which described by Akatsuka et al. [21]: ( ) Here, the slope α and aspect angle β of SRTM were exported from spatial analyst tools of ArcGIS software.The azimuth angle of PALSAR platform ∅ is 261.84 degree, and θ is equal with the off-nadir an- gle 34.3 degree.
2) Calculation of local ground scattering area ( ) A Castel et al. [25] provided a sample equation to describe A over a flat terrain as the following equation: where r a and r s represent the azimuth and slant range pixel spacing respectively.On the other hand, the method for computing A slope was selected from the literature published by Wegmuller [27]: where ψ is the projection angle which defined as the angle between the surface normal and the image plane normal [28]: cos sin cos cos sin sin Here, α , β and θ represent the same meaning within Equation (2).

A modified Slope Correction Model
Based on a sample backscatter terrain correction model, a modified slope correction model for specially calibrating ALOS PALSAR 50 m ortho-rectified mosaic data of this study was generated with the regulation that the homogeneous land cover target should have the similar backscattering property regardless of any topography terrain [23].This sample model had been published by Ulaby et al. [29] and Sun et al. [30] as: where σ and σ  are backscattering coefficient before and after terrain correction, respectively.Sun et al.
[30] carried out this model both for HH polarization and HV polarization of L-band wave, and successfully induced the terrain effect with the changing of power p, where 1 2 p ≤ ≤ .Figure 2 and Figure 3 show the corrected images of ALOS PALSAR 50 m mosaic data by using Equation (6) when p is 1.Slope corrected HV image (Figure 2(b)) shows a more efficient correction on brightness variation than HH image (Figure 3(b)) over the mountain areas.Therefore, the limitation of this model is required to be improved for HH image.
Figure 4 shows a sample scattering geometry on the ground surface.Suppose the scattering surface over flat area that has standard backscattering behavior, each target over the tilted area will get an assumptive standard reference.Therefore, in Figure 4  cos cos Therefore, the assumptive standard backscattering behavior (  ) can be calculated from Equation (7): Here, we call ( ) as the strengthened correction factor for HH polarization.In addition, according to the geometry theorem, the power of p is decided by the relationship of OB and OA: where H is the satellite's height, and h is the DEM.

Feature Selection
Based on the slope corrected images, firstly, a multi-layer image was composited using four layers that are HH, HV, the difference between HH and HV (HH-HV) and the ratio of HH and HV (HH/HV).Image segmentation was derived from a commercial software tool, eCognition 9.0.Multiresolution segmentation method was applied to the composited image together with the ground truth training samples, which consist of forest, herbaceous, agriculture, urban and water body.2552 objects were generated along with the scale decided as 2. We extracted 32 object features based on the layer value, texture and geometry properties for HH, HV, HH-HV and HH/HV individually (Table 2).
In order to reduce the useless features to improve the accuracy for the subsequent classifiers Sequential Minimal Optimization (SMO) and Random Forest (RF), we chose to use Weka software tool, which is used for a correction of fast machine learning algorithms of data mining, to find an optimal feature combination and classification.The feature selection result extracted from Wrapper Subset Evaluation is shown in Table 3.

Classification and Validation Results
The predicted land cover classification results exported from Weka software tool are shown in Figure 7(a) and Figure 7(b).Images available in Google Earth were used as ground truth to evaluate the accuracy of SMO and RF classifier.50 points were randomly collected for each class from the classification result.Due to the classification result of SMO that did not show any agriculture class, we collected 50 validation random points from agriculture ground truth to identify which class did they misclassified into.Table 4(a) and Table 5(a) show the result of accuracy assessment of five land cover classes classified by SMO classifier and RF classifier.The overall accuracy and kappa coefficient were 46.8% and 0.335 for SMO classification result, while 55.6% and 0.445 for RF classification result.From Table 4(a), 28 validation points of agriculture were misclassified as forest, 13 points were misclassified to herbaceous and 9 points were misclassified as urban.The validation result identified with random points of the water body is 100% for both SMO and RF, while one segmented object was obviously misclassified as agriculture over the ocean in Figure 7(b).In order to solve this problem, we considered to alter this misclassified segment by hand in this case, while to make water body mask using the different satellite image or replace water body class with the result from RF classifier when classify larger study area.Forest and non-forest classification accuracy was also assessed using the same random validation points (Table 4(b) and Table 5(b)).RF classifier showed a better performance with overall accuracy of 79.4% and kappa coefficient of 0.42, while the overall accuracy of SMO was 74.4% and kappa coefficient was 0.286.

Conclusions
Slope correction and object-based classification approach have been described in this study.In terms of slope correction, the visual inspection of the corrected images demonstrated the new modified slope correction model that has the ability of reducing the terrain influence on ALOS PALSAR 50m ortho-rectified mosaic HH polarization as well as HV polarization.The reason of unavailable of the exiting slope models are considered as that may be caused by using different software tool (model-1), different code-level programming (model-2) and the determination of parameter (model-3).
In terms of classification, Sequential Minimal Optimization (SMO) was compared with Random Forest (RF) classifier after feature selection.Eight features were selected for SMO, while four features were selected for RF automatically by Weka software tool.The accuracy assessment represented that the use of RF is better than SMO for multiclass land cover classification with a higher overall accuracy and kappa coefficient.In order to avoid the overfitting problem occurred on the result of SMO, adding more ground truth data is considered in the future.Then, we will test SMO and RF classifiers on different location and larger area multiclass land cover classification.

3 )
Value decision of ref θ and n ref θ of model-2 and model-3 means a reference incidence angle which was defined as 34.3 degree in this study.Model-3 was applied to correct ALOS PALSAR 50m ortho-rectified mosaic HH and HV image when n is 0.7.

Figure 4 .
Figure 4. Ground scattering geometry.loc θ is the local incidence angle, inc θ is 34.3 degree.
coefficient after slope correction for HH image and HV image, and o HH σ and o HV σ mean the original backscatter coefficient of HH image and HV image, respectively.The effect of each slope correction model is assessed by visual identification in Figure 5 and Figure 6.As can be seen, model-1, model-2 and model-3 have less impact on changing the backscattering brightness variation over mountain area, while the corrected image generated using Equation (10) and Equation (11) show the mountain area had been changed to flat.

Table 1 .
Three existing slope correction models.
Combing the equations above, leads to the new terrain correction model for ALOS PALSAR 50 m mosaic data following with:

Table 2 .
The list of generated features from eCognition Developer.

Table 3 .
Optimal feature combination for SMO and RF.

Table 4 .
(a) Accuracy of land cover classification result of SMO classifier; (b) Accuracy of forest and non-forest result of SMO classifier.

Table 5 .
(a) Accuracy of land cover classification result of RF classifier; (b) Accuracy of forest and non-forest result of RF classifier.