Internet-Based Spectral Database for Different Land Covers in Egypt

The spectral signatures of natural objects in the visible and near-infrared spectral range are influenced by the object’s physical and biochemical properties. These signatures can be compiled in a database and used to retrieve information of land cover types and their physical composition from actual hyperspectral observations. This paper describes development process of hyperstectral database of reflectance from different land cover types in Egypt. It has been compiled from data obtained using a ground-based spectroradiometer system that covers the spectral range from 350 to 2500 nm at 1 nm resolution. The database is accessible through a website http://www.spectraldb.narss.sci.eg/spectral, where the system includes also metadata that describes the site environment and measurement processes. The system provides flexible mechanisms and friendly interfaces to allow accessing the database by the non-specialized people, whereas spectral data can be sorted by sites, species or selected environmental parameters. The system presents sample results from different vegetation and soil covers. Development of such a database is essential for different remote sensing applications, satellite’s calibrations, data dissemination and linkage with other databases for scientific researches purposes.


Introduction
A hyperspectral signature of a certain surface cover (e.g.vegetation or rock type) is a record of reflectance from a surface at hundreds or thousands of continuous spectral bands with narrow bandwidth.A collection of such signatures for different surfaces called hyperspectral database (or library).The database can be used later against measured hyperspectral observations from ground instrument, airborne, or spaceborne sensor to define the surface and retrieve its geophysical properties.For example, observed hyperspectral data from an airborne or spaceborne sensor can be comparing against hyperspectral signatures of different surfaces in order to identify the surface with the closest signature.A wide range of applications using hyperspectral databases have been developed in the fields of vegetations and soil management.This includes vegetation classification, crop infestation, plant pigmentation, and soil composition and moisture contents [1][2][3][4][5][6][7][8][9][10][11].The hyperspectral signatures are used also to calibrate, validate, and simulate remote-sensing imagery obtained from visible and near infrared sensors (wavelength between 0.4 to 3 nm).The interpretation of the hyperspectral data is not usually simple due to their high dimensionality that is the result of sampling a wide spectral range in very narrow bands.
Instruments that are used to measure hyperspectral signature of surface cover called spectroradiometers.They allow non-destructive sampling of radiation from the surface and enable users to measure spectral signatures under different physical and geometrical conditions.Operation of these instruments tends to be relatively easy and data are collected quickly [12].Examples of commercial instruments include the ASD Field Spec (350 -2500 nm), Apogee/StellarNet SPEC-PAR/NIR (350 -950 nm), StallerNet EPP2000-NIR-InGaAs (400 -1700 nm) and Spectral Evolution PSR-3500 (350 -2100 nm).
The structure of hypercpectral database of spectral signatures should allow easy access by users in terms of searching, retrieving and viewing information.The users may be able to search the data using different criteria such as surface cover category (including classes and subclass), data acquisition time or location or just by typing a keyword.For efficient research, spectral data need to be augmented by metadata and stored in an organized way [13].It is stated that an organized, shareable and non-redundant storage of spectral data and associated metadata are important features towards better data quality and accessibility.Major hyperspectral databases include SPECCHIO [14] and SpectraProc DB [12].SPEC-CHIO is a spectral database that holds reference and actual field spectra obtained from several spectroradiometers.Different spectroradiometrs produce data encrypted differently.SPECCHIO can incorporate data from different spectral device.Metadata are also included since they ensure the durability of database and enables the sharing of spectral data between research groups.SpectraProc DB, on the other hand, was specifically designed to provide fast, repeatable processing of hyperspectral data only from any Analytical System Device (ASD) FieldSpec Pro spectral data files.It includes many features of data organization and standard preprocessing.
Sharing spectral signature datasets with other scientists is complicated due to differences in data collection techniques and sampling environment conditions [15].One common feature of all hyperspectral databases is their significant amounts of data that cover not only numerous electromagnetic wavelength but also a wide range of incidence angles and surface cover conditions (e.g.minerals impurity, crop infestation, etc.).
The National Authority of Remote Sensing and Space Science (NARSS) in Egypt have compiled a hyperspectral database of different vegetations, soil and rock types as well as sea and lake water.The data were obtained from field measurements using an ASD Field Spec 3 spectroradiometer.This was pursued through a collaborative project between NARSS and Mediterranean Agronomic Institute of Chania (MAICh).The project, called GI@MED aimed at the creation of a Greek-Egyptian bilateral axis for the promotion of applications of modern geoinformation technology in the sectors of Agriculture and the Environment in the South-eastern Mediterranean.More information can be accessed through http://www.gi-eastmed.net/.
The database has been incorporated in a web-based system with appropriate user interface that offers easy accessibility to the data.This paper addresses the procedure of the hyperspectral data acquisition and an overall description of the database systems, including storage and processing.Sample results of signatures of vegetation crops are also introduced.

Spectroradiometr
Radiometry is the subject of measuring electromagnetic reflection in the visible and near infrared bands.This requires measuring the input (i.e.irradiance) to the surface and the reflection (radiance) from the surface.In remote sensing, in-situ radiometry measurements are important in order to relate the airborne or spaceborne observations to ground measurements.Radiant energy and radiant flux are measured in Joule (J) and Watt (W), respectively.Radiance and spectral radiance are com- Field spectrometry is the quantitative measurement of radiance, irradiance, reflectance, or transmission in the field.That involves the collection of accurate spectra and requires an awareness of the influences of many parameters such as sources of illumination, atmospheric characteristics and stability, winds, instrument field-of-view, sample viewing and illumination geometry, instrument scanning time and spatial and temporal variability of the sample characteristics.
A Spectrometer is an instrument that measures the radiation over specific bands of the electromagnetic spectrum, usually from gamma rays into the far infrared.This instrument is also called spectroradiometer.It is used in measuring the spectral distribution of radiation in very narrow bands (measured in nanometers).Spectroradiometers are used to confirm the specifications of light sources and calibrate display devices such as screens of laptops.In remote sensing, they are used to acquire a reference database of the spectral signatures of ground cover types, and use them later to retrieve ground parameters from the observations.Accurate spectral measurements require awareness of many influences and parameters.This includes sources of illumination, atmospheric characteristics and stability, winds, the instrument's field-of-view, the sample viewing and illuminetion geometry, the instrument scanning time and the spatial and temporal variability of the sample characteristics.
In this study, a hyperspectral instrument called Analytical Spectrum Device (ASD FieldSpec ® 3HR) was used to measure the spectral signature of different land cover types.This a high resolution (HR) instrument, operates in the full optical spectral range (visible, near infrared and short wave infrared) starting from 350 nm to 2500 nm.The sampling interval is 1.4 nm within the range 350 -1050 nm and 2 nm within the range 1000 -2500 nm.The measured reflectance is resampled to produce a spectrum at 1 nm interval for the entire range.

Data Collection
A protocol of spectroscopy measurements was developed.It includes the following steps: 1) A visit to each site was planned before the actual field measurements in order to examine its features and estimate the time required for travelling between sites.This is important because the spectroradiometr can work only within a certain range of sun zenith angle (available between 10:00 am and 2:00 pm).2) The location of the site determined accurately using a Garmin Etrex vista HCX GPS device that connected to spectroradiometer controller unit to acquire location data automatically.In addition to the environmental conditions are suitable for taking the measurements.Measurements under cloudy sky require special procedures.3) The instrument should be calibrated before any measurements are taken or at least every 15 minutes.In the case of thick cloud cover, a new calibration should be performed after the sky is clear again.
The measurement scheme includes acquisition of hyperspectral signature from different "samples" and several "replications" within each sample.A replication is the average of 25 spectra; a fixed number by the instrument.The number of replications is between 3 and 10.The sample is obtained at a certain location for a certain type of surface cover.Examples of surface types include field crops, species of natural vegetations, soil types, rock types as well as natural and artificial water bodies.Several samples are taken of the same surface types at different locations.Metadata are recorded for each sample.This includes environmental, geographical, surface and instrument's parameters.Different samples are taken for the same surface under different environmental conditions (as described by the metadata) or if the surface conditions vary (e.g.irrigation conditions, stressed plants, infested areas, etc.).The wider the variety of the conditions the more number of samples that have to be taken.This warrants better representation of the averaged hyperspectral signature from the surface cover types.
Most of the data were collected at height range from 30 -50 cm above the target surface at nadir position (90˚) by hand held technique.In some situations the instrument was mounted on the 10 meter Mobile Unit for Field Spectroradiometric Measurements at Mediterranean (MUFSPEM@MED).This platform was designed specifically for this study.It was used in measuring the spectral signature of high trees or when a certain field of view is required to match the field of view from a remote sensing instrument.The spectroradiometer provides the spectral signature in either one of the two forms: Digital Number (DN) mode and reflectance mode.In the reflectance mode the instrument's processor divides the measured radiance by the stored reference spectral from a white panel (as described later).Figure 1 shows photographs of the instruments during data collection at different field situations including the case when it has to be mounted on the mobile unit.
The set of measurements for each sample is composed of three parts: 1) the spectral signature from a standard white panel, 2) the signature of the actual ground target from several replications (a replication is a full spectral signature), and 3) another measurement from the white panel.The white panel is provided with the instrument.It reflects all the incident sunlight, so the measured reflected spectrum is actually a measure of the incident light spectrum.The white reference spectrum needed to convert the DN measurement into reflectance (if requested later by the user).The replicates within each sample are measurements of the hyperspectral signature without the reflectance from reference white panel.
Data were collected over a wide variety of land cover in Egypt.This included field crops and different soil types in selected areas within the Nile Delta and the valley between Cairo and Bani-Sueif.Most of the rock samples, on the other hand, were obtained mainly from different point along the coast of the Red Sea, including areas around the city of Safaga.The geographic locations are included in the database.Hyperspectral signatures were also obtained from different water bodies: natural represented by lakes and sea water, and artificial represented by irrigation canals.The lake samples were obtained from the Manzala Lake and the Burullus Lake.The sea water samples were obtained from several points around the Za'farana and Safaga along the Red Sea coast.The samples from the irrigation canals were obtained from the Salam Canal in Tina Plain.
The spectral signature of any given crop (or rock type) was obtained by averaging the spectra obtained from several samples acquired from different locations under a variety of conditions.Field instruments cannot have the quality of a stable laboratory instrument because of temperature drifts and rapidly changing atmospheric absorptions.Averaging data from many samples and replications within each sample will remove any artifacts from field data caused by weather, soil or user's handling of the instrument.This will make the signature more representative of the surface type and therefore more appro-reflectance) and the measurements setup (e.g.handheld or on a mobile platform).priate for comparison against remote sensing data.

Generation of the Metadata
Availability of the metadata allows better identification of the product hyperspectral signature.It also facilitates comparison between data in different sets upon confirming the similarity of the conditions under which data were acquired.On the other hand, the lack of metadata can render previously collected data useless for different applications.
The metadata that accompany the spectral signature contain information about the object (i.e. the surface cover) and the sampling environment at the time of data capture.The data are acquired for each sample.They are composed of information on environmental, geographical, surface and instrument's parameters.Environmental metadata include cloud conditions, ambient temperature, relative humidity, air pressure, and wind speed and direction.These parameters measured by the sensors on an available meteorological station.The stations are established and operated by Central Laboratory for Agriculture Climate (CLAC).In the case when no station is available, the data obtained from climatic sites available on the Internet.Geographical metadata include an accurate geographical location of the sampling measured by a Geographical Positioning System (GPS) attached to the instrument.

Data Archiving Structure
Most database systems hierarchically structured because this scheme accommodates different levels of information.The first level is the broadest then it sub-divides into more detailed sub-levels [16].The method used in the current system is the standard FAO classification scheme for the land cover.The scheme presented in Figure 2. The Dichotomous phase ends with eight classes.These classes are included in the database.The Modular-Hierarchical phase is not part of the fixed structure of the database.The administrator only can add it.It contains optional detailed information about sub-classes that can be acquiring occasionally.An example of a piece of information that can be adding under any one of the eight classes (i.e.below the Dichotomous phase) would be about the irrigation system for the primary vegetated terrestrial cultivated and managed area.
The metadata for the surface include information regarding appropriate properties of the surface.For example, in the case of the crops, the information includes planting date, recent irrigation conditions, the brand of the crop (e.g.wheat Sids 1 or 5; wheat Sakha 93, etc.).In the case of the rocks, the information includes chemical descriptions such as Syenogranite (composed of K-fields, orthoclase, Quartz, Biotite, Fe-Oxide and Clay Mineral).Soil analysis is also part of the metadata.Metadata related to sensor's parameters include information about the viewing angle of the instrument, the illumination source (sun of artificial self-illumination), the lens used in measurements, the mode of the measurements (DN or The spectral database designed as a relational database.It takes into consideration implements of hierarchical structure as used for the field data to store species, site, date and spectrum data.Multiple datasets can hold spec- The database was primarily designed to hold spectral data of vegetative studies.Therefore it started with a simple structure that could hold spectral data sorted into sites and species.In the diagram the entities species, site and spectrum reflect the hierarchical structure of spectral data.The sample entity was added to the top of this structure to enable the storage of data belonging to different studies in the same database. The concepts and necessities of using spectral library data in remote sensing courses on e-learning education and the important of concepts (tools, software, data sampling,…) on the quality on collected data.In addition to, Web techniques can concentrate on the: description of spectral data acquisition, store & organization and Data preprocessing [17].

Spectral Database Implementation
NARSS spectral database user has two different pages.Admin page that ASD files can be imported into the database as part of a study and allows admin users to manage sites, species, spectrum files and its metadata.More-over, some of pre-processing functions were developed such as converting the DN data to reflectance data instantly online.Figure 4 shows the developing steps of NARSS spectral database development.
The database implemented in Postgres, GNU open source software.Postgres is a relational database management system that can handle large amounts of data.The system designed to work with any relational database system as shown in Figure 5.A web application developed to simplify data loading archiving and analyzing spectral data.
The technical requirements for such a system were (Graphical user interface to the database-Functions for loading spectral data into the database-Data pre-processing functions-Data analysis functions-File export functions to allow data analysis and plotting in 3rd party packages).The application developed using the latest in java and web 2.0 technologies using spring framework.Hibernate framework was used for the database access from java code.JQuery javascript framwork used at the client to get and show spectral data and graphs.

Sample Results
The database structure results showed that the species, classes, types of natural and artificial different land cover types are well archived spectral data.The Archiving of the spectrum were mainly based on the FAO classification system shown in Figure 6.The organized structure    of the database led to better planned sampling campaigns for spectral data collection.In addition, well-ordered data makes the data handling, pre-processing and processing much easy.
The spectral data archived as raw data in the database, both of admin and non admin users can demonstrate and visualize the data either as raw data or pre-processing by converting the DN measurement data to reflectance at http://www.spectraldb.narss.sci.eg/spectral.That gives ability for comparing between the different targets and makes the different targets signatures comparison much easily shown in Figures 7-9.

Conclusions
The high performance and flexibility of the NARSS spectral database of spectral data and metadata retrieving indicates well-defined relations without conflicts between the database attributes.The spatial location of every spectral sample is overlaid on Google maps which are connected to the first non-admin users page that clarify exactly spatial distribution of the collected spectral measurements.
Sustainability of using the archived spectral data on the long term for scientific and research purposes in addition to data distribution is return to NARSS spectral database well-organized storage system of the spectroscopy measurements and its metadata.The developed database provides multi admin users to upload, remove, modify and pre-process spectral datasets and non-admin users demonstrate pre-process the spectral files.In addition, all subsequent operations can be carried out on the original dataset that remains unchanged.Moreover, acquired spectral data in DN can be faster and easily converted to reflectance data.GI@MED project financed by the Hellenic Aid (Greek international development services) for supporting NARSS with the ASD spectroradiometer and using GI@MED lab facilities at NARSS premises to carry out this work.

Figure 1 .
Figure 1.A set of photographs showing different arrangements of measurements for different land cover types, including handheld instruments and instrument mounted on fixed and mobile platforms.

Figure 2 .
Figure 2. Hierarchical structures of the main classes according to FAO land cover system.

Figure 3 .
Figure 3. Hierarchical structures of the archived spectral data in the database.

Figure 5 .
Figure 5. Overview of the spectral database model shows entities and relations.

Figure 7 .Figure 6 .
Figure 7. Sample of reflectance spectral signatures visualization.Figure 6.The non-admin user main page of NARSS spectral data management and processing.

Figure 9 .
Figure 9.The online data conversion from DN to reflectance.