Band Selection using
Hyperspectral Data of subtropical Tree Species Tung FUNG, Fung yan MA, Wai
Lok SIU Keyword:
hyperspectral, band selectionDepartment of Geography The Chinese University of hong Kong Shatin, New Territories, Hong Kong, China Tel: 852-2609-6535 Fax: 852-2603-5006 Email : tungfung@cuhk.edu.hk Abstract Band selection was performed based on hyperspectral data taken at 400-900 nm spectral range for 25 subtropical tree species in Hong Kong. Stepwise discriminate analysis and hierarchical clustering were used to select bands with high discriminatory power. In addition to the blue, green, red and near-infrard bands, spectral bands along the blue-green, red and near-infrared bands, spectral bands along the blue-green edge, green red edge and red curves were found to have important information to discriminate the 25 tree species which could identify tree species with 89% overall accuracy. Introduction The increasing availability of hyperspectral data and image has enriched us with better data for environmental monitoring, forest tree species, geological exploration and many other applications as well. Currently, most studies focus on either using airborne hyperspectral sensors such as the Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) or in situ collected field spectra. With hyperspectral scanner onboard satellites (e.g. Orbiview-4), hyperspectral image analysis will certainly be a major arena of research and applications. The use of hyperspectral data analysis, however, brings along research issues. Price (1994) noted three major issues including (1) the trade-off between the number of spectral intervals (bands) and spatial resolution of the imagery; (2) the trade -off between higher spectral resolution and reduced signal-to-noise ratio and (3) appropriate positioning of the selected spectral bands in order to provide information about the objects of interest. Hyperspectral data undoubtedly possess a rich amount of information. Nevertheless, redundancy in information among the hundreds of bands opens and area for research to explore the optimal selection of bands for analysis. Price (1994) indicated that for natural materials, 15-25 spectral bands were sufficient whereas the number of bands had to be doubled for the studies of minerals. Petric and heasler (1998) examined optimal band selection strategies using spatial autocorrelation; spectral autocorrelation and optimization with distance metrics. Warner and Shank (1997) also examined the use of spatial autocorrelation for narrow band selection. The objective of this studyis to investigate how many bands and in which spectral ranges these bands are needed to identify subtropical tree species based on their hyperspectral reflectance's. Tropical and subtropical environment is endowed with a rich diversity of flora which also renders it a difficult but challenging environment for monitoring. Under this project, we have earlier examined the effectiveness of subtropical tree identification using hyperspectral data (Fung et al., 1999). The results of using back propagation feed forward neural network and linear discriminate analysis for classification revealed that about 80% overall accuracy could be attained with autumn being the best season for hyperspectral data and suggest suitable bands for tree species recognition. Hyperspectral data collection A high spectral resolution spectrometer PSD2000 was used for taking hyperspectral data. The spectrometer was linked with a notebook computer for data acquisition and analysis. It took data ranging from 210 mm to 1050 nm with spectral resolution of approximately 2.6 nm. It had a field of view of 22o. During data collection, two references, a white and a dark, were used for calibration. Based on the illumination condition, an integration time for collecting photons was selected to adjust the master/slave sampling frequency to avoid saturationor shortage. Sample spectra could then be collected through dividing the sample radiance by that of the standard while reference under the same illumination condition. In this study, we only examined data from 400-900 nm to avoid bands with too much noise. Subtropical trees were the prime focus of this study. Twenty-five tree species within the campus of the Chinese University of Hong Kong were selected fro the study. They included both native and exotic; needleleaf and broadleaf species. These trees were also commonly planted in the country parks and urban areas of Hong Kong. They are listed as followings:
Owing to the difficult in acquiring in situ ground measurement in the subtropical environment (Fung et al., 1999), the experiment was undertaken in a controlled environment. The following procedures were adopted:
Data Analysis Two algorithms were adopted for band selection in this study. A preliminary band selection was first done using stepwise discriminant analysis. By adjusting the F ratio for bands to enter into the discriminant model, the number of bands was significantly reduced to include only those having the highest discriminant power. Two criteria were used. The first one used a criterion that bands entered if the significance level of F value was smaller than 0.05 and removed I the significance level of F was greater than 0.1. the second used a criterion that bands were removed for F value less than 2.71 and entered if F value greater than 3.84. The tree identification results were compared with those using all bands based on significant test of Kappa. These two criteria were compared with the results of using all 138 bands for tree identification. In the second analysis, hierarchical clustering was performed using the autumn data set. The 400-900 nm spectral data was grouped into clusters. Based on the analysis of dendrogram, bands within cluster were selected for investigation. Different combinations of bands were used to examine their accuracy in tree identification so as to explore the most informative combinations. Result and discussions The result of band selection using stepwise linear discriminant analysis is shown in Table 1. Using the first criterion, the number of bands ranged from 9 to 18. Less bands, 7-9, were selected if the second criterion was adopted. The classification results had no Significant difference when the corresponding Kappa values were compared. However, adopting either criterion can significantly improve the classification results when compared with those using all the 138 spectral bands. The improvement was particularly significant when the winter data was used. The overall accuracy yielded using 9 bands was 91.27% whilst only 70.16% was produced when all the bands were used. Amongst the bands selected, they lied mainly in the green peak and the red edge showing that bands along these regions had stronger discriminating power.
The result of hierarchical clustering showed that clusters not only appeared in the typical spectral regions such as blue (400-504 nm), green (526-582 nm), red (654-686 nm) and near infrared (766-900 nm), many f them wee generated along the edges. These include the blue-green edge (504-526 nm), the green-red (582-654nm) and the red edge (697-755 nm). Seven different combinations of bands were selected to test the discriminating power of different spectral regions (Table 2). The result was shown in Table 3. Using the conventional 4-band combination (B,G,R, NIR) generated an average overall accuracy of 46.11%. The result has no significantly different from that (44.66%) using only 3 edges (BG, GR, R). Both results are inferior to that using 5 bands along the red edge (55.62%). The result improved substantially if more bands and appropriate combination was tested. The combination of 7 edges yielded an average overall accuracy of 72.97% whilst that using the conventional 4 bands plus 3 edges was 77.16%. Further addition of bands along the edges can further improve the result to 87.26% (11 bands) and 89.30% (13 bands). The result was also superior to those produced using discriminate analysis. Conclusion This study shows that band selection helps improve the classification accuracies for treespecies recognition. The bands that were selected by stepwise discriminate analysis mainly lied in the spectral regions around the green peak and the red edge. Hierarchical clustering provided a more detailed analysis for band selection. It showed the value of spectral regions along the BG, GR and edge which received less focus in many studies.
Acknowledgement This research is supported by a RGC earmarked grant (CU96223/UPG). References
|