Figure 2. Merged cluster classes representing high, medium and low biomass areas derived from red-edge (A), WA1 (B), WA3 (C), and WA4 (D).

Abstract
The expected increase in the availability of hyperspectral data derived from both private and public remote sensing sensors offers the potential for enhancing forest information extraction such as those used for estimating canopy structure and above-ground biomass. The objectives of this research are (1) to analyze the geometric properties of forest-related parameters to enhance information extraction of canopy structure and above-ground biomass by using hyperspectral data obtained from high-resolution, Airborne Visible/Infrared Image Spectrometer (AVIRIS) data, and (2) to develop algebraic algorithms based upon geometric properties of the red-edge and water-absorption bands to increase the amount of variance of forest parameters associated with structure and biomass.

The data were obtained from a special, low-altitude AVIRIS mission performed in fall 1998, with 3.8-meter spatial-resolution data captured above Congaree Swamp National Park in central South Carolina. A subset of 64 bands covering selected spectral regions of both visible and infrared bands is used for analysis.
      Methods include high-dimensional clustering of band groups that represent red-edge, near-infrared plateau, and four-water absorption regions. The results were used for geometric analysis that led to developing four separate algebraic algorithms to increase the variance among forest-related properties. Three classes of biomass were digitized from the visual interpretation of color composite images and consequently used for spatial correlation analysis.
      Results suggest that biomass and canopy structure estimates derived from water-absorption bands WA3 (1428.6 to 1490.2 nm) and WA4 (1902.2 to 1993.7 nm), as well as derived ratio images, provide better results than do other water-absorption bands, red-edge bands, or near-infrared plateau bands when compared to visually interpreted classes.

Introduction
As data from new sensors and satellites become increasingly available, forest-parameter estimates derived from remotely sensed data offer increasing benefits to ecological modeling. Hyperspectral sensors collect energy in narrow bands - usually 10 nanometers wide - between visible and long-wave infrared. These data have the potential to provide information associated with differences in the structure and biochemical composition of surface features.
      Ecological research has focused primarily on local-scale analysis of environmental processes. In order to improve our understanding of Earth phenomena, the gap between local and regional ecological research must be filled. Studies that focus on vegetation parameters are especially important because they provide a key link between local and global scales. To understand the role ecosystems play in controlling the composition of the atmosphere, it is necessary to quantify processes such as photosynthesis and primary production, decomposition and soil carbon storage, and trace gas exchanges. Photosynthesis is the link whereby surface-atmosphere exchanges such as the radiation balance and exchange of heat, moisture, and gas can be inferred. It also describes the efficiency of carbon dioxide exchange and is directly related to the primary production of vegetation.
      Canopy relationships with solar energy reflectance probably represent the most complex energy interactions among surface features. The various wavelengths of incoming solar radiation interact differently with the forest's physical and biochemical components. This interaction is determined by parameters such as crown density, stand height, canopy depth, leaf geometry, leaf pigment reflectance, leaf water content, and forest-floor reflectance. Many of these parameters can be measured by using remotely sensed data.
      The expected increase in the availability of hyperspectral data brings with it new challenges for image-processing technology. There are no algorithms developed at the present time that are able to produce results as accurate as the ones available for current multispectral analysis. The higher dimensionality of hyperspectral data imposes a significant obstacle to information extraction. Differences in hyperspectral space are subtler than those of multispectral space. Algorithms must be more sensitive to lower-magnitude changes in feature space in order to identify different features without losing valuable information. The experiences derived from multispectral analysis have shown important relationships between algorithm performance and data dimensionality for extracting spatial information. The required number of training samples is linearly related to the dimensionality of a linear classifier, and to the square of the dimensionality of a quadratic classifier. In a nonparametric case it is estimated that, as the number of dimensions increases, the sample needs to increase exponentially to have an effective estimate of multivariate densities. It is this reason that nonparametric schemes, including the currently popular neural network methods, are less attractive for remote sensing. Data dimensionality has an important effect on information extraction in conventional multispectral data, but it becomes the paramount concern when dealing with hyperspectral data.
       Studies focusing on the remote sensing analysis of physical and biochemical canopy properties are quite extensive. Ever since the academic work of Allen and others, models intended to simulate canopy reflectance for the purpose of improving remote sensing analysis have flourished. Most of these models take as input the following parameters: leaf reflectance and transmittance, soil reflectance, leaf area index, average leaf inclination, viewing geometry, and incoming irradiance. Results obtained from model simulation have shown that the leaf area index affects canopy reflectance over the entire spectrum. Leaf angle inclination acts very similarly to leaf area index. Changes in leaf mesophyll structure induce small variations, primarily concentrated where leaf absorption is low, such as in the near-infrared range. Chlorophyll concentrations and changes in water-equivalent thickness result in wide variations of canopy reflectance.
      Forest structure is determined by vegetation parameters such as tree density, canopy height, canopy layering, and tree-size distribution, and by topographic parameters such as elevation, slope, and aspect. Aboveground biomass is related to leaf volume, stem and branches volume, and leaf area. The reflectance of incoming solar radiation is affected in different ways by these forest parameters.
      Over the last ten years, many studies utilizing Airborne Visible/Infrared Image Spectrometer (AVIRIS) data have been developed. Several of those have demonstrated the potential of using hyperspectral technology for forest resource assessment and analysis. Most of these studies have focused on chaparral, savanna, and softwood environments. This article, however, focuses on an old-growth hardwood forest located in the Congaree National Park of central South Carolina.
      Geometry and red-edge shift provide information related to vegetation structure and composition. Red edge refers to a steep increase in vegetation reflectance adjacent to chlorophyll absorption in the transition zone between red and near-infrared regions of the spectrum. In the near-infrared region, vegetation reflectance has a higher value due to the increased reflectiveness of leaf cell walls. Spectral shifts of the red edge tend to minimize the influence of confounding factors such as soil reflectance, specular component, and atmospheric effects. The red-edge vegetation in AVIRIS data comprises eight contiguous spectral bands between 0.6852 and 0.7532 meters. The geometry of red-edge measurement can be related to changes in vegetation phenology and to such environmental stress as changes in precipitation and temperature.
      Absorption of solar radiation by liquid water can be detected by the reflectance pattern of incoming energy in different regions of the spectrum. Variable amounts of water in the plant tissue associated with various forms and quantities of leaves in a canopy can be used to discriminate different types of forests. The bands most sensitive to water in plant tissue are called liquid water bands and are located at these approximate regions of the spectrum: 970, 1220, 1480, 1940, and 2500nm.

Data and Methods
AVIRIS collects reflected energy in 224 contiguous spectral channels, distributed in narrow bands between 390 and 2500 nanometers. The data were collected in 873 columns perpendicular to the flight line, and at a variable length along the same flight line. In a regular AVIRIS image the sensor simultaneously views all 224 pixels of a given location, with 20-meter spatial resolution on the standard 20km flight altitude, and 16-bit radiometric resolution. In 1998, however, NASA's Jet Propulsion Laboratory performed a special, low-altitude mission for selected locations. During this mission AVIRIS data, including those used in this research, were collected with 3.8-meter spatial resolution. A small sample of the entire image was extracted from the main scene for this study, a total of 311 lines and 407 columns.
      As presented in Table 1, a subset of 64 bands is used for analysis. The AVIRIS column shows the original band numbers as collected by AVIRIS. The center and range columns show the center point and respective width of each band in nanometers. These bands were divided into seven groups: red edge 14-21, near-infrared plateau 22-28 (NIR plateau), water absorption one 29-37 (WA1), water absorption two 38-43 (WA2), water absorption three 44-49 (WA3), water absorption four 50-58 (WA4), and water absorption five 59-64 (WA5). With the exception of the green and red bands (1 through 13), these groups were used for high-dimensional clustering.
      High-dimensional clustering was implemented using the ISODATA Iterative Clustering Algorithm from Multispec image-processing software, plus an along-first covariance eigenvector for locating the initial cluster centers. A total of 120 initial cluster centers were assigned, and the system proceeded to place them equidistant along the first principal component. Once centers were determined, the algorithm proceeded to associate each pixel with the cluster center that had the smallest Euclidean distance from each center. The process continued until at least 98 percent of the pixels had been changed. The total number of clusters produced in each group of bands varied from 15 to 113, depending upon the amount of variance present in each group of bands or ratio images.
       Geometric analysis of the cluster classes thus produced was implemented to identify geometrical properties of the results. Algebraic algorithms (ratios) were developed based upon the geometric analysis that was performed to improve feature identification of related vegetation properties. The following equations were used for rationing.
       In equation 1, the higher steepness between bands 15 and 17 2 and bands 18 and 20 4 is computed against the lower steepness between bands 14 and 15 1, 17 and 18 3, and 20 and 21 5. The product is multiplied by a stretching factor (Sf) determined by the radiometric resolution of the data - for example, 100 for 16-bit- and then added to the radiance values of band 16. Equation 2 calculates the difference in steepness between bands 29 and 30 1 and bands 32 and 34 2 multiplied by a stretching factor (Sf = 100). Equations 3 and 4 take advantage of the clearer distinction between vegetation clusters and amplify this distinction by adding all three bands, then multiplying the product by a stretching factor (Sf = 10 and Sf = 5, respectively).
      The images produced by rationing were used for another round of high-dimensional clustering by using the same settings as described above. The clusters thus produced were grouped into three major classes of biomass (high, medium and low) and four classes of structure (zero to 10 percent, 10 to 25 percent, 25 to 60 percent, and 60 to 100 percent cover).
       Vector layers delineating three biomass classes and four structure classes were produced by the visual interpretation of color-composite images. These layers were later converted into a raster format for spatial correlation analysis.
       ERDAS Imagine was used to develop a spatial correlation analysis to obtain spatial statistics, comparing the different band and ratio classifications among themselves by overlaying them with the biomass and structure polygons produced thorough visual interpretation.

Results and Discussion
Results obtained from the high-dimensional clustering of selected band combinations allowed a geometric analysis of the data to produce the algebraic equations presented herein. Figure 1 shows four examples of cluster classes representing high, medium and low biomass as obtained from red-edge, WA1, WA3, and WA4 band groups. Low and medium biomass classes overlap between bands 16 and 21 in the red-edge region while, in WA1, these classes overlap in all bands. In WA3 and WA4 high and medium biomass classes overlap, but the distinction between these and low biomass is clearer than that of the other two groups.
       Figure 2 shows the spatial distribution of three biomass classes produced by the same band groups shown in Figure 1. WA3 and WA4 (C and D respectively in the figure) have better spatial distribution than do the other two groups, when all three classes are considered. The overlap between high and medium biomass shown on Figure 1, however, can also be noticed in the spatial distribution of these two classes.
       Figure 3 shows the distribution of three biomass classes as identified by the visual interpretation of color-composite images. This image was used for spatial correlation analysis with all classifications produced. Figure 4 shows the spatial distribution of biomass classes derived from clustering of the four combined ratio images. The results obtained from this classification are visually similar to the ones obtained from the WA3 and WA4 classifications.
       Figure 5 shows an overlay of the two images shown on Figures 3 and 4. Notice the number of pixels mixed between the high and medium classes (blue pixels), as well as the low and medium classes (green pixels). The same type of analysis was performed for each individual band group and ratio image. The results of this analysis are presented in Table 2. The top three rows of Table 2 show the total number of pixels that were classified as one of three biomass classes using band groups and ratio images. The bottom six rows show the total number of pixels produced by overlay analysis of each classification to the digitized biomass image. Note in these rows that WA3, WA3 Ratio, and WA4 Ratio produced the best results for high, medium and low biomass respectively. RE Ratio produced the lowest mixing value between high and medium biomass, and WA3 produced the lowest mixing values for both medium and low, and for high and low biomass. Note as well the increase in the number of pixels correctly classified as medium and low biomass from WA1 to WA1 Ratio, and from WA3 to WA3 Ratio.
      Table 3 shows the total number of clusters produced in each classification. Compare the same two examples above to the numbers presented on the same band groups and ratio images. In the first (WA1 to WA1 Ratio), the number of clusters decreased from 84 to 14. In the second (WA3 to WA3 Ratio), the number of clusters increased from 38 to 87. A similar decrease occurred between Red-edge and RE Ratio. The decrease in the number of clusters can be explained by the decrease in the amount of variance in the data, due primarily to the decreasing effect of the algorithms used to produce these two ratio images (refer to equations 1 and 2 above). Therefore ratio images WA3 and WA4, and band group WA4, provided the best overall performance in identifying the three proposed biomass classes. These data sets also hold the potential of identifying more detailed biomass classes due to the greater amount of variance contained in them as compared to the other band groups.
      Figure 6 presents the four forest-cover classes produced by visual interpretation of color-composite images. Figure 7 shows the result of merged clusters produced by WA3 ratio-image classification. These two images were overlaid, and the result is shown in Figure 8. The same operation was performed for WA3, WA4, and WA4 Ratio classifications, and the number of pixels produced by this procedure is presented in Table 4. The results shown in Table 4 are for the best four classifications only.
      Note in the table that overlay results are better for the same three biomass band combinations: WA4, WA3 Ratio, and WA4 Ratio. However, the amount of mixing classes between larger covers remained high. The large number of clusters merged into single classes explains this circumstance. These clusters represent variations in the surface features that can lead to a finer delineation and quantification of forest-cover types. The same assumption can be applied to biomass amounts. Detailed ground information on both biomass and forest cover need to be obtained in order to test the validity of this assumption.

Conclusions
Results show that estimates derived from water absorption bands WA3 (1428.6 to 1490.2nm) and WA4 (1902.2 to 1993.7nm), and derived ratio images provided better results than other water-absorption bands, the red-edge bands, and the near-infrared plateau bands when compared to visually interpreted classes. Results obtained during clustering suggest that ratio images WA3 and WA4 have the potential for identifying a larger number of biomass and structure classes. However, this assumption can only be tested if detailed ground information becomes available. The use of selected spectral bands provided by AVIRIS data and derived ratios allow the use of existing classification algorithms to analyze hyperspectral data, as shown by the methods used in this research project.

About the Author:
Nelson Dias is a faculty member of the Department of Geography, Geology and Anthropology at Indiana State University (Terre Haute). He may be reached via e-mail at: [email protected].

Back