Earth Observation Magazine

Validation of the Landsat 7 Long-term Acquisition Plan
Part 1

The launch of Landsat 7 on April 15, 1999 ushered in a new era of managing global survey missions. For the first time (for Landsat or any similar mission) an automated Long-Term Acquisition Plan (LTAP) was employed to populate the US Archive at the USGS EROS Data Center. The automated LTAP system was designed to globally collect sunlit, essentially cloud-free, and seasonally refreshed Landsat acquisitions for all land areas of the Earth. The LTAP has successfully populated the archive such that most users requiring data find it already in the archive, from the right time of year, substantially cloud-free, and under the right radiometric conditions, rather than having to order new acquisitions. Requests for data acquisitions outside of the LTAP have been lower than for any other Landsat mission. After three years of operations, we have undertaken an exhaustive analysis of the performance of the LTAP in terms of seasonality, cloud contamination, gain setting strategy, and satisfaction of science community requirements. This article addresses the evolution of the LTAP during Landsat-7 operations, the validation strategies that were designed, and the results of the validation process. In addition, lessons learned from the validation exercise and future updates to the LTAP strategy are discussed.

Terry Arvidson, Richard Irish, Brian Markham, Dr. Darrel Williams, Jay Feuquay, John Gasch, and Dr. Samuel N. Goward

A new automated approach to Landsat mission operations was introduced with the deployment of Landsat 7. A long-term acquisition plan (LTAP) was developed prior to launch to systematically populate the US archive, held at the USGS EROS Data Center, with seasonally refreshed, essentially cloud-free observations of all land areas of the planet (Arvidson, et al, 2001). The LTAP system addresses terrestrial seasonality, cloud climatology versus daily NOAA-predicted cloud cover, the two possible gain states on the Enhanced Thematic Mapper plus (ETM+) sensor on Landsat 7, and various technical and/or operational constraints such as sensor duty cycle and battery reserve power. All of these factors are merged in an Oracle-based information system that resides in the Landsat mission operations center (MOC). Information on predicted cloud cover and mission technical status are fed to the MOC daily. The MOC then prepares a mission operations plan, determining which Landsat WRS paths and rows will be acquired for the next 37 hours. An updated operations plan is uploaded to the satellite every 24 hours to command the payload data acquisition subsystems.
Prior to launch, a team of researchers at NASA Goddard and the University of Maryland developed the specific characteristics of the LTAP system. As much as possible, the LTAP team depended upon then-existing information about land seasonality from NOAA Advanced Very High Resolution Radiometer (AVHRR) global observations, and on cloud cover climatology, developed from AVHRR and GOES observations by the International Satellite Cloud Climatology Project (ISCCP) (Arvidson et al., 2001; Goward et al., 1996; Goward et al., 1999). A similar approach was taken with ETM+ gain state considerations. This automated approach was reviewed by the Landsat Science Team in 1998, prior to launch (Goward & Arvidson, 1998).
The LTAP system has now predominately operated Landsat 7 for the last 3 years, July 1999-July 2002. (There are times, such as during calibration maneuvers and orbital realignments, when mission operations are take over by ground control to accomplish tasks other than the primary observation missions. Also during national emergencies the system is targeted to meet other national needs.) The LTAP development team is currently pursuing a validation of the LTAP, to insure that it is accomplishing the goals of the LTAP, initiated in 1995. The specific goals include:

-- All Land Areas: Observations will be collected for all Landsat Worldwide Reference System (WRS) areas that contain any land (including islands and coral reefs where possible).
-- Seasonally Refreshed: All land locations will be revisited with sufficient temporal frequency to capture major seasonal changes (e.g. spring, summer, fall, winter, dry versus wet season, low versus high sun).
-- Essentially Cloud Free: To meet the previous requirements, at least one observation per season should be as cloud free as possible — this is an elusive goal in cloud-prone regions of the globe.
-- Radiometry: The LTAP considers the radiometric dynamic range across the optically reflective spectrum. The ETM+ sensor acquires measurements in either “low” or “high” gain for each spectral band. This is a trade-off between observation saturation and radiometric precision. The desire was to minimize saturation while maximizing scene radiometric contrast in the acquired observations. The low light factor and the two gain states available with the ETM+ sensor required a radiometric aspect to the LTAP method.

As the LTAP system was developed, members of the Landsat Science Team, as well as other members of the Landsat user community, pointed out that seasonality as defined by an AVHRR normalized difference vegetation index (NDVI) would not necessarily meet all their needs. Several such “niche” communities were identified, and special provisions within the LTAP were incorporated to meet their needs:
-- Glaciers & Ice Sheets
-- Volcanoes
-- Fire study sites
-- Ocean Islands
-- Calibration sites
-- Sea ice
-- Coral Reefs
-- Boreal forests
-- Rainforests
-- Agricultural areas of interest to the USDA Foreign Agricultural Service

A validation plan to evaluate the basics of the Landsat-7 LTAP approach was developed by the LTAP Working Group at Goddard and the University of Maryland. The team proposed and developed independent validation approaches for each of the unique aspects of the LTAP:
-- Seasonality: In this case, the acquired coverage, at monthly time steps, resident in the EROS Data Center archive was compiled to evaluate how well the seasonality was achieving desired goals. WRS-based maps of monthly coverage, as well as the lowest acquired cloud cover, were examined.
-- Cloud Cover: There are multiple aspects to this methodology, including the within-scene automated cloud cover assessment (ACCA), the ISCCP cloud climatology, and the comparative performance of the NOAA National Centers for Environmental Prediction (NCEP) cloud forecast versus the ACCA results and the ISCCP climatology. In effect: Are we better off using cloud forecasts versus climatology versus nothing?
-- Radiometry: Quite early in the Landsat-7 mission, the Landsat Science Team members reported observing measurement saturation in the near-infrared band for full canopy agricultural fields observed during the summer in the mid-latitudes. So began the saga of changing gain strategies as a function of user complaints. Thus the validation approach in this case has been predominately based on reaction to user input. Beyond these on-going changes, we also carried out a seasonal assessment of measurement saturation, as a function of spectral

This LTAP validation exercise is significant for the following reasons:

1 Refine LTAP-based mission operations for the remainder of the Landsat-7 mission (through 2006 at the earliest).
2 Identify potential acquisition reduction strategies for the purposes of extending mission life
3 Provide a refined assessment of the LTAP approach for the benefit of the Landsat Data Continuity Mission (LDCM) and all future Landsat missions.
Each of the validation analyses was complex and led to extensive in-depth consideration of the questions raised. In this article we will only present a high-level summary of the conclusions we have reached without presenting many of the details that underpin our conclusions. We hope that in the near future we will be able to compile for publication more detailed manuscripts on the various aspects of the Landsat-7 LTAP validation.

Land Definition
The Landsat-7 land database contains over 53,000 geographic tags (Figure 1) identifying for each WRS land scene the geopolitical extents, land-cover type, and other useful information. The geographic tags were not included in the initial system design but were added just before launch. They have proven extremely useful for answering queries, selectively applying scheduling rules, and conducting analyses.

These include:
-- Identification of nearly 17,000 Worldwide Reference System (WRS) scenes which consist of land and shallow coastal waters. Each of these scenes are acquisition candidates for the US archive (Arvidson et al., 2000).
-- Land scenes also set limits for scheduling International Ground Station (IGS) acquisitions, as IGS requests over water are not allowed.
-- Geographic tags serve as keys to the scheduling software, triggering special logic paths used during scheduling (e.g., high latitude overlap, night or ascending scene duty cycle constraints);
-- Geographic tags allow collection of statistics by interest group, such as the various science niches (e.g., reefs, volcanoes) (Arvidson et al., 2000).

Validation Strategies
The geographic tags were used to sort and display the data, evaluate the tag versus the WRS, and identify errors.
Land scenes that are surrounded on two or more sides by water were classified as “fringe coastal” scenes. Browse images of these scenes were visually inspected in the context of their neighboring scenes to determine whether the land portion of each fringe scene was included within the surrounding scenes. If so, then the scene could be removed from the land database with no loss of coverage.
Finally, efforts are underway to compare the LTAP land database with the land database generated by the Earth Satellite Corporation (EarthSat) in their compilation of the Global Landsat 2000 Scientific Data Purchase.
Validation Outcome
As expected with such a large database, we found 64 errors in the geographic tags and in the WRS assigned to some of the tags, and we found 32 WRS scenes in the land database with no entries in the seasonality file. These were corrected in August 2002. The analysis of the list of fringe coastal scenes identified 352 scenes that could be deleted from the land and seasonality databases. These were deleted in August 2002.
Future analysis of the large land database to refine the geographic tags and correct any reported errors in location is planned. As an example, future work will include adding a detailed reef definition to the land database, using inputs from the reef community to correct the location of reefs based on the imagery, and adding the names of the reefs to the database instead of using the generic “reef niche” tag. We have discovered islands in the EarthSat database that had not been included in the LTAP database, and reefs in the LTAP database that are not included in the EarthSat database. The comparison is ongoing and results will be exchanged with EarthSat.

Seasonality
The basic goal of the seasonality portion of the LTAP is to insure that Landsat 7 acquires new observations for each new variation in land surface conditions. In general this is with respect to the condition of local vegetation cover, although there are other reasonable definitions as noted by the various niche communities discussed later in this article. For vegetative cover, the typical mid-latitude perspective is “spring, summer, fall, winter,” approximate quarterly (3-month) sectors of the annual 12-month Earth orbital cycle. However, seasonality varies by region of the planet. For example, wet and dry cycles in lower latitude locations where temperature is not the primary determinant of seasonality, or high latitude locations where there is a very short summer and a very long winter. Our effort to characterize these regional variations in “seasonality” led us to employ the AVHRR observation record of land in the visible and near-infrared, converted to a measure of green vegetation presence with the normalized difference vegetation index (NDVI) (Arvidson et al., 2001; Justice et al., 1985). Our analysis produced monthly WRS location assessments to acquire-once or acquire-every opportunity (i.e. ~ twice) during that month (Figure 2).
Operational acquisition performance has been closely monitored throughout operations and has been fine-tuned through database, parameter, and software updates to conform to this definition of seasonality.

Validation Strategies
Our primary interest was to evaluate whether the LTAP was acquiring the basic spatio-temporal coverage we would have expected. Landsat 7’s sensor, limited by a 16% maximum duty cycle, should optimally be able to acquire nearly the entire Earth’s land areas every two 16-day orbital cycles. We therefore began with examination of WRS maps consisting of observations collected over a 32-day (2-cycle) period (Figure 3). Following our initial analysis of these 2-cycle maps, we determined that analysis of seasonal quarterly maps would also be important, where the seasons were defined to approximately coincide with the Earth’s orbital solstices and equinoxes (i.e. Dec-Feb, Mar-May, Jun-Aug., Sept.-Nov.) In support of this analysis, we also employed the newly implemented Landsat global visualization (GLOVIS) data exploration tool, available at the EROS Data Center website (http://glovis.usgs.gov/). This system provides interactive access to the Landsat-7 metadata, including the browse images. This facilitated rapid and qualitative evaluation of specific examples of the results shown in the global WRS maps noted previously.

Validation Outcome
Based upon the 2-cycle and quarterly WRS maps, we concluded that the LTAP is operating the Landsat-7 mission to acquire better than 90% global coverage for each quarter of the year. Thus for most locations, we are accomplishing the desired seasonal coverage originally specified as a goal of the LTAP.
However, we do note that the geographic distribution of the observations is not entirely in line with our expectations. Our conclusions concerning regional geographic coverage include:

-- Too many “desert” scenes are being acquired currently. Our estimate is that on average 8.1 scenes are acquired annually of each WRS desert scene. Of these 5 are cloud-free and 3.1 have cloud cover that averages over 43%! Our seasonality analysis was clearly too conservative (or too generous, depending on how you look at it) in addressing desert coverage. We do have to worry about desert-based irrigated agriculture, for example in central Saudi Arabia and Sudan. This activity appears to be rapidly expanding. Further analysis of the phenomenon is warranted.
-- The mission is consuming substantial acquisition resources in tropical, cloud-prone regions (as noted in Figure 3). A quick examination of the browse imagery suggests a) there are only selected seasons of the year when low cloud cover images are acquired, and b) the only probable way of producing “cloud-free” coverage in these locations is to employ image composition methods, such as those used with AVHRR and MODIS measurements today. These observations indicate that a more in-depth analysis of tropical coverage is warranted. This problem may in part be addressed with a new 80% cloud cover threshold cut-off rule that has recently been implemented in the LTAP. That is, if the NCEP cloud cover forecast is for 80% cloud cover or worse, acquisition will not be scheduled.
-- There appears to be excessive coverage of the high mid-latitudes (>45 degrees) in the winter. Apparently our conservative (generous) seasonality file prescribes acquisition every opportunity during the winter and the low cloud cover makes such scenes good candidates for acquisitions. However, having seen the frozen, snow-covered boreal and tundra regions once, there seems little value to acquiring them again.
-- The reverse is true in these mid- to high-latitude regions during the summer. As with the tropical forest regions, these boreal and tundra zones are cloud-prone during the summer. However, only one scene is acquired during the summer (June-August) timeframe. There may be a need to set a high summer priority on acquisitions for these locations during the summer, particularly for Russian Siberia and Northern China, where there is no international ground station coverage to augment the LTAP.
We now believe that the binary NDVI interpretation (acquire-once, acquire-every opportunity) we deployed in this version of the LTAP does not take full advantage of the LTAP system functions. In fact, use of a continuous variable extract of the NDVI record may well be far more successful in addressing a number of the shortcomings we have already discovered, specifically with respect to desert acquisitions. Implementation of such a new seasonality definition would require an additional commitment to development, testing and validation but we believe that such an approach would produce a substantial improvement in LTAP operations.

Clouds
To achieve the least cloud contamination in acquired imagery, the LTAP considers cloud conditions in the past, the present, and nominal:
-- Cloud forecasts: The National Centers for Environmental Prediction furnish daily global weather forecasts. These forecasts are used as a scheduling aid to avoid acquiring scenes likely to be cloud contaminated.
-- Cloud climatology: Acquisition scheduling decisions employ the forecasted cloud conditions normalized against the historical average cloud cover. Priority of a candidate acquisition appreciates if the forecasted cloud cover is better than the historical average or depreciates if the forecast is worse than average. (Gasch & Campana, 2000)
-- Cloud cover assessment: Image processing assesses the amount of cloud contamination in each acquired image. This automated cloud cover assessment (ACCA) score (Irish, 2000) is conveyed back to the scheduling system as an indicator of past success. Future acquisition decisions consider the results of prior acquisitions as defined by the ACCA.
Cloud avoidance is a major goal of the LTAP and is accomplished by using all three types of data. In some areas of the world, cloud avoidance is not attempted. These are the 50 United States, where data is required every opportunity regardless of cloud cover, and Antarctica where the reliability of the cloud predictions has been questionable due to persistent ground fog.

Validation Strategies
Inter-comparisons were made among the different types of cloud data used to understand the relationships, compare results, and identify any biases: climatological versus predicted, climatological versus assessed, assessed versus predicted.
To evaluate ACCA performance, a blind study was conducted on 192 images selected randomly across 8 latitudinal zones, 21 locations in each zone. 160 of the 192 images (for 20 of the 21 locations) were selected from the time interval when the Normalized Difference Vegetation Index (NDVI) was within 10% of its peak value. For the other location, four seasonal images were chosen for each latitudinal zone. Various Adobe Photoshop tools were used to generate binary cloud masks from each of the browse images; histograms of the cloud masks yielded cloud mask percentages for comparison with the calculated cloud cover scores.
To determine the regional success of cloud avoidance, maps showing the best cloud cover score for each WRS scene and the influence of cloud cover on acquisition frequency were generated and analyzed (Figure 4).

Validation Outcome
Our analysis has demonstrated that the ACCA algorithm is performing quite well (Figure 5). As expected, the ACCA algorithm did under-report very thin high cirrus clouds (Irish 2000), especially where water dominates a scene. Also, commanded changes
to the sensor gain during image acquisition presents a problem for the algorithm, which can over-estimate clouds in the image area following the gain change.
Considering the 7 scenes that differed by more than 15%:
-- Two scenes under-reported (17% and 39%) due to thin cirrus not visible to the ACCA algorithm
-- One scene under-reported (22%) due to weaker cirrocumulus cloud signature over water
-- One scene over-reported (17%) due to gain change
Three scenes with apparent over-reporting (26%, 38%, 30%) proved not true because the remaining clouds are not visible in the spectral bands shown in the browse images.
If the mask scores are adjusted for the three over-reporting cases where ACCA appeared to work, then ACCA is within 10% of truth for 95% of all scenes examined. Considering the 160 scenes at peak NDVI, ACCA is within 10% of truth 97% of the time. ACCA under-performed for peak NDVI scenes in the mid-latitudinal zone (30-45 degrees North), with results within 10% of truth only 85% of the time, however several of the over-reporting cases fell into this zone. A seasonal bias is suggested in the Boreal South zone (45-60 degrees South), but 2 of the 4 seasonal scenes were contaminated by gain changes and were thus over-estimated. We concluded that ACCA is performing as designed in most cases and is providing reliable feedback to the scheduling system.
Comparisons among the cloud file types revealed a slight bias (approximately 3%) in the cloud predictions to the high side; this value is considered to be in the noise. The predictions are also more accurate at the extremes (0% and 100%) as shown in Figure 6. Based on this, we recently decided to implement a cloud cover cut-off of 80%; if the forecasted cloud cover for a scene is worse than 80%, the scene is not considered a candidate for scheduling. This rule is applied only to scenes outside the 50 US states. Being a recent change, analysis of the consequences of this rule change is pending. Modeling runs predict that a tiny percentage of clear scenes will be missed and a significant amount of duty cycle will be reallocated to acquiring less cloudy scenes.
There was no apparent degradation due to the staleness of cloud forecasts out to 36 hours. Degradation of cloud forecasts at higher latitudes was evident during the campaigns over Antarctica. This combined with the ACCA algorithm’s difficulty in discriminating clouds from snow and ice (Irish 2000) led us to not consider cloud avoidance over Antarctica and to rely on the feedback of manual cloud cover assessments to guide the scheduler for future scene acquisitions over the continent.
Two interesting phenomena were observed in our analysis of browse scenes and the associated cloud assessments:
1. Island effect: For isolated oceanic islands and reefs, an assessment of low cloud cover is misleading because the few reported clouds are almost always over the island or reef, rendering the scene useless; conversely, for islands and reefs that are close to continental coasts, an assessment of high cloud cover is also misleading because the clouds are almost always over the continental landmass and the offshore islands and reefs are in the clear.
2. Western continental coastal cloud/fog banks: We found poor correlation between usefulness of the image and the ACCA score for these scenes, where the morning cloud/fog banks are hanging off the coast over the water, and the land is clear. These scenes are reported at 50 or 70% cloud cover but the land area is clear. This was primarily seen in North and South America.
Both of these phenomena highlight the importance for users to check the browse images and ascertain that their areas of interest are not under the clouds. Also, in their queries, it is important to keep the cloud cover threshold wide open in order to not exclude some of the scenes as described above that have high cloud cover assessments but generally clear land areas.
Future plans include investigating the feasibility of employing a land mask in the cloud assessment process, resulting in a scene utility score as well as a cloud cover score. We will consider manual cloud assessment in other areas of the world where automated cloud assessment is unreliable, and explore the compositing of images to construct cloud-free images in areas that are persistently cloud-covered. We will also investigate a cloud-confidence factor based on our assessment of latitudinal and regional reliability of the cloud forecasts.

(End of Part One—To be continued)

About the Authors
Terry Arvidson is a Senior Systems Engineer for Lockheed Martin.
Richard Irish is a Senior Systems Engineer for SSAI.
Brian Markham is a Landsat Calibration Scientist for NASA/Goddard Space Flight Center.
Dr. Darrel Williams is a Landsat Project Scientist for NASA/Goddard Space Flight Center.
Jay Feuquay is a Landsat 7 Program Data Acquisition Manager for USGS, EROS Data Center.
John Gasch is a Landsat 7 Mission Analyst for Emalico LLC, Goddard Space Flight Center.
Dr. Samuel N. Goward is a Geography Professor for the University of Maryland.

Back