| Literature DB >> 35236869 |
Li Lin1,2, Liping Di3,4, Chen Zhang1,2, Liying Guo1, Yahui Di1, Hui Li1,2, Anna Yang1.
Abstract
Space-based crop identification and acreage estimation have played a significant role in agricultural studies in recent years, due to the development of Remote Sensing technology. The Cropland Data Layer (CDL), which was developed by the U.S. Department of Agriculture (USDA), has been widely used in agricultural studies and achieved massive success in recent years. Although the CDL's accuracy assessments report high overall accuracy on various crops classifications, misclassification is still common and easy to discern from visual inspection. This study is aimed to identify and resolve inaccurate crop classification in CDL. A decision tree method was employed to find questionable pixels and refine them with spatial and temporal crop information. The refined data was then evaluated with high-resolution satellite images and official acreage estimates from USDA. Two validation experiments were also developed to examine the data at both the pixel and county level. Data generated from this research was published online in two repositories, while both applications allow users to download the entire dataset at no cost.Entities:
Year: 2022 PMID: 35236869 PMCID: PMC8891360 DOI: 10.1038/s41597-022-01169-w
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Comparing classification results for fields in Queen Anne’s County, Maryland, in 2019. The left image shows landcover data from USDA CDL, and the right image shows data from Refined CDL. The legend was adapted from the official CDL color map (https://nassgeodata.gmu.edu/CropScape).
Fig. 2Separating all CLD pixels into different categories based on connecting neighborhoods (Pixels with different colors indicating different CDL codes).
CDL codes and class names that were labeled as constant features in the algorithm[9,13].
| CDL Code | Classification Name |
|---|---|
| 63 | Forest |
| 82 | Developed |
| 83 | Water |
| 87 | Wetlands |
| 111 | Open Water Perennial |
| 121 | Developed/Open Space |
| 122 | Developed/Low Intensity |
| 123 | Developed/Med Intensity |
| 124 | Developed/High Intensity |
| 141 | Deciduous Forest |
| 142 | Evergreen Forest |
| 143 | Mixed Forest |
Fig. 3Proposed post-classification decision tree algorithm.
Fig. 4The refinement workflow of the proposed decision tree model.
Matrix of validation samples and their classifications in CDL and R-CDL, 267 respectively.
| RCDL | Corn | Cotton | Sorghum | Soybeans | Peanuts | Barley | Spring Wheat | Winter Wheat | Dbl Crop WinWht/Soybeans | Oats | Millet | Canola | Alfalfa | Sugarbeets | Dry Beans | Sugarcane | Peas | Christmas Trees | Dbl Crop WinWht/Corn | Blueberries | Non-Crop | Total | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CDL | 1 | 2 | 4 | 5 | 10 | 21 | 23 | 24 | 26 | 28 | 29 | 31 | 36 | 41 | 42 | 45 | 53 | 70 | 225 | 242 | NC | ||
| Corn | 1 | — | 0 | 0 | 14 | 0 | 0 | 0 | 2 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 48 | 65 |
| Cotton | 2 | 0 | — | 1 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 | 11 |
| Rice | 3 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| Sorghum | 4 | 1 | 0 | — | 2 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 | 13 |
| Soybeans | 5 | 20 | 2 | 0 | — | 0 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 58 | 85 |
| Sunflower | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 2 |
| Peanuts | 10 | 0 | 0 | 0 | 0 | — | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| Barley | 21 | 0 | 0 | 0 | 0 | 0 | — | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 |
| Spring Wheat | 23 | 1 | 0 | 0 | 0 | 0 | 2 | — | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 | 12 |
| Winter Wheat | 24 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | — | 2 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 | 18 |
| Dbl Crop WinWht/Soybeans | 26 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | — | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 4 |
| Rye | 27 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 |
| Oats | 28 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | — | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 6 |
| Millet | 29 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | — | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 |
| Canola | 31 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | — | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 2 |
| Flaxseed | 32 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| Alfalfa | 36 | 4 | 0 | 0 | 4 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | — | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 19 | 28 |
| Sugarbeets | 41 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | — | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 3 |
| Dry Beans | 42 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | — | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| Other Crops | 44 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| Sugarcane | 45 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | — | 0 | 0 | 0 | 0 | 1 | 1 |
| Onions | 49 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| Cucumbers | 50 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| Tomatoes | 54 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| Herbs | 57 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| Sod/Grass Seed | 59 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 |
| Peaches | 67 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| Grapes | 69 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| Pecans | 74 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| Non-Crop | NC | 30 | 3 | 2 | 27 | 0 | 0 | 5 | 5 | 1 | 0 | 0 | 0 | 7 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | — | 81 |
| Total | 60 | 5 | 3 | 56 | 2 | 3 | 7 | 10 | 4 | 1 | 1 | 1 | 9 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 182 | 351 |
Confusion matrix of the post-classification result of the validation set.
| Classification result | R-CDL | ||
|---|---|---|---|
| correct | incorrect | ||
| CDL | correct | N/A | 35 |
| incorrect | 394 | 71 | |
Fig. 5Comparing NASS county-level planted area with CDL or R-CDL statistics for corn and soybean from 2017–2020. Correlation is measured by the coefficient of determination, R2. Depending on the data availability of NASS estimates, 838 counties were calculated for corn, and 836 counties were calculated for soybean.
Average percentage difference for corn planting areas at county level from NASS to CDL and R-CDL in different years.
| Corn | 2017 | 2018 | 2019 | 2020 |
|---|---|---|---|---|
| CDL | 4.2% | 6.7% | 6.7% | 9.6% |
| R-CDL | −2.1% | −0.5% | −1.1% | 0.6% |
Average percentage difference for soybean planting areas at the county-level from NASS to CDL and R-CDL in different years.
| Soybean | 2017 | 2018 | 2019 | 2020 |
|---|---|---|---|---|
| CDL | 2.7% | 3.9% | 5.1% | 4.1% |
| R-CDL | −3.6% | −3.0% | −2.2% | −3.5% |
| Measurement(s) | agricultural field |
| Technology Type(s) | decision tree |
| Sample Characteristic - Location | contiguous United States of America |