| Literature DB >> 30875732 |
Sean Hartling1, Vasit Sagan2, Paheding Sidike3, Maitiniyazi Maimaitijiang4, Joshua Carron5.
Abstract
Urban areas feature complex and heterogeneous land covers which create challenging issues for tree species classification. The increased availability of high spatial resolution multispectral satellite imagery and LiDAR datasets combined with the recent evolution of deep learning within remote sensing for object detection and scene classification, provide promising opportunities to map individual tree species with greater accuracy and resolution. However, there are knowledge gaps that are related to the contribution of Worldview-3 SWIR bands, very high resolution PAN band and LiDAR data in detailed tree species mapping. Additionally, contemporary deep learning methods are hampered by lack of training samples and difficulties of preparing training data. The objective of this study was to examine the potential of a novel deep learning method, Dense Convolutional Network (DenseNet), to identify dominant individual tree species in a complex urban environment within a fused image of WorldView-2 VNIR, Worldview-3 SWIR and LiDAR datasets. DenseNet results were compared against two popular machine classifiers in remote sensing image analysis, Random Forest (RF) and Support Vector Machine (SVM). Our results demonstrated that: (1) utilizing a data fusion approach beginning with VNIR and adding SWIR, LiDAR, and panchromatic (PAN) bands increased the overall accuracy of the DenseNet classifier from 75.9% to 76.8%, 81.1% and 82.6%, respectively. (2) DenseNet significantly outperformed RF and SVM for the classification of eight dominant tree species with an overall accuracy of 82.6%, compared to 51.8% and 52% for SVM and RF classifiers, respectively. (3) DenseNet maintained superior performance over RF and SVM classifiers under restricted training sample quantities which is a major limiting factor for deep learning techniques. Overall, the study reveals that DenseNet is more effective for urban tree species classification as it outperforms the popular RF and SVM techniques when working with highly complex image scenes regardless of training sample size.Entities:
Keywords: convolutional neural network (CNN); data fusion; deep learning; dense convolutional network (DenseNet); random forest (RF); support vector machine (SVM); tree species classification
Mesh:
Year: 2019 PMID: 30875732 PMCID: PMC6471063 DOI: 10.3390/s19061284
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Study area located at Forest Park in St. Louis, MO, USA. The red border indicates the boundary of the park.
List of tree species and number of training samples collected within the study area.
| Tree Species (Common Name) | Scientific Name | Genus | Family | Class | Training Samples |
|---|---|---|---|---|---|
| Green ash |
|
| Oleaceae | Deciduous | 348 |
| Bald cypress |
|
| Cupressaceae | Deciduous | 261 |
| Cottonwood |
|
| Salicaceae | Deciduous | 233 |
| Silver maple |
|
| Sapindaceae | Deciduous | 172 |
| Sugar maple |
|
| Sapindaceae | Deciduous | 145 |
| Pin oak |
|
| Fagaceae | Deciduous | 141 |
| Austrian pine |
|
| Pinaceae | Coniferous evergreen | 126 |
| Sycamore |
|
| Platanaceae | Deciduous | 126 |
| Total | 1552 |
Remotely sensed datasets.
| WorldView-2 | 22 September 2012 | Panchromatic: 450–800 | Pan: 0.5 m VNIR: 2.0 m | 11 bits per pixel |
| Coastal: 400–450 | ||||
| Blue: 450–510 | ||||
| Green: 510–580 | ||||
| Yellow: 585–625 | ||||
| Red: 630–690 | ||||
| Red Edge: 705–745 | ||||
| NIR1: 770–895 | ||||
| NIR2: 860–1040 | ||||
| WorldView-3 | 21 August 2015 | SWIR-1: 1195–1225 | SWIR: 7.5 m | 14 bits per pixel |
| SWIR-2: 1550–1590 | ||||
| SWIR-3: 1640–1680 | ||||
| SWIR-4: 1710–1750 | ||||
| SWIR-5: 2145–2185 | ||||
| SWIR-6: 2185–2225 | ||||
| SWIR-7: 2235–2285 | ||||
| SWIR-8: 2295–2365 | ||||
| LiDAR | 22 December 2012 | 1.5 m | ||
| NAIP | 18 June 2012 | Blue: 400–580 | VNIR: 1 m | 11 bits per pixel |
| Green: 500–650 | ||||
| Red: 590–675 | ||||
| NIR: 675–850 |
Figure 2Manually delineated ground truth samples of dominant tree species within study area. (a) Small scale view of tree species reference sample subset. (b) Large scale view of reference samples for individual tree canopies. (c) Individual tree species samples after background removal.
Figure 3Classification workflow.
Figure 4LiDAR data processing steps. (a) First return LAS data. (b) Last return LAS data. (c) Relative height model (last return subtracted from first return). (d) LiDAR intensity return image.
Feature variables (64 statistical spectral features, 40 textural features, 14 shape features).
| Feature Name | Description | Datasets Utilized | Total Bands |
|---|---|---|---|
| SpecMean | Spectral mean value of pixels forming region in band | VNIR/SWIR | 16 |
| SpecStd | Spectral standard deviation value of pixels forming region in band | 16 | |
| SpecMin | Spectral minimum value of pixels forming region in band | 16 | |
| SpecMax | Spectral maximum value of pixels forming region in band | 16 | |
| TextRange | Average data range of pixels comprising region inside kernel | VNIR/LiDAR/Pan | 10 |
| TextMean | Average value of pixels comprising region inside kernel | 10 | |
| TextVar | Average variance of pixels comprising region inside kernel | 10 | |
| TextEntro | Average entropy value of pixels comprising region inside kernel | 10 | |
| Area | Total area of polygon, minus area of holes | Extracted Image | 1 |
| Length | Combined length of all boundaries of polygon, including boundaries of holes | 1 | |
| Compactness | Indicates compactness of polygon [= √(4 * Area/ | 1 | |
| Convexity | Measures convexity of polygon [= length of convex hall/length] | 1 | |
| Solidity | Compares area of polygon to area of a convex hull surrounding polygon [= Area/area of convex hull] | 1 | |
| Roundness | Compares area of polygon to square of maximum diameter of polygon [= 4 * Area/( | 1 | |
| Form_Factor | Compares area of polygon to square of total perimeter [= 4 * Area/( | 1 | |
| Elongation | Ratio of major axis of polygon to minor axis of polygon [= Major_Length/Minor_Length] | 1 | |
| Rectangular_Fit | Compares area of polygon to area of oriented bounding box enclosing polygon [= Area/(Major_Length * Minor_Length)] | 1 | |
| Main_Direction | Angle subtended by major axis of polygon and | 1 | |
| Major_Length | Length of major axis of an oriented bounding box enclosing polygon | 1 | |
| Minor_Length | Length of minor axis of an oriented bounding box enclosing polygon | 1 | |
| Number_of_Holes | Number of holes in polygon in an integer value | 1 | |
| Hole_Area | Ratio of total area of polygon to area of outer contour of polygon [= Area/outer contour area] | 1 | |
| Total | 118 |
Vegetation indices utilized for tree species classification.
| Vegetation Index Name | Abbreviation | Formula | References |
|---|---|---|---|
| Atmospherically Resistant Vegetation Index | ARVI |
| [ |
| Canopy Chlorophyll Content Index | CCCI |
| [ |
| Green Normalized Difference Vegetation Index | GNDVI |
| [ |
| Normalized Difference Red Edge Index | NDRE |
| [ |
| Normalized Difference Red Edge Index—NIR2 | NDRE2 |
| [ |
| Normalized Difference Vegetation Index | NDVI |
| [ |
| Normalized Difference Vegetation Index—Green/Red Ratio | NDVI-GR |
| [ |
| Normalized Difference Vegetation Index—Yellow | NDVI-Y |
| [ |
| Normalized Difference Vegetation Index—NIR2 | NDVI2 |
| [ |
| Normalized Difference Water Index | NDWI |
| [ |
| Plant Senescence Reflectance Index | PSRI |
| [ |
| Soil Adjusted Vegetation Index | SAVI |
| [ |
| Visible Atmospherically Resistant Indices—Red Edge | VARI-Red Edge |
| [ |
Figure 5A deep DenseNet with three dense blocks. The layers between two adjacent blocks are referred to as transition layers and change feature-map sizes via convolution and pooling.
Classification accuracies for eight dominant tree species using fused datasets and DenseNet classification using 30% training and 70% testing of each species total reference samples. All values, except for kappa coefficient, are percentages.
| Tree Species | VNIR | VNIR+SWIR | VNIR+SWIR+LiDAR | VNIR+SWIR+LiDAR+PAN |
|---|---|---|---|---|
| Green Ash | 34.88 | 81.40 | 62.79 | 60.47 |
| Bald Cypress | 78.57 | 83.33 | 97.62 | 92.86 |
| Cottonwood | 78.95 | 60.53 | 76.32 | 78.95 |
| Silver Maple | 59.62 | 50.00 | 61.54 | 73.08 |
| Sugar Maple | 55.26 | 60.53 | 76.32 | 84.21 |
| Pin Oak | 95.19 | 90.38 | 94.23 | 97.12 |
| Austrian Pine | 81.43 | 82.86 | 82.86 | 88.57 |
| Sycamore | 85.90 | 80.77 | 80.77 | 71.79 |
| Overall Accuracy | 75.91 | 76.77 | 81.08 | 82.58 |
| Kappa Coefficient | 0.72 | 0.73 | 0.78 | 0.80 |
| Average Accuracy | 71.22 | 73.72 | 79.05 | 80.88 |
Classification accuracies for 8 dominant tree species using fused datasets with SVM and RF classifiers using 30% training and 70% testing of each species total reference samples. All values, except for the kappa coefficient, are percentages.
| VNIR | VNIR+SWIR | VNIR+SWIR+LiDAR | VNIR+SWIR+LiDAR+PAN | |
|---|---|---|---|---|
| SVM Classification Accuracy | ||||
| Green Ash | 34.88 | 34.88 | 41.86 | 44.19 |
| Bald Cypress | 42.86 | 42.86 | 61.90 | 61.90 |
| Cottonwood | 26.32 | 23.68 | 31.58 | 31.58 |
| Silver Maple | 13.46 | 19.23 | 15.38 | 15.38 |
| Sugar Maple | 26.32 | 28.95 | 34.21 | 34.21 |
| Pin Oak | 72.12 | 69.23 | 73.08 | 73.08 |
| Austrian Pine | 72.86 | 77.14 | 74.29 | 75.71 |
| Sycamore | 48.72 | 47.44 | 43.59 | 43.59 |
| Overall Accuracy | 48.17 | 48.60 | 51.40 | 51.83 |
| Kappa Coefficient | 0.39 | 0.40 | 0.43 | 0.44 |
| Average Accuracy | 42.19 | 42.93 | 46.99 | 47.46 |
| RF Classification Accuracy | ||||
| Green Ash | 20.93 | 16.28 | 30.23 | 20.93 |
| Bald Cypress | 30.95 | 33.33 | 42.86 | 42.86 |
| Cottonwood | 0.00 | 2.63 | 2.63 | 5.26 |
| Silver Maple | 13.46 | 13.46 | 15.38 | 11.54 |
| Sugar Maple | 15.79 | 26.32 | 15.79 | 18.42 |
| Pin Oak | 89.42 | 88.46 | 92.31 | 92.31 |
| Austrian Pine | 82.86 | 80.00 | 82.86 | 82.86 |
| Sycamore | 51.28 | 58.97 | 60.26 | 58.97 |
| Overall Accuracy | 48.60 | 50.11 | 53.12 | 52.04 |
| Kappa Coefficient | 0.38 | 0.40 | 0.43 | 0.42 |
| Average Accuracy | 38.09 | 39.93 | 42.79 | 41.64 |
Figure 6Overall accuracies for SVM, RF and DenseNet classifiers using various dataset combinations.
Figure 7Overall accuracies for individual tree species using SVM, RF, and DenseNet (DN) classifiers using (a) eight VNIR WorldView-2 bands, (b) 8 VNIR WorldView-2 bands plus eight SWIR WorldView-3 bands, (c) eight VNIR WorldView-2 bands plus eight SWIR WorldView-3 bands plus LiDAR intensity return image and (d) eight VNIR WorldView-2 bands plus eight SWIR WorldView-3 bands plus LiDAR intensity return image plus WorldView-2 panchromatic band.
Classification accuracies for eight dominant tree species incorporating 13 VIs and 118 extracted features for SVM, RF, and DenseNet classifiers in addition to 18-band data fusion set (Base).
| SVM | RF | DenseNet | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Species | Base | VIs | VIs/Features | Base | VIs | VIs/Features | Base | VIs | VIs/Features |
| Green ash | 44.19 | 51.16 | 51.16 | 20.93 | 30.23 | 20.93 ** | 60.47 | 65.12 | 74.42 |
| Bald cypress | 61.90 | 69.05 | 73.81 | 42.86 | 40.48 * | 59.52 | 92.86 | 92.86 | 78.57 ** |
| Cottonwood | 31.58 | 26.32 * | 31.58 | 5.26 | 7.89 | 15.79 | 78.95 | 68.42 * | 55.26 ** |
| Silver maple | 15.38 | 23.08 | 36.54 | 11.54 | 21.15 | 34.62 | 73.08 | 67.31 * | 57.69 ** |
| Sugar maple | 34.21 | 47.37 | 52.63 | 18.42 | 15.79 * | 36.84 | 84.21 | 76.32 * | 47.37 ** |
| Pin oak | 73.08 | 78.85 | 75.96 ** | 92.31 | 91.35 * | 94.23 | 97.12 | 94.23 * | 80.77 ** |
| Austrian pine | 75.71 | 77.14 | 65.71 ** | 82.86 | 88.57 | 84.29 ** | 88.57 | 82.86 * | 90.00 |
| Sycamore | 43.59 | 66.67 | 53.85 ** | 58.97 | 73.08 | 65.38 ** | 71.79 | 89.74 | 84.62 ** |
| Overall Accuracy | 51.83 | 60.00 | 58.28 ** | 52.04 | 56.77 | 60.22 | 82.58 | 82.37 * | 74.62 ** |
| Kappa Coefficient | 0.44 | 0.53 | 0.51 ** | 0.42 | 0.48 | 0.52 | 0.80 | 0.79 * | 0.70 ** |
| Average Accuracy | 47.46 | 54.95 | 55.16 | 41.64 | 46.07 | 51.45 | 80.88 | 79.61 * | 71.09 ** |
* Addition of VIs decreased classification accuracy from previous dataset; ** Addition of textural features/VIs decreased classification accuracy from previous dataset.
Overall accuracies for SVM, RF, and DenseNet classifiers using fused imagery dataset combined with different segmented feature types or vegetation indices.
| Datasets | SVM | RF | DenseNet |
|---|---|---|---|
| VNIR/SWIR/LiDAR/Pan (V/S/L/P) | 51.83 | 52.04 | 82.58 * |
| V/S/L/P + Shape Features | 50.97 | 56.13 | 78.06 |
| V/S/L/P + Statistical Spectral Features | 53.12 | 55.91 | 74.19 |
| V/S/L/P + Texture Features | 53.98 | 59.78 | 80.43 |
| V/S/L/P + Vegetation Indices | 60.00 * | 56.77 | 82.37 |
| V/S/L/P + All Features + VIs | 58.28 | 60.22 * | 74.62 |
* Highest classification accuracy of all data combinations per classifier.
Figure 8Overall accuracy for SVM, RF, and DenseNet classifiers using 10% training, 30% training, 50% training and 70% training from the total samples for each species using 18 bands.
Overall accuracy for SVM, RF and DenseNet classifiers for eight dominant tree species using varying percentages of the total samples for each individual class.
| Overall Accuracy (%) | |||
|---|---|---|---|
| Reference Sample % | SVM | RF | DenseNet |
| 10 | 32.52 | 43.41 | 70.77 |
| 30 | 44.57 | 49.63 | 79.56 |
| 50 | 48.97 | 49.61 | 80.62 |
| 70 | 51.83 | 52.04 | 82.58 |