Literature DB >> 31410287

Utilizing the density of inventory samples to define a hybrid lattice for species distribution models: DISTRIB-II for 135 eastern U.S. trees.

Matthew P Peters1, Louis R Iverson1, Anantha M Prasad1, Stephen N Matthews1,2.   

Abstract

Species distribution models (SDMs) provide useful information about potential presence or absence, and environmental conditions suitable for a species; and high-resolution models across large extents are desirable. A primary feature of SDMs is the underlying spatial resolution, which can be chosen for many reasons, though we propose that a hybrid lattice, in which grid cell sizes vary with the density of forest inventory plots, provides benefits over uniform grids. We examine how the spatial grain size affected overall model performance for the Random Forest-based SDM, DISTRIB, which was updated with recent forest inventories, climate, and soil data, and used a hybrid lattice derived from inventory densities.Modeled habitat suitability was compared between a uniform grid of 10 × 10 and a hybrid lattice of 10 × 10 and 20 × 20 km grids to assess potential improvements. The resulting DISTRIB-II models for 125 eastern U.S. tree species provide information on individual habitat suitability that can be mapped and statistically analyzed to understand current and potential changes.Model performance metrics were comparable among the hybrid lattice and 10-km grids; however, the hybrid lattice models generally had higher overall model reliability scores and were likely more representative of the inventory data.Our efforts to update DISTRIB models with current information aims to produce a more representative depiction of recent conditions by accounting for the spatial density of forest inventory data and using the latest climate data. Additionally, we developed an approach that leverages a hybrid lattice to maximize the spatial information within the models and recommend that similar modeling efforts be used to evaluate the spatial density of response and predictor data and derive a modeling grid that best represents the environment.

Entities:  

Keywords:  Forest Inventory and Analysis; habitat suitability; importance value; species abundance; statistical prediction

Year:  2019        PMID: 31410287      PMCID: PMC6686326          DOI: 10.1002/ece3.5445

Source DB:  PubMed          Journal:  Ecol Evol        ISSN: 2045-7758            Impact factor:   2.912


INTRODUCTION

Modeling a species' potential niche and mapping habitat suitability (HS) are standard practices for environmental research examining aspects of an ecosystem that influence species distributions, especially those impacted by ongoing change (Guisan et al., 2013). Such efforts can assist in defining a species' habitat range, be combined with field sampling to verify model performance, or aid in conservation and resource planning. Whether process‐based (Wang et al., 2014; Yospin et al., 2014) or statistical (Iverson, Prasad, Matthews, & Peters, 2008; Prasad, Iverson, Matthews, & Peters, 2016; Warwell, Rehfeldt, & Crookston, 2010), habitat modeling requires, at a minimum, information about the species' occurrence (i.e., importance or presence/absence, often from in situ plot data) and spatially indexed environmental conditions. Knowing what environmental information to include can be challenging in that variables are (a) scale dependent since values can differ at locations of species presence compared to aggregated values provided to the model (Kadmon, Farber, & Danin, 2003) and (b) must be relevant to the scope of the model, whether a macro‐ versus site‐level analysis or predicting occurrence versus abundance. Models that encompass regional extents (e.g., areas greater than ~10,000 km2) generally rely on remotely sensed data representing environmental conditions, and the spatial resolutions of these datasets have improved over the past few decades (Pettorelli et al., 2014). Additionally, techniques have been developed to downscale and relate climate data to local scales (Daly et al., 2008; Wang, Hamann, Spittlehouse, & Murdock, 2012) and HS models are being developed at these finer resolutions (Franklin et al., 2013; Gottschalk, Aue, Hotes, & Ekschmitt, 2011). While it seems advantageous to model HS at the finest resolution possible, over large extents the resultant models may not adequately match the spatial density of inventory data used for model training, which often only represents a fraction of the modeling extent. Additionally, issues related to model extrapolation (see Dormann, 2007; Peters, Herrick, Urban, Gardner, & Breshears, 2004; Rastetter, 2017) need consideration related to what is being modeled. The availability and accuracy of in situ inventory data collected by field sampling can be the biggest limitation to spatial resolutions when modeling over large extents. Field inventories are costly to implement at high densities, and thus, inventories may not fully sample representative habitats for rare species (Guisan et al., 2006; Mao & Colwell, 2005). Regardless of these drawbacks, inventory data are generally well‐suited for ecological modeling of habitats, provided positional errors are minimal (e.g., via data aggregation or smoothing) and/or modeling approaches that can reduce the influence of such errors are employed (Guisan et al., 2007). Dealing with locations where inventory data are not available can be done by omitting these locations from the model altogether or excluding them from the training dataset to then predict a value at these locations. However, the area of omitted or predicted locations can be quite large depending on the extent and grain size of the model in relation to the density of inventory samples. We propose that the spatial density of inventory plots be used to develop a hybrid lattice of grid cells (Stevens, 1997; Tsui & Brimicombe, 1997) for summarizing model predictor variables. The U.S. Department of Agriculture, Forest Service Forest Inventory and Analysis (FIA) dataset is a systematic random sample of forest conditions with one survey plot per ~2,428 ha (Bechtold & Patterson, 2005); however, due to the spatial distribution of forestland, densified sampling in some states from state‐funded sampling, and the randomness of plot locations, each cell within the modeling lattice will have a varying number of inventory plots. One solution is to use Thiessen or Voronoi polygons to provide spatial structure among inventory plots, where irregular‐shaped polygons containing a single data point at their centroid partition the landscape. Holland, Aegerter, Dytham, and Smith (2007) examined regular and irregular geometries for use in modeling movement across a landscape and concluded that irregular geometries reduced bias resulting from the spatial structure representing the landscape. Irregular geometries are ideal for nonparametric point‐pattern analyses (Boots, 1980; Vincent, Haworth, Griffiths, & Collins, 1976), since no assumptions about the statistical distribution of response and dependent variables are made. However, a drawback to Thiessen polygons is that in regions that are sparsely sampled, large polygons represent a single inventory plot which may be uncharacteristic of the total area (Wilkin, King, & Sheldon, 2009). Therefore, a gridded network may provide a better representation of landscape conditions irrespective of sampling densities since variance of environmental conditions within each grid is reduced compared to the entire extent. In this paper, we propose that a hybrid lattice may incorporate benefits of both Thiessen polygons and uniform grid networks for HS modeling. We compare whether models parameterized with data summarized with nested grids of both 10 × 10 and 20 × 20 km cells perform better in terms of model accuracy and reliability than 10 × 10 km uniform grid models. Concurrently, the evaluation of the hybrid lattice provides an update to our HS model, DISTRIB (described below), in which DISTRIB‐II attempted to model 135 tree species of the eastern United States.

MATERIALS AND METHODS

Species distribution model parameterization

The HS model, DISTRIB, uses FIA (www.fia.fs.fed.us) data to derive individual tree species importance values (IV, i.e., weighted abundance; Curtis & McIntosh, 1951) which are correlated to environmental conditions using Random Forest (RF hereafter, Iverson et al., 2008; Prasad, Iverson, & Liaw, 2006). It used a grain size of 20 × 20 km to summarize 38 environmental variables and aggregate species IVs, generally among two or more inventory plots, within the eastern United States (Iverson et al., 2008). DISTRIB has been used to predict potential current and future HS for 134 tree species under various scenarios of climate change; outputs are available from the Climate Change Tree Atlas (www.fs.fed.us/nrs/atlas, Prasad, Iverson, Peters, & Matthews, 2014) as are various vulnerability assessments (Brandt et al., 2017; Swanston et al., 2011) and general summaries of potential impacts (Iverson et al., 2017; Matthews & Iverson, 2017; Prasad et al., 2016). We introduce DISTRIB‐II, which incorporates an overhaul of data sources, updates to the RF modeling technique, and the hybrid lattice approach of modeling. DISTRIB‐II takes advantage of the increased resolution of available environmental variables, and the newer and more comprehensive inventory data available through the FIA database. The FIA program collects and reports information about the nation's forest lands, and beginning in 1999, implemented annual inventories completed over a 5‐ to 7‐year cycle (O'Connell et al., 2017). However, due to insufficient funding, cycles for some states were extended. Sampling of forest conditions and individual trees ≥5.0 inches in DBH is performed on four 24‐foot radius subplots (O'Connell et al., 2017). For privacy, locations and information have been “fuzzied and swapped,” respectively (see Lister et al., 2005), and we use these records to calculate individual IV from the number of stems (e.g., relative density) and basal area (e.g., relative dominance). The resulting IVs range from 0 to 100 and were used as the response value for 135 eastern U.S. tree species (Appendix A1) and 45 environmental variables (Table 1) were aggregated from native resolutions to a 100 (10 × 10 km) and 400 km2 (20 × 20 km) lattice. Efforts to improve modeled HS by increasing the spatial resolution at which data are provided to RF have been conducted (Peters, Iverson, Prasad, & Matthews, 2013), and 84,204 annualized FIA records (Forest Inventory & Analysis Database, 2017) from the most recently completed cycle for 37 eastern states sampled during the period 2000–2016 were processed for DISTRIB‐II. Most states completed inventory cycles in 6 years initiated in 2005, 2007, or 2008; however, Louisiana, Texas, and West Virginia had longer cycles (11, 12, and 10 years, respectively). The underlying response grids used to develop HS models were refined to 10‐ and 20‐km grids (Figure 1), where each 20‐km cell was divided into four 10‐km cells. In addition to using a uniform grid (hereafter DISTRIB‐10), a hybrid lattice (hereafter DISTRIB‐hybrid), composed of 10‐ and 20‐km grids established by FIA plot density, was used to represent landscape conditions. An iterative algorithm determined whether sufficient FIA plots existed within each 20‐km grid to warrant increasing the resolution to 10 km. The 10‐km grids were accepted if ≥50% of the four 10‐km cells within a 20‐km cell contained two or more FIA plots; otherwise, the focal 20‐km cell was retained.
Table A1

Model performance statistics for DISTRIB‐10 and DISTRIB‐hybrid models for 135 tree species. Model Reliability has been colored to indicate high (green), medium (yellow), low (pink), and unacceptable (purple). Species models having a negative RF R2 are colored red and Model Reliability is colored gray. Range class characterizes the overall distribution of the species range within the eastern United States

FIA CodeScientific nameRange ClassDISTRIB‐10DISTRIB‐hybrid
RF R 2 Fuzzy KappaTSSCV devTop5Model ReliabilityRF R 2 Fuzzy KappaTSSCV devTop5Model Reliability
12 Abies balsamea NDH0.660.970.800.950.300.930.660.790.960.960.390.93
43 Chamaecyparis thyoides NSH0.110.890.580.660.160.500.110.540.860.670.220.47
61 Juniperus ashei NDH0.610.990.730.880.380.890.650.750.990.920.430.92
68 Juniperus virginiana WDH0.210.800.630.860.320.590.260.610.790.910.550.64
71 Larix laricina NSH0.450.900.700.840.400.780.450.680.90.890.610.80
94 Picea glauca NSL0.150.890.740.790.140.600.170.730.890.800.270.61
95 Picea mariana NSH0.500.920.720.890.310.800.490.700.920.850.480.80
97 Picea rubens NDH0.590.960.790.920.060.850.590.790.960.900.520.91
105 Pinus banksiana NSH0.340.940.700.800.160.680.340.660.940.820.200.66
107 Pinus clausa NDH0.330.930.640.800.290.680.350.620.930.720.430.68
110 Pinus echinata WDH0.490.890.700.900.300.780.480.710.880.860.550.81
111 Pinus elliottii NDH0.550.950.700.940.440.850.600.690.950.950.370.85
115 Pinus glabra NSL0.120.780.650.640.170.520.130.630.770.630.210.50
121 Pinus palustris NSH0.280.890.640.820.300.640.280.640.890.880.330.64
123 Pinus pungens NSL0.070.820.660.520.120.490.070.670.830.670.150.49
125 Pinus resinosa NSH0.270.920.640.890.480.680.270.620.910.910.330.63
126 Pinus rigida NSH0.500.860.680.730.190.740.520.670.860.830.370.77
128 Pinus serotina NSH0.340.840.640.710.380.670.340.630.840.790.430.67
129 Pinus strobus WDH0.370.890.680.930.130.680.360.660.880.930.360.70
131 Pinus taeda WDH0.650.920.740.970.290.880.640.740.910.960.370.89
132 Pinus virginiana NDH0.340.910.700.860.340.710.330.700.90.910.700.76
221 Taxodium distichum NSH0.220.860.650.820.130.580.250.630.870.780.230.59
222 Taxodium ascendens NSH0.300.920.680.780.330.680.300.650.920.750.310.65
241 Thuja occidentalis WSH0.390.940.690.900.360.740.390.690.930.920.400.74
261 Tsuga canadensis NSH0.370.910.690.910.030.670.360.690.90.940.380.72
311 Acer barbatum NSL0.150.610.580.580.090.460.150.600.620.720.190.47
313 Acer negundo WSH0.090.810.600.710.180.490.130.590.820.800.380.54
314 Acer nigrum NSH0.000.740.450.520.150.350.020.520.730.550.170.37
315 Acer pensylvanicum NSL0.280.770.630.870.180.590.270.630.770.870.450.63
316 Acer rubrum WDH0.470.710.680.950.320.730.420.670.640.950.390.70
317 Acer saccharinum NSH0.050.880.570.340.100.440.080.580.890.730.230.48
318 Acer saccharum WDH0.420.860.670.960.150.700.420.670.840.920.390.73
319 Acer spicatum NSL0.180.410.240.700.250.310.180.250.410.740.360.32
331 Aesculus glabra NSL0.020.800.560.570.230.440.050.580.770.340.140.40
332 Aesculus flava NSL0.080.790.610.540.100.460.090.640.80.610.100.47
356 Amelanchier spp. NSL0.090.450.350.530.130.290.090.340.450.440.150.26
367 Asimina triloba NSL0.010.540.300.270.070.220.010.310.520.270.100.20
371 Betula alleghaniensis NDL0.540.900.750.950.020.780.530.740.90.940.650.87
372 Betula lenta NDH0.390.940.750.920.200.750.400.740.930.900.270.75
373 Betula nigra NSL0.010.780.620.280.120.430.010.620.790.480.170.42
375 Betula papyrifera WDH0.460.940.770.930.300.810.460.770.930.930.190.78
379 Betula populifolia NSL0.110.690.640.650.180.490.100.640.70.660.190.48
381 Sideroxylon lanuginosum ssp. lanuginosum NSL0.020.730.470.390.120.360.040.520.710.330.170.36
391 Carpinus caroliniana WSL0.110.710.620.630.120.480.090.610.70.590.160.45
401 Carya aquatica NSL0.160.790.590.460.140.500.220.650.810.620.140.55
402 Carya cordiformis WSL0.050.780.590.460.300.470.060.610.780.540.270.46
403 Carya glabra WDL0.220.830.670.930.280.620.210.680.810.940.350.62
404 Carya illinoinensis NSH0.020.800.590.630.070.430.080.640.830.660.280.50
405 Carya laciniosa NSL0.020.760.470.170.170.360.020.460.760.190.100.31
407 Carya ovata WSL0.120.830.620.790.370.560.180.640.830.850.360.58
408 Carya texana NDL0.330.910.730.800.200.690.350.740.910.790.380.73
409 Carya alba WDL0.180.800.650.900.030.540.170.650.780.940.660.63
421 Castanea dentata NSLX0.010.370.09−0.200.110.080.010.090.33−0.280.150.02
452 Catalpa speciosa NSHX −0.01 0.820.470.480.180.38 −0.01 0.530.810.270.160.35
461 Celtis laevigata NDH0.170.810.620.750.120.530.270.630.80.800.470.64
462 Celtis occidentalis WDH0.080.820.580.700.350.510.150.610.840.850.310.55
471 Cercis canadensis NSL0.070.650.580.550.200.440.070.560.640.610.130.40
491 Cornus florida WDL0.190.750.660.690.110.550.170.640.740.630.530.58
521 Diospyros virginiana NSL0.010.510.540.460.060.340.020.580.530.570.110.36
531 Fagus grandifolia WDH0.410.830.680.910.050.680.390.670.810.920.450.73
541 Fraxinus americana WDL0.220.770.620.900.330.590.240.630.750.930.140.56
543 Fraxinus nigra WSH0.290.860.690.910.340.680.290.680.860.900.040.62
544 Fraxinus pennsylvanica WSH0.100.730.570.580.030.430.130.550.730.820.030.45
546 Fraxinus quadrangulata NSL0.060.740.560.630.170.440.060.610.760.580.180.45
551 Gleditsia aquatica NSLX −0.01 0.880.460.370.120.37 0.00 0.460.810.250.050.30
552 Gleditsia triacanthos NSH0.030.820.590.470.180.450.110.610.840.660.180.49
555 Gordonia lasianthus NSH0.170.860.640.760.400.600.180.660.860.740.330.58
571 Gymnocladus dioicus NSLX −0.01 0.830.470.380.050.35 −0.01 0.580.830.380.170.39
580 Halesia spp. NSL0.100.700.280.470.260.330.090.260.710.550.330.31
591 Ilex opaca NSL0.260.690.620.850.070.550.280.610.690.880.350.60
601 Juglans cinerea NSLX 0.00 0.690.470.560.170.36 0.00 0.470.70.510.050.31
602 Juglans nigra WDH0.090.840.600.650.270.510.140.620.840.730.260.53
611 Liquidambar styraciflua WDH0.470.900.730.950.150.760.450.720.890.980.230.76
621 Liriodendron tulipifera WDH0.420.870.700.940.240.730.410.700.860.960.450.76
641 Maclura pomifera NDH0.060.890.600.700.270.500.160.610.90.710.310.55
651 Magnolia acuminata NSL0.170.760.560.770.030.480.160.540.770.810.290.50
652 Magnolia grandiflora NSL0.050.690.560.560.160.420.040.550.70.500.140.39
653 Magnolia virginiana NSL0.240.820.680.850.220.620.230.680.810.840.170.59
654 Magnolia macrophylla NSL0.180.470.200.510.130.270.180.160.460.500.080.22
655 Magnolia fraseri NSL0.140.790.630.520.290.540.140.610.790.560.280.50
682 Morus rubra NSL0.010.600.490.390.060.330.070.550.630.660.200.41
691 Nyssa aquatica NSH0.200.840.640.760.150.570.210.630.850.770.160.56
693 Nyssa sylvatica WDL0.240.820.700.900.200.630.220.700.80.930.530.66
694 Nyssa biflora NDH0.240.910.680.890.320.650.240.680.910.880.130.61
701 Ostrya virginiana WSL0.130.680.620.670.080.480.130.610.680.550.300.48
711 Oxydendrum arboreum NDL0.500.890.750.900.500.840.480.750.880.910.560.83
721 Persea borbonia NSL0.170.690.600.610.320.530.170.600.710.660.370.53
722 Planera aquatica NSL0.010.740.540.200.180.390.030.550.750.680.220.41
731 Platanus occidentalis NSL0.040.820.620.290.130.450.050.630.820.390.050.42
741 Populus balsamifera NSH0.320.870.690.780.320.690.360.680.870.760.250.68
742 Populus deltoides NSH0.010.870.61−1.770.010.330.030.590.890.670.250.46
743 Populus grandidentata NSL0.180.860.660.850.300.600.180.660.850.830.340.59
746 Populus tremuloides WDH0.560.920.710.940.090.790.570.700.910.940.170.80
761 Prunus pensylvanica NSL0.020.470.400.610.060.280.020.400.480.640.280.31
762 Prunus serotina WDL0.250.680.610.850.640.640.270.600.640.850.290.57
763 Prunus virginiana NSLX −0.02 0.340.270.380.080.18 −0.02 0.330.350.500.110.20
766 Prunus americana NSLX −0.01 0.460.170.230.090.15 −0.01 0.230.460.060.030.12
802 Quercus alba WDH0.400.770.660.940.020.650.370.650.740.950.360.68
804 Quercus bicolor NSL0.000.840.560.560.120.420.010.580.840.480.160.41
806 Quercus coccinea WDL0.300.880.690.900.150.650.300.690.870.840.320.67
809 Quercus ellipsoidalis NSH0.310.910.700.830.130.660.310.690.910.800.240.66
812 Quercus falcata WDL0.210.860.690.890.070.590.200.700.840.870.440.64
813 Quercus pagoda NSL0.180.850.650.730.320.590.190.660.860.770.260.58
816 Quercus ilicifolia NSLX 0.00 0.650.44−0.140.210.31 −0.01 0.410.60.080.070.23
817 Quercus imbricaria NDH0.110.890.620.760.080.510.170.700.880.440.350.58
819 Quercus laevis NSH0.220.820.660.740.170.580.210.640.850.770.150.57
820 Quercus laurifolia NDH0.270.850.670.900.370.660.290.670.850.910.360.66
822 Quercus lyrata NSL0.150.850.640.680.140.540.190.650.860.680.260.57
823 Quercus macrocarpa NDH0.190.880.630.850.450.620.240.610.890.850.180.58
824 Quercus marilandica NSL0.210.770.640.800.190.560.290.680.770.840.190.62
825 Quercus michauxii NSL0.030.690.600.590.080.420.030.600.70.540.040.39
826 Quercus muehlenbergii NSL0.170.850.650.800.090.540.190.640.840.810.230.57
827 Quercus nigra WDH0.330.900.720.960.380.730.320.720.890.960.390.72
828 Quercus texana NSH0.150.830.570.590.100.490.180.610.840.550.340.54
830 Quercus palustris NSH0.060.920.630.700.160.500.110.660.910.710.140.52
831 Quercus phellos NSL0.120.790.640.660.080.500.130.640.790.700.140.50
832 Quercus prinus NDH0.480.920.710.950.390.800.470.720.920.950.370.79
833 Quercus rubra WDH0.300.770.640.920.320.640.290.620.740.860.400.62
834 Quercus shumardii NSL0.040.750.580.530.060.420.040.620.770.590.140.44
835 Quercus stellata WDH0.420.840.660.910.310.720.500.680.830.900.550.80
837 Quercus velutina WDH0.350.810.650.920.400.690.360.650.790.910.500.70
838 Quercus virginiana NDH0.350.900.620.890.520.710.440.660.910.870.400.74
842 Quercus incana NSL0.060.620.410.470.160.340.060.430.630.330.180.32
901 Robinia pseudoacacia NDH0.110.870.670.780.260.560.110.650.870.660.180.51
912 Sabal palmetto NDH0.270.960.610.840.570.680.310.630.950.840.380.67
921 Salix amygdaloides NSLX −0.01 0.570.340.190.110.25 −0.02 0.470.580.430.180.31
922 Salix nigra NSH0.010.770.600.010.080.390.030.590.80.470.160.42
931 Sassafras albidum WSL0.140.700.610.740.180.500.140.620.690.620.270.50
935 Sorbus americana NSL0.050.420.180.280.120.170.070.150.390.400.320.18
951 Tilia americana WSL0.190.830.630.780.270.580.220.620.830.880.480.62
971 Ulmus alata WDL0.250.850.710.860.250.650.290.720.850.890.130.65
972 Ulmus americana WDH0.140.730.570.880.380.530.220.590.720.910.360.56
973 Ulmus crassifolia NDH0.140.910.580.820.080.510.220.640.930.870.370.63
975 Ulmus rubra WSL0.080.760.600.470.410.510.100.610.760.600.300.49
977 Ulmus thomasii NSLX −0.01 0.650.310.400.100.25 −0.01 0.450.710.480.190.32
Table 1

Environmental data used to predict habitat suitability of eastern U.S. tree species. Data were either aggregated to 10‐ and 20‐km grids or derived from aggregated data

CategoryVariableDescriptionNative resolution
Climatea [PANN] Annual precipitationMean 30‐year (1981–2010) monthly precipitation (mm)800 m
[PGrow] May‐September precipitationMean 30‐year (1981–2010) monthly precipitation for May–September (mm)
[TANN] Annual mean temperatureMean 30‐year (1981–2010) monthly temperature (°C)
[TGrow] May‐September mean temperatureMean 30‐year (1981–2010) monthly temperature for May–September (°C)
[TWINavg] Mean temperature of coldest monthMean 30‐year (1981–2010) monthly temperature of coldest month (°C)
[TSUMavg] Mean temperature of warmest monthMean 30‐year (1981–2010) monthly temperature of warmest month (°C)
[Aridity] Aridity IndexA conditional ratio of precipitation and Thornthwaite potential evapotranspiration (see Koch, Smith, & Coulston, 2013)10 and 20 km
Elevationb [ElvMIN] MinimumMinimum value90 m
[ElvMEAN] MeanMean value
[ElvMAX] MaximumMaximum value
[ElvMEDIAN] MedianMedian value
[ElvMIN] RangeRange between minimum and maximum values
[ElvStdDev] Standard deviationAmount of deviance among elevation
[ElvCV] Coefficient of variationThe CV of elevation
Solarc [DayLenCV] Day length coefficient of variationThe CV of 12‐monthly day lengths derived from the latitude of grid cells10 and 20 km
Soild [AWC] Available water capacity (cm)The quantity of water that the soil is capable of storing for use by plants30 m
[AWS] Available water supply (cm)The total volume of water that should be available to plants when the soil, inclusive of rock fragments, is at field capacity
[BD3RDBAR] Bulk density (g/cm3)The ovendry weight of the soil material <2 mm in size per unit volume of soil at water tension of 1/3 bar
[CACO3] Calcium carbonateThe percent of carbonates, by weight, in the fraction of the soil <2 mm in size
[CEC7] Cation‐exchange capacityThe total amount of extractable cations that can be held by the soil, expressed in terms of milliequivalents per 100 g of soil at neutrality (pH 7.0) or at some other stated pH
[DEP2WATTBL] Depth to water table (cm)Depth to a saturated zone in the soil
[KSAT] Permeability (cm/hr)Saturated hydraulic conductivity or the ease with which pores in a saturated soil transmit water
[KFACTRF] Erosion K factorThe susceptibility of a soil to sheet and rill erosion by water estimated by percentage of silt, sand, and organic matter and on soil structure and saturated hydraulic conductivity
[TFACTOR] Erosion T factor (tons/acre/year)An estimate of the maximum average annual rate of soil erosion by wind and/or water that can occur without affecting crop productivity over a sustained period
[CLAY] Percent clayMineral soil particles that are <0.002 mm in diameter
[SAND] Percent sandMineral soil particles that are 0.05–2 mm in diameter
[SILT] Percent siltMineral soil particles that are 0.002–0.05 mm in diameter
[OM] Organic matter content (% by weight)Plant and animal residue in soil material <2 mm in diameter at various stages of decomposition
[PH] pHA measure of acidity or alkalinity
[SIEVE10] Percent passing sieve No. 10Soil fraction passing a number 10 sieve (2.00 mm square opening)
[SIEVE200] Percent passing sieve No. 200Soil fraction passing a number 200 sieve (0.074 mm square opening)
[SProd] Soil productivitye Productivity Index derived from family‐level soil taxonomy information
Soil taxonomic orderThe percentage of each of nine taxonomic orders: Alfisols, Aridisols, Entisols, Histosols, Inceptisols, Mollisols, Spodosols, Ultisols, and Vertisols
Soil textureThe percentage of texture class as defined by USDA standard terms: clayey, loamy, sandy, or other

PRISM Climate Group (2014).

Farr et al. (2007).

Forsythe, Rykiel, Stahl, Wu, and Schoolfield (1995).

Soil Survey Staff (2016).

Schaetzl, Krist, and Miller (2012).

Figure 1

Extent of DISTRIB models and distribution of 10‐km and 20‐km grid cells within the hybrid lattice

Environmental data used to predict habitat suitability of eastern U.S. tree species. Data were either aggregated to 10‐ and 20‐km grids or derived from aggregated data PRISM Climate Group (2014). Farr et al. (2007). Forsythe, Rykiel, Stahl, Wu, and Schoolfield (1995). Soil Survey Staff (2016). Schaetzl, Krist, and Miller (2012). Extent of DISTRIB models and distribution of 10‐km and 20‐km grid cells within the hybrid lattice In DISTRIB‐II, RF models were developed using the randomForest library (Liaw & Wiener, 2002) in R version 3.1.1 (R Development Core Team, 2014). Mean IV among FIA plots within each grid cell was modeled with 45 environmental variables consisting of climate, elevation, and soil properties (Table 1) for the hybrid and uniform grids. Only species occurring in at least 60 grid cells were considered for HS modeling. For each species, cells were excluded from the training data if (a) fewer than two FIA plots were present, (b) forest cover was <5% defined by the 2006 NLCD (Fry et al., 2011) (classes 41, 42, 43, and 90), or (c) the mean IV was >1.5 times the interquartile range of all cell IVs for the species so outlier values would not influence the models. We excluded cells from the training dataset with <5% forest cover (i.e., highly agricultural regions) containing two or more FIA plots because environmental drivers in those cells are likely to only marginally relate to forest species. We excluded IV outliers because they are unlikely to represent the broad 100‐ or 400‐km2 area, likely representing an artifact of recent forest or land use change. RF was parameterized to generate 1,001 regression trees, use eight randomly chosen predictor variables at each node (i.e., mtry), and grow each regression tree with a minimum of 10 observations. We set mtry to eight instead of the default, one‐third of the number of predictors (15 in this case), because predictor set redundancy resulted in better model performance statistics with fewer variables used at each node. Once the RF model was trained, predictions of IV were made to all cells regardless of size for the hybrid lattice, whether sufficient FIA plots were present, or percent forest cover was less than five percent.

Modeling species' importance

Each of the 1,001 regression trees built by RF provides information about the predicted IV, and the default is to report the mean prediction. However, the resampling of only eight of 45 variables at each node can result in spurious trees due to, for example, omission of an entire class (e.g., climate); while this does not influence overall prediction (Breiman, 2001), outliers can influence prediction distributions at a given cell (Roy & Larocque, 2012). Therefore, we compared the mean predicted value to the median for each cell; if the median = 0 and among all 1,001 predicted values the coefficient of variation (CV) ≥2.75, then 0 was used as the predicted IV rather than the mean, which was 0 < IVmean <8 among all species. This “mean–median” combination is a modification to the approach suggested by Roy and Larocque (2012) which limits the influence on outlier predictions, yet retaining some marginally suitable habitat (e.g., mean prediction). In doing so, it gives more weight to half of the forest predicting a zero compared to a few trees predicting values >0 when the deviation of values is 2.75 times greater than the mean.

Evaluating model performance

We assessed statistical performance, or model reliability (ModRel), for each species' model among the DISTRIB‐hybrid and DISTRIB‐10 with five variables: (a) a pseudo‐R 2 obtained from the RF model (RF R 2); (b) a Fuzzy Kappa (FK) comparing the imputed RF map to the FIA‐derived map (Hagen‐Zanker, 2006, 2009); (c) a true skill statistic (TSS) of the imputed RF, after removing records with very high CV (e.g., mean–median combination); (d) the deviance of the CV (CVdev) among 30 regression trees via bagging (Iverson et al., 2008; Prasad et al., 2006); and (e) the stability of the top five variables (Top5) from 30 regression trees (Iverson et al., 2008). The five variables were normalized to a 0–1 scale and weighted as follows to arrive at a final ModRel score: 0.33*RF R 2 + 0.33*FK + 0.11*TSS + 0.11*CVdev + 0.11*Top5 which gives more weighting to RF R 2 and FK, a primary performance metric and a comparison of predicted to observed values, respectively. Then, ModRel scores were assigned to one of four classes: high (ModRel ≥ 0.7), medium (0.7 > ModRel > 0.54), low (0.55 > ModRel ≥ 0.14), and unreliable (ModRel < 0.14). Any species with negative RF R 2 were deemed unreliable and excluded from HS modeling. Described in Iverson et al. (2008), FK is derived from a cell‐by‐cell comparison between the FIA IV and RF‐modeled IV (see Prasad et al., 2006), producing a 0–1 scale where one is a perfect match. FK is a better measure than percentage correct because the Kappa statistics account for uneven quantities of classes (Hagen‐Zanker, 2006, 2009), while the “fuzzy” part considers the proximity between classes (e.g., IV 1–3 vs. IV 4–6), which are a closer match than classes farther apart (e.g., IV 0 vs. IV 21–30). The variation among 30 regression trees via bagging allowed an assessment of the consistency of the model outputs and provided two components of the ModRel scoring (Prasad et al., 2006). With a stable model, the deviance explained among 30 regression trees would vary little while an unstable model would yield trees explaining varying degrees of deviance. The CVdev variable (CV among 30 models) was calculated by (a) taking the weighted sums of the predictor deviance explained for each of the top five predictors; (b) calculating the CV (0–1) among the 30 bagging trees; and (c) subtracting this value from one to obtain a 0–1 score with one being most stable. Thus, it considers the amount and consistency of contribution of the top five predictors. The Top5 variable uses the rank order of the top (up to five) predictors to compare the top RF variables among the 30 regression trees (via bagging; Iverson et al., 2008). We chose five variables arbitrarily to represent the primary drivers of the model, though for some species, fewer than five variables were needed to create suitable regression trees among the two grain sizes. The 0–1 scale was derived by summing the inverse ranks and dividing by the perfect match sum of 15 (assuming five points for first variable and one point for fifth variable entered into RF or bagging model). A score of one indicated that all five variables match the order exactly between RF and a bagging output, while a score of zero indicated no matches of top variables. True skill statistic indicates how well the predicted values correspond to observed data (Allouche, Tsoar, & Kadmon, 2006); however, it can only be calculated for cells that contain observed IV and not for those in which imputation was used to predict an IV. TSS is only informative for a portion of cells, and we calculated TSS from IV by assuming that IV > 0 represents a predicted presence and IV = 0 represents an absence for the species. In addition to calculating the performance statistics, confidence values (Wager, Hastie, & Efron, 2014) were calculated as a percentage of the 1,001 predicted values within one standard deviation of the mean or the median absolute deviation for records utilizing the mean–median combination, respectively. These confidence values can then be mapped to reveal spatial patterns of performance.

Species' range, detection, and abundance

For each of the 135 species modeled, information related to the spatial distribution of FIA data was used to classify the (a) distribution as narrow or wide, (b) density of FIA plots (commonness) as dense or sparse, and (c) FIA mean IV (i.e., abundance) as high or low. These classifications allow us to collectively evaluate the quality of the models as well as generalize some species characteristics. A species' distribution was considered narrow if the area of grid cells with FIA IV > 0 occupied <10% of the eastern United States; otherwise, it was assigned as wide. The density of FIA plots was considered dense if ≥40% of FIA plots among grid cells with IV > 0 for the species reported presence; otherwise, density was assigned to sparse. The abundance was considered high for average IV ≥ 6.0, the median of mean IV where the species occurs among all species, and low for values <6. These three categories were combined and coded with the first letter, where WDH indicates the species has a wide distribution, dense FIA plot ratio, and high IV. Codes for species that were withdrawn due to poor performance metrics were appended with an “X.”

Predictor variable importance

An evaluation of predictor importance was performed as described by Iverson et al. (2008) where a variable importance index (VarImpIndx) was derived from the average of three normalized (0–100) scores: (a) the sum of predictor importance scores from RF (percent increase) across all species (SumVarImp); (b) the sum of the reciprocal of ranked predictor importance across all species (SumRankRecip); and (c) the frequency of the top 10 predictors with the highest importance across all species (FreqTop10).

RESULTS

Representing species importance

Across the eastern United States, uniform grids containing 41,681 and 10,691 cells had an average of 2.9 and 9.4 FIA plots at 10 and 20 km, respectively (Figure 2a,b). In contrast, the hybrid lattice contained 29,357 cells with an average of 3.2 FIA plots. Among the 20‐km grids, a maximum of 40 FIA plots occurred within some cells as a result intensified inventories. However, the hybrid lattice, of which 84.7% of the 29,357 cells are 10‐km grids, reduced the number of FIA plots to range from one to 12, with a mean plot count similar to the 10‐km uniform grid (Figure 2c). Thus, the hybrid lattice may provide a more representative sampling of the species' distribution across environmental conditions compared to a uniform 20‐km grid, which aggregates information from 2 to 40 plots for modeling.
Figure 2

Count of Forest Inventory and Analysis (FIA) plots within (a) 10‐km grids, (b) 20‐km grids, and (c) hybrid lattice of 10‐ and 20‐km grids. The accompanying table includes the total cell count and mean (range) of FIA plots among cells

Count of Forest Inventory and Analysis (FIA) plots within (a) 10‐km grids, (b) 20‐km grids, and (c) hybrid lattice of 10‐ and 20‐km grids. The accompanying table includes the total cell count and mean (range) of FIA plots among cells

Predicting species importance

For several species, the resulting DISTRIB‐hybrid models contained many mean predictions of relatively low IV in locations where a species would not be expected under current conditions (Figure 3a,b). When compared to the RF median prediction, these locations often had IV = 0 (Figure 3c,d); however, some of these locations might be suitable for a species though half of the 1,001 predicted values are zero. Combining the mean–median predictions ensures that some of these possibly suitable habitats remain (Figure 3e,f) in locations where the variance of predicted values varies little from the mean. Additionally, the confidence values provide a degree of certainty among the 1,001 predictions across the modeled spatial distribution (Figure 3g,h). Comparing predicted IVs between the DISTRIB‐hybrid and DISTRIB‐10 models for two species with differing range extents contrasts how the hybrid lattice informs the models (Figure 4). DISTRIB‐hybrid with 1,890 10‐km cells and 203 20‐km cells predicted an IV sum of 10,356 for Abies balsamea, while DISTRIB‐10 resulted in an IV sum of 10,426 (Figure 4a,b). Within the cells depicted in Figure 4, Ulmus americana has a predicted IV sum of 4,906 and 4,990 from the DISTRIB‐hybrid and DISTRIB‐10 models, respectively (Figure 4c,d). The modeling extent shown in Figure 4e,f had fewer cells containing <2 FIA plots for the DISTRIB‐hybrid model, an area of 58,500‐km2, compared to the 84,500‐km2 area within the DISTRIB‐10 model.
Figure 3

Modeled importance values for two species calculated by (a and b) the mean of 1,001 Random Forest (RF) predictions, (c and d) the median RF prediction, (e and f) a combination of mean and median RF predictions where the RF mean is used unless the median value is 0 and the coefficient of variation among predicted values is ≥2.75, then the median is accepted. (g and h) indicate the spatial confidence of RF predictions calculated as percentage of regression trees within ± 1 standard deviation of the mean or the median absolute deviation for the mean and the median prediction, respectively

Figure 4

Comparison of predicted importance values between (a and c) the DISTRIB‐hybrid model and (b and d) DISTRIB‐10 model for a species corresponding to more 10‐km cells (Abies balsamea) and a species corresponding to more 20‐km cells (Ulmus americana). A comparison between the number of inventory plots per (e) the DISTRIB‐hybrid and (f) DISTRIB‐10 cells (see Appendix A4 for areal analysis)

Modeled importance values for two species calculated by (a and b) the mean of 1,001 Random Forest (RF) predictions, (c and d) the median RF prediction, (e and f) a combination of mean and median RF predictions where the RF mean is used unless the median value is 0 and the coefficient of variation among predicted values is ≥2.75, then the median is accepted. (g and h) indicate the spatial confidence of RF predictions calculated as percentage of regression trees within ± 1 standard deviation of the mean or the median absolute deviation for the mean and the median prediction, respectively Comparison of predicted importance values between (a and c) the DISTRIB‐hybrid model and (b and d) DISTRIB‐10 model for a species corresponding to more 10‐km cells (Abies balsamea) and a species corresponding to more 20‐km cells (Ulmus americana). A comparison between the number of inventory plots per (e) the DISTRIB‐hybrid and (f) DISTRIB‐10 cells (see Appendix A4 for areal analysis)
Table A4

The number of FIA inventory plots corresponding to the hybrid lattice and uniform grid cells for the region depicted in Figure 4 E and F

FIA PlotsDISTRIB‐hybridDISTRIB−10
Number of cellsArea (km2)Number of cellsArea (km2)
010927,60052952,900
116230,90031631,600
2–346159,00048248,200
4–679291,30076876,800
7–951751,80050950,900
10–12515,000505,000
1311001100
Total2,093265,7002,655265,500
The same modeling framework and underlying data were used to model HS for 135 species, and we examined how changes to the model's spatial resolution affected performance statistics (Table 2 and Appendix A1). Nine DISTRIB‐hybrid models were deemed unacceptable, having negative RF R 2 values, while one species had excessively low model reliability (ModRel < 0.14, Appendix A1); this left 125 modeled species. DISTRIB‐hybrid models resulted in higher RF R 2 than the DISTRIB‐10 models for 79 species, while the remaining 46 species were lower (Figure 5). TSS values were higher for 123 DISTRIB‐hybrid models, while FK values were lower for 124 species compared to the DISTRIB‐10 models (Appendix A1).
Table 2

Characterization of species' ranges and FIA records

CODEScaleSpecies countPercent of speciesRF R 2 FKCV devianceTop 5 variablesTSSModRel
NDH10 km1813.30.4940.9230.8790.4750.8450.740
Hybrid0.5490.8580.8540.5280.9220.762
NDL10 km32.20.6880.9090.9070.3750.9310.828
Hybrid0.6910.9360.9050.7590.9060.871
NSH10 km2417.80.3290.8750.6840.3200.7970.623
Hybrid0.3560.7970.7660.4060.8790.638
NSHX10 km10.700.8280.4960.2880.5870.409
Hybrid00.6630.2810.2290.8180.376
NSL10 km4130.40.1550.7150.5810.2400.6800.473
Hybrid0.1710.6890.6190.2930.7180.473
NSLX10 km96.70.0010.6110.2870.1810.4230.275
Hybrid00.490.3060.1610.6030.263
WDH10 km1914.10.560.8510.9250.3940.8450.74
Hybrid0.5780.8450.9340.5470.8360.762
WDL10 km96.70.3490.8130.8930.3580.8390.652
Hybrid0.3500.8390.8910.5420.7950.667
WSH10 km43.00.3250.8430.7960.3580.8000.629
Hybrid0.3610.7910.8810.3060.8430.632
WSL10 km75.20.1780.7630.6660.3870.7700.550
Hybrid0.2010.7750.6790.4400.7600.550

The range code field describes the distribution, density, importance, and model status. Distributions were narrow or wide, density is dense or sparse, importance is high or low, and model status was given an “X” if withdrawn (see text for assignment criteria). Mean scores (0–1) for RF R 2 obtained from Random Forest model, Fuzzy Kappa (FK) calculated on imputed mixed model predictions, coefficient of variation (CV) of deviance from 30 bagging models, top 5 variables from 30 bagging models, and true skill statistic (TSS) were used to derive model reliability (ModRel). See Appendix A1 for a list of species corresponding to the range codes.

Figure 5

Comparison of Random Forest R 2 values for the DISTRIB‐10 and the DISTRIB‐hybrid models for 126 tree species with positive values. Points above the 1:1 line have a higher DISTRIB‐hybrid RF R 2 value compared to the DISTRIB‐10 models

Characterization of species' ranges and FIA records The range code field describes the distribution, density, importance, and model status. Distributions were narrow or wide, density is dense or sparse, importance is high or low, and model status was given an “X” if withdrawn (see text for assignment criteria). Mean scores (0–1) for RF R 2 obtained from Random Forest model, Fuzzy Kappa (FK) calculated on imputed mixed model predictions, coefficient of variation (CV) of deviance from 30 bagging models, top 5 variables from 30 bagging models, and true skill statistic (TSS) were used to derive model reliability (ModRel). See Appendix A1 for a list of species corresponding to the range codes. Comparison of Random Forest R 2 values for the DISTRIB‐10 and the DISTRIB‐hybrid models for 126 tree species with positive values. Points above the 1:1 line have a higher DISTRIB‐hybrid RF R 2 value compared to the DISTRIB‐10 models The weighted metrics used for ModRel varied slightly among the two modeling grids. For DISTRIB‐hybrid models, 29, 47, 49, and 10 species were classified as having high, medium, low, and unacceptable ModRel, while the DISTRIB‐10 models resulted in 24, 43, 58, and 10 species, respectively (Appendix A1). Additionally, for all species, the DISTRIB‐hybrid models had higher mean confidence values, especially for species with high ModRel (Appendix A2).
Table A2

Confident values among Importance Value classes for DISTRIB‐10 and DISTRIB‐hybrid models for 135 tree species. Confidence values are a combination of the percentage of RF trees predicting a mean value within ± 1 standard deviation of the mean or the median absolute deviation among the 1,001 regression trees

FIA CodeScientific NameModel ReliabilityIV 0IV 1–3IV 4–6IV 7–10IV 11–20IV 21–30IV 31–50IV 51–100Model ReliabilityIV 0IV 1–3IV 4–6IV 7–10IV 11–20IV 21–30IV 31–50IV 51–100
12 Abies balsamea High0.8410.9710.8210.7620.7340.7250.6980High0.8190.9690.8180.7650.740.7260.6950
43 Chamaecyparis thyoides Low0.6590.9840.6620.5420.4680.5300Low0.6860.9830.6470.5350.4920.51800
61 Juniperus ashei High0.6610.9920.9080.8660.8030.7210.6570.745High0.8090.9910.9080.8620.7880.7180.680.748
68 Juniperus virginiana Medium0.9090.9640.8380.7260.6260.5830.5770Medium0.8810.9560.8440.7420.6410.5970.6190.521
71 Larix laricina High0.7770.9650.7720.6570.6530.6790.7290.677High0.7880.9630.7770.6650.6540.6890.7290.676
94 Picea glauca Medium0.6870.9620.7520.6190.5040.43700Medium0.8610.9580.7620.6150.530.47700
95 Picea mariana High0.6680.9690.8150.7250.690.7120.6810.746High0.5750.9660.810.7260.6850.7060.6850.751
97 Picea rubens High0.8620.9810.8270.7660.7260.6720.630High0.8240.9810.8320.7740.7250.670.6570
105 Pinus banksiana Medium0.6840.9780.8110.7160.6820.6360.6050Medium0.8460.9770.8080.7290.6850.6180.6150
107 Pinus clausa Medium0.6510.9870.8970.8030.6320.6110.5620.772Medium0.8180.9860.8980.8040.6420.6180.5920.744
110 Pinus echinata High0.9340.9590.7820.6910.6570.6970.7050.69High0.840.9530.7920.70.6570.6920.7070.687
111 Pinus elliottii High0.7330.9810.890.8540.7870.7020.7010.711High0.7470.9790.880.8490.7810.7060.7130.725
115 Pinus glabra Low0.7970.9630.5220.4780.411000Low0.670.9610.5270.4990.584000
121 Pinus palustris Medium0.7910.9750.8720.7550.6250.5890.6640Medium0.8740.9730.8710.7650.6350.5940.660
123 Pinus pungens Low0.7310.9780.5680.3990.445000Low0.730.9760.5420.4590000
125 Pinus resinosa Medium0.940.9720.8150.7030.6290.6370.6280Medium0.7950.9670.8150.7140.6320.640.6270
126 Pinus rigida High0.9430.9760.6570.6180.6980.660.6780.755High0.8290.9740.6580.6130.6640.6770.6910.738
128 Pinus serotina Medium0.7190.9780.7910.6460.6180.6160.6170.67Medium0.7490.9750.8120.6470.6250.6350.6090.681
129 Pinus strobus Medium0.8440.9650.8170.7110.6650.6670.650High0.8710.9570.8230.7240.6670.660.6670
131 Pinus taeda High0.790.9810.8920.8530.7820.7260.7360.735High0.830.9770.8910.8470.780.7280.7380.736
132 Pinus virginiana High0.890.9740.8190.7090.6450.6040.5070High0.9520.9710.8230.7130.6480.5930.5110
221 Taxodium distichum Medium0.9390.9670.7160.590.5650.6280.670Medium0.7150.9640.7270.620.5670.6270.6610
222 Taxodium ascendens Medium0.7480.9760.8210.7270.6490.540.6340.717Medium0.760.9740.8360.7310.680.5420.6250.814
241 Thuja occidentalis High0.6960.9710.8370.7440.6840.6550.6540.572High0.8490.9680.8380.7510.6860.6470.6620.578
261 Tsuga canadensis Medium0.9160.9620.7970.7110.6820.6330.5230High0.8370.9580.8010.720.6840.6350.510
311 Acer barbatum Low0.7720.960.5980.5750.326000Low0.8090.9580.5680.5590.327000
313 Acer negundo Low0.9180.9640.7330.6140.5390.4820.4850Low0.910.9610.7750.6810.5890.5390.4660
314 Acer nigrum Low0.9670.9860.3730.3920.319000Low0.9680.9820.4310.4140.38000
315 Acer pensylvanicum Medium0.8140.9210.6090.4550000Medium0.7970.9210.6140.4820000
316 Acer rubrum High0.8440.9410.8150.7660.7280.7060.6650.52High0.8620.9270.8230.7810.7340.7010.6710.537
317 Acer saccharinum Low0.9260.9760.7990.5960.4780.440.4150Low0.9470.9750.830.6210.5210.4640.4810
318 Acer saccharum High0.8450.9620.8440.7630.6970.6810.6680.478High0.8410.9550.8520.7770.7120.6760.6670.606
319 Acer spicatum Low0.6450.9220.5100000Low0.7030.9230.50600000
331 Aesculus glabra Low0.880.9740.430.4190.371000Low0.8590.9720.4970.4110.411000
332 Aesculus flava Low0.870.9690.5370.4930.429000Low0.830.9680.5320.5340.435000
356 Amelanchier spp. Low0.8920.9310.5070.4140.335000Low0.910.9280.5120.4120.357000
367 Asimina triloba Low0.840.9640.3740.7520.394000Low0.8950.9590.3770.5920000
371 Betula alleghaniensis High0.7850.9430.7360.7130.7110.66500High0.8790.940.7410.7120.7060.67400
372 Betula lenta High0.7340.9630.7750.7050.650.53400High0.890.9620.7820.7060.650.57800
373 Betula nigra Low0.8620.9720.4760.4240.4480.49700Low0.9270.9690.480.4060.440.69200
375 Betula papyrifera High0.8070.9450.7760.710.6920.6780.5250High0.7890.9390.7820.7130.6860.6920.5150
379 Betula populifolia Low0.6850.9670.6380.5080.433000Low0.8140.9650.6250.5060.462000
381 Sideroxylon lanuginosum ssp. lanuginosum Low0.8380.9690.5290.4550.4520.57600Low0.7650.9630.580.5030.4460.58900
391 Carpinus caroliniana Low0.8750.930.5590.520.465000Low0.8680.9230.5570.5020.461000
401 Carya aquatica Low0.9030.9710.640.550.5390.54800Medium0.8750.9690.7080.5990.5650.58300
402 Carya cordiformis Low0.9050.9520.5710.4790.4490.49700Low0.9030.9470.6270.50.4790.4400
403 Carya glabra Medium0.9290.9410.6930.5990.5460.69900Medium0.8880.930.7050.6020.557000
404 Carya illinoinensis Low0.9070.9750.7150.5340.4430.4120.4540Low0.8470.9720.7480.5960.5270.4510.4320
405 Carya laciniosa Low0.9320.9750.4360.360.378000Low0.8860.9730.4850.3870.402000
407 Carya ovata Medium0.9180.950.6830.5910.5250.42300Medium0.9040.9430.7180.6190.5590.5400
408 Carya texana Medium0.7740.970.740.6640.6260.5650.5780High0.8590.9680.7490.6790.6320.550.5450
409 Carya alba Low0.8610.9290.670.5860.5120.50900Medium0.8520.9190.6760.5850.5010.54700
421 Castanea dentata Unacceptable0.7890.9490.6840.2620000unacceptable0.7330.94800.2810000
452 Catalpa speciosa Unacceptable0.7740.9910.4460.2840.4830.1900Low
461 Celtis laevigata Low0.8260.9610.7530.6430.5950.5560.5420Medium0.7820.960.7740.7020.6370.6560.5840
462 Celtis occidentalis Low0.9310.9630.7290.5730.5070.500Medium0.9110.9610.7950.680.5890.5270.5040
471 Cercis canadensis Low0.8780.9420.4970.4430.488000Low0.9110.9340.5110.4380.541000
491 Cornus florida Medium0.8340.9070.570.4750.580.44300Medium0.8350.8970.5790.4810.5360.73100
521 Diospyros virginiana Low0.8250.9440.5830.50.5280.44700Low0.8110.9370.580.4750.4440.47300
531 Fagus grandifolia Medium0.8940.950.740.680.670.6870.6930High0.8880.9430.7520.6870.6670.6910.6850
541 Fraxinus americana Medium0.8830.9410.7490.6660.6380.5380.4660Medium0.860.9310.7650.6930.6470.530.5140
543 Fraxinus nigra Medium0.9250.960.7820.7030.6310.49400Medium0.7920.9550.7950.7080.6370.51700
544 Fraxinus pennsylvanica Low0.8650.9450.7430.6140.570.5080.4460Low0.8250.9360.7810.6710.5960.550.5070
546 Fraxinus quadrangulata Low0.8780.9810.4660.4440.41000Low0.8730.9780.5250.4680.419000
551 Gleditsia aquatica Unacceptable0.7830.9780.3870.2790.212000Low0.9180.9770.4080.3370.155000
552 Gleditsia triacanthos Low0.8880.9680.670.4940.4430.41800Low0.9440.9640.750.6170.5270.5160.3660
555 Gordonia lasianthus Medium0.8340.9740.7270.570.530.55300Medium0.7060.9710.7370.5880.5420.57200
571 Gymnocladus dioicus Unacceptable0.7640.9850.3310.7150.18900
580 Halesia spp. Low0.7070.9520.52700000Low0.7380.9580.50400000
591 Ilex opaca Medium0.740.9350.6680.5990.575000Medium0.80.9310.6710.6210.583000
601 Juglans cinerea Unacceptable0.8350.9770.3510.350.682000Low0.9470.9750.3920.3950.301000
602 Juglans nigra Low0.9060.9590.7250.5810.5260.4320.490Low0.9070.9560.7750.660.5760.4780.4620
611 Liquidambar styraciflua High0.8730.9620.8380.7810.7210.6310.50High0.8680.9580.8430.7920.7310.6330.510
621 Liriodendron tulipifera High0.9250.960.7950.7350.7020.6380.5280High0.9210.9530.8040.7470.7060.6430.540
641 Maclura pomifera Low0.9110.9770.8230.6030.4820.460.4340Medium0.9120.9750.8180.6960.5530.5580.570
651 Magnolia acuminata Low0.7730.9520.550.5680000Low0.9050.9520.5320.5740000
652 Magnolia grandiflora Low0.680.9520.5170.4320.401000Low0.6980.950.4860.4470.491000
653 Magnolia virginiana Medium0.8940.9510.7160.6540.6110.51500Medium0.9120.9470.7230.6470.6130.52500
654 Magnolia macrophylla Low0.7750.9680.6160.6370000Low0.7960.9670.5570.6190000
655 Magnolia fraseri Low0.8790.9580.5840.4420.487000Low0.8220.9560.5720.4460.464000
682 Morus rubra Low0.9220.9550.4980.4160.3920.45700Low0.9120.9460.6770.5520.4940.49100
691 Nyssa aquatica Medium0.7710.9780.6920.5310.5480.5930.6910Medium0.8480.9750.7080.5570.5170.5630.7520
693 Nyssa sylvatica Medium0.8510.9170.7050.6120.5180.49600Medium0.8520.9080.7070.6090.5220.50200
694 Nyssa biflora Medium0.7810.9690.7940.6750.6030.5510.4350Medium0.7420.9670.8060.6830.610.570.4250
701 Ostrya virginiana Low0.8840.9290.6030.5480.492000Low0.8540.9180.6140.5510.489000
711 Oxydendrum arboreum High0.7920.9360.6970.7050.667000High0.890.9330.7010.6930.679000
721 Persea borbonia Low0.7250.9440.6230.5380.60.36900Low0.7670.940.6390.5520.597000
722 Planera aquatica Low0.7110.9790.4190.4070.3420.35800Low0.7190.9770.4720.4070.387000
731 Platanus occidentalis Low0.8650.9590.580.4770.4780.3900Low0.9060.9520.5990.5070.4730.42800
741 Populus balsamifera Medium0.650.9650.7630.730.6510.5770.6330Medium0.6950.9620.7620.7160.6530.650.6650
742 Populus deltoides Low0.9440.9790.7150.5280.4180.3920.690Low0.9560.9740.7730.5630.4490.4340.3930
743 Populus grandidentata Medium0.8580.9520.7060.6010.5730.52200Medium0.80.9460.7150.6020.5740.54400
746 Populus tremuloides High0.90.9610.8260.7620.7230.7050.7040.746High0.7920.9550.8360.7730.7270.7090.7020.769
761 Prunus pensylvanica Low0.8450.9550.5540.5180.491000Low0.9220.9540.5820.4780.5000
762 Prunus serotina Medium0.8520.9270.7650.7060.6650.620.6170.512Medium0.7930.9160.7770.7050.6740.6470.6160.495
763 Prunus virginiana Unacceptable0.770.9580.4480.2990.36100
766 Prunus americana Unacceptable0.940.9790.4390.2810.2360.2480
802 Quercus alba Medium0.9020.9460.7980.7160.6760.6890.6760Medium0.8890.9360.8090.7320.6820.6830.6730
804 Quercus bicolor Low0.9060.9770.4680.4140.4220.35200Low0.9510.9740.5420.4530.3980.3700
806 Quercus coccinea Medium0.9170.9520.7130.6520.6150.5310.4560Medium0.8880.9470.7210.6520.620.6010.4650
809 Quercus ellipsoidalis Medium0.7740.9790.7620.6630.6360.59500Medium0.8290.9730.7690.6830.6410.600
812 Quercus falcata Medium0.8480.940.7310.6280.5420.48800Medium0.7780.9330.7350.6340.5420.48400
813 Quercus pagoda Medium0.7270.9540.630.5840.5620.37400Medium0.8740.9490.6440.60.555000
816 Quercus ilicifolia Unacceptable0.9620.980.2980.96100.7040
817 Quercus imbricaria Low0.840.9780.690.5690.4960.4080.4750Medium0.9540.9770.730.6570.5520.4180.50
819 Quercus laevis Medium0.7020.9760.7290.5980.5930.60700Medium0.830.9730.750.6230.5850.60400
820 Quercus laurifolia Medium0.8420.9550.7910.6960.6160.5730.5590Medium0.750.9520.80.7040.6190.60.5880
822 Quercus lyrata Low0.8770.9680.6570.5550.5360.49400Medium0.890.9650.6930.5820.5530.6010.3570
823 Quercus macrocarpa Medium0.9270.9740.7970.6760.5990.580.5480Medium0.9420.9680.830.730.6480.5720.6050
824 Quercus marilandica Medium0.8210.9640.7590.680.6250.5380.5130Medium0.8040.9580.7660.6690.6040.620.5660
825 Quercus michauxii Low0.7980.9550.4460.4140.438000Low0.9080.9520.4260.4830.46000
826 Quercus muehlenbergii Low0.9390.9660.6240.5960.49000Medium0.8760.9590.6640.5940.535000
827 Quercus nigra High0.6930.950.8280.7340.6620.5120.4340High0.8830.9430.8360.7450.6620.5280.40
828 Quercus texana Low0.7360.9770.6540.5260.5990.41200Low0.7910.9770.7240.5950.6360.4410.3640
830 Quercus palustris Low0.8780.9810.6820.5050.4460.45500Low0.8910.980.7160.5610.5010.44400
831 Quercus phellos Low0.7610.9540.6520.5560.5170.46900Low0.8290.9490.6760.5590.5470.46400
832 Quercus prinus High0.840.9710.8090.7350.6990.6860.6390High0.9180.970.8170.7420.7030.6720.6620
833 Quercus rubra Medium0.8980.9360.7590.6780.6640.5550.5640Medium0.880.9260.7780.6950.6590.5530.610
834 Quercus shumardii Low0.8110.9710.4830.4250.421000Low0.8180.9660.5060.4540.419000
835 Quercus stellata High0.8220.9510.8170.7640.7050.6710.6640.701High0.7710.9410.8120.760.7080.6860.7020.705
837 Quercus velutina Medium0.8920.9460.7440.6680.6680.6660.6370High0.8750.9360.7630.6870.6680.6750.6490
838 Quercus virginiana High0.6640.9740.880.8170.7320.6280.6340.699High0.6270.9710.8750.8220.7420.6770.6650.686
842 Quercus incana Low0.8270.970.4990.4720.384000Low0.7130.9670.5330.4990.426000
901 Robinia pseudoacacia Medium0.9420.9710.7570.6160.510.4180.3250Low0.9290.9640.7750.6270.520.4560.340
912 Sabal palmetto Medium0.7110.9830.8960.8570.7070.6170.5820.67Medium0.5830.9810.8810.8490.750.6650.6120.546
921 Salix amygdaloides Unacceptable0.8250.9860.3810.3230.2660.2350
922 Salix nigra Low0.9380.9730.6520.4830.4390.3950.3570Low0.9260.9660.7070.5170.4940.4560.4360
931 Sassafras albidum Low0.8590.9420.6670.5590.4730.53600Low0.8690.9340.6790.5860.4980.56300
935 Sorbus americana Low0.7110.9300.3080000Low0.7070.9310.38200000
951 Tilia americana Medium0.9290.9530.7060.630.5870.4920.5620Medium0.9060.9440.7280.6360.6120.54800
971 Ulmus alata Medium0.8550.9420.7570.6920.5860.50600Medium0.7750.9340.7620.6940.6120.5160.3810
972 Ulmus americana Low0.8460.9330.7510.6370.6050.5250.510Medium0.8070.9250.7860.7050.6420.5550.5030
973 Ulmus crassifolia Low0.7120.9730.8620.7210.5670.5090.5160Medium0.7860.9730.8560.770.6530.5830.5030
975 Ulmus rubra Low0.9080.9440.5810.5360.4650.400Low0.8890.9360.6320.5380.4950.44800
977 Ulmus thomasii Unacceptable0.8550.9820.2770.261000
AverageAll0.8290.9610.6660.5850.5150.4040.2580.090All0.8370.9570.6820.5970.5230.4010.2730.094
High0.8050.9640.8110.7470.7060.6370.5560.366High0.8320.9590.8080.7420.6990.6420.5710.308
Medium0.8310.9570.7530.6570.5940.5370.3640.078Medium0.8280.9530.7580.6630.5910.5170.3400.081
Low0.8380.9610.5860.4960.4240.2540.1000.000Low0.8500.9600.5690.4760.3880.1930.0730.000

Species' range characteristics

Among the 135 species modeled, 96 were considered to have narrow distributions within the eastern United States, 85 had sparse FIA densities, and 68 had low mean IVs (Table 2). Among the 10 species withdrawn resulting from narrow and uncommon distributions, one was from the NSH (i.e., narrow range, sparse density, high abundance) and nine were from NSL (narrow range, sparse density, low abundance) classes. Scaled RF R 2 and FK tended to be higher among species with dense FIA records rather than sparse records, especially for narrow distributions (Table 2). The NDL class (N = 3) had the highest ModRel (0.83–0.87) for each scale among all the distributional range classes, while the NSL class (N = 41) had the lowest ModRel (0.47). Overall, ModRel was generally higher (e.g., mean score of 0.76) for species with dense FIA records compared to species with sparse density of FIA plots (mean of 0.57). ModRel was also somewhat higher for species with high mean IV (0.69) versus low mean IV (0.64).

Predictor importance

The assessment of importance for 45 predictor variables placed TSUMavg, 30‐year mean temperature of the warmest month, as the most influential variable, followed by TWINavg, the 30‐year mean temperature of the coldest month (Appendix A3). The seven climate variables were in the top 50% of VarImpIndx scores, as well as day length CV and some soil properties (e.g., PH, SIEVE10, SProd, and KSAT). However, highly correlated variables, like some of the temperature variables, are often interchangeable within the RF models and likely provide the same information. The variables of elevation, soil properties, and types scored lower, having overall FreqTop10 and SumRankRecip scores under 40. Such lower values suggest that this type of information is often important for HS models of specific species or regions, but not so much across all species or the entire eastern United States.
Table A3

Overall predictor variable importance for 45 environmental data layers used to model suitable habitat of 135 tree species among DISTRIB‐hybrid grids. Values have been normalized to a 0–100 scale

SumVarImp1 SumRankRecip2 FreqTop103 VarImpIndex4
Climate
PANN84.254.780.273
PGrow74.639.452.355.4
TANN96.36698.887
TGrow95.162.594.283.9
TSUMavg99.410010099.8
TWINavg10092.398.897.1
Aridity76.642.661.660.3
Elevation
ElvCV5811.512.827.4
ElvMAX67.923.629.140.2
ElvMEAN70.93034.945.3
ElvMEDIAN6115.915.130.7
ElvMIN61.515.11430.2
ElvRANGE64.519.925.636.7
ElvStdDev73.334.138.448.6
Geographic
DayLenCV90.751.777.973.5
Soil properties
AWC68.330.534.944.6
AWS65.523.723.337.5
BD3RDBAR65.626.329.140.3
CACO344.711.610.522.2
CEC763.520.125.636.4
DEP2WATTBL70.128.532.643.7
KFACTRF65.520.522.136
TFACTOR62.719.923.335.3
KSAT69.541.14351.2
OM69.627.534.944
CLAYEY30.61512.819.5
LOAMY53.610.211.625.2
SANDY50.96.44.720.6
OTHER4411.38.121.1
CLAY65.225.324.438.3
SAND65.928.730.241.6
SILT68.134.937.246.8
PH84.151.166.367.2
SIEVE1074.143.944.254.1
SIEVE2006724.227.939.7
SProd72.746.14354
Soil type
Alfisols57.334.329.140.2
Aridisols0000
Entisols17.153.58.5
Histosols29.55.88.114.5
Inceptisol62.62927.939.8
Mollisols44.724.827.932.5
Spodosols248.711.614.8
Ultisols46.625.430.234.1
Vertisols18.513.28.113.3

Variable names are described in Table 1. SumVarImp1 = sum of predictor importance scores from Random Forest (percent increase) across all species. SumRankRecip2 = sum of the reciprocal of rank of each predictor across all species. FreqTop103 = frequency of the top 10 predictors with the highest importance across all species. VarImpIndx4 = Variable Importance Index is overall mean importance defined by three normalized metrics which indicates the variables influence in models for the 135 species.

DISCUSSION

The DISTRIB models

DISTRIB‐II, which will use the DISTRIB‐hybrid models, is a statistical model which predicts HS of individual tree species as IVs, based on inventory and environmental data. The spatial resolution of DISTRIB‐II has been increased where inventory densities are high by a hybrid lattice; however, each cell still represents a relatively large area (100–400 km2) for which the model output indicates a potential mean importance of a species. Assessing output from DISTRIB‐II at local sites is beyond the scope of the model as inventories within cells are aggregated to provide a representation of the species across the grid cell. Additionally, site‐specific conditions mediate establishment and competition among species (Clark, Gelfand, Woodall, & Zhu, 2014), which are only indirectly accounted for by inventory plots and are better assessed by modeling of colonization likelihoods (Prasad, Gardiner, Iverson, Matthews, & Peters, 2013; Prasad et al., 2016) and by local forest managers. The DISTRIB‐II models are intended to provide macroscale information about the IVs for 125 tree species modeled within the eastern United States under current and future (see Iverson, Peters, Prasad, & Matthews, 2019) conditions. The models are highly dependent on FIA plot data, and the hybrid lattice approach is somewhat akin to the first DISTRIB models which predicted species IVs among counties using regression trees (Iverson & Prasad, 1998). While the underlying framework (e.g., statistical regression trees, climate, elevation, and soil data) of the DISTRIB‐II modeling approach is similar to previous versions, predicted values between DISTRIB versions will differ as a result of changes in source data and modifications to how environmental data were processed. Thus, each DISTRIB version represents a snapshot derived from current data and scientific knowledge, and one would expect each version to differ in the modeled spatial patterns and trends of abundance for each species. In our current efforts, differences arise from using a hybrid lattice approach, but also from (a) newer FIA records, (b) recent 30‐year climate normals, (c) a newer set of predictor variables, (d) removal of outlier training data, and (e) modifying predicted IVs with the mean–median combination. The FIA data from the most recently completed cycles were inventoried during the periods 2000–2016, and while many individual trees likely established several decades earlier, we chose the current 30‐year climate normal (i.e., 1981–2010) to account for recent changes and potential stressors that may be reflected by the inventories compared to earlier climate normals. Trees are long‐lived, and newly established individuals are likely responding to more recent climatic conditions and disturbances; using current conditions, we aim to capture these responses when exploring changes under future projections. By refining the spatial resolution of DISTRIB‐II models and examining how cells size affected model performance, we have shown that for some metrics, the DISTRIB‐hybrid models had higher model performance as compared to the DISTRIB‐10 models. This behavior is likely due to added information in cells that would otherwise be excluded from DISTRIB‐10 training data because of too few FIA plots. However, by combining metrics that target specific aspects (e.g., presence/absence or fuzzy values) of the overall model's performance, the DISTRIB‐hybrid models for many species perform similarly or better than DISTRIB‐10. In the case of FK scores, however, the DISTRIB‐10 models were higher than the DISTRIB‐hybrid model, likely resulting from the increased number of cells and more closely predicted values between classes. Additionally, ModRel scores and the confidence values derived from RF predictions help to interpret potential responses from individual species by indicating overall confidence and where predictions agree or disagree.

Comparing the hybrid grid to other approaches

Thiessen polygons, though used in models for birds (Schlicht, Valcu, & Kempenaers, 2014; Wilkin, Perrins, & Sheldon, 2007) and amphibians (Holcombe, Stohlgren, & Jarnevich, 2007), like other irregular geometries used to model movement (Holland et al., 2007), have not been used generally to model vegetation patterns. This trend may be a result from the underlying vegetation inventory area having higher densities of sampling data or the plethora of gridded digital data available. Our use of a hybrid lattice borrows the spatial structure component of Thiessen polygons, but retains the uniformness of gridded data to stratify the landscape. Approaches that use regional models (Ellenwood, Krist, & Romero, 2015) within a larger extent to predict local HS are useful when considering current ranges, but may not fully capture range‐wide conditions necessary to explore habitat changes arising from climate change. Therefore, we did not attempt to employ such techniques, but acknowledge that for current conditions, such approaches may improve model performance by reducing zero‐inflated datasets (Savage, Lawrence, & Squires, 2015). Other modeling efforts have produced predictions at spatial resolutions <500 m (Evans & Cushman, 2009; Rehfeldt, Worrall, Marchetti, & Crookston, 2015; Wilson, Lister, & Riemann, 2012) and as computational power increases and downscaled climate datasets become more widely available, this trend may become more prominent. However, while high‐resolution habitat models are desirable, the sampling density of inventory data remains a limitation in that sparsely sampled regions may not provide sufficient information for modeling. A spatial grid, derived by the density of a focal object (e.g., FIA plots), may provide a more representative dataset for model training, and as such, we explored here the potential use of a hybrid lattice to model tree HS across the eastern United States, which has forest coverage ranging from nearly null in the “Corn Belt” to ~100% in the Appalachian Mountains. We found that the hybrid lattice capitalizes on the varying density of FIA plots to maximize the information content across the region; with ~85% of cells across the eastern United States having sufficient FIA plots for the 10‐km grid, we realize a fourfold increase in spatial resolution on tree species attributes over our earlier estimates with a 20‐km grid.

Transferability of modeling approach

Forest managers and decision makers often rely on modeled HS, primarily based on a uniform grid, which can have spatial resolutions too coarse for local needs. Although data aggregation techniques can readily process gridded and vector datasets independently, and methods exist to aggregate values between these two forms, results can include accuracy errors (Openshaw, 1983; Turner, O'Neill, Gardner, & Milne, 1989). The hybrid lattice approach presented here was applied to forest inventory data which was the main limitation to increasing the spatial resolution for our HS models. All environmental predictor datasets were available at native resolutions <1‐km grids. The process to derive the hybrid lattice can be useful for any information meeting specified criteria and applied iteratively to create multinested grids across the landscape. Random Forest does not directly consider any spatial information (e.g., proximity or size), and our models were developed with tabular data, rather than gridded datasets. Other statistical techniques commonly used to model HS would have to accept tabular records or vector data, as raster data would have to be based on the smallest area within the hybrid lattice. A raster dataset would produce many duplicate values among the larger grids of a hybrid lattice and could influence the models similarly to zero‐inflated datasets by artificially increasing combinations of response and covariates. Thus, we believe the hybrid lattice approach presented here represents an improvement in DISTRIB‐II, and we advocate for its wider use in HS modeling.

CONFLICT OF INTEREST

None declared.

AUTHORS' CONTRIBUTIONS

MPP processed environmental data, modeled habitat suitability, and led the writing of the manuscript; LRI and SNM developed model reliability scores and contributed to the manuscript; AMP processed FIA records, provided guidance on Random Forest modeling, and contributed to the manuscript.
  8 in total

1.  A new model to simulate climate-change impacts on forest succession for local land management.

Authors:  Gabriel I Yospin; Scott D Bridgham; Ronald P Neilson; John P Bolte; Dominique M Bachelet; Peter J Gould; Constance A Harrington; Jane A Kertis; Cody Evers; Bart R Johnson
Journal:  Ecol Appl       Date:  2015-01       Impact factor: 4.657

2.  Using niche-based models to improve the sampling of rare species.

Authors:  Antoine Guisan; Olivier Broennimann; Robin Engler; Mathias Vust; Nigel G Yoccoz; Anthony Lehmann; Niklaus E Zimmermann
Journal:  Conserv Biol       Date:  2006-04       Impact factor: 6.560

3.  Exploring tree species colonization potentials using a spatially explicit simulation model: implications for four oaks under climate change.

Authors:  Anantha M Prasad; Judith D Gardiner; Louis R Iverson; Stephen N Matthews; Matthew Peters
Journal:  Glob Chang Biol       Date:  2013-05-02       Impact factor: 10.863

4.  Modeling plant species distributions under future climates: how fine scale do climate projections need to be?

Authors:  Janet Franklin; Frank W Davis; Makihiko Ikegami; Alexandra D Syphard; Lorraine E Flint; Alan L Flint; Lee Hannah
Journal:  Glob Chang Biol       Date:  2012-11-09       Impact factor: 10.863

5.  More than the sum of the parts: forest climate response from joint species distribution models.

Authors:  James S Clark; Alan E Gelfand; Christopher W Woodall; Kai Zhu
Journal:  Ecol Appl       Date:  2014-07       Impact factor: 4.657

6.  Confidence Intervals for Random Forests: The Jackknife and the Infinitesimal Jackknife.

Authors:  Stefan Wager; Trevor Hastie; Bradley Efron
Journal:  J Mach Learn Res       Date:  2014-01       Impact factor: 3.654

7.  Landscape as a model: the importance of geometry.

Authors:  E Penelope Holland; James N Aegerter; Calvin Dytham; Graham C Smith
Journal:  PLoS Comput Biol       Date:  2007-09-05       Impact factor: 4.475

8.  Predicting species distributions for conservation decisions.

Authors:  Antoine Guisan; Reid Tingley; John B Baumgartner; Ilona Naujokaitis-Lewis; Patricia R Sutcliffe; Ayesha I T Tulloch; Tracey J Regan; Lluis Brotons; Eve McDonald-Madden; Chrystal Mantyka-Pringle; Tara G Martin; Jonathan R Rhodes; Ramona Maggini; Samantha A Setterfield; Jane Elith; Mark W Schwartz; Brendan A Wintle; Olivier Broennimann; Mike Austin; Simon Ferrier; Michael R Kearney; Hugh P Possingham; Yvonne M Buckley
Journal:  Ecol Lett       Date:  2013-10-17       Impact factor: 9.492

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.