Literature DB >> 23437239

Contrasting diversity values: statistical inferences based on overlapping confidence intervals.

Ian MacGregor-Fors1, Mark E Payton.   

Abstract

Ecologists often contrast diversity (species richness and abundances) using tests for comparing means or indices. However, many popular software applications do not support performing standard inferential statistics for estimates of species richness and/or density. In this study we simulated the behavior of asymmetric log-normal confidence intervals and determined an interval level that mimics statistical tests with P(α) = 0.05 when confidence intervals from two distributions do not overlap. Our results show that 84% confidence intervals robustly mimic 0.05 statistical tests for asymmetric confidence intervals, as has been demonstrated for symmetric ones in the past. Finally, we provide detailed user-guides for calculating 84% confidence intervals in two of the most robust and highly-used freeware related to diversity measurements for wildlife (i.e., EstimateS, Distance).

Entities:  

Mesh:

Year:  2013        PMID: 23437239      PMCID: PMC3577692          DOI: 10.1371/journal.pone.0056794

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Measuring biodiversity is one of the major goals of ecologists around the world [1]. As suggested by Hubbell [2], biodiversity can be summarized by the species richness and relative abundances of a community in a given space and time. For decades, ecologists have used many different methods to calculate and contrast species richness, relative abundances, and/or diversity values. Most simply, ecologists often contrast species richness and abundances relative to sampling effort among different conditions using tests for comparing means (e.g., ANOVA, Kruskal-Wallis; [3]–[5]). Also, many indices have been developed to measure species richness and diversity (see Moreno [1], [6]–[8] for further details). However, many popular software applications do not support performing standard inferential statistics for estimates of diversity (e.g., species richness, density). Recently, the use of two methods for quantifying species richness and individual densities have became very popular due to their robustness: (1) rarefaction curves produced by randomly re-sampling the pool of total individuals or sampling units, plotting the estimated number of species in relation to a given number of individuals or sampling units [9]–[11], and (2) distance-sampling calculation of densities (number of individuals per area unit - e.g., hectares, square kilometers), calculated based on the probability of detection of individuals at increasing distances from the observer and the size of the successfully surveyed area [8]. Both methods can be calculated using freeware. Rarefaction curves can be generated using the output from the software EstimateS [12], which computes the expected number of species as a function number of accumulated samples (sample-based rarefaction, denoted Sobs [Mao Tao] in EstimateS) with symmetric 95% confidence intervals (Sobs 95% CI Upper and Lower Bounds). Densities can be calculated using the software Distance [13], for which asymmetric 95% confidence intervals, based on assuming the distributions of the density estimate is log-normal, are output as a default by the program. As software programs such as EstimateS and Distance output results that cannot be contrasted directly though inferential statistics, degree of overlap between confidence intervals has been proposed to assess statistical differences [14]. Such comparisons allow testing null hypotheses regarding different environmental conditions (e.g., habitats, treatments). Although other approaches to hypothesis testing for Distance have been shown to contrast density values effectively (e.g., ANOVA, t-tests), they often require experience using sophisticated processes in statistical packages. As demonstrated by Payton et al. [14], when comparing overlapping 95% confidence intervals of independent treatments with similar standard errors, non-overlapping confidence intervals represent significant differences in expectations with extremely low probabilities of Type I error (α <0.01), while no statistical inferences can be drawn with certainty if confidence intervals overlap but are not coincident. However, Payton et al. [14] showed that comparing 83–84% confidence intervals, instead of 95%, represents statistical tests with an α level of 0.05 (Fig. 1), the conventional criterion of significance for biological and ecological analyses [15].
Figure 1

Hypothetical scenario comparing diversity values with 95% and 84% confidence intervals.

In the example on the left (A vs. B), with 95% confidence intervals, no conclusion can be drawn regarding statistical difference in diversity values at P = 0.05. In the example on the right (A′ vs B′), with 84% confidence intervals but the same means as on the left, we can confidently infer that diversity values differ at P<0.05.

Hypothetical scenario comparing diversity values with 95% and 84% confidence intervals.

In the example on the left (A vs. B), with 95% confidence intervals, no conclusion can be drawn regarding statistical difference in diversity values at P = 0.05. In the example on the right (A′ vs B′), with 84% confidence intervals but the same means as on the left, we can confidently infer that diversity values differ at P<0.05. As the 83–84% rule has previously been demonstrated only for normally distributed confidence intervals, in this study we simulated how asymmetric log-normal confidence intervals behave and determined a confidence interval level for mimicking two-sample statistical tests with α = 0.05. As the log-normal distribution is a normal distribution on the log-scale, we predicted that the 83–84% rule should also apply to asymmetric log-normal confidence intervals. We also describe how to calculate different percentage confidence intervals for rarefaction curves and distance-sampling based densities and indicate how to contrast them, representing a novel way to statistically compare species richness and density values robustly.

Materials and Methods

Simulations for Mimicking Pairwise Tests Based on Asymmetric Confidence Intervals

We performed simulations to establish the confidence intervals at which P<0.01, 0.05, and 0.10 Type I error was achieved, mimicking pairwise tests with PC SAS [16]. In order to explore how the proposed method behaves for various types of log-normal distributions, we created several combinations of the two parameters of the log-normal distribution (µ and σ). Specifically, we created 48 different log-normal distributions by utilizing 6 different levels for μ and 8 different levels for σ in an effort to cover a variety of different distributions. For the purposes of these simulations, we generated samples from parent populations which were generated by assuming different means and corresponding standard errors, which are functions of the parameters utilized to create these parent populations. Thus, as we assessed the behavior of asymmetric confidence intervals, we calculated a confidence interval for each of two samples drawn from the same population, each with alpha values varying from 0.05 to 0.25, at 0.01 increments. We calculated 10,000 iterations of each simulation scenario, including populations with different means extracted from the same parent populations. For each iteration, we calculated 0.75% to 0.95% confidence intervals in 1% increments, and we used this series of confidence intervals to determine the proportion of times the simulated confidence intervals overlap for each nominal level of confidence. Note that the log-normal distribution’s coefficient of variation is a function of σ only [17], so changing the mean of the distribution changes, by definition, the variance also.

Results

For almost all of the scenarios contrasting samples with different means, the 84% confidence intervals provided overlap probability that best mimicked a two-tailed two population test with a 0.05 error rate. To mimic a 0.01 test, 94% confidence intervals would appear to be the proper choice. Confidence intervals at the 76% level best mimic a test with a 0.10 error rate (Tables 1, 2).
Table 1

Simulation results of 10,000 iterations calculating the overlap of confidence intervals of various sizes generated from log-normal populations with mean of 12.2 and variance of 0.08 (log-normal parameter values of μ = 2.5 and σ2 = 0.0005).

Size of CIs (%)Average overlapSize of CIs (%)Average overlap
950.994684*0.9543*
940.9898830.9494
930.9899820.9448
920.9855810.9312
910.9838800.9318
900.9802790.9276
890.9768780.9149
880.9687770.909
870.967176*0.9026*
860.9624750.8966
850.9608

Values that represent the preferred choice of confidence interval to mimic tests with alpha of 0.05 and 0.10 are marked with an asterisk (*). This table represents only one set of parameter values considered (among 48 sets considered) and is meant to represent typical results associated with the other population values considered in our simulations.

Table 2

Appropriate sizes of confidence intervals to simulate P = 0.05 and P = 0.10 size tests for various combinations of log-normal parameter values and associated means and variances.

μσ 2MeanVarianceSize of CIs (%)Average overlap
4.50.000590.044.05840.9541
760.9032
0.00190.068.12840.9546
760.903
0.001590.0812.18840.9507
760.9049
0.00290.1116.25840.953
760.9036
0.002590.1320.33830.9506
760.9065
0.00390.1524.41840.9541
750.9001
0.003590.1728.51840.9566
770.9114
0.00490.232.61830.9505
760.9042
5.50.0005244.7529.96840.952
770.9071
0.001244.8159.96830.9517
760.9043
0.0015244.8890.01840.9554
750.904
0.002244.94120.11840.9509
760.9049
0.0025245150.25850.9582
760.901
0.003245.06180.43840.9534
760.9022
0.0035245.12210.66840.9525
760.9057
0.004245.18240.94830.9514
760.9041
6.50.0005665.31221.37840.9533
760.9039
0.001665.47443.08840.9535
760.9011
0.0015665.64665.12840.9557
750.9017
0.002665.81887.49840.9524
760.9012
0.0025665.971110.19840.9517
760.901
0.003666.141333.23830.9508
770.9091
0.003566.311556.6830.9504
760.9042
0.004666.471780.3840.9521
760.9028

Results from simulations including 10,000 iterations.

Values that represent the preferred choice of confidence interval to mimic tests with alpha of 0.05 and 0.10 are marked with an asterisk (*). This table represents only one set of parameter values considered (among 48 sets considered) and is meant to represent typical results associated with the other population values considered in our simulations. Results from simulations including 10,000 iterations.

Discussion

As predicted, our results show that comparing the overlap, or lack of it, between 84% asymmetric confidence intervals pertaining to different means mimics 0.05 tests surprisingly well (Fig. 1, 2). Thus, this study provides empirical evidence that the 84% rule is suitable for mimicking 0.05 statistical tests for both symmetric and asymmetric confidence intervals. However, we did not explore the statistical power of the method (regarding Type II errors), since the primary concern of this paper was to create a process that best mimicked an alpha-level test, and the use of overlapping 84% confidence intervals for this method would be more powerful, by definition, than using 95% intervals. Assessing power for this situation would involve constructing distributions with different means (and, by virtue of the nature of the log-normal distribution, different variances) and assessing the ability of the method to detect differences in overlapping confidence intervals with different means. Though our results have been demonstrated effective only for normal symmetric intervals and for log-normal asymmetric intervals, we believe that the 84% rule for mimicking 0.05 tests with overlapping confidence intervals might work effectively for other distributions. For example, comparing 84% confidence intervals for species estimation comparisons using widely used non-parametric estimators (e.g., Chao1, Chao2, ICE, ACE, Jackknife, Bootstrap), could mimic 0.05 tests. However, it remains to be tested.
Figure 2

Comparison of the use of 95% and 84% confidence intervals in three replicates of our simulations.

For this representative example, the data were created from a log-normal population with a mean of 90.2 and variance of 32.6. In case 1, the both sets of intervals overlap, both suggesting that no significant (NS) differences exist. Note, however, that the 95% confidence intervals will yield an error rate of less than 1%, while the 84% confidence intervals better mimic a 0.05 level test. In case 2, 95% confidence intervals slightly overlap, while 84% ones do not. For this situation, these two approaches would lead to different conclusions: (a) significant differences (*) when considering 84% confidence intervals, and (b) no statistical differences can be inferred using 95% confidence intervals (?). In case 3, none of the sets of intervals overlap, both suggesting that significant differences exist. Note, however, that statistical differences using 95% confidence intervals are assumed with an error rate of less than 1%, while that of 84% confidence intervals better mimic a 0.05 level test.

Comparison of the use of 95% and 84% confidence intervals in three replicates of our simulations.

For this representative example, the data were created from a log-normal population with a mean of 90.2 and variance of 32.6. In case 1, the both sets of intervals overlap, both suggesting that no significant (NS) differences exist. Note, however, that the 95% confidence intervals will yield an error rate of less than 1%, while the 84% confidence intervals better mimic a 0.05 level test. In case 2, 95% confidence intervals slightly overlap, while 84% ones do not. For this situation, these two approaches would lead to different conclusions: (a) significant differences (*) when considering 84% confidence intervals, and (b) no statistical differences can be inferred using 95% confidence intervals (?). In case 3, none of the sets of intervals overlap, both suggesting that significant differences exist. Note, however, that statistical differences using 95% confidence intervals are assumed with an error rate of less than 1%, while that of 84% confidence intervals better mimic a 0.05 level test. In order to generate 84% confidence intervals for rarefaction analyses, the standard deviation of the observed species (Mao Tao SD) from the output file from EstimateS is needed. As standard deviations equal standard errors in EstimateS because infinite degrees of freedom are assumed in the calculation of Mao Tao SD, the latter must be multiplied by 1.372, the quantile (normal curve z-score) corresponding to two-sided intervals of 84% probabilities, with alpha = 0.16, and cumulative probabilities of 0.08 and 0.92. For example, if Mao Tao SD = 5.55, for example, 84% confidence intervals for that specific value of Mao Tao SD, which can vary in relation to the number of accumulated computed individuals in a rarefaction plot, are equal to the average value ±7.61. As the Distance program can calculate user-selected levels for confidence intervals (default = 0.95) for distance-sampling density calculations, setting the confidence interval limits solves the issue. To accomplish this, go the “Analyses” button on the toolbar, select “Analysis details” and a new window will appear. Finally, select the “Misc” tab and modify the default value for confidence intervals (i.e., 95) to 84. Results output from the Distance program will now include 84% confidence intervals. Wildlife species richness and density measurements of ecosystems are imperative in order to concentrate conservation actions in highly biodiverse areas [1]. In this paper, we demonstrated that the 84% rule mimics 0.05 pairwise statistical tests for both symmetric and asymmetric confidence intervals, with detailed users’ guides for calculating 84% confidence intervals in two of the most robust and highly-used freeware applications related to biodiversity (i.e., EstimateS, Distance). Thus, we encourage ecologists to use these programs to calculate species richness and individual density statistical expectations, applying this easy-to-use overlapping confidence interval method when making statistical inferences, which represents an alternative to the use of diversity indices.
  3 in total

1.  A consistent terminology for quantifying species diversity?

Authors:  Claudia E Moreno; Pilar Rodríguez
Journal:  Oecologia       Date:  2010-03-06       Impact factor: 3.225

2.  Distance software: design and analysis of distance sampling surveys for estimating population size.

Authors:  Len Thomas; Stephen T Buckland; Eric A Rexstad; Jeff L Laake; Samantha Strindberg; Sharon L Hedley; Jon Rb Bishop; Tiago A Marques; Kenneth P Burnham
Journal:  J Appl Ecol       Date:  2010-02       Impact factor: 6.528

3.  Overlapping confidence intervals or standard error intervals: what do they mean in terms of statistical significance?

Authors:  Mark E Payton; Matthew H Greenstone; Nathaniel Schenker
Journal:  J Insect Sci       Date:  2003-10-30       Impact factor: 1.857

  3 in total
  21 in total

1.  Pathogen shifts in a honeybee predator following the arrival of the Varroa mite.

Authors:  Kevin J Loope; James W Baty; Philip J Lester; Erin E Wilson Rankin
Journal:  Proc Biol Sci       Date:  2019-01-16       Impact factor: 5.349

2.  Effect of heifer-raising practices on E. coli antimicrobial resistance and Salmonella prevalence in heifer raisers.

Authors:  R V Pereira; J D Siler; K J Cummings; M A Davis; L D Warnick
Journal:  Epidemiol Infect       Date:  2015-04-23       Impact factor: 2.451

3.  The specificity of host-bat fly interaction networks across vegetation and seasonal variation.

Authors:  Mariana Zarazúa-Carbajal; Romeo A Saldaña-Vázquez; César A Sandoval-Ruiz; Kathryn E Stoner; Julieta Benitez-Malvido
Journal:  Parasitol Res       Date:  2016-06-22       Impact factor: 2.289

4.  Patterns of Bird Diversity and Endemism Along an Elevational Gradient in the Southern Mexican Highlands.

Authors:  Edson A Alvarez-Alvarez; Rosalba Rodríguez-Godínez; Pablo Sierra-Morales; Sandy A Medina-Valdivia; Estefanía Vázquez-Salgado; Marlene Brito-Millán; R Carlos Almazán-Núñez
Journal:  Zool Stud       Date:  2020-12-15       Impact factor: 2.058

5.  Clonemate cotransmission supports a role for kin selection in a puppeteer parasite.

Authors:  Charles D Criscione; Bradley J van Paridon; John S Gilleard; Cameron P Goater
Journal:  Proc Natl Acad Sci U S A       Date:  2020-03-02       Impact factor: 11.205

6.  Effect of preweaned dairy calf housing system on antimicrobial resistance in commensal Escherichia coli.

Authors:  R V Pereira; J D Siler; J C Ng; M A Davis; L D Warnick
Journal:  J Dairy Sci       Date:  2014-10-11       Impact factor: 4.034

7.  Crop landscapes reduced taxonomic and functional richness but increased evenness of aquatic macroinvertebrates in subtropical rivers.

Authors:  Gomez Daniela; Molineri Carlos
Journal:  Environ Monit Assess       Date:  2019-10-31       Impact factor: 2.513

8.  Bat-fruit interactions are more specialized in shaded-coffee plantations than in tropical mountain cloud forest fragments.

Authors:  Jesús R Hernández-Montero; Romeo A Saldaña-Vázquez; Jorge Galindo-González; Vinicio J Sosa
Journal:  PLoS One       Date:  2015-05-18       Impact factor: 3.240

9.  Combining point counts and autonomous recording units improves avian survey efficacy across elevational gradients on two continents.

Authors:  Anna Drake; Devin R de Zwaan; Tomás A Altamirano; Scott Wilson; Kristina Hick; Camila Bravo; José Tomás Ibarra; Kathy Martin
Journal:  Ecol Evol       Date:  2021-06-01       Impact factor: 2.912

10.  Citizens' Willingness to Support New Taxes for COVID-19 Measures and the Role of Trust.

Authors:  Erick Lachapelle; Thomas Bergeron; Richard Nadeau; Jean-François Daoust; Ruth Dassonneville; Éric Bélanger
Journal:  Polit Policy       Date:  2021-04-19
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.