| Literature DB >> 35415952 |
Sean Hoban1, Frederick I Archer2, Laura D Bertola3, Jason G Bragg4, Martin F Breed5, Michael W Bruford6, Melinda A Coleman7, Robert Ekblom8, W Chris Funk9, Catherine E Grueber10, Brian K Hand11, Rodolfo Jaffé12, Evelyn Jensen13, Jeremy S Johnson14, Francine Kershaw15, Libby Liggins16, Anna J MacDonald17, Joachim Mergeay18,19, Joshua M Miller20, Frank Muller-Karger21, David O'Brien22, Ivan Paz-Vinas23, Kevin M Potter24, Orly Razgour25, Cristiano Vernesi26, Margaret E Hunter27.
Abstract
Biodiversity underlies ecosystem resilience, ecosystem function, sustainable economies, and human well-being. Understanding how biodiversity sustains ecosystems under anthropogenic stressors and global environmental change will require new ways of deriving and applying biodiversity data. A major challenge is that biodiversity data and knowledge are scattered, biased, collected with numerous methods, and stored in inconsistent ways. The Group on Earth Observations Biodiversity Observation Network (GEO BON) has developed the Essential Biodiversity Variables (EBVs) as fundamental metrics to help aggregate, harmonize, and interpret biodiversity observation data from diverse sources. Mapping and analyzing EBVs can help to evaluate how aspects of biodiversity are distributed geographically and how they change over time. EBVs are also intended to serve as inputs and validation to forecast the status and trends of biodiversity, and to support policy and decision making. Here, we assess the feasibility of implementing Genetic Composition EBVs (Genetic EBVs), which are metrics of within-species genetic variation. We review and bring together numerous areas of the field of genetics and evaluate how each contributes to global and regional genetic biodiversity monitoring with respect to theory, sampling logistics, metadata, archiving, data aggregation, modeling, and technological advances. We propose four Genetic EBVs: (i) Genetic Diversity; (ii) Genetic Differentiation; (iii) Inbreeding; and (iv) Effective Population Size (Ne ). We rank Genetic EBVs according to their relevance, sensitivity to change, generalizability, scalability, feasibility and data availability. We outline the workflow for generating genetic data underlying the Genetic EBVs, and review advances and needs in archiving genetic composition data and metadata. We discuss how Genetic EBVs can be operationalized by visualizing EBVs in space and time across species and by forecasting Genetic EBVs beyond current observations using various modeling approaches. Our review then explores challenges of aggregation, standardization, and costs of operationalizing the Genetic EBVs, as well as future directions and opportunities to maximize their uptake globally in research and policy. The collection, annotation, and availability of genetic data has made major advances in the past decade, each of which contributes to the practical and standardized framework for large-scale genetic observation reporting. Rapid advances in DNA sequencing technology present new opportunities, but also challenges for operationalizing Genetic EBVs for biodiversity monitoring regionally and globally. With these advances, genetic composition monitoring is starting to be integrated into global conservation policy, which can help support the foundation of all biodiversity and species' long-term persistence in the face of environmental change. We conclude with a summary of concrete steps for researchers and policy makers for advancing operationalization of Genetic EBVs. The technical and analytical foundations of Genetic EBVs are well developed, and conservation practitioners should anticipate their increasing application as efforts emerge to scale up genetic biodiversity monitoring regionally and globally.Entities:
Keywords: biodiversity monitoring; environmental policy; indicators; interoperability; metadata; molecular ecology
Mesh:
Year: 2022 PMID: 35415952 PMCID: PMC9545166 DOI: 10.1111/brv.12852
Source DB: PubMed Journal: Biol Rev Camb Philos Soc ISSN: 0006-3231
Fig. 1The four Genetic Essential Biodiversity Variables (EBVs; bullet points) are indicated below each level of biological organization (Species, Populations, Individuals) for which they can be calculated. The species level corresponds to the combined genetic diversity for the species. The population level pie charts reflect the relative population sizes and the proportion of genotypes in each population (i.e. population genetic structure resulting from gene flow and migration). The smallest circles represent unique individuals with the colors depicting their genotypes.
The four proposed Genetic Composition Essential Biodiversity Variables (EBVs), their definitions, possible values, interpretation, and a ranking from low (+) to high (+++) on various EBV criteria (rows)
| Genetic EBV | Genetic Diversity | Genetic Differentiation | Inbreeding | Effective Population Size ( | ||
|---|---|---|---|---|---|---|
| Number of genetic units | Differentiation between units | |||||
| Richness | Evenness | |||||
| Definition | Count of the number of alleles in a population | Expected proportion of heterozygotes in a population at equilibrium | Number of genetic lineages/units within a species | Degree of genetic differentiation among populations or units | Degree of relatedness between pairs of individuals, mating among relatives, or identity by descent | Size of an ideal population that loses genetic variation at the same rate as the focal population |
| Examples | Allelic richness, nucleotide diversity (π) |
| Number of distinct evolutionarily significant units or management units |
|
| LD‐based |
| Range of values | 0 to 1 | 0 to 1 | ≥1 | 0 to 1 | ‐1 to 1 | Usually smaller than census population size |
| Level of organization | Population | Individual/population | Across species range or specific region | Across species range or specific region | Individual (pairs or families)/population | Population |
| Interpretation of a change identified across loci | Decline indicates loss of adaptive capacity | Decline indicates loss of adaptive capacity or increased levels of inbreeding | Loss of independent evolutionary or demographic units (probable loss of adaptations and long‐term persistence) | Increased differentiation indicates more fragmentation (reduced gene flow) between units; decreased differentiation indicates homogenization | Increased inbreeding increases the chance of expression of deleterious alleles, reducing fitness | Lower |
| Relevance to species status | +++ | ++ | +++ | ++ | +++ | +++ |
| Sensitivity to change over time | +++ | + | +++ | ++ (depends on life history and | +++ (depends on method) | ++ |
| Spatial scalability | +++ | + | +++ | ++ | + | + |
| Feasibility | +++ | +++ | +++ | +++ | ++ (depends on measure – | +/++ (depends on method – single sample is more feasible |
| Data available | +++ | +++ | +++ | +++ | + | + |
H e = expected heterozygosity under Hardy–Weinberg assumptions, and H = observed heterozygosity, yielding the probability of randomly drawing two different alleles from the population. F ST = genetic differentiation (fixation index) measured as the proportion of the total genetic variance contained in a subpopulation relative to the total genetic variance. G ST = genetic differentiation of subpopulations relative to the total population, a generalized index of F ST for loci with two or more alleles. D est = unbiased genetic differentiation among subpopulations. F IS = genetic differentiation of individuals relative to the subpopulation, reflecting deviations from random mating within subpopulations (inbreeding). F ROH = inbreeding by autozygosity estimated from runs of homozygosity, i.e. the proportion of the autosomal genome above a specified length. F H = the increase in individual homozygosity relative to mean Hardy–Weinberg expected homozygosity. t m = the multilocus outcrossing rate, which is 1 – the self‐fertilization rate. t s = the degree of outcrossing among unrelated individuals only. t m–t s = the estimated rate of biparental inbreeding. LD, linkage disequilibrium.
See Appendix SI for discussion of alternative spatial scalability ranking. Note that each EBV can be represented by numerous statistics which have individual calculations (Leroy et al., 2018; see Appendix S1).
Fig. 2A sample of four diploid individuals from a population, with various representations of genetic composition data structures. The workflow process includes genetic sequencing, aligning the sequences from each individual, and polymorphic loci identification. The data from the polymorphic sites (single nucleotide polymorphisms; SNPs) in the sequence can be summarized as a matrix of genotypes for each locus (L1–L4). When these loci are bi‐allelic SNPs, the data can be summarized as the Minor Allele Count – the number of occurrences of the least frequent allele at that locus, a convenient summary format for certain statistics and models. A Genetic Diversity EBV for evenness, such as observed heterozygosity (H o), can also be summarized for each individual (rows), or by locus (columns), as illustrated for the SNP matrix. Note that some measures of diversity include invariant sites which are calculated from sequence alignments, not a matrix of SNP genotypes.
Fig. 3The four Genetic Composition Essential Biodiversity Variables (EBVs). Green background shading indicates the preferred genetic state (high or low levels) in many conservation/management situations. The preferred state for genetic differentiation is context dependent, represented by a lighter shade of green (see text). Distance of genetic units illustrates high genetic distance (black versus white), and low genetic distance (dark gray versus light gray). The contemporary effective population size (N e) is represented with black (breeding) and gray (non‐breeding) individuals and the graphs denote projections after the present time (p) of the future losses of genetic diversity.
Fig. 4Steps in generating Genetic Composition Essential Biodiversity Variables (EBVs) include field (or archive) collection of DNA, laboratory work, computational processing of raw data, analysis/calculation, publishing, archiving, modeling and/or synthesis and communication to inform management decisions. GEOME, Genomic Observatories MetaDatabase; Pop, population; QA/QC, quality assurance/quality control.
Fig. 5Sources of DNA for genetic analyses. Genetic material may be obtained directly from tissue samples from extant populations (biopsy or non‐invasive), biological collections (e.g. museums), or sub‐fossils (sometimes called ancient DNA) or from the environment (i.e. environmental DNA, eDNA). Older samples may have low‐quality and low‐quantity DNA, restricting the use of certain Genetic Essential Biodiversity Variables (EBVs); eDNA is challenging to use for Genetic EBVs since the DNA is typically of lower quality and quantity. These examples indicate information typically assessed with these types of data, and do not represent all possibilities. N e, effective population size.
Examples of three broad categories of large‐scale studies that establish a foundation to operationalize Essential Biodiversity Variables (EBVs): taxa examined, data source, EBVs compiled, size of data set, and DNA marker type (mtDNA, mitochondrial DNA; microsat, microsatellite; AFLP, amplified fragment length polymorphism). Category 1 = large scale spatial patterns; Category 2 = temporal change; Category 3 = quantitative relationship between driver and Genetic EBV response
| Study | Category | Taxa | Data source | Genetic EBV | Number of species | Marker type |
|---|---|---|---|---|---|---|
| Manel | 1 | Freshwater and marine fishes | GenBank | Genetic Diversity: richness | 5426 | mtDNA |
| Lawrence | 1 | Terrestrial vertebrates | From literature | Genetic Diversity: richness and evenness; Genetic Differentiation: differentiation between units | 897 | Microsat |
| De Kort | 1 | Animals and plants | From literature | Genetic Diversity: evenness | 727 | Microsat and AFLP |
| Leigh | 2 | Animals | From literature | Genetic Diversity: richness and evenness | 91 | Predominantly microsat |
| Jordan | 2 | Plants | From literature | Genetic Diversity: richness and evenness | 48 | Various |
| Pinsky & Palumbi ( | 3 | Fish | From literature | Genetic Diversity: richness and evenness | 140 | Microsat |
| Breed | 3 | Plants | From literature | Inbreeding | 40 | Predominantly microsat |