| Literature DB >> 35430878 |
Megan Aylward1, Vinay Sagar1, Meghana Natesh2, Uma Ramakrishnan1,3.
Abstract
Unprecedented advances in sequencing technology in the past decade allow a better understanding of genetic variation and its partitioning in natural populations. Such inference is critical to conservation: to understand species biology and identify isolated populations. We review empirical population genetics studies of Endangered Bengal tigers within India, where 60-70% of wild tigers live. We assess how changes in marker type and sampling strategy have impacted inferences by reviewing past studies, and presenting three novel analyses including a single-nucleotide polymorphism (SNP) panel, genome-wide SNP markers, and a whole-mitochondrial genome network. At a broad spatial scale, less than 100 SNPs revealed the same patterns of population clustering as whole genomes (with the exception of one additional population sampled only in the SNP panel). Mitochondrial DNA indicates a strong structure between the northeast and other regions. Two studies with more populations sampled revealed further substructure within Central India. Overall, the comparison of studies with varied marker types and sample sets allows more rigorous inference of population structure. Yet sampling of some populations is limited across all studies, and these should be the focus of future sampling efforts. We discuss challenges in our understanding of population structure, and how to further address relevant questions in conservation genetics. This article is part of the theme issue 'Celebrating 50 years since Lewontin's apportionment of human diversity'.Entities:
Keywords: SNPs; conservation; endangered species; genetic markers; genomes; tiger
Mesh:
Year: 2022 PMID: 35430878 PMCID: PMC9014192 DOI: 10.1098/rstb.2020.0418
Source DB: PubMed Journal: Philos Trans R Soc Lond B Biol Sci ISSN: 0962-8436 Impact factor: 6.671
Figure 1(a) Schematic showing the power to infer population structure as a function of sample size and the number of markers. (b) Actual numbers of markers (represented categorically) and sample size for a set of studies investigating population structure in species of large mammals (see electronic supplementary material, table for studies and species).
The empirical datasets reviewed in this study on population structure in Bengal tigers. We provide information on the marker types, sampling schemes and data sources used in the presented analyses. For nuclear DNA the number of loci corresponds to actual number of variants used, whereas for mitochondrial DNA number of loci is the total number of base pairs sequenced.
| DNA type | type of variants | number of loci | number of individuals | number populations sampled | data source | analysis |
|---|---|---|---|---|---|---|
| nuclear | SNP | 198 930 | 37 | 8 | Khan | this study |
| Armstrong | ||||||
| Liu | ||||||
| nuclear | SNP | 10 184 | 38 | 15 | Natesh | Natesh |
| nuclear | SNP | 81 | 155 | 24 | Sagar | this study |
| Khan | ||||||
| Armstrong | ||||||
| Liu | ||||||
| nuclear | microsatellite | 11 | 158 | 34 | Kolipakam | Kolipakam |
| nuclear | microsatellite | 8 | 56 | 28 | Mondol | Mondol |
| mitochondrial | sequence | 15435 bp | 42 | 4 | Khan | this study |
| Armstrong | ||||||
| Liu | ||||||
| Mt | sequence | 1200 bp | 57 | 27 | Mondol | Mondol |
| Mt | sequence | 932 bp | 77 | 13 | Sharma | Sharma |
Figure 2Mitogenome analysis using 42 individuals (dataset table 1, row 6) from four regional populations: CI (n = 13), NW (n = 14), SW (n = 11) and NE (n = 4). (a) Maximum-parsimony network for four regional populations of tigers using mitochondrial genomes minus the control region. Numbers on the network indicate the number of mutations between the nodes; lines without a number indicate only one mutation. Circle size is proportional to the number of individuals with that haplotype. (b) Pairwise Fst among all four populations CI, NW, SW and NE, calculated from mitogenomes using PopGenome in R, v. 3.4.2 [55,56].
Figure 3An overview of the nuclear DNA studies in Bengal tigers to date reveals population structure as inferred based on population structure analysis of these populations at optimal K. We include three previously published studies (table 1, rows 2, 4, and 5) [12,38,39] and the two nuclear marker analyses in this paper. We show this in the context of assumed differences in power to detect structure based on the sampling scheme (number of markers and number of individuals) and number of areas sampled as the number of protected areas.
Figure 4Population structure across India, which can be classified into six regions; northwest (NW), southwest (SW), Central India (CI), north (NOR), northeast (NE) and Sundarbans (SU); (a) shows sample locations of the two different datasets (table 1 row 1 and 3); whole-genome data from [33–36,38,39,44] and SNP data from [10]. (b) Population structure based on genome-wide SNPs from whole-genome sequence data. Results are shown for K = 2–6 for 37 individuals from eight protected areas that represent six regions (NW, n = 10, SW, n = 9, CI, n = 10, NOR, n = 3, NE, n = 3, SU, n = 2). (c) Population structure based on a panel of 81 SNP variants for 175 individuals from 24 protected areas (classified as tiger reserves), which represent six regions (NW, n = 21, SI, n = 52, CI, n = 78, NOR, n = 5, NE, n = 7, SU, n = 12). Results are shown for K = 2–7. Preferred K (based on the Evanno method [45]) is shown in red and is K = 3 for both structure plots. Labels above structure plots correspond to the geographic region, whereas labels below the structure plots are the protected areas where samples were collected. Specific protected area labels allow comparison of which of these overlap between the two datasets.