| Literature DB >> 36199020 |
Alexandra Carrier1,2,3, Julien Prunier4, William Poisson1,2,3, Mallorie Trottier-Lavoie1,2,3, Isabelle Gilbert1,2,3, Maria Cavedon5, Kisun Pokharel6, Juha Kantanen6, Marco Musiani7, Steeve D Côté8, Vicky Albert9, Joëlle Taillon9, Vincent Bourret9, Arnaud Droit4, Claude Robert10,11,12.
Abstract
BACKGROUND: Development of large single nucleotide polymorphism (SNP) arrays can make genomic data promptly available for conservation problematic. Medium and high-density panels can be designed with sufficient coverage to offer a genome-wide perspective and the generated genotypes can be used to assess different genetic metrics related to population structure, relatedness, or inbreeding. SNP genotyping could also permit sexing samples with unknown associated metadata as it is often the case when using non-invasive sampling methods favored for endangered species. Genome sequencing of wild species provides the necessary information to design such SNP arrays. We report here the development of a SNP-array for endangered Rangifer tarandus using a multi-platform sequencing approach from animals found in diverse populations representing the entire circumpolar distribution of the species.Entities:
Keywords: Genomics; Genotyping; Next generation sequencing; Rangifer tarandus; SNP chip
Mesh:
Year: 2022 PMID: 36199020 PMCID: PMC9533608 DOI: 10.1186/s12864-022-08899-6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 4.547
Sequencing approaches and SNP yields in this study
| Description | WGS 30X - | WGS 5X - EastCan | WGS 5X - West Can | WGS 5X - EUR | GBS-QC |
|---|---|---|---|---|---|
| Number of samples | 20 | 182 | 10 | 10 | 672 |
| Tissues | Ear punches | Ear punches and hairs | Ear punches | Ear punches and hairs | Ear punches and hairs |
| Samples population | N-E Canada | N-E Canada | N-W Canada | Eurasia | N-E Canada |
| Libraries | Shotgun DNA library | Shotgun DNA library | Shotgun DNA library | Shotgun DNA library | Two-enzyme protocol (Pst1/Msp1) |
| Illumina sequencer | HiSeqX | NovaSeq 6000 | HiSeq 4000 | NovaSeq 6000 | NovaSeq 6000 |
| Average number of reads per sample (M) | 479.9 | 43.8 | 60.1 | 47.0 | 3.7 |
| Read length | PE 150 | PE 150 | PE 150 | PE150 | PE 150 |
| N raw SNPs | 22914.4k | 29826.9k | 6119.8k | 14675.6k | 1061.5k |
| N high quality SNPs | 6748k | 139.4k | 177.7k | 3568.3k | 16.4k |
Fig. 1SNP discovery flowchart; including sampling, sequencing technologies, computer programs, and filtering criteria used for SNP discovery and integration into the genotyping array. 1Eura: Eurasian; 2GBS: Genotyping-By-Sequencing; maf: minor allele frequency; HQ: high-quality
Fig. 2Geographic distribution of Caribou sampled in the East Canada. Ranges for the migratory ecotype are based on recent data (2019) whereas Mountain and Boreal ranges are based on data collected from previous years. The assignment of individuals to an ecotype was ensured by telemetry monitoring. TRAF: “Rivière-aux-feuilles” herd. TRG: “Rivière-Georges” herd. Photo credit: Joëlle Taillon
Fig. 3Venn diagram for the number of SNPs discovered in the East Canada group with each technology. 5X: whole genome sequencing with a depth of 5X; 30X: whole genome sequencing with a depth of 30X; GBS: Genotyping-By-Sequencing; N indicates the sample size
Fig. 4Discriminant analysis of principal components (DAPC) showing (A) Boreal, Mountain and Migratory ecotypes, (B) the three boreal and the migratory genetic groups and (C) the three boreal forest populations. Inset indicates the number and contribution of axes from principal component analysis retained in the DA.
Average linkage disequilibrium within ecotypes and genetic groups from the Quebec province (Canada) for all SNPs.
| Ecotype/genetic group | Average-LD |
|---|---|
| Mountain_Gaspésie | 0.1710 |
| Boreal_Western Québec | 0.1099 |
| Boreal_Central Québec | 0.0996 |
| Boreal_Eastern Québec | 0.0397 |
| Migratory_Rivière George | 0.0579 |
| Migratory_Rivière aux Feuilles | 0.0260 |
List of Supplementary data
S1: Details for each step of the stacks workflow
S2: List of SNPs positioned on the X chromosome
S3: List of IDs of faulty probes that may yet be found in the resulting vcf
Fig. 5Sexing samples using the SNP chip. (A) Distribution of observed heterozygosity over the 1438 SNPs from the X chromosome; the left peak (max = 0.005) represents the homozygous genotypes from males (in blue) while the right peak (max = 0.33) represents the more heterozygous genotypes from females (in red). (B) Combining the extent of the chromosome X heterozygosity with call-rates of the SNPs in the SRY gene (absent from female genome); the cluster on the bottom left represents the males where a positive call was generated for all 6 probes while the upper cluster represents the females with complete missing genotypes