| Literature DB >> 35698034 |
Verena E Kutschera1, Marcin Kierczak2, Tom van der Valk2, Johanna von Seth3,4,5, Nicolas Dussex3,4,5, Edana Lord3,4,5, Marianne Dehasque3,4,5, David W G Stanton3,4, Payam Emami Khoonsari6, Björn Nystedt2, Love Dalén3,4,5, David Díez-Del-Molino7,8,9.
Abstract
BACKGROUND: Many wild species have suffered drastic population size declines over the past centuries, which have led to 'genomic erosion' processes characterized by reduced genetic diversity, increased inbreeding, and accumulation of harmful mutations. Yet, genomic erosion estimates of modern-day populations often lack concordance with dwindling population sizes and conservation status of threatened species. One way to directly quantify the genomic consequences of population declines is to compare genome-wide data from pre-decline museum samples and modern samples. However, doing so requires computational data processing and analysis tools specifically adapted to comparative analyses of degraded, ancient or historical, DNA data with modern DNA data as well as personnel trained to perform such analyses.Entities:
Keywords: Ancient DNA; Bioinformatics; Conservation genomics; Endangered species; Reproducibility; Snakemake; Temporal genomic erosion; Whole genome re-sequencing data
Mesh:
Year: 2022 PMID: 35698034 PMCID: PMC9195343 DOI: 10.1186/s12859-022-04757-0
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.307
Fig. 1Overview of the GenErode pipeline data processing tracks. Input and output file formats, dependencies between steps, and main software used are shown. Optional steps are highlighted in red
Fig. 2Overview of the GenErode pipeline data analysis tracks and final report. Input file formats and main software used are shown
Fig. 3Results from the downstream analyses for historical and modern Sumatran rhinoceros samples from the test dataset. a Principal component analysis (PCA) of historical and modern Sumatran rhinoceros samples using the Sumatran rhinoceros reference. Historical samples: JvS008, JvS009, JvS022. Modern samples: JvS033, JvS034, JvS035. See “PCA” in Additional file 3. b Maximum likelihood estimates of genome-wide heterozygosity (θ) and 95% confidence intervals from mlRho in historical and modern Sumatran rhinoceros samples using the Sumatran rhinoceros reference. See “mlRho” in Additional file 3. c Inbreeding coefficient estimated as the proportion of the genome in runs of homozygosity (F) for ROH of length > = 2 Mb in historical and modern Sumatran rhinoceros samples. Parameter settings in the configuration file: homozyg-snp: 25; homozyg-kb: 100; homozyg-window-snp: 250; homozyg-window-het: 3; homozyg-window-missing: 15; homozyg-het: 750. See “ROH” in Additional file 3. d Number of variants in each impact category in historical and modern Sumatran rhinoceros samples as estimated by snpEff and using the white rhinoceros reference with 253 protein-coding gene annotations. Note that none of the samples contained any high impact variants which are therefore not shown. See “snpEff” in Additional file 2. e Relative mutational load estimated from GERP scores for historical and modern Sumatran rhinoceros samples using the white rhinoceros reference. GERP scores were estimated using 30 outgroup species from across the tree of mammals. Only derived alleles from sites with GERP scores above the 99th percentile were included (corresponding to GERP > 0.00048928). See “GERP” in Additional file 2