| Literature DB >> 30346553 |
Ellie E Armstrong1, Ryan W Taylor1, Stefan Prost1,2, Peter Blinston3, Esther van der Meer3, Hillary Madzikanda3, Olivia Mufute4, Roseline Mandisodza-Chikerema4, John Stuelpnagel5, Claudio Sillero-Zubiri6, Dmitri Petrov1.
Abstract
Background: A high-quality reference genome assembly is a valuable tool for the study of non-model organisms. Genomic techniques can provide important insights about past population sizes and local adaptation and can aid in the development of breeding management plans. This information is important for fields such as conservation genetics, where endangered species require critical and immediate attention. However, funding for genomic-based methods can be sparse for conservation projects, as costs for general species management can consume budgets. Findings: Here, we report the generation of high-quality reference genomes for the African wild dog (Lycaon pictus) at a low cost (<$3000), thereby facilitating future studies of this endangered canid. We generated assemblies for three individuals using the linked-read 10x Genomics Chromium system. The most continuous assembly had a scaffold and contig N50 of 21 Mb and 83 Kb, respectively, and completely reconstructed 95% of a set of conserved mammalian genes. Additionally, we estimate the heterozygosity and demographic history of African wild dogs, revealing that although they have historically low effective population sizes, heterozygosity remains high. Conclusions: We show that 10x Genomics Chromium data can be used to effectively generate high-quality genomes from Illumina short-read data of intermediate coverage (∼25x-50x). Interestingly, the wild dog shows higher heterozygosity than other species of conservation concern, possibly due to its behavioral ecology. The availability of reference genomes for non-model organisms will facilitate better genetic monitoring of threatened species such as the African wild dog and help conservationists to better understand the ecology and adaptability of those species in a changing environment.Entities:
Mesh:
Year: 2019 PMID: 30346553 PMCID: PMC6350039 DOI: 10.1093/gigascience/giy124
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:(A) Pack of African wild dogs. (B) Shared heterozygous sites between the three de novo assemblies (calculated using a posterior cutoff of 0.99). More of the heterozygous sites are shared between the two sisters than between either sister and Eureka. (C) Reconstruction of the individuals’ demographic history using the PSMC program. Bootstrap replicates are plotted in lighter colors. Time is in years before present.
Assembly statistics.
| Sister 1 | Sister 2 | Eureka | ||
|---|---|---|---|---|
| Input | Reads (m) | 1200 | 801.56 | 427.6 |
| Average coverage | 69 | 46 | 25 | |
| Mean molecule size (kb) | 19.91 | 77.03 | 52.00 | |
| Contig | N50 (kb) | 61.34 | 83.47 | 50.15 |
| Longest (kb) | 524.60 | 615.40 | 450.50 | |
| Number (k) | 78.62 | 68.64 | 108.00 | |
| Scaffold | N50 (mb) | 7.91 | 21.34 | 15.31 |
| Longest (mb) | 43.96 | 69.63 | 41.67 | |
| Number (k) | 11.78 | 17.64 | 25.78 | |
| Total size (gb) | Scaffolds ≥ 10 kb | 2.27 | 2.26 | 2.20 |
| Scaffolds ≥ 500 bp | 2.34 | 2.40 | 2.42 |
Assembly statistics for the three African wild dog genomes reported by the Supernova assembler. Coverage was assessed using SAMtools depth.
Conserved gene statistics.
| Assembly | Species | Complete | Single copy | Duplicated | Fragmented | Missing | Total searched |
|---|---|---|---|---|---|---|---|
| Sister 1 |
| 3,914 | 3,875 | 39 | 102 | 88 | 4,104 |
| Sister 2 |
| 3,903 | 3,845 | 58 | 107 | 94 | 4,104 |
| Eureka |
| 3,829 | 3,789 | 40 | 169 | 106 | 4,104 |
| canFam3.1 |
| 3,910 | 3,857 | 53 | 98 | 96 | 4,104 |
| Kenya |
| 3,849 | 3,823 | 26 | 136 | 119 | 4,104 |
| South Africa |
| 3,892 | 3,867 | 25 | 104 | 108 | 4,104 |
| Wolf |
| 3,890 | 3,849 | 41 | 110 | 104 | 4,104 |
| Hawaiian monk seal |
| 3,881 | 3,833 | 48 | 118 | 105 | 4,104 |
Results of the BUSCO v2 gene annotation from three African wild dog genome assemblies, canFam3.1, low-coverage wild dog genomes [21], the recently published wolf genome [36], and the Hawaiian monk seal genome [37].