| Literature DB >> 33604669 |
Margarida Gonçalves1,2, Hans R Siegismund3, Bettine Jansen van Vuuren4, Klaus-Peter Koepfli5,6, Nuno Ferrand1,2,4, Raquel Godinho1,2,4.
Abstract
Roan antelope (Hippotragus equinus) is the second-largest member of the Hippotraginae (Bovidae), and is widely distributed across sub-Saharan mesic woodlands. Despite being listed as "Least Concern" across its African range, population numbers are decreasing with many regional Red List statuses varying between Endangered and Locally Extinct. Although the roan antelope has become an economically-important game species in Southern Africa, the vast majority of wild populations are found only in fragmented protected areas, which is of conservation concern. Genomic information is crucial in devising optimal management plans. To this end, we report here the first de novo assembly and annotation of the whole-genome sequence of a male roan antelope from a captive-breeding program. Additionally, we uncover single-nucleotide variants (SNVs) through re-sequencing of five wild individuals representing five of the six described subspecies. We used 10X Genomics Chromium chemistry to produce a draft genome of 2.56 Gb consisting of 16,880 scaffolds with N50 = 8.42 Mb and a BUSCO completeness of 91.2%. The draft roan genome includes 1.1 Gbp (42.2%) repetitive sequences. De novo annotation identified 20,518 protein-coding genes. Genome synteny to the domestic cow showed an average identity of 92.7%. Re-sequencing of five wild individuals to an average sequencing depth of 9.8x resulted in the identification of a filtered set of 3.4x106 bi-allelic SNVs. The proportion of alternative homozygous SNVs for the individuals representing different subspecies, as well as differentiation as measured by PCA, were consistent with expected divergence from the reference genome and among samples. The roan antelope genome is a valuable resource for evolutionary and population genomic questions, as well as management and conservation actions.Entities:
Keywords: zzm321990 Hippotragus equinuszzm321990 ; zzm321990 de novo genome assembly; 10X Genomics Chromium; Bovidae; genomic diversity; roan antelope
Mesh:
Year: 2021 PMID: 33604669 PMCID: PMC8022987 DOI: 10.1093/g3journal/jkab002
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Roan antelope distribution and sampling. (A) Roan antelope (Hippotragus equinus). Photo by Raquel Godinho; (B) Geographic distribution of roan antelope subspecies according to Ansell (1972) and current species distribution, following IUCN SSC Antelope Specialist Group (2017). The geographic location of the five samples used in this study for whole-genome re-sequencing is represented by dark dots labelled with sample identifications (as in Table 1).
Roan antelope native samples used for whole-genome re-sequencing
| Sample ID | Original label ID | Subspecies | Year | Sex | Locality | N PCR cycles | Mapping (%) | Mean coverage |
|---|---|---|---|---|---|---|---|---|
| He80 | 7324 |
| 1998 | F | Kablima, Ghana | 8 | 99.8 | 5.2x |
| HeNI226 | NN.252 |
| 1925 | M | Nana Barya Reserve, CAR | 9 | 99.7 | 5.9x |
| He108 | 9293 |
| ― | M | Akobe septum, Ethiopia | 8 | 99.8 | 11.7x |
| He95 | 8250 |
| 1997 | M | Maswa Game Reserve, Tanzania | 10 | 99.7 | 12.2x |
| HeNI207 | SWA.74 |
| 1937 | F | Cubango river, Angola | 7 | 99.7 | 14.1x |
Contemporary samples from tissue collection of the University of Copenhagen, Department of Biology (responsible: H.R. Siegismund).
Historic samples donated by the Powell-Cotton Museum, Kent, UK; CAR—Central African Republic.
Assembly statistics based on gene completeness scores by BUSCO v.3.0.2 for the whole-genome assembly of roan antelope
| BUSCOs | Total number | Percentage |
|---|---|---|
| Complete | 3,743 | 91.2 |
| Complete and single-copy | 3,677 | 89.6 |
| Complete and duplicated | 66 | 1.6 |
| Fragmented | 194 | 4.7 |
| Missing | 167 | 4.1 |
BUSCO scores using the Mammalia OrthoDB v9 gene set, containing 4,104 genes.
Summary of the repetitive content found in roan antelope genome assembly, using RepeatMasker
| Number | Length occupied (bp) | Percent masked | |
|---|---|---|---|
| SINEs | 222,376 | 27,885,908 | 1.1 |
| LINEs | 2,731,480 | 799,087,834 | 30.4 |
| LTR elements | 496,668 | 106,731,053 | 4.0 |
| DNA elements | 377,402 | 54,829,227 | 2.1 |
| Unclassified | 413,329 | 65,638,505 | 2.5 |
| Total interspersed repeats | 1,054,172,527 | 40.1 | |
| Small RNA | 298,364 | 38,764,415 | 1.4 |
| Simple repeats | 470,855 | 18,232,272 | 0.6 |
| Low complexity | 80,149 | 3,946,798 | 0.1 |
| Total repeats | 1,106,510,938 | 42.2 |
Figure 2Synteny blocks between roan antelope and the domestic cow genomes. Synteny between roan scaffolds larger than 10 Kb and the domestic cow chromosomes (29 autosomes and the X). Scaffolds were ordered to obtain the best pairwise alignment blocks between the two genomes which are visualized using colored lines.
Figure 3Intraspecific variation in roan antelope. (A) Bar chart comparing the number of quality-filtered alternative homozygous and heterozygous single-nucleotide variants (millions of SNVs) among the five re-sequenced wild individuals. (B) Plot of principal component analysis (PCA) for the same individuals. PC1 and PC2 show the first and second axes with corresponding percentage of explained variance. Circles identify each individual and colors correspond to the respective subspecies, following Figure 1.