| Literature DB >> 32051450 |
Ye Yin1,2, Ting Yang3,4, Huan Liu5,3,6, Ziheng Huang3,7, Yaolei Zhang3,7, Yue Song3,7, Wenliang Wang3,7, Xuanmin Guang3, Sunil Kumar Sahu3,7,6, Karsten Kristiansen8,9.
Abstract
Mandrill (Mandrillus sphinx) is a primate species, which belongs to the Old World monkey (Cercopithecidae) family. It is closely related to human, serving as a model for human health related research. However, the genetic studies on and genomic resources of mandrill are limited, especially in comparison to other primate species. Here we produced 284 Gb data, providing 96-fold coverage (considering the estimated genome size of 2.9 Gb), to construct a reference genome for the mandrill. The assembled draft genome was 2.79 Gb with contig N50 of 20.48 Kb and scaffold N50 of 3.56 Mb. We annotated the mandrill genome to find 43.83% repeat elements, as well as 21,906 protein-coding genes. The draft genome was of good quality with 98% gene annotation coverage by Benchmarking Universal Single-Copy Orthologs (BUSCO). Based on comparative genomic analyses of the Major Histocompatibility Complex (MHC) of the immune system in mandrill and human, we found that 17 genes in the mandrill that have been associated with disease phenotypes in human such as Lung cancer, cranial volume and asthma, barbored amino acids changing mutations. Gene family analyses revealed expansion of several genes, and several genes associated with stress environmental adaptation and innate immunity responses exhibited signatures of positive selection. In summary, we established the first draft genome of the mandrill of value for studies on evolution and human health.Entities:
Mesh:
Year: 2020 PMID: 32051450 PMCID: PMC7016171 DOI: 10.1038/s41598-020-59110-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Published primate genome sequences. (modified based on[37]).
| Common name | Species name | Bases in contigs | Contig N50 | Scaffold N50 | Reference |
|---|---|---|---|---|---|
| Chimpanzee | 2.7 Gb | 15.7 kb | 8.6 Mb | [ | |
| Chimpanzee (updated) | 2.9 Gb | 50.7 kb | 8.9 Mb | [ | |
| Bonobo | 2.7 Gb | 67 kb | 9.6 Mb | [ | |
| Gorilla | 2.7 Gb | 11.8 kb | 914 kb | [ | |
| Gorilla (updated) | 2.8 Gb | 9.6 Mb | 23.1 Mb | [ | |
| Orang-utan | 3.1 Gb | 15.5 kb | 739 kb | [ | |
| Indian rhesus macaque | 2.9 Gb | 25.7 kb | 24.3 Mb | [ | |
| Indian rhesus macaque (updated) | 3.1 Gb | 107.2 kb | 4.2 Mb | ||
| Chinese rhesus macaque | 2.8 Gb | 11.9 kb | 891 kb | [ | |
| Vietnamese cynomolgus macaque | 2.9 Gb | 12.5 kb | 652 kb | [ | |
| baboons | Papio | 2.9 Gb | 149.87 kb | 140.35 Mb | [ |
| Aye-aye | 3.0 Gb | NA | 13.6 kb | [ | |
| Vervet | 2.8 Gb | 90.4 kb | 81.8 Mb | [ | |
| Gibbon | 2.8 Gb | 35.1 kb | 22.7 Mb | [ | |
| Marmoset | 2.3 Gb | 29 kb | 6.7 Mb | [ | |
| Mouse lemur | 2.4 Gb | 210.7 kb | 108.2 Mb | ||
| Pig-tailed macaque | 2.8 Gb | 106.9 kb | 15.2 Mb | ||
| Sifaka | 2.1 Gb | 28.1 kb | 5.6 Mb | ||
| Sooty mangabey | 2.8 Gb | 112.9 kb | 12.8 Mb | ||
| Squirrel monkey | 2.5 Gb | 38.8 kb | 18.7 Mb | ||
| Bushbaby | 2.4 Gb | 27.1 kb | 13.9 Mb | ||
| Mouse lemur | 2.4 Gb | 182.9 kb | 3.7 Mb | ||
| Tarsier | 3.4 Gb | 38.2 kb | 401 Mb | [ |
Summary of the mandrill genome assembly.
| Contig | Scaffold | |||
|---|---|---|---|---|
| Size (bp) | Number | Size (bp) | Number | |
| N90 | 5,266 | 141,475 | 638,217 | 936 |
| N80 | 9,025 | 101,618 | 1,303,160 | 634 |
| N70 | 12,638 | 75,505 | 1,962,294 | 457 |
| N60 | 16,336 | 56,061 | 2,730,696 | 332 |
| N50 | 20,483 | 40,751 | 3,564,730 | 241 |
| Longest | 211,017 | – | 19,105,867 | – |
| Total size | 2,798,997,503 | – | 2,882,689,325 | – |
| Total number (> = 100 bp) | – | 455,069 | – | 215,140 |
| Total number (> = 2 kb) | – | 194,923 | – | 4,742 |
Figure 1Phylogeny and evolution of the mandrill’s gene families. (a) The Venn diagram of the gene families of human (Homo sapiens), macaque (Macaca mulatta), gorilla (Gorilla gorilla), marmoset (Callithrix jacchus) and mandrill. (b) Comparison of orthologous genes among 11 primates and mouse. (c) The maximum-likelihood phylogenetic tree based on the 4-fold degenerate sites of 5,133 single-copy gene families in the 12 species.
Figure 2The demographic changes of mandrill compared to human. The x-axis indicates the time (from left to right indicates recent to ancient), and the y-axis indicates the estimated effective population size.
Figure 3Synteny of MHC class I region between mandrill and human. Red boxes indicate the MHC genes. Green boxes indicate other protein-coding genes. Yellow and green indicate deletions and insertions in mandrill, respectively.
Genes with disease and its’ mutation in mandrill.
| Gene name | Full name | Position | Wildtype AA | Mutation AA | Disease Description |
|---|---|---|---|---|---|
| Aminolevulinate Dehydratase | 59 | K | N | Amyotrophic lateral sclerosis | |
| Class II Major Histocompatibility Complex Transactivator | 500 | G | A | Multiple sclerosis | |
| Crumbs Cell Polarity Complex Component 1 | 959 | G | S | Retinitis pigmentosa | |
| Interleukin 4 Receptor | 75 | I | V | Asthma, atopic | |
| Microcephalin 1 | 761 | A | V | Cranial volume | |
| NPHS2 Stomatin Family Member, Podocin | 192 | I | V | Nephrotic syndrome | |
| Tumor Protein P53 Binding Protein 1 | 353 | D | E | Lung cancer |