| Literature DB >> 24194868 |
Xin Ma1, Joanna L Kelley, Kirsten Eilertson, Shaila Musharoff, Jeremiah D Degenhardt, André L Martins, Tomas Vinar, Carolin Kosiol, Adam Siepel, Ryan N Gutenkunst, Carlos D Bustamante.
Abstract
To gain insights into evolutionary forces that have shaped the history of Bornean and Sumatran populations of orang-utans, we compare patterns of variation across more than 11 million single nucleotide polymorphisms found by previous mitochondrial and autosomal genome sequencing of 10 wild-caught orang-utans. Our analysis of the mitochondrial data yields a far more ancient split time between the two populations (~3.4 million years ago) than estimates based on autosomal data (0.4 million years ago), suggesting a complex speciation process with moderate levels of primarily male migration. We find that the distribution of selection coefficients consistent with the observed frequency spectrum of autosomal non-synonymous polymorphisms in orang-utans is similar to the distribution in humans. Our analysis indicates that 35% of genes have evolved under detectable negative selection. Overall, our findings suggest that purifying natural selection, genetic drift, and a complex demographic history are the dominant drivers of genome evolution for the two orang-utan populations.Entities:
Mesh:
Year: 2013 PMID: 24194868 PMCID: PMC3806739 DOI: 10.1371/journal.pone.0077175
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Phylogenetic tree of mtDNA HVRI region and allele sharing matrix of mtDNA and autosomes.
(A) The phylogenetic tree among human, chimpanzee, Bornean (red) and Sumatran (blue). The nodes shaded in cyan represent the grouping of the three Sumatran individuals that has the furthest distance to the rest clustering of the Sumatran group. The 10 re-sequenced individuals from our study are colored as red (Bornean) or blue (Sumatran). The corresponding geographic origins are shown on the right with the following annotation (SU: Sumatran; TU/SL: Tuanan or Sungai Lading; DV/K: Danum Valley or Kinabatangan; SA/GP: Sabangau or Gunung Palung; EK: East Kalimantan including Kutai national park or Sangatta; SAR/DS: Semongok Wildlife Rehabilitation Centre or Danau Sentarum). (B) The IBS allele sharing matrix among the 10 individuals from our study with the same group coloring scheme. The upper and lower triangular matrix represents the IBS sharing of all mtDNA variation loci (n = 1084) and autosomal variation loci (n = 11,866,619), respectively.
Figure 2SFS analysis.
(A) SFS for all genomic SNPs (Genome), synonymous (S) or non-synonymous (NS) SNPs in Bornean (Red) and Sumatran (Blue) population. (B) Two-dimensional SFS (Bornean on Y axis and Sumatran on X axis). Top row: Spectra for different functional categories of SNPs. To make all the spectra directly comparable, the non-coding and synonymous spectra have been re-scaled to represent the same number of segregating SNPs as the non-synonymous spectrum. Bottom row: residuals between pairs of spectra. Bottom-left compares synonymous and non-coding spectra, and bottom-right compares non-synonymous and synonymous spectra. Red and blue entries indicate, respectively, that the first spectrum has greater or fewer SNPs in that entry than the second.
Summary statistics of McDonald-Kreitman table entries for Bornean and Sumatran population using Human (hg19) as outgroup.
| Bornean (Modest Population Size Decline) | |||||
| Divergence | Divergence(%) | Polymorphism | Polymorphism(%) | Length(Mb) | |
| Syn | 50177 | 2.90 | 5302 | 0.31 | 1.73 |
| Non-Syn | 33701 | 0.67 | 4042 | 0.08 | 5.03 |
| Non-Syn/Syn divergence ratio: 22.97% | |||||
| Non-Syn/Syn polymorphism ratio: 26.22% | |||||
|
| |||||
|
|
|
|
|
| |
| Syn | 50011 | 2.89 | 7076 | 0.41 | 1.73 |
| Non-Syn | 33198 | 0.66 | 4847 | 0.09 | 5.03 |
| Non-Syn/Syn divergence ratio: 22.94% | |||||
| Non-Syn/Syn polymorphism ratio: 23.56% | |||||
This is total aligned length.
Figure 3Distribution of the proportion SNPs within each of the PolyPhen categories based on PolyPhen-2 result.
Error bars denote 95% confidence intervals on the proportion of SNPs in each category.
Figure 4Inferred cumulative distribution of selection coefficients.
Solid lines are for the full demographic model, dashed are for the model with no migration, and dotted are for the model with no growth. The break in the axes indicates that the scale is not continuous, because a logarithmic scale cannot span zero.
Percent of mutations within various selective classes for different models of the distribution of selection coefficients and different underlying demographic models.
| demographic scenario | selection model |
| −10−2< | −10−3< | −10−4< |
| full model | pt + lethal | 64 | 0 | 0 | 36 |
| expon + lethal | 64 | 0 | 7 | 29 | |
| norm + lethal | 64 | 0 | 0 | 36 | |
| gamma | 47 | 16 | 11 | 26 | |
| no migration | pt + lethal | 65 | 0 | 0 | 35 |
| expon + lethal | 65 | 0 | 5 | 30 | |
| norm + lethal | 57 | 0 | 27 | 16 | |
| gamma | 48 | 16 | 11 | 25 | |
| no growth | pt + lethal | 57 | 0 | 0 | 43 |
| expon + lethal | 54 | 0 | 19 | 27 | |
| norm + lethal | 56 | 0 | 16 | 28 | |
| gamma | 20 | 35 | 21 | 24 |
Genes under positive selection identified by SnIPRE.
| Gene under positive selection | ||
| Transcript ID | Description | Gene |
| knownGene.uc001vjy.1.1 | CDNA FLJ35379 fis, clone SKMUS2006481. | AX747676 |
| knownGene.uc002kni.1.1 | Homo sapiens cDNA FLJ38028 fis, clone CTONG2013222. | AK095347 |
| knownGene.uc002yvu.1.1 | Homo sapiens DSCR6a mRNA, complete cds. | DSCR6a |
| refGene.NM_000201.1 | intercellular adhesion molecule 1 precursor | ICAM1 |
| refGene.NM_002099.1 | glycophorin A precursor | GYPA |
| refGene.NM_002864.1 | pregnancy-zone protein | PZP |
| refGene.NM_004900.1 | apolipoprotein B mRNA editing enzyme, catalytic | APOBEC3B |
| refGene.NM_005397.1 | podocalyxin-like precursor isoform 2 | PODXL |
| refGene.NM_005547.1 | Involucrin | IVL |
| refGene.NM_014508.1 | apolipoprotein B mRNA editing enzyme, catalytic | APOBEC3C |
| refGene.NM_016190.1 | Cornulin | CRNN |
| refGene.NM_017716.1 | membrane-spanning 4-domains, subfamily A, member | MS4A12 |
| refGene.NM_023922.1 | taste receptor, type 2, member 14 | TAS2R14 |
| refGene.NM_030972.1 | zinc finger protein 611 | ZNF611 |
| refGene.NM_033004.1 | NLR family, pyrin domain containing 1 isoform 1 | NLRP1 |
| refGene.NM_033049.1 | mucin 13, epithelial transmembrane | MUC13 |
| refGene.NM_138337.1 | myeloid inhibitory C-type lectin-like receptor | CLEC12A |
| refGene.NM_172241.1 | cutaneous T-cell lymphoma-associated antigen 1 | CTAGE1 |
| vegaGene.OTTHUMT00000150638.1 | carcinoembryonic antigen-related cell adhesion molecule 3 | CEACAM3 |
| refGene.NM_001423.1 | epithelial membrane protein 1 | EMP1 |
| refGene.NM_001099733.1** | NULL | NULL |
| refGene.NM_001008743.1** | sulfotransferase family, cytosolic, 1C, member | SULT1C3 |
| knownGene.uc003byh.1.1** | TPRXL protein | TPRXL |
and ** denote genes identified as putative positively selected on orang-utan and human lineage, respectively, in previous study [22].