| Literature DB >> 21270892 |
Devin P Locke1, LaDeana W Hillier, Wesley C Warren, Kim C Worley, Lynne V Nazareth, Donna M Muzny, Shiaw-Pyng Yang, Zhengyuan Wang, Asif T Chinwalla, Pat Minx, Makedonka Mitreva, Lisa Cook, Kim D Delehaunty, Catrina Fronick, Heather Schmidt, Lucinda A Fulton, Robert S Fulton, Joanne O Nelson, Vincent Magrini, Craig Pohl, Tina A Graves, Chris Markovic, Andy Cree, Huyen H Dinh, Jennifer Hume, Christie L Kovar, Gerald R Fowler, Gerton Lunter, Stephen Meader, Andreas Heger, Chris P Ponting, Tomas Marques-Bonet, Can Alkan, Lin Chen, Ze Cheng, Jeffrey M Kidd, Evan E Eichler, Simon White, Stephen Searle, Albert J Vilella, Yuan Chen, Paul Flicek, Jian Ma, Brian Raney, Bernard Suh, Richard Burhans, Javier Herrero, David Haussler, Rui Faria, Olga Fernando, Fleur Darré, Domènec Farré, Elodie Gazave, Meritxell Oliva, Arcadi Navarro, Roberta Roberto, Oronzo Capozzi, Nicoletta Archidiacono, Giuliano Della Valle, Stefania Purgato, Mariano Rocchi, Miriam K Konkel, Jerilyn A Walker, Brygg Ullmer, Mark A Batzer, Arian F A Smit, Robert Hubley, Claudio Casola, Daniel R Schrider, Matthew W Hahn, Victor Quesada, Xose S Puente, Gonzalo R Ordoñez, Carlos López-Otín, Tomas Vinar, Brona Brejova, Aakrosh Ratan, Robert S Harris, Webb Miller, Carolin Kosiol, Heather A Lawson, Vikas Taliwal, André L Martins, Adam Siepel, Arindam Roychoudhury, Xin Ma, Jeremiah Degenhardt, Carlos D Bustamante, Ryan N Gutenkunst, Thomas Mailund, Julien Y Dutheil, Asger Hobolth, Mikkel H Schierup, Oliver A Ryder, Yuko Yoshinaga, Pieter J de Jong, George M Weinstock, Jeffrey Rogers, Elaine R Mardis, Richard A Gibbs, Richard K Wilson.
Abstract
'Orang-utan' is derived from a Malay term meaning 'man of the forest' and aptly describes the southeast Asian great apes native to Sumatra and Borneo. The orang-utan species, Pongo abelii (Sumatran) and Pongo pygmaeus (Bornean), are the most phylogenetically distant great apes from humans, thereby providing an informative perspective on hominid evolution. Here we present a Sumatran orang-utan draft genome assembly and short read sequence data from five Sumatran and five Bornean orang-utan genomes. Our analyses reveal that, compared to other primates, the orang-utan genome has many unique features. Structural evolution of the orang-utan genome has proceeded much more slowly than other great apes, evidenced by fewer rearrangements, less segmental duplication, a lower rate of gene family turnover and surprisingly quiescent Alu repeats, which have played a major role in restructuring other primate genomes. We also describe a primate polymorphic neocentromere, found in both Pongo species, emphasizing the gradual evolution of orang-utan genome structure. Orang-utans have extremely low energy usage for a eutherian mammal, far lower than their hominid relatives. Adding their genome to the repertoire of sequenced primates illuminates new signals of positive selection in several pathways including glycolipid metabolism. From the population perspective, both Pongo species are deeply diverse; however, Sumatran individuals possess greater diversity than their Bornean counterparts, and more species-specific variation. Our estimate of Bornean/Sumatran speciation time, 400,000 years ago, is more recent than most previous studies and underscores the complexity of the orang-utan speciation process. Despite a smaller modern census population size, the Sumatran effective population size (N(e)) expanded exponentially relative to the ancestral N(e) after the split, while Bornean N(e) declined over the same period. Overall, the resources and analyses presented here offer new opportunities in evolutionary genomics, insights into hominid biology, and an extensive database of variation for conservation efforts.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21270892 PMCID: PMC3060778 DOI: 10.1038/nature09687
Source DB: PubMed Journal: Nature ISSN: 0028-0836 Impact factor: 69.504
Sumatran orangutan assembly statistics (ponAbe2).
| Total Contig Bases | 3.09 Gb |
| Total Contig Bases >Phred Q20 | 3.05 Gb (98.5%) |
| Ordered/Oriented Contigs & Scaffolds | 3.08 Gb |
| Number of Contigs >1 kb | 410,172 |
| N50 Contig Length | 15.5 kb |
| N50 Contig Number | 55,989 |
| Number of Scaffolds >2 kb | 77,683 |
| N50 Scaffold Length | 739 kb |
| N50 Scaffold Number | 1,031 |
| Average Read Depth | 5.53x |
Figure 1Divergence among great apes, a lesser ape, and an old world monkey with respect to humans
We estimated nucleotide divergence in unique gap-free sequence, indicated at each node, from the alignment of rhesus macaque (yellow), gibbon (purple), orangutan (orange), gorilla (aqua), chimpanzee (green) and human (blue) whole genome shotgun reads to the human reference (Hs.35)(S3). Note that the Bornean (Pongo pygmaeus) and Sumatran (Pongo abelii) orangutan species showed nucleotide identity comparable to that of bonobo (Pan paniscus) and chimpanzee (Pan troglodytes). *Yu et al. 200329, #Chen and Li 200130.
Number of genome rearrangements by species.
| Species | Rearrangements >100 kb | Rearrangements >5 kb |
|---|---|---|
| Orangutan | 38 | 861 |
| Chimpanzee | 85 (+124%) | 1095 (+27%) |
| Human | 54 (+42%) | 1238 (44%) |
The number in parentheses indicates the %Δ with respect to the orangutan genome. Note 40 events >100 kb and 532 events >5 kb were assigned to the human-chimpanzee ancestor by ancestral reconstruction (S6).
Figure 2The neocentromere of orangutan chromosome 12
Note the identical order of four BAC-derived FISH probes (IDs in S8) between the normal (left panel) and neocentromere-bearing (right panel) configurations of orangutan chromosome 12, despite discordant centromere positions indicated by arrows on the adjacent DAPI-only images. The neocentromere recruits centromeric proteins CENP-A and CENP-C and lies within a ~225 kb gene-free and alpha satellite-free region. The neocentromere-bearing variant is polymorphic in both Bornean and Sumatran populations, suggesting the neocentromere arose prior to the Bornean/Sumatran split, yet has not been fixed in either species.
Figure 3Alu quiescence in the orangutan lineage
We identified only ~250 lineage-specific Alu retroposition events in the orangutan genome, a dramatically lower rate than that of other sequenced primates, including humans. The total number of lineage-specific L1, SVA and Alu insertions is shown (pie chart), along with the rate of insertion events per element type (bar graph). Reduced Alu retroposition potentially limited the effect of a wide variety of repeat-driven mutational mechanisms in the orangutan lineage that played a major role in restructuring other primate genomes.
Figure 4Enrichment for positive selection in the cerebroside-sulfatid metabolism pathway
We identified six genes (indicated in yellow) under moderate to strong positive selection in primates (P<0.05) that fall within the cerebroside-sulfatid region of the sphingolipid metabolism pathway (adapted from human KEGG pathway 00600; http://www.genome.jp/kegg/kegg2.html). This pathway is associated with several human lysosomal storage disorders, such as Gaucher’s disease, Sandhoff’s disease, Tay Sachs disease and metachromatic leukodystrophy.
Figure 5Orangutan population genetics and demographics
a. Site-frequency spectra (SFS) for 13.2 million Bornean (blue) and Sumatran (red) SNPs are shown, note the enrichment of low-frequency SNPs among Sumatran individuals. b. The majority of SNPs were restricted to their respective island populations as the ‘heat’ of the 2D SFS, representing high allele counts, lay along the axes. c. Our demographic model estimated the ancestral orangutan population (N = 17,900) split approximately 400,000 years ago, followed by exponential expansion of Sumatran N and a decline of Bornean N, culminating in higher diversity among modern Sumatran orangutans despite a lower census population size. The model also supported low-level gene flow (<1 individual/generation) indicated by arrows.