| Literature DB >> 25671092 |
Cai Li1, Yong Zhang2, Jianwen Li2, Lesheng Kong3, Haofu Hu2, Hailin Pan2, Luohao Xu2, Yuan Deng2, Qiye Li1, Lijun Jin2, Hao Yu2, Yan Chen2, Binghang Liu2, Linfeng Yang2, Shiping Liu2, Yan Zhang2, Yongshan Lang2, Jinquan Xia2, Weiming He2, Qiong Shi2, Sankar Subramanian4, Craig D Millar5, Stephen Meader3, Chris M Rands3, Matthew K Fujita6, Matthew J Greenwold7, Todd A Castoe8, David D Pollock9, Wanjun Gu10, Kiwoong Nam11, Hans Ellegren12, Simon Yw Ho13, David W Burt14, Chris P Ponting3, Erich D Jarvis15, M Thomas P Gilbert16, Huanming Yang17, Jian Wang2, David M Lambert4, Jun Wang18, Guojie Zhang19.
Abstract
BACKGROUND: Penguins are flightless aquatic birds widely distributed in the Southern Hemisphere. The distinctive morphological and physiological features of penguins allow them to live an aquatic life, and some of them have successfully adapted to the hostile environments in Antarctica. To study the phylogenetic and population history of penguins and the molecular basis of their adaptations to Antarctica, we sequenced the genomes of the two Antarctic dwelling penguin species, the Adélie penguin [Pygoscelis adeliae] and emperor penguin [Aptenodytes forsteri].Entities:
Keywords: Adaptation; Antarctica; Avian genomics; Evolution; Penguins
Year: 2014 PMID: 25671092 PMCID: PMC4322438 DOI: 10.1186/2047-217X-3-27
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Basic statistics of assembly and annotation of the two penguin genomes
| Species | Contig N50 length (bp) | Scaffold N50 length (bp) | Assembly size (bp) | (G + C)% | Repeats (%) | #Protein coding genes |
|---|---|---|---|---|---|---|
|
| 19,134 | 5,047,175 | 1,226,103,150 | 41.7% | 6.47 | 15,270 |
|
| 30,463 | 5,071,598 | 1,257,483,768 | 41.8% | 7.38 | 16,070 |
Figure 1Phylogenetic relationships and changes in effective population sizes of two penguin species. (A) Phylogeny of two penguins and six closely related aquatic species (northern fulmar Fulmarus glacialis; great cormorant Phalacrocorax carbo; crested ibis Nipponia nippon; dalmatian pelican Pelecanus crispus; little egret Egretta garzetta; red-throated loon Gavia stellata) (blue names), and a land bird (zebra finch Taeniopygia guttata). The estimates of topology and divergence times are from our avian phylogenomic study [24]. Horizontal bars at each node represent 95% credibility intervals of estimated divergence times. Above the tree are the geological timescale and temperature changes over the past 65 million years, relative to the present [26]. PETM, Paleocene–Eocene Thermal Maximum. (B) Dynamic changes of effective population sizes (N ) of two penguins inferred by the pairwise sequentially Markovian coalescent (PSMC) method. The thick curves depict the estimated N values of the two penguins, and the thin curves represent PSMC bootstrapping estimates. (C) Enlargement of the period from 100 KYA to 10 ka in panel (B). MIS 4, Marine Isotope Stage 4; LGM, last glacial maximum. Temperature change data are from [33].
Figure 2Penguin-specific duplications of keratinocyte β-keratin genes. (A) Numbers of keratinocyte β-keratin genes in the two penguins and other birds. Error bars indicate standard deviations. P-values were calculated by phylogenetic ANOVA. (B) Phylogenetic cladogram of keratinocyte β-keratin genes of the Adélie penguin (PYGAD, in blue), emperor penguin (APTFO, in red), and five aquatic relatives (northern fulmar, FULGL; crested ibis, NIPNI; great cormorant, PHACA; little egret, EGRGA; dalmatian pelican, PELCR). Shading in the tree indicates putative penguin-specific gene duplication events.
Figure 3Cases of positively selected sites and non-neutral penguin-specific amino acid changes. (A) Positively selected sites in emperor penguin CNGB1 protein sequence. Cytoplasmic and transmembrane regions are separated by the dashed lines. Blue shading represents the membrane-spanning helix, and the cAMP binding domain is shown in grey. The posterior probabilities were calculated using BEB method in CODEML. (B) Positively selected sites in the FASN protein in the Adélie lineage (green dots) and the ancestral lineage (blue dots). The molecular binding domains of FASN are shown in light red, whereas the major catalytic domains are shown in grey. From left to right, beta-ketoacyl synthase (KS), acyl and malonyl transferases (MAT), enoyl reductase (ER), beta-ketoacyl reductase (KR), and thioesterase (TE). The posterior probabilities were calculated using BEB method in CODEML. (C) Non-neutral penguin-specific amino acid changes in the EVC2 protein. One substitution site is located in the Pfam domain EVC_like (PF12297).