| Literature DB >> 35260629 |
D Noll1,2,3, F Leon1, D Brandt4, P Pistorius5, C Le Bohec6,7, F Bonadonna8, P N Trathan9, A Barbosa10, A Raya Rey11,12,13, G P M Dantas14, R C K Bowie15, E Poulin2,3, J A Vianna16,17,18.
Abstract
Although mitochondrial DNA has been widely used in phylogeography, evidence has emerged that factors such as climate, food availability, and environmental pressures that produce high levels of stress can exert a strong influence on mitochondrial genomes, to the point of promoting the persistence of certain genotypes in order to compensate for the metabolic requirements of the local environment. As recently discovered, the gentoo penguins (Pygoscelis papua) comprise four highly divergent lineages across their distribution spanning the Antarctic and sub-Antarctic regions. Gentoo penguins therefore represent a suitable animal model to study adaptive processes across divergent environments. Based on 62 mitogenomes that we obtained from nine locations spanning all four gentoo penguin lineages, we demonstrated lineage-specific nucleotide substitutions for various genes, but only lineage-specific amino acid replacements for the ND1 and ND5 protein-coding genes. Purifying selection (dN/dS < 1) is the main driving force in the protein-coding genes that shape the diversity of mitogenomes in gentoo penguins. Positive selection (dN/dS > 1) was mostly present in codons of the Complex I (NADH genes), supported by two different codon-based methods at the ND1 and ND4 in the most divergent lineages, the eastern gentoo penguin from Crozet and Marion Islands and the southern gentoo penguin from Antarctica respectively. Additionally, ND5 and ATP6 were under selection in the branches of the phylogeny involving all gentoo penguins except the eastern lineage. Our study suggests that local adaptation of gentoo penguins has emerged as a response to environmental variability promoting the fixation of mitochondrial haplotypes in a non-random manner. Mitogenome adaptation is thus likely to have been associated with gentoo penguin diversification across the Southern Ocean and to have promoted their survival in extreme environments such as Antarctica. Such selective processes on the mitochondrial genome may also be responsible for the discordance detected between nuclear- and mitochondrial-based phylogenies of gentoo penguin lineages.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35260629 PMCID: PMC8904570 DOI: 10.1038/s41598-022-07562-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Distribution of gentoo penguins around the Southern Ocean. The figure shows the sample locations (colored circles): northern gentoo: Martillo and Falkland/Malvinas I.; southern gentoo: Signy Island, O’Higgins base, Stranger Point and Gabriel Gonzalez Videla base; southeastern gentoo: Courbet Peninsula, Kerguelen I.; eastern gentoo: Marion and Crozet I. and open circles represent areas with unsampled colonies. Map images provided by Shutterstock database and edited in Adobe Illustrator.
Indices of diversity of mt-genes (N: number of sequences, H: number of haplotypes, S: number of polymorphic sites, π: nucleotide diversity, Np: number of peptides generated after translation of sequences, ω: dN/dS).
| mt-gen | N | Size (bp) | H | S | π | Np | ω |
|---|---|---|---|---|---|---|---|
| ATP6 | 62 | 681 | 6 | 28 | 0.0127 | 3 | 0.038 |
| ATP8 | 62 | 162 | 2 | 2 | 0.0046 | 2 | – |
| CYTB | 62 | 1137 | 10 | 30 | 0.0088 | 6 | 0.100 |
| COX1 | 62 | 1533 | 7 | 33 | 0.0076 | 4 | 0.019 |
| COX2 | 62 | 675 | 5 | 16 | 0.0084 | 3 | 0.021 |
| COX3 | 62 | 783 | 3 | 11 | 0.0045 | 2 | 0.021 |
| ND1 | 62 | 978 | 7 | 18 | 0.0065 | 4 | 0.075 |
| ND2 | 62 | 1029 | 9 | 22 | 0.0071 | 8 | 0.179 |
| ND3 | 62 | 348 | 5 | 11 | 0.0105 | 4 | 0.142 |
| ND4 | 62 | 1368 | 10 | 35 | 0.0086 | 4 | 0.110 |
| ND4L | 62 | 294 | 6 | 7 | 0.0055 | 4 | 0.248 |
| ND5 | 62 | 1803 | 7 | 41 | 0.0080 | 5 | 0.097 |
| ND6 | 62 | 516 | 7 | 18 | 0.0109 | 2 | 0.048 |
*ω was not estimated for ATP8 because it only has 2 haplotypes.
Figure 2(a) Haplotype distribution of gentoo penguin lineages around the Southern Ocean; (b) Distribution of haplotypes translated into amino acid sequences. Map images provided by Shutterstock database and edited in Adobe Illustrator.
Figure 3Bayesian phylogenetic reconstruction obtained from partitions of 13 protein coding genes (11,170 bp). On the phylogenies, stars, circles, and squares represent codons under selection obtained using different approaches. Stars and circles represent codons with positive selection signals obtained from only TreeSAAP or only codeml analyses, respectively. Squares represent sites under selection using both programs. Abbreviations on the map: APF (Antarctic Polar Front) and STF (Sub-Tropical Front). All nodes were supported by PP = 1.0. Map images provided by Shutterstock database and edited in Adobe Illustrator.
Figure 4Mito-nuclear discordance between topologies of gentoo penguins. Left: Genomic phylogeny (SNAPP) generated using 4429 SNPs in Pertierra et al. (2020). Right: Bayesian phylogeny from 13 mitochondrial protein coding genes (11,170 bp).
Detection of positive selection in codons of OXPHOS mitochondrial genes using four methods: TreeSAAP, Fubar and codeml.
| OXPHOS Complex | Gene | Codon | TreeSAAP | Fubar | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| From–to | Branch | Cat | Prop | Branch | PP | M2 | M8 | ||||
| V | ATP6 | 28 | Ala–Thr | SE + N + S | 6 | ↓Pα | ns | ns | ns | ns | ns |
| 60 | Thr–Ala | SE + N + S | 6 | ↑Pα | ns | ns | ns | 0.75* | ns | ||
| 181 | Ala–Thr | SE | 6 | ↓Pα | ns | ns | ns | ns | ns | ||
| I | ND1 | 22 | Ile–Val | E | 8 | ↑EC | SE + N + S | 0.75* | ns | ns | ns |
| 90 | Ile–Leu | N + S | 8 | ↑EC | ns | ns | ns | ns | ns | ||
| 137 | Ile–Val | E | 8 | ↑EC | ns | ns | ns | ns | ns | ||
| I | ND2 | 2 | ns | ns | ns | ns | ns | 1*** | 1*** | 0.921 | |
| 321 | ns | ns | ns | ns | ns | 1*** | 1*** | 0.929 | |||
| I | ND3 | 105 | Tyr–His | E | 7 | ↑αm | ns | ns | ns | ns | ns |
| I | ND4 | 14 | Thr–Ala | E | 6 | ↑ Pα | ns | ns | ns | ns | ns |
| 15 | Ala–Thr | E | 6 | ↓ Pα | ns | ns | ns | ns | ns | ||
| 29 | Thr–Ala | S | 6 | ↑ Pα | N + S | 0.78* | ns | ns | ns | ||
| 191 | Thr–Ala | E | 6 | ↑ Pα | ns | ns | ns | ns | ns | ||
| 280 | Thr–Ala | E | 6 | ↑ Pα | ns | ns | ns | ns | ns | ||
| 418 | ns | ns | ns | ns | ns | 0.78* | 0.81** | ns | |||
| I | ND5 | 22 | Thr–Ala | SE + N + S | 6 | ↑Pα | ns | ns | ns | ns | ns |
| 428 | Thr–Ala | N + S | 6 | ↑Pα | ns | ns | 0.89** | 0.89** | 0.92 | ||
| 500 | Ala–Thr | SE + N + S | 6 | ↓Pα | ns | ns | 0.79* | 0.81** | ns | ||
| 518 | ns | ns | ns | N + S | 0.96** | ns | ns | ns | |||
Pα: α-helical tendencies; αm: Power to be at the middle of alpha-helix; EC equilibrium constant, E eastern gentoo, SE southeastern gentoo, N northern, S southern gentoo. Codeml BS (Branch-site model) and SM (site model).
Posterior probability estimates by Bayes Empirical Bayes in site and branch-site models (PP: Posterior probability; *:PP > 0.7, **:PP > 0.8, ***: PP > 0.99).
Estimation ΔLRT of M1a/M2a and M7/M8 nested codeml site models (*p < 0.05) to detect genes candidates of positive selection.
| Gen | Alternative model | Null model | ΔLRT | DF | |
|---|---|---|---|---|---|
| ATP6 | Model 2a | Model 1a | 0.0150 | 2 | 0.9920 |
| ATP6 | Model 8 | Model 7 | 0.5650 | 2 | 0.7530 |
| COX1 | Model 2a | Model 1a | 0.0002 | 2 | 0.9998 |
| COX1 | Model 8 | Model 7 | 0 | 2 | 1 |
| COX2 | Model 2a | Model 1a | 0.0001 | 2 | 0.9999 |
| COX2 | Model 8 | Model 7 | 0 | 2 | 1 |
| COX3 | Model 2a | Model 1a | 0 | 2 | 1 |
| COX3 | Model 8 | Model 7 | 0 | 2 | 1 |
| CYTB | Model 2a | Model 1a | 0.0002 | 2 | 0.9990 |
| CYTB | Model 8 | Model 7 | 0 | 2 | 1 |
| ND1 | Model 2a | Model 1a | 0 | 2 | 1 |
| ND1 | Model 8 | Model 7 | 0 | 2 | 1 |
| ND2 | Model 2a | Model 1a | 12.0300 | 2 | 0.0024* |
| ND2 | Model 8 | Model 7 | 12.4700 | 2 | 0.0020* |
| ND3 | Model 2a | Model 1a | 0 | 2 | 1 |
| ND3 | Model 8 | Model 7 | 0 | 2 | 1 |
| ND4 | Model 2a | Model 1a | 0.3390 | 2 | 0.8430 |
| ND4 | Model 8 | Model 7 | 2.1970 | 2 | 0.3330 |
| ND4L | Model 2a | Model 1a | 0.0001 | 2 | 0.9990 |
| ND4L | Model 8 | Model 7 | 0.0053 | 2 | 0.9990 |
| ND5 | Model 2a | Model 1a | 2.1680 | 2 | 0.3380 |
| ND5 | Model 8 | Model 7 | 9.4800 | 2 | 0.0088* |
| ND6 | Model 2a | Model 1a | 0 | 2 | 1 |
| ND6 | Model 8 | Model 7 | 0 | 2 | 1 |
Estimation ΔLRT of nested codeml branch-site model A (*p < 0.05).
| Branch | Gen | ΔLRT (Null) | ΔLRT (M1a) | df | ||
|---|---|---|---|---|---|---|
| SE + N + S | ATP6 | 0.50 | 3.98 | 1 | 0.478 | 0.046* |
| SE + N + S | ND1 | 2.55 | 2.01 | 1 | 0.110 | 0.150 |
| N + S | ND4 | 0.94 | 0.94 | 1 | 0.330 | 0.330 |
| SE | ND5 | 7.94 | 0.01 | 1 | 0.005* | 0.931 |
| N + S | ND5 | 8.65 | 2.72 | 1 | 0.003* | 0.099 |