| Literature DB >> 25946223 |
Yuan Li1, Björn Canbäck1, Tomas Johansson1, Anders Tunlid1, Honor C Prentice1.
Abstract
The dimeric metabolic enzyme phosphoglucose isomerase (PGI, EC 5.3.1.9) plays an essential role in energy production. In the grass Festuca ovina, field surveys of enzyme variation suggest that genetic variation at cytosolic PGI (PGIC) may be adaptively important. In the present study, we investigated the molecular basis of the potential adaptive significance of PGIC in F. ovina by analyzing cDNA sequence variation within the PgiC1 gene. Two, complementary, types of selection test both identified PGIC1 codon (amino acid) sites 200 and 173 as candidate targets of positive selection. Both candidate sites involve charge-changing amino acid polymorphisms. On the homology-modeled F. ovina PGIC1 3-D protein structure, the two candidate sites are located on the edge of either the inter-monomer boundary or the inter-domain cleft; examination of the homology-modeled PGIC1 structure suggests that the amino acid changes at the two candidate sites are likely to influence the inter-monomer interaction or the domain-domain packing. Biochemical studies in humans have shown that mutations at several amino acid sites that are located close to the candidate sites in F. ovina, at the inter-monomer boundary or the inter-domain cleft, can significantly change the stability and/or kinetic properties of the PGI enzyme. Molecular evolutionary studies in a wide range of other organisms suggest that PGI amino acid sites with similar locations to those of the candidate sites in F. ovina may be the targets of positive/balancing selection. Candidate sites 200 and 173 are the only sites that appear to discriminate between the two most common PGIC enzyme electromorphs in F. ovina: earlier studies suggest that these electromorphs are implicated in local adaptation to different grassland microhabitats. Our results suggest that PGIC1 sites 200 and 173 are under positive selection in F. ovina.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25946223 PMCID: PMC4422690 DOI: 10.1371/journal.pone.0125831
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Festuca ovina individuals analyzed.
| Individual | Sample code | Sampling site | Microhabitat category | PGIC electromorph phenotype |
|---|---|---|---|---|
| 1 | AB7:1:2aIII | AB | 7 | 1,2 |
| 2 | TR6:2:5aIII | TR | 6 | 2 |
| 3 | PE2:1:2bIII | PE | 2 | 1,4 |
| 4 | BY2:2:2aIII | BY | 2 | 1,2,4 |
| 5 | BY7:2:2aII | BY | 7 | 2,4 |
| 6 | GL1:2:5aIII | GL | 1 | 2,5 |
| 7 | GL6:2:2aII | GL | 6 | 2,4 |
| 8 | PE1:1:4aII | PE | 1 | 2,4,6 |
| 9 | TR6:2:2a | TR | 6 | 1,4 |
| 10 | AB2:1:1aIII | AB | 2 | 2 |
| 11 | AB2:1:3aIII | AB | 2 | 1,6 |
| 12 | PE7:1:3aIII | PE | 7 | 2 |
| 13 | GL7:2:1aIII | GL | 7 | 1,2 |
| 14 | BY1:1:2aIII | BY | 1 | 1,2 |
| 15 | TR1:2:2aII | TR | 1 | 2 |
NOTES.—Sample code is the collecting code for each individual. Sampling sites are located in the alvar grasslands of Öland, Sweden, from south to north: Albrunna (AB) 56° 19′ N/16° 25′ E, Penåsa (PE) 56° 26′ N/16° 27′ E, Gösslunda (GL) 56° 29′ N/16° 30′ E, Bårby alvar (BY) 56° 31′ N/16° 29′ E and Torrör (TR) 56° 34′ N/16° 34′ E. Microhabitat categories are 1 (low pH, moist), 2 (low pH, dry), 6 (high pH, dry) and 7 (high pH, moist). PGIC electromorph phenotypes were scored as in Prentice et al. [17], [18].
Tests for positive selection using ω-ratio tests: candidate amino acid sites identified by all the ω-ratio tests are underlined.
| Candidate targets (amino acid sites) of positive selection identified by | ||||
|---|---|---|---|---|
|
| PAML | PAML | HyPhy/REL | omegaMap |
| 19–195 | (2 | (2 | 53 | |
| 196–561 | 109, | 109, |
| |
| 562–855 |
|
|
| |
| 856–1650 | (2 | (2 | ||
| 19–1650 | 43, 46–53, 128, 145, 148–150, 170, 172, | |||
aAll the PAML and HyPhy analyses were run on four non-recombinant PgiC1 segments that are delimited on the basis of the three recombinant breakpoints (nucleotide positions 195, 561 and 855) identified by GARD.
bWithin the PAML package, two nested tests, each including one neutral (M1a/M7) and one selection model (M2a/M8), were used to test for selection. Within each of the two nested tests, 2dL is the likelihood-ratio statistic used to test whether the selection model fits the data significantly better than the neutral model (df = 2).
Candidate targets of positive selection identified in M2a, M8 are those with posterior probabilities >50% in Bayes Empirical Bayes analyses. Sites with posterior probabilities >99% are indicated in bold text.
*: 0.01 < p < 0.05.
cThe model REL within the Hyphy package was used to test for selection. Candidates targets of positive selection are those with Bayes factors > 50, candidates with Bayes factors larger than 500 are shown in bold text.
dCandidate targets of positive selection identified in omegaMap are those with posterior probabilities >50%. Sites with posterior probabilities >99% are shown in bold text.
Amino acid polymorphism among the 29 translated PGIC1 amino acid sequences from F. ovina.
| 2 | 3 | 5 | 6 | 7 | 8 | 9 | 14 | 18 | 19 | 20 | 22 | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Hap | pI | Ind | Ind EM | 43 | 47 | 48 | 49 | 53 | 64 | 109 | 128 | 145 | 149 | 150 | 170 | 173 | 200 | 227 | 334 | 423 | 425 | 452 | 474 | 521 |
| Consensus | A | G | I | Y | A | K | H | Q | P | V | V | T | E | D | I | G | T | S | S | F | Q | |||
| Hap1 | 6.89 | 1 |
| - | - | - | - | - | - | N | - | - | - | - | - |
|
| - | - | - | T | - | L | - |
| Hap1 | 6.89 | 9 |
| - | - | - | - | - | - | N | - | - | - | - | - |
|
| - | - | - | T | - | L | - |
| Hap13 | 6.89 | 3 |
| - | - | - | - | S | - | N | - | - | - | - | - |
|
| - | - | - | - | - | - | - |
| Hap14 | 6.89 | 3 |
| - | - | - | - | S | - | N | - | - | - | - | - |
|
| L | - | - | - | - | - | - |
| Hap8 | 6.89 | 6 | 2,5 | - | - | - | - | - | - | N | - | - | - | - | - |
|
| - | - | - | - | - | - | - |
| Hap3 | 6.93 | 9 |
| - | - | - | - | - | R | - | - | - | - | - | - |
|
| - | - | - | - | - | - | - |
| Hap17 | 6.93 | 11 |
| - | - | - | - | - | - | - | - | - | - | - | - |
|
| - | - | - | - | - | - | - |
| Hap20 | 6.93 | 13 |
| - | - | - | - | - | - | - | - | - | - | - | - |
|
| - | - | - | - | - | - | - |
| Hap21 | 6.93 | 14 |
| - | - | - | - | - | - | - | - | - | - | - | - |
|
| - | - | - | - | - | - | - |
| Hap5 | 6.96 | 4 |
| - | - | V | H | S | - | - | - | - | - | - | - |
|
| - | - | A | - | - | - | - |
| Hap4 | 7.05 | 8 |
| - | S | - | - | - | - | N | - | - | - | - | - |
| N | L | - | - | - | G | - | - |
| Hap16 | 7.05 | 11 | 1,6 | - | - | - | - | - | - | N | - | - | - | - | - |
| N | - | - | - | - | - | - | - |
| Hap9 | 7.07 | 6 |
| - | - | - | - | S | - | - | - | - | - | - | - | Q |
| - | - | A | - | - | - | - |
| Hap6 | 7.07 | 7 |
| - | - | - | - | - | - | - | - | - | - | - | - | Q |
| - | - | A | - | - | - | - |
| Hap6 | 7.07 | 15 |
| - | - | - | - | - | - | - | - | - | - | - | - | Q |
| - | - | A | - | - | - | - |
| Hap15 | 7.07 | 10 |
| - | - | - | - | - | - | - | - | - | - | - | - | Q |
| - | A | - | - | - | - | - |
| Hap15 | 7.07 | 15 |
| - | - | - | - | - | - | - | - | - | - | - | - | Q |
| - | A | - | - | - | - | - |
| Hap19 | 7.07 | 12 |
| - | - | V | H | S | - | N | - | - | - | - | - | Q |
| - | - | - | - | - | - | - |
| Hap7 | 7.07 | 7 |
| - | - | - | - | - | - | - | - | - | - | - | - |
| G | - | - | - | - | - | - | - |
| Hap7 | 7.07 | 13 | 1, | - | - | - | - | - | - | - | - | - | - | - | - |
| G | - | - | - | - | - | - | - |
| Hap2 | 7.07 | 1 | 1, | - | - | - | - | - | - | - | - | Q | - | - | - |
| G | - | - | - | - | - | - | - |
| Hap2 | 7.07 | 8 |
| - | - | - | - | - | - | - | - | Q | - | - | - |
| G | - | - | - | - | - | - | - |
| Hap2 | 7.07 | 4 | 1, | - | - | - | - | - | - | - | - | Q | - | - | - |
| G | - | - | - | - | - | - | - |
| Hap2 | 7.07 | 2 |
| - | - | - | - | - | - | - | - | Q | - | - | - |
| G | - | - | - | - | - | - | - |
| Hap2 | 7.07 | 14 | 1, | - | - | - | - | - | - | - | - | Q | - | - | - |
| G | - | - | - | - | - | - | - |
| Hap10 | 7.07 | 5 | 2,4 | - | - | - | - | - | - | - | - | - | - | I | A |
|
| - | - | - | - | - | - | R |
| Hap18 | 7.07 | 12 |
| - | - | - | - | - | - | - | - | - | - | - | - |
| N | - | - | - | - | - | - | - |
| Hap12 | 7.11 | 2 |
| V | - | - | - | - | - | - | H | - | - | - | - |
| N | - | - | - | - | - | - | - |
| Hap11 | 7.21 | 5 | 2,4 | - | - | - | - | - | - | N | - | - | I | - | - | Q | G | - | - | - | - | - | - | - |
NOTES.—The exon number is shown in the first row of the table header. The vertically arranged numbers in the second row of the table header indicate the location of each polymorphic amino acid site. The first row beneath the table header is the consensus sequence, where each amino acid represents the most frequent residue in the alignment at each position. Each of the subsequent rows represents a single predicted amino acid sequence; for each sequence (apart from the consensus sequence), Hap = haplotype, pI = isoelectric point (predicted from the translated haplotype sequence using Protein calculator v3.3 http://www.scripps.edu/~cdputnam/protcalc.html), Ind = the F. ovina individual from which the sequence was derived, Ind EM = the PGIC enzyme electromorph phenotype (coded for by the PgiC1 and PgiC2 loci) of the individual. Apart from the consensus sequence, all the sequences are listed in ascending order of predicted pI values. Dashes (-) within a sequence indicate that the amino acid residue at a polymorphic site is the same as in the consensus sequence. The negatively charged residues at the selected polymorphic amino acid sites 173 and 200 are shown in bold italics. In the column Ind EM, bold italics indicate: EM 1 in the sequences with negatively charged residues at both sites 173 and 200; EM 2 in sequences with negatively charged residue at either sites 173 or 200, but not at both sites.
Fig 1Homology-modeled 3-D structure of F. ovina PGIC1 and the candidate targets of positive selection.
(A) A PGIC1 dimer, with the two monomers shown in green and yellow, respectively. The candidate sites 200 and 173 (space-filled, orange) are mapped onto one of the monomers. Site 200 is located on the edge of the inter-monomer boundary. Shown in the active sites are the four most conserved PGI residues (equivalent to Lys516, Glu360, His391 and Arg274 in F. ovina) [35] (space-filled, magenta). (B) A PGIC1 monomer showing that the candidate site 173 (space-filled, orange) is located on the edge of the cleft between the small (green) and large (yellow) domains. The active site is partially located within the cleft. The two domains are interconnected by a single polypeptide (Domain connection: dark blue). (C)-(E) Local structures showing that the charge changes at the candidate sites may affect the inter-monomer interaction or the domain-domain packing. (C) Shows all the residues occurring within a distance of 6Å from the candidate site 200 (space-filled, orange). The acidic Asp200 is adjacent to two basic residues: Lys199 (space-filled, green) is located in the same monomer (monomer I, green) as Asp200; Lys179 (space-filled, yellow) is in the opposite monomer (monomer II, yellow). (D) & (E) Show all the residues occurring within a distance of 6Å from the candidate site 173 (space-filled, orange) which is located on the small domain (green). Site 173 is close to Lys297 (space-filled, dark blue) which is located on the domain connection (dark blue). When the amino acid variant at the candidate site 173 is Gln, a hydrogen bond (magenta dotted line, panel (D)) is predicted between Gln173 and Lys297 by the DeepView/Swiss-PdbViewer.
3-D structural locations of PGI amino acid sites in a range of organisms.
| Species | Amino acid site | Location in the PGI 3-D protein structure | PGI 3-D protein structure |
|---|---|---|---|
|
| 114 | Edge of the cleft between the large and small domains |
|
|
| 200 | Edge of the inter-monomer boundary |
|
|
| 112 | Edge of the cleft between the large and small domains |
|
| 238 | Surface of the small domain |
| |
|
| 111 | Edge of the cleft between the large and small domains |
|
| 372 | Interpenetrating loop across interface between monomers |
—
| |
|
| 66 | N-terminus |
|
| 301 | Edge of the cleft between the large and small domains |
| |
|
| 369 | Interpenetrating loop across interface between monomers |
—
|
|
| 195 | Edge of the inter-monomer boundary |
|
| 83 | Close to the edge of the cleft between the large and small domains |
| |
| 100 | Close to the edge of the cleft between the large and small domains |
|
aAmino acid sites that have been proposed as being potential targets of selection.
bFour functionally important human PGI amino acid sites that share similar locations to the two candidate targets of positive selection in F. ovina are also included in the table [36], [37].
cThe location of the amino acid site was identified in the present study, but see
dThe locations of the amino acid sites 372 and 375 in M. cinxia and of the sites 369 and 375 in C. eurytheme were identified in earlier studies [11], [15].
ePGI 3-D protein structures that have been used, in the present study, for identifying the locations of the amino acid sites listed in the second column.
fBecause the majority of the PGI amino acid sequence is not available for L. stylosa, the homology-modeled PGI 3-D structure from the related L. crassa was used as a proxy (see Material and Methods).
Fig 23-D structural locations of PGI amino acid sites in a range of organisms.
In panels (A)-(E), the amino acid sites shown in red are those that have been proposed as the potential targets of positive/balancing selection in (A) Melitaea cinxia (site 111 [14]), (B) Dioscorea tokoro (site 112 [9]), (C) Arabidopsis thaliana (site 114 [38]), (D) Tigriopus californicus (site 301 [12]) and (E) Leavenworthia stylosa (site 200 [13]): the sites are located on the edge of either the inter-monomer boundary or the inter-domain cleft. In panel (F), amino acid sites shown in red are those at which mutation has been shown to significantly alter the activity of PGI in Homo sapiens [36] (for the sake of simplicity, only two of the four sites listed in Table 4 are shown here: site 83 is located close to the edge of the inter-domain cleft, while site 195 is located on the edge of the inter-monomer boundary). The four most conserved residues in the active site [35] are indicated in dark magenta in all the panels. The small and large domains, in the PGI monomers in panels (A)-(D) and in one of the two PGI monomers in panels (E) and (F), are shown in dark green and yellow, respectively; the remaining monomer in each of panels E and F is shown in grey.
Fig 3Multi-species PGI amino acid sequence alignment around the F. ovina candidate targets of positive selection.
The multi-species alignment shows that the F. ovina PGIC1 candidate site 200 is close to (potentially) functionally important sites in other species. The F. ovina PGIC1 site 200 is next to the L. stysola PGI site 200 that has been proposed as a candidate target of balancing selection [13], and is just two amino acid residues away from the human PGI site 195, a mutation at which has been shown to significantly reduce the enzyme stability of PGI [36]. The alignment includes PGI amino acid sequences from F. ovina (Hap 2), L. crassa (GenBank protein id/gb: AF054455 [39]) and Homo sapiens (PDB code/pdb: 1jlh [40]). Because the majority of the PGI amino acid sequence is not available for L. stylosa, a sequence from the related L. crassa was used instead.