| Literature DB >> 28529468 |
Yuan Li1, Bengt Hansson1, Lena Ghatnekar1, Honor C Prentice1.
Abstract
BACKGROUND: Phosphoglucose isomerase (PGI, EC 5.3.1.9) is an essential metabolic enzyme in all eukaryotes. An earlier study of the PgiC1 gene, which encodes cytosolic PGI in the grass Festuca ovina L., revealed a marked difference in the levels of nucleotide polymorphism between the 5' and 3' portions of the gene.Entities:
Keywords: Cytosolic phosphoglucose isomerase; Festuca ovina; Nucleotide polymorphism; PgiC1; Purifying selection
Mesh:
Substances:
Year: 2017 PMID: 28529468 PMCID: PMC5437402 DOI: 10.1186/s41065-017-0032-6
Source DB: PubMed Journal: Hereditas ISSN: 0018-0661 Impact factor: 3.271
Fig. 1The structure of the PgiC1 gene and the 3-D protein structure of its gene product. a PGI dimer, coded for by PgiC1, in F. ovina and homology modelled in an earlier study [34]. One monomer is shown in yellow. In the other monomer, the large domain is shown in green and the small domain is shown in dark blue. The three active site residues (equivalent to Lys516, Glu360, and His391 in F. ovina) that are directly involved in the PGI isomerization reaction are shown in red. The rest of the monomer is represented in light blue. b The gene structure is summarized for the part of the PgiC1 gene corresponding to the 1 633 bp sequence of the 29 Öland sequences characterized in the present study. Boxes represent the exons and lines represent introns. The gene structure is scaled according to an earlier published PgiC1 gene sequence (GenBank accession numbers HQ616103). The 5’ and 3’ portions (see Fig. 2) of the PgiC1 gene that are compared in the present study include, respectively, exons 1–12 and exons 13–22
Fig. 2Sliding window analyses of nucleotide diversity (π T), Tajima’s D and ω. The ticks on the x axis represent the boundary of each analysed PgiC1 exon within the PgiC1 coding sequence. In F. ovina, PgiC1 exons 5–12 encode the small domain of a PGI monomer while exons 13–21 encode the large domain. The two dots on the x axis show the locations of the two PgiC1 codon sites (173 and 200) that were earlier identified as being candidates for positive diversifying selection [34]. The three stars on the x axis represent the three active site residues (equivalent to Lys516, Glu360, and His391 in F. ovina) that are directly involved in the PGI isomerization reaction [35]. The grey dotted vertical line shows the location of intron 12, which is used as the demarcation point for defining the 5’ and 3’ portions of PgiC1 sequence. The brown dotted vertical line indicates codon site 200 which is under positive diversifying selection and located at or near to peaks of π T, Tajima’s D and ω
Analyses of the 3’ and 5’ portions of 29 PgiC1 sequences from Öland F. ovina
|
|
|
|
|
|
|
| Mean | Tajima’s | Fay & Wu’s | Wall’s | Wall’s | Mean conservation score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5’ portion | 872 | 68 | 21 | 19 | 20 | 20 | 0.383 | 0.280 | −0.195 n.s. | 3.355 n.s. | 0.015* | 0.030* | 0.105 |
| 3’ portion | 761 | 20 | 15 | 4 | 7 | 1 | 0.026 | 0.128 | −1.363 n.s. | −1.768 n.s. | 0.000 n.s. | 0.000 n.s. | −0.121 |
The table shows the lengths of the 5’ and 3’ portions of the sequence (L), the number of segregating sites (S), the number of haplotypes (N h), the total nucleotide diversity (π), the total Watterson’s estimator of population mutation rate (per site) (θ W), the minimum number of recombination events (R M), the population recombination parameter (ρ) (per site), the average ω ratio, Tajima’s D, Fay & Wu’s H, and Wall’s B and Q, as well as the mean conservation score (the smaller the value, the more conserved) estimated using the online ConSurf Server. Because parts of the coding sequence are not available for the outgroup sequence from F. altissima, the 5’ and 3’ portions of F. ovina PgiC1 that were considered in the Fay and Wu’s H test span, respectively, coding sequence nucleotide positions 259-828 & 919-1530
*0.01 < P < 0.05; n.s. non-significant
Fig. 3Levels of recombination and linkage disequilibrium (LD) within the F. ovina PgiC1 sequence. a Population recombination rate (ρ) across PgiC1. The heavy line represents the mean value of ρ, while the thin lines represent the upper and lower 95% HPD (highest posterior density) interval bounds for the posterior distribution of ρ. b LD matrix for PgiC1. The level of LD is measured by the pairwise correlation coefficient r 2 values [42] for all the polymorphic nucleotide sites (except for one that segregates into more than two nucleotides [34]). Shades of grey indicate the r 2 values, ranging from r 2 = 0 (white) to r 2 = 1 (black). The proportional spacing of the polymorphic sites, which is scaled according to an earlier published PgiC1 gene sequence (GenBank accession number HQ616103), is indicated by black vertical lines on a white horizontal bar (shown above the LD matrix)
MacDonald-Kreitman tests for the 3’ and 5’ portions of F. ovina PgiC1
| Gene | Synonymous | Non-synonymous | Fisher’s exact test | ||
|---|---|---|---|---|---|
| Portion | Fixeda ( | Polyb ( | Fixeda ( | Polyb ( |
|
| 5’ portion | 17 | 31 | 4 | 10 | 0.755 n.s. |
| 3’ portion | 31 | 9 | 2 | 5 | 0.018* |
Because parts of the coding sequence are not available for the outgroup sequence from F. altissima, the 5’ and 3’ portions of F. ovina PgiC1 that were considered in the test span, respectively, coding sequence nucleotide positions 259-828 & 919-1530
aThe number of fixed substitutions (per gene) between F. ovina and the outgroup (the number of nucleotide sites that are fixed for different nucleotides in F. ovina and the outgroup)
bThe number of nucleotide polymorphic sites (per gene) within F. ovina
*0.01 < P < 0.05; n.s. non-significant