| Literature DB >> 22731960 |
Giovanni Marco Dall'Olio1, Hafid Laayouni, Pierre Luisi, Martin Sikora, Ludovica Montanucci, Jaume Bertranpetit.
Abstract
BACKGROUND: Asparagine N-Glycosylation is one of the most important forms of protein post-translational modification in eukaryotes. This metabolic pathway can be subdivided into two parts: an upstream sub-pathway required for achieving proper folding for most of the proteins synthesized in the secretory pathway, and a downstream sub-pathway required to give variability to trans-membrane proteins, and involved in adaptation to the environment and innate immunity. Here we analyze the nucleotide variability of the genes of this pathway in human populations, identifying which genes show greater population differentiation and which genes show signatures of recent positive selection. We also compare how these signals are distributed between the upstream and the downstream parts of the pathway, with the aim of exploring how forces of population differentiation and positive selection vary among genes involved in the same metabolic pathway but subject to different functional constraints.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22731960 PMCID: PMC3426484 DOI: 10.1186/1471-2148-12-98
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Overview of the Asparagine N-Glycosylation pathway. The Quality Control Cycle (also known as Calnexin/Calreticulin Cycle), which divides the pathway into two parts, is shown as an octagon. Genes classified as ‘upstream’ in the analysis are in blue; genes classified as ‘downstream’ are in green. Genes in gray have been excluded from the network analysis (see methods).
Summary of relevant results: Loci with extreme patterns of differentiation between continental groups. Loci showing extreme gene-level FST p-value after Bonferroni multiple comparison correction is applied taking into account the number of continental groups and the two methods applied
| Branching 1 | MAN2A2 | CSASIA | 3 × 10-12 | 4.2 × 10-11 |
| | MGAT3 | CSASIA | 2 × 10-8 | 2.8 × 10-7 |
| Branching 2 | B4GALT2 | MENA | 2 × 10-7 | 2.8 × 10-6 |
| | MGAT4A | EUR | 3 × 10-6 | 4.2 × 10-5 |
| | | EASIA | 3 × 10-6 | 4.2 × 10-5 |
| | ST3GAL4 | SSAFR | 3 × 10-7 | 4.2 × 10-6 |
| | ST8SIA3 | EUR | 2 × 10-7 | 2.8 × 10-6 |
| | ST8SIA6 | CSASIA | 3 × 10-6 | 4.2 × 10-5 |
| Substrates | DPM1 | MENA | 3 × 10-8 | 4.2 × 10-7 |
| | DPM3 | CSASIA | 2 × 10-24 | 2.8 × 10-23 |
| PMM1 | EASIA | 8 × 10-7 | 1.1 × 10-5 |
Populations were grouped as following: SSAFR, MENA, EUR, CSASIA, EASIA, OCE and AME standing respectively for Sub-Saharan Africa, Middle East-North Africa, Europe, Central South Asia, East Asia, Oceania and America.
Figure 2Distribution of Fvalues per SNP on the nearby chromosomal regions for a set of 10 genes (shown here as example) in the Asparagine N-Glycosylation pathway that show a signature of population differentiation in at least one continental group. The same representation but for all the genes in the pathway is presented as Supplementary Figure S2. Points in darker gray represent SNPs inside the gene. A blue smoothing line (calculated with the Loess function) is shown to facilitate the reading. Note that although 800 kb regions are plotted here, only the SNPs within 100 kb upstream and downstream of the gene have been included in the analysis.
Summary of relevant results: Genes putatively affected by positive selection. Loci showing gene-level iHS extreme empirical p-value after Bonferroni multiple comparison correction is considered taking into account the number of continental groups and the two methods applied
| Precursor Biosynthesis | ALG12 | MENA | 3 × 10-10 | 4.2 × 10-9 |
| | | EUR | 1 × 10-10 | 1.4 × 10-9 |
| | | EASIA | 5 × 10-11 | 7.0 × 10-10 |
| | GANAB | SSAFR | 4 × 10-7 | 5.6 × 10-6 |
| CNX_CRT | EDEM3 | OCE | 1 × 10-10 | 1.4 × 10-9 |
| | GCS1 | MENA | 2 × 10-14 | 2.8 × 10-13 |
| | | EUR | 2 × 10-16 | 2.8 × 10-15 |
| | | CSASIA | 2 × 10-6 | 2.8 × 10-5 |
| | UGCGL2 | SSAFR | 4 × 10-5 | 5.6 × 10-4 |
| Branching 1 | FUT8 | SSAFR | 1 × 10-4 | 1.4 × 10-3 |
| | MAN1A1 | AME | 5 × 10-13 | 7.0 × 10-12 |
| | MAN1A2 | SSAFR | 8 × 10-4 | 1.12 × 10-2 |
| | MAN2A1 | SSAFR | 4 × 10-23 | 5.6 × 10-22 |
| | | EASIA | 3 × 10-10 | 4.2 × 10-9 |
| | MGAT2 | EASIA | 4 × 10-7 | 5.6 × 10-6 |
| | MGAT3 | EASIA | 1 × 10-14 | 1.4 × 10-13 |
| Branching 2 | MGAT5B | MENA | 1 × 10-7 | 1.4 × 10-6 |
| | | EUR | 7 × 10-8 | 9.8 × 10-7 |
| | ST8SIA3 | MENA | 5 × 10-10 | 7.0 × 10-9 |
| | | EUR | 2 × 10-8 | 2.8 × 10-7 |
| Substrates | DPM1 | MENA | 5 × 10-12 | 7.0 × 10-11 |
| | | EUR | 1 × 10-8 | 1.4 × 10-7 |
| | | CSASIA | 2 × 10-16 | 2.8 × 10-15 |
| | DOLPP1 | CSASIA | 2 × 10-5 | 2.8 × 10-4 |
| | DPM2 | EUR | 5 × 10-7 | 7.0 × 10-6 |
| DPM3 | EASIA | 6 × 10-30 | 8.4 × 10-29 |
Populations were grouped as following: SSAFR, MENA, EUR, CSASIA, EASIA, OCE and AME standing respectively for Sub-Saharan Africa, Middle East-North Africa, Europe, Central South Asia, East Asia, Oceania and America.
Figure 3Distribution of iHS values per SNP on a the nearby chromosomal regions for a set of 15 genes (shown here as example) in the Asparagine N-Glycosylation pathway that show a signature of positive selection in at least one continental group. The same representation but for all the genes in the pathway is presented as Supplementary Figure S3. Points in darker gray represent SNPs inside the gene. A blue smoothing line (calculated with the Loess function) is shown to facilitate the reading. Note that although 800 kb regions are plotted here, only the SNPs within 100 kb upstream and downstream of the gene have been included in the analysis.
Mann–Whitney test between genes that do not show signals of positive selection (Group 1, 40 genes) and genes that show signatures of positive selection (Group 2, 17 genes), for each of the node centrality measured
| Betweenness | 1151.00 | 502.00 | 331.00 | −0.16 | 0.8749 |
| Centroid | 1201.50 | 451.50 | 298.50 | 0.73 | 0.4674 |
| Closeness | 1173.50 | 479.50 | 326.50 | 0.24 | 0.8135 |
| Eccentricity | 1157.00 | 496.00 | 337.00 | −0.05 | 0.9578 |
| Node.degree | 1183.50 | 469.50 | 316.50 | 0.42 | 0.6758 |
A p-value < 0.05 for a centrality measure would indicate that genes belonging to the two groups have different values for that centrality measure.
Mann–Whitney test between genes that do not show signals of population differentiation (Group 1, 47 genes) and genes that show signatures of population differentiation (Group 2, 10 genes), for each of the node centrality measured
| Betweenness | 1425.50 | 227.50 | 172.50 | 1.32 | 0.1885 |
| Centroid | 1357.50 | 295.50 | 229.50 | −0.12 | 0.9078 |
| Closeness | 1404.50 | 248.50 | 193.50 | 0.87 | 0.3830 |
| Eccentricity | 1485.00 | 168.00 | 113.00 | 2.59 | 0.0097** |
| Node Degree | 1254.00 | 399.00 | 126.00 | −2.33 | 0.0196* |
A p-value < 0.05 for a centrality measure would indicate that genes belonging to the two groups have different values for that centrality measure. Only Eccentricity remains marginally significant after a Bonferroni correction is applied.
Figure 4Box plot of network centrality parameters between genes that have empirical significant population differentiation (S) and genes showing no significant population differentiation (NS). Comparisons for (D) Eccentricity and (E) Node Degree were significant at p < 0.05.