| Literature DB >> 29321589 |
S R Chaudhry1,2, N Lwin1, D Phelan1, A A Escalante3, F U Battistuzzi4,5.
Abstract
Low complexity regions (LCRs) are a common feature shared by many genomes, but their evolutionary and functional significance remains mostly unknown. At the core of the uncertainty is a poor understanding of the mechanisms that regulate their retention in genomes, whether driven by natural selection or neutral evolution. Applying a comparative approach of LCRs to multiple strains and species is a powerful approach to identify patterns of conservation in these regions. Using this method, we investigate the evolutionary history of LCRs in the genus Plasmodium based on orthologous protein coding genes shared by 11 species and strains from primate and rodent-infecting pathogens. We find multiple lines of evidence in support of natural selection as a major evolutionary force shaping the composition and conservation of LCRs through time and signatures that their evolutionary paths are species specific. Our findings add a comparative analysis perspective to the debate on the evolution of LCRs and harness the power of sequence comparisons to identify potential functionally important LCR candidates.Entities:
Mesh:
Year: 2018 PMID: 29321589 PMCID: PMC5762703 DOI: 10.1038/s41598-017-18695-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Frequency and composition of homopolymeric regions (HPRs) in Plasmodium. (a) Frequency of HPRs in P. falciparum (Pf), P. vivax (Pv), P. cynomolgi (Pcy), and P. knowlesi (Pk) (CVK group). Frequencies are calculated as the ratio of proteins with ≥1 HPRs and the total number of proteins in each species. The inset shows the frequency of HPRs for 5 Pv strains (Pvb: P. vivax Brazil I; Pvi: P. vivax India VII; Pvm: P. vivax Mauritania; Pvnk: P. vivax North Korea; Pvs: P. vivax Salvador-1). (b) and (c) usage of amino acids in the proteome (b) and the HPRs (c) of Pf and the CVK group. (d) Ratio of amino acid usage in HPRs and proteomes. Only amino acids with >1% frequency in any of the species are shown in decreasing order of frequency in Pf.
Figure 2Asparagine and Lysine usage in HPRs and proteome of Pf (a) and the CVK lineages (b). Asterisks show significant t-test p-values at the 1% level. Additional statistical t-tests not shown are: HPRs Pf Asn vs. CVK Asn and Pf Lys vs. CVK Lys, both significant at p ≪ 0.001.
Figure 3Ancestrally reconstructed low complexity regions (LCRs) in the CVK group. Results are shown for regions conserved in the Pv strains as HPRs (a) or LCRs (b); in these cases, these regions are present as homopolymers in at least one of the other species). From tips to root, numbers in grey represent conserved states (HPR → HPR or LCR → LCR) and numbers in black represent changes (HPR → LCR or LCR → HPR).
Figure 4Preferred amino acid usage in HPRs of the Plasmodium genus. Values for Pv are averages of the 5 strains. Py includes values calculated for strains 17X (darker shade) and YM (lighter shade).