| Literature DB >> 34791220 |
Pablo Mier1, Miguel A Andrade-Navarro1.
Abstract
Polyglutamine (polyQ) regions are highly abundant consecutive runs of glutamine residues. They have been generally studied in relation to the so-called polyQ-associated diseases, characterized by protein aggregation caused by the expansion of the polyQ tract via a CAG-slippage mechanism. However, more than 4,800 human proteins contain a polyQ, and only nine of these regions are known to be associated with disease. Computational sequence studies and experimental structure determinations are completing a more interesting picture in which polyQ emerge as a motif for modulation of protein-protein interactions. But long polyQ regions may lead to an excess of interactions, and produce aggregates. Within this mechanistic perspective of polyQ function and malfunction, we discuss polyQ definition and properties such as variable codon usage, sequence and context structure imposition, functional relevance, evolutionary patterns in species-centered analyses, and open resources.Entities:
Keywords: CAG-expansion diseases; aggregation; homorepeat; polyglutamine; protein–protein interaction
Mesh:
Substances:
Year: 2021 PMID: 34791220 PMCID: PMC8763233 DOI: 10.1093/gbe/evab246
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Number of PolyQ Regions in the Human Proteome (UniProt release 2020_06)
| Pure polyQ | |
|---|---|
|
|
|
| 4 | 744 |
| 5 | 260 |
| 6 | 134 |
| 7 | 73 |
| 8 | 57 |
| 9 | 19 |
| 10 | 49 |
| >11 | 165 |
|
| |
|
| |
|
| |
|
|
|
| ≥4/5 | 2,950 |
| ≥4/6 | 4,808 |
| ≥4/7 | 6,871 |
| ≥8/10 | 397 |
| ≥9/10 | 291 |
Fig. 1.PolyQ context and structure. PolyQ regions (red) are usually positioned C-terminally to a coiled coil (gray) and N-terminally to a polyP or Proline-rich region (green). Leucine residues (blue) are found typically at positions -1 and -5 from the polyQ. PolyQ regions can be either (a) disordered, (b) in helical conformation to aid protein–protein interaction, or (c) forming beta-aggregates.
Fig. 2Examples of polyQ evolution patterns in orthologous proteins. The analysis is centered in the human species and considers a series of species at various taxonomic distances. Alignment of orthologs of proteins (a) Mef2a, (b) CXXC1, (c) Gemi2, and (d) MGAP from species Gallus gallus (top), Bos taurus, Mus musculus, Macaca mulatta, Pongo abelii and H. sapiens (bottom). The phylogenetic tree was built using taxonomic information from the NCBI Taxonomy resource Common Taxonomy Tree (Sayers et al. 2009) and the R library phyloseq version 1.29.0 (McMurdie and Holmes 2013). The complete alignments can be found in supplementary files S2–S5, Supplementary Material online.
Resources Related to PolyQ Regions
| Resource | Content | Reference |
|---|---|---|
| sQanner | Evaluation of the abundance of polyQ regions in a protein data set |
|
| EvoPPI 1.0 | Comparison of PPI data from several databases within and among ten species. The authors illustrate its use with a detailed case study using all nine polyQ-associated disease proteins |
|
| HDNetDB | Interaction knowledge related to Huntington’s disease, a polyQ-associated disease |
|
| dAPE | Assessment of the evolution of homorepeats and their protein context |
|
| HRaDis | Database of associations between homorepeats and diseases |
|
| PolyQ database | Information about 135 mouse models of polyQ-associated diseases, plus detailed descriptions about phenotypes and therapeutic approaches tested in vivo |
|
| HRaP | Database of occurrence of homorepeats and patterns in 122 proteomes (110 species) |
|