| Literature DB >> 31612960 |
Bálint Mészáros1, Gábor Erdős1, Beáta Szabó2, Éva Schád2, Ágnes Tantos2, Rawan Abukhairan2, Tamás Horváth2, Nikoletta Murvai2, Orsolya P Kovács2, Márton Kovács2, Silvio C E Tosatto3, Péter Tompa2,4, Zsuzsanna Dosztányi1, Rita Pancsa2.
Abstract
Membraneless organelles (MOs) are dynamic liquid condensates that host a variety of specific cellular processes, such as ribosome biogenesis or RNA degradation. MOs form through liquid-liquid phase separation (LLPS), a process that relies on multivalent weak interactions of the constituent proteins and other macromolecules. Since the first discoveries of certain proteins being able to drive LLPS, it emerged as a general mechanism for the effective organization of cellular space that is exploited in all kingdoms of life. While numerous experimental studies report novel cases, the computational identification of LLPS drivers is lagging behind, and many open questions remain about the sequence determinants, composition, regulation and biological relevance of the resulting condensates. Our limited ability to overcome these issues is largely due to the lack of a dedicated LLPS database. Therefore, here we introduce PhaSePro (https://phasepro.elte.hu), an openly accessible, comprehensive, manually curated database of experimentally validated LLPS driver proteins/protein regions. It not only provides a wealth of information on such systems, but improves the standardization of data by introducing novel LLPS-specific controlled vocabularies. PhaSePro can be accessed through an appealing, user-friendly interface and thus has definite potential to become the central resource in this dynamically developing field.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31612960 PMCID: PMC7145634 DOI: 10.1093/nar/gkz848
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Data integrated into PhaSePro. Manual curation (left) is based on the literature. The biological function of the organelle, the partners required for and other determinants of LLPS are expressed as free text descriptions, while all other annotations are encoded using references to UniProt sequences or as terms in various ontologies and controlled vocabularies (CVs) (see Supplementary Tables). Automated annotations (top right) are added from the outputs of various sequence-based prediction methods (IUPred (38) for disorder prediction and Pfam (39) for conserved protein regions), from ProtVista (40) (for sequence variants omitting predicted variants), from PhosphoSitePlus (37) (for phosphorylation, methylation, acetylation and ubiquitination sites) and from PDBe (41) (for structures overlapping the LLPS driver region). Entries are cross-linked to various data resources and the data contained in PhaSePro can be accessed via download or the dedicated API.
Figure 2.Entry pages in PhaSePro. Each entry page is structured to display information pertaining to specific aspects of the liquid–liquid phase separation (LLPS) or the formed membraneless organelles. The figure uses the entry of human Tau protein (https://phasepro.elte.hu/entry/P10636–8).