Literature DB >> 31824649

An intrinsically disordered proteins community for ELIXIR.

Norman E Davey1, M Madan Babu2, Martin Blackledge3, Alan Bridge4, Salvador Capella-Gutierrez5, Zsuzsanna Dosztanyi6, Rachel Drysdale7, Richard J Edwards8, Arne Elofsson9, Isabella C Felli10, Toby J Gibson11, Aleksandras Gutmanas12, John M Hancock7, Jen Harrow7, Desmond Higgins13, Cy M Jeffries14, Philippe Le Mercier4, Balint Mészáros6, Marco Necci15, Cedric Notredame16,17, Sandra Orchard18, Christos A Ouzounis19, Rita Pancsa20, Elena Papaleo21, Roberta Pierattelli10, Damiano Piovesan15, Vasilis J Promponas22, Patrick Ruch23, Gabriella Rustici24, Pedro Romero25, Sirarat Sarntivijai7, Gary Saunders7, Benjamin Schuler26, Malvika Sharan11, Denis C Shields13, Joel L Sussman27, Jonathan A Tedds7, Peter Tompa28, Michael Turewicz29, Jiri Vondrasek30, Wim F Vranken31, Bonnie Ann Wallace32, Kanin Wichapong33, Silvio C E Tosatto15.   

Abstract

Intrinsically disordered proteins (IDPs) and intrinsically disordered regions (IDRs) are now recognised as major determinants in cellular regulation. This white paper presents a roadmap for future e-infrastructure developments in the field of IDP research within the ELIXIR framework. The goal of these developments is to drive the creation of high-quality tools and resources to support the identification, analysis and functional characterisation of IDPs. The roadmap is the result of a workshop titled "An intrinsically disordered protein user community proposal for ELIXIR" held at the University of Padua. The workshop, and further consultation with the members of the wider IDP community, identified the key priority areas for the roadmap including the development of standards for data annotation, storage and dissemination; integration of IDP data into the ELIXIR Core Data Resources; and the creation of benchmarking criteria for IDP-related software. Here, we discuss these areas of priority, how they can be implemented in cooperation with the ELIXIR platforms, and their connections to existing ELIXIR Communities and international consortia. The article provides a preliminary blueprint for an IDP Community in ELIXIR and is an appeal to identify and involve new stakeholders. Copyright:
© 2019 Davey NE et al.

Entities:  

Keywords:  ELIXIR; cellular regulation; community standards; databases; intrinsically disordered proteins; protein dynamics; protein function; protein-protein interactions

Mesh:

Substances:

Year:  2019        PMID: 31824649      PMCID: PMC6880265          DOI: 10.12688/f1000research.20136.1

Source DB:  PubMed          Journal:  F1000Res        ISSN: 2046-1402


Introduction

Intrinsically disordered regions (IDRs), protein segments that lack persistent secondary or tertiary structure ( Chouard, 2011; Dyson & Wright, 2005; Tompa, 2011), are predicted to cover almost a third of the residues in eukaryotic proteomes ( Pancsa & Tompa, 2012; Xue ). IDRs play a central role in cell regulation and contribute significantly to the cellular complexity of higher eukaryotes ( Dunker ; Dyson & Wright, 2005; Forman-Kay & Mittag, 2013; Gouw ; Mitrea & Kriwacki, 2016; Schad ; Tompa, 2005; Van Roey ). They represent a major source of protein diversity and versatility on the level of organisms and during evolution ( Babu ; Buljan ; Davey ; Light ; Weatheritt & Gibson, 2012). In the human proteome, IDRs are expected to contain up to one hundred thousand interaction interfaces and a million sites of post-translational modification ( Tompa ). However, to date, only a small fraction of these functional modules have been characterised ( Gouw ; Schad ). IDR-mediated interactions are commonly found in dynamic and transient complexes that underlie enzyme inhibition, signal transduction and liquid-liquid phase transition ( Borgia ; Ivarsson & Jemth, 2019; Kriwacki ; Martin & Mittag, 2018; Mitrea & Kriwacki, 2016; Olsen ; Scott & Pawson, 2009; Wright & Dyson, 2015). IDR-mediated interactions are also major determinants of protein regulation, and are so far known to affect: (i) the post-translational modification state of a protein by acting as docking sites to recruit modifying enzymes; (ii) the cellular half-life of a protein by recruiting E3 ubiquitin ligases resulting in ubiquitin-dependent proteasomal degradation of the protein; and (iii) the localisation of a protein by acting as signals that target proteins to specific subcellular locations ( Beltrao ; Davey & Morgan, 2016; Gouw ; Guharoy ; Iakoucheva ; Mészáros ; Van Roey ). Many functions of IDRs are directly associated with their structural attributes, without directly contributing to binding events that result in complex formation; for example, IDRs can act as entropic springs, flexible linkers or spacers ( Tompa, 2005; van der Lee ), or fly-casting regions to capture binding partners ( Shoemaker ). Furthermore, IDRs are subject to extensive pre- and post-translational regulation to modulate protein function in response to cellular stimuli ( Bah & Forman-Kay, 2016; Csizmok & Forman-Kay, 2018; Van Roey ; Van Roey ; Weatheritt ). As a result of the fundamental regulatory functions performed by IDRs, and the cell-state conditionality of these regulatory processes, IDRs encode many of critical steps of a protein life-cycle from the ribosome to the proteasome. Consequently, IDRs play a key role in many human diseases, including cancer (p53), Alzheimer’s disease (Aβ, Tau) or Parkinson’s disease (α-synuclein) ( Shigemitsu & Hiroaki, 2018; Uversky ) and human IDR interfaces are often mimicked by pathogens to hijack host pathways and deregulate the cell ( Davey ; Dyson & Wright, 2018; Kruse ; Via ; Xue ). Given their therapeutic relevance, IDP-mediated interactions are now seen as potential drug targets ( Corbi-Verge & Kim, 2016). Therefore, a better understanding of their structure and function will help to develop new strategies to fight human diseases. IDP research spans several experimental fields studying protein structure and function including structural biology, biophysics, biochemistry, cell biology, proteomics, comparative genomics, systems biology, synthetic biology and pharmacology ( Figure 1A) ( Blikstad & Ivarsson, 2015; Corbi-Verge & Kim, 2016; Felli & Pierattelli, 2015; Forman-Kay & Mittag, 2013; Plitzko ). On a structural level, IDRs do not adopt a single stable highly populated structure, instead, they are structurally heterogeneous, continuously sample a wide ensemble of conformations, with preferentially sampled intramolecular contacts driving local transient secondary structure and compaction of conformations ( Davey, 2019; Dyson & Wright, 2005; Forman-Kay & Mittag, 2013; Holehouse & Pappu, 2018). Therefore, IDRs are more easily described by probabilistic models than the intuitive visual representations of structures of folded protein regions. Classical methods for structural characterisation of a protein, such as X-Ray crystallography, are unable to capture the dynamic structures of IDRs. Instead, the structural aspects of the IDRs are studied by a range of biophysical methods including Nuclear Magnetic Resonance (NMR), Small-angle X-ray scattering (SAXS), circular dichroism (CD) or Förster resonance energy transfer (FRET) ( Felli & Pierattelli, 2015; Fuertes ; Holmstrom ; Plitzko ; Tompa, 2011; Tolchard ). The results of these complementary methods can be integrated to build a model of IDR structure and dynamics.
Figure 1.

( A) Key experimental methods used across the three major research focuses of the IDP field: structure, interactome and function. ( B) The growth in structural IDP data curated by the DisProt database ( Piovesan ). DisProt was established over a decade ago in the USA, and recently brought to Europe after years of inactivity and completely re-annotated, this explains the lag in the curation. A huge amount of IDP/IDR data in the IDP literature remain uncurated and the vast majority of IDP regions remain to be characterised ( Pancsa & Tompa, 2012; Xue ). ( C) The growth in functional IDP data curated by the Eukaryotic Linear Motif (ELM) resource ( Gouw ). Similar to the structural IDR data, a huge amount of functional IDP/IDR data in the IDP literature remains uncurated and the vast majority of functional modules in IDPs remain to be characterised ( Tompa ).

( A) Key experimental methods used across the three major research focuses of the IDP field: structure, interactome and function. ( B) The growth in structural IDP data curated by the DisProt database ( Piovesan ). DisProt was established over a decade ago in the USA, and recently brought to Europe after years of inactivity and completely re-annotated, this explains the lag in the curation. A huge amount of IDP/IDR data in the IDP literature remain uncurated and the vast majority of IDP regions remain to be characterised ( Pancsa & Tompa, 2012; Xue ). ( C) The growth in functional IDP data curated by the Eukaryotic Linear Motif (ELM) resource ( Gouw ). Similar to the structural IDR data, a huge amount of functional IDP/IDR data in the IDP literature remains uncurated and the vast majority of functional modules in IDPs remain to be characterised ( Tompa ). The functional aspects of IDPs are studied by a battery of well-established cell biology and biophysical approaches ( Gibson ) ( Figure 1). Many of the approaches were developed for protein interaction elucidation, for example, by coupling mutagenesis to affinity-purification. However, interactions mediated by IDRs are often defined by their low affinity and cell-state dependent conditionality, two properties that are not always amenable to the available experimental protein interaction detection methods ( Gibson ; Van Roey ). Therefore, there are a growing number of approaches that have been specifically designed to characterise low affinity IDR-mediated interactions such as peptide arrays, proteomic phage display (ProP-PD) and peptides attached to Microspheres with Ratiometric Barcode Lanthanide Encoding (MRBLE-pep) ( Blikstad & Ivarsson, 2015; Davey ; Nguyen ; Volkmer, 2009). As IDRs often adopt a dominant conformation in their bound state, many IDR-containing interfaces can be studied in complex with the structured binding partners using structural approaches including NMR spectroscopy and X-Ray Crystallography ( Bonetti ; Fuxreiter, 2018; Iešmantavicius ; Schad ). Distinct IDR functionalities are also studied using isolation, deletion or mutagenesis of a functional module combined with bespoke assays to characterise phase separation, protein localisation, stability and post-translational modification ( Gibson ; Gouw ; Martin & Mittag, 2018). The computational IDP field tackles several distinct research tasks including: (i) the prediction of IDRs ( Cilia ; Dosztányi ), (ii) the identification of functional modules in IDRs ( Dosztányi ; Edwards & Palopoli, 2015; Krystkowiak & Davey, 2017; Neduva & Russell, 2006), (iii) docking of peptides that are intrinsically disordered in their unbound state ( Raveh ; Trabuco ), (iv) force field development and molecular simulations for IDR structure ( Best, 2017; Chong ; Huang & MacKerell, 2018; Stanley ), (v) in silico inhibitor design and development for IDRs ( Baggett & Nath, 2018; Santofimia-Castaño ; Yu ) and (vi) the processing of the data produced by experimental structural and functional analyses of IDRs ( Bernadó ; Franke ; Nodet ; Ozenne ) ( Table 1). The members of the European IDP community are major contributors across these computational IDP fields ( Figure 2).
Table 1.

Representative core software of the IDP field developed and hosted in Europe.

NameDescription
IUPredPrediction of intrinsically disordered regions. URL: https://iupred2a.elte.hu ( Dosztányi et al., 2005)
ESpritzPrediction of intrinsically disordered regions URL: http://protein.bio.unipd.it/espritz/ ( Walsh et al., 2012)
FoldIndexPrediction of intrinsically disordered regions URL: https://fold.weizmann.ac.il/fldbin/findex ( Prilusky et al., 2005)
MobiDB-litePrediction of intrinsically disordered regions URL: http://protein.bio.unipd.it/mobidblite/ ( Necci et al., 2017)
DynaMinePrediction of protein backbone dynamics URL: https://dynamine.ibsquare.be ( Cilia et al., 2014)
DiLiMotPrediction of functional modules in intrinsically disordered regions. URL: http://dilimot.russelllab.org/ ( Neduva & Russell, 2006)
SLiMSearchPrediction of functional modules in intrinsically disordered regions. URL: http://slim.ucd.ie/slimsearch/ ( Krystkowiak & Davey, 2017)
ANCHORPrediction of functional modules in intrinsically disordered regions. URL: https://iupred2a.elte.hu ( Dosztányi et al., 2009)
PepSite 2Prediction of IDP binding sites. URL: http://pepsite2.russelllab.org/ ( Trabuco et al., 2012)
FlexPepDockDocking of IDPs to their ordered binding partner. URL: http://flexpepdock.furmanlab.cs.huji.ac.il/ ( Raveh et al., 2011)
δ2DSecondary structure propensity from NMR data URL: http://www-mvsoftware.ch.cam.ac.uk/ ( Camilloni et al., 2012)
ncSPCStructural propensity calculator from NMR data URL: http://linuxnmr02.chem.rug.nl/ncSPC ( Tamiola & Mulder, 2012)
Flexible-MeccanoGeneration of ensemble descriptions of IDPs URL: http://www.ibs.fr/research/scientific-output/software/flexible-meccano/ ( Ozenne et al., 2012)
ATSASA data analysis suite for SAXS data including the EOM tool URL: https://www.embl-hamburg.de/biosaxs/ ( Bernadó et al., 2007; Franke et al., 2017)
GROMACSMolecular Dynamics software applicable to IDPs URL: http://www.gromacs.org/ ( Hess et al., 2008)
PLUMED2Library for free energy calculations URL: https://www.plumed.org/ ( Tribello et al., 2014)
Figure 2.

Representative core software ( Table 1, marked with a diamond) and resources ( Table 2) for the IDP field developed and hosted in Europe.

Europe hosts many of the key IDP resources ( Table 2). DisProt is the largest database of manually curated experimental data on IDP structure ( Piovesan ; Sickmeier ). MobiDB and D are central resources of pre-computed structural prediction of IDPs ( Piovesan ; Oates ). ELM is a manually curated database of binding regions residing in IDPs ( Gouw ). SASBDB and PCDDB are central resources for Small-angle X-ray Scattering (SAXS) and Circular Dichroism (CD) data ( Valentini ; Whitmore ). PED is a database of conformational ensembles from Nuclear Magnetic Resonance (NMR) and SAXS data, and Molecular Dynamics (MD) simulations ( Varadi ). It is important to note that the vast majority of the computational frameworks developed by the IDP field had to be created from scratch given the unique structural and functional aspects of IDRs. For example, the analysis of IDP structure drew predominantly from polymer biophysics rather than the existing structural biology of folded proteins ( Holehouse & Pappu, 2018; Milles ; Schuler ).
Table 2.

Representative core resources of the IDP field developed and hosted in Europe.

NameDescription
DisProtCurated database of experimentally validated intrinsically disordered regions. URL: http://www.disprot.org ( Piovesan et al., 2017)
MobiDBDatabase of predicted intrinsically disordered regions. URL: http://mobidb.bio.unipd.it ( Piovesan et al., 2018)
D2P2Database of predicted intrinsically disordered regions. URL: http://d2p2.pro/ ( Oates et al., 2013)
PED3Database of IDP conformational ensembles. URL: http://pedb.vib.be/ ( Varadi et al., 2014)
CheZODDatabase of structural propensities from NMR chemical shift data. URL: http://www.protein-nmr.org/ ( Nielsen & Mulder, 2016)
SASBDBRepository for Small-angle X-ray Scattering (SAXS) data. URL: https://www.sasbdb.org/ ( Valentini et al., 2015)
PCDDBRepository for Circular Dichroism (CD) data. URL: http://pcddb.cryst.bbk.ac.uk/ ( Whitmore et al., 2017)
ELMCurated database of Short, Linear Motifs. URL: http://elm.eu.org/ ( Gouw et al., 2018)
DIBSCurated database of intrinsically disordered binding regions. URL: http://dibs.enzim.ttk.mta.hu/ ( Schad et al., 2018)
Switches.ELMCurated conditional interactomics database for IDPs. URL: http://switches.elm.eu.org ( Van Roey et al., 2013)
The ELIXIR IDP Community has grown out of the NGP-NET COST Action (Non-globular proteins: From sequence to structure, function and application in molecular physiopathology), a scientific cooperation funded in 2015 under a Horizon 2020 EU Framework Programme. The NGP-NET community spanned 30 different countries, plus EMBL-EBI and EMBL Heidelberg. NGP-NET held a series of thematic workshops on IDPs to drive the development of computational resources and community standards. A strategic workshop titled “Intrinsically Disordered Proteins in Core Data Resources” was organised by NGP-NET at the EBI campus, Hinxton, the UK on June 1–2, 2017 to discuss the integration of IDP-related data and computational resources into the ELIXIR framework. A major outcome of this workshop was the recognition that IDP annotations are significantly underrepresented in the ELIXIR Core Data Resources (CDRs) ( Durinx ). These initial insights evolved into a comprehensive plan to develop the IDP field and integrate key IDP resources and tools into the CDRs. That plan formed the basis for the ELIXIR IDP Community proposal that was submitted and presented at the ELIXIR Head of Nodes meeting in Basel, Switzerland on September 13th, 2017. After completing the ELIXIR Community application process, including the creation of this white paper, the IDP Community proposal was accepted by the ELIXIR Head of Nodes on May 14th, 2019. In parallel with these developments, the IDP research community has developed closer integration with ELIXIR activities. A follow up strategic meeting, “The 2 nd Workshop on Intrinsically Disordered Proteins in Core Data Resources”, was held in Prague on March 13–14, 2019 and brought together data producers, database developers and a representative of the ELIXIR Data Platform to discuss the integration of IDP data into the CDRs; and a member of the IDP research community attended the ELIXIR Interoperability Platform Face To Face Meeting on 1–2 April 2019. Furthermore, several actions have already taken place within the ELIXIR framework to tackle time-sensitive priorities including two implementation studies, “Implementation study for the integration of ELIXIR-IIB in ELIXIR Data Curation activities” and “Integration and standardisation of intrinsically disordered protein data”, focussing on interoperability. An additional, fundamental development associated with these implementation studies was the initialisation of the HUPO (Human Proteome Organisation) Proteomics Standards Initiative (PSI) Intrinsic Disorder workgroup. The goal of the IDP Community in ELIXIR is to support the development of standards, tools and resources to accelerate the identification, analysis and functional characterisation of intrinsically disordered regions. Here, we introduce the major areas of priority for the development of an e-infrastructure that will allow the community to realise these goals while supporting the needs of the generators, users and consumers of IDP data.

Identification of community challenges

A strategic workshop titled “An intrinsically disordered protein user community proposal for ELIXIR” was held on October 31st, 2018 at the University of Padua, Italy. The meeting was attended by participants from 13 ELIXIR nodes: Belgium, Cyprus, Czech Republic, EBI, Germany, Hungary, Ireland, Israel, Italy, Netherlands, Spain, Switzerland, and the United Kingdom. ELIXIR members in both Greece and Sweden were also interested, but could not attend. The ELIXIR Hub team was represented by John Hancock, the ELIXIR Communities and Services Coordinator and representatives from the Interoperability, Tools and Training ELIXIR platforms. Members of the 3DBioInfo and Proteomics ELIXIR Communities, the Instruct-ERIC structural biology research infrastructure, and the ELIXIR CDRs of high relevance to the IDP research, namely PDBe ( Mir ), IMEx/IntAct ( Orchard ) and UniProt ( UniProt Consortium, 2019), were also present. During the meeting, the following topics were identified as key areas of priority for the ELIXIR IDP Community.

Area 1 – Standards and exchange formats for IDP data

The IDP field has not yet established official standards to allow consistent storage and dissemination of data. Hence, a key responsibility of the IDP Community is the definition of guidelines and standards to improve the reproducibility, interpretation, and dissemination of experimental data. The development of novel standards would result in a shift from the current “organic” interoperability of the community, where each group defines their own formats and creates a range of parsers to read the formats developed by other groups, to standardised data dissemination accessible to the larger biological community. For functional data describing protein interactions, molecular interaction interchange standards based on the guidelines of the HUPO-PSI-MI molecular standards ( Sivade Dumousseau ) can be applied. However, for structural data, no pre-existing solution exists. In the simplest cases, experimental evidence for IDRs can be mapped to protein sequences as sequence features. Structural descriptions such as IDR conformational ensembles (e.g. based on experimental NMR, SAXS or MD simulations data) or secondary structural propensities (e.g. based on NMR data) will require more descriptive standards, especially if they are probabilistic. Only a subset of the required experimental and functional ontologies are already available and most IDP data cannot be described by pre-existing terminology. IDPs will require novel entries to controlled vocabularies describing specific experimental methods and protein structural concepts. The first step of this process was undertaken at a workshop for the ELIXIR implementation study: “Integration and standardisation of intrinsically disordered protein data” held at the University of Padua, October 29–30th, 2018. The meeting proposed a first draft Minimum Information About Disorder Experiments (MIADE) standard defining the data required for reporting an IDP experiment. A further key outcome of the meeting was the establishment of the HUPO-PSI intrinsic disorder (HUPO-PSI ID) workgroup to drive the development of the required standards, storage and dissemination formats and controlled vocabularies for the community. Upon completion, the HUPO-PSI ID recommendations will be adopted by the key community resources promoting effortless integration of IDP data into the ELIXIR CDRs. An important goal of the roadmap will be the initiation of collaborations with the experimental communities for each of the distinct structural and functional methods used in IDP research to develop: (i) experimental method specific standards and (ii) workflows with minimal required experimental detail for the characterisation of IDPs. These developments would require extensive collaboration with the data generators and the experimental method specific data repositories including SASBDB ( Valentini ), PCDDB ( Whitmore ) and the BMRB ( Ulrich ) to allow data produced by IDP researchers to be stored and disseminated in the most descriptive and efficient way possible. This will simplify the integration of the results of these analyses into the community resources. The recent development of the PDB-Dev repository of non-atomistic or part-atomistic structural data can provide a prototype for the deposition of experimental IDP data ( Vallat ; Peng ).

Area 2 - Automated and community-driven curation

The vast majority of the experimental data describing IDPs, the functional modules encoding their function, the regulatory mechanisms conditionally controlling that function, and their dysregulation in disease, is isolated in the text of research and review articles and in poorly formatted supplementary tables. This hampers the integration of IDP information with data such as protein function, modification, splice variants and disease-causing single-nucleotide polymorphisms (SNPs). Consequently, data created at great expense is significantly underutilised. Annotation of bona fide IDPs, especially for their function, is currently a labour-intensive process. The IDP community has already been experimenting with crowdsourced curation, a topic which is of interest to the ELIXIR Data Platform, and included within Task 3 of the ELIXIR Data Platform 2019–2023 ELIXIR Programme. DisProt is a successful example, leveraging the collective expertise of around 40 researchers from a dozen different IDP labs in as many countries. The rate and accuracy of curation can be improved by integrating the available ELIXIR e-infrastructure into the curation process through partial annotation of articles and through automatic selection, classification and prioritisation of the relevant articles for curation. For example, the annotation of IDP-mediated interactions directly into the IMEx consortium annotation portal would provide a pre-built environment for such an endeavour. Automatic triage and pre-compiling of data based on Europe PMC data would also greatly boost productivity, allowing the community to cope with the increasing amount and complexity of data being published ( Britan ). The large number of articles describing the structure and function of IDPs has highlighted the need to exploit text-mining approaches in order to eventually leverage automatic annotation from the literature. ELIXIR would facilitate integration with existing text-mining frameworks. Finally, a future goal of the community is the early capture of IDP data pre-publication directly from the data producers to reduce the need for manual curation. The coordinated experimental and computational fields within the ELIXIR IDP Community can provide a single contact point to lobby journals to require data deposition prior to publication.

Area 3 - Integration with ELIXIR Core Data Resources

IDP annotations are significantly underrepresented in the ELIXIR CDRs and a key goal of the ELIXIR IDP community is to facilitate the integration of IDP data and services into these resources. The CDRs do not currently annotate or import experimental data describing IDR structure and function despite their high abundance and functional importance. Recently, InterPro, PDBe, and UniProt adopted IDP predictions from sequence retrieved from MobiDB, as part of an ELIXIR Implementation Study “Integration and standardisation of intrinsically disordered protein data”. This work can be used as a blueprint for the successful integration of additional IDP data into further CDRs. It would be advantageous both for the IDP Community and the CDRs to comprehensively integrate the available IDP data into these resources. Interoperability can be guaranteed by the implementation of IDP specific ontologies to describe IDP specific features and the availability of standards-compliant RESTful APIs, enabling cross-linking and programmatic access to IDP resources. Many of the key IDP resources already host RESTful APIs and utilise persistent identifiers from CDRs. However, these APIs are currently not fully interoperable with each other without bespoke adaptors, though this will be tackled by Priority Area 1 “Standards and exchange formats for IDP data”. The development of curation guidelines and standards in line with the requirements of the CDRs, or the adoption of CDR guidelines, will streamline the integration process. The IDP resources that annotate functional modules and their interactions are currently not members of the International Molecular Exchange (IMEx) consortium ( Orchard ). A key step on the roadmap is joint curation efforts between the ELIXIR IDP Community and the IMEx consortium of IDP-mediated interactions, thereby reducing duplication of effort.

Area 4 – Standardisation, benchmarking and indexing of computational tools

The IDP research field is currently developing best practices for scientific (i/o) file formats, data analysis pipelines and benchmarking of scientific tools. These steps are being taken to raise the quality and accessibility of software developed by the community to produce more accurate, faster, more stable and user-friendly software implementation. The development and adoption of the storage and dissemination formats in Priority Area 1 will help standardise the (i/o) file formats of IDP tools. However, several use cases will not be covered by these formats (e.g. standardised i/o formats for residue-specific IDR scoring) and additional effort will be required to formalise the output formats in these areas. Containerised software, such as those available via BioContainers, or package managers, such as BioConda, are rarely used in IDP research. However, such advances would make IDP tools more accessible to the wider community and would simplify their benchmarking. The field of IDP research could also benefit from the development of reusable experimental workflows such as those implemented with the Common Workflow Language for commonly used IDP analysis pipelines built on top of ELIXIR CDRs and Deposition Databases. These workflows could then be managed using the Galaxy workflow manager platform. Benchmarking remains an issue for the community as the lack of common benchmarking datasets has hampered the systematic assessment of IDP tools. As a result, many publications have claimed superior performance for biased datasets, furthering the need for standardisation. Due to the availability of high-quality manually curated IDP data from DisProt, the community can provide gold standard blind datasets to run periodic benchmarking assessments of IDR prediction tools. This approach has recently been successfully applied by the Critical Assessment of Intrinsic Disorder (CAID) initiative. A similar platform for the comparison of methods predicting functional modules within IDPs or aligning homologous IDRs would also benefit the community and drive advances in these developing fields. The development of Open Source software to benchmark IDP analysis methods across a wide range of performance metrics will simplify and standardise the assessment of IDP tools. OpenEbench can be used as an information hub to distribute reference datasets, to run comparative assessments and to publish benchmarking results. From a technical point of view, the adoption of BioContainers and/or Galaxy, and the standardisation of input and output formats would streamline the assessment process. Benchmarking results covering both scientific and technical aspects of the available IDP analysis tools can be hosted at OpenEBench to simplify method selection by users. Finally, the addition of these computational tools for the analysis of IDPs to the bio.tools registry would improve the visibility of these tools.

Area 5 – Development of a centralised knowledge-base for IDP data

Computational IDP researchers based in Europe develop many of the tools and resources that underlie the global IDP e-infrastructure. However, these assets are currently spread over numerous institutes and universities across Europe. The development of an umbrella resource, DisProtCentral (founded June 2017), consolidating the European IDP resources and tools through a single portal will improve the accessibility of these resources for the wider biological community. The DisProtCentral consortium will provide a central hub to access high-quality curation, annotation and predictions of structural and functional information on IDPs, in addition to providing stable identifiers/URIs to describe regions of disorder within specific proteins, enabling cross-referencing and linking between resources. The consortium will include all the stakeholders in the IDP field and provide a centralised repository for the protein disorder-related databases and tools. This will future-proof these key IDP resources against issues arising from the loss of funding or change of group focus. This initiative to produce such a centralised resource draws attention to the need for a fit-for-purpose interoperability tooling adaptor that translates across the metadata annotations of individual resources. This will allow pre-existing data from distinct resources to be integrated via normalisation functionalities such as ontology cross-referencing for data previously mapped to different ontologies/vocabularies, or ontology term mapping for those that have not been mapped to any standard terminologies. Ideally, newly generated data should be produced FAIR-at-source (Findable, Accessible, Interoperable, and Reusable) ( Wilkinson ). DisProtCentral will play a vital role in encouraging IDP data generators to follow the standardised set of best practices: identifying recommended ontologies that are universally adopted by those data generators, defining a common schema markup strategy, and validating persistent identifiers for data to be deposited into the resource. A wider goal of the DisProtCentral resource is to provide a single point of contact to promote discourse and collaboration to ensure that the needs of the IDP data generators, users and consumers are all being met by the computational IDP researchers. An important aspect of this effort will be the development of training material for the wider biological communities describing the best practices for IDP analyses.

Alignment with ELIXIR Activities

The community challenges identified are already well-aligned with on-going ELIXIR platform goals and activities. In particular: The task proposed in Priority Area 3 “Integration with ELIXIR Core Data Resources” aligns with the goals of the Data Platform. The community is already in the process of integrating IDP annotation into the ELIXIR CDRs as part of two ELIXIR implementation studies, “ Implementation study for the integration of ELIXIR-IIB in ELIXIR Data Curation activities” and “ Integration and standardisation of intrinsically disordered protein data”. The recent integration of MobiDB into InterPro, PDBe and UniProt can be taken as a blueprint for the further integration of IDP information into ELIXIR CDRs. Furthermore, initial discussions have taken place to plan the integration of additional sources of IDP data. A separate task, which can benefit from ongoing Data Platform activities, is the development of an IDP curation framework as described in Priority Area 2 “Automated and community-driven curation”. This aligns with ELIXIR Data Platform Task 3 “Scalable curation” of the 2019–2023 ELIXIR Programme. The distributed community curation for IDP resources will benefit from stronger interaction with Europe PMC, with the SciLite framework providing literature mining. This is especially important where data is scattered throughout the extensive corpus of biological literature and not properly indexed. Together with an automated curation triage step selecting relevant papers, this can boost the productivity of community curation while ensuring high-quality annotation of the IDP literature. Initial efforts have started with the design of a dedicated curation-support prototype, which has been used by DisProt curators since 2018. The service capitalises on the neXtA5 platform ( Mottin ), designed by the Text Mining group of the Swiss Institute of Bioinformatics. The demonstrator ( http://candy.hesge.ch/disprotGUI/) is able to rank articles based on a scoring function, which prioritises articles with a high density of IDP-related concepts. Thanks to the curation-support tool, it will be possible to obtain a high quality curated benchmark, with the aim to evolve the current system from a triage system to a more accurate binary classifier, being able to select not only relevant papers but also to highlight short passages of text in full-text papers likely to support the expert curation. The tasks proposed in Priority Area 4 “Standardisation, benchmarking and indexing of computational tools” fall under the objectives of the Tools Platform. A major goal of Priority Area 4 is to increase productivity by developing reusable experimental workflows for commonly used IDP analysis pipelines built on top of ELIXIR CDRs. The Tools Platform can advise the ELIXIR IDP Community on the development of such workflows using the Common Workflow Language in collaboration with the ELIXIR Interoperability platform. The IDP analysis tool benchmarking by the CAID initiative is complementary to the OpenEBench benchmarking platform. OpenEBench represents an information hub for the distribution of reference datasets, the application of metrics and comparative assessments and the distribution of benchmarking results. CAID can also drive prototype development for software containers and reusable experimental workflows for the community. The outcomes could then be generalised for other applications. The bio.tools service registry can be a comprehensive census of the available software for IDP analysis and the BioContainers and Bioconda platforms can assist in the generation of software containers and their registry across different technologies to facilitate access and use of software by the IDP communities and beyond. Several of the identified priorities will benefit from the input of the Interoperability Platform. The main interaction with the platform will be to develop FAIR-compliant standards and guidelines for reporting, data exchange and retrieval of data on IDP structure and function ( Wilkinson ). The recent development of a HUPO PSI-ID workgroup and a draft MIADE standard are key steps towards this goal. IDP resources themselves will benefit from becoming FAIR and adhering and contributing to the further development of standards, controlled vocabularies and ontologies. These advances are of paramount importance to improve the dissemination of IDP data. FAIR-compliant data will also aid in the FAIRification of analysis tools where reusable computational workflows can make programmatic calls to retrieve reusable data from an integrated centralised source. The Interoperability Platform service framework has already defined the various key activities to support the “FAIRification” of data which can be applied to IDP data. The effort to make IDP data FAIR can exploit the various key services that are offered by the Interoperability Platform such as those identified as recommended practices through the ELIXIR Recommended Interoperability Resources (RIRs), and the platform mission-critical initiatives (i.e., Bioschemas, and Common Workflow Language). As this is an emerging community with new data resources in development, deploying these recommended practices in new IDP databases will highlight the IDP Community as a use case example in the mission to make data FAIR-at-source. This work can be supported by the Interoperability Platform under their remit to provide support for interoperability for ELIXIR Communities. Work being carried out in the Compute Platform can be leveraged in two main ways. Community curation requires authentication for a wide range of participants, which can benefit from the federated ELIXIR AAI approach for curator login. This could also facilitate the attribution of credit to curators. A second element would be to use the ELIXIR distributed computing infrastructure including identity and access management, data integration and container deployment across a range of appropriate compute endpoints as required, both for updating large-scale databases and to run the CAID experiment. A range of new Compute Platform activities are underway to enable an integrated approach to the deployment of relevant workflows and containers. The IDP Community can take advantage of these activities to deploy IDP-related workflows and containers. The IDP Community is focussed on the development of the next generation of IDP researchers. Extensive training has been funded and carried out by the COST Action NGP-NET and additional training schools are planned as part of the MSCA-RISE-funded IDPfun project. The Italian ELIXIR node, which has been actively providing training, will organise a yearly training course on IDP resources. The input of the ELIXIR Training Platform will be indispensable to facilitate the development of a core set of training materials, its dissemination and inclusion in ELIXIR training courses. These materials will comprise introductory and advanced lessons on databases access, in silico analysis of IDPs (e.g. IDP prediction, IDP docking, MD simulations and IDP-specific primary sequence analysis techniques), and computational processing of experimental data (e.g. NMR spectra, phage display). There will also be a requirement for the development of training material related to IDP interoperability to train experimentalists, software developers, database developers and curators in topics such as IDP standards, IDP controlled vocabularies and Common Workflow Language (CWL) for IDPs. This aligns with the Interoperability Platform’s training and outreach task in the 2019–2023 ELIXIR Programme to collaborate with Training Platform members to provide Interoperability training. In collaboration with the Training Platform, the IDP Community will be able to adopt and implement a wide range of best practices and guidelines available through the ELIXIR Training Toolkit, which aims to provide a comprehensive reference resource for developing training capacity, rolling out new training programs, as well as expanding existing ones. All materials and activities developed for IDP training will be shared through TeSS for further dissemination. Additionally, train-the-trainer activities will be planned to increase the size of the IDP trainers pool and build additional training capacity.

Alignment with other communities

Links to ELIXIR Communities

ELIXIR 3DBioInfo is a community of structural bioinformaticians with the remit of continuing the development of the e-infrastructure for the storage, visualisation, analysis, annotation, and prediction of the structure of biological macromolecules and complexes. The probabilistic approach to protein structure employed by the IDP researchers is highly complementary to the work of the ELIXIR 3DBioInfo community and together, a complete structural description of proteins can be achieved. With the two communities addressing different challenges, there are numerous opportunities for synergistic connections between them, particularly in the development of structural ontologies and standards, structure boundary definition for structural studies, methods to predict IDR-globular domain interactions and whole protein structure prediction tools. . The ELIXIR Proteomics Community aspires to improve the research on proteoforms including protein forms caused by post-translational modifications (PTMs) and sequence variants. To this end, proteoform-centric annotations of proteomes are needed. This includes information on the co-occurrences of IDRs with PTMs and non-constitutive exons, and if possible, the functional outcomes of these conditional changes to the protein. Close collaboration is desirable between the communities regarding data interoperability to promote the use of IDP and proteomics data in data analysis workflows. IDP research also applies proteomics-related technologies such as ion-mobility mass spectrometry and cross-linking mass spectrometry. In these cases, close collaboration with the proteomics community within the HUPO-PSI Mass Spectrometry workgroup will support the standardisation efforts of the IDP community. The overarching goal of the ELIXIR Rare Diseases Community is to create a sustainable, reusable, and interoperable infrastructure that will enable researchers to discover, access, and analyse rare disease data. A key aspect of the analysis of rare diseases data is the elucidation of the mechanism(s) by which genetic change(s) result in a diseased state. While the interpretation of the effect of disease mutations altering globular domains is well established, IDPs represent challenges in this context. Interaction and collaboration with the ELIXIR IDP Community will provide invaluable information to aid in the understanding of disease-causing variations, truncations, and/or chimeric oncogene(s) related to IDRs. The development, and dissemination of standards for IDR structural and functional data as proposed in priority area 1 are important aspects for the collaboration between the ELIXIR IDP and Rare Diseases Communities. Galaxy is a web-based e-infrastructure for computational biomedical research. It allows users with minimal computational proficiency to run and share data analysis workflows. This promotes reproducibility and simplifies sharing of data and results. Currently, there is limited use of Galaxy by the ELIXIR IDP community. However, one of the priority areas of the roadmap, “ Standardisation, benchmarking and indexing of computational tools”, is the development of reusable experimental workflows for commonly used IDP analysis pipelines built on top of ELIXIR CDRs. A collaboration with the ELIXIR Galaxy Community would be highly beneficial for such an endeavour, particularly in the development of IDP pipelines using the ELIXIR scientific workflow platform.

Links to non-ELIXIR communities

Instruct-ERIC is a European research infrastructure for structural biology that makes high-end technologies and methods available to European researchers. Similarly to the proposed ELIXIR 3DBioInfo Community, the interdependency of the research performed by Instruct-ERIC and the IDP researchers, covering the structured and unstructured parts of the proteome, allows for extensive synergies. Instruct-ERIC coordinates access to facilities which permit the structural analysis of IDPs including several centres specialised in NMR and biophysical techniques. Furthermore, Instruct-ERIC has developed a set of high-quality training courses in structural biology. Given the complementary teaching focus of the ELIXIR IDP Community and the Instruct-ERIC initiative, future training schools with contributions from both sources would be of great benefit to any participant. The Dark Proteome Initiative is a US consortium of experimentalists with the goal of fostering collaborations amongst IDP researchers and lobbying for funding to address open and biologically important questions in the IDP field. The Dark Proteome Initiative is a readymade community of world-class generators of IDP data utilising a wide range of experimental approaches allowing the ELIXIR IDP Community to access a single entity for guidance on the needs of experimental data generators. The Dark Proteome Initiative will also be able to provide invaluable advice on the detailed definitions and requirements for the development of ontologies and standards. The PLUMED consortium has been recently established as an open community including PLUMED developers, contributors and users to help to establish reproducibility, open access data, and harmonisation of the protocols for molecular dynamics simulations, free energy calculations and other simulations that can be run with the PLUMED software (PLUMED consortium, 2019).

Conclusions

Recent years have seen a rapid growth of interest in IDPs. This has coincided with significant advances in the in vivo, in vitro and in silico methods to study the structure and function of IDPs. Unfortunately, the basic requirements for the organisation and dissemination of the tools and data produced by the field have not advanced in step with these developments. The field is on the cusp of an era where the high-throughput characterisation of the extensive intrinsically disordered regions (IDRs) of proteins is possible. This roadmap will provide the foundation that supports this data explosion and provide a solid platform for the future biological research of IDPs.

Data availability

No data are associated with this article. The paper describes importance of intrainsically disordered proteins with thorough set of examples with good references. It also explains computational and experimental challenges involved in IDP research. A good summary of available tools and platforms are also provided. Key challenges in IDP research are identified and key activities to be carried out to tackle these challenges are provided. ELIXIR community has many of the researchers working in the field within the consortium. All the needed activities align well with ELIXIR activities as well. So it is natural that ELIXIR plays a centralized role in these activities. The paper also identified key players around the world that they can collaborate with establishing a task force to tackle major challenges IDP research needs to tackle. Only issue I have with the paper is the lack of focus on reproducibility issues. That is a major problem life science research is facing today. most of the experimental results are not reproducible even computational models are not reproducible resulting in waste of billions of euros spent in funding such research activities. They should have a special focus group within their consortium to ensure reproducibility of experimental results within IDP community (define key information needed to reproduce results, endorse use of e notebooks and use of protocols established withing the network, etc.). They should also do the same for computational models that are developed to tackle several problems related to IDP. A section added on these issues would strengthen the paper. Also if they include other initiatives focusing on reproducbility that would strengthen their task force. I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard. There has been a significant contribution from Europe to the field of intrinsically disordered protein (IDP) research, including development of computational tools. This white paper recognizes that science and methodological development, and reports on a meeting focused on organizing the software and the software developers supporting the IDP community in Europe. While it is extremely important to provide infrastructure support for the IDP community and efforts to increase communication and usage of tools are essential, it is not clear that the significant effort to put these resources under a single umbrella and unify data formats is of value beyond that of encouraging independent development with opportunities to interact. This white paper provides a useful report on the meeting that will be of value to the IDP computational community. However, it could be improved to provide greater clarity on the proposal and some of the suggested priorities could be re-evaluated. Specific comments: Minor points: The white paper itself is challenging to read. It is unclear who the target audience of the article is? Is it the IDP/IDR community or the ELIXIR community, or both? In the present form, the article might be confusing to both communities, as it provides a lot (at times too much) detail for some aspects, resources, and individual tools, while omitting crucial information for others. For instance, it is unclear what from the current resources would be relocated into a new ELIXIR infrastructure and what would continue to exist independently and simply being "pointed at" from a central resource? It is also unclear exactly where central data repositories and central software platforms would reside. Many presently available resources are outlined and suggested at different points, making it unclear how everything would fit together. In addition, it is unclear if there will be a central resource(s), such as DisProtCentral, and if so would this resource(s) ultimately have independent developers, next to the participating research groups. Some of the software and the databases that will be the key components are listed in Figure 2. However, not all of the outlined resources are presently functional or consistently maintained. It is therefore unclear if the initiative aims at reviving some of the resources in the future, and if so, whether this be done by the individual research groups. For the present time, it would be important to include a table with links to the websites/repositories of the operational resources, and a note on the current status of all other, e.g. "not actively developed", "under development", "under restructuring", etc. It would be very helpful to include schematics to help understanding of the proposed initiative and improve the flow of the article. For instance, it would be good to use flow charts to outline the hierarchy of priorities to help understand how presently available resources fit into different priority areas and what will be the first tasks undertaken by different work groups. The priorities relating to the Area 1 (“Standards and exchange formats for IDP data") would be especially important to clarify in that sense, so that the community can be informed where to look for updates on the matter and thus help aid these efforts. Finally, the history of the interactions between the IDP community and ELIXIR and the subsequent integration into the ELIXIR core could be represented with a timeline and be in a separate subsection of the article, as this presently interrupts the flow of the article. It would be important to define the strategy that the community wants to undertake towards integration of resources. For instance this could be a) integrated, dedicated databases with API access that are continuously maintained for data storage; and b) open source software with modular architecture that will enable integration into, e.g., BioConda for software development. At present, the general strategy is not clear or explicit, making the article quite confusing to follow. It would be highly valuable to use this article and this platform to promote open source software development and established platforms for this purpose, such as GitHub. GitHub has proven effective in countless large-scale projects and it connects thousands of developers. The comparatively very small community of IDP computational researchers would benefit by developing, integrating, and sharing their resources through GitHub. In addition, such a platform would ensure inclusiveness. Related to this, it is not clear how the proposed centralization will solve the major issue of long term support for software, including funding for people to continue to maintain and develop the tools. A good way to assure a project will continue is for the project source/algorithm be fully open, in a repository such as GitHub, to enable use of modular code within other projects. It is stated that DisProtCentral would be used for consolidating the European IDP resources and tools. However, it has not been mentioned what is the plan (if any) for integration of non-European resources and tools? Would there be some way for developers from outside of Europe to qualify to submit or include their software and resources in DisProtCentral? There is a lot of emphasis on 'interoperability', and adaptors that could be used to this end. However, apart from the simplest workflows, e.g. annotation of predicted disordered regions on a protein sequence, it is likely that the demands for the workflows will be rapidly changing in the future. Spending effort on defining formats for data is not likely to be productive since by the time these standards can be applied they are could already be outdated. Technically speaking, it is not a problem if data are used in different formats, as long as those formats are easy to parse, since writing a parser is trivial. Therefore, the emphasis should be shifted to the modular software development that would enable developers to 'mix and match' various routines and dynamically create their own workflows adapt to the task at hand. The proposal for integration into bio.tools or BioConda in this context is a very good one. There are two tools that are actively maintained to current standards, GROMACS and PLUMED2, and they are not IDP specific and they run well  regardless of ELIXIR. These points should be noted. Capabilities to download the software, see the development flow and interact with the project directly are available for these tools, but it is not clear whether these principles are a general priority. A full name and abbreviation for ELIXIR should be given in the abstract. International Molecular Exchange (IMEx) is mentioned in the article before the abbreviation is introduced (the abbrev. is introduced later). We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.
  106 in total

1.  GROMACS 4:  Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation.

Authors:  Berk Hess; Carsten Kutzner; David van der Spoel; Erik Lindahl
Journal:  J Chem Theory Comput       Date:  2008-03       Impact factor: 6.006

2.  How Do Intrinsically Disordered Viral Proteins Hijack the Cell?

Authors:  H Jane Dyson; Peter E Wright
Journal:  Biochemistry       Date:  2018-06-28       Impact factor: 3.162

3.  Relationship of Sequence and Phase Separation in Protein Low-Complexity Regions.

Authors:  Erik W Martin; Tanja Mittag
Journal:  Biochemistry       Date:  2018-03-16       Impact factor: 3.162

4.  Programmable Microfluidic Synthesis of Over One Thousand Uniquely Identifiable Spectral Codes.

Authors:  H Q Nguyen; B C Baxter; K Brower; C A Diaz-Botia; J L DeRisi; P M Fordyce; K S Thorn
Journal:  Adv Opt Mater       Date:  2016-10-18       Impact factor: 9.926

Review 5.  Intrinsically disordered proteins in human diseases: introducing the D2 concept.

Authors:  Vladimir N Uversky; Christopher J Oldfield; A Keith Dunker
Journal:  Annu Rev Biophys       Date:  2008       Impact factor: 12.981

6.  Structural disorder in eukaryotes.

Authors:  Rita Pancsa; Peter Tompa
Journal:  PLoS One       Date:  2012-04-05       Impact factor: 3.240

7.  DisProt 7.0: a major update of the database of disordered proteins.

Authors:  Damiano Piovesan; Francesco Tabaro; Ivan Mičetić; Marco Necci; Federica Quaglia; Christopher J Oldfield; Maria Cristina Aspromonte; Norman E Davey; Radoslav Davidović; Zsuzsanna Dosztányi; Arne Elofsson; Alessandra Gasparini; András Hatos; Andrey V Kajava; Lajos Kalmar; Emanuela Leonardi; Tamas Lazar; Sandra Macedo-Ribeiro; Mauricio Macossay-Castillo; Attila Meszaros; Giovanni Minervini; Nikoletta Murvai; Jordi Pujols; Daniel B Roche; Edoardo Salladini; Eva Schad; Antoine Schramm; Beata Szabo; Agnes Tantos; Fiorella Tonello; Konstantinos D Tsirigos; Nevena Veljković; Salvador Ventura; Wim Vranken; Per Warholm; Vladimir N Uversky; A Keith Dunker; Sonia Longhi; Peter Tompa; Silvio C E Tosatto
Journal:  Nucleic Acids Res       Date:  2016-11-28       Impact factor: 16.971

8.  PDBe: towards reusable data delivery infrastructure at protein data bank in Europe.

Authors:  Saqib Mir; Younes Alhroub; Stephen Anyango; David R Armstrong; John M Berrisford; Alice R Clark; Matthew J Conroy; Jose M Dana; Mandar Deshpande; Deepti Gupta; Aleksandras Gutmanas; Pauline Haslam; Lora Mak; Abhik Mukhopadhyay; Nurul Nadzirin; Typhaine Paysan-Lafosse; David Sehnal; Sanchayita Sen; Oliver S Smart; Mihaly Varadi; Gerard J Kleywegt; Sameer Velankar
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

9.  ANCHOR: web server for predicting protein binding regions in disordered proteins.

Authors:  Zsuzsanna Dosztányi; Bálint Mészáros; István Simon
Journal:  Bioinformatics       Date:  2009-08-28       Impact factor: 6.937

10.  D²P²: database of disordered protein predictions.

Authors:  Matt E Oates; Pedro Romero; Takashi Ishida; Mohamed Ghalwash; Marcin J Mizianty; Bin Xue; Zsuzsanna Dosztányi; Vladimir N Uversky; Zoran Obradovic; Lukasz Kurgan; A Keith Dunker; Julian Gough
Journal:  Nucleic Acids Res       Date:  2012-11-29       Impact factor: 16.971

View more
  4 in total

1.  MobiDB: intrinsically disordered proteins in 2021.

Authors:  Damiano Piovesan; Marco Necci; Nahuel Escobedo; Alexander Miguel Monzon; András Hatos; Ivan Mičetić; Federica Quaglia; Lisanna Paladin; Pathmanaban Ramasamy; Zsuzsanna Dosztányi; Wim F Vranken; Norman E Davey; Gustavo Parisi; Monika Fuxreiter; Silvio C E Tosatto
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

2.  PED in 2021: a major update of the protein ensemble database for intrinsically disordered proteins.

Authors:  Tamas Lazar; Elizabeth Martínez-Pérez; Federica Quaglia; András Hatos; Lucía B Chemes; Javier A Iserte; Nicolás A Méndez; Nicolás A Garrone; Tadeo E Saldaño; Julia Marchetti; Ana Julia Velez Rueda; Pau Bernadó; Martin Blackledge; Tiago N Cordeiro; Eric Fagerberg; Julie D Forman-Kay; Maria S Fornasari; Toby J Gibson; Gregory-Neal W Gomes; Claudiu C Gradinaru; Teresa Head-Gordon; Malene Ringkjøbing Jensen; Edward A Lemke; Sonia Longhi; Cristina Marino-Buslje; Giovanni Minervini; Tanja Mittag; Alexander Miguel Monzon; Rohit V Pappu; Gustavo Parisi; Sylvie Ricard-Blum; Kiersten M Ruff; Edoardo Salladini; Marie Skepö; Dmitri Svergun; Sylvain D Vallet; Mihaly Varadi; Peter Tompa; Silvio C E Tosatto; Damiano Piovesan
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

3.  DisProt in 2022: improved quality and accessibility of protein intrinsic disorder annotation.

Authors:  Federica Quaglia; Bálint Mészáros; Edoardo Salladini; András Hatos; Rita Pancsa; Lucía B Chemes; Mátyás Pajkos; Tamas Lazar; Samuel Peña-Díaz; Jaime Santos; Veronika Ács; Nazanin Farahi; Erzsébet Fichó; Maria Cristina Aspromonte; Claudio Bassot; Anastasia Chasapi; Norman E Davey; Radoslav Davidović; Laszlo Dobson; Arne Elofsson; Gábor Erdős; Pascale Gaudet; Michelle Giglio; Juliana Glavina; Javier Iserte; Valentín Iglesias; Zsófia Kálmán; Matteo Lambrughi; Emanuela Leonardi; Sonia Longhi; Sandra Macedo-Ribeiro; Emiliano Maiani; Julia Marchetti; Cristina Marino-Buslje; Attila Mészáros; Alexander Miguel Monzon; Giovanni Minervini; Suvarna Nadendla; Juliet F Nilsson; Marian Novotný; Christos A Ouzounis; Nicolás Palopoli; Elena Papaleo; Pedro José Barbosa Pereira; Gabriele Pozzati; Vasilis J Promponas; Jordi Pujols; Alma Carolina Sanchez Rocha; Martin Salas; Luciana Rodriguez Sawicki; Eva Schad; Aditi Shenoy; Tamás Szaniszló; Konstantinos D Tsirigos; Nevena Veljkovic; Gustavo Parisi; Salvador Ventura; Zsuzsanna Dosztányi; Peter Tompa; Silvio C E Tosatto; Damiano Piovesan
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

4.  DichroWeb, a website for calculating protein secondary structure from circular dichroism spectroscopic data.

Authors:  Andrew J Miles; Sergio G Ramalli; B A Wallace
Journal:  Protein Sci       Date:  2021-07-21       Impact factor: 6.725

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.