Literature DB >> 28713550

A community proposal to integrate proteomics activities in ELIXIR.

Juan Antonio Vizcaíno1, Mathias Walzer1, Rafael C Jiménez2, Wout Bittremieux3, David Bouyssié4, Christine Carapito4, Fernando Corrales5, Myriam Ferro4, Albert J R Heck6,7, Peter Horvatovich8, Martin Hubalek9, Lydie Lane10,11, Kris Laukens3, Fredrik Levander12, Frederique Lisacek13,14, Petr Novak15, Magnus Palmblad16, Damiano Piovesan17, Alfred Pühler18, Veit Schwämmle19, Dirk Valkenborg20,21,22, Merlijn van Rijswijk23,24, Jiri Vondrasek9, Martin Eisenacher25, Lennart Martens26,27, Oliver Kohlbacher28,29,30,31.   

Abstract

Computational approaches have been major drivers behind the progress of proteomics in recent years. The aim of this white paper is to provide a framework for integrating computational proteomics into ELIXIR in the near future, and thus to broaden the portfolio of omics technologies supported by this European distributed infrastructure. This white paper is the direct result of a strategy meeting on 'The Future of Proteomics in ELIXIR' that took place in March 2017 in Tübingen (Germany), and involved representatives of eleven ELIXIR nodes. These discussions led to a list of priority areas in computational proteomics that would complement existing activities and close gaps in the portfolio of tools and services offered by ELIXIR so far. We provide some suggestions on how these activities could be integrated into ELIXIR's existing platforms, and how it could lead to a new ELIXIR use case in proteomics. We also highlight connections to the related field of metabolomics, where similar activities are ongoing. This white paper could thus serve as a starting point for the integration of computational proteomics into ELIXIR. Over the next few months we will be working closely with all stakeholders involved, and in particular with other representatives of the proteomics community, to further refine this paper.

Entities:  

Keywords:  bioinformatics infrastructure; computational proteomics; data standards; databases; mass spectrometry; multi-omics approaches.; proteomics; training

Year:  2017        PMID: 28713550      PMCID: PMC5499783          DOI: 10.12688/f1000research.11751.1

Source DB:  PubMed          Journal:  F1000Res        ISSN: 2046-1402


Introduction

Proteomics is generally defined as the large-scale experimental study of the proteome. High-throughput proteomics approaches have matured significantly, becoming an increasingly used tool in biological research. The rapid development of the field over the last decade has been primarily driven by technological progress in mass spectrometry instrumentation, chromatographic separation, genomics (increased availability of sequenced genomes) and bioinformatics [1, 2]. The primary workhorse of proteomics today is mass spectrometry coupled to liquid chromatography (LC-MS), with less commonly used high-throughput proteomics approaches based on antibodies (e.g., protein arrays and other immunofluorescence-based techniques). Key applications of proteomics are the study of (differential) protein expression in time and space, characterization of protein primary structures and their post-translational modifications (PTMs), such as phosphorylation and glycosylation, elucidating protein structures, and protein-protein interactions. It is the primary technology driving progress in unravelling signalling networks (e.g. protein phosphorylation driven signalling) and protein interaction networks, and is indispensable for understanding biological function of protein isoforms and disentangling their specific functions. In complex systems biology and systems medicine studies, proteomics often complements information gained from other omics levels, such as genomics and transcriptomics (the so-called proteogenomics and proteotranscriptomics studies [3]), metagenomics (metaproteomics), glycomics, and metabolomics. As already highlighted, remarkable advances in computational methods have been a key driver in the fast development of the field of proteomics. ELIXIR ( https://www.elixir-europe.org/) is a European Research Infrastructure (ESFRI), which coordinates, integrates and sustains bioinformatics resources across its member states. Some of the most prominent research groups in proteomics are active in Europe, such as Prof. Matthias Mann (Martinsried, Munich, Germany), Prof. Ruedi Aebersold (Zurich, Switzerland), Prof. Albert Heck (Utrecht, the Netherlands) and Prof. Mathias Uhlén (Stockholm, Sweden). In addition, Europe also hosts worldwide renowned groups that are focused on the development and application of widely-used bioinformatics tools and resources, including MaxQuant [4], the OpenMS framework [5], CompOmics [6] tools, such as PeptideShaker [7], the PRIDE database, as the world-leading proteomics repository [8] (also coordinating the global ProteomeXchange Consortium of proteomics resources [9]), and a collection of open data standards and related software [10], emphasizing the European leading role in the activities of the HUPO (Human Proteome Organisation) Proteomics Standards Initiative (PSI). Outside mass spectrometry proteomics, the Human Protein Atlas [11], located in Sweden, is the world-leading resource for antibody-based characterization of the human proteome. In this context, it should also be highlighted that two of the three sites behind the development of UniProt [12], the most-widely used protein knowledgebase, are European: the Swiss Institute of Bioinformatics (SIB) and the European Bioinformatics Institute (EMBL-EBI). Furthermore, neXtProt [13], which is the reference knowledgebase for human proteins in the context of the HUPO Human Proteome Project, is also developed and maintained at SIB. Additionally, it is of note to highlight that a number of national proteomics-dedicated infrastructures have already prioritised structuring and developing computational proteomics among their activities. This is the case of the French proteomics infrastructure ProFI (which has, for instance, devoted a major investment to develop the Proline tool) and the Spanish infrastructure ProteoRed (which has, for example, contributed heavily to PSI activities). Also, proteomics is represented in other national scientific infrastructures. One example is the Netherlands DTL (Dutch Techcentre for Life Sciences), having an active role in advocating for FAIR data management. In the context of providing infrastructure for storing proteomics data, it is worth highlighting here that, although it was not the case just a few years ago, thanks to many of these efforts, and with the support of scientific publishers and funders, public availability of proteomics data has increased exponentially in recent years, becoming a common scientific practise, similarly to how it routinely happens in disciplines such as genomics and transcriptomics [14]. Figure 1 summarises the growth of the PRIDE database in recent years.
Figure 1.

Number of datasets ( A), and the total size of the PRIDE database ( B), from 2005 to 2016. Data was retrieved directly from the PRIDE Archive Oracle TM database instance, which contains the file sizes and the dates when the datasets where originally submitted.

Number of datasets ( A), and the total size of the PRIDE database ( B), from 2005 to 2016. Data was retrieved directly from the PRIDE Archive Oracle TM database instance, which contains the file sizes and the dates when the datasets where originally submitted. Although the European bioinformatics community has been very active in the proteomics field (see above), proteomics activities have not been highly represented in ELIXIR so far. There was a proteomics component in two small ELIXIR pilot actions (collaborations between EMBL-EBI, now ELIXIR central node, and Bioinformatics Services to Swedish Life Science [BILS], now ELIXIR-Sweden). In addition, a selection of proteomics tools and training events have been included in the ELIXIR tool registry [15] ( http://bio.tools) and in TeSS, the ELIXIR training portal ( https://tess.elixir-europe.org/), respectively. These platforms were recently presented to the proteomics community [16]. However, we propose that, due to the growing importance of the field and the prominence of proteomics bioinformatics activities in Europe, it is the right time to formally integrate proteomics activities in ELIXIR. In this context, in February 2017, EMBL-EBI and ELIXIR-DE initiated the first ELIXIR ‘Implementation study’ involving proteomics approaches, as a starting point for the field. Within this implementation study, suggested by ELIXIR management, the meeting “The future of proteomics in ELIXIR” took place in Tübingen (Germany), as a general strategy meeting for future proteomics activities in ELIXIR. In this white paper, we first summarize the main conclusions of the meeting, and then explain possible future directions for the incorporation of proteomics activities in ELIXIR, taking into account the current overall ELIXIR structure, split in platforms and use cases.

Methods

Meeting “The future of proteomics in ELIXIR”

The meeting took place on March 1 st–2 nd 2017 in Tübingen (Germany). Attendance was widely advertised through ELIXIR dissemination channels (e.g., mailing lists, newsletter) and was open to any interested member of the community. There were 24 attendees representing eleven ELIXIR nodes: Germany (host), Belgium, Czech Republic, Denmark, France, Italy, Netherlands, Sweden, Switzerland, EMBL-EBI, and one representative from the ELIXIR Hub. The detailed minutes of the meeting are available as Supplementary File 1. The meeting started with a presentation given by Rafael Jiménez (ELIXIR Chief Technical Officer) who provided a general overview of the current ELIXIR activities. This initial talk was followed by a series of presentations where the representatives of each node summarized their ongoing activities related to proteomics. All the presentations are freely available at http://tinyurl.com/elixir-proteomics. The remainder of the meeting was devoted to an open discussion on how to bring together activities, experience, stakeholders, and emerging needs. First of all, ten potential ELIXIR stakeholders in this domain were identified, namely: funding agencies, regulatory bodies, educators, infrastructures, publishers, core facilities, bioinformaticians, life scientists, industries and hospitals/patients. Second, a series of needs and challenges were outlined for each of the stakeholders, and these were then mapped to each of the existing ELIXIR platforms: Data, Tools, Interoperability, Compute and Training (see below). The output of this activity is summarized in Supplementary Table 1. In the second day of the meeting, more concrete topics, derived from the identified needs and challenges were outlined by the attendees, and then organised in wider areas, so called “clusters”. Finally, they were prioritised, with the idea that these could form the basis for future proteomics activities in ELIXIR.

Current ELIXIR internal structure

Here we describe the current status of ELIXIR platforms and use cases, by May 2017. ELIXIR’s activities are structured around platforms and use cases. They bring together resources and expertise from the ELIXIR Nodes and form the basic unit of operation within ELIXIR. The ELIXIR platforms are responsible of the implementation of the ELIXIR programme and are organised in five key areas: Data, Tools, Compute, Interoperability and Training. The platforms are complemented by four use cases that represent four scientific communities: Human data, Rare diseases, Marine metagenomics and Plant sciences ( Figure 2). The use cases drive the work of the ELIXIR platforms by defining their bioinformatics needs and requirements. The close collaboration between the ELIXIR use cases and platforms ensures that the services developed by the ELIXIR platforms are fit for purpose. Each platform and use case is led by a group of senior scientists from across the ELIXIR nodes. In addition to the funding available in each national ELIXIR node, the main source of financial support for ELIXIR activities comes from the ELIXIR-EXCELERATE EU H2020 project. Additional activities are funded through other complementary grants as well as ‘Implementation studies’ supported by the ELIXIR Hub.
Figure 2.

Overview of current ELIXIR platforms and use cases (by May 2017).

The Data platform focuses on sustaining Europe’s life science data infrastructure. This platform is working on guidelines and indicators for data resources to improve their impact and sustainability [17]. It also works on improving links between curated data resources and literature data. The Tools platform is dedicated to services and connectors to drive access and exploitation of bioinformatics research software. The main key activities within this platform are centred to facilitate the discovery, benchmarking and interoperability of software. It does also focus on software development best practices, as well as on a strategy for workflows and software containers. The Interoperability platform supports the discovery, integration and analysis of biological data. Activities driven by this platform are organised in projects around identifiers, metadata standards and linked data. It also works on the description of interoperability services as well as specialised workshops named BYOD (Bring Your Own Data) [18] to improve the “FAIRness” (Findable, Accessible, Interoperable and Re-usable) of data resources [19]. The Compute platform is dedicated to the compute, storage, transfer, authentication and authorization of biological data relying on services provided by ELIXIR nodes and e-infrastructures. Finally, the Training platform aims to increase the professional skills for managing and exploiting data. Part of activities are meant to train researchers, trainers and service providers, but it also includes other activities related to e-learning, to improve the discovery and availability of training materials and to measure the impact of training. There are four use cases. First of all, the Human data use case develops long-term strategies for managing and accessing sensitive human data. The Rare diseases use case supports the development of new therapies for rare diseases. The Marine metagenomics use case works towards a sustainable metagenomics infrastructure to nurture research and innovation in the marine domain. Finally, the Plant sciences use case develops an infrastructure to support genotype-phenotype analysis for crop and tree species.

Results and Discussion

Areas prioritised in the meeting

In an attempt to assess the relative priorities of the various areas of proteomics that might steer integration into ELIXIR, attendees voted on the relative importance of the topics. The following list shows the top-ranked areas for future ELIXIR related proteomics activities (called “Clusters” from now on), sorted in descending order by the number of votes received: It includes topics such as data integration of proteomics and other types of omics data, correlation between gene and protein expression, and development of data standards for “multi-omics” data types. A closely related group of activities was “Cancer proteomics”, comprising topics such as support for clinical proteomics data (including large patient cohorts) and cancer “multi-omics” (proteogenomics) studies. The term proteoform [20] represents the different molecular forms in which the protein product of a single gene can be found, including changes due to genetic variations, alternatively spliced RNA transcripts and PTMs, among other events. This cluster of activities included topics such as the handling, validation of proteoforms and creation of standards for their description, improvement of the existing connection between proteoforms, genes and metabolites (topic related to Cluster 1 above), and activities devoted to explain unidentified spectral signals. In any analytical discipline, QC is essential. Due to the fact that proteomics is a newer and still rapidly developing field, QC has historically not been as well-developed in proteomics as in, for instance, the more established small molecule mass spectrometry field [14]. This cluster is therefore focused on activities to develop automatic and reliable pipelines for QC of proteomics data at different levels. The concrete activities outlined here could be summarized as the development of robust, reproducible, scalable, user-friendly, integrated, QC-controlled, data analysis pipelines, ideally enabling the use of compatible cloud infrastructures, which in addition, could also be used for data storage. Infrastructure supporting efficient development of such pipelines and workflows is also important, including tool repositories and documentation, workflow management systems and interfaces for accessing computational resources. This topic encompasses the improvement of protein inference in shotgun proteomics approaches, the use of peptides that match to more than one protein precursor, and enhanced data integration/harmonisation for quantitative proteomics. It was perceived by many attendees that, although many of these issues are often considered solved, there are still many improvements possible in this area. This topic also includes the improvement in the annotation of proteomics datasets, in particular in data repositories like PRIDE (to facilitate public data reuse by third parties), the development and/or extension of existing Laboratory Information Management Systems (LIMS), standard data formats, and guidelines summarising best practises for data management, following FAIR principles. The rest of the areas discussed got only one or two votes from the attendees, and included activities related to interactomics, structural proteomics, metabolomics, metaproteomics, the development of benchmarking datasets, and training efforts. Finally, it is worth highlighting an additional proposal for the creation of a repository for tool-related ideas.

Alignment between ELIXIR activities and the needs of the proteomics community

As mentioned in the ‘Methods’ section, ELIXIR activities are currently structured in five platforms ( Data, Tools, Interoperability, Compute and Training) and four longitudinal use cases ( Human data, Rare diseases, Marine metagenomics and Plant sciences) ( Figure 2). Our preferred option is that proteomics becomes the main focus of one additional ELIXIR use case in the near future. If there is no scope for proteomics to have its own use case, other options could be possible, for instance the integration of proteomics activities into the existing ELIXIR use cases. We think that the current use cases, heavily focused on genomics data, would also benefit from having a “multi-omics” perspective. Out of the existing cases, Human data would be an obvious choice. The other three could also benefit from proteomics activities: Rare diseases (clinical proteomics), Marine metagenomics (metaproteomics), and Plant sciences (plant proteomics). In any case, and without considering specific use cases, it is clear that there are several topics that, in our opinion, would fit very well into the scope of the current five ELIXIR platforms: 1- Data platform. Metadata, standardisation, annotation and data management activities (Cluster 6), and “multi-omics” approaches (Cluster 1), involving data integration efforts from different omics data types, would be highly relevant in this context. Moreover, QC efforts (Cluster 3) are essential for all such types of data re-use. Indeed, data re-use is a very important aspect in proteomics data, as only 30–40% of the acquired data is typically exploited [14]. This creates extremely exciting opportunities for proteomics data re-use with specialized tools that can lead to the discovery of new biological information [21]. 2- Tools platform. In this platform, the overall aim would be to increase the visibility, quality and sustainability of proteomics software developed following best practices. First of all, more proteomics tools should be effectively included into the ELIXIR tool registry, and highlighted there appropriately. However, it is worth highlighting that there are around 350 tools proteomics tools represented in this resource already. In addition, the development of improved and user-friendly quantification algorithms and tools, along with direct coupling to dedicated and performant statistical analysis (Cluster 5), also connects directly to the ELIXIR tools platform. Other possible activities would be related to Cluster 4, and would involve the improvement of the description and sharing of proteomics workflows, facilitating the encapsulation of workflows, data and tools into proteomics software containers that could be shared across ELIXIR, taking advantage of existing resources. Finally, the proposed idea in the meeting to create of a repository for tool-related ideas, would also be applicable in the wider ELIXIR context. 3- Interoperability platform. Obviously, the activities of this platform are very relevant in the case of “multi-omics” approaches (Cluster 1), for instance the development of data standards for these type of approaches, e.g. to enable better data integration and visualisation. A close connection can also be made to metabolomics with regards to QC (Cluster 3), as the main technology of choice (mass spectrometry) is shared between these two fields. 4- Compute platform. Workflow analysis pipelines and activities related to the development of cloud infrastructures (Cluster 4) and QC activities (Cluster 3) should be highlighted here. This would involve different pieces of infrastructure supporting the efficient development of such workflows, e.g. the EDAM ontology and the ELIXIR tool registry [15], open data formats, and workflow management systems, among others. It is important to note that, although originally set up in the Data platform, the proteomics ELIXIR implementation study mentioned in the ‘Introduction’ section aims to make a first step forward in this direction, for the popular shotgun (MS/MS) approaches. As a proof of concept, these pipelines will be deployed first in the EMBL-EBI “Embassy Cloud”, a cloud infrastructure based in the Open Stack operating system, with the idea that in the future they can be made available in other cloud systems (e.g. Amazon EC2, Google Cloud, Microsoft Azure), so that the developed pipelines can be freely reused by any interested researcher in the community. In the context of the implementation study, the pipelines are connected to the PRIDE database, bringing the analysis tools closer to the data as datasets become larger in size and complexity. This trend is ongoing for other omics technologies in the context of ELIXIR: for instance, the pipelines developed in the context of the Marine metagenomics use case [22]. 5- Training platform. ELIXIR is already working actively in coordinating training activities across Europe (e.g. the already mentioned TeSS portal). Although training was not specifically highlighted as one of the main areas for future development during the meeting, the reason is that everyone assumes that this is implicitly a key need in all bioinformatics fields. While excellent training courses and workshops have already been created in the field, a higher degree of coordination across Europe should be achieved for bioinformatics training activities, and in particular for computational proteomics topics, possibly in coordination with other fields such as metabolomics (see next section).

Connection to metabolomics and other ESFRIs

As mentioned already, metabolomics and proteomics share a common experimental platform: mass spectrometry. A parallel effort is currently ongoing to integrate metabolomics activities in ELIXIR. There are many topics of common interest where both fields could benefit from a closer interaction: e.g. the development of common software (for QC, data visualisation, signal processing, to name but a few) and open data standards. Some initiatives are already working towards this goal, e.g. the computational mass spectrometry group ( http://compms.org/), and some recent activities recently carried out by the PSI, but a lot of work remains to be done. One concrete proposal would be to coordinate efforts concerning training activities. ELIXIR would be an ideal platform to enable this coordination effort. Coming back to the proteomics field, PRIME-XS ( http://www.primexs.eu) was a four-year Infrastructure project funded by the EU FP7 Programme (2011–2015), coordinated by Prof. Albert Heck. The main aim of PRIME-XS was to provide funded access to an infrastructure of state-of-the-art proteomics technology to the European biological and biomedical research community. It is anticipated that future tailored EU H2020 infrastructure grant calls might provide an opportunity to create a second iteration of this successful project. The potential synergies with ELIXIR, considering the outlined activities in this white paper, would be obvious at different levels, for instance the provision of a cloud computing infrastructure to store and analyse the acquired data in a scalable and reproducible way, and the development of pipelines to allow data integration between proteomics and other omics data types.

Conclusions

We hope that this white paper acts as a guide to achieve the overall goal of integrating proteomics into ELIXIR. In our opinion, this would not only make perfect sense from a scientific point of view (proteomics information provides an essential ingredient in a “full picture” of life), but also because it would enable computational proteomics to work in close contact with bioinformatics activities in other high-throughput fields, which will undoubtedly trigger many possible interactions, exchanges, and exciting novel developments. This white paper presents a framework for integrating computational proteomics into ELIXIR as the result of a strategy meeting on ‘The Future of Proteomics in ELIXIR’ that took place in March 2017 in Tübingen (Germany). The meeting discussions lead to six priority areas: Multi-omics approaches; Proteoforms and PTMs; Quality Control (QC) activities; Data analysis workflows and cloud computing; Protein quantification and statistics; Metadata, standardization, annotation, and data management. The priority areas are well aligned with the current ELIXIR platforms: Data platform; Tools platform; Interoperability platform; Compute platform; Training platform and can be integrated into ELIXIR. The paper is well structured and written. Minor change: On page 7 of PDF version of paper, paragraph "2 - Tools platform ...": "it is worth highlighting that there are around 350 tools proteomics tools represented in this resource already", the first "tools" should be removed. I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard. Proteomics has reached a point in which bioinformatics is of paramount importance for the process to extract the maximum information out of the enormous amount of data obtained in practically any study. The scientific community in general and proteomics in particular is eager to get new and effective solutions that may implement that way. In this white paper manuscript, Vizcaino et al. explain the ELIXIR initiative and the importance of integrating proteomics in this European Research Infrastructure. Authors cover in a detailed way what was covered on the meeting that took place in Tübingen (Germany) on February 2017 under the name “The future of proteomics in ELIXIR”. The paper is well written and structured. Conclusions are clearly stated. I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.
  22 in total

1.  PeptideShaker enables reanalysis of MS-derived proteomics data sets.

Authors:  Marc Vaudel; Julia M Burkhart; René P Zahedi; Eystein Oveland; Frode S Berven; Albert Sickmann; Lennart Martens; Harald Barsnes
Journal:  Nat Biotechnol       Date:  2015-01       Impact factor: 54.908

2.  Proceedings of the EuBIC Winter School 2017.

Authors:  Sander Willems; David Bouyssié; Matthieu David; Marie Locard-Paulet; Karl Mechtler; Veit Schwämmle; Julian Uszkoreit; Marc Vaudel; Viktoria Dorfer
Journal:  J Proteomics       Date:  2017-04-04       Impact factor: 4.044

3.  OpenMS: a flexible open-source software platform for mass spectrometry data analysis.

Authors:  Hannes L Röst; Timo Sachsenberg; Stephan Aiche; Chris Bielow; Hendrik Weisser; Fabian Aicheler; Sandro Andreotti; Hans-Christian Ehrlich; Petra Gutenbrunner; Erhan Kenar; Xiao Liang; Sven Nahnsen; Lars Nilse; Julianus Pfeuffer; George Rosenberger; Marc Rurik; Uwe Schmitt; Johannes Veit; Mathias Walzer; David Wojnar; Witold E Wolski; Oliver Schilling; Jyoti S Choudhary; Lars Malmström; Ruedi Aebersold; Knut Reinert; Oliver Kohlbacher
Journal:  Nat Methods       Date:  2016-08-30       Impact factor: 28.547

Review 4.  Proteogenomics from a bioinformatics angle: A growing field.

Authors:  Gerben Menschaert; David Fenyö
Journal:  Mass Spectrom Rev       Date:  2015-12-15       Impact factor: 10.946

Review 5.  Development of data representation standards by the human proteome organization proteomics standards initiative.

Authors:  Eric W Deutsch; Juan Pablo Albar; Pierre-Alain Binz; Martin Eisenacher; Andrew R Jones; Gerhard Mayer; Gilbert S Omenn; Sandra Orchard; Juan Antonio Vizcaíno; Henning Hermjakob
Journal:  J Am Med Inform Assoc       Date:  2015-02-28       Impact factor: 4.497

6.  UniProt: a hub for protein information.

Authors: 
Journal:  Nucleic Acids Res       Date:  2014-10-27       Impact factor: 16.971

7.  The ProteomeXchange consortium in 2017: supporting the cultural change in proteomics public data deposition.

Authors:  Eric W Deutsch; Attila Csordas; Zhi Sun; Andrew Jarnuczak; Yasset Perez-Riverol; Tobias Ternent; David S Campbell; Manuel Bernal-Llinares; Shujiro Okuda; Shin Kawano; Robert L Moritz; Jeremy J Carver; Mingxun Wang; Yasushi Ishihama; Nuno Bandeira; Henning Hermjakob; Juan Antonio Vizcaíno
Journal:  Nucleic Acids Res       Date:  2016-10-18       Impact factor: 16.971

Review 8.  A Golden Age for Working with Public Proteomics Data.

Authors:  Lennart Martens; Juan Antonio Vizcaíno
Journal:  Trends Biochem Sci       Date:  2017-01-22       Impact factor: 13.807

9.  The FAIR Guiding Principles for scientific data management and stewardship.

Authors:  Mark D Wilkinson; Michel Dumontier; I Jsbrand Jan Aalbersberg; Gabrielle Appleton; Myles Axton; Arie Baak; Niklas Blomberg; Jan-Willem Boiten; Luiz Bonino da Silva Santos; Philip E Bourne; Jildau Bouwman; Anthony J Brookes; Tim Clark; Mercè Crosas; Ingrid Dillo; Olivier Dumon; Scott Edmunds; Chris T Evelo; Richard Finkers; Alejandra Gonzalez-Beltran; Alasdair J G Gray; Paul Groth; Carole Goble; Jeffrey S Grethe; Jaap Heringa; Peter A C 't Hoen; Rob Hooft; Tobias Kuhn; Ruben Kok; Joost Kok; Scott J Lusher; Maryann E Martone; Albert Mons; Abel L Packer; Bengt Persson; Philippe Rocca-Serra; Marco Roos; Rene van Schaik; Susanna-Assunta Sansone; Erik Schultes; Thierry Sengstag; Ted Slater; George Strawn; Morris A Swertz; Mark Thompson; Johan van der Lei; Erik van Mulligen; Jan Velterop; Andra Waagmeester; Peter Wittenburg; Katherine Wolstencroft; Jun Zhao; Barend Mons
Journal:  Sci Data       Date:  2016-03-15       Impact factor: 6.444

10.  Tools and data services registry: a community effort to document bioinformatics resources.

Authors:  Jon Ison; Kristoffer Rapacki; Hervé Ménager; Matúš Kalaš; Emil Rydza; Piotr Chmura; Christian Anthon; Niall Beard; Karel Berka; Dan Bolser; Tim Booth; Anthony Bretaudeau; Jan Brezovsky; Rita Casadio; Gianni Cesareni; Frederik Coppens; Michael Cornell; Gianmauro Cuccuru; Kristian Davidsen; Gianluca Della Vedova; Tunca Dogan; Olivia Doppelt-Azeroual; Laura Emery; Elisabeth Gasteiger; Thomas Gatter; Tatyana Goldberg; Marie Grosjean; Björn Grüning; Manuela Helmer-Citterich; Hans Ienasescu; Vassilios Ioannidis; Martin Closter Jespersen; Rafael Jimenez; Nick Juty; Peter Juvan; Maximilian Koch; Camille Laibe; Jing-Woei Li; Luana Licata; Fabien Mareuil; Ivan Mičetić; Rune Møllegaard Friborg; Sebastien Moretti; Chris Morris; Steffen Möller; Aleksandra Nenadic; Hedi Peterson; Giuseppe Profiti; Peter Rice; Paolo Romano; Paola Roncaglia; Rabie Saidi; Andrea Schafferhans; Veit Schwämmle; Callum Smith; Maria Maddalena Sperotto; Heinz Stockinger; Radka Svobodová Vařeková; Silvio C E Tosatto; Victor de la Torre; Paolo Uva; Allegra Via; Guy Yachdav; Federico Zambelli; Gert Vriend; Burkhard Rost; Helen Parkinson; Peter Løngreen; Søren Brunak
Journal:  Nucleic Acids Res       Date:  2015-11-03       Impact factor: 16.971

View more
  5 in total

1.  ABRF Proteome Informatics Research Group (iPRG) 2016 Study: Inferring Proteoforms from Bottom-up Proteomics Data.

Authors:  Joon-Yong Lee; Hyungwon Choi; Christopher M Colangelo; Darryl Davis; Michael R Hoopmann; Lukas Käll; Henry Lam; Samuel H Payne; Yasset Perez-Riverol; Matthew The; Ryan Wilson; Susan T Weintraub; Magnus Palmblad
Journal:  J Biomol Tech       Date:  2018-06-21

2.  The future of metabolomics in ELIXIR.

Authors:  Merlijn van Rijswijk; Charlie Beirnaert; Christophe Caron; Marta Cascante; Victoria Dominguez; Warwick B Dunn; Timothy M D Ebbels; Franck Giacomoni; Alejandra Gonzalez-Beltran; Thomas Hankemeier; Kenneth Haug; Jose L Izquierdo-Garcia; Rafael C Jimenez; Fabien Jourdan; Namrata Kale; Maria I Klapa; Oliver Kohlbacher; Kairi Koort; Kim Kultima; Gildas Le Corguillé; Pablo Moreno; Nicholas K Moschonas; Steffen Neumann; Claire O'Donovan; Martin Reczko; Philippe Rocca-Serra; Antonio Rosato; Reza M Salek; Susanna-Assunta Sansone; Venkata Satagopam; Daniel Schober; Ruth Shimmo; Rachel A Spicer; Ola Spjuth; Etienne A Thévenot; Mark R Viant; Ralf J M Weber; Egon L Willighagen; Gianluigi Zanetti; Christoph Steinbeck
Journal:  F1000Res       Date:  2017-09-06

Review 3.  Integrated Chemometrics and Statistics to Drive Successful Proteomics Biomarker Discovery.

Authors:  Anouk Suppers; Alain J van Gool; Hans J C T Wessels
Journal:  Proteomes       Date:  2018-04-26

Review 4.  Subcellular Transcriptomics and Proteomics: A Comparative Methods Review.

Authors:  Josie A Christopher; Aikaterini Geladaki; Charlotte S Dawson; Owen L Vennard; Kathryn S Lilley
Journal:  Mol Cell Proteomics       Date:  2021-12-16       Impact factor: 5.911

5.  The proBAM and proBed standard formats: enabling a seamless integration of genomics and proteomics data.

Authors:  Gerben Menschaert; Xiaojing Wang; Andrew R Jones; Fawaz Ghali; David Fenyö; Volodimir Olexiouk; Bing Zhang; Eric W Deutsch; Tobias Ternent; Juan Antonio Vizcaíno
Journal:  Genome Biol       Date:  2018-01-31       Impact factor: 13.583

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.