| Literature DB >> 20829833 |
Emek Demir1, Michael P Cary, Suzanne Paley, Ken Fukuda, Christian Lemer, Imre Vastrik, Guanming Wu, Peter D'Eustachio, Carl Schaefer, Joanne Luciano, Frank Schacherer, Irma Martinez-Flores, Zhenjun Hu, Veronica Jimenez-Jacinto, Geeta Joshi-Tope, Kumaran Kandasamy, Alejandra C Lopez-Fuentes, Huaiyu Mi, Elgar Pichler, Igor Rodchenkov, Andrea Splendiani, Sasha Tkachev, Jeremy Zucker, Gopal Gopinath, Harsha Rajasimha, Ranjani Ramakrishnan, Imran Shah, Mustafa Syed, Nadia Anwar, Ozgün Babur, Michael Blinov, Erik Brauner, Dan Corwin, Sylva Donaldson, Frank Gibbons, Robert Goldberg, Peter Hornbeck, Augustin Luna, Peter Murray-Rust, Eric Neumann, Oliver Ruebenacker, Oliver Reubenacker, Matthias Samwald, Martijn van Iersel, Sarala Wimalaratne, Keith Allen, Burk Braun, Michelle Whirl-Carrillo, Kei-Hoi Cheung, Kam Dahlquist, Andrew Finney, Marc Gillespie, Elizabeth Glass, Li Gong, Robin Haw, Michael Honig, Olivier Hubaut, David Kane, Shiva Krupa, Martina Kutmon, Julie Leonard, Debbie Marks, David Merberg, Victoria Petri, Alex Pico, Dean Ravenscroft, Liya Ren, Nigam Shah, Margot Sunshine, Rebecca Tang, Ryan Whaley, Stan Letovksy, Kenneth H Buetow, Andrey Rzhetsky, Vincent Schachter, Bruno S Sobral, Ugur Dogrusoz, Shannon McWeeney, Mirit Aladjem, Ewan Birney, Julio Collado-Vides, Susumu Goto, Michael Hucka, Nicolas Le Novère, Natalia Maltsev, Akhilesh Pandey, Paul Thomas, Edgar Wingender, Peter D Karp, Chris Sander, Gary D Bader.
Abstract
Biological Pathway Exchange (BioPAX) is a standard language to represent biological pathways at the molecular and cellular level and to facilitate the exchange of pathway data. The rapid growth of the volume of pathway data has spurred the development of databases and computational tools to aid interpretation; however, use of these data is hampered by the current fragmentation of pathway information across many databases with incompatible formats. BioPAX, which was created through a community process, solves this problem by making pathway data substantially easier to collect, index, interpret and share. BioPAX can represent metabolic and signaling pathways, molecular and genetic interactions and gene regulation networks. Using BioPAX, millions of interactions, organized into thousands of pathways, from many organisms are available from a growing number of databases. This large amount of pathway data in a computable form will support visualization, analysis and biological discovery.Entities:
Mesh:
Year: 2010 PMID: 20829833 PMCID: PMC3001121 DOI: 10.1038/nbt.1666
Source DB: PubMed Journal: Nat Biotechnol ISSN: 1087-0156 Impact factor: 54.908
Figure 1BioPAX is a shared language for biological pathways. BioPAX reduces the effort required to efficiently communicate between pathway users, databases and software tools. Without a shared language, each system must speak the language of all other systems in the worst case (black lines). With a shared language, each system only needs to speak that language (central red box).
Figure 2BioPAX enables computational data gathering, publication and use of information about biological processes. Traditional pathway information processing: Observations considering prior models published as text and figures. Computable pathway information processing: Scientist’s description represented using formal, computable framework (ontology) published in a computer software readable format for analysis by scientists.
Figure 6The relationship among popular standard formats for pathway information. BioPAX and PSI-MI are designed for data exchange to and from databases and pathway and network data integration. SBML and CellML are designed to support mathematical simulations of biological systems and SBGN represents pathway diagrams.
Figure 3The AKT pathway as represented by a traditional method (top left, from http://www.biocarta.com), a formalized SBGN diagram (http://www.sbgn.org 84) (left), and using the BioPAX language (right). An important advantage of the BioPAX representation is that it can be interpreted by computer software and used in multiple ways, including automatic diagram creation, information retrieval and analysis. Online documentation at http://www.biopax.org contains more details about how to represent diverse types of biological pathways. Actual samples of pathway data in BioPAX OWL XML format are available in Supplementary Tables S2 and S3.
BioPAX covers five main types of biological pathways and coverage has increased over time with new levels of the ontology.
| Type of | Main BioPAX Classes Used | Introduced |
|---|---|---|
| Metabolic | All types of physical entities (most common use of protein, small | Level 1 |
| Signaling | All types of physical entities (most common use of protein, | Level 2 |
| Molecular | All types of physical entities (most common use of protein, | Level 2 |
| Gene | All types of physical entities, TemplateReaction, | Level 3 |
| Genetic | Gene, GeneticInteraction | Level 3 |
Figure 4High-level view of the BioPAX ontology. Classes are shown as boxes and arrows represent inheritance relationships. The three main types of classes in BioPAX are colored, Pathway (red), Interaction (green) and PhysicalEntity and Gene (blue). For brevity, only the properties of the Protein class are shown as an example at the top right. Refer to BioPAX documentation at http://www.biopax.org for full details of all classes and properties.
What is included in BioPAX
| Ontology specification | Web Ontology Language (OWL) XML file, |
| Language documentation | Explanation of BioPAX entities, example |
| Example files | Example files for biochemical pathway, protein |
| Graphical representation | Recommendations for graphical representation |
| Paxtools software | Java programming library supporting |
| List of data sources & supporting software | Databases making data available in BioPAX |
Figure 5Example uses of pathway information in BioPAX format. Red colored boxes or lines indicate use of BioPAX.
Databases and software supporting BioPAX. Note, PSI-MI data sources can be converted to BioPAX Level 2 using the PSI-MI to BioPAX converter.
| Database | Type | URL | Format | License | Statistics |
|---|---|---|---|---|---|
| BIND | Protein |
| PSI-MI | Free to | >85,000 interactions |
| BioCyc | Metabolic and |
| BioPAX | Free to | ~500 mostly |
| BioGRID | Protein- |
| PSI-MI | Free to | >265,000 interactions |
| BioModels | Metabolic and |
| SBML, | Free to | >450 pathways, >240 |
| Cancer Cell | Signaling |
| BioPAX | Free to | Pathways: 10 |
| DIP | Protein- |
| PSI-MI | Free for | >57,000 interactions |
| Ecocyc | Metabolic and |
| BioPAX, | Free to | Pathways: 246 |
| HPRD | Protein- |
| PSI-MI | Free for | >38,000 interactions |
| IMID | Signaling |
| BioPAX | Free to | >2000 interactions |
| INOH | Signaling |
| BioPAX | Free to | >60 pathways |
| IntAct | Protein- |
| PSI-MI | Free to | >200,000 interactions |
| KEGG | Metabolic |
| BioPAX | Free for | >330 reference pathways |
| MetaCyc | Metabolic and |
| BioPAX | Free to | 1399 curated pathways, |
| MINT | Protein- |
| PSI-MI | Free to | >80,000 interactions |
| MIPS | Protein- |
| PSI-MI | Free to | >12,000 interactions |
| NCI/Nature | Signaling |
| BioPAX | Free to | >400 curated pathways |
| NetPath | Signaling |
| BioPAX | Free to | 20 large curated pathways |
| OPHID | Protein- |
| PSI-MI | Free for | >424,000 interactions |
| Pathway | Pathways and |
| BioPAX | Free to | >1,400 collected pathways |
| Reactome | Metabolic and |
| BioPAX, | Free to | >50 curated pathways |
| RegulonDB | Regulatory |
| BioPAX | Free to | Regulatory interactions: |
| Rhea | Metabolic |
| BioPAX, | Free to | >11,000 reactions |