| Literature DB >> 16351751 |
Paolo Romano1, Domenico Marra, Luciano Milanesi.
Abstract
BACKGROUND: The completion of the Human Genome Project has resulted in large quantities of biological data which are proving difficult to manage and integrate effectively. There is a need for a system that is able to automate accesses to remote sites and to "understand" the information that it is managing in order to link data properly. Workflow management systems combined with Web Services are promising Information and Communication Technologies (ICT) tools. Some have already been proposed and are being increasingly applied to the biomedical domain, especially as many biology-related Web Services are now becoming available. Information on biological resources and on genomic sequences mutations are two examples of very specialized datasets that are useful for specific research domains.Entities:
Mesh:
Year: 2005 PMID: 16351751 PMCID: PMC1866383 DOI: 10.1186/1471-2105-6-S4-S24
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1General architecture of the system. The general architecture of the system includes three blocks: workflows creation and annotation (WCA), user interface (UI) and workflows execution (WE). The "Taverna" and "FreeFluo" boxes refer to open source software developed externally from the project. The boxes named "Repository Manager", "Workflow Executor" and "Interface" refer to purpose applications developed within the project. All other boxes refer to databases, either local or remote.
External links to/from CABRI catalogues.
| ICLC Human and animal cell lines | 0 | 294 | 0 |
| DSMZ Human and animal cell lines | 0 | 905 | 0 |
| NCCB Plasmids | 0 | 371 | 0 |
| BCCM/LMBP™ Plasmids | 154 | 405 | 816 |
| NCCB Phages | 0 | 30 | 0 |
| BCCM/LMG™ Bacteria strains | 799 | 0 | 0 |
| DSMZ Bacteria strains | 2,618 | 0 | 0 |
| NCIMB Bacteria strains | 334 | 0 | 0 |
| CIP Bacteria strains | 545 | 0 | 0 |
| BCCM/IHEM™ Fungi and Yeasts strains | 18 | 0 | 0 |
| BCCM/MUCL™ Fungi and Yeasts strains | 119 | 0 | 0 |
| CBS Filamentous fungi strains | 9,667 | 454 | 0 |
| CBS Yeasts strains | 52,983 | 1,132 | 0 |
| CABI Bioscience Fungi strains | 206 | 0 | 0 |
| DSMZ Fungi and Yeasts strains | 31 | 0 | 0 |
| Total | 67,256 | 3,591 | 816 |
The reported figures represents the number of external links to/from CABRI catalogues. Only catalogues having at least one link are listed. EMBL links are included in the EMBL Data Library since EMBL version 81. Links to Medline and plasmids' maps are included in CABRI catalogues' records.
Figure 2ACD definitions for two Soaplab based Web Services searching CABRI cell lines catalogues. This figure shows the definitions used for the set-up of the Web Services that return unique identifiers of cell lines in CABRI catalogues after a search either by name or by property (free text search). These definitions are written in the ACD language and must be converted into XML before they can be used. Differences between the two definitions consist in the comment line where the actual parameters for the remote call to the CABRI site are defined
List of Web Services related to IARC TP53 Mutation Database.
| getP53MutationsByProperty | lib, text | Full record |
| getP53MutationsByIds | Id | Full record |
| getP53MutationIdsByType | lib, mutation type | id(s) |
| getP53MutationIdsByEffect | lib, effect | id(s) |
| getP53MutationIdsByExon | lib, exon number | id(s) |
| getP53MutationIdsByIntron | lib, intron number | id(s) |
| getP53MutationIdsByCodonNumber | lib, codon number | id(s) |
| getP53MutationIdsByCpgSite | lib, cpg site (true/false) | id(s) |
| getP53MutationIdsBySpliceSite | lib splice site (true/false) | id(s) |
| getP53MutationIdsByMetastasisLocalization | lib, metastasis localization (organ) | id(s) |
| getP53MutationIdsByTumorOrigin | lib, origin (primary, secondary, ...) | id(s) |
This table lists all Web Services that have been created with the aim of allowing data retrieval from IARC TP53 Mutation Database. Here id refers to the Mutation_ID defined by IARC, while lib refers to the name of involved database that currently is 'tp53_iarc' but has been parameterized in view of future extension of the system, allowing the search of other databases as well.
List of Web Services related to CABRI catalogues.
| getBacteriaIdsByName | Bacteria strains | lib(s), name | id(s) |
| getBacteriaIdsByProperty | Bacteria strains | lib(s), text | id(s) |
| getBacteriaById | Bacteria strains | id | full record |
| getFungiIdsByName | Filamentous fungi strains | lib(s), name | id(s) |
| getFungiIdsByProperty | Filamentous fungi strains | lib(s), text | id(s) |
| getFungiById | Filamentous fungi strains | id | full record |
| getYeastIdsByName | Yeasts strains | lib(s), name | id(s) |
| getYeastIdsByProperty | Yeasts strains | lib(s), text | id(s) |
| getYeastsById | Yeasts strains | id | full record |
| getPlasmidIdsByName | Plasmids | lib(s), name | id(s) |
| getPlasmidIdsByProperty | Plasmids | lib(s), text | id(s) |
| getPlasmidsById | Plasmids | id | full record |
| getPhageIdsByName | Phages | lib(s), name | id(s) |
| getPhageIdsByProperty | Phages | lib(s), text | id(s) |
| getPhagesById | Phages | id | full record |
| getCellLinesIdsByName | Human and animal cell lines | lib(s), name | id(s) |
| getCellLinesIdsByProperty | Human and animal cell lines | lib(s), text | id(s) |
| getCellLinesById | Human and animal cell lines | id | full record |
| getResourceIdsByName | All | lib(s), name | id(s) |
| getResourcesById | All | id | full record |
This table lists all Web Services that have been created with the aim of allowing data retrieval from CABRI catalogues. Here, id refers to the unique identifier of resources in the catalogues and can therefore be a strain number, an accession number or a collection number. Instead, lib refers to the name of involved catalogues, while name is the scientific name where spaces must be substituted by the '_SP_' string and text is any single word.
List of some workflows. .
This table lists some of the workflows that have been created and are available on-line for demonstration purposes in the Scufl format. These workflows can be launched by using the Taverna Workbench
Figure 3Diagram of the GetBacteriaByName workflow. This diagram lists one of the workflows that have been created and are available on-line for demonstration purposes. Two web services are used (getBacteriaIdsByname and getBacteriaById), together with some local string elaborations. This output has been generated by the Taverna Workbench.