| Literature DB >> 15608287 |
Susie M Stephens1, Jake Y Chen, Marcel G Davidson, Shiby Thomas, Barry M Trute.
Abstract
As database management systems expand their array of analytical functionality, they become powerful research engines for biomedical data analysis and drug discovery. Databases can hold most of the data types commonly required in life sciences and consequently can be used as flexible platforms for the implementation of knowledgebases. Performing data analysis in the database simplifies data management by minimizing the movement of data from disks to memory, allowing pre-filtering and post-processing of datasets, and enabling data to remain in a secure, highly available environment. This article describes the Oracle Database 10g implementation of BLAST and Regular Expression Searches and provides case studies of their usage in bioinformatics. http://www.oracle.com/technology/software/index.html.Entities:
Mesh:
Year: 2005 PMID: 15608287 PMCID: PMC540068 DOI: 10.1093/nar/gki114
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Experimental approach for the identification of protein–protein interactions using ODM BLAST.
Figure 2A sample of the protein–protein interaction results gained from ODM BLAST analysis.
SQL and PL/SQL interfaces for Oracle Regular Expressions
| SQL function | Description |
|---|---|
| REGEXP_LIKE | Determine whether pattern matches |
| REGEXP_SUBSTR | Determine what string matches the pattern |
| REGEXP_INSTR | Determine where the match occurred in the string |
| REGEXP_REPLACE | Search and replace a pattern |
Figure 3Depicts a segment of the results from the Regular Expressions Search. The sequence for protein ‘NP_003983’ is shown with the Regular Expressions highlighted in boldface.
Figure 4Most frequently occurring motifs in a protein–protein interaction database.
Figure 5Frequency of motifs within proteins.