| Literature DB >> 29515813 |
Srinivasareddy Putluri1, Md Zia Ur Rahman1, Shaik Yasmeen Fathima2.
Abstract
Cloud computing offers significant research and economic benefits to healthcare organisations. Cloud services provide a safe place for storing and managing large amounts of such sensitive data. Under conventional flow of gene information, gene sequence laboratories send out raw and inferred information via Internet to several sequence libraries. DNA sequencing storage costs will be minimised by use of cloud service. In this study, the authors put forward a novel genomic informatics system using Amazon Cloud Services, where genomic sequence information is stored and accessed for processing. True identification of exon regions in a DNA sequence is a key task in bioinformatics, which helps in disease identification and design drugs. Three base periodicity property of exons forms the basis of all exon identification techniques. Adaptive signal processing techniques found to be promising in comparison with several other methods. Several adaptive exon predictors (AEPs) are developed using variable normalised least mean square and its maximum normalised variants to reduce computational complexity. Finally, performance evaluation of various AEPs is done based on measures such as sensitivity, specificity and precision using various standard genomic datasets taken from National Center for Biotechnology Information genomic sequence database.Entities:
Keywords: AEP; DNA; DNA analysis; DNA sequencing; adaptive signal processing; base periodicity; bioinformatics; cloud computing; cloud-based adaptive exon prediction; disease identification; gene information; gene sequence; genomic sequence database; healthcare; molecular biophysics
Year: 2018 PMID: 29515813 PMCID: PMC5830887 DOI: 10.1049/htl.2017.0032
Source DB: PubMed Journal: Healthc Technol Lett ISSN: 2053-3713
Fig. 1Traditional genome informatics system
Fig. 2Novel genome bioinformatics cloud-based system for exon prediction
Performance comparison in terms of execution times
| Sequence number | Execution time with proposed model, s | Execution time without cloud, s |
|---|---|---|
| 1 | 116 | 286 |
| 2 | 183 | 314 |
| 3 | 224 | 375 |
| 4 | 257 | 412 |
| 5 | 292 | 483 |
Dataset of DNA sequences from NCBI database on node 1
| Sequence number | Accession number | Sequence definition |
|---|---|---|
| 1 | E15270.1 | human gene for osteoclastogenesis inhibitory factor gene |
| 2 | X77471.1 | Homo sapiens human tyrosine aminotransferase gene |
| 3 | AB035346.2 | Homo sapiens T cell lymphoma/leukaemia 6 gene |
| 4 | AJ225085.1 | Homo sapiens Fanconi anaemia group A gene |
| 5 | AF009962 | Homo sapiens CC chemokine (CCR-5) receptor gene |
Fig. 3Location of exon (3934-4581) for a DNA sequence with accession AF009962 predicted using various AEPs
a LMS-based AEP
b VNLMS-based AEP
c VNSRLMS-based AEP
d VNSLMS-based AEP
e VNSSLMS-based AEP
f MVNLMS-based AEP
g MVNSRLMS-based AEP
h MVNSLMS-based AEP
i MVNSSLMS-based AEP
Performance measures of various AEPs with respect to Sn, Sp and Pr calculations
| Sequence number | Parameter | LMS | VNLMS | VNSRLMS | VNSLMS | VNSSLMS | MVNLMS | MVNSRLMS | MVNSLMS | MVNSSLMS |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Sn | 0.6286 | 0.8128 | 0.7972 | 0.7795 | 0.7581 | 0.7692 | 0.7507 | 0.7416 | 0.7302 |
| Sp | 0.6435 | 0.8021 | 0.7836 | 0.7732 | 0.7565 | 0.7684 | 0.7423 | 0.7465 | 0.7212 | |
| Pr | 0.5922 | 0.8137 | 0.7783 | 0.7697 | 0.7488 | 0.7595 | 0.7512 | 0.7396 | 0.7323 | |
| 2 | Sn | 0.6384 | 0.8024 | 0.7835 | 0.7769 | 0.7597 | 0.7691 | 0.7456 | 0.7432 | 0.7318 |
| Sp | 0.6628 | 0.7992 | 0.7841 | 0.7685 | 0.7586 | 0.7635 | 0.7523 | 0.7476 | 0.7311 | |
| Pr | 0.5894 | 0.8136 | 0.7924 | 0.7715 | 0.7526 | 0.7463 | 0.7392 | 0.7257 | 0.7186 | |
| 3 | Sn | 0.6457 | 0.8028 | 0.7882 | 0.7793 | 0.7581 | 0.7692 | 0.7517 | 0.7446 | 0.7306 |
| Sp | 0.6587 | 0.8121 | 0.7936 | 0.7592 | 0.7465 | 0.7682 | 0.7423 | 0.7365 | 0.7212 | |
| Pr | 0.5934 | 0.7994 | 0.7823 | 0.7667 | 0.7488 | 0.7596 | 0.7532 | 0.7456 | 0.7323 | |
| 4 | Sn | 0.6273 | 0.8145 | 0.7936 | 0.7735 | 0.7557 | 0.7638 | 0.7537 | 0.7374 | 0.7214 |
| Sp | 0.6405 | 0.8024 | 0.7835 | 0.7529 | 0.7497 | 0.7691 | 0.7476 | 0.7402 | 0.7318 | |
| Pr | 0.5858 | 0.8137 | 0.7941 | 0.7775 | 0.7586 | 0.7598 | 0.7443 | 0.7376 | 0.7251 | |
| 5 | Sn | 0.6481 | 0.7989 | 0.7884 | 0.7615 | 0.7526 | 0.7663 | 0.7546 | 0.7457 | 0.7306 |
| Sp | 0.6518 | 0.8058 | 0.7812 | 0.7596 | 0.7461 | 0.7692 | 0.7583 | 0.7446 | 0.7382 | |
| Pr | 0.5904 | 0.8121 | 0.7936 | 0.7782 | 0.7565 | 0.7682 | 0.7525 | 0.7465 | 0.7296 |