| Literature DB >> 25834725 |
Christian Castaneda1, Kip Nalley2, Ciaran Mannion3, Pritish Bhattacharyya3, Patrick Blake2, Andrew Pecora4, Andre Goy4, K Stephen Suh5.
Abstract
As research laboratories and clinics collaborate to achieve precision medicine, both communities are required to understand mandated electronic health/medical record (EHR/EMR) initiatives that will be fully implemented in all clinics in the United States by 2015. Stakeholders will need to evaluate current record keeping practices and optimize and standardize methodologies to capture nearly all information in digital format. Collaborative efforts from academic and industry sectors are crucial to achieving higher efficacy in patient care while minimizing costs. Currently existing digitized data and information are present in multiple formats and are largely unstructured. In the absence of a universally accepted management system, departments and institutions continue to generate silos of information. As a result, invaluable and newly discovered knowledge is difficult to access. To accelerate biomedical research and reduce healthcare costs, clinical and bioinformatics systems must employ common data elements to create structured annotation forms enabling laboratories and clinics to capture sharable data in real time. Conversion of these datasets to knowable information should be a routine institutionalized process. New scientific knowledge and clinical discoveries can be shared via integrated knowledge environments defined by flexible data models and extensive use of standards, ontologies, vocabularies, and thesauri. In the clinical setting, aggregated knowledge must be displayed in user-friendly formats so that physicians, non-technical laboratory personnel, nurses, data/research coordinators, and end-users can enter data, access information, and understand the output. The effort to connect astronomical numbers of data points, including '-omics'-based molecular data, individual genome sequences, experimental data, patient clinical phenotypes, and follow-up data is a monumental task. Roadblocks to this vision of integration and interoperability include ethical, legal, and logistical concerns. Ensuring data security and protection of patient rights while simultaneously facilitating standardization is paramount to maintaining public support. The capabilities of supercomputing need to be applied strategically. A standardized, methodological implementation must be applied to developed artificial intelligence systems with the ability to integrate data and information into clinically relevant knowledge. Ultimately, the integration of bioinformatics and clinical data in a clinical decision support system promises precision medicine and cost effective and personalized patient care.Entities:
Keywords: Artificial intelligence; Bioinformatics; Clinical decision support system; Clinical informatics; Clinical outcome; Integrated knowledge environment; Patient care; Personalized medicine; Precision medicine; Watson
Year: 2015 PMID: 25834725 PMCID: PMC4381462 DOI: 10.1186/s13336-015-0019-3
Source DB: PubMed Journal: J Clin Bioinforma ISSN: 2043-9113
Overview of popular electronic health record vendors with various measures of market share and user satisfaction
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 12.1% | 19.6% | +7.6% | 10.2% | 15.1% | +4.9% | 10 th | 18 th | 9 th | 6 th | 22% | 6.4% | 15.5% | 25.6% |
|
| 13.0% | 11.0% | −2.0% | 3.4% | 3.9% | +0.5% | 15 th/21 st/23 rd | 16 th/24 th/29 th | 17 th/24 th/30 th | 13 th | 10% | 11.0% | 15.2% | 15.3% |
|
| 12.4% | 8.6% | −3.8% | N/A | N/A | N/A | 11 th | 12 th | 11 th | 8 th | 6% | 13.8% | 11.8% | 6.4% |
|
| 9.6% | 8.2% | −1.4% | 0.6% | 1.4% | +0.8 | 27 th | 30 th | 26 th | 17 th | 5% | 5.5% | 5.8% | 5.8% |
|
| 6.1% | 6.5% | +0.5% | 0.3% | 0.4% | +0.1 | 14 th/17 th | 19 th/20 th | 13 th/21st | 10 th | 6% | 6.2% | 11.6% | 7.7% |
|
| 1.7% | 3.7% | +2.1% | 15.1% | 14.4% | −0.7 | N/A | N/A | N/A | 11 th | 9% | N/A | 5.8% | 15.6% |
|
| 1.4% | 2.9% | +1.5% | N/A | N/A | N/A | 9 th | 3 rd | 7 th | 2 nd | 2% | N/A | N/A | N/A |
|
| 4.9% | 2.4% | −2.5% | N/A | N/A | N/A | 22 nd | 21 st | 19 th | N/A | N/A | 3.3% | 3.6% | N/A |
|
| 2.5% | 2.1% | −0.4% | 7.5% | 9.5% | +2.0 | N/A | N/A | N/A | 16 th | 2% | 4.1% | N/A | 2.4% |
|
| 3.1% | 2.2% | −0.9% | 0.0% | 0.0% | 0.0 | 28 th | 28 th | 22 nd | N/A | 1% | N/A | N/A | N/A |
“Provider” is defined as a health care professional who has (1) registered for the CMS Medicare or Medicaid EHR Incentive Programs, (2) attested to Meaningful Use as part of the CMS Medicare EHR Incentive Program, or (3) is receiving technical assistance from the ONC REC Program in order to meet the milestones of the CMS EHR Incentive Programs. “Eligible hospital” is defined as a non-acute care hospital [16]. Primary vendors have EHR products in a provider’s EHR system that meet the majority of “Meaningful Use” criteria; are the sole vendor of EHR products for the provider; or are vendors of a complete EHR product as reported in the Certified EHR Product List.
Figure 1Typical office-visit workflow, highlighting points at which CDSS may improve care. Electronic health records act as a reservoir of information used by clinicians and clinical decision support systems to plan healthcare. Along with this, information research data feeds back into the workflow allowing a self-improving cycle of information exchange. CDSS then affects many stages of the office visit and optimizes patient care through warnings, reminders, and suggestions.
Publications in pathology CDSS implementation
|
|
|
|
|
|---|---|---|---|
|
| Prostate Cancer | Created a system used to store topographical information about prostate sample biopsy specimens to create heat-maps for areas of most likely to be prostatic carcinoma, essential for clinical decision-making, prognosis, and research. | Between 2010 and 2011, generated 259 biopsy case reports uploaded to the database with 100% data completeness and a source-to-database error of 10.3 per 10,000 fields. Serves as an implementation of pathology data sharing through a healthcare information system. |
|
| Hematologic Neoplasms | Developed of web-enabled relational database, by pooling literature for cell surface marker definitions of 37 hematologic neoplasms. Using this expression profile, an algorithm was created by pathologists to assist in teaching flow cytometry diagnosis of hematologic neoplasms | Algorithm for identifying hematologic marker expression patterns validated using 92 clinical cases with an identification success rate of 89%. Tool has been used by pathologists-in-training to develop flow cytometry interpretation skills. |
|
| Prostate Cancer | Developed a Bayesian belief network (BBN) for Gleason grading of prostate adenocarcinoma, to allow subjective evaluation of prostatic carcinoma slides by computer. | As histological diagnosis of prostate carcinoma is often produces wide inter-interpreter variability. This tool serves as a decision support tool to interpret descriptive terms in pathology reports and accurately determine tumor grading. |
|
| Bladder Cancer | Designed a decision support system, employed by pathologists in microscopic observation of tissue samples and measurements of nuclear characteristics, allowing automatic assessment of urinary bladder tumor grade and cancer recurrence probability. | The system employed classified tumors with an accuracy of 82%, 80.5%, and 93.1% for tumors of grade I, II, and III. Suggested prognosis in 72.8% of samples with a confidence of 74.5%. |
|
| Information Sharing | Established a graphic-based computerized system for display of clinical pathology data to permit improved data access and sharing. | By displaying laboratory data in a graphical manner, designers aimed to create a user-friendly system meant to highlight pertinent trends necessary for decision management. |
|
| Colorectal cancer | Deployed a new guideline to improve recognition of patients at risk for Lynch syndrome by a multidisciplinary team of surgeons, pathologists and clinical geneticists. | An electronic reminder system for pathologists increasing identification of patients at high risk for Lynch syndrome by 18% when compared to the control arm. |
|
| Hematologic Neoplasms | Developed an image based retrieval system containing 261 digitized images of lymphocyte disorders and regular lymphocytes. Queries to the system extract features of the histopathological images for comparison and identification. | Based solely on morphological characteristics, the analysis system performed better than humans in identifying certain lymphoproliferative disorders according to a ten-fold cross-validated confusion matrix. |
Selected manuscripts focused upon clinical decision support for pathologists. Ranging from image identification to computerized reminders, the potential for decision support is broad. The purpose of each support system and a summary of conclusions are also noted.
Overview of software systems designed to integrate data from multiple databases
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
|
| “A biological data warehouse called Atlas that locally stores and integrates biological sequences, molecular interactions, homology information, functional annotations of genes, and biological ontologies”. | Open Source | Manual | 13 | GenBank, RefSeq, UniProt, Human Protein Reference Database (HPRD), Biomolecular Interaction Network Database (BIND), Database of Interacting Proteins (DIP), Molecular Interactions Database (MINT), IntAct, NCBI Taxonomy, Gene Ontology (GO), Online Mendelian Inheritance in Man (OMIM), LocusLink, Entrez Gene and HomoloGene | British Columbia University - Vancouver, BC |
|
| “An open source toolkit for constructing bioinformatics database warehouses using the MySQL and Oracle relational database managers. Integrates multiple public bioinformatics databases into a single relational database system within a common bioinformatics schema”. | Open Source | Dependent on the individual databases | 12 | ENZYME, KEGG, BioPax, Eco2dbase, Metacyc, Mage-ML and BioCyc, UniProt, GenBank, NCBI Taxonomy, CMR databases, and Gene Ontology. | Stanford Research Institute – Menlo Park, Ca |
|
| “Facilitates the creation of protein structure data sets for many structure-based studies. It allows combining queries on a number of structure-related databases not covered by other projects at present”. | Free-Use | Dependent on the individual databases | 12 | PDB, SCOP, CATH, DSSP, ENZYME, Boehringer, KEGG, Swiss-Prot, GO, GOA, Taxonomy, PISCES | Humboldt-Universität zu Berlin – Berlin Germany |
|
| “To provide an integrated bioinformatics platform for a systems biology approach to the biology of pseudomonads in infection and biotechnology”. | Free-Use | Unknown | 4 | KEGG, Pseudomonas Genome Database v2, PRODORIC, and BRENDA | Technische Universität Braunschweig - Braunschweig, Germany |
|
| “A cancer microarray database and web-based data-mining platform aimed at facilitating discovery from genome-wide expression analyses”. | Free-Use, Subscription-based for expanded functionality | Annually for Free Version, Regular data updates for subscription | - | 65 Gene expression datasets, from 4700 microarray experiments. | Life Technologies Corporation |
|
| “BioMart enables scientists to perform advanced querying of biological data sources through a single web interface. The power of the system comes from integrated querying of data sources regardless of their geographical locations”. | Open Source | Unknown | 25 (as of 2009), 46 as of 5/2014 | Ensembl Genes, Ensembl Homology, Ensembl Variation, Ensembl Genomic Features, Vega, HTGT, Gramene, Reactome, Wormbase, Dictybase, RGD, PRIDE, EURATMart, MSD, Uniprot, Pancreatic Expression Database, PepSeeker, ArrayExpress, GermOnLine, DroSpeGe, HapMap, VectorBase, Paramecium, Eurexpress, Europhenome | Collaboration between many institutes and Universities. |
|
| “The Ondex data integration platform enables data from diverse biological datasets to be linked, integrated and visualised through graph analysis techniques. Ondex can be used in a number of important application areas such as transcription analysis, protein interaction analysis, data mining and text mining”. | Open Source | Unknown | 28 | AraCyc, AtRegNet, BioCyc, BioGRID, Brenda, Cytoscape, EcoCyc, GOA, Gramene, Grassius, KEGG, Medline, MetaCyc, O-GlycBase, OMIM, PDB, Pfam, Prolog (limited functionality), SGD, TAIR, TIGR, Transfac, transpath, UniProt, WordNet, ChEBI, ChEMBL, GFF3 | Rothamsted Research Harpenden, UK |
|
| “InterMine is an open-source data warehouse system that facilitates the building of databases with complex data integration requirements and a need for a fast customizable query facility. Using InterMine, large biological databases can be created from a range of heterogeneous data sources, and the extensible data model allows for easy integration of new data types”. | Open Source | Unknown | 23 | GO Annotation, GO OBO, Treefam, Homologene, OrthoDB, Panther, Ensembl, Compara, BioGRID, IntAct, PSI-MI Ontology, KEGG, Reactome, UniProt, Protein Data Bank, InterPro, PubMed, Ensembl SNP, Chado, Ensembl Core, FASTA, GFF3, OMIM, Uberon | University of Cambridge - Cambridge, United Kingdom |
|
| “An integrated, growing biomarker repository of over 2,000 breast, ovarian, colorectal, non-Hodgkin’s lymphoma and melanoma biomarkers mined and manually curated by PhD. scientist from full-text papers. Annotations include 33 critical data elements (CDEs) organized in computable Sophic Cancer Biomarker Objects (SCBOs). SCan-MarK allows researchers to mine, explore and expose complex biomarker, disease, treatment, outcome relationships graphically displayed as knowledge networks”. | Free Trial | Manual | 30 | Examples: TCGA, dbSNP, Cancer Gene Index, Drugbank, PDB, Sophic’s non-redundant Sanger COSMIC, Medline, ENSEMBL, ENZYME, Go, Interpro, Pfam, Pubchem, Unigene, Taxonomy, Uniprot, Refseq, Entrezgene, Reactome Pathway | Sophic Alliance |
Displayed are current software solutions whose primary goal is to facilitate research workflow through data-mining algorithms. These software solutions range from open-source to paid subscription, and target specific subgroups of scientists. A common underlying goal amongst these examples is centralization of multiple databases through the use of algorithms and standardization.