| Literature DB >> 25717408 |
Elisabeth Scheufele1, Dina Aronzon2, Robert Coopersmith2, Michael T McDuffie2, Manish Kapoor2, Christopher A Uhrich2, Jean E Avitabile2, Jinlei Liu2, Dan Housman2, Matvey B Palchuk1.
Abstract
The tranSMART knowledge management and high-content analysis platform is a flexible software framework featuring novel research capabilities. It enables analysis of integrated data for the purposes of hypothesis generation, hypothesis validation, and cohort discovery in translational research. tranSMART bridges the prolific world of basic science and clinical practice data at the point of care by merging multiple types of data from disparate sources into a common environment. The application supports data harmonization and integration with analytical pipelines. The application code was released into the open source community in January 2012, with 32 instances in operation. tranSMART's extensible data model and corresponding data integration processes, rapid data analysis features, and open source nature make it an indispensable tool in translational or clinical research.Entities:
Year: 2014 PMID: 25717408 PMCID: PMC4333702
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Figure 1.N-tier Architecture of tranSMART
Figure 2.Best Practice Approach to Data Integration
Data Categories Supported by tranSMART platform
| Category | Type | Description | Example | Usage | Storage |
|---|---|---|---|---|---|
| Level 1 | Raw | • Raw data from source platform | • Raw binary machine reads | • Processing pipeline | File system |
| Level 2 | Processed | • Normalized data through curation or data processing pipelines | • Clinical trial data | • Dataset Explorer | Database: DeApp, i2b2DemoData |
| Level 3 | Interpreted | • Interpreted or aggregated data from processed data | • Z-scores for gene expression data | • Dataset Explorer | Database: DeApp, BioMart |
| Level 4 | Summary and Findings | • Quantified association and analysis across multiple samples. | • Fold changes | • Search | Database: BioMart |
| Master Data | Slow changing data | • Data about key business entities in the system. | • Study design | • Dataset Explorer | Database: i2b2Mctadata, i2b2DemoData, BioMart, SearchApp |
| Reference Data | Slow changing data used as reference | • Data from other system that’s used as identifier data or as a reference to other systems | • Affymetrix annotation files | • Dataset Explorer | Database: DeApp, BioMart |
| MetaData - Structural | Metadata | • Data that describes data structure | • Data dictionary | • Documentation | File system |
| MetaData – Administrative (Operational) | Metadata | • Data associated with application/data access and operation | • ETL auditing and QC results | • Search | Database: searchApp, rdc_cz |
tranSMART Implementations
| Organization Types | # Instances |
|---|---|
| AMC | 4 |
| Biopharma | 11 |
| Cancer Center | 2 |
| Commercial Software | 1 |
| Government | 4 |
| Non-profit | 5 |
| Research | G |