| Literature DB >> 23548156 |
Imad Abugessaisa1, David Gomez-Cabrero, Omri Snir, Staffan Lindblad, Lars Klareskog, Vivianne Malmström, Jesper Tegnér.
Abstract
BACKGROUND: Sequencing of the human genome and the subsequent analyses have produced immense volumes of data. The technological advances have opened new windows into genomics beyond the DNA sequence. In parallel, clinical practice generate large amounts of data. This represents an underused data source that has much greater potential in translational research than is currently realized. This research aims at implementing a translational medicine informatics platform to integrate clinical data (disease diagnosis, diseases activity and treatment) of Rheumatoid Arthritis (RA) patients from Karolinska University Hospital and their research database (biobanks, genotype variants and serology) at the Center for Molecular Medicine, Karolinska Institutet.Entities:
Mesh:
Year: 2013 PMID: 23548156 PMCID: PMC3623742 DOI: 10.1186/1479-5876-11-85
Source DB: PubMed Journal: J Transl Med ISSN: 1479-5876 Impact factor: 5.531
Figure 1A class diagram for the translational medicine computing platform for RA.
List of selected SNPs
| Rs number | ||||
| 6314 | HTR2A | C | T | C |
| 1328674 | HTR2A | C | T | T |
| 548234 | PRDM1 | T | C | C |
| 4781003 | CIITA | C | T | T |
| 4535211 | PLCL2 | G | A | A |
| 10431908 | CIITA | A | G | G |
| 544167 | C2 | G | T | G |
| 12746613 | FCGR2A | C | T | T |
| 4810485 | CD40 | G | T | G |
| 10498441 | NID2 | A | G | A |
| 10499194 | OLIG3,TNFAIP3 | C | T | C |
| 2064476 | HLA-DPB2 | A | G | A |
| 706778 | IL2RA | C | T | T |
| 2736340 | BLK | A | G | G |
| 26232 | C5orf30 | C | T | C |
| 540386 | TRAF6 | C | T | C |
| 231707 | C4orf8 | G | A | A |
| 10402677 | CEACAM1 | G | A | A |
| 42041 | CDK6 | C | G | G |
| 2024301 | CLEC4A;POU5F1P3 | A | T | T |
| 3807306 | IRF5 | A | C | A |
| 10488631 | IRF5;TNPO3 | T | C | C |
| 3761847 | TRAF1/C5 | A | G | G |
| 7026551 | C5 | A | C | C |
| 11586238 | CD2,CD58 | C | G | G |
| 231735 | CTLA4 | G | T | G |
| 13017599 | REL | A | G | G |
| 394581 | TAGAP | T | C | T |
| 2263484 | C21orf74 | A | C | C |
| 6682654 | CD244 | | | |
| 6859219 | ANKRD55 | C | A | C |
| 13031237 | REL | A | C | C |
| 934734 | SPRED2 | A | G | G |
| 11676922 | AFF3 | A | T | T |
| 3087243 | CTLA4 | G | A | G |
| 1678542 | KIF5A | C | G | C |
| 951500 | CCL21 | A | G | A |
| 892188 | GLP-1;FDX1L;ICAM5 | C | T | T |
| 1133104 | CLEC4A;POU5F1P3 | G | T | T |
| 1980422 | CD28 | T | C | C |
| 1859341 | CEACAM8 | A | G | G |
| 3087456 | CIITA | A | G | G |
| 2271077 | GALNTL2 | A | G | A |
| 2377422 | CLEC4A;POU5F1P3 | C | T | T |
| 2476601 | PTPN22 | C | T | T |
| 2812378 | CCL21;C9orf144B | A | G | G |
| 2240340 | PADI4 | C | T | T |
| 6416647 | CIITA | T | C | C |
| 3890745 | MMEL1 | T | C | T |
| 4272626 | NHLH2 | C | T | T |
| 10258735 | RPA3 | A | G | G |
| 3093023 | CCR6 | G | A | A |
| 3218253 | IL2RB | G | A | A |
| 6822844 | IL2,IL21 | G | T | G |
| 7234029 | PTPN2 | A | G | G |
| 6457620 | HLA-DRA | G | C | G |
| 6920220 | OLIG3,TNFAIP3 | G | A | A |
| 10413014 | CEACAM8 | A | G | G |
| 7574865 | STAT4 | G | T | T |
| 10468473 | MAP2K4 | G | A | A |
| 10410147 | CEACAM8 | G | A | A |
| 10919563 | PTPRC | G | A | G |
| 4750316 | DKFZp667F0711/PRKCQ | G | C | G |
| 2523451 | MICA | A | G | G |
| 6457617 | HLA-DQ | C | T | C |
Figure 2RA Biobank and the cell registry attributes.
Figure 3Illustration of Genotype and serology vocabulary.
Figure 4The Swedish Rheumatology quality register. Entities, attributes, and relationships. Dmard: Disease-modifying Anti-rheumatic Drugs. HAQ: Health Assessment Questionnaire. DAS: The Disease Activity Score.
Figure 5Clinical development center with its components and information flow (
Figure 6Information architecture and user interface of CDR.
Figure 7Visual query builder implementation in CDC.
List of requirements and a comparison with existing solutions
| R1 | Database loading and integration | The platform should provide a visual and easy-to-use user interface to load and transfer data across all studies. The user should be protected from error by having the system validate the source data files before loading them into the database(s). | |||
| R2 | User-role authentication | System and application level authentication techniques should be supported, the PI wants to grant access to his/her co-workers and collaborator and define specific role and privilege. | |||
| R3 | Support for bioinformatics work flow developments | Having all the data loaded into the database, the platform should support development of bioinformatics workflow with less scripting effort. | |||
| R4 | Visual Query Builder | The platform should provide a query builder to easily create and execute queries against the database tables contained in the study. | |||
| R5 | Data export | The platform should allow exporting query results into different formats, e.g., CSV, spreadsheet | |||
| R6 | Version control and traceability | The platform should offer version control for all datasets that are stored in the database. Multiple versions of each file are necessary for traceability between inputs and outputs maintained so that the user can view the earlier versions of each file. | |||
| R7 | Minimal programming effort | The integration of different databases (e.g., biobank, clinical, genotype, etc.) and development of workflow must be easy and require as little programming effort as possible. | |||
| R8 | Source schema customization and metadata management | The platforms should support creation and documentation of metadata for all data files. | |||
| R9 | Dashboard display for studies | With a single click on the list of studies, the PI can navigate to different studies running in his/her group and its associated databases. | |||
| R10 | Installation | The customization and installation effort is minimal. | |||
| Total of 10 requirements |
Figure 8Classification of personal health information (PHI).
List of PHI per data sources
| RA-Biobank | Personnummer - Civic registration number |
| Efternamn – last name | |
| Förnamn- first name | |
| Swedish Rheumatology Quality Register | Personnummer - Civic registration number |
| Patientkod: patient registration number in the registry | |
| Namn: full name | |
| registrerad_pa_mottagning: clininc address |
Figure 9Security layers of the system.
Result of query implemented in CDC visual query builder
| AG | SyF0420 | 53 | 01:G09 | *04/*15 | SFMC | 938.9 |
| AG | SyF0420 | 53 | 01:G10 | *04/*15 | SFMC | 938.9 |
| AG | SyF0420 | 53 | 01:H01 | *04/*15 | SFMC | 938.9 |
| AG | SyF0420 | 53 | 01:H02 | *04/*15 | SFMC | 938.9 |
| AG | SyF0420 | 53 | 01:H03 | *04/*15 | SFMC | 938.9 |
| AG | SyF0420 | 53 | 01:H04 | *04/*15 | SFMC | 938.9 |
| AG | SyF0420 | 53 | 01:H05 | *04/*15 | SFMC | 938.9 |
Different naming used for same attribute in different database sources
| Project number | SyRnr | Patienter::Pat.nr. (patient number) |
| Personnr. + Personnr2 (Date of birth + Personal identification 4 digits number) | | Patienter::Personnummer + Patienter::Pers.dat (Date of birth + Personal identification 4 digits number) |
| Gender | | Patienter::Man_Kv |
| | Provdatum (sample date) | Provdat (sample date) |
| | Provsort (sample type) | Provtyp (sample type) |
| Namn (Name) | Patienter::Efternamn (last name) + Patienter::Förnamn (first name) |