| Literature DB >> 28775673 |
Eleanor Williams1,2, Josh Moore1, Simon W Li1, Gabriella Rustici1, Aleksandra Tarkowska1, Anatole Chessel3,4, Simone Leo1,5, Bálint Antal3, Richard K Ferguson1, Ugis Sarkans2, Alvis Brazma2, Rafael E Carazo Salas3,6, Jason R Swedlow1.
Abstract
Access to primary research data is vital for the advancement of science. To extend the data types supported by community repositories, we built a prototype Image Data Resource (IDR) that collects and integrates imaging data acquired across many different imaging modalities. IDR links data from several imaging modalities, including high-content screening, super-resolution and time-lapse microscopy, digital pathology, public genetic or chemical databases, and cell and tissue phenotypes expressed using controlled ontologies. Using this integration, IDR facilitates the analysis of gene networks and reveals functional interactions that are inaccessible to individual studies. To enable re-analysis, we also established a computational resource based on Jupyter notebooks that allows remote access to the entire IDR. IDR is also an open source platform that others can use to publish their own image data. Thus IDR provides both a novel on-line resource and a software infrastructure that promotes and extends publication and re-analysis of scientific image data.Entities:
Mesh:
Year: 2017 PMID: 28775673 PMCID: PMC5536224 DOI: 10.1038/nmeth.4326
Source DB: PubMed Journal: Nat Methods ISSN: 1548-7091 Impact factor: 28.547
Data sets in IDR
| Study identifier | Species | Type | Screens or experiments | 5D images | Size (TB) | Pheno- typesa | Targetsb | Experimentsc | Reference |
|---|---|---|---|---|---|---|---|---|---|
| idr0001-graml-sysgro | Gene deletion screen | 1 | 109,728 | 10.06 | 19 | 3,005 | 18,432 | [ | |
| idr0002-heriche-condensation | Human | RNAi screen | 1 | 1,152 | 2.10 | 2 | 102 | 1,152 | [ |
| idr0003-breker-plasticity | Protein screen | 1 | 97,920 | 0.20 | 14 | 6,234 | 32,640 | [ | |
| idr0004-thorpe-rad52 | Gene deletion screen | 1 | 3,765 | 0.17 | 1 | 4,195 | 4,512 | [ | |
| idr0005-toret-adhesion | RNAi screen | 2 | 45,792 | 0.14 | 1 | 13,035 | 15,264 | [ | |
| idr0006-fong-nuclearbodies | Human | Protein localization screen | 1 | 240,848 | 1.40 | 8 | 12,743 | 16,224 | [ |
| idr0007-srikumar-sumo | Protein localization screen | 1 | 3,456 | 0.02 | 23 | 377 | 1,152 | [ | |
| idr0008-rohn-actinome | RNAi screen | 2 | 55,944 | 0.12 | 46 | 12,826 | 26,496 | [ | |
| idr0009-simpson-secretion | Human | RNAi screen | 2 | 397,056 | 3.25 | 3 | 17,960 | 397,056 | [ |
| idr0010-doil-dnadamage | Human | RNAi screen | 1 | 56,832 | 0.08 | 2 | 18,675 | 56,832 | [ |
| idr0011-ledesmafernandez-dad4 | Gene deletion screen | 5 | 8,957 | 0.4 | 1 | 5,209 | 8,736 | NA | |
| idr0012-fuchs-cellmorph | Human | RNAi screen | 1 | 45,692 | 0.38 | 18 | 16,701 | 26,112 | [ |
| idr0013-neumann-mitocheck | Human | RNAi screen | 2 | 200,995 | 14.54 | 18 | 18,393 | 206,592 | [ |
| idr0015-UNKNOWN-taraoceans | Multi-species | Geographic screen | 1 | 32,776 | 2.49 | 0 | 84 | 84 | [ |
| idr0016-wawer- bioactivecompoundprofiling | Human | Small molecule screen | 1 | 869,820 | 3.19 | 2 | 29,542 | 144,000 | [ |
| idr0017-breinig-drugscreen | Human | Small molecule screen | 1 | 147,456 | 2.48 | 0 | 1,281 | 36,864 | [ |
| idr0018-neff-histopathology | Histopathology of gene knockouts | 1 | 899 | 0.27 | 48 | 9 | 248 | — | |
| idr0019-sero-nfkappab | Human | HCS image analysis | 1 | 25,872 | 0.03 | 0 | 198 | 2,156 | [ |
| idr0020-barr-chtog | Human | RNAi screen | 1 | 36,960 | 0.03 | 2 | 241 | 1,232 | [ |
| idr0021-lawo- pericentriolarmaterial | Human | Protein localization using 3D-SIM | 1 | 414 | 0.0003 | 1 | 9 | 414 | [ |
| idr0023-szymborska- nuclearpore | Human | Protein localization using dSTORM | 1 | 524 | 0.0005 | 1 | 7 | 359 | [ |
| idr0027-dickerson- chromatin | 3D-tracking of tagged chromatin loci | 1 | 229 | 0.03 | 0 | 8 | 112 | [ | |
| idr0028-pascualvargas-rhogtpases | Human | RNAi screen | 4 | 155,332 | 0.18 | 9 | 170 | 5,544 | [ |
| idr0032-yang-meristem | 1 | 458 | 0.003 | 5 | 115 | 115 | [ | ||
| Sum | 35 | 2,538,777 | 42 | 224 | 161,119 | 1,002,328 | |||
| Average | 105,782 | 1.73 | 9 | 6,713 | 41,764 |
aThe number of submitted phenotypes.
bThe number of genes, compounds or proteins identified as targets for analysis.
cThe number of individual wells (in HCS studies) or imaging experiments (in nonscreen data sets). NA, not applicable (unpublished data).
Example URLs and views of IDR data sets
| Study identifier | IDR URL |
|---|---|
| idr0001-graml-sysgro | |
| idr0002-heriche-condensation | |
| idr0003-breker-plasticity | |
| idr0004-thorpe-rad52 | |
| idr0005-toret-adhesion | |
| idr0006-fong-nuclearbodies | |
| idr0007-srikumar-sumo | |
| idr0008-rohn-actinome | |
| idr0009-simpson-secretion | |
| idr0010-doil-dnadamage | |
| idr0011-ledesmafernandez-dad4 | |
| idr0012-fuchs-cellmorph | |
| idr0013-neumann-mitocheck | |
| idr0015-UNKNOWN-taraoceans | |
| idr0016-wawer-bioactivecompoundprofiling | |
| idr0017-breinig-drugscreen | |
| idr0018-neff-histopathology | |
| idr0019-sero-nfkappab | |
| idr0020-barr-chtog | |
| idr0021-lawo-pericentriolarmaterial | |
| idr0023-szymborska-nuclearpore | |
| idr0027-dickerson-chromatin | |
| idr0028-pascualvargas-rhogtpases | |
| idr0032-yang-meristem |
Figure 1Sampling of phenotypes in the IDR.
Each sample represents a well from a microwell plate in a screen or an image from a data set. Wells annotated as controls were not included. User-submitted phenotype terms were mapped to the CMPO terms shown here. Colors represent higher-level groupings of phenotype terms. Point size represents the number of studies each phenotype is linked to (1, 2, 3 or 4 studies).
Figure 2Network analysis of genes linked to the elongated cell phenotype in the IDR.
(a) Protein–protein interaction network based on the genes linked to the elongated cell phenotype (CMPO_0000077) in three IDR studies. Genes from S. pombe (green, idr0001-A)[5], HeLa cell morphology (blue, idr0012-A)[39] and HeLa Actinome (red, idr0008-B)[40] are displayed with linkages (gray) from StringDB[33]. To enable comparisons in Cytoscape, the human orthologs of S. pombe genes are used for the genes identified in idr0001-A (Supplementary Note). (b) Close-up view of network in a. Genes are listed in Supplementary Note.