| Literature DB >> 28350385 |
Kenneth Daily1, Shannan J Ho Sui2, Lynn M Schriml3, Phillip J Dexheimer4, Nathan Salomonis4, Robin Schroll5, Stacy Bush5, Mehdi Keddache6, Christopher Mayhew7, Samad Lotia8, Thanneer M Perumal1, Kristen Dang1, Lorena Pantano2, Alexander R Pico8, Elke Grassman9, Diana Nordling10, Winston Hide2,11,12, Antonis K Hatzopoulos13, Punam Malik3, Jose A Cancelas5, Carolyn Lutzko5, Bruce J Aronow4, Larsson Omberg1.
Abstract
The use of induced pluripotent stem cells (iPSC) derived from independent patients and sources holds considerable promise to improve the understanding of development and disease. However, optimized use of iPSC depends on our ability to develop methods to efficiently qualify cell lines and protocols, monitor genetic stability, and evaluate self-renewal and differentiation potential. To accomplish these goals, 57 stem cell lines from 10 laboratories were differentiated to 7 different states, resulting in 248 analyzed samples. Cell lines were differentiated and characterized at a central laboratory using standardized cell culture methodologies, protocols, and metadata descriptors. Stem cell and derived differentiated lines were characterized using RNA-seq, miRNA-seq, copy number arrays, DNA methylation arrays, flow cytometry, and molecular histology. All materials, including raw data, metadata, analysis and processing code, and methodological and provenance documentation are publicly available for re-use and interactive exploration at https://www.synapse.org/pcbc. The goal is to provide data that can improve our ability to robustly and reproducibly use human pluripotent stem cells to understand development and disease.Entities:
Mesh:
Year: 2017 PMID: 28350385 PMCID: PMC5369318 DOI: 10.1038/sdata.2017.30
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Figure 1Integrated workflow for stem cell characterization and data integration.
The types of cell lines evaluated by the PCBC Cell Characterization Core, assays and data processing pipelines are indicated. Numbers in parentheses indicate number of distinct cell line samples, except at assay level, where they indicate the number of individual assays (including replicates).
High level organization of available raw and processed data folders on Synapse for PCBC samples.
| mRNA | syn1773112 | Raw fastq |
| syn1773111 | Tophat BAM | |
| syn2246521 | Cufflinks FPKM | |
| syn3270268 | eXpress | |
| syn2822494 | htseq-count | |
| syn2247799 | Summarized FPKM matrix | |
| syn2247543 | Summarized eXpress matrices | |
| miRNA | syn2247098 | Raw fastq |
| syn2247164 | mirExpress | |
| syn2247832 | Summarized mirExpress counts matrix | |
| syn5014443 | Summarized seqBuster counts matrix | |
| methylation | syn2653626 | Raw idat |
| syn2233188 | Summarized beta value matrix |
Figure 2Principal components analysis of mRNA, miRNA, and methylation features.
The first two PCs are shown, with sample points colored by the differentiation state.
Figure 3Screenshot of the interactive data explorer of the PCBC data.
The mRNA, miRNA, and methylation data, along with the associated metadata, are queryable. Users can also search across assays (for example, miRNAs that may target genes). Data can be downloaded for further use.