| Literature DB >> 32442913 |
Kanika Kanchan1, Kruthika Iyer1, Lisa R Yanek1, Ivan Carcamo-Orive2, Margaret A Taub3, Claire Malley1, Kristin Baldwin4, Lewis C Becker1, Ulrich Broeckel5, Linzhao Cheng1, Chad Cowan6, Matteo D'Antonio7, Kelly A Frazer7, Thomas Quertermous2, Gustavo Mostoslavsky8, George Murphy8, Marlene Rabinovitch2, Daniel J Rader9, Martin H Steinberg10, Eric Topol11, Wenli Yang12, Joshua W Knowles2, Cashell E Jaquish13, Ingo Ruczinski3, Rasika A Mathias1.
Abstract
Human induced pluripotent stem cell (hiPSC) lines have previously been generated through the NHLBI sponsored NextGen program at nine individual study sites. Here, we examined the structural integrity of 506 hiPSC lines as determined by copy number variations (CNVs). We observed that 149 hiPSC lines acquired 258 CNVs relative to donor DNA. We identified six recurrent regions of CNVs on chromosomes 1, 2, 3, 16 and 20 that overlapped with cancer associated genes. Furthermore, the genes mapping to regions of acquired CNVs show an enrichment in cancer related biological processes (IL6 production) and signaling cascades (JNK cascade & NFκB cascade). The genomic region of instability on chr20 (chr20q11.2) includes transcriptomic signatures for cancer associated genes such as ID1, BCL2L1, TPX2, PDRG1 and HCK. Of these HCK shows statistically significant differential expression between carrier and non-carrier hiPSC lines. Overall, while a low level of genomic instability was observed in the NextGen generated hiPSC lines, the observation of structural instability in regions with known cancer associated genes substantiates the importance of systematic evaluation of genetic variations in hiPSCs before using them as disease/research models.Entities:
Keywords: GWAS, VanillaICE; Oncogenes; Structural integrity; Tumor suppressor genes; hiPSCs
Mesh:
Substances:
Year: 2020 PMID: 32442913 PMCID: PMC7575060 DOI: 10.1016/j.scr.2020.101803
Source DB: PubMed Journal: Stem Cell Res ISSN: 1873-5061 Impact factor: 2.020
Genomic structural integrity of the NextGen hiPSC lines. Out of 506 hiPSC lines, 149 lines acquired CNVs during reprogramming/passaging.
| Site | Donor | hiPSC | Pairs (N) | hiPSC lines with | Acquired |
|---|---|---|---|---|---|
| Site1 | 132 | 132 | 132 | 2 | 2 |
| Site2 | 7 | 7 | 7 | 0 | 0 |
| Site 3 | 1 | 1 | 1 | 1 | 1 |
| Site4 | 171 | 171 | 171 | 104 | 203 |
| Site5 | 46 | 83 | 83 | 21 | 26 |
| Site6 | 9 | 9 | 9 | 2 | 2 |
| Site7 | 49 | 60 | 60 | 7 | 7 |
| Site8 | 0 | 0 | 0 | 0 | 0 |
| Site9 | 38 | 43 | 43 | 12 | 17 |
| Total | 453 | 506 | 506 | 149 | 258 |
>1 hiPSC line for some donor DNA.
Fig. 1.Characteristics of the Copy Number Variants in NextGen hiPSC lines. [A] Histogram showing the number of hiPSC lines with number of detected CNVs. [B] Histogram showing the cumulative size of CNVs in mega base pairs (Mb) per hiPSC line. Black dashed line shows that cumulative CNV coverage for 85% hiPSC lines was less than 2 Mb. [C] Distribution of 258 CNVs acquired by hiPSC lines across the genome. The chromosomal regions harboring cluster of CNVs are indicated.
Fig. 2.Gene set enrichment analysis for the set of 1,395 genes mapping to 258 CNVs from 149 hiPSC lines for enrichment with regards to biological processes. [A] Fold enrichment of the 22 GO Terms with an FDR of <5% (q < 0.05) are displayed, q values <0.01 and <0.001 are shown as “*” and “**”, respectively. [B] Sequential Gene Set Enrichment of 4 biological process: I-kappaB kinase/NF-kappaB signaling cascade (GO:0007249), JNK cascade (GO:0007254), Cytokine production (GO:0001816) and IL-6 production (GO:0032635). Input set 1/2/3/4 and 5 include gene sets mapping to regions ≥1, ≥2, ≥3, ≥4 and ≥5 acquired CNVs in lines. The FE and FDR-P values increase markedly from input set1 through to input set5. [C] Sequential GSEA for gene sets mapping to regions of acquired Dels. Input set 1/2/3 and 4 include gene sets mapping to regions of ≥1, ≥2, ≥3 and ≥4 acquired Dels in hiPSCs. The FE and FDR-P values are higher for gene sets mapping to regions of Dels than all CNVs combined.
Fig. 3.Chromosomal regions harboring clusters of CNVs. The chromosomal position of each CNV cluster is shown in a red box over the ideogram. The amplifications are represented as blue bars and deletions are shown in red bars. Dashed vertical lines delineate the core overlap regions, and in some cases nearby CNVs within the same genomic locus are also shown. The bottom panel shows the genes lying in the overlap region that could be potentially affected by the CNV in the hiPSC lines.
Fig. 4.Expression levels of the transcripts mapping to the core overlap region of chr20 amplification. The chromosomal position of the amplification is shown in a red box over the ideogram. hiPSC lines harboring chr20 amplification are shown as blue bars and dashed vertical lines delineate the core overlap region. The bottom panel shows the fold changes comparing carriers to non-carries of the genes mapping to the overlap region, and their respective 95% confidence interval. Statistically significant up-regulation is observed for the genes marked in red. The expression of each gene is drawn positionally to the center of the gene.
Details of sibling hiPSC lines. 25 hiPSC lines acquired CNVs as compared to their corresponding donor cell.
| Site | Donor | iPSC | Pairs (N) | hiPSC lines with | Acquired |
|---|---|---|---|---|---|
| Site5 | 37 | 74 | 74 | 17 | 22 |
| Site7 | 11 | 22 | 22 | 2 | 2 |
| Site9 | 5 | 10 | 10 | 6 | 11 |
| Total | 53 | 106 | 106 | 25 | 35 |