| Literature DB >> 27283945 |
Behnam Ahmadian Baghbaderani1, Adhikarla Syama2, Renuka Sivapatham3, Ying Pei4, Odity Mukherjee2, Thomas Fellner1, Xianmin Zeng3,4, Mahendra S Rao5,6.
Abstract
We have recently described manufacturing of human induced pluripotent stem cells (iPSC) master cell banks (MCB) generated by a clinically compliant process using cord blood as a starting material (Baghbaderani et al. in Stem Cell Reports, 5(4), 647-659, 2015). In this manuscript, we describe the detailed characterization of the two iPSC clones generated using this process, including whole genome sequencing (WGS), microarray, and comparative genomic hybridization (aCGH) single nucleotide polymorphism (SNP) analysis. We compare their profiles with a proposed calibration material and with a reporter subclone and lines made by a similar process from different donors. We believe that iPSCs are likely to be used to make multiple clinical products. We further believe that the lines used as input material will be used at different sites and, given their immortal status, will be used for many years or even decades. Therefore, it will be important to develop assays to monitor the state of the cells and their drift in culture. We suggest that a detailed characterization of the initial status of the cells, a comparison with some calibration material and the development of reporter sublcones will help determine which set of tests will be most useful in monitoring the cells and establishing criteria for discarding a line.Entities:
Keywords: Consent; Embryonic stem cells; Induced pluripotent stem cells; Manufacturing; Markers; cGMP
Mesh:
Substances:
Year: 2016 PMID: 27283945 PMCID: PMC4919381 DOI: 10.1007/s12015-016-9662-8
Source DB: PubMed Journal: Stem Cell Rev Rep ISSN: 2629-3277 Impact factor: 5.739
Assays used to characterize the iPSC lines
| Assay | Objective | Evaluation Criteria | Category | Tested iPSC Line |
|---|---|---|---|---|
| Assay Release | ||||
| Pluripotency Markers | Identity & Purity | SSEA-4 > 70 %, Tra-1-60 > 70 %, Tra-1-81 > 70 %, Oct3/4 > 70 %; Purity: CD34 < 5 % | Release assay | All lines |
| Karyotype Analysis | Safety | 46, XX or 46, XY | Release assay | All Lines |
| Mycoplasma Testing | Safety | Negative | Release assay | All Lines |
| Sterility Testing | Safety | Negative | Release assay | All Lines |
| Endotoxin Testing | Safety | Standard QC release (<0.5 EU/ml) | Release assay | All lines |
| Vector Clearance | Safety | No trace of episomal plasmid DNA detected | Release assay | All lines |
| STR Genotyping | Purity & Identity | STR Profile of starting population and iPSC line are identical | Release assay | Al Lines |
| Cell Count & Viability | Viability | % viability >50; minimum cell number/vial | Release Assay | All Lines |
| Viral Panel Testing | Safety | Standard MCB Release Panel | Release Assay | LiPSC-GR1.1 |
| Characterization Assays | ||||
| EB Formation | Identity & Potency | Detection of at least one marker per germ layer | FIO* | All lines |
| Gene Array Analysis | Identity | Clustering with established hPSCs | FIO* | All Lines |
| Colony morphology | Identity & Purity | Characteristic morphology of culture/colonies; lack of spontaneously differentiated cells | FIO* | All lines |
| Post-thaw Plating | Thawing efficiency and Viability | 20+ colonies / vial (after 7 days or 50 % confluency) | FIO* | All Lines |
| HLA Typing | Identity | HLA-A, B, C, DRB1 and DQB1 | FIO* | All lines |
| Type | ||||
| CGH + SNP microarray | Identity | Amplifications and/ or deletions of specific genes | FIO* | LiPSC-GR1.1 and ER2.2 |
| Whole Genome Sequencing | Identity | HiSeq X Human Whole Genome | FIO* | LiPSC-GR1.1 and ER2.2 |
| Sequence | ||||
Table summarizes the tests that were performed on the three engineering run lines and the two cGMP lines (all). Note that the three engineering lines were generated at different times from the same donor sample (Female), while the two cGMP lines were generated from a different donor (Male)
• For Information Only (FIO)
Fig. 1Generation, expansion, and characterization of human iPSCs (LiPSC-ER2.1) - engineering runs. Panel a illustrates priming of CD34+ Cells isolated from cord blood unit and expanded in culture on day 3 prior to the nucleofection (Priming), iPSC colony emerged on day 9 post nucleofection (D9 Post-Nucleofection), iPSCs at passage 6 (P6 Colonies), and iPSCs at passage 18 (P18 culture). Panel b illustrates iPSCs positively stained with OCT4, TRA-1-60, SSEA4, NANOG, TRA-1-81, and alkaline phosphatase (AP). Panel c shows the iPSCs expressing the pluripotent stem cell surface markers SSEA4, TRA-1-60, and TRA-1-81 (dark blue). Light blue exhibits the isotype control. Panel d shows iPSCs differentiated into embryoid bodies and readily expressed the markers for early ectoderm (TUJ1), endoderm (Alpha-Feto Protein (AFP)), and mesoderm (Smooth Muscle Actin (SMA)). DAPI shows the nuclei stain in blue. The iPSCs demonstrated normal karyotype after 17 passages (e). STR analysis showed that the iPSCs matched the starting CD34+ donor sample (f). Scale bar in all images in Panel a is 500 microns except the Priming image which is 250 microns. Scale bar in all images in Panel b is 250 microns except the AP image which is 500 microns. Scale bar in all images in Panel D is 125 microns
Fig. 2Human iPSC manufacturing process flow diagram with in process testing of samples. The process for manufacturing of human iPSCs under defined and cGMP conditions include (1) isolation of CD34+ cells from fresh cord blood unit, (2) priming CD34+ cells for 4 days, (3) reprogramming of CD34+ cells into iPSCs using 4D Nucleofector system and an episomal based technology, (4) isolation of about 9 iPSC colonies, serial subculturing of iPSCs up to 6 passages, (5) in process cryopreservation of all iPSC colonies to select the two best iPSC colonies based on the results of in process control IP-QC3, (6) expansion of two selected iPSC colonies and confirmation of plasmid clearance, (7) further expansion into large tissue culture flasks, (8) banking, and (9) final characterizations and QC testing. Multiple in-process samples (shown in green) were taken at different stages of the process and submitted for relevant testing. Each in-process test has been described in the figure. The number of in-process samples exhibits the number of tests carried out at each step
iPSC lines generated by Lonza: STR, HLA, and donor information
| Line | M/F | STR | HLA Type | ABO/RH+/− | Ethnicity |
|---|---|---|---|---|---|
| LiPSC-GR1.1 | M | Amelogenin (X, Y); vWA (16); D8S1179 (13, 14); TPOX (8); FGA (18, 25): D3S1358 (15); THO1 (6, 9.3); D21S11 (31.2, 33.2); D18S51 (14); Penta E (7, 11); D5S818 (11); D13S317 (9, 11); D7S820 (10, 11); D16S539 (11, 13); CSF1PO (12), Penta D (11, 13) | A*02/*03, B*07/*39, C*07/*15 DRB1*04/*08,DQB1*03/*04 | O+ | Hispanic |
| LiPSC-GR1.2 | |||||
| LiPSC-ER2.1 | F | Amelogenin (X); vWA (16); D8S1179 (14, 15); TPOX (8, 9); FGA (22, 23): D3S1358 (16, 18); THO1 (8, 9.3); D21S11 (24.2, 30.2); D18S51 (12); Penta E (10); D5S818 (11, 13); D13S317 (11); D7S820 (8, 10); D16S539 (11, 12); CSF1PO (10, 13), Penta D (9, 14) | A*03/*24, B*27/*27, C*01/*05 DRB1*01/*11,DQB1*03/*05 | Inferred A | Caucasian |
| LiPSC-ER2.2 | |||||
| LiPSC-ER2.3 |
Table summarizes the information for identity and HLA typing and ethnic background that was collected. This data does not distinguish between clones from the same individual, and tracing and tracking of clones represents an issue that will require a solution to complement the barcoding and physical tracking methods we recommend
aCGH-SNP analysis of the iPSCs Manufactured during cGMP runs
| Chromosome | Start | Stop | Genes | Cytoband | LiPSC GR-1.1 | LiPSC GR-1.2 |
|---|---|---|---|---|---|---|
| List A: Common changes observed in GR-1.1 and GR-1.2 | ||||||
| chr2 | 45,168,836 | 45,171,902 | SIX3 | P21 | Present | Present |
| chr15 | 34,695,166 | 34,841,446 | GOLGA8B, GOLGAA8A, MIR1233-1, MIR1233-2 | q14 | Present | Present |
| chr16 | 31,959,074 | 33,773,134 | HERC2P4, LOC390705, TP53TG3, TP53TG3B, LOC653550, SLC6A10P | P11.2 | Present | Present |
| chr20 | 57,463,534 | 57,464,754 | GNAS | q13.32 | Present | Present |
| chrX | 130,813,232 | 131,201,564 | LOC286467, MST4 | q26.2 | Present | Present |
| chrY | 17,130,014 | 17,630,471 | q11.221 | Present | Present | |
| Total No. of common aCGH | 6 | |||||
| List B. Non overlapping changes observed GR-1.1 and GR-1.2 | ||||||
| chr2 | 81,631,218 | 84,380,876 | LOC1720 | p12 - p11.2 | Present | |
| chr2 | 166,180,491 | 166,815,270 | SCN2A, CSRNP3, GALNT3, TTC21B | q24.3 | Present | |
| chr3 | 46,620,840 | 46,622,617 | TDGF1 | p21.31 | Present | |
| chr5 | 124,645,064 | 128,776,611 | GRAMD3, ALDH7A1, PHAX, C5orf48, LMNB1, MARCH3, FLJ44606, MEGF10, PRRC1, CTXN3, FLJ33630, SLC12A2, FBN2, SLC27A6, ISOC1 | q23.2 - q23.3 | Present | |
| chr6 | 47,311,409 | 54,276,380 | CD2AP, GPR111, GPR115, OPN5, C6orf138, MUT, CENPQ, GLYATL3, C6orf141, RHAG, CRISP2, CRISP3, PGK2, CRISP1, DEFB133, DEFB114, DEFB113, DEFB110, DEFB112, TFAP2D, TFAP2B, PKHD1, MIR206, MIR133B, IL17A, IL17F, MCM3, PAQR8, EFHC1, TRAM2, LOC730101, TMEM14A, GSTA7P, GSTA2, GSTA1, GSTA5, GSTA3, GSTA4, ICK, FBXO9, GCM1, ELOVL5, GCLC, KLHL31, LRRC1, C6orf142, TINAG | p12.3 - p12.1 | Present | |
| chr7 | 155,596,206 | 155,601,974 | SHH | q36.3 | Present | |
| chr10 | 114,549,196 | 117,896,129 | VTI1A, LOC143188, TCF7L2, HABP2, NRAP, CASP7, C10orf81, DCLRE1A, NHLRC2, ADRB1, C10orf118, MIR2110, TDRD1, VWA2, AFAP1L2, ABLIM1, FAM160B1, TRUB1, ATRNL1, GFRA1 | q25.2 - q25.3 | Present | |
| chr11 | 19,664,494 | 20,906,873 | NAV2, LOC100126784, DBX1, HTATIP2, PRMT3, SLC6A5, NELL1 | p15.1 | Present | |
| chr12 | 19,001,106 | 19,925,941 | PLEKHA5, AEBP2 | p12.3 | Present | |
| chr19 | 50,816,568 | 51,294,521 | KCNC3, NAPSB, NAPSA, NR1H2, POLD1, SPIB, MYBPC2, FAM71E1, C19orf63, JOSD2, ASPDH, LRRC4B, SNAR-F, SYT3, LOC342918, SHANK1, CLEC11A, GPR32, ACPT | q13.33 | Present | |
| chrX | 306,955 | 329,692 | PPP2R3B | p22.33 | Present | |
| Total No. of aCGH | 5 | 6 | ||||
aCGH-SNP analysis of the iPSCs GR1.1 and GR1.2 which are clones. Panel A identifies common changes detected and Panel B lists non- overlapping changes. The genes are color coded; Red- Known to be affected in disease. Black –Not reported as affected In database . Teal in a pathway related to diseaseNote the analysis was done at the same time on the same run. The clones were manufactured in two separate runs
aCGH-SNP analysis of the iPSCs Manufactured during the Engineering Runs. aCGH-SNP analysis of the iPSCs ER2.1, ER2.2, ER2.3 which are clones. Panel A identifies common changes detected and Panel B lists non- overlapping changes. The genes are color coded; Red- Known to be affected in disease. Black –Not reported as affected In database . Teal in a pathway related to disease. Note the analysis was done at the same time for ER 2.2 and ER 2.3 and on a separate run for ER2.1. on the same run. The clones were manufactured in three separate runs
| Chromosome | Start | Stop | Overlap | Genes | Cytoband | LiPSC ER-2.1 | LiPSC ER-2.2 | LiPSC ER-2.3 |
|---|---|---|---|---|---|---|---|---|
| A. Common aCGH between LiPSC ER-2.1, LiPSC ER-2.2, and LiPSC ER-2.3 lines | ||||||||
| chr16 | 34,482,042 | 34,743,643 | 100 | LOC283914, LOC146481, LOC100130700 | p11.2 - p11.1 | Present | Present | Present |
| Total No. of Common aCGH | 1 | |||||||
| B. Uncommon aCGH between LiPSC ER-2.1, LiPSC ER-2.2, and LiPSC ER-2.3 lines | ||||||||
| chr2 | 192,581,125 | 199,052,600 | SDPR, TMEFF2, PCGEM1, SLC39A10, DNAH7, STK17B, HECW2, CCDC150, LOC100130452, GTF3C3, C2orf66, PGAP1, ANKRD44, SF3B1, COQ10B, HSPD1, HSPE1, HSPE1-MOBKL3, MOBKL3, RFTN2, MARS2, BOLL, PLCL1 | q32.3 - q33.1 | Present | |||
| chr2 | 193,974,618 | 199,390,001 | SLC39A10, DNAH7, STK17B, HECW2, CCDC150, LOC100130452, GTF3C3, C2orf66, PGAP1, ANKRD44, SF3B1, COQ10B, HSPD1, HSPE1, HSPE1-MOBKL3, MOBKL3, RFTN2, MARS2, BOLL, PLCL1 | q32.3 - q33.1 | Present | |||
| chr4 | 45,882 | 68,211 | ZNF595, ZNF718 | p16.3 | Present | |||
| chr4 | 62,069,285 | 63,216,102 | LPHN3 | q13.1 | Present | |||
| chr5 | 145,468,046 | 147,372,489 | PLAC8L1, LARS, RBM27, POU4F3, TCERG1, GPR151, PPP2R2B, STK32A, DPYSL3, JAKMIP2, SPINK1, SCGB3A2, C5orf46 | q32 | Present | |||
| chr5 | 146,215,672 | 147,372,489 | PPP2R2B, STK32A, DPYSL3, JAKMIP2, SPINK1, SCGB3A2, C5orf46 | q32 | Present | |||
| chr5 | 156,407,838 | 156,619,217 | HAVCR1, HAVCR2, MED7, FAM71B, ITK | q33.3 | Present | |||
| chr6 | 296,674 | 418,057 | DUSP22, IRF4 | p25.3 | Present | |||
| chr6 | 296,674 | 462,491 | DUSP22, IRF4 | p25.3 | Present | |||
| chr7 | 82,012,870 | 85,939,933 | CACNA2D1, PCLO, SEMA3E, SEMA3A, SEMA3D | q21.11 | Present | |||
| chr7 | 121,256,234 | 122,561,644 | PTPRZ1, AASS, FEZF1, LOC154860, CADPS2, RNF133, RNF148 | q31.32 | Present | |||
| chr8 | 7,220,322 | 7,752,586 | ZNF705G, DEFB4B, DEFB103B, DEFB103A, SPAG11B, DEFB104B, DEFB104A, DEFB106B, DEFB106A, DEFB105B, DEFB105A, DEFB107A, DEFB107B, FAM90A7, FAM90A14, FAM90A13, FAM90A19, FAM90A18, FAM90A8, FAM90A9,FAM90A10, SPAG11A, DEFB4A | p23.1 | Present | |||
| chr9 | 129,453,130 | 129,725,863 | LMX1B, ZBTB43, ZBTB34, RALGPS1 | q33.3 | Present | |||
| chr9 | 138,157,660 | 138,283,617 | q34.3 | Present | ||||
| chr11 | 2,016,471 | 2,024,343 | H19, MIR675 | p15.5 | Present | |||
| chr11 | 26,859,414 | 28,019,072 | FIBIN, BBOX1, CCDC34, LGR4, LIN7C, BDNF-AS1, BDNF | p14.2 - p14.1 | Present | |||
| chr11 | 67,374,414 | 67,378,114 | NDUFV1 | q13.2 | Present | |||
| chr12 | 19,039,306 | 20,040,127 | PLEKHA5, AEBP2 | p12.3 - p12.2 | Present | |||
| chr13 | 54,124,303 | 58,474,271 | MIR1297, PRR20B, PRR20C, PRR20D, PRR20E, PRR20A, PCDH17 | q14.3 - q21.1 | Present | |||
| chr14 | 107,145,681 | 107,182,658 | q32.33 | Present | ||||
| chr15 | 22,587,129 | 22,849,189 | GOLGA8DP, GOLGA6L1, TUBGCP5 | q11.2 | Present | |||
| chr20 | 57,463,534 | 57,464,754 | GNAS | q13.32 | Present | Present | ||
| chr22 | 19,749,830 | 19,758,306 | TBX1 | q11.21 | Present | |||
| chr22 | 51,063,432 | 51,110,017 | ARSA | q13.33 | Present | |||
| chrX | 306,955 | 319,352 | PPP2R3B | p22.33 | Present | |||
| chrX | 443,311 | 445,970 | p22.33 | Present | ||||
| chrX | 8,514,158 | 8,547,642 | KAL1 | p22.31 | Present | |||
| chrX | 53,449,448 | 53,459,515 | SMC1A, RIBC1, HSD17B10 | p11.22 | Present | |||
| Total No. of aCGH | 4 | 15 | 10 | |||||
Microarray Analysis for pluripotent cells
| Data Set analysis |
|---|
| QC Tests |
| Total reads |
| Intensity range |
| Distribution |
| No detected > 100 RFU |
| Normalization |
| Other Results |
| Overall profile (relatedness) |
| Chromosomal Bias |
| M/F |
| X chr inactivation |
| Pluritest |
| Pluripotent marker set |
| Contaminating cell detection |
| Incomplete programming |
| Gene Sets |
| MicroRNA |
| Mitochondrial profiling |
| Pathway analysis |
| Positional markers |
| Imprinted genes |
The table summarizes the routine QC tests and the results with GR1.1. Note that both gene subsets as well as examination of overall quality can be inferred using tests such as Pluritest, correlation co-efficients and unbiased hierarchical clustering. The data quality is improved by comparing to a database and with sufficient numbers of lines one can establish ranges for a comparability assay
Fig. 3Pluritest analysis of test and control samples. a. Model-based multi-class pluripotency score: pluripotency score between red lines indicates a 95 % pluripotency signature. Samples between blue line indicates 95 % of non-pluripotent samples. Line PR1.0, TR1.1, TR1.2, GR1.1, GR1.2, ER2.1, ER2.2, ER2.3, XCL1iPSC and H14 ESC are all localized between or close to the red lines, hence all pluripotent. The CD34+ cord blood line and ER2.2 NSC are (negative controls) are located between the blue lines demonstrates non-pluripotent signature. b. Novelty Score: This score is based on well-characterized pluripotent samples, color-coded green and non-pluripotent samples color-coded red. All iPSC and ES samples are in green demonstrates pluripotent samples, whereas CD34+ cord blood line and ER2.2 NSC demonstrates non-pluripotent color-code red. c. Hierarchical Clustering of vst-transformed samples: Samples were transformed using variance stabilizing transformation (VST) and robust spline normalization _ENREF_35[40]. Distance on x -axis is based on Pearson correlations. d. Overview: This combines novelty score on X-axis and Pluripotency score on y-axis. The red background, where the iPSC and ESC samples are located, suggest the empirical distribution of pluripotent cells. CD34+ cord blood line and ER2.2 NSC are located closer to the non-pluripotent background colored blue
Fig. 4Whole Genome analysis conducted on two iPSC lines generated using the cGMP compliant manufacturing process. a. WGS data characterisation pipeline. This figure outlines the work flow followed in this study. The filters applied at various stages have been mentioned in the methods section. The fastq files were aligned using Isaac aligner to generate bam file. The bam file generated was checked for its mapping quality using samstat. The fastq files were also used for prediction of HLA types using HLAVBseq. The variants called by Issac variant caller was annotated using SnpEff and this data was used for predicting blood groups using BOOGIE. Only non-synonymous variants were considered for its implication with PD and cross validated with expression data. Structural variation (deletions and duplications) were filtered (see methods) and annotated for genes using UCSC. This data was cross validated with expression and microarray data. The differentially expressed genes were verified for gene enrichment relevant to disease and pathway through DAVID. b. The x-axis shows the various chromosomes and Y-axis represents the max-depth computed by Issac Variant Caller across each chromosome on a log scale for both the cell lines under study. c. Bar graph representing number of variants identified for SNPs, small Insertions, small deletions, synonymous, non-synonymous variants, CNVs and different types of structural variants including duplications, large insertions (length > 50), large deletions (length > −50), inversions and translocations. Deletions were higher in number than other types of SVs
Whole Genome Analysis for pluripotent cells
| Data Set analysis |
|---|
| QC Tests |
| Total reads |
| Unmapped reads |
| Mapped reads |
| Chr. coverage |
| Error Rate |
| Other Results |
| Mutations detected |
| Mitochondrial Seq |
| Comparison with CGH |
| STR verification |
| Distinguishing alleles |
| HLA |
| Minor Blood groups |
| Other transplant antigens |
| Other gene subsets |
| Plasmid/Viral Insertion sites |
| Utilizing unmapped data |
| Mycoplasma |
| Other infectious agents |
| Foreign DNA |
The table summarizes the routine QC tests and the results with GR1.1. Note that both gene subsets as well as examination of overall quality can be inferred using tests such as Pluritest, correlation co-efficients and unbiased hierarchical clustering. The data quality is improved by comparing to a database and with sufficient numbers of lines one can establish ranges for a comparability assay
Basic QC analysis of WGS data. Basic QC and mapping quality summary statistics for both the cell lines computed using samtools and Samstat
| QC anlysis Summary | LiPSC-ER2.2 | LiPSC-GR1.1 |
|---|---|---|
| Total reads | 840146878 | 898394686 |
| Total reads mapped | 33656253 | 768473456 |
| Mean depth | 34.49 | 30.83 |
| deduplicated reads | 90375272 | 157504614 |
| Total reads after removing duplicates (%) | 749771606 (89.2 %) | 740890072 (82.5 %) |
| Reads mapped to human genome | 693727158 | 613291081 |
| MAPQ > =30 | 86.9 % | 79.6 % |
| MAPQ <30 | 0.1 % | 0.0 % |
| MAPQ <20 | 0.5 % | 0.5 % |
| MAPQ <10 | 0.7 % | 0.7 % |
| MAPQ <3 | 0.3 % | 0.3 % |
Fig. 5Extracting blood type and HLA from the whole genome sequencing data. a. Table showing the blood group predicted by BOOGIE for four different blood group systems namely ABO, Rh, MN and Rh associated glycoprotein with the score for both the cell lines. The validity of this approach is provided as means of comparison with the actual available donor information. b. Table showing HLA types predicted in-silico by HLAVBseq using WGS data for both the cell lines along with their depth. The predicted HLA type obtained using the WGS data is compared with the results obtained using the HLA AssureTM SE SBT kit
Identification of PD implicated genes from WGS. The table shows the genes that are identified by WGS, having damaging SIFT and polyphen scores that are implicated in PD. These genes were cross verified for the changes in their RNA expression
| SAMPLE | SYMBOL | CHR | POS | SNP ID | SIFT | POLYPHEN 2 | 1000Gp3_EUR_AF | ER1.2 (F) | GR1.1(M) | H9(F) | H7(F) | NCRM6(F) | MIM_disease |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LiPSC-ER2.2 | SYNJ1 | 21 | 34059352 | rs2254562 | 0.43 | 0.629 | 0.29 | 56 | 69 | 65 | 61 | [MIM:615530]Parkinson disease | |
| EIF4G1 | 3 | 184039151 | rs367754765 | 0.912 | 0.647 | 1483 | 1740 | 1426 | 1404 | [MIM:614251]Parkinson disease | |||
| LiPSC-GR1.1 | MAPT | 17 | 44060775 | rs63750417 | 0.91 | 0.72 | 0.24 | 4 | 37 | 26 | 10 | [MIM:600274]Parkinson Disease | |
| SYNJ1 | 21 | 34059352 | rs2254562 | 0.43 | 0.63 | 0.29 | 84 | 69 | 65 | 61 | [MIM:615530]Parkinson disease | ||
| MAPT | 17 | 44061278 | rs17651549 | 0.91 | 0.72 | 0.24 | 4 | 37 | 26 | 10 | [MIM:260540]Parkinson-dementia syndrome |
* These variants did not show its signature at Clinvar database
Cross-validation of the structural variants identified across various platforms: WGS, Microarray, and SNP analyses
| Microarry platform | EVENT | ovelap with WGS-unfiltered | overlap with expression data | H9_H7 | NCRM6 | |
|---|---|---|---|---|---|---|
| LiPSC-ER2.2 | IRF4 | DEL | no | yes | 0.27 | 0 |
| DUSP22 | DEL | no | yes | 0.64 | 0 | |
| LOC283914 | AMP | no | yes | - | −1 | |
| LOC146481 | AMP | no | yes | −4 | 0 | |
| LOC10013070 | AMP | no | yes | 2.85 | - | |
| PPP2R3B | AMP | no | yes | 1.3 | 0 | |
| LiPSC-GR1.1 | SIX3 | GAIN | yes | 1 | 4 | |
| SCN2A | GAIN | yes | 2 | - | ||
| GALNT3 | GAIN | yes | 0 | 3 | ||
| TTC21B | GAIN | yes | −13 | 1 | ||
| CSRNP3 | GAIN | yes (but as del) | yes | 0 | 3 | |
| TDGF1 | GAIN | yes | 0 | −1 | ||
| NAV2 | GAIN | yes | 0 | 1 | ||
| HTATIP2 | GAIN | yes | 0 | 0 | ||
| SLC6A5 | GAIN | yes | - | - | ||
| NELL1 | GAIN | yes | 1 | 0 | ||
| LOC100126784 | GAIN | yes | 1 | 0 | ||
| DBX1 | GAIN | yes | −1 | - | ||
| PRMT3 | GAIN | yes | 0 | 0 | ||
| PLEKHA5 | GAIN | yes | 0 | 0 | ||
| AEB2 | GAIN | |||||
| GOLGA8B | LOSS | yes | −2 | 0 | ||
| GOLGA8A | LOSS | yes | - | - | ||
| MIR1233-1 | LOSS | |||||
| MIR1233-2 | LOSS | |||||
| HERC2P4 | LOSS | yes | 1 | 2 | ||
| LOC390705 | LOSS | yes | 0 | 1 | ||
| TP53TG3 | LOSS | yes | 0 | 2 | ||
| TP53TG3B | LOSS | |||||
| LOC653550 | LOSS | yes | - | −1 | ||
| SLC6A10P | LOSS | yes | −1 | −1 | ||
| GNAS | GAIN | yes | 0 | 0 | ||
| PPP2R3B | GAIN | yes | yes | 0 | 1 | |
| LOC286767 | GAIN | |||||
| MST4 | GAIN | yes | −1 | 0 |
Structural variants identified by WGS and its enrichment in any disease related pathways (if any)
| Event | No. after filter | Max size | Min size | no ofgenes log2 ratio >1.5 with H7_H9 | no of genes log2 ratio >1.5 with NCRM6 | overlap if any* | DAVID-OMIM disease GE | DAVID- pathway GE | |
|---|---|---|---|---|---|---|---|---|---|
| LiPSC-ER2.2 | DEL | 653 | 16797153 | 89 | 11 | 18 | no | no | no |
| DUP | 58 | 387 | 128375322 | 28 | 53 | 6 | Chemdependency (Genetic_Association_DB_Disease_Class): HTR6, GRIK3, OPRD1 | Eicosanoid Metabolism (BIOCARTA)-CYP2J2,PTGER3 | |
| LiPSC-GR1.1 | DEL | 555 | 16797153 | 122 | 24 | 20 | 4 | no | T Cytotoxic Cell Surface Molecules, T Helper Cell Surface Molecules(BIOCARTA)-CD3D, THY1 |
| DUP | 58 | 17376 | 462 | 0 | 0 | no | NA | NA |
Summary of imprinted genes identified by WGS
| Imprinted genes | Status | Expressed allele | LiPSC-ER2.2 | LiPSC-GR1.1 | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Allelic depth | SNP-HET | INS-HET | DEL-HET | Allelic depth | SNP-HET | INS-HET | DEL-HET | |||
| ABCC9 | Predicted | Maternal | 27.4 | 50 | 2 | 3 | 26.0 | 54 | 5 | 5 |
| ABCG8 | Predicted | Maternal | 27.0 | 73 | 4 | 5 | 25.8 | 65 | 3 | 3 |
| ACD | Predicted | Maternal | 26.5 | 2 | 23.0 | 1 | ||||
| ADAMTS16 | Predicted | Maternal | 28.0 | 254 | 20 | 23 | 26.0 | 319 | 20 | 24 |
| AIM1 | Imprinted | Paternal | 29.3 | 82 | 4 | 10 | 23.4 | 54 | 4 | |
| ALDH1L1 | Predicted | Maternal | 25.5 | 148 | 3 | 9 | 25.9 | 47 | 6 | 6 |
| ANO1 | Imprinted | Maternal | 22.8 | 42 | 1 | 2 | 26.0 | 1 | ||
| APBA1 | Predicted | Paternal | 29.0 | 118 | 8 | 11 | 27.5 | 8 | 4 | |
| ATP10A | Imprinted | Maternal | 25.3 | 256 | 12 | 18 | 24.0 | 203 | 10 | 20 |
| B4GALNT4 | Predicted | Maternal | 24.3 | 17 | 2 | 1 | 23.4 | 21 | 3 | 1 |
| BLCAP | Imprinted | Isoform Dependent | 23.8 | 12 | 5 | 26.5 | 4 | 3 | 1 | |
| BMP8B | Predicted | Paternal | 23.4 | 3 | 22.4 | 10 | 2 | |||
| BTNL2 | Predicted | Maternal | 27.8 | 34 | 2 | 3 | 25.5 | 80 | 4 | 6 |
| C10orf91 | Predicted | Maternal | 25.5 | 1 | 35.0 | |||||
| C20orf20 | Predicted | Maternal | 25.4 | 14 | 1 | 2 | 27.8 | 14 | 1 | 2 |
| C9orf116 | Predicted | Paternal | 18.9 | |||||||
| C9orf85 | Predicted | Paternal | 27.9 | 3 | 26.8 | |||||
| CCDC85A | Predicted | Paternal | 27.1 | 245 | 19 | 14 | ||||
| CDH18 | Predicted | Paternal | 28.3 | 893 | 83 | 94 | 25.6 | 244 | 28 | 29 |
| CDK4 | Predicted | Maternal | 24.3 | 1 | 2 | 21.8 | 3 | 1 | ||
| CDKN1C | Imprinted | Maternal | 27.0 | 1 | 15.0 | |||||
| CHMP2A | Predicted | Maternal | 31.0 | 1 | 23.5 | 4 | ||||
| CHST8 | Predicted | Maternal | 24.3 | 96 | 4 | 6 | 25.1 | 121 | 8 | 8 |
| COL9A3 | Predicted | Maternal | 23.1 | 50 | 2 | 2 | 21.9 | 34 | 6 | |
| CPA4 | Imprinted | Maternal | 25.9 | 2 | 22.5 | 8 | 1 | 1 | ||
| CSF2 | Predicted | Maternal | 19.0 | 1 | 27.0 | |||||
| CYP1B1 | Predicted | Paternal | 26.1 | 1 | 28.4 | 9 | 2 | |||
| DDC | Imprinted | Isoform Dependent | 26.3 | 190 | 11 | 9 | 25.6 | 138 | 5 | 8 |
| DGCR6 | Imprinted | Random | 23.2 | 2 | 24.1 | 8 | 1 | |||
| DGCR6L | Imprinted | Random | 22.6 | 22.1 | 6 | 1 | ||||
| DIRAS3 | Imprinted | Paternal | 15.0 | 1 | ||||||
| DLGAP2 | Imprinted | Paternal | 25.7 | 4 | 2 | 24.8 | 21 | 2 | 1 | |
| DLK1 | Imprinted | Paternal | 20.4 | 9 | 1 | 21.7 | ||||
| DLX5 | Imprinted | Maternal | 25.3 | 1 | 1 | 1 | 23.7 | 1 | 1 | |
| DNMT1 | Imprinted | Paternal | 23.2 | 36 | 1 | 9 | 23.0 | 50 | 3 | 5 |
| DVL1 | Predicted | Maternal | 26.3 | 1 | 1 | 23.6 | 3 | 1 | ||
| E2F7 | Predicted | Maternal | 30.2 | 28 | 29.6 | 23 | 1 | |||
| EGFL7 | Predicted | Paternal | 21.2 | 1 | 22.1 | 23 | ||||
| EVX1 | Predicted | Paternal | 24.0 | |||||||
| FAM50B | Imprinted | Paternal | 31.0 | 1 | 25.0 | 1 | ||||
| FBRSL1 | Predicted | Maternal | 23.7 | 128 | 7 | 8 | 20.5 | 10 | 3 | |
| FERMT2 | Predicted | Paternal | 29.0 | 6 | 28.2 | 86 | 11 | 3 | ||
| FGFRL1 | Predicted | Maternal | 21.6 | 19 | 1 | 2 | 22.9 | 17 | 4 | 2 |
| FOXF1 | Predicted | Maternal | 18.6 | 2 | 1 | 1 | 22.5 | 1 | ||
| FOXG1 | Predicted | Paternal | 27.8 | 54 | 4 | 25.0 | ||||
| FUCA1 | Predicted | Paternal | 27.7 | 1 | ||||||
| GAREM | Predicted | Paternal | 27.9 | 36 | 4 | 3 | ||||
| GATA3 | Predicted | Paternal | 24.3 | 28 | 4 | 1 | 21.0 | 9 | 1 | |
| GDAP1L1 | Imprinted | Paternal | 27.0 | 3 | 1 | 24.3 | 2 | 1 | ||
| GFI1 | Predicted | Paternal | 26.0 | 9 | 2 | 23.4 | 10 | 3 | ||
| GLI3 | Predicted | Maternal | 27.8 | 234 | 19 | 16 | 25.5 | 299 | 21 | 18 |
| GLIS3 | Imprinted | Paternal | 26.3 | 517 | 22 | 36 | 25.5 | 611 | 25 | 37 |
| GNAS | Imprinted | Isoform Dependent | 28.0 | 68 | 8 | 4 | 26.8 | 32 | 2 | 4 |
| GPT | Predicted | Maternal | 15.0 | 1 | 20.5 | |||||
| GRB10 | Imprinted | Isoform Dependent | 27.1 | 246 | 13 | 20 | 24.9 | 180 | 10 | 9 |
| H19 | Imprinted | Maternal | 21.5 | 6 | 22.2 | 6 | ||||
| HOXA11 | Predicted | Maternal | 23.1 | 4 | 1 | 24.8 | 1 | |||
| HOXA2 | Predicted | Maternal | 31.0 | 19.5 | 1 | |||||
| HOXA3 | Predicted | Maternal | 26.5 | 5 | 3 | 1 | 22.7 | 11 | 2 | 1 |
| HOXA4 | Predicted | Maternal | 21.0 | 25.0 | ||||||
| HOXA5 | Predicted | Maternal | 19.0 | 37.0 | 2 | |||||
| HOXB2 | Predicted | Maternal | 25.2 | 3 | 1 | 1 | 27.5 | |||
| HOXB3 | Predicted | Maternal | 25.2 | 15 | 1 | 2 | 22.9 | 7 | ||
| HOXC4 | Predicted | Maternal | 26.3 | 13 | 6 | 5 | 24.1 | 24 | 1 | 8 |
| HOXC9 | Predicted | Maternal | 23.3 | 4 | 21.0 | 1 | ||||
| HSPA6 | Predicted | Maternal | 23.5 | 1 | 23.8 | 4 | ||||
| HYMAI | Imprinted | Paternal | 27.0 | 21.5 | 2 | 1 | ||||
| IFITM1 | Predicted | Maternal | 24.5 | 1 | 13.0 | |||||
| IGF2 | Imprinted | Paternal | 21.8 | 1 | 2 | 22.1 | 20 | 4 | ||
| ISM1 | Predicted | Paternal | 26.8 | 121 | 5 | 13 | 25.5 | 48 | 4 | 5 |
| KBTBD3 | Predicted | Paternal | 31.8 | 5 | 1 | 29.5 | 2 | 1 | ||
| KCNK9 | Imprinted | Maternal | 24.1 | 135 | 5 | 4 | 23.0 | 38 | 1 | 3 |
| KCNQ1 | Imprinted | Maternal | 25.7 | 343 | 13 | 25 | 23.6 | 212 | 11 | 11 |
| KCNQ1DN | Imprinted | Maternal | 14.0 | 1 | 21.0 | |||||
| KLF14 | Imprinted | Maternal | 21.7 | 2 | 18.4 | 3 | ||||
| L3MBTL | Imprinted | Paternal | 25.2 | 2 | 21.5 | 14 | ||||
| LDB1 | Predicted | Maternal | 22.0 | 4 | 21.8 | 1 | 1 | |||
| LILRB4 | Predicted | Maternal | 21.4 | 7 | 21.4 | 7 | 1 | |||
| LIN28B | Imprinted | Paternal | 26.9 | 17 | 1 | 5 | 27.6 | 48 | 3 | 4 |
| LMX1B | Predicted | Maternal | 24.0 | 68 | 8 | 5 | 24.1 | 53 | 1 | 3 |
| LRRTM1 | Imprinted | Paternal | 15.5 | 1 | ||||||
| LY6D | Predicted | Paternal | 27.3 | 3 | 1 | 25.0 | ||||
| MAGEL2 | Imprinted | Paternal | 33.5 | 2 | ||||||
| MAGI2 | Imprinted | Maternal | 29.4 | 1035 | 78 | 79 | 27.4 | 1485 | 120 | 112 |
| MEG3 | Imprinted | Maternal | 22.1 | 21 | 2 | 3 | 21.7 | 24 | 1 | |
| MEG8 | Imprinted | Maternal | 21.1 | 2 | 19.7 | 6 | ||||
| MEST | Imprinted | Paternal | 26.3 | 9 | 26.4 | 16 | 1 | |||
| MESTIT1 | Imprinted | Paternal | 29.5 | 2 | 23.0 | 1 | ||||
| MIMT1 | Imprinted | Paternal | 27.0 | 28.2 | ||||||
| MYEOV2 | Predicted | Paternal | 23.7 | 16 | 1 | 21.5 | ||||
| MZF1 | Predicted | Maternal | 23.8 | 4 | 24.7 | 12 | 1 | |||
| NAA60 | Imprinted | Maternal | 24.0 | 10 | 1 | 23.5 | ||||
| NAP1L5 | Imprinted | Paternal | 25.5 | 1 | 1 | |||||
| NDN | Imprinted | Paternal | 26.0 | 1 | 22.0 | 1 | ||||
| NKAIN3 | Predicted | Paternal | 27.2 | 539 | 28 | 35 | 25.9 | 924 | 69 | 53 |
| NKX6-2 | Predicted | Maternal | 19.0 | |||||||
| NLRP2 | Imprinted | Maternal | 22.5 | 62 | 5 | 4 | 20.5 | 39 | 5 | |
| NPAP1 | Imprinted | Unknown | 25.0 | 1 | 1 | 24.0 | ||||
| NTM | Imprinted | Maternal | 28.2 | 793 | 53 | 68 | 25.9 | 585 | 49 | 49 |
| OBSCN | Predicted | Paternal | 25.2 | 160 | 7 | 7 | 23.7 | 169 | 10 | 6 |
| OR11L1 | Predicted | Paternal | 24.7 | 23.4 | ||||||
| OSBPL5 | Imprinted | Maternal | 23.8 | 115 | 9 | 9 | 22.4 | 89 | 4 | 4 |
| OTX1 | Predicted | Maternal | 21.8 | 4 | 20.7 | 3 | ||||
| PAOX | Predicted | Maternal | 25.8 | 6 | 1 | 21.2 | 2 | 1 | ||
| PEG10 | Imprinted | Paternal | 34.6 | 2 | 1 | 27.0 | 4 | |||
| PEG3 | Imprinted | Paternal | 25.4 | 1 | 23.7 | 2 | ||||
| PEX10 | Predicted | Maternal | 19.8 | 8 | 1 | 19.5 | 12 | |||
| PHLDA2 | Imprinted | Maternal | 16.0 | 1 | ||||||
| PHPT1 | Predicted | Maternal | 21.5 | |||||||
| PLAGL1 | Imprinted | Paternal | 28.2 | 29 | 3 | 4 | 26.6 | 13 | 2 | 1 |
| PPAP2C | Predicted | Maternal | 18.3 | 16 | 1 | 26.3 | 276 | 16 | 26 | |
| PPP1R9A | Imprinted | Maternal | 26.5 | 71 | 4 | 9 | 22.1 | 12 | 2 | |
| PRDM16 | Predicted | Paternal | 21.8 | 231 | 10 | 17 | 21.7 | 382 | 10 | 19 |
| PTPN14 | Predicted | Maternal | 27.0 | 10 | 2 | 3 | 27.3 | 55 | 4 | 6 |
| PURG | Predicted | Paternal | 28.2 | 26.8 | 20 | 3 | 2 | |||
| PYY2 | Predicted | Paternal | 31.0 | 1 | ||||||
| RAB1B | Predicted | Maternal | 25.1 | 7 | 2 | 2 | 23.7 | 3 | 1 | |
| RB1 | Imprinted | Maternal | 26.9 | 52 | 7 | 9 | 26.2 | 11 | 1 | 1 |
| RBP5 | Imprinted | Maternal | 32.0 | 1 | 1 | 18.8 | 1 | 1 | ||
| RPL22 | Predicted | Paternal | 31.9 | 7 | 26.0 | 1 | ||||
| RTL1 | Imprinted | Paternal | 20.3 | 3 | 22.0 | 1 | ||||
| SALL1 | Predicted | Maternal | 26.9 | 2 | 2 | 23.3 | 5 | 3 | 1 | |
| SGCE | Imprinted | Paternal | 30.3 | 18 | 3 | 2 | 28.8 | 17 | 3 | 1 |
| SGK2 | Imprinted | Paternal | 23.3 | 6 | 1 | 1 | 25.1 | 18 | 2 | 1 |
| SIM2 | Predicted | Paternal | 26.9 | 28 | 3 | 2 | 24.2 | 16 | 1 | |
| SLC22A18 | Imprinted | Maternal | 22.8 | 29 | 3 | |||||
| SLC22A18AS | Provisional Data | Maternal | 22.3 | 56 | 4 | 1 | ||||
| SLC22A2 | Imprinted | Maternal | 28.9 | 5 | 1 | 25.8 | 4 | |||
| SLC22A3 | Imprinted | Maternal | 27.0 | 82 | 2 | 8 | 26.8 | 119 | 2 | 7 |
| SLC26A10 | Predicted | Maternal | 27.0 | 1 | 25.7 | 2 | ||||
| SLC4A2 | Predicted | Maternal | 21.6 | 1 | 23.2 | 9 | ||||
| SNRPN | Imprinted | Paternal | 24.9 | 117 | 15 | 13 | 24.3 | 156 | 9 | 18 |
| SNURF | Imprinted | Paternal | 25.7 | 13 | 2 | 2 | ||||
| SOX8 | Predicted | Paternal | 19.1 | 10 | 20.9 | 1 | ||||
| SPON2 | Predicted | Paternal | 21.7 | 30 | 2 | 22.3 | 1 | 1 | ||
| TCEB3C | Imprinted | Maternal | 73.0 | 1 | ||||||
| TFPI2 | Imprinted | Maternal | 24.0 | |||||||
| TIGD1 | Predicted | Paternal | 28.4 | 7 | 25.6 | 7 | ||||
| TMEM52 | Predicted | Paternal | 24.7 | 2 | 1 | 16.0 | ||||
| TMEM60 | Predicted | Paternal | 29.0 | 3 | 28.3 | 1 | ||||
| TP73 | Imprinted | Maternal | 23.6 | 73 | 4 | 5 | 23.0 | 93 | 3 | 7 |
| TSHZ3 | Predicted | Paternal | 26.6 | 34 | 4 | 8 | 24.7 | 77 | 7 | 10 |
| UBE3A | Imprinted | Maternal | 28.3 | 11 | 1 | 2 | 26.6 | 105 | 7 | 10 |
| VAX2 | Predicted | Maternal | 28.0 | 27 | 2 | 4 | 24.7 | 14 | 3 | 2 |
| VENTX | Predicted | Maternal | 25.0 | 1 | 24.0 | 12 | ||||
| WT1 | Imprinted | Paternal | 28.3 | 53 | 4 | 2 | 24.9 | 9 | 1 | 3 |
| ZC3H12C | Imprinted | Paternal | 28.0 | 106 | 7 | 11 | 26.9 | 102 | 9 | 18 |
| ZDBF2 | Imprinted | Paternal | 28.4 | 10 | 2 | 29.8 | 7 | |||
| ZFAT | Imprinted | Paternal | 24.1 | 2 | 1 | 23.1 | 268 | 9 | 16 | |
| ZFAT-AS1 | Imprinted | Paternal | 28.6 | 7 | ||||||
| ZFP36L2 | Predicted | Maternal | 21.0 | 1 | 1 | 15.0 | 1 | |||
| ZIC1 | Predicted | Maternal | 37.7 | 2 | 1 | 30.7 | 2 | 1 | ||
| ZIM2 | Imprinted | Paternal | 26.0 | 1 | 23.8 | 14 | ||||
| ZNF225 | Predicted | Paternal | 22.1 | 25.4 | 1 | |||||
| ZNF229 | Predicted | Maternal | 27.5 | 4 | 2 | 25.8 | 22 | 5 | ||
| ZNF264 | Unknown | Unknown | 27.1 | 34 | 5 | 2 | 25.2 | 30 | 1 | 4 |
| ZNF597 | Imprinted | Maternal | 33.0 | 1 | 1 | 24.8 | 1 | |||
For each cell line the mean depth of the alternate allele, the number of heterozygous and homozygous variants identified from WGS is provided