Joseph E Kaserman1, Killian Hurley1, Mark Dodge2, Carlos Villacorta-Martin2, Marall Vedaie2, Jyh-Chang Jean2, Derek C Liberti2, Marianne F James2, Michelle I Higgins2, Nora J Lee2, George R Washko3, Raul San Jose Estepar3, Jeffrey Teckman4, Darrell N Kotton1, Andrew A Wilson5. 1. Center for Regenerative Medicine (CReM) of Boston University and Boston Medical Center, Boston, MA 02118, USA; The Pulmonary Center and Department of Medicine, Boston University School of Medicine, Boston, MA 02118, USA. 2. Center for Regenerative Medicine (CReM) of Boston University and Boston Medical Center, Boston, MA 02118, USA. 3. Brigham and Women's Hospital, Boston, MA 02118, USA. 4. Saint Louis University, St Louis, MO 63104, USA. 5. Center for Regenerative Medicine (CReM) of Boston University and Boston Medical Center, Boston, MA 02118, USA; The Pulmonary Center and Department of Medicine, Boston University School of Medicine, Boston, MA 02118, USA. Electronic address: awilson@bu.edu.
Abstract
Individuals with the genetic disorder alpha-1 antitrypsin deficiency (AATD) are at risk of developing lung and liver disease. Patient induced pluripotent stem cells (iPSCs) have been found to model features of AATD pathogenesis but only a handful of AATD patient iPSC lines have been published. To capture the significant phenotypic diversity of the patient population, we describe here the establishment and characterization of a curated repository of AATD iPSCs with associated disease-relevant clinical data. To highlight the utility of the repository, we selected a subset of iPSC lines for functional characterization. Selected lines were differentiated to generate both hepatic and lung cell lineages and analyzed by RNA sequencing. In addition, two iPSC lines were targeted using CRISPR/Cas9 editing to accomplish scarless repair. Repository iPSCs are available to investigators for studies of disease pathogenesis and therapeutic discovery.
Individuals with the genetic disorder alpha-1 antitrypsin deficiency (AATD) are at risk of developing lung and liver disease. Patient induced pluripotent stem cells (iPSCs) have been found to model features of AATD pathogenesis but only a handful of AATD patient iPSC lines have been published. To capture the significant phenotypic diversity of the patient population, we describe here the establishment and characterization of a curated repository of AATD iPSCs with associated disease-relevant clinical data. To highlight the utility of the repository, we selected a subset of iPSC lines for functional characterization. Selected lines were differentiated to generate both hepatic and lung cell lineages and analyzed by RNA sequencing. In addition, two iPSC lines were targeted using CRISPR/Cas9 editing to accomplish scarless repair. Repository iPSCs are available to investigators for studies of disease pathogenesis and therapeutic discovery.
Alpha-1 antitrypsin deficiency (AATD) is a prominent cause of inherited lung and liver disease, affecting approximately 100,000 patients in the United States and accounts for an estimated 1%–4% of chronic obstructive pulmonary disease cases (American Thoracic Society and European Respiratory Society, 2003, Lieberman et al., 1986). Clinical disease results from mutations in the SERPINA1 gene and associated dysfunction of alpha-1 antitrypsin (AAT) protein. While much has been learned about the mechanisms through which misfolded AAT proteins affect both the lung and the liver in the half century since the disease was first described (Eriksson, 1964, Laurell and Eriksson, 1963), significant gaps in our understanding of disease pathogenesis remain, underscoring an unmet need for models that faithfully recapitulate human disease mechanisms and capture AATD patient heterogeneity.Patient-derived induced pluripotent stem cells (iPSCs) have emerged in recent years as a model system capable of reproducing human disease phenotypes, including AATD (Rashid et al., 2010, Segeritz et al., 2018, Tafaleng et al., 2015, Wilson et al., 2015). Through the application of directed differentiation protocols, these pluripotent cells can be utilized to study disease mechanisms in a variety of differentiated cell types, including those representing the two organs most affected in AATD, lung and liver (Jacob et al., 2017, McCauley et al., 2017, Rashid et al., 2010, Segeritz et al., 2018, Tafaleng et al., 2015, Wilson et al., 2015). Patient iPSCs likewise allow the generation of multiple cell types from a single genetic background, a significant advantage in modeling multiorgan diseases (Leung et al., 2013).However, despite the promise represented by this model for AATD studies, significant challenges must be overcome to fully realize their potential. For instance, only a small number of AATD patient iPSC lines have been published to date. Moreover, both institutional and legal barriers limit or prohibit sharing of some existing lines with academic and industry investigators thus preventing their use for further understanding disease mechanisms, for drug discovery, or for cell therapies. In addition, line-to-line variability in differentiation efficiency can limit the utility of specific lines to generate disease-relevant lineages and is not often reported. To address these challenges, we have developed a unique, large repository of iPSCs derived from comprehensively phenotyped AATD patients with associated participant consent that explicitly allows sharing and dissemination of both cells and associated clinical data. To further enhance the utility of this repository, we have systematically differentiated a subset of lines to lung and liver lineages to both quantify their differentiation capacity and analyze their transcriptome in comparison with primary controls. Here, we present this repository together with salient details of clinical lung and liver characterization to facilitate the application of constituent iPSC lines for advanced disease studies.
Results
Establishment of an AATD iPSC Repository Representing Multiple Genotypes
To establish a freely accessible iPSC repository representing multiple AATD genotypes, we first compiled detailed clinical data, including radiographic and spirometric data, liver biochemistry and function testing, liver elastography, and biopsy results from patients attending the Alpha-1 Center at Boston University and Boston Medical Center and from subjects enrolled in two ongoing clinical studies. At the time of enrollment in the repository, we collected a linked sample of peripheral blood mononuclear cells (PBMCs) or in some cases skin fibroblasts suitable for reprogramming to generate iPSCs from each individual. We cryopreserved a total of 168 patient samples (Figure 1 and Table 1) representing both genders, a wide range of ages, and diverse SERPINA1 genotypes, including common disease-causing variants (PiZZ, PiSZ), rare deficiency variants (PiQ0BoltonQ0Bolton, PiFZ, PiZMHeerlen) (Reeves et al., 2018), and one newly described mutation (PiZVGrantspass).
Figure 1
Flow Diagram of Individuals Included in Repository
See also Figures S1 and S2.
Table 1
AATD Patient-Specific Repository
Line
Genotype
Sex
Age (years)
Line
Genotype
Sex
Age (years)
Line
Genotype
Sex
Age (years)
PiZZ1a,b
ZZ
M
44
CA-003a
ZZ
F
20
MO-005a
ZZ
F
57
PiZZ2a,b
ZZ
M
65
CA-004a
ZZ
M
70
MO-006a
ZZ
F
44
PiZZ3a,b
ZZ
M
65
CA-005a
ZZ
M
63
MO-007a
ZZ
F
63
PiZZ4a
ZZ
F
55
CA-006a
ZZ
M
45
MO-008a
ZZ
F
54
PiZZ5a
ZZ
F
57
CA-007a
ZZ
M
77
MO-009a
ZZ
F
63
PiZZ6a,c
ZZ
F
16
CA-008a
ZZ
M
58
MO-010a
ZZ
F
36
PiZZ7a
ZZ
F
62
CA-009a
ZZ
F
61
MO-011a,c
ZZ
M
53
PiZZ8a
ZZ
F
48
CA-010a
ZZ
F
31
MO-012a
ZZ
F
58
PiZZ9a
ZZ
F
69
CA-011a
ZZ
F
68
MO-013a
ZZ
F
57
PiZZ10a
ZZ
M
64
CA-012a
ZZ
F
53
MO-014a
ZZ
M
35
PiZZ11a
ZZ
F
35
CA-013a
ZZ
M
72
MO-015a
ZZ
F
60
PiZZ12a
ZZ
F
71
CA-014a
ZZ
M
72
MO-016a
ZZ
M
74
PiZZ13a
ZZ
M
49
CA-015a
ZZ
M
71
MO-017a
ZZ
M
68
PiZZ14a
ZZ
F
62
CA-016a
ZZ
M
55
MO-018a
ZZ
F
58
PiZZ15a
ZZ
F
52
CA-017a
ZZ
M
63
MO-019a
ZZ
F
54
PiZZ16a
ZZ
M
58
CA-018a
ZZ
F
70
MO-020a
ZZ
F
80
PiZZ17a
ZZ
M
61
CA-019a
ZZ
F
18
MO-021a
ZZ
M
60
PiZZ18a
ZZ
F
66
CA-020a
ZZ
F
45
MO-022a
ZZ
M
45
PiZZ19a
ZZ
F
31
CA-021a
ZZ
M
51
MO-023a
ZZ
M
67
PiZZ20a
ZZ
F
65
CA-022a
ZZ
M
59
MO-024a
ZZ
F
40
PiZZ21a
ZZ
F
71
CA-023a
ZZ
F
54
MO-025a
ZZ
F
51
PiZZ22a
ZZ
M
58
CA-024a
ZZ
M
45
MO-026a
ZZ
M
50
PiZZ23a
ZZ
M
68
CA-025a
ZZ
F
68
MO-027a
ZZ
F
50
PiZZ24a
ZZ
M
43
CA-026a
ZZ
M
62
MO-028a
ZZ
F
26
PiZZ25a
ZZ
F
38
CA-027a
ZZ
F
63
MO-029a
ZZ
M
37
PiZZ26a
ZZ
M
82
CA-028a
ZZ
F
42
MO-030a
ZZ
F
62
PiZZ27a
ZZ
F
51
CA-029a
ZZ
F
63
MO-031a
ZZ
F
76
PiZZ28a
ZZ
F
25
CA-030a
ZZ
F
72
MO-032a
ZZ
F
56
PiZZ29a
ZZ
F
52
MA-001a
ZZ
F
58
PiSZ1a,b
SZ
F
68
PiZZ30a
ZZ
F
19
MA-002a,c
ZZ
M
69
PiSZ2a,b
SZ
F
62
PiZZ31a
ZZ
F
72
MA-003a
ZZ
F
65
PiSZ3a,b
SZ
M
56
PiZZ32a
ZZ
M
59
MA-004a
ZZ
M
65
PiSZ4a
SZ
F
30
PiZZ33a
ZZ
M
64
MA-005a
ZZ
F
40
PiSZ5a
SZ
F
59
PiZZ34a
ZZ
F
58
MA-006a
ZZ
F
65
PiSZ6a
SZ
F
54
PiZZ35a
ZZ
F
49
MA-007a
ZZ
M
69
PiSZ7a
SZ
M
72
PiZZ36a
ZZ
M
55
MA-008a
ZZ
M
56
PiSZ8a
SZ
F
47
PiZZ37a
ZZ
F
59
MA-009a
ZZ
F
51
PiSZ9a
SZ
M
64
PiZZ38a
ZZ
F
68
MA-010a
ZZ
M
42
PiSZ10a
SZ
F
64
PiZZ39a
ZZ
F
71
MA-011a,c
ZZ
F
37
PiSS1a
SS
M
42
PiZZ100b,d
ZZ
F
67
MA-012a
ZZ
M
54
PiSS2a
SS
M
51
PiZZ101b,d
ZZ
F
65
MA-013a
ZZ
M
44
PiZMh1a,b
ZMHeerlen
F
76
PiZZ102b,d
ZZ
F
51
MA-014a
ZZ
M
32
PiZMh2a
ZMHeerlen
M
37
PiZZ103b,d
ZZ
F
61
MA-015a
ZZ
M
58
PiFZ1a,b
FZ
M
56
CBZ1a,b
ZZ
M
NA
MA-016a
ZZ
F
23
PiBB1b,d
Q0BoltonQ0Bolton
M
46
CBZ3a,b
ZZ
M
NA
MA-017a
ZZ
M
55
PiZVGrantspass1a
ZVGrantspass
F
67
CBZ4a,c
ZZ
F
NA
MA-018a
ZZ
M
36
PiZQ0Newcity1a
ZQ0Newcity
F
71
CBZ5a,c
ZZ
M
NA
MA-019a
ZZ
F
54
PiMZ2a
MZ
F
79
CBZ6a,c
ZZ
F
NA
MA-020a
ZZ
M
71
PiMZ3a
MZ
M
59
CBZ7a,c
ZZ
M
NA
MA-022a
ZZ
F
45
PiMZ4a
MZ
F
64
CBZ8a,c
ZZ
F
NA
MA-023a
ZZ
M
58
PiMZ5a
MZ
M
38
CBZ9a,c
ZZ
M
NA
MA-025a
ZZ
F
63
PiMZ6a
MZ
M
53
CBZ12a,c
ZZ
M
NA
MA-026a
ZZ
M
58
PiMZ7a
MZ
M
50
CBZ13a,c
ZZ
M
NA
MO-001a
ZZ
F
71
PiMZ8a
MZ
M
37
CBZ14a,c
ZZ
M
NA
MO-002a
ZZ
M
40
PiMZ9a
MZ
M
53
CA-001a
ZZ
M
47
MO-003a
ZZ
M
68
PiMZ10a
MZ
F
51
CA-002a
ZZ
M
62
MO-004a
ZZ
M
50
PiMS1a
MS
F
41
Summary of the 168 peripheral blood or skin fibroblast samples obtained from subjects with SERPINA1 mutations. Included within the repository are samples representative of common disease-causing variants (S,Z) as well as rare and novel mutations. Twenty-eight samples (highlighted) have been reprogrammed into established iPSC lines. See also Figures S1 and S2.
PBMC, peripheral blood mononuclear cell; M, male; F, female; NA, not available.
PBMC.
STEMCCA.
Sendai virus.
Skin fibroblast.
Flow Diagram of Individuals Included in RepositorySee also Figures S1 and S2.AATD Patient-Specific RepositorySummary of the 168 peripheral blood or skin fibroblast samples obtained from subjects with SERPINA1 mutations. Included within the repository are samples representative of common disease-causing variants (S,Z) as well as rare and novel mutations. Twenty-eight samples (highlighted) have been reprogrammed into established iPSC lines. See also Figures S1 and S2.PBMC, peripheral blood mononuclear cell; M, male; F, female; NA, not available.PBMC.STEMCCA.Sendai virus.Skin fibroblast.To provide a representative, diverse sampling of the repository, we next selected ten “featured” samples derived from individuals with PiZZ and PiSZ AAT phenotypes representing the most common pathogenic allele combinations for in-depth characterization of clinical phenotypic data as well as iPSC reprogramming and functional assays of resulting iPSCs following directed differentiation to disease-relevant lineages (Tables 2 and 3). Consistent with previous epidemiologic studies of AATD (American Thoracic Society and European Respiratory Society, 2003), the majority of the samples selected for characterization (8/10) are derived from individuals with radiographic (seven with emphysema, one with cystic lung disease) and spirometrically confirmed lung disease, while the remaining two samples represent younger subjects who have not to date developed detectable lung pathology (overall mean forced expiratory volume in 1 s [FEV1]/forced vital capacity [FVC] 49.4%; FEV1 58%; diffusion capacity for carbon monoxide in percent predicted 54.1%) (Table 2). To evaluate for heterogeneity in distribution and pattern of lung disease among individuals in this group, we next analyzed the upper, mid, and lower portions of the lungs on computed tomography (CT) scan through application of a modified National Emphysema Treatment Trial scoring system (Table 2) (Hersh et al., 2015) as well as quantification of lung CT densitometry (Table S1) (Onieva et al., 2016, Ross et al., 2009). While we did find classic lower lobe panlobular emphysema in some scans, we likewise noted significant heterogeneity in emphysema type and distribution (Tables 2 and S1). We additionally assembled test results to characterize the presence or absence and degree of liver disease for each donor, including biochemistry, transaminases, international normalized ratio, and platelet count together with elastography and biopsy results when available (Table 3). Elastography detected some degree of liver fibrosis in five of five PiZZ individuals tested with two of five having probable or definite cirrhosis.
Table 2
Demographic and Clinical Pulmonary Characteristics
Subject ID
Genotype, Age (years)
FEV1
FEV1/FVC
DLCO
CXR Image
CT Image
CT Read
CT Score
PiZZ6
ZZ, 16
3.40 (108%)
85%
92%
no emphysema
0
0
0
MA-011
ZZ, 37
2.43 (85%)
78%
data unavailable
no emphysema
0
0
0
PiZZ1
ZZ, 44
2.60 (75%)
63%
79%
diffuse cystic lung disease. No emphysema
0
0
0
PiZZ2
ZZ, 65
0.59 (18%)
21%
21%
moderate centrilobular & paraseptal emphysema
2
3
3
PiZZ3
ZZ, 65
1.44 (48%)
52%
data unavailable
moderate centrilobular emphysema
2
3
3
PiZZ100
ZZ, 67
0.91 (39%)
46%
31%
panlobular emphysema
1
2
4
MA-002
ZZ, 69
1.82 (68%)
39%
66%
moderate centrilobular emphysema
2
4
3
PiSZ3
SZ, 56
0.81 (24%)
19%
40%
data unavailable
data unavailable
emphysemaa
data unavailable
PiSZ2
SZ, 62
2.09 (94%)
59%
83%
mild diffuse centrilobular emphysema
1
1
1
PiSZ1
SZ, 68
0.55 (21%)
32%
21%
advanced destructive centrilobular emphysema
4
3
3
PFT, pulmonary function tests; FEV1, forced expiratory volume in 1 s in liters (percent predicted); FVC, forced vital capacity; DLCO, diffusion capacity for carbon monoxide in percent predicted; CXR, chest X-ray; CT, computed tomography.
CT score: 0, no emphysema; 1, mild emphysema; 2, moderate emphysema; 3, severe emphysema; 4, very severe emphysema.
Ishak fibrosis 0, PAS-D+ hepatocytes zones 1 and 2
PiSZ3
SZ, 56
24
0.2
3.5
303
1.00
data unavailable
data unavailable
PiSZ2
SZ, 62
24
0.9
4.5
228
0.91
data unavailable
data unavailable
PiSZ1
SZ, 68
21
0.3
3.7
201
data unavailable
data unavailable
data unavailable
ALT, alanine aminotransferase; INR, international normalized ratio; RUQ US, right upper quadrant ultrasound; VCTE, vibration-controlled transient elastography.
VCTE fibrosis score: F0, no scarring; F1, mild scarring; F2, moderate scarring; F3, severe scarring; F4, cirrhosis.
Ishak fibrosis score: 0, no fibrosis; 1, fibrous expansion of some portal areas; 2, fibrous expansion of most portal areas; 3, fibrous expansion of most portal areas with occasional portal to portal bridging; 4, fibrous expansion of portal areas with marked portal-portal bridging; 5, marked bridging with occasional nodules; 6, cirrhosis.
Demographic and Clinical Pulmonary CharacteristicsPFT, pulmonary function tests; FEV1, forced expiratory volume in 1 s in liters (percent predicted); FVC, forced vital capacity; DLCO, diffusion capacity for carbon monoxide in percent predicted; CXR, chest X-ray; CT, computed tomography.CT score: 0, no emphysema; 1, mild emphysema; 2, moderate emphysema; 3, severe emphysema; 4, very severe emphysema.See also Table S1.Clinical report.Demographic and Clinical Hepatic CharacteristicsALT, alanine aminotransferase; INR, international normalized ratio; RUQ US, right upper quadrant ultrasound; VCTE, vibration-controlled transient elastography.VCTE fibrosis score: F0, no scarring; F1, mild scarring; F2, moderate scarring; F3, severe scarring; F4, cirrhosis.Ishak fibrosis score: 0, no fibrosis; 1, fibrous expansion of some portal areas; 2, fibrous expansion of most portal areas; 3, fibrous expansion of most portal areas with occasional portal to portal bridging; 4, fibrous expansion of portal areas with marked portal-portal bridging; 5, marked bridging with occasional nodules; 6, cirrhosis.
Differentiation Capacity of AATD iPSCs to Disease-Relevant Lineages
To demonstrate the utility of patient-specific cells for disease modeling, we selected PBMC samples from our repository for reprogramming to generate iPSCs. In addition to the 10 featured samples detailed above, we reprogrammed 18 samples representing a variety of SERPINA1 deficiency allele combinations (Figure 1; Table 1). Each iPSC line was confirmed to have a normal karyotype and to express markers of pluripotency before proceeding to directed differentiation experiments (Figures S1 and S2; additional pluripotency staining can be found at https://stemcellbank.bu.edu). We have previously established directed differentiation protocols to generate iPSC-derived hepatic cells (iHeps) (Wilson et al., 2015, Kaserman and Wilson, 2017, Reeves et al., 2018), and to derive either proximal or distal differentiated lung lineages (Hawkins et al., 2017, Jacob et al., 2017, McCauley et al., 2017, McCauley et al., 2018) for disease modeling and assessment of drug responsiveness. To apply these protocols to each of the ten featured iPSC lines, a single vial of cryopreserved iPSCs representing each line was thawed, expanded, and split to accomplish parallel liver- and lung-directed differentiation (Figure 2A). Undifferentiated iPSCs were first patterned into definitive endoderm, the germ layer from which both the lung and liver derive embryologically. These cells were then further specified through sequential exposure to lineage-specific growth factors to derive either iHeps or lung progenitors (Figures 2A, 2B, and S3). We sorted cells on day 15 of our lung-directed differentiation protocol based on the cell surface marker profile, CD47hi/CD26lo, which enriches for lung progenitor cells expressing the earliest known lung lineage marker, NKX2-1 (Hawkins et al., 2017). This sort strategy achieved an enrichment in NKX2-1+ putative lung progenitors of >85% in eight out of nine iPSC lines tested (mean 87.6% ± 11.2%) from a pre-sort mean NKX2-1 percentage of 39.5% ± 14.3% (Figure S3). These lung progenitors can be further differentiated to either distal (Gotoh et al., 2014, Jacob et al., 2017) or proximal lung lineages for disease modeling of specific compartments (Firth et al., 2015, McCauley et al., 2017).
Figure 2
Directed Differentiation of AATD Patient-Specific iPSCs into Endodermal Lineages
(A) Schematic demonstrating the specification of AATD patient-derived iPSCs into the two most disease-relevant endodermal lineages.
(B) Representative flow cytometric analysis using stage-specific markers at key directed differentiation time points, including definitive endoderm (day 3 or 4), lung progenitors (day 15), and iHeps (day 26).
(C) MFI of intracellular AAT for all 10 AATD lines. n = 3 independent experiments. Data are represented as mean ± SEM.
See also Figure S3.
Directed Differentiation of AATD Patient-Specific iPSCs into Endodermal Lineages(A) Schematic demonstrating the specification of AATD patient-derived iPSCs into the two most disease-relevant endodermal lineages.(B) Representative flow cytometric analysis using stage-specific markers at key directed differentiation time points, including definitive endoderm (day 3 or 4), lung progenitors (day 15), and iHeps (day 26).(C) MFI of intracellular AAT for all 10 AATD lines. n = 3 independent experiments. Data are represented as mean ± SEM.See also Figure S3.iHeps differentiated from these ten featured iPSC lines in these experiments consistently acquired a characteristic hepatocyte-like morphology (Wilson et al., 2015). Intracellular AAT, quantified by mean fluorescence intensity (MFI), demonstrated significant AAT protein retention with an average MFI of 427.13 ± 88.64 (Figures 2B, 2C, and S3). We and others have previously found hepatic-directed differentiation of iPSCs to be efficient across multiple lines but with some line-to-line variability (Ogawa et al., 2013, Pashos et al., 2017, Warren et al., 2017, Wilson et al., 2015), consistent with the differentiation efficiencies we observed across our ten featured lines. Together, these findings demonstrate the ability of featured iPSC lines to differentiate into multiple downstream lineages relevant to AATD disease pathogenesis.
Transcriptional Profiling of iPSC-Derived Lung Progenitors, iHeps, and Primary Human Hepatocytes
We next profiled the global transcriptomes of both iHeps and lung progenitors derived from the ten featured lines. We used digital gene expression (DGE), a platform for high-fidelity RNA sequencing (Cacchiarelli et al., 2015). In addition to differentiated iPSC-derived cell types, we collected RNA from undifferentiated iPSCs and from a panel of primary adult human hepatocytes (PHH) for comparison (Figure 3A). Principal-component analysis (PCA) of the sequencing data revealed four distinct groups (Figure 3B), with undifferentiated and lung progenitor samples clustering separately from both PHH and iHeps. PHH and iHep transcriptomes displayed a similar first-principal-component variance (Figure 3B) consistent with less transcriptomic variance among hepatic cells of primary or engineered origin, relative to other cell types. We next analyzed expression levels of selected genes known to mark hepatic, lung, and undifferentiated lineages (Figure 3C). Using empirical Bayes moderated t-statistical analysis, the expression of specific key pluripotency-associated genes, including POU5F1 (OCT4) and NANOG, were significantly upregulated in undifferentiated iPSCs (false discovery rate [FDR] < 0.001), while NKX2-1 and CPM, known markers of iPSC-derived lung progenitor cells (Gotoh et al., 2014, Hawkins et al., 2017), as well as markers of immature fetal lung, such as SOX9, GATA6, and ELF5 (Hawkins et al., 2017, Nikolić et al., 2017), were significantly upregulated in the iPSC-lung progenitors (FDR < 0.001). Conversely, established hepatic lineage genes, such as TTR, SERPINA1, and ALB (Wilson et al., 2015), were upregulated (FDR < 0.001) in both iHeps and PHH consistent with the PCA groupings.
Figure 3
Transcriptomic Profile of AATD Patient-Derived iPSC-Lung Progenitors and Hepatic Cells
(A) Schematic demonstrating transcriptomic profiling from 10 AATD iPSC lines using DGE.
(B) PCA demonstrates samples clustered predominantly by cell type.
(C) Heatmap of selected lineage-specific markers across iPSC-derived and primary samples.
(D) Gene set variation analysis demonstrates enrichment within relevant organ-specific hallmark pathways for each group.
(E) Quantification of lung and hepatic differentiation efficiency for each iPSC line based on percent enrichment for NKX2-1 and application of a 30-gene transcriptomic signature of hepatic enrichment.
(F) Ranking of all samples by hepatic enrichment score.
See also Table S2.
Transcriptomic Profile of AATD Patient-Derived iPSC-Lung Progenitors and Hepatic Cells(A) Schematic demonstrating transcriptomic profiling from 10 AATD iPSC lines using DGE.(B) PCA demonstrates samples clustered predominantly by cell type.(C) Heatmap of selected lineage-specific markers across iPSC-derived and primary samples.(D) Gene set variation analysis demonstrates enrichment within relevant organ-specific hallmark pathways for each group.(E) Quantification of lung and hepatic differentiation efficiency for each iPSC line based on percent enrichment for NKX2-1 and application of a 30-gene transcriptomic signature of hepatic enrichment.(F) Ranking of all samples by hepatic enrichment score.See also Table S2.To identify the biological or pathogenic pathways that characterize our iHeps and iPSC-lung progenitors we used gene set variation analysis, a non-parametric, unsupervised method for estimating variation of gene set enrichment through the samples of an expression dataset (Figure 3D). This analysis demonstrated enrichment of relevant organ-specific hallmark pathways for each of the groups studied (Liberzon et al., 2015). For example, iPSC-lung progenitors exhibited enrichment of “Lung Morphogenesis,” “Alveolar Development,” “Lung Epithelium Development,” and “Lung Cell Differentiation” pathways, while “Regulation of Hepatocyte Proliferation” and “Liver Regeneration” pathways were enriched in iHeps. SERPINA1 is one of the most highly upregulated genes in iHeps relative to undifferentiated controls (Wilson et al., 2015). Consistent with this expression pattern and the known contribution of the ZAAT variant to hepatic injury, AATD iHeps also demonstrated higher enrichment scores within pathways associated with fibrosis and cirrhosis (e.g., transforming growth factor β signaling, epithelial-mesenchymal transition, hepatocyte apoptotic process) in comparison with PHH, a finding with potential relevance to AAT liver disease modeling.The ability to assess the efficiency of hepatic differentiation between different iPSC lines and across experiments is a recognized challenge in iPSC-directed differentiation protocols (Wilson et al., 2015, Pashos et al., 2017). To compare relative efficiencies for each of the 10 ZAAT iHep differentiations, we assigned an individual hepatic enrichment score using a panel of 30 genes significantly upregulated in PHH compared with undifferentiated iPSCs (Figure 3E; Table S2). Application of this score revealed a moderate degree of variability in hepatic enrichment among iHep lines, consistent with our experience and published reports describing line-to-line variability in directed differentiation protocols (Pashos et al., 2017). To test the specificity of the score we likewise applied it to iPSC-lung progenitors and found reduced hepatic enrichment relative to iHep comparators as expected (average 0.5695 ± .024; p < 0.0001) (Figure 3F).
CRISPR/Cas9 Gene Editing of the SERPINA1 Locus in AATD Patient-Derived iPSCs
To isolate the cellular effects that result directly from endogenous ZAAT expression within a single genetic background, we next applied the CRISPR/Cas9 endonuclease system to achieve scarless correction of the Z mutation (Figures 4 and S4). Using this approach, we were able to target multiple PiZZ iPSC lines to generate corrected, syngeneic PiMM daughter lines achieving an overall biallelic correction efficiency of 6% (Figure 4C). After gene correction was confirmed via Sanger sequencing, edited lines underwent repeat karyotyping and staining for markers of pluripotency (data not shown). To evaluate the downstream effects of gene correction, syngeneic PiZZ and PiMM iPSC lines were then differentiated in parallel to generate iHeps for comparison. As expected, edited PiMM iHeps displayed reduced levels of intracellular AAT protein accumulation together with an increase in secreted AAT in comparison with their parental PiZZ iHeps (Figures 4D–4G).
Figure 4
Scarless Gene Correction of the Z Mutation in AATD Patient-Derived iPSCs Using CRISPR/Cas9
(A) Targeting strategy for the Z mutation located within exon five of SERPINA1. A 70-bp single-stranded oligodeoxynucleotide (ssODN) repair template sequence is shown with the wild-type sequence highlighted in pink, the new ClaI restriction enzyme site in blue, and silent mutations to reduce CRISPR targeting of the donor sequence in green and red.
(C) Efficiency of biallelic homology-directed repair across genetically distinct ZZ AATD iPSCs.
(D) Flow cytometry of fixed, permeabilized iHeps demonstrates significant reduction of intracellular AAT protein within CRISPR-corrected syngeneic PiMM iHeps.
(E) MFI of AAT in syngeneic PiZZ and PiMM iHeps.
(F) ELISA of total secreted AAT levels from syngeneic PiZZ and PiMM iHep supernatants.
(G) Immunoblot probed with anti-AAT antibody demonstrates reduced intracellular levels of the native 52-kDA AAT protein with increased levels of secreted mature 55-kDa AAT protein in the syngeneic PiMM relative to PiZZ iHeps. n = 3 independent experiments. Data represented as mean ± SEM. ∗p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.001 by two-tailed t test.
See also Figure S4.
Scarless Gene Correction of the Z Mutation in AATD Patient-Derived iPSCs Using CRISPR/Cas9(A) Targeting strategy for the Z mutation located within exon five of SERPINA1. A 70-bp single-stranded oligodeoxynucleotide (ssODN) repair template sequence is shown with the wild-type sequence highlighted in pink, the new ClaI restriction enzyme site in blue, and silent mutations to reduce CRISPR targeting of the donor sequence in green and red.(B) Representative fluorescence-activated cell sorting plots demonstrating nucleofection efficiency in AATD iPSCs.(C) Efficiency of biallelic homology-directed repair across genetically distinct ZZ AATD iPSCs.(D) Flow cytometry of fixed, permeabilized iHeps demonstrates significant reduction of intracellular AAT protein within CRISPR-corrected syngeneic PiMM iHeps.(E) MFI of AAT in syngeneic PiZZ and PiMM iHeps.(F) ELISA of total secreted AAT levels from syngeneic PiZZ and PiMM iHep supernatants.(G) Immunoblot probed with anti-AAT antibody demonstrates reduced intracellular levels of the native 52-kDA AAT protein with increased levels of secreted mature 55-kDa AAT protein in the syngeneic PiMM relative to PiZZ iHeps. n = 3 independent experiments. Data represented as mean ± SEM. ∗p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.001 by two-tailed t test.See also Figure S4.
Discussion
iPSC technology and the associated ability to derive disease-associated cell types in a specific genetic context has provided a new avenue to study the contribution of gene variants to disease. Their unlimited capacity as a source of differentiated cell types together with their amenability to genetic manipulation further underscore the potential utility of the iPSC model to elucidate cellular disease mechanisms. AATD is characterized by significant inter-individual variability in disease manifestation, suggesting the need for a model that captures the genetic heterogeneity of the patient population. Therefore, we have generated a repository of iPSCs from numerous patients with SERPINA1 mutations to allow the generation and study of disease-affected cell types in multiple genetic backgrounds.It is likely that additional as-of-yet undefined polymorphisms in disease-modifying genes contribute to heterogeneity of outcomes in AATD patients observed in clinical practice. The ability to apply this model for either discovery or validation of additional genetic modifiers will require careful clinical phenotyping to correctly classify and characterize clinical outcomes of interest. For this reason, the clinical database compiled in these studies and associated with this repository greatly enhances its potential value. This database can be interrogated upon request to identify samples representative of specific clinical phenotypes. In addition to association of biological samples with clinical data, iPSC repositories, such as the one described, here possess other important attributes that distinguish them and their reagents from those held by individual investigators. These include curated selection of iPSC lines to represent significant clinical phenotypes, maintenance of quality control standards to ensure that constituent iPSCs do not harbor significant karyotypic abnormalities, documentation of donor consents that allow sharing of lines and associated clinical data with academic and industry-associated investigators, and streamlined material transfer agreements that minimize administrative barriers to sharing (Park et al., 2017).In these studies, we defined the functional ability of each characterized iPSC line to differentiate into multiple disease-relevant lineages. The capacity to generate multiple differentiated lineages from each line allows the experimental modeling of multiorgan diseases in a genetically controlled fashion, a feature that separates this model system from heterologous cell lines (Giadone et al., 2019). Published work delineating protocols for deriving lung progenitor cells and their differentiated progeny have largely been based on a limited number of pluripotent stem cell lines (Gotoh et al., 2014, Hawkins et al., 2017, Huang et al., 2013, Jacob et al., 2017, Longmire et al., 2012, McCauley et al., 2017, McCauley et al., 2018, Mou et al., 2012). While protocol optimization for specific lines can enhance differentiation efficiency, this work demonstrates the application of a single differentiation protocol applied to numerous donor iPSC lines, each of which efficiently generated lung progenitor cells, precursor cells for all lung epithelium that can be subsequently used to model disease in the proximal and distal lung (Hawkins et al., 2017, Jacob et al., 2017, Longmire et al., 2012, McCauley et al., 2017, McCauley et al., 2018).In addition to characterizing PiZZ patient cells in our experiments, we have likewise included PiSZ iPSCs heterozygous for the Z and S mutations, an allele combination not associated with high levels of polymerization and not known to confer significant additional risk for the development of liver disease (American Thoracic Society and European Respiratory Society, 2003, Tan et al., 2014). In addition, we chose to profile multiple genetically distinct primary hepatocyte samples as comparators to derive a more robust primary cell expression profile that accounts for known inter-individual differences in gene expression (Pashos et al., 2017).Directed differentiation protocols for the derivation of hepatic cells from iPSCs and their application to model liver disease are well established (Rashid et al., 2010, Segeritz et al., 2018, Smith et al., 2015, Wilson et al., 2015). A significant hurdle in these protocols, however, has been the ability to compare the relative efficiency of differentiation across experiments as liver-specific lineage markers, such as AFP, are expressed in early hepatoblasts while more mature hepatocyte genes, such as ASGR1, mark only a minority of cells in current differentiation protocols (Warren et al., 2017). To quantify differences in the differentiation efficiency for iHeps generated in our studies, we applied the primary hepatocyte gene expression profile to generate a signature of relative hepatic enrichment using an unbiased, computational approach. Application of this signature identified modest differences across the ten profiled genetic backgrounds. Whether these observations result in differences in meaningful functional outcomes will require additional testing to determine.Previous reports have demonstrated the application of gene-editing technologies to generate biallelic correction of the Z mutation (Choi et al., 2013, Smith et al., 2015, Yusa et al., 2012). The ability to generate such isogenic, gene-corrected variants is important to control for genetic heterogeneity in the conduct of experiments evaluating the contribution of specific mutations to disease pathogenesis. Here, we illustrate a detailed, reproducible approach to CRISPR/Cas9-based editing of the Z mutation using open source reagents that we have applied to multiple PiZZ iPSC lines in our repository. These edited lines, together with other lines described here, are now freely available for sharing with academic and industry researchers.In summary, we have created a repository of iPSCs with detailed associated clinical characterization from individuals with AATD. Through systematic characterization of the capacity of selected lines to generate disease-associated cell types, we have sought to illustrate and enhance the utility of this repository to interested investigators. The lines characterized in this study were created with donor consent explicitly allowing for sharing and commercialization, are available by request, and can be found online at https://stemcellbank.bu.edu. We are hopeful that the repository and its constituent lines will thus be applied to advance understanding of AATD-associated lung and liver disease pathogenesis and thereby benefit AATD patients.
Experimental Procedures
Subjects
The recruitment of human subjects and all iPSC studies were approved by the Boston University institutional review board (BUMC IRB protocols H-29791, H-32506, and H-33064). Subjects were recruited from clinical practice at the Alpha-1 Foundation-associated patient support group and patient education meetings and in association with clinical studies of AATD patients (ClinicalTrials.gov identifiers: NCT02014415; NCT01379469). At the time of recruitment demographic and detailed clinical data were collected and managed using REDCap electronic data capture tools hosted at Boston University (Harris et al., 2009, Harris et al., 2019).
iPSC Line Generation and Maintenance
Derivation of iPSC lines in the repository was performed as described previously (Choi et al., 2014, Somers et al., 2010, Sommer et al., 2009, Wilson et al., 2015). In brief, each iPSC line was generated from either human dermal fibroblasts or PBMCs. After expansion, cells were reprogrammed using either the STEMCCA lentiviral reprogramming vector (Somers et al., 2010) or Sendai virus CytoTune2 Reprogramming Kit (Thermo Fisher Scientific). Following reprogramming, each line was maintained on murine embryonic fibroblast feeders with knockout serum replacement supplemented medium, or in feeder free conditions with mTeSR-1 medium (STEMCELL Technologies).
iPSC Sharing and Distribution
To facilitate sharing of iPSC lines in the repository, an online catalog was created with the URL https://stemcellbank.bu.edu. This catalog includes detailed information pertaining to each repository iPSC line, including but not limited to genotype, race/ethnicity, sex, age at sample collection, karyotype, short tandem repeat analysis, pluripotency characterization, testing for residual Sendai virus, associated gene-edited lines, and links to associated genomic datasets. Of note, iPSC lines are only made routinely available for distribution once they have undergone this detailed quality control processing.
As described previously (Jacob et al., 2017, McCauley et al., 2018), NKX2-1+ lung progenitors were generated from iPSCs as follows: Cells were patterned into definitive endoderm using the STEMdiff Definitive Endoderm Kit (STEMCELL Technologies) as per the manufacturer's instructions. Cells were further differentiated using stage-specific factors to generate first anterior foregut endoderm and then iPSC-lung progenitors. On day 15 of differentiation, primordial lung progenitor cells were sorted based on CD47hi/CD26lo gating (Hawkins et al., 2017).
iHep Generation
iPSC-directed differentiation toward the hepatic lineage was performed using our previously published protocol (Kaserman and Wilson, 2017, Wilson et al., 2015). In brief, cells were patterned into definitive endoderm using the STEMdiff Definitive Endoderm Kit as per the manufacturer's instructions. Cells were passaged on day 5 of differentiation and then cultured for an additional 21 days using stage-specific growth factors to specify the hepatic lineage and induce maturation. Detailed protocols for derivation of iPSC lung and hepatic cells are freely accessible at: https://www.bu.edu/dbin/stemcells/protocols.php.
PHH
Commercially available, cryopreserved PHH (human plateable, induction qualified) were obtained from Thermo Fisher Scientific. Single vials containing 8–14 × 106 cells were obtained from six genetically distinct lots and thawed as per the manufacturer's instructions. Cells (2 × 106) were used to seed four type I collagen-coated wells in 24-well format to verify viability; the remaining cells were immediately pelleted at the time of thaw and lyzed in QIAzol Reagent (QIAGEN) for RNA sequencing.
Fluorescence-Activated Cell Sorting Analysis
Efficiency of definitive endoderm induction was quantified using CD184-PE (STEMCELL Technologies) and CD117-APC (Thermo Fisher Scientific) antibodies. To quantify intracellular protein content, fixed and permeabilized iHeps were stained using anti-human AAT (Santa Cruz Biotechnologies) and anti-human FOXA1 (Santa Cruz Biotechnologies) antibodies followed by anti-mouse IgG1-Alexa Fluor 647 (Jackson ImmunoResearch) and IgG2a-DyLight 488 (Jackson ImmunoResearch) antibodies (Wilson et al., 2015).
RNA Sequencing
Total RNA was isolated using an miRNeasy kit (QIAGEN) as per the manufacturer's instruction. For DGE the concentrations of all samples were normalized and sent to the Broad Institute for library construction and sequencing (Cacchiarelli et al., 2015, Park et al., 2017). Reads were then aligned to the ENSEMBL human reference genome. The count matrix was filtered and normalized to facilitate linear model fitting and differential expression testing. A liver enrichment score was derived by identifying differentially expressed genes between PHH and undifferentiated iPSCs.
CRISPR-Based Editing of Z Mutation
The CRISPR/Cas9 endonuclease system was utilized to induce scarless correction of the “Z” mutation in the SERPINA1 gene. A 70 bp single-stranded oligodeoxynucleotide repair template was synthesized (Integrated DNA Technologies) and delivered along with a single plasmid containing both the gRNA and Cas9-2A-GFP sequence via nucleofection (Lonza) to undifferentiated iPSCs. After 48 h, GFP+ cells were sorted and sparsely plated to achieve clonal outgrowth. Colonies were then screened for repair template incorporation and were gene sequence confirmed via DNA sequencing.
ELISA
Secreted AAT was quantified from AATD iHep supernatants using the human alpha-1-antitrypsin ELISA quantification kit (GenWay Biotech) as per the manufacturer's instructions.
Western/Immunoblot
Cell protein lysates and supernatants were collected as described previously (Wilson et al., 2015), denatured, and run on a 4%–12% Bis-Tris gel in MOPS SDS buffer using the NuPAGE system (Invitrogen). Proteins were transferred to a polyvinylidene fluoride membrane. Membranes were probed with antibodies against AAT (Santa Cruz Biotechnologies) and GAPDH (EMD Millipore). Signal was detected using goat anti-mouse-conjugated horseradish peroxidase (Bio-Rad) and imaged on an LAS-4000 chemiluminescent imager (Fuji).
Data and Code Availability
The GEO accession number for the DGE data reported in this paper is GSE140732.
Author Contributions
J.E.K., K.H., D.N.K., and A.A.W. contributed to conception and design of the study. J.E.K., K.H., M.D., M.V., J.-C.J., M.F.J., M.H., and N.J.L. contributed to the data collection. M.D., J.T., D.N.K., and A.A.W. contributed to participant recruitment and collection of data and samples. J.E.K., K.H., G.R.W., R.S.J.E., C.V.-M., D.N.K., and A.A.W. contributed to the data analysis. J.E.K., K.H., D.N.K., and A.A.W. contributed to the manuscript preparation.
Authors: Paul A Harris; Robert Taylor; Robert Thielke; Jonathon Payne; Nathaniel Gonzalez; Jose G Conde Journal: J Biomed Inform Date: 2008-09-30 Impact factor: 6.317
Authors: Emer P Reeves; Ciara A O'Dwyer; Danielle M Dunlea; Mark R Wormald; Padraig Hawkins; Mohammad Alfares; Darrell N Kotton; Steven M Rowe; Andrew A Wilson; Noel G McElvaney Journal: Am J Respir Crit Care Med Date: 2018-10-15 Impact factor: 21.405
Authors: Edgar N Tafaleng; Souvik Chakraborty; Bing Han; Pamela Hale; Wanquan Wu; Alejandro Soto-Gutierrez; Carol A Feghali-Bostwick; Andrew A Wilson; Darrell N Kotton; Masaki Nagaya; Stephen C Strom; Jayanta Roy-Chowdhury; Donna B Stolz; David H Perlmutter; Ira J Fox Journal: Hepatology Date: 2015-04-13 Impact factor: 17.425
Authors: Aba Somers; Jyh-Chang Jean; Cesar A Sommer; Amel Omari; Christopher C Ford; Jason A Mills; Lei Ying; Andreia Gianotti Sommer; Jenny M Jean; Brenden W Smith; Robert Lafyatis; Marie-France Demierre; Daniel J Weiss; Deborah L French; Paul Gadue; George J Murphy; Gustavo Mostoslavsky; Darrell N Kotton Journal: Stem Cells Date: 2010-10 Impact factor: 6.277
Authors: Andrew A Wilson; Lei Ying; Marc Liesa; Charis-Patricia Segeritz; Jason A Mills; Steven S Shen; Jyhchang Jean; Geordie C Lonza; Derek C Liberti; Alex H Lang; Jean Nazaire; Adam C Gower; Franz-Josef Müeller; Pankaj Mehta; Adriana Ordóñez; David A Lomas; Ludovic Vallier; George J Murphy; Gustavo Mostoslavsky; Avrum Spira; Orian S Shirihai; Maria I Ramirez; Paul Gadue; Darrell N Kotton Journal: Stem Cell Reports Date: 2015-04-02 Impact factor: 7.765
Authors: Su Mi Choi; Yonghak Kim; Joong Sup Shim; Joon Tae Park; Rui-Hong Wang; Steven D Leach; Jun O Liu; Chuxia Deng; Zhaohui Ye; Yoon-Young Jang Journal: Hepatology Date: 2013-06 Impact factor: 17.425
Authors: Lu Tan; Jennifer A Dickens; Dawn L Demeo; Elena Miranda; Juan Perez; S Tamir Rashid; James Day; Adriana Ordoñez; Stefan J Marciniak; Imran Haq; Alan F Barker; Edward J Campbell; Edward Eden; Noel G McElvaney; Stephen I Rennard; Robert A Sandhaus; James M Stocks; James K Stoller; Charlie Strange; Gerard Turino; Farshid N Rouhani; Mark Brantly; David A Lomas Journal: Eur Respir J Date: 2014-03-06 Impact factor: 16.671
Authors: Charis-Patricia Segeritz; Sheikh Tamir Rashid; Miguel Cardoso de Brito; Maria Paola Serra; Adriana Ordonez; Carola Maria Morell; Joseph E Kaserman; Pedro Madrigal; Nicholas R F Hannan; Laurent Gatto; Lu Tan; Andrew A Wilson; Kathryn Lilley; Stefan J Marciniak; Bibek Gooptu; David A Lomas; Ludovic Vallier Journal: J Hepatol Date: 2018-06-05 Impact factor: 25.083
Authors: Alberto Posabella; Andrea B Alber; Hendrik J Undeutsch; Raoul A Droeser; Anthony N Hollenberg; Laertis Ikonomou; Darrell N Kotton Journal: Front Endocrinol (Lausanne) Date: 2021-04-20 Impact factor: 5.555
Authors: Adam J Hume; Baylee Heiden; Judith Olejnik; Ellen L Suder; Stephen Ross; Whitney A Scoon; Esther Bullitt; Maria Ericsson; Mitchell R White; Jacquelyn Turcinovic; Tran T N Thao; Ryan M Hekman; Joseph E Kaserman; Jessie Huang; Konstantinos-Dionysios Alysandratos; Gabor E Toth; Ferenc Jakab; Darrell N Kotton; Andrew A Wilson; Andrew Emili; Volker Thiel; John H Connor; Gabor Kemenesi; Daniel Cifuentes; Elke Mühlberger Journal: PLoS Pathog Date: 2022-02-04 Impact factor: 6.823
Authors: Christoph Schaniel; Priyanka Dhanan; Bin Hu; Yuguang Xiong; Teeya Raghunandan; David M Gonzalez; Rafael Dariolli; Sunita L D'Souza; Arjun S Yadaw; Jens Hansen; Gomathi Jayaraman; Bino Mathew; Moara Machado; Seth I Berger; Joseph Tripodi; Vesna Najfeld; Jalaj Garg; Marc Miller; Colleen S Surlyn; Katherine C Michelis; Neelima C Tangirala; Himali Weerahandi; David C Thomas; Kristin G Beaumont; Robert Sebra; Milind Mahajan; Eric Schadt; Dusica Vidovic; Stephan C Schürer; Joseph Goldfarb; Evren U Azeloglu; Marc R Birtwistle; Eric A Sobie; Jason C Kovacic; Nicole C Dubois; Ravi Iyengar Journal: Stem Cell Reports Date: 2021-11-04 Impact factor: 7.765
Authors: Rhiannon B Werder; Joseph E Kaserman; Michael S Packer; Jonathan Lindstrom-Vautrin; Carlos Villacorta-Martin; Lauren E Young; Yvonne Aratyn-Schaus; Francine Gregoire; Andrew A Wilson Journal: Mol Ther Date: 2021-07-02 Impact factor: 12.910