Literature DB >> 34811797

Preparation of multiplexed control materials for cancer mutation analysis by genome editing in GM12878 cells.

Guigao Lin1,2,3, Kuo Zhang1,2,3, Yanxi Han1,3, Rongxue Peng1,3, Jinming Li1,2,3.   

Abstract

BACKGROUND: Quality control materials are necessary for assay development, test validation, and proficiency testing in cancer mutation analysis. Most of the existing controls for somatic mutations only harbor a single variant and are derived from unstable cell lines. This study aimed to establish a method to create stable multianalyte controls in a defined background by genome editing in GM12878 cells, which also can be applied for the reference of next-generation sequencing.
METHODS: GM12878 cells were electroporated with a donor plasmid containing a mutant DNA sequence and a Cas9/sgRNA expressing vector. The genome-edited GM12878 cell was validated with Sanger sequencing, amplification refractory mutation system (ARMS), and next-generation sequencing (NGS).
RESULTS: We have successfully generated a mutant GM12878 cell line harboring the defined variants including single-nucleotide variants (SNVs), small insertions and deletions (indels), and structural variants (SVs). The introduction of intended mutations in GM12878 cell line was confirmed by both ARMS and sequencing methods.
CONCLUSIONS: We developed a method for the preparation of the multiplexed controls for reference mutations in cancer gene by genome editing in GM12878 cells. This methodology can be used to generate other stable cancer reference materials with an unlimited supply.
© 2021 The Authors. Journal of Clinical Laboratory Analysis published by Wiley Periodicals LLC.

Entities:  

Keywords:  CRISPR/Cas9; control materials; genome editing; next-generation sequencing; somatic mutation

Mesh:

Year:  2021        PMID: 34811797      PMCID: PMC8761438          DOI: 10.1002/jcla.24139

Source DB:  PubMed          Journal:  J Clin Lab Anal        ISSN: 0887-8013            Impact factor:   2.352


INTRODUCTION

Next‐generation sequencing (NGS) technology has become a cornerstone in cancer management and has been widely adopted by clinical laboratories. Targeted NGS panels are commonly used to interrogate single‐nucleotide variants (SNVs), insertions and deletions (indels), copy number variants (CNVs), and structural variants (SVs). Accurate mutation signature analysis is essential for guiding cancer therapy, definitive diagnosis, and predicting survival. To ensure that NGS data are reliable and useful for clinical decision making, labs conducting NGS tests should comply with regulatory and professional quality guidelines. , Quality control of NGS workflow is challenging because of its complexity. Control materials, which can be used to evaluate biases and errors in the NGS workflow, are necessary for test validation, quality control, and proficiency testing. The National Institute for Standards and Technology (NIST) has developed several reference normal cell lines (e.g., HapMap cell line GM12878) for whole‐genome variant assessment. In the absence of reference cancer cell lines for whole genome sequencing, the normal samples are often used in the establishment of NGS test performance for somatic mutation analysis.  The World Health Organization (WHO) also developed several gene‐specific reference materials from cancer cell lines.  Most reference materials for cancer mutation testing are derived from tumor cell lines which only contain a single mutation. To meet the demand for implementing multiplexed materials as controls for NGS assays, several approaches, including pooling characterized cancer cell lines and mixing synthetic DNA constructs, were developed. However, the cell line strategy may encounter the problem of sourcing, mixing, and characterizing. In addition, chromosomal instability and genetic drift, which affect the genomic mutation profile, are especially increased in cancer cells throughout cell culture. Synthetic DNA does not resemble patient's sample and cannot be used to monitor the extraction process. Here, we aimed to develop a method to create stable and multiplexed control materials by genome editing in GM12878 cells, which also can be used for the control of matched tumor‐normal sequencing. We previously generated genome‐engineered cell lines containing intended mutations efficiently by harnessing clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR‐associated endonuclease Cas9 (Cas9)‐induced nonhomologous end joining (NHEJ). Results showed that the engineered HEK293T and A549 cells harboring inserted synthetic mutant DNA fragments can serve as valid controls for genetic testing. In this study, a long mutant DNA fragment (covering SNVs, indels and SVs) for targeted NGS test was designed and introduced into the glyceraldehyde‐3‐phosphate dehydrogenase (GAPDH) locus in GM12878 cells.

MATERIALS AND METHODS

Variant selection and donor plasmid

We selected five hotspot mutations (Table 1) in lung cancer that represent common variant types (SNVs, indels, and gene fusions). Each of the SNVs and indels was flanked by genomic sequence of EGFR exon and intron (300 bp both ends) region (Figure 1 and Table S1). For EML4‐ALK variant 1 (E13:A20), a 1000‐ bp region both upstream and downstream of the breakpoint was designed for hybrid‐capture NGS testing (Figure 1 and Table S1). A total of four small mutant DNA fragments were designed head to tail and assembled into a large fragment (Figure 1 and Table S1). In addition, two sgRNA target sequences were added at both ends for cutting. The designed mutant DNA sequence (listed in Table S1) was denovo synthesized (Sangon Biotech) and cloned into pUC57 (GenScript) as a donor and further validated by sequencing.
TABLE 1

The donor plasmid harboring mutations

PlasmidMutations
A

NM_005228.3(EGFR):c.2369C>T (p. Thr790Met),

NM_005228.3(EGFR)c.2573T>G (p. Leu858Arg), NM_005228.3(EGFR):c.2235_2249del15 (p. Glu746_Ala750del), NM_005228.3(EGFR):c.2310_2311insGGT (p. Asp770_Asn771insGly),

EML4ALK variant 1 (E13:A20)

Abbreviations: ALK, anaplastic lymphoma kinase; EGFR, epidermal growth factor receptor; EML4, echinoderm microtubule‐associated protein‐like 4.

FIGURE 1

Schematics of creating multiplexed control materials for cancer mutation testing by integration of a mutant DNA sequence in GM12878 cells. Red arrow refers sgRNA cleavage site. ALK, anaplastic lymphoma kinase; EML4, echinoderm microtubule‐associated protein‐like 4; GAPDH, glyceraldehyde‐3‐phosphate dehydrogenase; NHEJ, nonhomologous end joining

The donor plasmid harboring mutations NM_005228.3(EGFR):c.2369C>T (p. Thr790Met), NM_005228.3(EGFR)c.2573T>G (p. Leu858Arg), NM_005228.3(EGFR):c.2235_2249del15 (p. Glu746_Ala750del), NM_005228.3(EGFR):c.2310_2311insGGT (p. Asp770_Asn771insGly), EML4‐ALK variant 1 (E13:A20) Abbreviations: ALK, anaplastic lymphoma kinase; EGFR, epidermal growth factor receptor; EML4, echinoderm microtubule‐associated protein‐like 4. Schematics of creating multiplexed control materials for cancer mutation testing by integration of a mutant DNA sequence in GM12878 cells. Red arrow refers sgRNA cleavage site. ALK, anaplastic lymphoma kinase; EML4, echinoderm microtubule‐associated protein‐like 4; GAPDH, glyceraldehyde‐3‐phosphate dehydrogenase; NHEJ, nonhomologous end joining

Cas9/sgRNA plasmid

SgRNA sequences targeting human GAPDH intron 7 were designed by using CHOPCHOP.  Pairs of sgRNA oligos for each targeting site were cloned into pCAG‐eCas9‐GFP‐U6‐gRNA (Addgene # 79145) as previously reported.  Target sequences of sgRNAs are listed in Table 2.
TABLE 2

Oligonucleotides used in this study

NameSequence (5′−3′)Intended useProduct length
SgRNA1TCGATGGGTGGAGTCGCGTGsgRNA target sequence/
SgRNA2GCGCAGGGTTAGTCACCGGC/
SgRNA3TAGCGTTGACCCGACCCCAA/
T7E1‐FGGGGGACGCTTTCTTTCCTTT7EI assay633 bp
T7E1‐RTTTCCGGAAGACGGAATGGG
858FATGTCAAGATCACAGATTTTGGGCGAllele specific PCR (L858R) for detection of integrated mutant DNA128 bp
858RCTGGTCCCTGGTGTCAGGAAAA
egfr 19FGCTGGTAACATCCACCCAGASanger sequencing/
egfr 19RGAGAAAAGGTGGGCCTGAGSanger sequencing/
egfr 20FCCTCCTTCTGGCCACCATGCGSanger sequencing/
egfr 20RCATGTGAGGATCCTGGCTCCSanger sequencing/
egfr 21FCGGATGCAGAGCTTCTTCCCSanger sequencing/
egfr 21RAGGCAGCCTGGTCCCTGGTGSanger sequencing/

Abbreviations: ALK, anaplastic lymphoma kinase; EML4, echinoderm microtubule‐associated protein‐like 4; T7E1, T7 Endonuclease I.

Oligonucleotides used in this study Abbreviations: ALK, anaplastic lymphoma kinase; EML4, echinoderm microtubule‐associated protein‐like 4; T7E1, T7 Endonuclease I.

Cell culture and electroporation

GM12878, a lymphoblastoid cell line, is a well‐characterized Tier 1 cell of Encyclopedia of DNA Elements (ENCODE) project with a relatively normal karyotype. GM12878 was obtained from Coriell Institute and cultured in Roswell Park Memorial Institute 1640 medium (RPMI 1640; Gibco) supplemented with 15% fetal bovine serum (FBS; Gibco) and 1% penicillin‐streptomycin (Gibco) at 37°C with 5% CO2. The density of GM12878 cells was maintained at 3.5 × 105 cells/ml before transfection. For each transfection, 0.6 pmol of Cas9/sgRNA plasmid, 0.6 pmol of donor plasmid, and 2,000,000 cells were prepared. The cells were washed twice with PBS and resuspended in 20 μl electroporation buffer (Celetrix), then the plasmids were mixed with the cells and transferred to a 20‐μl electroporation tube (Celetrix). Electroporation was carried out at a voltage and pulse time of 440 V and 30 ms, respectively, using an electroporation machine (CTX‐1500A LE, Celetrix). Following electrotransformation, the cells were transferred to a 24‐well plate in warm medium.

DNA extraction

Genomic DNA was extracted from cells using the QIAamp DNA Mini Kit (QIAGEN) according to the manufacturer's recommendations.

T7 endonuclease I assay

Cells were collected 48 h after electroporation and genomic DNA was isolated. SgRNA targeted region (633 bp) was amplified using high‐fidelity polymerase chain reaction (PCR) master mix (ThermoFisher) and purified with zymoclean Gel DNA Recovery Kit (Zymo). Then, 200 ng of purified PCR products were denatured, annealed, and incubated with T7 Endonuclease I (T7E1) (NEB) following manufacturer's instructions. The cleavage efficiency of sgRNA was analyzed by using 2% agarose gel. PCR primers are listed in Table 2.

Fluorescence‐activated cell sorting

Cells were harvested 48 h after transfection and resuspended in PBS containing 0.5% bovine serum albumin (BSA) and 2% penicillin‐streptomycin for sorting. Single GFP‐positive cells were sorted in 96‐well plates using MoFlo Astrios EQ (Beckman Coulter). Cells were cultured in 50% conditioned media for 4–5 weeks and then transferred to 24‐well plates upon expansion.

PCR screening of positive clones

Allele‐specific PCR (detecting mutation L858R) was carried out for detection of integration of mutant DNA sequences in cell clones. DNA extracted from single clones was amplified using GoTaq Green Master Mix (Promega) and primers (listed in Table 2) following the manufacturer's instructions.

Sanger sequencing

The designed variants in the positive cell clones were validated by Sanger sequencing. Briefly, genomic DNA was first amplified with specific primers (listed in Table 2), then the PCR products were purified, and sequenced by using BigDye Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems) on the ABI 3500DX Genetic Analyzer (Applied Biosystems) according to the manufacturer's instructions.

Amplification refractory mutation system

The designed EGFR variants in positive clones were tested using amplification refractory mutation system (ARMS) with a certified EGFR mutations detection kit (AmoyDx) according to the manufacturer's instructions. Tests were performed on ABI‐7500 real‐time PCR system.

Next‐generation sequencing

Positive cells were further tested by targeted NGS. Essential NGS panel (AmoyDx), an approved hybridization capture‐based NGS kit (hotspot cancer panel), was used. Tests were conducted on Illumina Nextseq 500 system (Illumina) following the manufacturer's instructions.

RESULTS

Plasmid construction

We designed 3 sgRNAs to target the 7th intron of GAPDH gene. The cutting effectiveness of each sgRNA was assessed by the T7E1 assay using the genomic DNA extracted from the electroporated cells. Based on results (Figure S1) of the T7E1 assay, we selected sgRNA2 for further experiments. The donor plasmid harboring the corresponding mutant DNA sequence (Table 1) was constructed and validated via Sanger sequencing.

Generation of modified GM12878 harboring the intended mutations

Following transfection, we plated 960 individual cells after sorting. Most of the cells cannot survive electrotransfection and single cell culture. Only one positive clone (clone 24) was obtained among the 60 expanded and screened clones, which was designated as 12878‐Multiv. Figure 2 showed the representative PCR screening results. We further validated target variants in this positive clone by using ARMS method and Sanger sequencing (Figure 3 and Figure 4).
FIGURE 2

PCR screening of the positive clones in genome‐engineered GM12878 cells

FIGURE 3

Validation of the designed variants in genome‐edited GM12878 cell by Sanger sequencing. EGFR, epidermal growth factor receptor

FIGURE 4

Verification of the intended variants in genome‐edited GM12878 cell (12878‐Multiv) by ARMS. EGFR, epidermal growth factor receptor

PCR screening of the positive clones in genome‐engineered GM12878 cells Validation of the designed variants in genome‐edited GM12878 cell by Sanger sequencing. EGFR, epidermal growth factor receptor Verification of the intended variants in genome‐edited GM12878 cell (12878‐Multiv) by ARMS. EGFR, epidermal growth factor receptor

Variant detection using NGS

Genomic DNA NGS library was captured for target enrichment and sequenced on the Nextseq 500 system (Illumina). The sequencing data were analyzed according to the manufacturer's instructions. The mutations in the genome‐edited cell were successfully detected (Figure 5).
FIGURE 5

Integrative Genomics Viewer (IGV) screenshots of the target variants in genome‐edited GM12878 cell (12878‐Multiv) detected by NGS. (A) variant allele fraction (VAF) = 56.9%; (B) VAF = 64.3%; (C) VAF = 54.9%; (D) VAF = 50.8%; (E and F) VAF = 33.4%; ALK, anaplastic lymphoma kinase; EGFR, epidermal growth factor receptor; EML4, echinoderm microtubule‐associated protein‐like 4

Integrative Genomics Viewer (IGV) screenshots of the target variants in genome‐edited GM12878 cell (12878‐Multiv) detected by NGS. (A) variant allele fraction (VAF) = 56.9%; (B) VAF = 64.3%; (C) VAF = 54.9%; (D) VAF = 50.8%; (E and F) VAF = 33.4%; ALK, anaplastic lymphoma kinase; EGFR, epidermal growth factor receptor; EML4, echinoderm microtubule‐associated protein‐like 4

DISCUSSION

In this study, we created an engineered cell line from GM12878 using CRISPR/Cas9 technology. This cell line can be used as a multiplexed mutant standard for various detection methods including targeted gene sequencing. Paired with GM12878, they also can be used as controls for tumor‐normal sequencing. Because of its high transfection efficiency, HEK 293 are frequently used to produce genome‐edited cell lines as controls for somatic mutation testing. , , , However, HEK293 is a hypotriploid cell line which displays some cytogenetic instability and is not well‐characterized across the genome. On the contrary, GM12878 is stable, and most importantly, it is extensively characterized and serves as benchmark reference materials for NGS testing. The use of electroporation in our study offered transfection efficiencies up to 40% in GM12878, which is acceptable for downstream experiments. One shortcoming of using GM12878 is that it is difficult to grow into single clones after electroporation and FACS. Feed cells may be used. Generation of mutant cells by mutation knock‐in via on homology‐directed repair is another approach, however, the efficiency is relatively low. In addition, only nucleotides adjacent to protospacer adjacent motif (PAM) site can be efficiently edited with this strategy, which may exclude many clinical actional variants. By using CRISPR/Cas9‐induced homology‐independent integration, we can efficiently introduce any long DNA sequence harboring design variants into any locus of chromosome. We did not evaluate the off‐target effects because the artificial mutations were generated by inserting long DNA sequences harboring variants to the chromosome locus of an unrelated gene. In such a way, neither the indels in the integration site nor the insertion in the off‐target sites will have effects on the detection of the designed variants. Nonetheless, the engineered high‐fidelity eCas9 was used in our study to minimize off‐target effects. Furthermore, the methodology developed in this study can be extended to create a wide range of cancer reference materials. The results reported herein should be considered in light of some limitations. Since this is a pilot study, only five hotspot mutations were included in the engineered cell line. In addition, our developed control containing the artificial EML4‐ALK mutation cannot be used for RNA sequencing. Follow‐up studies are necessary to cover more variants in cancer panel. Finally, these control materials also need to be tested across different NGS platforms to establish broad utility. Figure S1 Click here for additional data file. Table S1 Click here for additional data file.
  13 in total

1.  Extensive sequencing of seven human genomes to characterize benchmark reference materials.

Authors:  Justin M Zook; David Catoe; Jennifer McDaniel; Lindsay Vang; Noah Spies; Arend Sidow; Ziming Weng; Yuling Liu; Christopher E Mason; Noah Alexander; Elizabeth Henaff; Alexa B R McIntyre; Dhruva Chandramohan; Feng Chen; Erich Jaeger; Ali Moshrefi; Khoa Pham; William Stedman; Tiffany Liang; Michael Saghbini; Zeljko Dzakula; Alex Hastie; Han Cao; Gintaras Deikus; Eric Schadt; Robert Sebra; Ali Bashir; Rebecca M Truty; Christopher C Chang; Natali Gulbahce; Keyan Zhao; Srinka Ghosh; Fiona Hyland; Yutao Fu; Mark Chaisson; Chunlin Xiao; Jonathan Trow; Stephen T Sherry; Alexander W Zaranek; Madeleine Ball; Jason Bobe; Preston Estep; George M Church; Patrick Marks; Sofia Kyriazopoulou-Panagiotopoulou; Grace X Y Zheng; Michael Schnall-Levin; Heather S Ordonez; Patrice A Mudivarti; Kristina Giorda; Ying Sheng; Karoline Bjarnesdatter Rypdal; Marc Salit
Journal:  Sci Data       Date:  2016-06-07       Impact factor: 6.444

2.  CRISPR/Cas9 Technology-Based Xenograft Tumors as Candidate Reference Materials for Multiple EML4-ALK Rearrangements Testing.

Authors:  Rongxue Peng; Rui Zhang; Guigao Lin; Xin Yang; Ziyang Li; Kuo Zhang; Jiawei Zhang; Jinming Li
Journal:  J Mol Diagn       Date:  2017-07-18       Impact factor: 5.568

Review 3.  Guidelines for Validation of Next-Generation Sequencing-Based Oncology Panels: A Joint Consensus Recommendation of the Association for Molecular Pathology and College of American Pathologists.

Authors:  Lawrence J Jennings; Maria E Arcila; Christopher Corless; Suzanne Kamel-Reid; Ira M Lubin; John Pfeifer; Robyn L Temple-Smolkin; Karl V Voelkerding; Marina N Nikiforova
Journal:  J Mol Diagn       Date:  2017-03-21       Impact factor: 5.568

4.  A novel cell line generated using the CRISPR/Cas9 technology as universal quality control material for KRAS G12V mutation testing.

Authors:  Shiyu Jia; Rui Zhang; Guigao Lin; Rongxue Peng; Peng Gao; Yanxi Han; Yu Fu; Jiansheng Ding; Qisheng Wu; Kuo Zhang; Jiehong Xie; Jinming Li
Journal:  J Clin Lab Anal       Date:  2018-01-30       Impact factor: 2.352

5.  Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR)/CRISPR-Associated Endonuclease Cas9-Mediated Homology-Independent Integration for Generating Quality Control Materials for Clinical Molecular Genetic Testing.

Authors:  Guigao Lin; Kuo Zhang; Rongxue Peng; Yanxi Han; Jiehong Xie; Jinming Li
Journal:  J Mol Diagn       Date:  2018-05       Impact factor: 5.568

6.  Rationally engineered Cas9 nucleases with improved specificity.

Authors:  Ian M Slaymaker; Linyi Gao; Bernd Zetsche; David A Scott; Winston X Yan; Feng Zhang
Journal:  Science       Date:  2015-12-01       Impact factor: 47.728

7.  Assuring the quality of next-generation sequencing in clinical laboratory practice.

Authors:  Amy S Gargis; Lisa Kalman; Meredith W Berry; David P Bick; David P Dimmock; Tina Hambuch; Fei Lu; Elaine Lyon; Karl V Voelkerding; Barbara A Zehnbauer; Richa Agarwala; Sarah F Bennett; Bin Chen; Ephrem L H Chin; John G Compton; Soma Das; Daniel H Farkas; Matthew J Ferber; Birgit H Funke; Manohar R Furtado; Lilia M Ganova-Raeva; Ute Geigenmüller; Sandra J Gunselman; Madhuri R Hegde; Philip L F Johnson; Andrew Kasarskis; Shashikant Kulkarni; Thomas Lenk; C S Jonathan Liu; Megan Manion; Teri A Manolio; Elaine R Mardis; Jason D Merker; Mangalathu S Rajeevan; Martin G Reese; Heidi L Rehm; Birgitte B Simen; Joanne M Yeakley; Justin M Zook; Ira M Lubin
Journal:  Nat Biotechnol       Date:  2012-11       Impact factor: 54.908

8.  The mutation rates of EGFR in non-small cell lung cancer and KRAS in colorectal cancer of Chinese patients as detected by pyrosequencing using a novel dispensation order.

Authors:  Guohua Xie; Fang Xie; Ping Wu; Xiangliang Yuan; Yanhui Ma; Yunchuan Xu; Li Li; Ling Xu; Ming Yang; Lisong Shen
Journal:  J Exp Clin Cancer Res       Date:  2015-06-18

9.  CHOPCHOP v3: expanding the CRISPR web toolbox beyond genome editing.

Authors:  Kornel Labun; Tessa G Montague; Maximilian Krause; Yamila N Torres Cleuren; Håkon Tjeldnes; Eivind Valen
Journal:  Nucleic Acids Res       Date:  2019-07-02       Impact factor: 16.971

10.  Preparation of the standard cell lines for reference mutations in cancer gene-panels by genome editing in HEK 293 T/17 cells.

Authors:  Takayoshi Suzuki; Yoshinori Tsukumo; Chie Furihata; Mikihiko Naito; Arihiro Kohara
Journal:  Genes Environ       Date:  2020-02-11
View more
  1 in total

1.  Preparation of multiplexed control materials for cancer mutation analysis by genome editing in GM12878 cells.

Authors:  Guigao Lin; Kuo Zhang; Yanxi Han; Rongxue Peng; Jinming Li
Journal:  J Clin Lab Anal       Date:  2021-11-23       Impact factor: 2.352

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.