| Literature DB >> 30874552 |
Byungjin Hwang1, Wookjae Lee1, Soo-Young Yum2, Yujin Jeon1, Namjin Cho1, Goo Jang3, Duhee Bang4.
Abstract
Determining cell lineage and function is critical to understanding human physiology and pathology. Although advances in lineage tracing methods provide new insight into cell fate, defining cellular diversity at the mammalian level remains a challenge. Here, we develop a genome editing strategy using a cytidine deaminase fused with nickase Cas9 (nCas9) to specifically target endogenous interspersed repeat regions in mammalian cells. The resulting mutation patterns serve as a genetic barcode, which is induced by targeted mutagenesis with single-guide RNA (sgRNA), leveraging substitution events, and subsequent read out by a single primer pair. By analyzing interspersed mutation signatures, we show the accurate reconstruction of cell lineage using both bulk cell and single-cell data. We envision that our genetic barcode system will enable fine-resolution mapping of organismal development in healthy and diseased mammalian states.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30874552 PMCID: PMC6420643 DOI: 10.1038/s41467-019-09203-z
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Targeted deaminase system for lineage tracing. a Schematic overview of the targeted deaminase system. Red arrows indicate the single flanking primer pair used to amplify the targeted regions. The substitution pattern in the target region served as the cell barcode for tree reconstruction. b Pairwise alignment of five representative target sites for sgRNA-1 and sgRNA-3 design. These regions were amplified using a single primer pair and alignment distinguished each target site by the different surrounding sequences. c Editing efficiency of the two selected sgRNAs in the target region for HEK293T and HeLa cells. d Correlation of editing efficiency between the first and second base C in a specific window (4–8) within the sgRNA-3 spacer sequence in HEK293T cells (Spearman’s correlation = 0.6 for HeLa cells). The source data are available in the Source Data file
Fig. 2Cumulative editing and analysis of lineage tracing experiment. a Cumulative editing efficiency in the target regions using sgRNA-1 and sgRNA-3 for the defined time points. b A representation of the tree experiment for sgRNA-3 in HEK293T cells. The single-cells were sorted and expanded to a limited number of cell divisions. The bulk cells were then amplified from a 96-well plate and sequenced for cell lineage analysis. A network graph[7] was constructed from all the pairwise cell barcode information. The “−” represents the mother-child node relationship and the colors of each circle represent the different clade (pink: Clade1, yellow: Clade2, blue: Clade3, green: Clade4). The misplaced nodes are represented by a red arrow. The red solid line connects the incorrectly placed mother-daughter node, and the dotted line indicates the correct mother-daughter node connection. c Time-lapse imaging of the single-cell expansion experiment. At the end of the video, single-cells were picked and sequenced for lineage reconstruction. d An example of a tree expansion experiment for seven single-cells. The bootstrapped confidence score (bootstrapped P-values × 100, n = 1000) are shown at the branching point. Physical isolation was impossible for the left most cell (1–1–1) in (c). The source data are available in the Source Data file
Fig. 3Performance simulations of the targeted deaminase system. Continued editing was allowed until G generations (see Methods for detail). After that, the accuracy (y-axis) was calculated as the percentage of correct order of pairwise relationships between the edits generated within G generations (see Methods). We compared the effect of dropouts on accuracy. a No dropout shown by solid lines and (b) dropout applied shown by dashed lines. The inflection point is indicated by the black arrow. Different colors correspond to different numbers of target sites and the units for the editing rate (base substitution rate) are mutations/site/cell division. The source data are available in the Source Data file