| Literature DB >> 28873399 |
Marina Lizio1,2, Ruslan Deviatiiarov3, Hiroki Nagai4,5, Laura Galan6, Erik Arner1,2, Masayoshi Itoh1,2,7, Timo Lassmann1,2, Takeya Kasukawa1, Akira Hasegawa1, Marian A Ros6, Yoshihide Hayashizaki2,7, Piero Carninci1,2, Alistair R R Forrest1,2,8, Hideya Kawaji1,2,7, Oleg Gusev1,3,7,9, Guojun Sheng4,5.
Abstract
Cap Analysis of Gene Expression (CAGE) in combination with single-molecule sequencing technology allows precision mapping of transcription start sites (TSSs) and genome-wide capture of promoter activities in differentiated and steady state cell populations. Much less is known about whether TSS profiling can characterize diverse and non-steady state cell populations, such as the approximately 400 transitory and heterogeneous cell types that arise during ontogeny of vertebrate animals. To gain such insight, we used the chick model and performed CAGE-based TSS analysis on embryonic samples covering the full 3-week developmental period. In total, 31,863 robust TSS peaks (>1 tag per million [TPM]) were mapped to the latest chicken genome assembly, of which 34% to 46% were active in any given developmental stage. ZENBU, a web-based, open-source platform, was used for interactive data exploration. TSSs of genes critical for lineage differentiation could be precisely mapped and their activities tracked throughout development, suggesting that non-steady state and heterogeneous cell populations are amenable to CAGE-based transcriptional analysis. Our study also uncovered a large set of extremely stable housekeeping TSSs and many novel stage-specific ones. We furthermore demonstrated that TSS mapping could expedite motif-based promoter analysis for regulatory modules associated with stage-specific and housekeeping genes. Finally, using Brachyury as an example, we provide evidence that precise TSS mapping in combination with Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR)-on technology enables us, for the first time, to efficiently target endogenous avian genes for transcriptional activation. Taken together, our results represent the first report of genome-wide TSS mapping in birds and the first systematic developmental TSS analysis in any amniote species (birds and mammals). By facilitating promoter-based molecular analysis and genetic manipulation, our work also underscores the value of avian models in unravelling the complex regulatory mechanism of cell lineage specification during amniote development.Entities:
Mesh:
Year: 2017 PMID: 28873399 PMCID: PMC5600399 DOI: 10.1371/journal.pbio.2002887
Source DB: PubMed Journal: PLoS Biol ISSN: 1544-9173 Impact factor: 8.029
Fig 1Developmental stages selected for Cap Analysis of Gene Expression (CAGE) study.
(A) Representative images of chicken embryos used for RNA preparation. (B) Temporal distribution of these selected stages (red dots) during chicken embryogenesis. X-axis: 46 Hamburger and Hamilton (HH) stages represented in numerical order. Y-axis: log scale of incubation time needed to reach these stages (based on reported estimates), e.g., Y value of 2.0 = 100 hours. Numerical values for this plot can be found in S1 Data. (C) Principal coordinates analysis (PCoA) of all 16 developmental samples. Four groups, representing early, mid, mid-late, and late developmental phases, can be recognized. (D) PCoA of all samples. Primary cells are clearly grouped away from each other and from developmental samples. Raw data for PCoA can be accessed at http://fantom.gsc.riken.jp/5/suppl/Lizio_et_al_2017/.
Fig 2Chicken-ZENBU views of transcription start site (TSS) peaks and expression levels.
(A) A robust single TSS peak (arrow) is correctly mapped to the known Ensembl and RefSeq 5′ end of GAPDH. Right panel: zoomed-in view of left panel. (B) Ambiguity in ACTB gene annotation can be resolved using Cap Analysis of Gene Expression (CAGE). (C) The TSS representative of RPL32 gene does not confirm either available annotation, suggesting the incorrectness of both gene models. (D) Bar graph of expression values shows NANOG pluripotency gene present at early stages, then down-regulated at later stages. (E) Late stage–specific expression of GFAP gene (Hamburger and Hamilton stage 41 [HH41] and HH45). Samples in the bar graphs are sorted by developmental stage.
Fig 3Expression profiles of stage- and cell type–specific transcription start sites (TSSs).
Cap Analysis of Gene Expression (CAGE) TSSs associated with pluripotency and germ layer–specific genes show distinct expression patterns during development. Pluripotency-related genes (NANOG, POU5F3, MYC, EOMES) show early stage–specific expression. Ectoderm-, mesoderm-, and endoderm-related genes show opposite expression patterns, being activated at later stages of development. X-axis represents developmental stages; y-axis represents tag per million (TPM) expression values on a logarithmic scale. Numerical values for this plot can be found in S1 Data.
Fig 4Chicken housekeeping genes.
Expression in tags per million (TPM) across all 26 samples for the top 30 housekeeping genes with lowest variance is compared to expression of well-known housekeeping genes GAPDH and ACTB. Genes are listed by variance (low to high). Numerical values for this plot can be found in S1 Data.
Fig 5Shared and unique transcription factor binding site motifs in the vicinity of transcription start sites (TSSs) enriched in sample subgroups.
(A) The top 50 enriched motifs in each subgroup are shown. Embryo group contains early, middle, middle-late, and late subgroups. Primary cells group contains mesenchymal stem cells (MSCs), aortic smooth muscle cells (SMCs), and hepatocytes subgroups. Other tissues group contains leg and wing buds and extraembryonic subgroups. (B) Venn diagram showing overlaps of those top 50 motifs for embryo, primary cells, and other tissues against housekeeping groups. The number of shared motifs between groups is low, suggesting stage- or cell type–specific regulatory programs.
Fig 6Induction of endogenous Brachyury expression mediated by Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR)-on.
(A) Chicken-ZENBU view of a 6-kb window on chromosome 3, showing the transcription start site (TSS) location for the Brachyury gene. Small rectangle represents the region shown in (B). (B) Sequences of 4 single guide RNAs (sgRNAs; red underline) located within 300 bp upstream of the Brachyury TSS. Blue: protospacer-adjacent motif (PAM). (C) Electroporated embryos were grown to Hamburger and Hamilton stage 10 (HH10), and sgRNA-expressing territories were assessed by co-electroporated GFP signal. Embryos were then processed for Brachyury in situ hybridization (shown here). (D) Magnified view of mid-region of the embryo, showing ectopic Brachyury+ cells in somites and neural tube.