Literature DB >> 26981420

Genome-wide RNA-seq and ChIP-seq reveal Linc-YY1 function in regulating YY1/PRC2 activity during skeletal myogenesis.

Kun Sun1, Liang Zhou2, Yu Zhao2, Huating Wang3, Hao Sun1.   

Abstract

Little is known how lincRNAs are involved in skeletal myogenesis. Here we describe the discovery and functional annotation of Linc-YY1, a novel lincRNA originating from the promoter of the transcription factor (TF) Yin Yang 1 (YY1). Starting from whole transcriptome shotgun sequencing (a.k.a. RNA-seq) data from muscle C2C12 cells, a series of bioinformatics analysis was applied towards the identification of hundreds of high-confidence novel lincRNAs. Genome-wide approaches were then employed to demonstrate that Linc-YY1 functions to promote myogenesis through associating with YY1 and regulating YY1/PRC2 transcriptional activity in trans. Here we describe the details of the ChIP-seq, RNA-seq experiments, and data analysis procedures associated with the study published by Zhou and colleagues in the Nature Communications Journal in 2015 Zhou et al. (2015) [1]. The data was deposited on NCBI's Gene Expression Omnibus (GEO, http://www.ncbi.nlm.nih.gov/geo/) with accession number GSE74049.

Entities:  

Keywords:  C2C12 cells; ChIP-seq; Linc-YY1; Myogenesis; RNA-seq

Year:  2016        PMID: 26981420      PMCID: PMC4778671          DOI: 10.1016/j.gdata.2016.01.022

Source DB:  PubMed          Journal:  Genom Data        ISSN: 2213-5960


Direct link to deposited data

http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE74049.

Experimental design, materials, and methods

Cell culture

Mouse C2C12 myoblasts cell line was purchased from American Type Culture Collection (ATCC). The myoblasts were maintain in growth medium (DMEM, 10%FBS and 1% Penicillin/Streptomycin), and induced to myotubes by culturing in differentiation medium (DMEM, 2% horse serum and 1% Penicillin/Streptomycin).

ChIP assays and sequencing experiments

ChIP assays were performed as previously described [2], [3]. About 2 × 107 C2C12 cells and 5 μg of antibodies were used in one immunoprecipitation. The antibodies include YY1 (Santa Cruz Biotechnology), Ezh2 (Active Motif), Eed (Millipore), trimethyl-histone H3-K27 (Millipore), or normal mouse IgG (Santa Cruz Biotechnology) as a negative control. For library construction, we used a protocol as described before [4]. Briefly, the immunoprecipitated DNA (~ 10 ng) were end-repaired, and A-nucleotide overhangs were then added, followed by adapter ligation, PCR enrichment, size selection and purification. The purified DNA library products were evaluated using Bioanalyzer (Agilent) and SYBR qPCR and diluted to 10 nM for sequencing on Illumina Hi-seq 2000 sequencer (YY1) (pair-end with 50 bp) or Illumina Genome Analyzer II sequencer (Ezh2, Eed and H3K27me3) (pair-end with 36 bp). Technical replicates were prepared by sequencing the same library twice. A data analysis pipeline CASAVA 1.8 (Illumina) was employed to perform the initial bioinformatics analysis (base calling). Table 1 lists all the experiments that we had performed. For MB YY1, we performed two biological replicates with the antibody SC-1703 and a third biological replicate with a second antibody AB58066. We also performed two technical replicates for each antibody (run 1 and run 2).
Table 1

Sequencing and mapping information for the experiments.

TypeSampleTreatmentRaw readsPre-processedMapped readsMappability (%)
ChIP-seqEzh2Vector transfected33,033,076NA22,480,37168.1
ChIP-seqEEDVector transfected42,772,454NA32,563,47676.1
ChIP-seqH3K27me3Vector transfected42,897,641NA33,224,32477.5
ChIP-seqIgGVector transfected46,757,140NA35,189,69675.3
ChIP-seqYy1Linc-YY1 transfected120,778,67528,875,46822,368,41177.5
ChIP-seqEzh2Linc-YY1 transfected45,143,782NA35,378,56078.4
ChIP-seqEEDLinc-YY1 transfected45,229,883NA34,536,36176.4
ChIP-seqH3K27me3Linc-YY1 transfected46,306,903NA35,814,70777.3
ChIP-seqIgGLinc-YY1 transfected38,713,272NA27,863,81272.0
RNA-seqsiNCNegative control oligo transfected248,983,238112,409,99794,952,57584.5
RNA-seqsiLinc-YY1siLinc-YY1 oligo transfected275,819,57164,514,69852,448,80481.3

The preprocessing procedure is only performed on samples with high duplication rate designed to remove the duplicated reads while not for others with low duplication rate (shown as ‘NA’).

Read alignment and peak calling

The sequenced reads were mapped to the mouse reference genome (UCSC mm9, non-repeat-masked) using SOAP2 [5] (v2.20) allowing a maximum of two mismatches and only the uniquely aligned reads were kept. The sequencing and mapping information from each dataset were shown in Table 1. The protein-DNA binding peaks were identified using Model-based Analysis for ChIP-seq (MACS [6], v1.4.2) with IgG control sample as background. The p-value cutoff was set as 10− 5 to call high-confidence binding sites.

Whole transcriptome sequencing experiments

Preparation of RNA-seq libraries for sequencing on the Illumina platforms was carried out using the RNA-Seq Sample Preparation Kit (catalog number RS-930-1001) according to the manufacturer's standard protocol. Briefly, purified RNA was fragmented via incubation for 5 min at 94'C with the Illumina supplied fragmentation buffer. The first strand of cDNA was next synthesized by reverse transcription using random oligo primers. Second-strand synthesis was conducted by incubation with RNase H and DNA polymerase I. The resulting double-stranded DNA fragments were subsequently end-repaired and A-nucleotide overhangs were added by incubation with Taq Klenow lacking exonuclease activity. After the attachment of anchor sequences, fragments were PCR amplified using Illumina-supplied primers and loaded onto the Hiseq 2000 or GAIIx flow cell. DNA clusters were generated with an Illumina cluster station with Paired-End Cluster Generation Kit v2 (Illumina), followed by 50 (or 26 bp)*2 cycles of sequencing on sequencer with Sequencing Kit v3 (Illumina). Genome Analyzer Sequencing Control Software (SCS) v2.5, which could perform real-time image analysis and base calling, was used to carry out the image processing and base calling during the chemistry and imaging cycles of a sequencing run. The default parameters within the data analysis software (SCS v2.5) from Illumina were used to filter poor-quality reads. In the default setting, a read would be removed if a chastity of < 0.6 is observed on two or more bases among the first 25 bases.

Transcriptome assembly and novel lincRNA identification

To generate a comprehensive catalog of lincRNAs in muscle cells, we applied an integrated analysis on RNA-seq data generated by Trapnell et al. [7] and our own RNA-seq data from proliferating and differentiating C2C12 cells. The raw sequencing reads were aligned to the mouse reference genome (UCSC mm9) using Tophat (v2.0.4) [7], during which procedure the UCSC gene annotation file downloaded from Cufflinks website (http://cole-trapnelllab.github.io/cufflinks/igenome_table/index.html) was used (the ‘-G’ option). The transcriptome assembly was then performed using Cufflinks (v2.0.4) [7], and a total of 46,627 transcripts were obtained. Then sebnif (v1.2.2) [8] was employed to identify the high-confidence novel lincRNAs. In this procedure, the annotated genes, transcript size, expression level and coding potential were all considered. As a result, a total of 2413 novel lincRNAs were identified. After further annotating each of them with features including K4-K36 domain, EST tag and MyoD binding, a stringent set of 158 lincRNAs were obtained.

Differentially expressed genes analysis

To detect the differentially expressed genes between siNC- and siLinc-YY1-transfected C2C12 cells, the raw RNA-seq data were first preprocessed (adapter trimming and duplicate removing using in-house programs) and then aligned to the reference genome (UCSC mm9) using Tophat (version 2.0.4), during which procedure the UCSC gene annotation file downloaded from Cufflinks website (http://cole-trapnelllab.github.io/cufflinks/igenome_table/index.html) was used (the ‘-G’ option). The sequencing and mapping information were shown in Table 1. Cuffdiff (version 2.0.4) [7] was then applied on the aligned data set against the RefSeq gene annotation, to determine differentially expressed genes with a ‘significant’ status. The GO analysis of the differentially expressed genes was performed using DAVID (http://david.abcc.ncifcrf.gov/).
Specifications
Organism/cell line/tissueMus musculus/C2C12
SexNA
Sequencer or array typeIllumina HiSeq 1500, Illumina Hiseq 2000, Illumina GA IIx
Data formatRaw data: FASTQ filesProcessed data: BEDGRAPH, TXT
Experimental factorsMyoblast vs myotube
Experimental featuresUsing RNA-seq, we identified hundreds of high confidence novel lincRNAs in skeletal muscle. Among these lincRNAs, one near the YY1 transcription factor caused our attention, which we named Linc-YY1. We then used ChIP-seq and RNA-seq to investigate the function of this novel lincRNA during myogenesis.
ConsentNA
Sample source locationManassas, VA, USA
  8 in total

1.  A novel target of microRNA-29, Ring1 and YY1-binding protein (Rybp), negatively regulates skeletal myogenesis.

Authors:  Liang Zhou; Lijun Wang; Leina Lu; Peiyong Jiang; Hao Sun; Huating Wang
Journal:  J Biol Chem       Date:  2012-05-31       Impact factor: 5.157

2.  SOAP2: an improved ultrafast tool for short read alignment.

Authors:  Ruiqiang Li; Chang Yu; Yingrui Li; Tak-Wah Lam; Siu-Ming Yiu; Karsten Kristiansen; Jun Wang
Journal:  Bioinformatics       Date:  2009-06-03       Impact factor: 6.937

3.  Linc-YY1 promotes myogenic differentiation and muscle regeneration through an interaction with the transcription factor YY1.

Authors:  Liang Zhou; Kun Sun; Yu Zhao; Suyang Zhang; Xuecong Wang; Yuying Li; Leina Lu; Xiaona Chen; Fengyuan Chen; Xichen Bao; Xihua Zhu; Lijun Wang; Ling-Yin Tang; Miguel A Esteban; Chi-Chiu Wang; Ralf Jauch; Hao Sun; Huating Wang
Journal:  Nat Commun       Date:  2015-12-11       Impact factor: 14.919

4.  Pax3/7BP is a Pax7- and Pax3-binding protein that regulates the proliferation of muscle precursor cells by an epigenetic mechanism.

Authors:  Yarui Diao; Xing Guo; Yanfeng Li; Kun Sun; Leina Lu; Lei Jiang; Xinrong Fu; Han Zhu; Hao Sun; Huating Wang; Zhenguo Wu
Journal:  Cell Stem Cell       Date:  2012-08-03       Impact factor: 24.633

5.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation.

Authors:  Cole Trapnell; Brian A Williams; Geo Pertea; Ali Mortazavi; Gordon Kwan; Marijke J van Baren; Steven L Salzberg; Barbara J Wold; Lior Pachter
Journal:  Nat Biotechnol       Date:  2010-05-02       Impact factor: 54.908

6.  A Novel YY1-miR-1 regulatory circuit in skeletal myogenesis revealed by genome-wide prediction of YY1-miRNA network.

Authors:  Leina Lu; Liang Zhou; Eric Z Chen; Kun Sun; Peiyong Jiang; Lijun Wang; Xiaoxi Su; Hao Sun; Huating Wang
Journal:  PLoS One       Date:  2012-02-01       Impact factor: 3.240

7.  Sebnif: an integrated bioinformatics pipeline for the identification of novel large intergenic noncoding RNAs (lincRNAs)--application in human skeletal muscle cells.

Authors:  Kun Sun; Yu Zhao; Huating Wang; Hao Sun
Journal:  PLoS One       Date:  2014-01-06       Impact factor: 3.240

8.  Model-based analysis of ChIP-Seq (MACS).

Authors:  Yong Zhang; Tao Liu; Clifford A Meyer; Jérôme Eeckhoute; David S Johnson; Bradley E Bernstein; Chad Nusbaum; Richard M Myers; Myles Brown; Wei Li; X Shirley Liu
Journal:  Genome Biol       Date:  2008-09-17       Impact factor: 13.583

  8 in total
  5 in total

1.  Heat shock cognate protein 70 promotes the differentiation of C2C12 myoblast and targets Yin Yang 1.

Authors:  Lei Chen; Tao Guo; Yan Yu; Yeqing Sun; Guangrong Yu; Liming Cheng
Journal:  Ann Transl Med       Date:  2019-10

2.  Dissection of Myogenic Differentiation Signatures in Chickens by RNA-Seq Analysis.

Authors:  Tingting Li; Genxi Zhang; Pengfei Wu; Lian Duan; Guohui Li; Qiuhong Liu; Jinyu Wang
Journal:  Genes (Basel)       Date:  2018-01-11       Impact factor: 4.096

3.  Long noncoding RNA Neat1 modulates myogenesis by recruiting Ezh2.

Authors:  Shanshan Wang; Hao Zuo; Jianjun Jin; Wei Lv; Zaiyan Xu; Yonghui Fan; Jiali Zhang; Bo Zuo
Journal:  Cell Death Dis       Date:  2019-06-26       Impact factor: 8.469

4.  LncMyoD Promotes Skeletal Myogenesis and Regulates Skeletal Muscle Fiber-Type Composition by Sponging miR-370-3p.

Authors:  Peiwen Zhang; Jingjing Du; Xinyu Guo; Shuang Wu; Jin He; Xinrong Li; Linyuan Shen; Lei Chen; Bohong Li; Jingjun Zhang; Yuhao Xie; Lili Niu; Dongmei Jiang; Xuewei Li; Shunhua Zhang; Li Zhu
Journal:  Genes (Basel)       Date:  2021-04-17       Impact factor: 4.096

5.  Male-Biased gga-miR-2954 Regulates Myoblast Proliferation and Differentiation of Chicken Embryos by Targeting YY1.

Authors:  Xiuxue Dong; Yu Cheng; Lingyun Qiao; Xin Wang; Cuiping Zeng; Yanping Feng
Journal:  Genes (Basel)       Date:  2021-08-27       Impact factor: 4.096

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.