Literature DB >> 26484140

High-throughput whole-genome sequencing of E14 mouse embryonic stem cells.

Danny Incarnato1, Francesco Neri1.   

Abstract

Mouse E14 embryonic stem cells (ESCs) are the most used ESC line, often employed for genome-wide studies involving next generation sequencing analysis [1-5]. More than 2 × 10 E9 sequences made on Illumina platform derived from the genome of E14 embryonic stem cells cultured in our laboratory were used to build a database of about 2.7 × 10 E6 single nucleotide variant [6]. The database was validated using other two sequencing datasets from other laboratory and high overlap was observed. The identified variants are enriched on intergenic regions, but several thousands reside on gene exons and regulatory regions, such as promoters, enhancers, splicing site and untranslated regions of RNA, thus indicating high probability of an important functional impact on the molecular biology of these cells. We created a new E14 genome assembly including the new identified variants and used it to map reads from next generation sequencing data generated in our laboratory or in others on E14 cell line. We observed an increase in the number of mapped reads of about 5%. CpG dinucleotide showed the higher variation frequency, probably because it could be a target of DNA methylation. Data were deposited in GEO datasets under reference GSM1283021 and here: http://epigenetics.hugef-research.org/data.php.

Entities:  

Keywords:  ESC; NGS; Whole-genome E14

Year:  2014        PMID: 26484140      PMCID: PMC4535964          DOI: 10.1016/j.gdata.2014.10.023

Source DB:  PubMed          Journal:  Genom Data        ISSN: 2213-5960


Direct link to deposited data

http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSM1283021 http://epigenetics.hugef-research.org/data.php

Experimental design, materials and methods

E14 mouse ES cells were cultured in ESC medium (DMEM high glucose with 15% fetal bovine serum [FBS], NNEA1x, NaPyr1x, 0.1 mM 2-mercaptoethanol, and 1500 U/ml LIF). Genomic DNA was extracted using a DNeasy Blood and Tissue kit (Qiagen). For sequencing of E14 genome, DNA was sonicated for 17′ pulse 30″ON/30″OFF high with Bioruptor Twin (Diagenode). Libraries were generated with DNA Sample Prep Kit (Illumina) and sequenced on Illumina HiScanSQ Platform. Basecalls performed using CASAVA version 1.8 following default parameters. Reads quality was estimated using FastQC tool v0.10.1 (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/). Nucleotide positions with a quality score under 30 (Phred33 scale) were trimmed using the fastx_trimmer tool from the FASTX Toolkit (http://hannonlab.cshl.edu/fastx_toolkit/). After low-quality positions trimming, reads in which sequencing continued through the 3′ adapter sequence were clipped using the fastx_clipper tool from the FASTX Toolkit. Then, reads were aligned to the mouse genome assembly mm9 using Bowtie [7] v0.12.7 with the following parameters: -q --max /dev/null -v 1 -S --sam-nohead -m 1. Reads with the same mapping positions were collapsed into one using the rmdup tool from SAMtools. Variants calling was performed using the mpileup tool from SAMtools [8]. Next, we used VCFtools [9] v0.1.11 (http://vcftools.sourceforge.net/) to select only SNVs with coverage of ≥10 and a frequency of ≥0.5. Moreover, using custom Perl scripts we discarded sites with more than one variant call at the same place. Finally, using the GATK v2.7-4 (http://www.broadinstitute.org/gatk/) FastaAlternateReferenceMaker function we created the new reference E14 assembly from the mm9 genome assembly. These data can be found at: http://epigenetics.hugef-research.org/data.php.
Specifications
Organism/cell line/tissueMouse E14 embryonic stem cells
SexMale
Sequencer or array typeIllumina HiScanSQ
Data formatRaw and analyzed
Experimental factorsN/A
Experimental featuresWhole genome sequencing of E14 embryonic stem cells
ConsentN/A
Sample source locationTorino, Italy
  9 in total

1.  TET1 and hydroxymethylcytosine in transcription and DNA methylation fidelity.

Authors:  Kristine Williams; Jesper Christensen; Marianne Terndrup Pedersen; Jens V Johansen; Paul A C Cloos; Juri Rappsilber; Kristian Helin
Journal:  Nature       Date:  2011-04-13       Impact factor: 49.962

2.  Dnmt3L antagonizes DNA methylation at bivalent promoters and favors DNA methylation at gene bodies in ESCs.

Authors:  Francesco Neri; Anna Krepelova; Danny Incarnato; Mara Maldotti; Caterina Parlato; Federico Galvagni; Filomena Matarese; Hendrik G Stunnenberg; Salvatore Oliviero
Journal:  Cell       Date:  2013-09-26       Impact factor: 41.582

3.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

4.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.

Authors:  Ben Langmead; Cole Trapnell; Mihai Pop; Steven L Salzberg
Journal:  Genome Biol       Date:  2009-03-04       Impact factor: 13.583

5.  High-throughput single nucleotide variant discovery in E14 mouse embryonic stem cells provides a new reference genome assembly.

Authors:  Danny Incarnato; Anna Krepelova; Francesco Neri
Journal:  Genomics       Date:  2014-07-05       Impact factor: 5.736

6.  Integration of external signaling pathways with the core transcriptional network in embryonic stem cells.

Authors:  Xi Chen; Han Xu; Ping Yuan; Fang Fang; Mikael Huss; Vinsensius B Vega; Eleanor Wong; Yuriy L Orlov; Weiwei Zhang; Jianming Jiang; Yuin-Han Loh; Hock Chuan Yeo; Zhen Xuan Yeo; Vipin Narang; Kunde Ramamoorthy Govindarajan; Bernard Leong; Atif Shahab; Yijun Ruan; Guillaume Bourque; Wing-Kin Sung; Neil D Clarke; Chia-Lin Wei; Huck-Hui Ng
Journal:  Cell       Date:  2008-06-13       Impact factor: 41.582

7.  The variant call format and VCFtools.

Authors:  Petr Danecek; Adam Auton; Goncalo Abecasis; Cornelis A Albers; Eric Banks; Mark A DePristo; Robert E Handsaker; Gerton Lunter; Gabor T Marth; Stephen T Sherry; Gilean McVean; Richard Durbin
Journal:  Bioinformatics       Date:  2011-06-07       Impact factor: 6.937

8.  Myc and max genome-wide binding sites analysis links the Myc regulatory network with the polycomb and the core pluripotency networks in mouse embryonic stem cells.

Authors:  Anna Krepelova; Francesco Neri; Mara Maldotti; Stefania Rapelli; Salvatore Oliviero
Journal:  PLoS One       Date:  2014-02-21       Impact factor: 3.240

9.  Genome-wide analysis identifies a functional association of Tet1 and Polycomb repressive complex 2 in mouse embryonic stem cells.

Authors:  Francesco Neri; Danny Incarnato; Anna Krepelova; Stefania Rapelli; Andrea Pagnani; Riccardo Zecchina; Caterina Parlato; Salvatore Oliviero
Journal:  Genome Biol       Date:  2013-08-29       Impact factor: 13.583

  9 in total
  3 in total

1.  Double Emulsion Picoreactors for High-Throughput Single-Cell Encapsulation and Phenotyping via FACS.

Authors:  Kara K Brower; Margarita Khariton; Peter H Suzuki; Chris Still; Gaeun Kim; Suzanne G K Calhoun; Lei S Qi; Bo Wang; Polly M Fordyce
Journal:  Anal Chem       Date:  2020-09-23       Impact factor: 6.986

2.  Lysines Acetylome and Methylome Profiling of H3 and H4 Histones in Trichostatin A-Treated Stem Cells.

Authors:  Flora Cozzolino; Ilaria Iacobucci; Vittoria Monaco; Tiziana Angrisano; Maria Monti
Journal:  Int J Mol Sci       Date:  2021-02-19       Impact factor: 5.923

3.  Improving read alignment through the generation of alternative reference via iterative strategy.

Authors:  Lina Bu; Qi Wang; Wenjin Gu; Ruifei Yang; Di Zhu; Zhuo Song; Xiaojun Liu; Yiqiang Zhao
Journal:  Sci Rep       Date:  2020-10-30       Impact factor: 4.379

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.