| Literature DB >> 28850106 |
Shuhei Noguchi1, Takahiro Arakawa1,2, Shiro Fukuda2, Masaaki Furuno1,2, Akira Hasegawa1,2, Fumi Hori1,2, Sachi Ishikawa-Kato1,2, Kaoru Kaida2, Ai Kaiho2, Mutsumi Kanamori-Katayama2, Tsugumi Kawashima1,2, Miki Kojima1,2, Atsutaka Kubosaki2, Ri-Ichiroh Manabe1,2, Mitsuyoshi Murata1,2, Sayaka Nagao-Sato1,2, Kenichi Nakazato2, Noriko Ninomiya2, Hiromi Nishiyori-Sueki1,2, Shohei Noma1,2, Eri Saijyo2, Akiko Saka2, Mizuho Sakai1,2, Christophe Simon2, Naoko Suzuki1,2, Michihira Tagami1,2, Shoko Watanabe1,2, Shigehiro Yoshida2, Peter Arner3,4, Richard A Axton5, Magda Babina6, J Kenneth Baillie7, Timothy C Barnett8,9, Anthony G Beckhouse10, Antje Blumenthal11, Beatrice Bodega12, Alessandro Bonetti1,2, James Briggs13, Frank Brombacher14,15,16, Ailsa J Carlisle7, Hans C Clevers17,18, Carrie A Davis19, Michael Detmar20, Taeko Dohi21, Albert S B Edge22, Matthias Edinger23,24, Anna Ehrlund3,4, Karl Ekwall25, Mitsuhiro Endoh26, Hideki Enomoto27, Afsaneh Eslami28, Michela Fagiolini29, Lynsey Fairbairn7, Mary C Farach-Carson30, Geoffrey J Faulkner31, Carmelo Ferrai32, Malcolm E Fisher7, Lesley M Forrester5, Rie Fujita33, Jun-Ichi Furusawa26, Teunis B Geijtenbeek34, Thomas Gingeras19, Daniel Goldowitz35, Sven Guhl6, Reto Guler14,15,16, Stefano Gustincich36,37, Thomas J Ha35, Masahide Hamaguchi38, Mitsuko Hara39, Yuki Hasegawa1,2, Meenhard Herlyn40, Peter Heutink41, Kelly J Hitchens8,13, David A Hume7, Tomokatsu Ikawa26, Yuri Ishizu1,2, Chieko Kai42,43, Hiroshi Kawamoto26, Yuki I Kawamura21, Judith S Kempfle22, Tony J Kenna44, Juha Kere25,45, Levon M Khachigian46,47, Toshio Kitamura48, Sarah Klein20, S Peter Klinken49, Alan J Knox50, Soichi Kojima39, Haruhiko Koseki26, Shigeo Koyasu26, Weonju Lee51, Andreas Lennartsson25, Alan Mackay-Sim52, Niklas Mejhert3,4, Yosuke Mizuno53, Hiromasa Morikawa38, Mitsuru Morimoto27, Kazuyo Moro26, Kelly J Morris32, Hozumi Motohashi54, Christine L Mummery55, Yutaka Nakachi53,56, Fumio Nakahara48, Toshiyuki Nakamura42, Yukio Nakamura57, Tadasuke Nozaki58, Soichi Ogishima59, Naganari Ohkura38, Hiroshi Ohno26, Mitsuhiro Ohshima60, Mariko Okada-Hatakeyama26,61, Yasushi Okazaki53,56, Valerio Orlando12,62, Dmitry A Ovchinnikov13, Robert Passier55, Margaret Patrikakis46, Ana Pombo32, Swati Pradhan-Bhatt63, Xian-Yang Qin39, Michael Rehli23,24, Patrizia Rizzu41, Sugata Roy2, Antti Sajantila64, Shimon Sakaguchi38, Hiroki Sato42, Hironori Satoh33, Suzana Savvi14,15,16, Alka Saxena2, Christian Schmidl23, Claudio Schneider65, Gundula G Schulze-Tanzil66, Anita Schwegmann14,15,16, Guojun Sheng67, Jay W Shin1,2, Daisuke Sugiyama68, Takaaki Sugiyama42, Kim M Summers7, Naoko Takahashi2, Jun Takai33, Hiroshi Tanaka28, Hideki Tatsukawa69, Andru Tomoiu7, Hiroo Toyoda54, Marc van de Wetering17, Linda M van den Berg34, Roberto Verardo70, Dipti Vijayan71, Christine A Wells72, Louise N Winteringham49, Ernst Wolvetang13, Yoko Yamaguchi73, Masayuki Yamamoto33, Chiyo Yanagi-Mizuochi74, Misako Yoneda42, Yohei Yonekura27, Peter G Zhang35, Silvia Zucchelli36, Imad Abugessaisa1, Erik Arner1,2, Jayson Harshbarger1,2, Atsushi Kondo1,2, Timo Lassmann1,2,75, Marina Lizio1,2, Serkan Sahin1,2, Thierry Sengstag2, Jessica Severin1,2, Hisashi Shimoji2,76, Masanori Suzuki2, Harukazu Suzuki1,2, Jun Kawai2,77, Naoto Kondo1,2, Masayoshi Itoh1,2,77, Carsten O Daub1,2,25, Takeya Kasukawa1, Hideya Kawaji1,2,76,77, Piero Carninci1,2, Alistair R R Forrest1,2,49, Yoshihide Hayashizaki2,77.
Abstract
In the FANTOM5 project, transcription initiation events across the human and mouse genomes were mapped at a single base-pair resolution and their frequencies were monitored by CAGE (Cap Analysis of Gene Expression) coupled with single-molecule sequencing. Approximately three thousands of samples, consisting of a variety of primary cells, tissues, cell lines, and time series samples during cell activation and development, were subjected to a uniform pipeline of CAGE data production. The analysis pipeline started by measuring RNA extracts to assess their quality, and continued to CAGE library production by using a robotic or a manual workflow, single molecule sequencing, and computational processing to generate frequencies of transcription initiation. Resulting data represents the consequence of transcriptional regulation in each analyzed state of mammalian cells. Non-overlapping peaks over the CAGE profiles, approximately 200,000 and 150,000 peaks for the human and mouse genomes, were identified and annotated to provide precise location of known promoters as well as novel ones, and to quantify their activities.Entities:
Mesh:
Year: 2017 PMID: 28850106 PMCID: PMC5574368 DOI: 10.1038/sdata.2017.112
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Summary of FANTOM5 phase 1 and phase 2 samples.
| Cell lines | 259 | 1 | 9 | 0 | 269 |
| Fractionations | 12 | 0 | 9 | 0 | 21 |
| Primary cells | 537 | 109 | 24 | 31 | 701 |
| Timecourse samples | 35 | 19 | 748 | 572 | 1,374 |
| Tissues | 150 | 237 | 33 | 45 | 465 |
| Quality control samples | 0 | 1 | 0 | 1 | 2 |
| Total | 993 | 367 | 823 | 649 | 2,832 |
Figure 1Data processing scheme.
Data processing scheme from sample preparation to CAGE peak expression and annotation. Sky blue and beige color indicate locations storing the data, the FANTOM5 data archive (Data Citation 1, Data Citations 10) and in DDBJ Sequence Read Archive (Data Citations 2–9) respectively.
Sequence files (CTSS files).
| Cell lines | 261 | 1 | 10 | 0 | 272 |
| Fractionations | 12 | 0 | 9 | 0 | 21 |
| Primary cells | 538 | 110 | 26 | 50 | 724 |
| Timecourse samples | 35 | 20 | 750 | 578 | 1,383 |
| Tissues | 152 | 236 | 36 | 45 | 469 |
| Quality control samples | 0 | 28 | 0 | 122 | 150 |
| Total | 998 | 395 | 831 | 795 | 3,019 |
Figure 2RNA and mapping quality control.
Distribution of RIN score (a), A260/A230 (b), A260/A280 (c), mapped reads (d), and promoter rate (e) for samples used for FANTOM5 expression analysis.
Figure 3Hierarchical clustering of primary cells.
Hierarchical clustering of primary cell samples of mouse based on logarithm of expression (TPM). Color shows anatomical categories of samples.