Literature DB >> 27015116

Large-scale atlas of microarray data reveals the distinct expression landscape of different tissues in Arabidopsis.

Fei He1, Shinjae Yoo2,3, Daifeng Wang4, Sunita Kumari5, Mark Gerstein4, Doreen Ware5,6, Sergei Maslov1,7,8.   

Abstract

Transcriptome data sets from thousands of samples of the model plant Arabidopsis thaliana have been collectively generated by multiple individual labs. Although integration and meta-analysis of these samples has become routine in the plant research community, it is often hampered by a lack of metadata or differences in annotation styles of different labs. In this study, we carefully selected and integrated 6057 Arabidopsis microarray expression samples from 304 experiments deposited to the Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI). Metadata such as tissue type, growth conditions and developmental stage were manually curated for each sample. We then studied the global expression landscape of the integrated data set and found that samples of the same tissue tend to be more similar to each other than to samples of other tissues, even in different growth conditions or developmental stages. Root has the most distinct transcriptome, compared with aerial tissues, but the transcriptome of cultured root is more similar to the transcriptome of aerial tissues, as the cultured root samples lost their cellular identity. Using a simple computational classification method, we showed that the tissue type of a sample can be successfully predicted based on its expression profile, opening the door for automatic metadata extraction and facilitating the re-use of plant transcriptome data. As a proof of principle, we applied our automated annotation pipeline to 708 RNA-seq samples from public repositories and verified the accuracy of our predictions with sample metadata provided by the authors.
© 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

Entities:  

Keywords:  Arabidopsis; automatic reconstruction of missing metadata; expression data integration; global transcriptional landscape; metadata annotation; re-use of public expression data

Mesh:

Substances:

Year:  2016        PMID: 27015116     DOI: 10.1111/tpj.13175

Source DB:  PubMed          Journal:  Plant J        ISSN: 0960-7412            Impact factor:   6.417


  12 in total

1.  METACLUSTER-an R package for context-specific expression analysis of metabolic gene clusters.

Authors:  Michael Banf; Kangmei Zhao; Seung Y Rhee
Journal:  Bioinformatics       Date:  2019-09-01       Impact factor: 6.937

Review 2.  The Plastid and Mitochondrial Peptidase Network in Arabidopsis thaliana: A Foundation for Testing Genetic Interactions and Functions in Organellar Proteostasis.

Authors:  Kristina Majsec; Nazmul H Bhuiyan; Qi Sun; Sunita Kumari; Vivek Kumar; Doreen Ware; Klaas J van Wijk
Journal:  Plant Cell       Date:  2017-09-25       Impact factor: 11.277

3.  Tailoring high-density oligonucleotide arrays for transcript profiling of different Arabidopsis thaliana accessions using a sequence-based approach.

Authors:  Anastassia Boudichevskaia; Hieu Xuan Cao; Renate Schmidt
Journal:  Plant Cell Rep       Date:  2017-05-22       Impact factor: 4.570

4.  Large-scale transcriptome analysis reveals arabidopsis metabolic pathways are frequently influenced by different pathogens.

Authors:  Zhenhong Jiang; Fei He; Ziding Zhang
Journal:  Plant Mol Biol       Date:  2017-05-24       Impact factor: 4.076

5.  Large-Scale Public Transcriptomic Data Mining Reveals a Tight Connection between the Transport of Nitrogen and Other Transport Processes in Arabidopsis.

Authors:  Fei He; Abhijit A Karve; Sergei Maslov; Benjamin A Babst
Journal:  Front Plant Sci       Date:  2016-08-11       Impact factor: 5.753

6.  Differential Coexpression Analysis Reveals Extensive Rewiring of Arabidopsis Gene Coexpression in Response to Pseudomonas syringae Infection.

Authors:  Zhenhong Jiang; Xiaobao Dong; Zhi-Gang Li; Fei He; Ziding Zhang
Journal:  Sci Rep       Date:  2016-10-10       Impact factor: 4.379

7.  Pan- and core- network analysis of co-expression genes in a model plant.

Authors:  Fei He; Sergei Maslov
Journal:  Sci Rep       Date:  2016-12-16       Impact factor: 4.379

8.  Gene co-expression network analysis identifies trait-related modules in Arabidopsis thaliana.

Authors:  Wei Liu; Liping Lin; Zhiyuan Zhang; Siqi Liu; Kuan Gao; Yanbin Lv; Huan Tao; Huaqin He
Journal:  Planta       Date:  2019-01-30       Impact factor: 4.116

9.  Extracting genotype information of Arabidopsis thaliana recombinant inbred lines from transcript profiles established with high-density oligonucleotide arrays.

Authors:  Renate Schmidt; Anastassia Boudichevskaia; Hieu Xuan Cao; Sang He; Rhonda Christiane Meyer; Jochen Christoph Reif
Journal:  Plant Cell Rep       Date:  2017-08-30       Impact factor: 4.570

10.  Identification of regulatory modules in genome scale transcription regulatory networks.

Authors:  Qi Song; Ruth Grene; Lenwood S Heath; Song Li
Journal:  BMC Syst Biol       Date:  2017-12-15
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.