Literature DB >> 27930330

Simultaneous dimension reduction and adjustment for confounding variation.

Zhixiang Lin1, Can Yang2, Ying Zhu3,4, John Duchi1,5, Yao Fu6, Yong Wang7, Bai Jiang1, Mahdi Zamanighomi1, Xuming Xu4, Mingfeng Li4, Nenad Sestan4,8,9, Hongyu Zhao10, Wing Hung Wong11,12.   

Abstract

Dimension reduction methods are commonly applied to high-throughput biological datasets. However, the results can be hindered by confounding factors, either biological or technical in origin. In this study, we extend principal component analysis (PCA) to propose AC-PCA for simultaneous dimension reduction and adjustment for confounding (AC) variation. We show that AC-PCA can adjust for (i) variations across individual donors present in a human brain exon array dataset and (ii) variations of different species in a model organism ENCODE RNA sequencing dataset. Our approach is able to recover the anatomical structure of neocortical regions and to capture the shared variation among species during embryonic development. For gene selection purposes, we extend AC-PCA with sparsity constraints and propose and implement an efficient algorithm. The methods developed in this paper can also be applied to more general settings. The R package and MATLAB source code are available at https://github.com/linzx06/AC-PCA.

Entities:  

Keywords:  confounding variation; dimension reduction; transcriptome

Mesh:

Year:  2016        PMID: 27930330      PMCID: PMC5187682          DOI: 10.1073/pnas.1617317113

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  22 in total

1.  Using control genes to correct for unwanted variation in microarray data.

Authors:  Johann A Gagnon-Bartsch; Terence P Speed
Journal:  Biostatistics       Date:  2011-11-17       Impact factor: 5.899

2.  The sva package for removing batch effects and other unwanted variation in high-throughput experiments.

Authors:  Jeffrey T Leek; W Evan Johnson; Hilary S Parker; Andrew E Jaffe; John D Storey
Journal:  Bioinformatics       Date:  2012-01-17       Impact factor: 6.937

3.  Normalization of RNA-seq data using factor analysis of control genes or samples.

Authors:  Davide Risso; John Ngai; Terence P Speed; Sandrine Dudoit
Journal:  Nat Biotechnol       Date:  2014-08-24       Impact factor: 54.908

4.  Computational prediction of methylation status in human genomic sequences.

Authors:  Rajdeep Das; Nevenka Dimitrova; Zhenyu Xuan; Robert A Rollins; Fatemah Haghighi; John R Edwards; Jingyue Ju; Timothy H Bestor; Michael Q Zhang
Journal:  Proc Natl Acad Sci U S A       Date:  2006-07-03       Impact factor: 11.205

5.  Accounting for non-genetic factors by low-rank representation and sparse regression for eQTL mapping.

Authors:  Can Yang; Lin Wang; Shuqin Zhang; Hongyu Zhao
Journal:  Bioinformatics       Date:  2013-02-17       Impact factor: 6.937

6.  High-throughput classification of yeast mutants for functional genomics using metabolic footprinting.

Authors:  Jess Allen; Hazel M Davey; David Broadhurst; Jim K Heald; Jem J Rowland; Stephen G Oliver; Douglas B Kell
Journal:  Nat Biotechnol       Date:  2003-05-12       Impact factor: 54.908

7.  Spatio-temporal transcriptome of the human brain.

Authors:  Hyo Jung Kang; Yuka Imamura Kawasawa; Feng Cheng; Ying Zhu; Xuming Xu; Mingfeng Li; André M M Sousa; Mihovil Pletikos; Kyle A Meyer; Goran Sedmak; Tobias Guennel; Yurae Shin; Matthew B Johnson; Zeljka Krsnik; Simone Mayer; Sofia Fertuzinhos; Sheila Umlauf; Steven N Lisgo; Alexander Vortmeyer; Daniel R Weinberger; Shrikant Mane; Thomas M Hyde; Anita Huttner; Mark Reimers; Joel E Kleinman; Nenad Sestan
Journal:  Nature       Date:  2011-10-26       Impact factor: 49.962

8.  Comparison of D. melanogaster and C. elegans developmental stages, tissues, and cells by modENCODE RNA-seq data.

Authors:  Jingyi Jessica Li; Haiyan Huang; Peter J Bickel; Steven E Brenner
Journal:  Genome Res       Date:  2014-07       Impact factor: 9.043

9.  DEG 5.0, a database of essential genes in both prokaryotes and eukaryotes.

Authors:  Ren Zhang; Yan Lin
Journal:  Nucleic Acids Res       Date:  2008-10-30       Impact factor: 16.971

10.  Transcriptional landscape of the prenatal human brain.

Authors:  Jeremy A Miller; Song-Lin Ding; Susan M Sunkin; Kimberly A Smith; Lydia Ng; Aaron Szafer; Amanda Ebbert; Zackery L Riley; Joshua J Royall; Kaylynn Aiona; James M Arnold; Crissa Bennet; Darren Bertagnolli; Krissy Brouner; Stephanie Butler; Shiella Caldejon; Anita Carey; Christine Cuhaciyan; Rachel A Dalley; Nick Dee; Tim A Dolbeare; Benjamin A C Facer; David Feng; Tim P Fliss; Garrett Gee; Jeff Goldy; Lindsey Gourley; Benjamin W Gregor; Guangyu Gu; Robert E Howard; Jayson M Jochim; Chihchau L Kuan; Christopher Lau; Chang-Kyu Lee; Felix Lee; Tracy A Lemon; Phil Lesnar; Bergen McMurray; Naveed Mastan; Nerick Mosqueda; Theresa Naluai-Cecchini; Nhan-Kiet Ngo; Julie Nyhus; Aaron Oldre; Eric Olson; Jody Parente; Patrick D Parker; Sheana E Parry; Allison Stevens; Mihovil Pletikos; Melissa Reding; Kate Roll; David Sandman; Melaine Sarreal; Sheila Shapouri; Nadiya V Shapovalova; Elaine H Shen; Nathan Sjoquist; Clifford R Slaughterbeck; Michael Smith; Andy J Sodt; Derric Williams; Lilla Zöllei; Bruce Fischl; Mark B Gerstein; Daniel H Geschwind; Ian A Glass; Michael J Hawrylycz; Robert F Hevner; Hao Huang; Allan R Jones; James A Knowles; Pat Levitt; John W Phillips; Nenad Sestan; Paul Wohnoutka; Chinh Dang; Amy Bernard; John G Hohmann; Ed S Lein
Journal:  Nature       Date:  2014-04-02       Impact factor: 49.962

View more
  14 in total

1.  aPCoA: covariate adjusted principal coordinates analysis.

Authors:  Yushu Shi; Liangliang Zhang; Kim-Anh Do; Christine B Peterson; Robert R Jenq
Journal:  Bioinformatics       Date:  2020-07-01       Impact factor: 6.937

2.  Integrative functional genomic analysis of human brain development and neuropsychiatric risks.

Authors:  Mingfeng Li; Gabriel Santpere; Yuka Imamura Kawasawa; Oleg V Evgrafov; Forrest O Gulden; Sirisha Pochareddy; Susan M Sunkin; Zhen Li; Yurae Shin; Ying Zhu; André M M Sousa; Donna M Werling; Robert R Kitchen; Hyo Jung Kang; Mihovil Pletikos; Jinmyung Choi; Sydney Muchnik; Xuming Xu; Daifeng Wang; Belen Lorente-Galdos; Shuang Liu; Paola Giusti-Rodríguez; Hyejung Won; Christiaan A de Leeuw; Antonio F Pardiñas; Ming Hu; Fulai Jin; Yun Li; Michael J Owen; Michael C O'Donovan; James T R Walters; Danielle Posthuma; Mark A Reimers; Pat Levitt; Daniel R Weinberger; Thomas M Hyde; Joel E Kleinman; Daniel H Geschwind; Michael J Hawrylycz; Matthew W State; Stephan J Sanders; Patrick F Sullivan; Mark B Gerstein; Ed S Lein; James A Knowles; Nenad Sestan
Journal:  Science       Date:  2018-12-14       Impact factor: 47.728

3.  Reverse GWAS: Using genetics to identify and model phenotypic subtypes.

Authors:  Andy Dahl; Na Cai; Arthur Ko; Markku Laakso; Päivi Pajukanta; Jonathan Flint; Noah Zaitlen
Journal:  PLoS Genet       Date:  2019-04-05       Impact factor: 5.917

4.  Transcriptomic Landscape of von Economo Neurons in Human Anterior Cingulate Cortex Revealed by Microdissected-Cell RNA Sequencing.

Authors:  Lixin Yang; Yandong Yang; Jiamiao Yuan; Yan Sun; Jiapei Dai; Bing Su
Journal:  Cereb Cortex       Date:  2019-02-01       Impact factor: 5.357

5.  Core transcriptional signatures of phase change in the migratory locust.

Authors:  Pengcheng Yang; Li Hou; Xianhui Wang; Le Kang
Journal:  Protein Cell       Date:  2019-07-10       Impact factor: 14.870

6.  Aging features of the migratory locust at physiological and transcriptional levels.

Authors:  Siyuan Guo; Pengcheng Yang; Bo Liang; Feng Zhou; Li Hou; Le Kang; Xianhui Wang
Journal:  BMC Genomics       Date:  2021-04-10       Impact factor: 3.969

7.  RA3 is a reference-guided approach for epigenetic characterization of single cells.

Authors:  Shengquan Chen; Guanao Yan; Wenyu Zhang; Jinzhao Li; Rui Jiang; Zhixiang Lin
Journal:  Nat Commun       Date:  2021-04-12       Impact factor: 14.919

8.  A general and flexible method for signal extraction from single-cell RNA-seq data.

Authors:  Davide Risso; Fanny Perraudeau; Svetlana Gribkova; Sandrine Dudoit; Jean-Philippe Vert
Journal:  Nat Commun       Date:  2018-01-18       Impact factor: 14.919

9.  Benchmarking principal component analysis for large-scale single-cell RNA-sequencing.

Authors:  Koki Tsuyuzaki; Hiroyuki Sato; Kenta Sato; Itoshi Nikaido
Journal:  Genome Biol       Date:  2020-01-20       Impact factor: 13.583

10.  scMC learns biological variation through the alignment of multiple single-cell genomics datasets.

Authors:  Lihua Zhang; Qing Nie
Journal:  Genome Biol       Date:  2021-01-04       Impact factor: 17.906

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.