Literature DB >> 25297068

Genome measures used for quality control are dependent on gene function and ancestry.

Jing Wang1, Leon Raskin1, David C Samuels1, Yu Shyr1, Yan Guo1.   

Abstract

MOTIVATION: The transition/transversion (Ti/Tv) ratio and heterozygous/nonreference-homozygous (het/nonref-hom) ratio have been commonly computed in genetic studies as a quality control (QC) measurement. Additionally, these two ratios are helpful in our understanding of the patterns of DNA sequence evolution.
RESULTS: To thoroughly understand these two genomic measures, we performed a study using 1000 Genomes Project (1000G) released genotype data (N=1092). An additional two datasets (N=581 and N=6) were used to validate our findings from the 1000G dataset. We compared the two ratios among continental ancestry, genome regions and gene functionality. We found that the Ti/Tv ratio can be used as a quality indicator for single nucleotide polymorphisms inferred from high-throughput sequencing data. The Ti/Tv ratio varies greatly by genome region and functionality, but not by ancestry. The het/nonref-hom ratio varies greatly by ancestry, but not by genome regions and functionality. Furthermore, extreme guanine + cytosine content (either high or low) is negatively associated with the Ti/Tv ratio magnitude. Thus, when performing QC assessment using these two measures, care must be taken to apply the correct thresholds based on ancestry and genome region. Failure to take these considerations into account at the QC stage will bias any following analysis. CONTACT: yan.guo@vanderbilt.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2014        PMID: 25297068      PMCID: PMC4308666          DOI: 10.1093/bioinformatics/btu668

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  21 in total

1.  The use of next generation sequencing technology to study the effect of radiation therapy on mitochondrial DNA mutation.

Authors:  Yan Guo; Qiuyin Cai; David C Samuels; Fei Ye; Jirong Long; Chung-I Li; Jeanette F Winther; E Janet Tawn; Marilyn Stovall; Päivi Lähteenmäki; Nea Malila; Shawn Levy; Christian Shaffer; Yu Shyr; Xiao-Ou Shu; John D Boice
Journal:  Mutat Res       Date:  2012-02-24       Impact factor: 2.433

2.  A map of human genome variation from population-scale sequencing.

Authors:  Gonçalo R Abecasis; David Altshuler; Adam Auton; Lisa D Brooks; Richard M Durbin; Richard A Gibbs; Matt E Hurles; Gil A McVean
Journal:  Nature       Date:  2010-10-28       Impact factor: 49.962

3.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data.

Authors:  Kai Wang; Mingyao Li; Hakon Hakonarson
Journal:  Nucleic Acids Res       Date:  2010-07-03       Impact factor: 16.971

4.  Exome sequencing generates high quality data in non-target regions.

Authors:  Yan Guo; Jirong Long; Jing He; Chung-I Li; Qiuyin Cai; Xiao-Ou Shu; Wei Zheng; Chun Li
Journal:  BMC Genomics       Date:  2012-05-20       Impact factor: 3.969

5.  On the immortality of television sets: "function" in the human genome according to the evolution-free gospel of ENCODE.

Authors:  Dan Graur; Yichen Zheng; Nicholas Price; Ricardo B R Azevedo; Rebecca A Zufall; Eran Elhaik
Journal:  Genome Biol Evol       Date:  2013       Impact factor: 3.416

6.  Exome sequencing of extreme phenotypes identifies DCTN4 as a modifier of chronic Pseudomonas aeruginosa infection in cystic fibrosis.

Authors:  Mary J Emond; Tin Louie; Julia Emerson; Wei Zhao; Rasika A Mathias; Michael R Knowles; Fred A Wright; Mark J Rieder; Holly K Tabor; Deborah A Nickerson; Kathleen C Barnes; Ronald L Gibson; Michael J Bamshad
Journal:  Nat Genet       Date:  2012-07-08       Impact factor: 38.330

7.  Summarizing and correcting the GC content bias in high-throughput sequencing.

Authors:  Yuval Benjamini; Terence P Speed
Journal:  Nucleic Acids Res       Date:  2012-02-09       Impact factor: 16.971

8.  An integrated encyclopedia of DNA elements in the human genome.

Authors: 
Journal:  Nature       Date:  2012-09-06       Impact factor: 49.962

9.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

10.  The effect of strand bias in Illumina short-read sequencing data.

Authors:  Yan Guo; Jiang Li; Chung-I Li; Jirong Long; David C Samuels; Yu Shyr
Journal:  BMC Genomics       Date:  2012-11-24       Impact factor: 3.969

View more
  50 in total

1.  Single nucleotide variant counts computed from RNA sequencing and cellular traffic into human kidney allografts.

Authors:  Gaurav Thareja; Hua Yang; Shahina Hayat; Franco B Mueller; John R Lee; Michelle Lubetzky; Darshana M Dadhania; Aziz Belkadi; Surya V Seshan; Karsten Suhre; Manikkam Suthanthiran; Thangamani Muthukumar
Journal:  Am J Transplant       Date:  2018-05-15       Impact factor: 8.086

2.  Strategies for processing and quality control of Illumina genotyping arrays.

Authors:  Shilin Zhao; Wang Jing; David C Samuels; Quanghu Sheng; Yu Shyr; Yan Guo
Journal:  Brief Bioinform       Date:  2018-09-28       Impact factor: 11.622

3.  Heterozygosity Ratio, a Robust Global Genomic Measure of Autozygosity and Its Association with Height and Disease Risk.

Authors:  David C Samuels; Jing Wang; Fei Ye; Jing He; Rebecca T Levinson; Quanhu Sheng; Shilin Zhao; John A Capra; Yu Shyr; Wei Zheng; Yan Guo
Journal:  Genetics       Date:  2016-08-31       Impact factor: 4.562

4.  Set-theory based benchmarking of three different variant callers for targeted sequencing.

Authors:  Jose Arturo Molina-Mora; Mariela Solano-Vargas
Journal:  BMC Bioinformatics       Date:  2021-01-07       Impact factor: 3.169

Review 5.  Alternative Applications of Genotyping Array Data Using Multivariant Methods.

Authors:  David C Samuels; Jennifer E Below; Scott Ness; Hui Yu; Shuguang Leng; Yan Guo
Journal:  Trends Genet       Date:  2020-08-06       Impact factor: 11.639

6.  Quality and concordance of genotyping array data of 12,064 samples from 5840 cancer patients.

Authors:  Mingsheng Guo; Wei Yue; David C Samuels; Hui Yu; Jing He; Ying-Yong Zhao; Yan Guo
Journal:  Genomics       Date:  2018-06-11       Impact factor: 5.736

7.  Transcriptomic insights into genetic diversity of protein-coding genes in X. laevis.

Authors:  Virginia Savova; Esther J Pearl; Elvan Boke; Anwesha Nag; Ivan Adzhubei; Marko E Horb; Leonid Peshkin
Journal:  Dev Biol       Date:  2017-03-07       Impact factor: 3.582

8.  Proteome size reduction in Apicomplexans is linked with loss of DNA repair and host redundant pathways.

Authors:  D Derilus; M Z Rahman; A E Serrano; S E Massey
Journal:  Infect Genet Evol       Date:  2020-12-06       Impact factor: 3.342

9.  A Population-Specific Major Allele Reference Genome From The United Arab Emirates Population.

Authors:  Gihan Daw Elbait; Andreas Henschel; Guan K Tay; Habiba S Al Safar
Journal:  Front Genet       Date:  2021-04-23       Impact factor: 4.599

10.  Phylogenetic analysis of mutational robustness based on codon usage supports that the standard genetic code does not prefer extreme environments.

Authors:  Ádám Radványi; Ádám Kun
Journal:  Sci Rep       Date:  2021-05-26       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.