Literature DB >> 28115622

Robust and rapid algorithms facilitate large-scale whole genome sequencing downstream analysis in an integrative framework.

Miaoxin Li1,2,3,4, Jiang Li5, Mulin Jun Li2, Zhicheng Pan3, Jacob Shujui Hsu3, Dajiang J Liu6,7, Xiaowei Zhan8, Junwen Wang2,9,10, Youqiang Song2,5, Pak Chung Sham2,3,4.   

Abstract

Whole genome sequencing (WGS) is a promising strategy to unravel variants or genes responsible for human diseases and traits. However, there is a lack of robust platforms for a comprehensive downstream analysis. In the present study, we first proposed three novel algorithms, sequence gap-filled gene feature annotation, bit-block encoded genotypes and sectional fast access to text lines to address three fundamental problems. The three algorithms then formed the infrastructure of a robust parallel computing framework, KGGSeq, for integrating downstream analysis functions for whole genome sequencing data. KGGSeq has been equipped with a comprehensive set of analysis functions for quality control, filtration, annotation, pathogenic prediction and statistical tests. In the tests with whole genome sequencing data from 1000 Genomes Project, KGGSeq annotated several thousand more reliable non-synonymous variants than other widely used tools (e.g. ANNOVAR and SNPEff). It took only around half an hour on a small server with 10 CPUs to access genotypes of ∼60 million variants of 2504 subjects, while a popular alternative tool required around one day. KGGSeq's bit-block genotype format used 1.5% or less space to flexibly represent phased or unphased genotypes with multiple alleles and achieved a speed of over 1000 times faster to calculate genotypic correlation.
© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Year:  2017        PMID: 28115622      PMCID: PMC5435951          DOI: 10.1093/nar/gkx019

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  38 in total

1.  Exome sequencing identifies novel compound heterozygous mutations of IL-10 receptor 1 in neonatal-onset Crohn's disease.

Authors:  H Mao; W Yang; P P W Lee; M H-K Ho; J Yang; S Zeng; C-Y Chong; T-L Lee; W Tu; Y-L Lau
Journal:  Genes Immun       Date:  2012-04-05       Impact factor: 2.676

2.  Whole-genome sequence variation, population structure and demographic history of the Dutch population.

Authors: 
Journal:  Nat Genet       Date:  2014-06-29       Impact factor: 38.330

3.  Predicting regulatory variants with composite statistic.

Authors:  Mulin Jun Li; Zhicheng Pan; Zipeng Liu; Jiexing Wu; Panwen Wang; Yun Zhu; Feng Xu; Zhengyuan Xia; Pak Chung Sham; Jean-Pierre A Kocher; Miaoxin Li; Jun S Liu; Junwen Wang
Journal:  Bioinformatics       Date:  2016-06-06       Impact factor: 6.937

4.  Finishing the euchromatic sequence of the human genome.

Authors: 
Journal:  Nature       Date:  2004-10-21       Impact factor: 49.962

5.  Rainbow: a tool for large-scale whole-genome sequencing data analysis using cloud computing.

Authors:  Shanrong Zhao; Kurt Prenger; Lance Smith; Thomas Messina; Hongtao Fan; Edward Jaeger; Susan Stephens
Journal:  BMC Genomics       Date:  2013-06-27       Impact factor: 3.969

6.  GEMINI: integrative exploration of genetic variation and genome annotations.

Authors:  Umadevi Paila; Brad A Chapman; Rory Kirchner; Aaron R Quinlan
Journal:  PLoS Comput Biol       Date:  2013-07-18       Impact factor: 4.475

7.  RVTESTS: an efficient and comprehensive tool for rare variant association analysis using sequence data.

Authors:  Xiaowei Zhan; Youna Hu; Bingshan Li; Goncalo R Abecasis; Dajiang J Liu
Journal:  Bioinformatics       Date:  2016-02-15       Impact factor: 6.937

8.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

9.  Predicting mendelian disease-causing non-synonymous single nucleotide variants in exome sequencing studies.

Authors:  Miao-Xin Li; Johnny S H Kwan; Su-Ying Bao; Wanling Yang; Shu-Leong Ho; Yong-Qiang Song; Pak C Sham
Journal:  PLoS Genet       Date:  2013-01-17       Impact factor: 5.917

10.  A specific mutation in TBL1XR1 causes Pierpont syndrome.

Authors:  Charlotte A Heinen; Aldo Jongejan; Peter J Watson; Bert Redeker; Anita Boelen; Olga Boudzovitch-Surovtseva; Francesca Forzano; Roel Hordijk; Richard Kelley; Ann H Olney; Mary Ella Pierpont; G Bradley Schaefer; Fiona Stewart; A S Paul van Trotsenburg; Eric Fliers; John W R Schwabe; Raoul C Hennekam
Journal:  J Med Genet       Date:  2016-01-14       Impact factor: 6.318

View more
  19 in total

1.  Actionable secondary findings from whole-genome sequencing of 954 East Asians.

Authors:  Clara Sze-Man Tang; Saloni Dattani; Man-Ting So; Stacey S Cherny; Paul K H Tam; Pak C Sham; Maria-Mercè Garcia-Barcelo
Journal:  Hum Genet       Date:  2017-11-11       Impact factor: 4.132

2.  Deviation from baseline mutation burden provides powerful and robust rare-variants association test for complex diseases.

Authors:  Lin Jiang; Hui Jiang; Sheng Dai; Ying Chen; Youqiang Song; Clara Sze-Man Tang; Shirley Yin-Yu Pang; Shu-Leong Ho; Binbin Wang; Maria-Mercedes Garcia-Barcelo; Paul Kwong-Hang Tam; Stacey S Cherny; Mulin Jun Li; Pak Chung Sham; Miaoxin Li
Journal:  Nucleic Acids Res       Date:  2022-04-08       Impact factor: 16.971

3.  Uncovering the genetic lesions underlying the most severe form of Hirschsprung disease by whole-genome sequencing.

Authors:  Clara Sm Tang; Xuehan Zhuang; Wai-Yee Lam; Elly Sau-Wai Ngan; Jacob Shujui Hsu; Y U Michelle; S O Man-Ting; Stacey S Cherny; Ngoc Diem Ngo; Pak C Sham; Paul Kh Tam; Maria-Mercè Garcia-Barcelo
Journal:  Eur J Hum Genet       Date:  2018-02-26       Impact factor: 4.246

4.  Rare variants in Protein tyrosine phosphatase, receptor type A (PTPRA) in schizophrenia: Evidence from a family based study.

Authors:  Jibin John; Prachi Kukshal; Aditya Sharma; Triptish Bhatia; V L Nimgaonkar; S N Deshpande; B K Thelma
Journal:  Schizophr Res       Date:  2018-12-27       Impact factor: 4.939

5.  Screening Consanguineous Families for Hearing Loss Using the MiamiOtoGenes Panel.

Authors:  Abhiraami Kannan-Sundhari; Denise Yan; Kolsoum Saeidi; Afsaneh Sahebalzamani; Susan H Blanton; Xue Zhong Liu
Journal:  Genet Test Mol Biomarkers       Date:  2020-09-29

6.  cepip: context-dependent epigenomic weighting for prioritization of regulatory variants and disease-associated genes.

Authors:  Mulin Jun Li; Miaoxin Li; Zipeng Liu; Bin Yan; Zhicheng Pan; Dandan Huang; Qian Liang; Dingge Ying; Feng Xu; Hongcheng Yao; Panwen Wang; Jean-Pierre A Kocher; Zhengyuan Xia; Pak Chung Sham; Jun S Liu; Junwen Wang
Journal:  Genome Biol       Date:  2017-03-16       Impact factor: 13.583

7.  The lysosomal disease caused by mutant VPS33A.

Authors:  Elena V Pavlova; Aleksey Shatunov; Lena Wartosch; Alena I Moskvina; Lena E Nikolaeva; Nicholas A Bright; Karen L Tylee; Heather J Church; Andrea Ballabio; J Paul Luzio; Timothy M Cox
Journal:  Hum Mol Genet       Date:  2019-08-01       Impact factor: 6.150

8.  Polygenic disruption of retinoid signalling in schizophrenia and a severe cognitive deficit subtype.

Authors:  William R Reay; Joshua R Atkins; Yann Quidé; Vaughan J Carr; Melissa J Green; Murray J Cairns
Journal:  Mol Psychiatry       Date:  2018-12-07       Impact factor: 15.992

9.  Chronic demodicosis in patients with immune dysregulation: An unexpected infectious manifestation of Signal transducer and activator of transcription (STAT)1 gain-of-function.

Authors:  Oded Shamriz; Atar Lev; Amos J Simon; Ortal Barel; Elisheva Javasky; Sigal Matza-Porges; Adir Shaulov; Zev Davidovics; Ori Toker; Raz Somech; Abraham Zlotogorski; Vered Molho-Pessach; Yuval Tal
Journal:  Clin Exp Immunol       Date:  2021-07-12       Impact factor: 5.732

10.  Knowledge-based analyses reveal new candidate genes associated with risk of hepatitis B virus related hepatocellular carcinoma.

Authors:  Deke Jiang; Jiaen Deng; Changzheng Dong; Xiaopin Ma; Qianyi Xiao; Bin Zhou; Chou Yang; Lin Wei; Carly Conran; S Lilly Zheng; Irene Oi-Lin Ng; Long Yu; Jianfeng Xu; Pak C Sham; Xiaolong Qi; Jinlin Hou; Yuan Ji; Guangwen Cao; Miaoxin Li
Journal:  BMC Cancer       Date:  2020-05-11       Impact factor: 4.430

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.