Literature DB >> 33680433

CompoundHetVIP: Compound Heterozygous Variant Identification Pipeline.

Dustin B Miller1, Stephen R Piccolo1.   

Abstract

Compound Heterozygous ( CH) variant identification requires distinguishing maternally from paternally derived nucleotides, a process that requires numerous computational tools. Using such tools often introduces unforeseen challenges such as installation procedures that are operating-system specific, software dependencies that must be installed, and formatting requirements for input files. To overcome these challenges, we developed Compound Heterozygous Variant Identification Pipeline (CompoundHetVIP), which uses a single Docker image to encapsulate commonly used software tools for file aggregation ( BCFtools or GATK4), VCF liftover ( Picard Tools), joint-genotyping ( GATK4), file conversion ( Plink2), phasing ( SHAPEIT2, Beagle, and/or Eagle2), variant normalization ( vt tools), annotation ( SnpEff), relational database generation ( GEMINI), and identification of CH, homozygous alternate, and de novo variants in a series of 13 steps. To begin using our tool, researchers need only install the Docker engine and download the CompoundHetVIP Docker image. The tools provided in CompoundHetVIP, subject to the limitations of the underlying software, can be applied to whole-genome, whole-exome, or targeted exome sequencing data of individual samples or trios (a child and both parents), using VCF or gVCF files as initial input. Each step of the pipeline produces an analysis-ready output file that can be further evaluated. To illustrate its use, we applied CompoundHetVIP to data from a publicly available Ashkenazim trio and identified two genes with a candidate CH variant and two genes with a candidate homozygous alternate variant after filtering based on user-set thresholds for global minor allele frequency, Combined Annotation Dependent Depletion, and Gene Damage Index. While this example uses genomic data from a healthy child, we anticipate that most researchers will use CompoundHetVIP to uncover missing heritability in human diseases and other phenotypes. CompoundHetVIP is open-source software and can be found at https://github.com/dmiller903/CompoundHetVIP; this repository also provides detailed, step-by-step examples. Copyright:
© 2021 Miller DB and Piccolo SR.

Entities:  

Keywords:  Genetics; compound heterozygous; genome analysis; phasing; reproducibility; trio

Mesh:

Year:  2020        PMID: 33680433      PMCID: PMC7905494          DOI: 10.12688/f1000research.26848.2

Source DB:  PubMed          Journal:  F1000Res        ISSN: 2046-1402


  28 in total

1.  Filamin B Loss-of-Function Mutation in Dimerization Domain Causes Autosomal-Recessive Spondylocarpotarsal Synostosis Syndrome with Rib Anomalies.

Authors:  Chi-Fan Yang; Chung-Hsing Wang; Weng Siong H'ng; Chun-Ping Chang; Wei-De Lin; Yuan-Tsong Chen; Jer-Yuarn Wu; Fuu-Jen Tsai
Journal:  Hum Mutat       Date:  2017-02-27       Impact factor: 4.878

2.  Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Am J Hum Genet       Date:  2007-09-21       Impact factor: 11.025

3.  A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2011-09-08       Impact factor: 6.937

4.  Novel compound heterozygous mutations in a child with Ataxia-Telangiectasia showing unrelated cerebellar disorders.

Authors:  Maria Piane; Anna Molinaro; Annarosa Soresina; Silvia Costa; Marianna Maffeis; Aldo Germani; Lorenzo Pinelli; Roberta Meschini; Alessandro Plebani; Luciana Chessa; Roberto Micheli
Journal:  J Neurol Sci       Date:  2016-10-13       Impact factor: 3.181

5.  The Transcription Factor Tox2 Drives T Follicular Helper Cell Development via Regulating Chromatin Accessibility.

Authors:  Wei Xu; Xiaohong Zhao; Xiaoshuang Wang; Han Feng; Mengting Gou; Wei Jin; Xiaohu Wang; Xindong Liu; Chen Dong
Journal:  Immunity       Date:  2019-11-12       Impact factor: 31.745

6.  The variant call format and VCFtools.

Authors:  Petr Danecek; Adam Auton; Goncalo Abecasis; Cornelis A Albers; Eric Banks; Mark A DePristo; Robert E Handsaker; Gerton Lunter; Gabor T Marth; Stephen T Sherry; Gilean McVean; Richard Durbin
Journal:  Bioinformatics       Date:  2011-06-07       Impact factor: 6.937

7.  Second-generation PLINK: rising to the challenge of larger and richer datasets.

Authors:  Christopher C Chang; Carson C Chow; Laurent Cam Tellier; Shashaank Vattikuti; Shaun M Purcell; James J Lee
Journal:  Gigascience       Date:  2015-02-25       Impact factor: 6.524

8.  HapCUT2: robust and accurate haplotype assembly for diverse sequencing technologies.

Authors:  Peter Edge; Vineet Bafna; Vikas Bansal
Journal:  Genome Res       Date:  2016-12-09       Impact factor: 9.043

9.  An open resource for accurately benchmarking small variant and reference calls.

Authors:  Justin M Zook; Jennifer McDaniel; Nathan D Olson; Justin Wagner; Hemang Parikh; Haynes Heaton; Sean A Irvine; Len Trigg; Rebecca Truty; Cory Y McLean; Francisco M De La Vega; Chunlin Xiao; Stephen Sherry; Marc Salit
Journal:  Nat Biotechnol       Date:  2019-04-01       Impact factor: 54.908

10.  The Ensembl Variant Effect Predictor.

Authors:  William McLaren; Laurent Gil; Sarah E Hunt; Harpreet Singh Riat; Graham R S Ritchie; Anja Thormann; Paul Flicek; Fiona Cunningham
Journal:  Genome Biol       Date:  2016-06-06       Impact factor: 13.583

View more
  1 in total

1.  A Survey of Compound Heterozygous Variants in Pediatric Cancers and Structural Birth Defects.

Authors:  Dustin B Miller; Stephen R Piccolo
Journal:  Front Genet       Date:  2021-03-22       Impact factor: 4.599

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.