Literature DB >> 35591888

A bioinformatics pipeline for estimating mitochondrial DNA copy number and heteroplasmy levels from whole genome sequencing data.

Stephanie L Battle1, Daniela Puiu2, Joost Verlouw3, Linda Broer3, Eric Boerwinkle4, Kent D Taylor5, Jerome I Rotter5, Stephan S Rich6, Megan L Grove4, Nathan Pankratz7, Jessica L Fetterman8, Chunyu Liu9, Dan E Arking1.   

Abstract

Mitochondrial diseases are a heterogeneous group of disorders that can be caused by mutations in the nuclear or mitochondrial genome. Mitochondrial DNA (mtDNA) variants may exist in a state of heteroplasmy, where a percentage of DNA molecules harbor a variant, or homoplasmy, where all DNA molecules have the same variant. The relative quantity of mtDNA in a cell, or copy number (mtDNA-CN), is associated with mitochondrial function, human disease, and mortality. To facilitate accurate identification of heteroplasmy and quantify mtDNA-CN, we built a bioinformatics pipeline that takes whole genome sequencing data and outputs mitochondrial variants, and mtDNA-CN. We incorporate variant annotations to facilitate determination of variant significance. Our pipeline yields uniform coverage by remapping to a circularized chrM and by recovering reads falsely mapped to nuclear-encoded mitochondrial sequences. Notably, we construct a consensus chrM sequence for each sample and recall heteroplasmy against the sample's unique mitochondrial genome. We observe an approximately 3-fold increased association with age for heteroplasmic variants in non-homopolymer regions and, are better able to capture genetic variation in the D-loop of chrM compared to existing software. Our bioinformatics pipeline more accurately captures features of mitochondrial genetics than existing pipelines that are important in understanding how mitochondrial dysfunction contributes to disease.
© The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.

Entities:  

Year:  2022        PMID: 35591888      PMCID: PMC9112767          DOI: 10.1093/nargab/lqac034

Source DB:  PubMed          Journal:  NAR Genom Bioinform        ISSN: 2631-9268


  44 in total

1.  dbSNP: the NCBI database of genetic variation.

Authors:  S T Sherry; M H Ward; M Kholodov; J Baker; L Phan; E M Smigielski; K Sirotkin
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

Review 2.  Leigh syndrome: One disorder, more than 75 monogenic causes.

Authors:  Nicole J Lake; Alison G Compton; Shamima Rahman; David R Thorburn
Journal:  Ann Neurol       Date:  2015-12-15       Impact factor: 10.422

3.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

4.  Impact of the sequencing method on the detection and interpretation of mitochondrial DNA length heteroplasmy.

Authors:  Kimberly Sturk-Andreaggi; Walther Parson; Marie Allen; Charla Marshall
Journal:  Forensic Sci Int Genet       Date:  2019-11-10       Impact factor: 4.882

5.  Assessing Mitochondrial DNA Variation and Copy Number in Lymphocytes of ~2,000 Sardinians Using Tailored Sequencing Analysis Tools.

Authors:  Jun Ding; Carlo Sidore; Thomas J Butler; Mary Kate Wing; Yong Qian; Osorio Meirelles; Fabio Busonero; Lam C Tsoi; Andrea Maschio; Andrea Angius; Hyun Min Kang; Ramaiah Nagaraja; Francesco Cucca; Gonçalo R Abecasis; David Schlessinger
Journal:  PLoS Genet       Date:  2015-07-14       Impact factor: 5.917

6.  Prevalence of nuclear and mitochondrial DNA mutations related to adult mitochondrial disease.

Authors:  Gráinne S Gorman; Andrew M Schaefer; Yi Ng; Nicholas Gomez; Emma L Blakely; Charlotte L Alston; Catherine Feeney; Rita Horvath; Patrick Yu-Wai-Man; Patrick F Chinnery; Robert W Taylor; Douglass M Turnbull; Robert McFarland
Journal:  Ann Neurol       Date:  2015-03-28       Impact factor: 10.422

7.  Twelve years of SAMtools and BCFtools.

Authors:  Petr Danecek; James K Bonfield; Jennifer Liddle; John Marshall; Valeriu Ohan; Martin O Pollard; Andrew Whitwham; Thomas Keane; Shane A McCarthy; Robert M Davies; Heng Li
Journal:  Gigascience       Date:  2021-02-16       Impact factor: 6.524

8.  MitoScape: A big-data, machine-learning platform for obtaining mitochondrial DNA from next-generation sequencing data.

Authors:  Larry N Singh; Brian Ennis; Bryn Loneragan; Noah L Tsao; M Isabel G Lopez Sanchez; Jianping Li; Patrick Acheampong; Oanh Tran; Ian A Trounce; Yuankun Zhu; Prasanth Potluri; Beverly S Emanuel; Daniel J Rader; Zoltan Arany; Scott M Damrauer; Adam C Resnick; Stewart A Anderson; Douglas C Wallace
Journal:  PLoS Comput Biol       Date:  2021-11-11       Impact factor: 4.475

9.  Correlates of Peripheral Blood Mitochondrial DNA Content in a General Population.

Authors:  Judita Knez; Ellen Winckelmans; Michelle Plusquin; Lutgarde Thijs; Nicholas Cauwenberghs; Yumei Gu; Jan A Staessen; Tim S Nawrot; Tatiana Kuznetsova
Journal:  Am J Epidemiol       Date:  2015-12-24       Impact factor: 4.897

10.  MUMmer4: A fast and versatile genome alignment system.

Authors:  Guillaume Marçais; Arthur L Delcher; Adam M Phillippy; Rachel Coston; Steven L Salzberg; Aleksey Zimin
Journal:  PLoS Comput Biol       Date:  2018-01-26       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.