Mengqin Duan1, Liang Chen1, Qinyu Ge1, Na Lu1, Junji Li1, Xuan Pan2, Yi Qiao1, Jing Tu3, Zuhong Lu4. 1. State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China. 2. Jiangsu Cancer Hospital, Jiangsu Institute of Cancer Research, Nanjing Medical University Affiliated Cancer Hospital, Nanjing 210009, China. 3. State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China. Electronic address: jtu@seu.edu.cn. 4. State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China. Electronic address: zhlu@seu.edu.cn.
Abstract
BACKGROUND: Detecting heteroplasmic variations in the mitochondrial genome can help identify potential pathogenic possibilities, which is significant for disease prevention. The development of next-generation sequencing changed the quantification of mitochondrial DNA (mtDNA) heteroplasmy from scanning limited recorded points to the entire mitochondrial genome. However, due to the presence of nuclear mtDNA homologous sequences (nuMTs), maximally retaining real variations while excluding falsest heteroplasmic variations from nuMTs and sequencing errors presents a dilemma. RESULTS: Herein, we used an improved method for detecting low-frequency mtDNA heteroplasmic variations from whole genome sequencing data, including point variations and short-fragment length alterations, and evaluated the effect of this method. A two-step alignment was designed and performed to accelerate data processing, to obtain and retain the true mtDNA reads and to eliminate most nuMTs reads. After analyzing whole genome sequencing data of K562 and GM12878 cells, ~90% of heteroplasmic point variations were identified in MitoMap. The results were consistent with the results of an amplification refractory mutation system qPCR. Many linkages of the detected heteroplasmy variations were also discovered. CONCLUSIONS: Our improved method is a simple, efficient and accurate way to mine mitochondrial low-frequency heteroplasmic variations from whole genome sequencing data. By evaluating the highest misalignment possibility caused by the remaining nuMTs-like reads and sequencing errors, our procedure can detect mtDNA heteroplasmic variations whose heteroplasmy frequencies are as low as 0.2%.
BACKGROUND: Detecting heteroplasmic variations in the mitochondrial genome can help identify potential pathogenic possibilities, which is significant for disease prevention. The development of next-generation sequencing changed the quantification of mitochondrial DNA (mtDNA) heteroplasmy from scanning limited recorded points to the entire mitochondrial genome. However, due to the presence of nuclear mtDNA homologous sequences (nuMTs), maximally retaining real variations while excluding falsest heteroplasmic variations from nuMTs and sequencing errors presents a dilemma. RESULTS: Herein, we used an improved method for detecting low-frequency mtDNA heteroplasmic variations from whole genome sequencing data, including point variations and short-fragment length alterations, and evaluated the effect of this method. A two-step alignment was designed and performed to accelerate data processing, to obtain and retain the true mtDNA reads and to eliminate most nuMTs reads. After analyzing whole genome sequencing data of K562 and GM12878 cells, ~90% of heteroplasmic point variations were identified in MitoMap. The results were consistent with the results of an amplification refractory mutation system qPCR. Many linkages of the detected heteroplasmy variations were also discovered. CONCLUSIONS: Our improved method is a simple, efficient and accurate way to mine mitochondrial low-frequency heteroplasmic variations from whole genome sequencing data. By evaluating the highest misalignment possibility caused by the remaining nuMTs-like reads and sequencing errors, our procedure can detect mtDNA heteroplasmic variations whose heteroplasmy frequencies are as low as 0.2%.
Authors: Ruoying Chen; Micheala A Aldred; Weiling Xu; Joe Zein; Peter Bazeley; Suzy A A Comhair; Deborah A Meyers; Eugene R Bleecker; Chunyu Liu; Serpil C Erzurum; Bo Hu Journal: Mitochondrion Date: 2021-01-26 Impact factor: 4.160
Authors: Jasmine R Connell; Miles C Benton; Rodney A Lea; Heidi G Sutherland; Larisa M Haupt; Kirsty M Wright; Lyn R Griffiths Journal: J Forensic Sci Date: 2022-07-19 Impact factor: 1.717
Authors: Ieva Keraite; Philipp Becker; Davide Canevazzi; Cristina Frias-López; Marc Dabad; Raúl Tonda-Hernandez; Ida Paramonov; Matthew John Ingham; Isabelle Brun-Heath; Jordi Leno; Anna Abulí; Elena Garcia-Arumí; Simon Charles Heath; Marta Gut; Ivo Glynne Gut Journal: Nat Commun Date: 2022-10-06 Impact factor: 17.694
Authors: Ana R Silverstein; Melanie K Flores; Brendan Miller; Su-Jeong Kim; Kelvin Yen; Hemal H Mehta; Pinchas Cohen Journal: Transl Med Aging Date: 2020-08-21
Authors: Christian R Marshall; Shimul Chowdhury; Ryan J Taft; Mathew S Lebo; Jillian G Buchan; Steven M Harrison; Ross Rowsey; Eric W Klee; Pengfei Liu; Elizabeth A Worthey; Vaidehi Jobanputra; David Dimmock; Hutton M Kearney; David Bick; Shashikant Kulkarni; Stacie L Taylor; John W Belmont; Dimitri J Stavropoulos; Niall J Lennon Journal: NPJ Genom Med Date: 2020-10-23 Impact factor: 8.617