Literature DB >> 27084951

HaploGrep 2: mitochondrial haplogroup classification in the era of high-throughput sequencing.

Hansi Weissensteiner1, Dominic Pacher2, Anita Kloss-Brandstätter2, Lukas Forer2, Günther Specht3, Hans-Jürgen Bandelt4, Florian Kronenberg2, Antonio Salas5, Sebastian Schönherr6.   

Abstract

Mitochondrial DNA (mtDNA) profiles can be classified into phylogenetic clusters (haplogroups), which is of great relevance for evolutionary, forensic and medical genetics. With the extensive growth of the underlying phylogenetic tree summarizing the published mtDNA sequences, the manual process of haplogroup classification would be too time-consuming. The previously published classification tool HaploGrep provided an automatic way to address this issue. Here, we present the completely updated version HaploGrep 2 offering several advanced features, including a generic rule-based system for immediate quality control (QC). This allows detecting artificial recombinants and missing variants as well as annotating rare and phantom mutations. Furthermore, the handling of high-throughput data in form of VCF files is now directly supported. For data output, several graphical reports are generated in real time, such as a multiple sequence alignment format, a VCF format and extended haplogroup QC reports, all viewable directly within the application. In addition, HaploGrep 2 generates a publication-ready phylogenetic tree of all input samples encoded relative to the revised Cambridge Reference Sequence. Finally, new distance measures and optimizations of the algorithm increase accuracy and speed-up the application. HaploGrep 2 can be accessed freely and without any registration at http://haplogrep.uibk.ac.at.
© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27084951      PMCID: PMC4987869          DOI: 10.1093/nar/gkw233

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Mitochondrial DNA (mtDNA) haplogroup classification has been mandatory in the area of phylogenetic and forensic genetics, and it is now also increasingly applied in medical genetics. For haplogroup classification, HaploGrep (1) provides a fully automated way to determine haplogroups by traversing the underlying phylogenetic tree Phylotree (2). Although HaploGrep has been widely accepted and used by a continuously growing group of researchers, important features to support high quality mtDNA research projects with a profound quality control (QC) (3,4), standardized formats or additional features for data analysis were lacking in the first version. While several new haplogroup classification tools were published subsequently (5–9), addressing issues with the reference sequence (10), QC was widely neglected. QC was already fundamental for Sanger based sequencing, and Next Generation Sequencing (NGS) requires even more attention (11–14). Since the availability of NGS devices have increased significantly during the last years, the processing costs of mtDNA in the laboratory decreased. This resulted in an explosion of newly generated data. Furthermore, the number of future mtDNA NGS sequencing studies is expected to grow continuously. Consortia like the ‘Centers for Common Disease Genomics’ (CCDG) are expected to sequence 150,000 to 200,000 whole genomes within the next four years (http://www.genome.gov/27563453). The 1000 Genomes Project (15) already contributed over 2,500 sequences to the expansion of the phylogenetic knowledge, showing high inter- and intra-population haplogroup diversity (16). Data from such projects highlight also the need for direct data import avoiding manual conversion steps. So far, users had to use mitoSAVE (17) in combination with Microsoft Excel to convert VCF files (18) into HaploGrep's initial input format hsd. Furthermore, the underlying algorithm requires adaptions to account for large datasets containing thousands of samples (19,20), and to account for the growing sequence archive within GenBank, the major source for the underlying phylogenetic tree. While the first release of Phylotree incorporated in HaploGrep was version 10 with 2,192 haplogroups, the current Phylotree version 17 (16) comprises 5,437 haplogroups, refining the human mtDNA tree even further. To detect differences in haplogroup classification results conducted on older or newer versions of Phylotree as well as to reproduce their results, multi-version support of Phylotree is required. Several well established tools for further mtDNA data analysis such as ARLEQUIN (21), PAUP* (22) or MRBAYES (23) exist, but can currently not deal with the provided export format of HaploGrep. There is a need to support additional output formats in order to eliminate error-prone and time-intensive conversions. Phylogenetic research strives for improving the current knowledge of the worldwide mitochondrial phylogenetic tree by exhibiting important new haplogroups, which is usually accompanied by a tree diagram combining old and new groups. Phylogenetic trees, displaying the relation between a large number of haplotypes, cannot be constructed by hand conveniently, and the manual process is prone to errors. Direct support of generating phylogenetic trees will greatly help to standardize the detection of novel haplogroups. To address the aforementioned shortcomings, we developed HaploGrep 2. It includes improvements to the current functionality, new input formats and several quality checks. The modular architecture allows us to adapt it to future needs by simple adding rules and new components without altering the core of the HaploGrep classification algorithm.

MATERIALS AND METHODS

HaploGrep 2 is a web application that communicates through a REST API with the web server. Thus, all computation intensive tasks are executed directly on the server. The haplogroup classification itself is based on pre-calculated phylogenetic weights that correspond to the occurrence per position in Phylotree and reflecting the mutational stability of a variant. In the updated classification algorithm, the weights are now scaled from 1 to 10 in a non-linear way (see Supplementary Table S1). Thus, the rare occurrences of variants in Phylotree will no longer influence the classification toward those haplogroups as much as in the previous version. Once the data is imported, the haplogroup classification is started automatically. Optimizations within the code led to a 20-fold speed-up compared to HaploGrep 1. By storing only the 50 highest ranked haplogroups per sample the memory consumption could be reduced significantly. Furthermore, new dissimilarity metrics for the mtDNA haplogroup classification were introduced. In addition to the already implemented Kulczynski distance (1), the Jaccard index, the Hamming distance and the Kimura 2-parameter distance were included (24) (see Supplementary Table S2 and 3 for performance comparison). Further major improvements included a check for artificial recombination (25) and a check for systematic artefacts and for rare or potential phantom mutations (26). For detecting artificial recombination, we apply two different strategies: the first strategy, proposed by Kong et al. (27), counts the remaining variants that were not assigned to the resulting best haplogroup, and tests whether these variants could be assigned to another haplogroup. For this step, mutational hotspots are excluded (e.g. 315.1C or 16519). The second recombination strategy assumes prior knowledge about the specific placement of the fragments of the polymerase chain reaction products (amplicons). With this information in hand, a check comparing the profiles relative to the fragment ranges can be executed. The user-defined fragments are generated, and the profiles split accordingly. If the distance of both haplogroup fragments exceeds five phylogenetic nodes, the sample is listed as potentially contaminated.

RESULTS

The new HaploGrep 2 web server can be accessed freely without registration at http://haplogrep.uibk.ac.at and works with all current browser versions. In order to simplify the integration within external workflows and pipelines, we further provide a stand-alone version, a command line version, and a Rest API of the web server. This allows simplified integration within external workflows and pipelines, as e.g. already used within the mtDNA-Server (28) (http://mtdna-server.uibk.ac.at) or the MitoMaster project (29,30). Details and further information can be found on the project websites.

Input

HaploGrep 2 supports the direct import of VCF files (18), one of the current standard formats for genetic data. This new import format is implemented by using HTSJDK, a Java API for high-throughput sequencing data formats (http://samtools.github.io/htsjdk/). For validation purposes, we used samples from the 1000 Genomes Project (1000G) Phase 1 and Phase 3. Through VCF support, output files from NGS software like the IonTorrent Suite can now be handled directly within HaploGrep 2. A VCF upload can also be performed by using the new Rest API, resulting in a JSON-formatted string including the haplogroup status and the quality score for each sample.

Quality control using a rule-based engine

The new rule-based engine performs several QCs: (i) check amount of remaining variants, that were not assigned to the resulting best haplogroup, (ii) amount of variants not used for the best ranked haplogroup per sample, (iii) differences between provided and estimated haplogroup, (iv) quality scores for haplogroup assignment, (v) ambiguous haplogroup classification, (vi) check for heteroplasmic variants, which is of major interest in mtDNA NGS studies, (vii) check for ‘N’ nucleobases, indicating sequencing or genotyping problems and (viii) reference sequence problems, triggered by profiles showing explicit variants identical to the rCRS (31) characteristics (e.g. 263A, 1438A, 8860A, etc.). For example, with the help of rule (ii) we could identify problems within the provided 1000G Phase 3 mtDNA data, where variants at prominent positions (e.g. 152, 195, 263) were missing, indicating potential problems when writing adjacent variants to the VCF file. Rule (i) and (viii) helped identifying issues with the reported H2 haplogroup in (32) which was in fact a C4a1a1 haplotype, after remapping the positions to the rCRS. The Genome Reference Consortium initially provided a Yoruba reference sequence (before using the rCRS in GRCh37), which was also a source of errors and gets checked with rule (viii) now. All quality checks generate errors (red) and warnings (yellow) by direct representation within a new ‘Errors & Warnings’ tab. This system allows us to react on new demands by adding new rules to the current rule engine (e.g., nomenclature checks).

Artificial recombination and phantom mutations

The artificial recombinants from forensic databases listed in Bandelt et al. (33), which were not recognized accurately in the initial version of HaploGrep, have now been reassessed within HaploGrep 2. The first check for remaining variants (rule (i)) already flagged six out of the seven cases. This rule check is executed automatically when determining new haplogroups and is included in the rule-engine tab. The range-based recombination strategy (Button ‘Check for Recombination’) successfully found all recombined samples. Therefore HaploGrep 2 marks all cases as potentially recombined. For rare mutations and potential phantom mutations (26,34) (Button ‘Check for Phantom Mutation’), HaploGrep 2 takes phylogenetic knowledge into account. It incorporates all variants with frequency scores from Soares et al. (35), based on 2,196 complete mitochondrial genomes. All remaining variants in a sample are annotated according to this list. If (i) a variant occurs with a frequency less than three times and (ii) at least two samples share this variant, then it is listed as a rare mutation in the report. The reason for this threshold is that known phantom mutations with score 2 are in fact, also present within the rare mutation list. Therefore this filter allows the identification of potential phantom mutations but also mapping/alignment problems, which will become more prominent when applying mapping algorithms regardless of the phylogeny. Also problems with the correct mtDNA nomenclature are represented in this report (cf. the analysis of 1000G Phase 3 data, where 832 of 2,504 samples showed 3107C whereas 13 showed 3109d, likely triggered by the coding 3107N of the void position 3107 in the rCRS).

Distance concordance check

Besides the Kulczynski distance used for haplogroup estimation in the first HaploGrep version, we added three new dissimilarity indices: Hamming distance, Jaccard index as well as the Kimura 2-parameter (Kimura2P) distance based on transition and transversion rates. For validation purposes we applied all four distances to the 1000G Phase 1 data (n = 1,074) as well as to the dataset provided by Li et al. (19), including 2,000 exome sequencing data from a Danish cohort. Table 1 presents the summary of the distance concordance check. While the runtime of the first three distances was almost identical (4.6–4.7 s), the Kimura2P distance showed a 33-fold higher wall-time (158.1 s). The ratio was similar for the Li data (6.4 s for Kulczynksi versus 210 s for Kimura2P). With different results in 21 out of the 1074 samples (1.96%) in the 1000G Phase 1 data, 153 out of 2534 (6.11%) in the 1000G Phase 3 data and 98 (4.9%) out of 2000 in the exome sequencing samples, the user receives additional information about data quality and can check suspect samples. Employing the distance concordance check, coverage problems in the exome data and problems with the VCF file in the 1000G Phase 3 data become visible (see Table 1 and Supplementary Tables S2–4). We therefore provide this combined haplogroup estimation mode (Button ‘Haplogroup Discordance Check’), which lists all samples where at least one metric results in a different result.
Table 1.

HaploGrep 2 runtime and concordance over different metrics

NGS mtDNA datasetSample sizeFull concordance over all metricsHaploGrep 2 Runtime (including QC)
1000G Phase 11,07498.0%5.7 s
Li et al.2,00095.1%7.7 s
1000G Phase 32,50493.9%13.0 s

Full concordance means that all samples are classified into the same haplogroup for the different metrics. The HaploGrep 2 runtime refers to the calculation of the Kulczynski distance including all quality checks from the rule-based engine. The detailed results are provided in Supplementary Tables S2, 3 and 4.

Full concordance means that all samples are classified into the same haplogroup for the different metrics. The HaploGrep 2 runtime refers to the calculation of the Kulczynski distance including all quality checks from the rule-based engine. The detailed results are provided in Supplementary Tables S2, 3 and 4.

Output formats

HaploGrep 2 displays the classification results and all details directly in the browser. Additionally, it provides several new browser- and file-based outputs: Haplogroups Report: summarizes the graphical output, comprising the quality score, remaining polymorphisms, polymorphisms not found in the top ranked haplogroup, the corresponding amino acid changes (36) and the input profile. Multiple Alignment Format: based on the rCRS, a multiple sequence alignment format is generated and can be directly viewed in the browser by using BioJS (37). With the help of this export, the freely available PGDSpider (38) can convert the result into a variety of population genetics formats (e.g. ARLEQUIN (21), MEGA (39), MIGRATE (40), PHYLIP (41) directly or the NEXUS format (42) for MRBAYES (23) or PAUP*(22). VCF: a column-based representation of all samples. For the graphical representation the metadata in the VCF header is skipped, and only the data lines are presented. The complete VCF file can be downloaded for further analysis such as FST computation, linkage disequilibrium (LD), or Principal Component Analysis (PCA), by applying VCFtools (18) directly or to generate PED and MAP files for further analysis with the toolset PLINK (43). FASTA: each sample is exported as sequence entry in one summarizing fasta file, excluding the alignment information.

Phylogenetic tree visualization

For many researchers a graphical representation of all haplogroups is the most attractive feature, as it shows how new haplogroups fit into the existing Phylotree. This is performed automatically by HaploGrep 2, while all classified samples are combined to a resulting (rooted) tree including all related polymorphisms relative to the rCRS. To customize the output, the user can choose whether hot spot mutations should be taken into account and which export format should be used (see Figure 1 for an example). The tree can be opened directly in the browser or can be downloaded either as pdf, svg or png file. The downloaded pdf and svg file can be further adapted by the user with vector graphic tools like Inkscape (freely available at https://inkscape.org/) or Corel Draw. This output format is implemented by using the Apache Batik SVG Toolkit (https://xmlgraphics.apache.org/batik/).
Figure 1.

Excerpt of the 1000G Phase 1 data generated with the new provided ‘Graphical Phylogenetic Tree’. Polymorphisms in the tips of the phylogeny are candidates for new haplogroups, see for instance the samples belonging to haplogroup D4j15, (confirmed to be related (ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/working/20130606_sample_info/20130606_sample_info.xlsx)) or samples HG00699 and HG00421 (not related). Polymorphisms marked in red are not occurring in Phylotree and may require additional attention, whereas mutations in blue are private polymorphisms for this group, already known by Phylotree. The annotation of amino acid changes and mutational hotspots (green) can be defined by the user, thereby hotspots at positions 16182, 16183 and 16519, AC insertion and deletions at 515–524, inserts at 16193 as well as variation around position 310 and point heteroplasmies can be excluded for the phylogenetic reconstruction.

Excerpt of the 1000G Phase 1 data generated with the new provided ‘Graphical Phylogenetic Tree’. Polymorphisms in the tips of the phylogeny are candidates for new haplogroups, see for instance the samples belonging to haplogroup D4j15, (confirmed to be related (ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/working/20130606_sample_info/20130606_sample_info.xlsx)) or samples HG00699 and HG00421 (not related). Polymorphisms marked in red are not occurring in Phylotree and may require additional attention, whereas mutations in blue are private polymorphisms for this group, already known by Phylotree. The annotation of amino acid changes and mutational hotspots (green) can be defined by the user, thereby hotspots at positions 16182, 16183 and 16519, AC insertion and deletions at 515–524, inserts at 16193 as well as variation around position 310 and point heteroplasmies can be excluded for the phylogenetic reconstruction.

Rest API

Many users are utilizing the command line version of HaploGrep, which is often integrated in workflows. The provision of a Rest API can simplify this, by providing the latest version of HaploGrep 2 to end users. We added a new web resource (haplogrep-ws) to HaploGrep 2 in which VCF files can be uploaded, and the results formatted as a JSON string are exported (see http://haplogrep.uibk.ac.at/blog for an example).

Support of multiple Phylotree versions

Phylotree gets updated constantly. To examine whether an updated version of Phylotree affects previously determined haplogroups, users are now able to select the Phylotree version manually. After changing the version of Phylotree within HaploGrep 2, samples are re-classified automatically and haplogroup updates are listed in brackets, thereby also triggering the rule-based engine and emitting a warning if a sample is classified as belonging to different haplogroups.

DISCUSSION

Several tools similar to HaploGrep have been published since its first release in 2011 (5–9). While haplogroup are estimated by most of these tools with similar performance (33), only a few tools allow for adequate QC. With growing phylogenetic knowledge, mitochondrial haplogroups increasingly gain importance in investigating the correctness of mitochondrial sequences or genotypes, making phylogenetic inference indispensable. Since HaploGrep 2 is based on Phylotree, results are highly dependent on these underlying data, and the results should not be accepted blindly. The growing sample sizes in studies require higher speed while simultaneously maintaining accuracy. We therefore implemented new distance metrics, updated the algorithm, and developed a rule-based engine for additional QC. All these new features can help researchers avoiding inadvertent interpretations of putative hits before publication. HeploGrep 2 will be frequently updated to meet new requirements and demands.
  39 in total

1.  Artificial recombination in forensic mtDNA population databases.

Authors:  H J Bandelt; A Salas; S Lutz-Bonengel
Journal:  Int J Legal Med       Date:  2004-10       Impact factor: 2.686

2.  PGDSpider: an automated data conversion tool for connecting population genetics and genomics programs.

Authors:  H E L Lischer; L Excoffier
Journal:  Bioinformatics       Date:  2011-11-21       Impact factor: 6.937

3.  Phantom mutation hotspots in human mitochondrial DNA.

Authors:  Anita Brandstätter; Timo Sänger; Sabine Lutz-Bonengel; Walther Parson; Eliane Béraud-Colomb; Bo Wen; Qing-Peng Kong; Claudio M Bravi; Hans-Jürgen Bandelt
Journal:  Electrophoresis       Date:  2005-09       Impact factor: 3.535

4.  mtDNA data mining in GenBank needs surveying.

Authors:  Yong-Gang Yao; Antonio Salas; Ian Logan; Hans-Jürgen Bandelt
Journal:  Am J Hum Genet       Date:  2009-12       Impact factor: 11.025

5.  Unified framework to evaluate panmixia and migration direction among multiple sampling locations.

Authors:  Peter Beerli; Michal Palczewski
Journal:  Genetics       Date:  2010-02-22       Impact factor: 4.562

6.  Heteroplasmic substitutions in the entire mitochondrial genomes of human colon cells detected by ultra-deep 454 sequencing.

Authors:  Katarzyna Skonieczna; Boris Malyarchuk; Arkadiusz Jawień; Andrzej Marszałek; Zbigniew Banaszkiewicz; Paweł Jarmocik; Marcelina Borcz; Piotr Bała; Tomasz Grzybowski
Journal:  Forensic Sci Int Genet       Date:  2014-10-31       Impact factor: 4.882

7.  mtDNA Variation and Analysis Using Mitomap and Mitomaster.

Authors:  Marie T Lott; Jeremy N Leipzig; Olga Derbeneva; H Michael Xie; Dimitra Chalkia; Mahdi Sarmady; Vincent Procaccio; Douglas C Wallace
Journal:  Curr Protoc Bioinformatics       Date:  2013-12

8.  Variation and association to diabetes in 2000 full mtDNA sequences mined from an exome study in a Danish population.

Authors:  Shengting Li; Soren Besenbacher; Yingrui Li; Karsten Kristiansen; Niels Grarup; Anders Albrechtsen; Thomas Sparsø; Thorfinn Korneliussen; Torben Hansen; Jun Wang; Rasmus Nielsen; Oluf Pedersen; Lars Bolund; Mikkel H Schierup
Journal:  Eur J Hum Genet       Date:  2014-01-22       Impact factor: 4.246

9.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

10.  Concept for estimating mitochondrial DNA haplogroups using a maximum likelihood approach (EMMA).

Authors:  Alexander W Röck; Arne Dür; Mannis van Oven; Walther Parson
Journal:  Forensic Sci Int Genet       Date:  2013-08-12       Impact factor: 4.882

View more
  254 in total

1.  Genomic landscape of human diversity across Madagascar.

Authors:  Denis Pierron; Margit Heiske; Harilanto Razafindrazaka; Ignace Rakoto; Nelly Rabetokotany; Bodo Ravololomanga; Lucien M-A Rakotozafy; Mireille Mialy Rakotomalala; Michel Razafiarivony; Bako Rasoarifetra; Miakabola Andriamampianina Raharijesy; Lolona Razafindralambo; Fulgence Fanony; Sendra Lejamble; Olivier Thomas; Ahmed Mohamed Abdallah; Christophe Rocher; Amal Arachiche; Laure Tonaso; Veronica Pereda-Loth; Stéphanie Schiavinato; Nicolas Brucato; Francois-Xavier Ricaut; Pradiptajati Kusuma; Herawati Sudoyo; Shengyu Ni; Anne Boland; Jean-Francois Deleuze; Philippe Beaujard; Philippe Grange; Sander Adelaar; Mark Stoneking; Jean-Aimé Rakotoarisoa; Chantal Radimilahy; Thierry Letellier
Journal:  Proc Natl Acad Sci U S A       Date:  2017-07-17       Impact factor: 11.205

2.  Analysis of biogeographic ancestry reveals complex genetic histories for indigenous communities of St. Vincent and Trinidad.

Authors:  Jada Benn Torres; Victoria Martucci; Melinda C Aldrich; Miguel G Vilar; Taryn MacKinney; Muhammad Tariq; Jill B Gaieski; Ricardo Bharath Hernandez; Zoila E Browne; Marlon Stevenson; Wendell Walters; Theodore G Schurr
Journal:  Am J Phys Anthropol       Date:  2019-05-24       Impact factor: 2.868

3.  Mitochondrial DNA control region variation in an Iraqi population sample.

Authors:  Suhair M Jabbar; Nihad A M Al-Rashedi
Journal:  Int J Legal Med       Date:  2020-11-05       Impact factor: 2.686

4.  Mitochondrial genomes uncover the maternal history of the Pamir populations.

Authors:  Min-Sheng Peng; Weifang Xu; Jiao-Jiao Song; Xing Chen; Xierzhatijiang Sulaiman; Liuhong Cai; He-Qun Liu; Shi-Fang Wu; Yun Gao; Najmudinov Tojiddin Abdulloevich; Manilova Elena Afanasevna; Khudoidodov Behruz Ibrohimovich; Xi Chen; Wei-Kang Yang; Miao Wu; Gui-Mei Li; Xing-Yan Yang; Allah Rakha; Yong-Gang Yao; Halmurat Upur; Ya-Ping Zhang
Journal:  Eur J Hum Genet       Date:  2017-11-29       Impact factor: 4.246

5.  Female exogamy and gene pool diversification at the transition from the Final Neolithic to the Early Bronze Age in central Europe.

Authors:  Corina Knipper; Alissa Mittnik; Ken Massy; Catharina Kociumaka; Isil Kucukkalipci; Michael Maus; Fabian Wittenborn; Stephanie E Metz; Anja Staskiewicz; Johannes Krause; Philipp W Stockhammer
Journal:  Proc Natl Acad Sci U S A       Date:  2017-09-05       Impact factor: 11.205

6.  Mitochondrial DNA variations in Austronesian-speaking populations living in the New Georgia Islands, the Western Province of the Solomon Islands.

Authors:  Mariko Issiki; Izumi Naka; Ryosuke Kimura; Takuro Furusawa; Kazumi Natsuhara; Taro Yamauchi; Minato Nakazawa; Takafumi Ishida; Ryutaro Ohtsuka; Jun Ohashi
Journal:  J Hum Genet       Date:  2017-11-13       Impact factor: 3.172

7.  Massively parallel sequencing-enabled mixture analysis of mitochondrial DNA samples.

Authors:  Jennifer D Churchill; Monika Stoljarova; Jonathan L King; Bruce Budowle
Journal:  Int J Legal Med       Date:  2018-02-22       Impact factor: 2.686

8.  Ancient DNA reveals a multistep spread of the first herders into sub-Saharan Africa.

Authors:  Mary E Prendergast; Mark Lipson; Elizabeth A Sawchuk; Iñigo Olalde; Christine A Ogola; Nadin Rohland; Kendra A Sirak; Nicole Adamski; Rebecca Bernardos; Nasreen Broomandkhoshbacht; Kimberly Callan; Brendan J Culleton; Laurie Eccles; Thomas K Harper; Ann Marie Lawson; Matthew Mah; Jonas Oppenheimer; Kristin Stewardson; Fatma Zalzala; Stanley H Ambrose; George Ayodo; Henry Louis Gates; Agness O Gidna; Maggie Katongo; Amandus Kwekason; Audax Z P Mabulla; George S Mudenda; Emmanuel K Ndiema; Charles Nelson; Peter Robertshaw; Douglas J Kennett; Fredrick K Manthi; David Reich
Journal:  Science       Date:  2019-05-30       Impact factor: 47.728

9.  Heteroplasmic shifts in tumor mitochondrial genomes reveal tissue-specific signals of relaxed and positive selection.

Authors:  Sneha Grandhi; Colleen Bosworth; Wesley Maddox; Cole Sensiba; Sara Akhavanfard; Ying Ni; Thomas LaFramboise
Journal:  Hum Mol Genet       Date:  2017-08-01       Impact factor: 6.150

10.  Mitochondrial DNA Haplogroups and Susceptibility to Neuroblastoma.

Authors:  Xiao Chang; Marina Bakay; Yichuan Liu; Joseph Glessner; Komal S Rathi; Cuiping Hou; Huiqi Qu; Zalman Vaksman; Kenny Nguyen; Patrick M A Sleiman; Sharon J Diskin; John M Maris; Hakon Hakonarson
Journal:  J Natl Cancer Inst       Date:  2020-12-14       Impact factor: 13.506

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.