| Literature DB >> 34674637 |
Imke Lankheet1, Mário Vicente1,2, Chiara Barbieri3,4, Carina Schlebusch5,6,7.
Abstract
BACKGROUND: Mitochondrial haplogroup assignment is an important tool for forensics and evolutionary genetics. African populations are known to display a high diversity of mitochondrial haplogroups. In this research we explored mitochondrial haplogroup assignment in African populations using commonly used genome-wide SNP arrays.Entities:
Keywords: Africa; HaploGrep; Haplogroup assignment; SNP array; mtDNA
Mesh:
Substances:
Year: 2021 PMID: 34674637 PMCID: PMC8532338 DOI: 10.1186/s12863-021-01000-2
Source DB: PubMed Journal: BMC Genom Data ISSN: 2730-6844
SNP arrays investigated in the current study
| Corresponding number | SNP array | Version SNP data | Number of mitochondrial SNPs |
|---|---|---|---|
| 1 | Affymetrix™ Genome-Wide Human SNP Array 6.0 | January 2017a | 411 |
| 2 | Axiom™ Genome-Wide Human Origins 1 Array | February 2015 | 256 |
| 3 | Axiom™ Genome-Wide PanAFR Genotyping Bundle | January 2017a | 239 |
| 4 | H3Africa Array | November 2018 | 260 |
| 5 | Illumina Infinium Multi-Ethnic AMR/AFR-8 | July 2015a | 373 |
| 6 | Illumina Infinium Multi-Ethnic Global-8 | February 2017a | 522 |
| 7 | Illumina Infinium Omni2.5–8 | February 2018 | 116 |
| 8 | Illumina Infinium Omni5–4 | July 2016a | 111 |
The eight SNP arrays that are compared based on their ability to assign African haplogroups are listed. The version of the SNP array that was used and the number of mitochondrial SNPs are listed for each SNP array. A (a) indicates that this is currently the latest version of the SNP panel. All SNP arrays used the hg19/37 reference genome to refer to SNP positions. The SNP arrays are numbered, and these numbers are used to reference to them in the text
Fig. 1Percentage of assignable mitochondrial haplogroups per SNP array, compared to the full mitochondrial genome. The various SNP arrays are listed on the x-axis. Only clades with a minimum bootstrapping value of 50 were used for the analysis. The two shades represent the level of haplogroup assignment that has been investigated. Darker shades indicate that haplogroups up to the level of three digits (e.g. L0d) have been investigated. Lighter shades indicate that haplogroups up to the level of four digits (e.g. L0d1) have been investigated. Haplogroup assignments from phylogenetic trees based on full mitochondrial genomes were the golden standard. SNP array 5 and 6 show the best performance on African haplogroup assignment
Fig. 2L0-L3 haplogroup assignment performance for eight different SNP arrays. The percentage of haplogroups that could be assigned compared to the full mitochondrial genome is shown for L0-L3. Only clades with a minimum bootstrapping value of 50 were used for the analysis. The different colours indicate the eight SNP arrays. When interested in a specific African haplogroup or a population carrying a specific haplogroup in high frequency, this SNP array analysis can guide researchers in assessing if mitochondrial haplogroup assignment using the particular SNP array will be informative. Up to four-digit haplogroups have been investigated (for example L0, L0d, L0d1). The numbers underneath each haplogroup indicate on how many sequences the analysis for that haplogroup is based
Fig. 3Average haplogroup assignment scores (HaploGrep2). The haplogroup assignment scores from HaploGrep2 were averaged and are shown here for the full mitochondrial sequences as well as each of the individual SNP arrays. The different colours indicate the different haplogroups. The other African haplogroups (L4-L6) are not shown here because of their low sample size
Fig. 4The percentage of correctly assigned African haplogroups by HaploGrep2, using only SNP array data. The percentage of African haplogroups that were correctly assigned by HaploGrep2 using only the SNPs typed on that SNP array is shown. The golden standard is the haplogroup reported in the NCBI GenBank. This analysis does not take into account the haplogroup rank, nor does it take into account the level of haplogroup assignment; whether L0 or L0a2a is assigned, makes no difference for this analysis