| Literature DB >> 21067984 |
Bettina Zimmermann1, Alexander Röck, Gabriela Huber, Tanja Krämer, Peter M Schneider, Walther Parson.
Abstract
The application of quasi-median networks provides an effective tool to check the quality of mtDNA data. Filtering of highly recurrent mutations prior to network analysis is required to simplify the data set and reduce the complexity of the network. The phylogenetic background determines those mutations that need to be filtered. While the traditional EMPOPspeedy filter was based on the worldwide mtDNA phylogeny, haplogroup-specific filters can more effectively highlight potential errors in data of the respective (sub)-continental region. In this study we demonstrate the performance of a new, west Eurasian filter EMPOPspeedyWE for the fine-tuned examination of data sets belonging to macrohaplogroup N that constitutes the main portion of mtDNA lineages in Europe. The effects on the resulting network of different database sizes, high-quality and flawed data, as well as the examination of a phylogenetically distant data set, are presented by examples. The analyses are based on a west Eurasian etalon data set that was carefully compiled from more than 3500 control region sequences for network purposes. Both, etalon data and the new filter file, are provided through the EMPOP database (www.empop.org).Entities:
Mesh:
Substances:
Year: 2010 PMID: 21067984 PMCID: PMC3065003 DOI: 10.1016/j.fsigen.2010.10.003
Source DB: PubMed Journal: Forensic Sci Int Genet ISSN: 1872-4973 Impact factor: 4.882
Fig. 1QM network torsi of the etalon data set. (A) HVS-I: nps 16024–16569; (B) HVS-II: nps 1–576. The nodes correspond to reduced and condensed haplotypes. The most frequent haplogroup and a “+” are given if more than one haplogroups are involved. The number of condensed haplotypes is indicated below. The branches represent mutational events; transitions are shown in green, transversions in red. Small black nodes represent a quasi-median indicating that this haplotype was not observed in the data set. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of the article.)
Fig. 2QM network torsi (nps 16024–16569) of a small high-quality data set (N = 31) from west Eurasia. (A) The torso of the 31 samples. (B) The resulting torso of the 31 samples combined with the etalon (N = 233).
Fig. 3QM network torsi (nps 16024–16400) of a Darginian data set (N = 29). (A) The torso of the 29 haplotypes. (B) The resulting torso of the 29 samples combined with the etalon (N = 231).
Fig. 4QM network torsi of high-quality west Eurasian samples. (A) The torso (nps 16024–16365) of an oversized sample set (N = 786). (B) The torso (nps 16024–16569) of 273 Austrians.
Fig. 5QM network torsi of 190 haplotypes from Thailand (nps 16024–16569), constructed with the new west Eurasian-specific filter EMPOPSpeedyWE (A) and the EMPOPspeedy filter (B).