| Literature DB >> 28422985 |
Endre Neparáczki1,2, Klaudia Kocsy1, Gábor Endre Tóth1, Zoltán Maróti3, Tibor Kalmár3, Péter Bihari4, István Nagy4,5, György Pálfi2, Erika Molnár2, István Raskó6, Tibor Török1.
Abstract
As part of the effort to create a high resolution representative sequence database of the medieval Hungarian conquerors we have resequenced the entire mtDNA genome of 24 published ancient samples with Next Generation Sequencing, whose haplotypes had been previously determined with traditional PCR based methods. We show that PCR based methods are prone to erroneous haplotype or haplogroup determination due to ambiguous sequence reads, and many of the resequenced samples had been classified inaccurately. The SNaPshot method applied with published ancient DNA authenticity criteria is the most straightforward and cheapest PCR based approach for testing a large number of coding region SNP-s, which greatly facilitates correct haplogroup determination.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28422985 PMCID: PMC5396865 DOI: 10.1371/journal.pone.0174886
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Details of NGS data for each sample.
| cemetery/grave no. /sample name | sample source | total no. of reads | no. of reads mapped on rCRS | no. of unique mapped reads | average fragment length | Average coverage | (%) of nucleotides above 5x coverage | estimated contamination (%) | (%) G to A misincorp. at 3' end (MapDamage) | (%) C to T misincorp. at 5' end (MapDamage) |
|---|---|---|---|---|---|---|---|---|---|---|
| Magyarhomoróg/120/anc2 | tooth | 26152 | 12519 | 2994 | 55.97 | 10.2 | 89.62 | 0.00 | 6.11 | 7.81 |
| Orosháza-Görbics tanya/2/ anc3 | femur | 85178 | 50555 | 5516 | 79.37 | 24.8 | 99.89 | 0.43 | 8.13 | 8.89 |
| Szabadkígyós-Pálliget/7/anc4 | tooth | 53176 | 27002 | 5006 | 63.85 | 19.3 | 97.46 | 0.45 | 8.19 | 9.22 |
| Szegvár-Oromdülő/412/anc5 | tooth femur | 242925 | 139136 | 74712 | 68.45 | 302.9 | 100.00 | 1.47 | 15.11 | 16.15 |
| Szegvár-Oromdülő/593/anc6 | tooth | 66798 | 35260 | 6488 | 58.09 | 22.8 | 98.64 | 2.08 | 6.47 | 8.70 |
| Sárrétudvari-Hízóföld/5/anc10 | tooth | 17632 | 6664 | 4284 | 69.94 | 17.9 | 95.11 | 0.00 | 7.40 | 10.29 |
| Sárrétudvari-Hízóföld/118/anc12 | tooth | 36214 | 14708 | 6218 | 63.96 | 24.0 | 98.81 | 0.36 | 8.75 | 11.53 |
| Sárrétudvari-Hízóföld/213/anc13 | tooth | 42326 | 20383 | 9424 | 62.44 | 34.2 | 95.01 | 0.83 | 9.01 | 10.22 |
| Harta-Freifelt/10/anc25 | tooth | 135472 | 66169 | 7938 | 80.75 | 35.8 | 99.93 | 1.75 | 6.00 | 7.10 |
| Karos-III/1 | femur | 30830 | 5968 | 4738 | 81.44 | 22.4 | 98.07 | 0.00 | 13.09 | 11.48 |
| Karos-III/3 | femur | 58982 | 12858 | 7414 | 76.84 | 32.0 | 98.91 | 1.68 | 12.51 | 10.66 |
| Karos-III/4 | femur | 82194 | 22930 | 9316 | 68.71 | 38.3 | 99.98 | 0.33 | 12.93 | 11.98 |
| Karos-III/5 | metatarsus | 60797 | 25043 | 20236 | 85.13 | 100.2 | 100.00 | 3.76 | 7.99 | 6.65 |
| Karos-III/6 | femur | 41054 | 3863 | 1886 | 70.68 | 7.9 | 87.24 | 0.00 | 6.71 | 6.75 |
| Karos-III/8 | femur | 75724 | 26747 | 11426 | 64.26 | 43.9 | 99.67 | 1.79 | 11.37 | 11.08 |
| Karos-III/10 | femur | 67416 | 6371 | 3438 | 68.22 | 14.1 | 89.61 | 0.00 | 9.89 | 9.04 |
| Karos-III/11 | femur tooth | 203927 | 75860 | 57508 | 69.95 | 240.7 | 100.00 | 1.43 | 13.48 | 15.45 |
| Karos-III/12 | femur | 52738 | 5843 | 4742 | 70.76 | 20.0 | 95.80 | 3.96 | 12.40 | 11.19 |
| Karos-III/14 | femur | 61346 | 16134 | 8778 | 69.29 | 35.7 | 99.69 | 1.82 | 14.53 | 14.13 |
| Karos-III/15 | femur | 142977 | 83486 | 24702 | 81.12 | 115.4 | 100.00 | 0.22 | 9.78 | 9.39 |
| Karos-III/16 | femur | 90950 | 23233 | 5334 | 75.65 | 23.6 | 99.14 | 1.34 | 11.34 | 11.12 |
| Karos-III/17 | femur | 43330 | 2626 | 2382 | 79.64 | 11.2 | 87.55 | 0.00 | 10.31 | 9.91 |
| Karos-III/18 | femur | 9184 | 3208 | 3154 | 68.48 | 12.9 | 90.49 | 0.00 | 15.42 | 12.11 |
| Karos-III/19 | tooth | 59102 | 30135 | 5948 | 69.07 | 24.6 | 98.78 | 0.00 | 6.39 | 6.55 |
Data refer to paired-end sequences from UDG treated libraries. The Szegvár-Oromdülő/412/anc5 and Karos-III/11 samples were sequenced twice from tooth and femur with identical results, then these sequence reads were merged, and statistics are given for the merged reads.
Comparison of Haplogroups identified with different PCR based methods and NGS.
| cemetery/grave no. / sample name | HVR-I muations found (position -16000) | HVR-II and coding region mutations studied / method | HVR-II and coding region mutations found | Hg described in the sudy (Hg with Haplogrep) | Haplotype identified by NGS in the present study | unnoticed SNP-s, or | |
|---|---|---|---|---|---|---|---|
| Magyarhomoróg/120/ anc2 | CRS | 73 7028 14766 / RFLP | - | H (H2a2a1) | H84 | none | |
| Orosháza-Görbics tanya/2/ anc3 | 147A 172C 183C 189C 223T 320T 355T | 10238 / RFLP | 10238C | N1a (L3e2b) | N1a1a1a1a | none | |
| Szabadkígyós-Pálliget/7/ anc4 | 223T 356C | 10400 12308 12705 / sequencing | 12308G | U4 (U4a2b) | N1a1a1a1 | 16147A 16172C 16248T 16320T 16355T | |
| Szegvár-Oromdülő/412/ anc5 | CRS | 73 7028 14766 / RFLP | 14766T | H (R0) | K1c1d | 16224C 16311C 73G 7028T | |
| Szegvár-Oromdülő/593/ anc6 | 114A 192T 256T 270T 294T | 12308 / sequencing | 12308G | U5a1 (U5a2a) | U5a2a1b | none | |
| Sárrétudvari-Hízóföld/5/ anc10 | 129A 148T 223T | 10238 / RFLP 12705 / sequencing | 10238C 12705T | I (N1) | I5a1a | 16391A | |
| Sárrétudvari-Hízóföld/118/ anc12 | 126C 182C 183C 189C 294T 296T 298C | 9 bp del / elecrtophoresis | 9 bp del | T (T2f1a) | T2f1a1 | none | |
| Sárrétudvari-Hízóföld/213/ anc13 | 311C | 73 14766 / RFLP 11719 12308 12705 / sequencing | 73G 11719A 14766T | R (R1) | J1c3g | 16069T 16126C | |
| Harta-Freifelt/10/ anc25 | 294T 304C | 73 7028 14766 / RFLP 10310 / sequencing | - | H (H5a4) | H5e1a | none | |
| Karos-III/1 | 183C 189C 217C | HVR-II: nt.190-309 sequenced and coding region 22 SNP-s of the GenoCoRe22 assay determined in all cases | 263G 7028T 9bp del 11719A 14766T | B4 | B4d1 | 207A | |
| Karos-III/3 | 362C | 239C 263G | H6 | H6a1b | none | ||
| Karos-III/4 | 069T 092C 126C 261T | 228A 263G 295T 7028T 11719A 12612G 14766T | J1c7 | J1c7a | none | ||
| Karos-III/5 | 183C 189C 217C | 263G 7028T 9bp del 11719A 14766T | B4 | B4d1 | none | ||
| Karos-III/6 | 189C | 263G 7028T 9bp del 11719A 14766T | B4’5 | B4d1 | 16183C, 16217C | ||
| Karos-III/8 | 051G 189C 362C | 263G 7028T 11467G 11719A 14766T | U2e | U2e1b | 217C, 16129C, 16256T, | ||
| Karos-III/10 | 304C | 263G | H5 | H5e1 | 16189C 16294T | ||
| Karos-III/11 | 189C 223T 278T | 195C 257G 263G 6371T 7028T 11719A 12705T 14766T | X2f | X2f | 16093C | ||
| Karos-III/12 | 183C 189C 223T 290T 319A | 235G 263G 4248C 7028T 11719A 12705T 14766T | A | A12 | none | ||
| Karos-III/14 | 126C 163G 186T 189C 294T | 195G 263G 7028T 11719A 13368A 14766T | T1a | T1a1b | none | ||
| Karos-III/15 | 069T 126C 362C | 263G 295T 7028T 11719A 12612G 14766T | J | J2a1 | 16263 del | ||
| Karos-III/16 | 256T 270T | 263G 7028T 11467G 11719A 14766T | U5a | U5a1a2a | 16399G | ||
| Karos-III/17 | 362C | 239C 263G | H6 | H6a1a | none | ||
| Karos-III/18 | 126C 163G 186T 189C 294T | 214G 263G 7028T 11719A 13368A 14766T | T1a10a | T1a10a | none | ||
| Karos-III/19 | 126C 163G 186T 189C 294T | 214G 263G 7028T 11719A 13368A 14766T | T1a10a | T1a10a | none |
Hg-s determined incorrectly with PCR methods are hightlighted with pink background, while yellow bakground hightlights correct Hg-s with incorrect haplotypes. Erroneously identified SNP-s are labelled with bold italic and lined through. Haplogroups and haplotypes were determined with the HaploGrep 2 version 2.1.0 [28] based on Phylotree 17 [39] from the available SNP-s. For the [7] samples HaploGrep assignment, based on their identified SNP positions is given in parenthesis.
*data from Ph.D thesis of Tömöry 2008.
Fig 1Chromatogram of two HVR-I sequence fragments of the Karos-III/16 sample from [9].
Arrows label double peaks, correct reads according to NGS data are listed above the arrows.
Fig 2Electropherograms of two SNaPshot SBE-II reactions from two extracts of the same Karos-III/6 sample [9].
Characters at the top indicate Hg-s defined by the corresponding peaks. Black characters indicate peaks defining the ancestral allele, read characters indicate peaks defining the derived allele. Arrows point at double peaks. As each dye has a different influence on DNA mobility, positions of identical fragments with different dyes are not the same. Black arrows point at peaks taken into account, while blue arrows indicate neglected peaks, considered to have been derived from contamination. Orange peaks are size standards (GeneScan-120 LIZ, Applied Biosystems).