| Literature DB >> 31220249 |
Paulina Borówka1, Łukasz Pułaski2,3, Błażej Marciniak4,5, Beata Borowska-Strugińska1, Jarosław Dziadek3, Elżbieta Żądzińska1, Wiesław Lorkiewicz1, Dominik Strapagiel4,5.
Abstract
BACKGROUND: Recent advances in ancient DNA studies, especially in increasing isolated DNA yields and quality, have opened the possibility of analysis of ancient host microbiome. However, such pitfalls as spurious identification of pathogens based on fragmentary data or environmental contamination could lead to incorrect epidaemiological conclusions. Within the Mycobacterium genus, Mycobacterium tuberculosis complex members responsible for tuberculosis share up to ∼99% genomic sequence identity, while other more distantly related Mycobacteria other than M. tuberculosis can be causative agents for pulmonary diseases or soil dwellers. Therefore, reliable determination of species complex is crucial for interpretation of sequencing results.Entities:
Keywords: NGS; aTB; ancient DNA; ancient tuberculosis
Mesh:
Substances:
Year: 2019 PMID: 31220249 PMCID: PMC6586198 DOI: 10.1093/gigascience/giz065
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Number of genomic k-mers from MTBC and MOTT members after initial hg19 clearing step matching selected targets, with k-mer length distinction (≥20, ≥25, ≥30, ≥35 bp), with estimation of percentage of k-mers from a given mycobaterial genome matching the M. tuberculosis genome for query length ≥30 and ≥35
| Query k-mer length ≥20 | Query k-mer length ≥25 | Query k-mer length ≥30 | Query k-mer length ≥35 | |||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| Alignment target | Genome length (bp) | Sequences mapped to MTB genome (%) | Full genome | Borowka et al. | Bos et al. | Bouwman et al. | Sequences mapped to MTB genome (%) | Full genome | Borowka et al. | Bos et al. | Bouwman et al. | Sequences mapped to MTB genome (%) | Full genome | Borowka et al. | Bos et al. | Bouwman et al. | Sequences mapped to MTB genome (%) | Full genome | Borowka et al. | Bos et al. | Bouwman et al. |
| |
| 3,268,203 | 3.19 | 140,922 | 19,736 | 101 | 1,683 | 4.88 | 215,257 | 4,349 | 85 | 2,240 | 2.61 | 115,138 | 1,430 | 26 | 1,201 | 1.45 | 63,860 | 543 | 6 | 715 |
|
| 5,067,172 | 5.26 | 232,228 | 46,530 | 158 | 2,915 | 2.87 | 126,769 | 2,816 | 103 | 1,890 | 1.39 | 61,160 | 283 | 46 | 1,065 | 0.75 | 33,175 | 62 | 14 | 644 | |
|
| 6,988,209 | 11.19 | 493,570 | 107,339 | 543 | 6,917 | 5.68 | 250,537 | 7,793 | 340 | 2,919 | 2.88 | 127,219 | 1,187 | 162 | 1,610 | 1.64 | 72,286 | 262 | 65 | 944 | |
|
| 6,254,616 | 8.48 | 374,030 | 77,785 | 291 | 5,208 | 5.22 | 230,382 | 5,940 | 131 | 2,774 | 2.69 | 118,483 | 958 | 40 | 1,534 | 1.53 | 67,463 | 236 | 16 | 916 | |
|
| 5,349,645 | 8.98 | 396,255 | 88,582 | 391 | 5,909 | 6.57 | 289,788 | 9,912 | 157 | 3,597 | 3.45 | 152,331 | 1,665 | 97 | 1,951 | 2.03 | 89,593 | 377 | 56 | 1,176 | |
|
| 5,938,797 | 9.33 | 411,677 | 80,142 | 339 | 5,414 | 9.51 | 419,641 | 12,734 | 197 | 4,578 | 5.35 | 235,800 | 3,904 | 93 | 2,702 | 3.15 | 139,050 | 1,450 | 33 | 1,575 | |
|
| 5,910,436 | 9.00 | 396,854 | 76,829 | 413 | 5,597 | 10.69 | 471,493 | 19,780 | 392 | 5,022 | 0.00 | 265,638 | 5,366 | 186 | 2,806 | 3.54 | 156,188 | 1,531 | 71 | 1,706 | |
|
| 4,434,836 | 7.14 | 314,850 | 60,482 | 262 | 4,336 | 8.17 | 360,395 | 11,534 | 207 | 4,105 | 0.00 | 200,395 | 3,233 | 120 | 2,126 | 2.62 | 115,687 | 1,060 | 68 | 1,235 | |
|
| 6,660,144 | 9.48 | 418,304 | 82,499 | 466 | 5,715 | 14.08 | 621,166 | 52,438 | 707 | 6,366 | 7.88 | 347,459 | 16,301 | 266 | 3,465 | 4.49 | 198,076 | 4,208 | 88 | 2,046 | |
|
| 5,805,761 | 8.26 | 364,492 | 71,682 | 339 | 4,800 | 12.26 | 540,893 | 36,626 | 448 | 5,543 | 6.94 | 306,075 | 10,994 | 160 | 3,094 | 4.04 | 178,217 | 3,088 | 61 | 1,886 | |
|
| 6,402,301 | 10.51 | 463,445 | 89,051 | 472 | 6,353 | 15.93 | 702,577 | 39,990 | 596 | 7,181 | 9.54 | 420,814 | 13,458 | 278 | 4,032 | 5.82 | 256,893 | 4,132 | 129 | 2,373 | |
|
| 4,829,781 | 8.07 | 356,159 | 71,620 | 322 | 5,128 | 12.08 | 532,953 | 16,610 | 194 | 5,331 | 7.31 | 322,606 | 4,752 | 110 | 3,271 | 4.58 | 202,232 | 1,475 | 65 | 2,095 | |
|
| 4,235,765 | 7.08 | 312,375 | 52,214 | 274 | 4,137 | 13.05 | 575,862 | 22,641 | 540 | 6,284 | 7.98 | 352,034 | 8,023 | 374 | 3,703 | 4.94 | 217,744 | 2,893 | 254 | 2,322 | |
|
|
| 4,288,871 | 17.53 | 773,238 | 181,627 | 598 | 9,935 | 94.85 | 4,184,378 | 734,742 | 2,306 | 37,814 | 96.27 | 4,245,996 | 725,608 | 2,253 | 35,935 | 96.21 | 4,244,109 | 713,211 | 2,214 | 34,394 |
|
| 4,370,115 | 17.81 | 785,606 | 188,016 | 825 | 10,498 | 96.71 | 4,266,542 | 772,527 | 3,989 | 40,576 | 98.17 | 4,330,722 | 771,950 | 3,873 | 38,507 | 98.12 | 4,328,596 | 758,572 | 3,785 | 36,841 | |
|
| 4,389,314 | 17.87 | 788,161 | 186,939 | 850 | 10,494 | 97.15 | 4,285,645 | 764,554 | 4,038 | 40,740 | 98.63 | 4,350,937 | 764,150 | 3,893 | 38,685 | 98.60 | 4,349,503 | 751,103 | 3,796 | 37,018 | |
|
| 4,345,492 | 17.72 | 781,857 | 184,148 | 592 | 10,161 | 96.31 | 4,248,729 | 750,458 | 2,304 | 39,042 | 97.79 | 4,313,964 | 749,050 | 2,252 | 36,990 | 97.76 | 4,312,566 | 735 993 | 2,213 | 35,367 | |
|
| 4,411,532 | 18.07 | 797,099 | 192,022 | 833 | 10,844 | 98.41 | 4,341,179 | 791,071 | 3,947 | 42,253 | 99.97 | 4,410,355 | 792,717 | 3,851 | 40,180 | 100 | 4,411,458 | 779,771 | 3,777 | 38,435 | |
Number of reads (per individual) used for alignment and statistical processing
| Sample ID | Raw reads | Trimmed reads | Average read length | Non-human reads | |||
|---|---|---|---|---|---|---|---|
| >20 | >25 | >30 | >35 | ||||
| 1_BK4 | 17,507,911 | 17,038,725 | 57.6 | 16,977,024 | 16,902,603 | 16,378,765 | 15,191,086 |
| 4_BK4 | 18,816,573 | 18,215,498 | 51.7 | 18,095,660 | 17,960,494 | 17,086,604 | 15,246,279 |
| 6_BK4 | 16,322,105 | 15,815,995 | 55.0 | 15,551,094 | 15,427,193 | 14,682,610 | 13,220,243 |
| 7_BK4 | 2,231,650 | 2,160,395 | 59.7 | 2,102,955 | 2,095,297 | 2,047,913 | 1,936,435 |
| 9_BK4 | 14,974,057 | 14,503,433 | 53.5 | 14,240,738 | 14,085,752 | 13,149,549 | 11,600,503 |
| 11A_BK4 | 16,432,267 | 16,000,777 | 58.0 | 15,766,313 | 15,695,767 | 15,172,161 | 14,034,604 |
| 11B_BK4 | 18,522,995 | 18,078,222 | 55.7 | 725,913 | 718,941 | 674,747 | 597,601 |
| 12_BK4 | 23,116,936 | 22,273,434 | 55.6 | 21,272,850 | 21,151,065 | 20,156,692 | 18,073,071 |
| 14_BK4 | 17,849,685 | 17,383,629 | 58.8 | 17,310,864 | 17,235,014 | 16,752,835 | 15,595,926 |
| 15_BK4 | 16,062,102 | 15,607,381 | 58.2 | 15,539,859 | 15,460,941 | 14,915,585 | 13,881,414 |
| 17_BK4 | 14,980,797 | 14,496,468 | 58.1 | 14,426,404 | 14,372,805 | 14,078,235 | 13,247,545 |
| 18_BK4 | 24,217,412 | 23,575,201 | 59.1 | 23,370,869 | 23,281,268 | 22,704,123 | 21,306,454 |
| 21_BK4 | 11,890,953 | 11,500,254 | 60.1 | 11,271,958 | 11,237,968 | 11,021,676 | 10,439,448 |
| 22_BK4 | 17,996,717 | 17,498,339 | 58.8 | 17,417,850 | 17,365,274 | 17,013,067 | 16,007,094 |
| 25_BK4 | 17,560,698 | 16,997,518 | 57.7 | 16,888,515 | 16,816,770 | 16,375,850 | 15,237,575 |
| 29_BK4 | 8,994,172 | 8,724,285 | 58.1 | 8,683,928 | 8,642,230 | 8,393,680 | 7,800,006 |
| 31_BK4 | 20,427,813 | 19,941,632 | 58.4 | 19,741,741 | 19,684,774 | 19,309,226 | 18,187,574 |
| 32_BK4 | 35,100,769 | 33,926,405 | 54.9 | 33,754,943 | 33,623,260 | 32,780,233 | 30,194,531 |
| 33_BK4 | 24,501,712 | 23,719,299 | 58.3 | 21,669,095 | 21,595,959 | 21,031,538 | 19,569,420 |
| 34_BK4 | 16,453,473 | 16,047,224 | 57.3 | 14,901,123 | 14,842,998 | 14,421,402 | 13,376,818 |
| 47_BK4 | 18,736,966 | 18,155,651 | 55.6 | 17,998,648 | 17,903,991 | 17,174,561 | 15,478,180 |
| 55_BK4 | 17,435,264 | 16,904,284 | 48.0 | 16,768,595 | 16,530,082 | 14,886,541 | 12,170,210 |
| 65_BK3 | 17,465,925 | 16,921,732 | 50.6 | 16,735,483 | 16,587,671 | 15,466,034 | 13,185,810 |
| 71_BK4 | 17,919,758 | 17,434,181 | 50.4 | 17,086,979 | 17,017,135 | 16,549,441 | 15,441,174 |
| 72_BK4 | 16,355,009 | 15,952,974 | 57.9 | 15,874,302 | 15,812,384 | 15,444,022 | 14,541,576 |
| 73_BK4 | 17,050,731 | 16,578,547 | 57.8 | 16,270,896 | 16,212,738 | 15,778,509 | 14,632,126 |
| 77_BK4 | 14,044,420 | 13,478,859 | 56.0 | 13,390,126 | 13,322,735 | 12,866,845 | 11,763,625 |
| 78_BK4 | 17,004,599 | 16,352,717 | 60.1 | 16,250,859 | 16,164,585 | 15,758,397 | 15,027,226 |
Figure 1:Changes in standardized ratio values in different read length bins and targets (red diamonds indicate outliers in Mycobacterium tuberculosis H37Rv and Borówka et al. targets in bin of reads ≥30 bp long).
Figure 2:Comparison of alignment targets constructed with different assumptions (red diamonds indicate outliers in Mycobacterium tuberculosis H37Rv and Borówka et al. targets in bin of reads ≥35 bp long).