| Literature DB >> 32019968 |
Marília Lima da Conceição1,2,3, Emilyn Costa Conceição4,5, Ismari Perini Furlaneto6, Sandro Patroca da Silva7, Arthur Emil Dos Santos Guimarães6,7, Pedro Gomes8, María Laura Boschiroli9, Lorraine Michelet9, Thomas Andreas Kohl10,11, Katharina Kranzer12,13, Loreno da Costa Francez14, Luana Nepomuceno Gondim Costa Lima7, Isabel Portugal8, João Perdigão15, Karla Valéria Batista Lima6,7.
Abstract
Lack of routine surveillance in countries endemic for bovine tuberculosis (TB) and limited laboratory support contributes to the inability to differentiate the Mycobacterium tuberculosis Complex species, leading to an underestimated burden of the disease. Here, Whole-Genome Sequencing of Mycobacterium bovis isolated from tissues with TB-like lesions obtained from cattle and buffalos at Marajó Island, Brazil, demonstrates that recent transmission of M. bovis is ongoing at distinct sites. Moreover, the M. bovis epidemiology in this setting is herein found to be dominated by an endemic and unique clade composed of strains evolved from a common ancestor that are now genetically differentiated from other M. bovis clades. Additionally, envisioning a rapid strain differentiation and tracing across multiple settings, 28 globally validated strain-specific SNPs were identified, three of which considered as robust markers for the M. bovis Marajó strain. In conclusion, this study contributes with data regarding the identification of a novel M. bovis phylogenetic clade responsible for ongoing transmission events in both cattle and buffalo species in Brazil, provides a framework to investigate the dissemination of this highly prevalent strain and, holds the potential to inform TB control strategies that may help to prevent the spread of bovine and zoonotic TB.Entities:
Mesh:
Year: 2020 PMID: 32019968 PMCID: PMC7000724 DOI: 10.1038/s41598-020-58398-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Mycobacterium bovis isolates from cattle and buffalos in the Marajó Island (Pará, Brazil). For each isolate, the year of isolation, city of origin along with classical genotyping data and mapping statistics are shown.
| Isolate | Year | Infection site | Animal | City | Genotype | Mapping Statistics | ||
|---|---|---|---|---|---|---|---|---|
| SIT/SBb | MIRU-VNTR clusterc | No. of Mapped Reads(%) | Mean Coverage Depth | |||||
| G00001 | 2014 | Lymph nodes | Buffalo 1 | Soure | 997/0822 | 1 | 2915368 (59.0) | 85.3582 |
| G00002a | 2014 | Lymph nodes | Buffalo 2 | Soure | 997/0822 | 4 | 1217486 (39.6) | 34.8304 |
| G00003a | 2015 | Lymph nodes | Cattle 1 | Chaves | 997/0822 | — | 443667 (10.7) | 7.7398 |
| G00004 | 2015 | Lymph nodes | Cattle 2 | Chaves | 997/0822 | 1 | 3889617 (98.8) | 114.7544 |
| G00005 | 2015 | Esophagus | Cattle 3 | Chaves | 997/0822 | 1 | 4452372 (99.7) | 131.8627 |
| G00006 | 2015 | Lymph nodes | Cattle 4 | Chaves | 997/0822 | 1 | 3166441 (99.7) | 93.1683 |
| G00007 | 2015 | Lymph nodes | Cattle 5 | Chaves | 997/0822 | 1 | 3392896 (99.6) | 100.1515 |
| G00008a | 2015 | Abomasum | Cattle 6 | Chaves | 997/0822 | 1 | 796095 (18.7) | 17.0591 |
| G00009 | 2015 | Esophagus | Buffalo 3 | Santa Cruz do Arari | 997/0822 | 1 | 3403743 (98.2) | 100.0624 |
| G00010 | 2015 | Liver | Buffalo 4 | Soure | 997/0822 | 2 | 3847522 (99.7) | 115.2658 |
| G00011 | 2015 | Lymph nodes | Buffalo 5 | Soure | 997/0822 | 1 | 3117214 (98.8) | 91.9547 |
| G00012 | 2015 | Rumen | Cattle 7 | Soure | 997/0822 | 1 | 2873682 (75.6) | 84.5041 |
| G00013 | 2015 | Lymph nodes | Buffalo 6 | Soure | 997/0822 | 3 | 4823761 (99.2) | 143.5417 |
| G00014 | 2015 | Lymph nodes | Buffalo 7 | Soure | 997/0822 | 1 | 4544806 (98.8) | 135.971 |
| G00015 | 2015 | Lymph nodes | Cattle 8 | Soure | 986/0885 | 1 | 5509443 (98.5) | 166.1321 |
| G00016a | 2015 | Lymph nodes | Cattle 9 | Soure | 997/0822 | — | 779376 (14.3) | 13.4106 |
| G00017 | 2015 | Lymph nodes | Buffalo 8 | Soure | 997/0822 | 3 | 5174290 (99.9) | 155.8405 |
| G00018 | 2015 | Lymph nodes | Buffalo 9 | Cachoeira do Arari | 997/0822 | 5 | 4438288 (99.8) | 131.5516 |
| G00019 | 2015 | Liver | Cattle 3 | Chaves | 997/0822 | 1 | 4254501 (99.9) | 124.9458 |
| G00020a | 2015 | Lymph nodes | Cattle 7 | Soure | 997/0822 | 2 | 729323 (13.7) | 15.6735 |
| G00021 | 2015 | Abomasum | Cattle 6 | Chaves | 997/0822 | 1 | 16287213 (98.6) | 476.0113 |
| G00022 | 2015 | Lymph nodes | Cattle 7 | Soure | 997/0822 | 1 | 15229721 (96.0) | 440.3858 |
aGenomes excluded due to low coverage depth and/or low percentage of mapping reads;
bShared International Type and SB-number according to SITVIT2 and Mycobacterium bovis Spoligotyping Database (MBovis.org);
c24-loci MIRU-VNTR profiles: 1 (225322341363454251322312); 2 (225322341363454251322312); 3 (225322341362454251322312); 4 (225322141363474251322312); and 5 (225322341363474251322312).
Figure 1Geographical distribution of genomic clusters, maximum-likelihood phylogenetic and minimum spanning trees (MSTs) of the M. bovis isolates from the Marajó Island, Brazil. The geographic distribution of the three genomic clusters found in the Marajó Island shows that M. bovis transmission is ongoing at multiple cities in the island (A) as a result of clonal expansion of a unique M. bovis clade that has disseminated across both cattle and buffalo species (B). MSTs with node coloring according to host species (C), city of origin (D) and genomic cluster (E) also support the dissemination of this strain across multiple cities and host species with moderate diversification observed mostly between genomic clusters. The maximum-likelihood phylogenetic tree based on 1773 high-quality genome-wide SNPs (B) is shown annotated with the genomic clusters (see legend), host species, MIRU-VNTR and spoligotyping profiles and city of origin of the animal. MIRU-VNTR profiles are represented in a 24-digit numeric string where each-digit represents the number of repeats at a particular locus according to the following order of the loci: 154, 424, 577, 580, 802, 960, 1644, 1955, 2059, 2163b, 2165, 2347, 2401, 2461, 2531, 2687, 2996, 3007, 3171, 3192, 3690, 4052, 4156, 4384 and 4348. Numbers annotated in links between the nodes of the MSTs (C–E) represent the number of segregating SNP sites between nodes. Figure generated using the Interactive Tree of Life v5 online tool (available at https://itol.embl.de), Microsoft PowerPoint 2016 (Version 1707) and Microsoft Excel 2016 (Version 1707), incl. Microsoft Power Map 3D Data Visualization Tool (https://products.office.com/en-us/business/office) and Phyloviz v2.0 (available at https://online2.phyloviz.net).
Figure 2Frequency histogram of SNP pairwise distances across the Marajó Island M. bovis isolates (B) and across the global M. bovis dataset (B). Pairwise distance distribution across the Marajó M. bovis isolates (A) show a restricted distribution when compared with the global dataset (B) with the strains isolated from the Marajó Island showing a SNP pairwise distance of up to 64 SNPs. In panel A, two additional distance peaks around 450 and 1050 bp represent the distance of the Marajó strains towards M. bovis AF2122/97 and M. caprae EPDC01, which were used as reference for mapping and to root the phylogenetic trees, respectively.
Figure 3Global phylogenetic tree of 257 M. bovis isolates (including the 17 Marajó M. bovis isolates) highlighting the monophyletic nature of the M. bovis Marajó clade. The tree is shown annotated with the isolate ID or ENA run accession, clonal complex colored according to the legend in the bottom-left corner, country of isolation, year of isolation, host species, and spoligotyping profile (in this order). Figure generated using the Interactive Tree of Life v5 online tool (available at https://itol.embl.de).
Figure 4Principal Component Analysis (PCA, panel A) and Principal Coordinate Analysis (PCoA) showing the genetic differentiation of the M. bovis Marajó strains upon comparison with additional 240 M. bovis isolates (B and C). PCA demonstrates the clustering of the M. bovis Marajó strains across all three principal components while showing the genetic divergence from the remaining isolates (A). PCoA corroborates this analysis by demonstrating the unique differentiation of the Marajó M. bovis population when compared with other settings (B) or with the known and previously described clonal complexes (C). Points across the PCA and PCoA plots are coloured according to the study population (country of origin) or clonal complex (see legends).
Pairwise FST distance matrix between different M. bovis sub-populations according to Clonal Cluster and comparison with the Marajo clade (unclassified as per the established Clonal Cluster genomic markers). Pairwise FST values are shown in the matrix upper triangle whereas p values are shown in the matrix lower triangle. The pairwise FST distance values highlight the low genetic differentiation of Unclassified isolates in comparison with other genetic clades, likely owing to its paraphyletic nature, with the Marajo clade showing a high genetic differentiation from the African 1–2 and European 1–2 Clonal Clusters but, lower in comparison with the remaining Unclassified isolates. All comparisons were significant at the 0.05 level of statistical significance.
| African 1 | African 2 | European 1 | European 2 | Marajo | Unclassified | |
|---|---|---|---|---|---|---|
| African 1 | 0.501 | 0.601 | 0.513 | 0.877 | 0.118 | |
| African 2 | 0.000 | 0.491 | 0.442 | 0.723 | 0.178 | |
| European 1 | 0.000 | 0.000 | 0.328 | 0.529 | 0.153 | |
| European 2 | 0.000 | 0.000 | 0.000 | 0.533 | 0.163 | |
| Marajo | 0.000 | 0.000 | 0.000 | 0.000 | 0.242 | |
| Unclassified | 0.001 | 0.000 | 0.000 | 0.000 | 0.000 |
Figure 5Boxplots of pairwise FST distances between populations (herein defined as clade or clonal complex), revealing the higher level of genetic differentiation of the Marajó strain when compared with other sub-populations of M. bovis.
Clade-specific SNPs detected for the Marajo strain along with affected genes, functional effect and affected gene product. All strain-specific SNPs are listed with positioning relative to the genome of M. tuberculosis H37Rv and M. bovis AF2122/97 with synonymous variants occurring at essential genes highlighted in bold.
| Genomic Position ( | Genomic Position ( | ORFc | Gene Name | Reference allele | Altered allele | Mutation | Variant Type | Gene Essentialityd | Gene Product |
|---|---|---|---|---|---|---|---|---|---|
| 463574 | 464591 | Rv0386/Mb0393 | — | T | C | T164C | missense | Non-essential | Probable transcriptional regulatory protein (probably LuxR/UhpA-family) |
| 473386 | 474405 | Rv0393/Mb0399 | — | T | C | T606C | synonymous | Non-essential | Conserved 13e12 repeat family protein |
| 770659 | 772428 | Rv0671/Mb0690 | G | A | G78A | synonymous | Non-essential | Possible conserved lipoprotein LpqP | |
| 938123 | 938878 | Rv0842/Mb0865 | — | C | T | C12T | synonymous | Non-essential | Probable conserved integral membrane protein |
| 990495 | 990960 | Rv0890c/Mb0914c | — | T | C | A2102G | missense | Non-essential | Probable transcriptional regulatory protein (probably LuxR-family) |
| 1067162 | 1067629 | Rv0955/Mb0980 | — | T | C | T1085C | missense | Non-essentiale | Probable conserved integral membrane protein |
| 1078797 | 1079264 | Rv0969/Mb0994 | G | T | G55T | stop | Non-essential | Probable metal cation transporter p-type ATPase CtpV | |
| 1192229 | 1193617 | Intergenic | — | A | G | — | — | — | — |
| 1394348 | 1395620 | Rv1250/Mb1282 | — | T | G | T170G | missense | Non-essential | Probable drug-transport integral membrane protein |
| 1513242 | 1515686 | Rv1348/Mb1383 | A | G | A196G | missense | Essential | Iron-regulated transporter IrtA | |
| 1529907 | 1532351 | Rv1358/Mb1393 | — | C | A | C3296A | missense | Non-essential | Probable transcriptional regulatory protein |
| 2245157 | 2228483 | Intergenic | — | C | A | — | — | — | — |
| 2582546 | 2564552 | Intergenic | — | T | C | — | — | — | — |
| 2585942 | 2567948 | Rv2314c/Mb2341c | — | A | G | T1349C | missense | Non-essential | Conserved protein |
| 2744419 | 2716993 | Rv2444c/Mb2471c | C | T | G566A | missense | Essential | Possible ribonuclease E Rne | |
| 3027938 | 2988986 | Rv2714/Mb2733 | — | A | G | A874G | missense | Non-essential | Conserved alanine and leucine rich protein |
| 3100631 | 3061592 | Rv2791c/Mb2814c | — | T | G | A951C | synonymous | Non-essential | Probable transposase |
| 3107666 | 3068627 | Intergenic | — | C | G | — | — | — | — |
| 3284440 | 3245425 | Rv2941/Mb2966 | G | A | G1106A | missense | Non-essential | Fatty-acid-amp ligase fadd28 (fatty-acid-amp synthetase) (fatty-acid-amp synthase) | |
| 3479333 | 3440293 | Rv3111/Mb3138 | A | G | A163G | missense | Non-essentiale | Probable molybdenum cofactor biosynthesis protein c MoaC1 | |
| 3600623 | 3559632 | Rv3224/Mb3251 | — | A | C | A773C | missense | Non-essential | Possible iron-regulated short-chain dehydrogenase/reductase |
| 3606612 | 3565621 | Rv3229c/Mb3258c | G | A | C423T | synonymous | Non-essentiale | Possible linoleoyl-coa desaturase (delta(6)-desaturase) | |
| 3859073 | 3811188 | Rv3439c/Mb3469c | — | G | T | C590A | missense | Non-essential | Conserved hypothetical alanine and proline rich protein |
| 4142051 | 4083811 | Rv3699/Mb3725 | — | A | C | A8C | missense | Non-essential | Conserved protein |
| 4357167 | 4297899 | Rv3878/Mb3908 | T | C | T475C | missense | Non-essential | Esx-1 secretion-associated protein EspJ |
aRelative to the genome position of M. tuberculosis H37Rv, GenBank Accession NC_000962.3;
bRelative to the genome position of M. bovis AF2122/97, GenBank Accession NC_002945.4;
cOpen Reading Frame;
dGene Essentiality according to DeJesus et al. (2017) using saturated Himar1 transposon libraries[22];
eNon-essential gene by DeJesus et al.[22], essential gene by Sassetti et al.[52] and Griffin et al.[53].