| Literature DB >> 31888476 |
Taiana T Silva-Pereira1,2, Cássia Y Ikuta2, Cristina K Zimpel1,2, Naila C S Camargo1,2, Antônio F de Souza Filho2, José S Ferreira Neto2, Marcos B Heinemann2, Ana M S Guimarães3,4.
Abstract
BACKGROUND: Mycobacterium pinnipedii, a member of the Mycobacterium tuberculosis Complex (MTBC), is capable of infecting several host species, including humans. Recently, ancient DNA from this organism was recovered from pre-Columbian mummies of Peru, sparking debate over the origin and frequency of tuberculosis in the Americas prior to European colonization.Entities:
Keywords: Comparative genomics; Genome; Mycobacterium pinnipedii; Mycobacterium tuberculosis complex; Superinfection
Mesh:
Substances:
Year: 2019 PMID: 31888476 PMCID: PMC6937676 DOI: 10.1186/s12864-019-6407-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Circular map of proteome of Mycobacterium pinnipedii genomes compared to the reference genome of Mycobacterium tuberculosis H37Rv. From the outer ring to the inner ring: Genome position, M. tuberculosis H37Rv, M. pinnipedii MP1, M. pinnipedii MP2, M. pinnipedii G01222, M. pinnipedii G01491, M. pinnipedii G01492, M. pinnipedii G01498, M. pinnipedii ATCC BAA-688. Color scale represents protein identity. Contigs were ordered using MAUVE and map was generated in Patric 3.5.18 web resources
Fig. 2Protein interaction network of CDS (coding DNA sequences) with known function containing non-synonymous SNPs (single nucleotide polymorphisms). CDSs with mutations were identified between strains MP1 and MP2 of Mycobacterium pinnipedii. Network was generated using STRING database version 11.0, with default parameters. Green edge: neighborhood evidence; blue edge: co-ocurrence evidence; pink edge: experimental evidence; yellow edge: textmining evidence; black edge: co-expression evidence
Fig. 3Clusters of orthologous proteins of Mycobacterium pinnipedii strains. a Venn diagram of orthologous proteins. b Cluster of Orthologous Groups (COG) classification based on function of the core protein clusters of the M. pinnipedii strains. c COG classification without unknown function class (S). Protein clustering was performed using OrthoMCL as available in KBase plataform. *The number of orthologous proteins shared between these groups are available in Additional file 4: Table S3. Cluster size varied from two to forty-three proteins
Fig. 4Large sequence polymorphisms (LSPs) of Mycobacterium pinnipedii strains. Values indicate number of nucleotides spanning each deleted region (zero means the region is not deleted in the corresponding bacterial strain). Start and end: nucleotide positions according to Mycobacterium tuberculosis H37Rv reference genome. Genes are annotated according to reference genome. *Ancient South American Mycobacterium pinnipedii strains. RD: regions of difference. All listed deletions have been already described in modern M. pinnipedii strains
Fig. 5Phylogenetic tree based on SNPs (single nucleotide polymorphisms) of Mycobacterium tuberculosis Complex (MTBC) using Maximum Likelihood (ML) model. Green box: modern cluster 1; orange box: modern cluster 2; pink box: ancient cluster; black circle: deletions MiD3 and MiD4; red triangle: deletion RD2seal. Mycobacterium tuberculosis H37Rv was used as outgroup. A customized script in Python version 3.6.3 was used to build a matrix of SNPs, which was used to infer a ML tree using RAxML with GTRCAT model and autoMRE for best-scoring ML tree and a maximum of 1000 bootstrap inferences using RaxML software version 7.3
Assembly statistics, genotyping and demographics of modern Mycobacterium pinnipedii strains used in this study
| Strains | Assembly statistics | Genotyping | Demographics | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Available data | Coverage | Number of contigs | N50 | Contig size (range) | Software | Spoligotypes | Source (species, country)a | Ref. | |
| Paired-end reads and draft genome | 228x | 102 | 61,148 | 1020-184,975 | CLC Genomics Workbench 11 | SB0155 | This study | ||
| Paired-end reads and draft genome | 220x | 106 | 60,933 | 1010- 184,294 | CLC Genomics Workbench 11 | SB2455 | This study | ||
| Paired-end reads | 32x | 138 | 38,346 | 1020-121,089 | CLC Genomics Workbench 11 | SB0155 | Argentina | [ | |
| Paired-end reads | 242x | 194 | 45,251 | 1135–150,198 | SPAdes 3.13.0 | SB0155 | Australia | [ | |
| Paired-end reads | 484x | 128 | 64,538 | 1146–193,250 | SPAdes 3.13.0 | SB0155 | Australia | [ | |
| Paired-end reads | 113x | 201 | 42,094 | 1026-122,203 | SPAdes 3.13.0 | SB0155 | Australia | [ | |
| Single reads | 86x | ND | ND | ND | ND | SB0155 | Germany (zoo) | [ | |
| Single reads | 87x | ND | ND | ND | ND | SB0155 | Germany (zoo) | [ | |
| Draft genome | NA | 162 | 113,505 | 500–281,603 | Velvet 1.2.10 | Unknownc | Australia | [ | |
ND: not done. NA: not available. awhere animal species are not listed, it is because they were not informed by the authors. bReads of M. pinnipedii 7739 and 7011 were not assembled because they were available as single reads. These reads were only used in the phylogenomic analysis. cSpoligotype pattern not found in Mbovis.org database (Bincode: 0000001000000000000000010100010001000000000). Sequencing coverage was calculated by the number of bases (after adaptors removal and quality trimming) divided by the average size of an MTBC genome (i.e. 4.3 Mb). Spoligotypes were defined using SpoTyping [40]