Literature DB >> 24874681

Draft Genome Sequence of Mycobacterium triplex DSM 44626.

Mohamed Sassi1, Olivier Croce1, Catherine Robert1, Didier Raoult1, Michel Drancourt2.   

Abstract

We announce the draft genome sequence of Mycobacterium triplex strain DSM 44626, a nontuberculosis species responsible for opportunistic infections. The genome described here is composed of 6,382,840 bp, with a G+C content of 66.57%, and contains 5,988 protein-coding genes and 81 RNA genes.
Copyright © 2014 Sassi et al.

Entities:  

Year:  2014        PMID: 24874681      PMCID: PMC4038886          DOI: 10.1128/genomeA.00499-14

Source DB:  PubMed          Journal:  Genome Announc


GENOME ANNOUNCEMENT

Mycobacterium triplex was described on the basis of a unique mycolic-acid pattern and a distinctive 16S rRNA gene hypervariable region (1). Phylogenetic analysis confirmed close relationships with Mycobacterium simiae, Mycobacterium genavense, Mycobacterium lentiflavum, and Mycobacterium sherrisii (2, 3). M. triplex has been initially isolated from sputum (4), lymph nodes (5), and cerebrospinal fluid specimens (1, 6). Only six cases of infection have been reported in immunocompromised patients, including HIV-infected patients (1) and a liver transplant patient (7). We analyzed the whole-genome sequence of M. triplex to facilitate the elucidation of its relationships within the M. simiae complex and the design of tools for its advanced detection and identification. Genomic DNA was isolated from M. triplex strain DSMZ 44626 grown in MGIT Middlebrook liquid culture (Becton Dickinson, Sparks, MD) at 37°C. DNA was sequenced using three high-throughput next-generation sequencing (NGS) technologies: Roche 454 (Roche Diagnostics Corporation, Indianapolis, IN) (8), SOLiD version 4 (Life Technologies, Carlsbad, CA), and MiSeq Illumina (Illumina, Inc., San Diego, CA). Two Roche 454 libraries, a 3.6-kb paired-end and a 1.6-kb shotgun XL+, were constructed, loaded on a picotiter (PTP) plate, and sequenced with the Roche-GS FLX Titanium sequencing kit XLR70. The two runs yielded 90.13 Mb with 265,087 passed filters and an average 348-bp length. The bar-coded paired-end SOLiD library generated 1,338,576 reads of 50- × 35-bp-length reads. Finally, a paired-end Nextera library, fragmented at 942 bp and sequenced on MiSeq in 2 × 151 bp, yielded 269,610 reads with an indexing of 2.15% on the flow cell. The reads that issued from these three sequencing technologies were first assembled separately. The 454 reads were assembled into contigs and scaffolds using Newbler version 2.8 (Roche). Illumina reads, trimmed using Trimmomatic (9), were assembled using the Spades software (10, 11). Contigs obtained were combined by using SSpace (12) and Opera v. 1.2 (13) software; GapFiller v. 1.10 (14) helped to reduce the set. Some manual refinements using CLC Genomics v. 5 software (CLC bio, Aarhus, Denmark) improved the genome. These analyses yielded four scaffolds of 21 contigs containing 6,379,625 bp and an estimated size, including gaps, of 6,382,840 bp, for a 66.57% G+C content. Noncoding genes and miscellaneous features were predicted using RNAmmer (15), ARAGORN (16), Rfam (17), and PFAM (18). Open reading frames (ORFs) were predicted using Prodigal (19) and functional annotation was achieved using BLASTP against the GenBank database (20) and the Clusters of Orthologous Groups (COGs) database (21, 22). The genome was shown to encode at least 81 predicted RNAs, including 3 rRNAs in a single operon, 58 tRNAs, 1 transfer-messenger RNA (tmRNA), and 19 miscellaneous RNAs. Also, 5,988 genes represented a coding capacity of 5,867,145 bp for a 91.9% coding percentage. Among these genes, 765 (12.77%) were found to encode putative proteins and 1.011 (16.88%) were assigned hypothetical proteins. Moreover, 5,940 genes matched a least one sequence in the COGs database with BLASTP default parameters.

Nucleotide sequence accession numbers.

The M. triplex strain DSM 44626 genome sequence has been deposited at ENA under the accession numbers CCAU010000001 to CCAU010000021.
  22 in total

1.  ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences.

Authors:  Dean Laslett; Bjorn Canback
Journal:  Nucleic Acids Res       Date:  2004-01-02       Impact factor: 16.971

2.  Rfam: an RNA family database.

Authors:  Sam Griffiths-Jones; Alex Bateman; Mhairi Marshall; Ajay Khanna; Sean R Eddy
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

3.  Assembling single-cell genomes and mini-metagenomes from chimeric MDA products.

Authors:  Sergey Nurk; Anton Bankevich; Dmitry Antipov; Alexey A Gurevich; Anton Korobeynikov; Alla Lapidus; Andrey D Prjibelski; Alexey Pyshkin; Alexander Sirotkin; Yakov Sirotkin; Ramunas Stepanauskas; Scott R Clingenpeel; Tanja Woyke; Jeffrey S McLean; Roger Lasken; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2013-10       Impact factor: 1.479

4.  Characterization of an SAV organism and proposal of Mycobacterium triplex sp. nov.

Authors:  M M Floyd; L S Guthertz; V A Silcox; P S Duffey; Y Jang; E P Desmond; J T Crawford; W R Butler
Journal:  J Clin Microbiol       Date:  1996-12       Impact factor: 5.948

5.  Brief report: disseminated mycobacteriosis caused by drug-resistant Mycobacterium triplex in a human immunodeficiency virus-infected patient during highly active antiretroviral therapy.

Authors:  A Cingolani; M Sanguinetti; A Antinori; L M Larocca; F Ardito; B Posteraro; G Federico; G Fadda; L Ortona
Journal:  Clin Infect Dis       Date:  2000-07       Impact factor: 9.079

6.  Novel mycobacterium related to Mycobacterium triplex as a cause of cervical lymphadenitis.

Authors:  R Hazra; M M Floyd; A Sloutsky; R N Husson
Journal:  J Clin Microbiol       Date:  2001-04       Impact factor: 5.948

7.  Prodigal: prokaryotic gene recognition and translation initiation site identification.

Authors:  Doug Hyatt; Gwo-Liang Chen; Philip F Locascio; Miriam L Land; Frank W Larimer; Loren J Hauser
Journal:  BMC Bioinformatics       Date:  2010-03-08       Impact factor: 3.169

Review 8.  Mycobacterium triplex pulmonary disease in immunocompetent host.

Authors:  Claudio Piersimoni; Piergiorgio Zitti; Gianna Mazzarelli; Allessandro Mariottini; Domenico Nista; Diego Zallocco
Journal:  Emerg Infect Dis       Date:  2004-10       Impact factor: 6.883

9.  Toward almost closed genomes with GapFiller.

Authors:  Marten Boetzer; Walter Pirovano
Journal:  Genome Biol       Date:  2012-06-25       Impact factor: 13.583

10.  The Pfam protein families database.

Authors:  Marco Punta; Penny C Coggill; Ruth Y Eberhardt; Jaina Mistry; John Tate; Chris Boursnell; Ningze Pang; Kristoffer Forslund; Goran Ceric; Jody Clements; Andreas Heger; Liisa Holm; Erik L L Sonnhammer; Sean R Eddy; Alex Bateman; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2011-11-29       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.