Literature DB >> 23408795

Complete genome sequence of Corynebacterium pseudotuberculosis biovar ovis strain P54B96 isolated from antelope in South Africa obtained by rapid next generation sequencing technology.

Syed Shah Hassan¹, Luis Carlos Guimarães, Ulisses de Pádua Pereira, Arshad Islam, Amjad Ali, Syeda Marriam Bakhtiar, Dayana Ribeiro, Anderson Rodrigues Dos Santos, Siomar de Castro Soares, Fernanda Dorella, Anne Cybelle Pinto, Maria Paula Cruz Schneider, Maria Silvanira Barbosa, Síntia Almeida, Vinícius Abreu, Flávia Aburjaile, Adriana Ribeiro Carneiro, Louise Teixeira Cerdeira, Karina Fiaux, Eudes Barbosa, Carlos Diniz, Flavia S Rocha, Rommel Thiago Jucá Ramos, Neha Jain, Sandeep Tiwari, Debmalya Barh, Anderson Miyoshi, Borna Müller, Artur Silva, Vasco Azevedo.

Abstract

The Actinobacteria, Corynebacterium pseudotuberculosis strain P54B96, a nonmotile, non-sporulating and a mesophile bacterium, was isolated from liver, lung and mediastinal lymph node lesions in an antelope from South Africa. This strain is interesting in the sense that it has been found together with non-tuberculous mycobacteria (NTMs) which could nevertheless play a role in the lesion formation. In this work, we describe a set of features of C. pseudotuberculosis P54B96, together with the details of the complete genome sequence and annotation. The genome comprises of 2.34 Mbp long, single circular genome with 2,084 protein-coding genes, 12 rRNA, 49 tRNA and 62 pseudogenes and a G+C content of 52.19%. The analysis of the genome sequence provides means to better understanding the molecular and genetic basis of virulence of this bacterium, enabling a detailed investigation of its pathogenesis.

Entities: Chemical Disease Species

Keywords: Antelope; Gram-positive pathogen; Ion Torrent; caseous lymphadenitis/cheesy gland disease; genome sequencing; liver lesion; s: biovar ovis

Year: 2012 PMID： 23408795 PMCID： PMC3569390 DOI： 10.4056/sigs.3066455

Source DB: PubMed Journal: Stand Genomic Sci ISSN： 1944-3277

Introduction

Caseous lymphadenitis (CLA) or cheesy gland [1] is highly prevalent in many regions of the world, resulting in huge and significant economic losses in agribusiness since it is responsible for a decrease in wool production and carcass quality [2]. Mainly small ruminant populations like sheep and goats, and other mammals, such as bovines, pigs, deer, ovines, equines, and even, though rarely, in camels and humans, are the victims of [3-6]. The disease is characterized by the presence of caseous necrosis in external and/or internal lymph nodes [1,7]. Ulcerative lymphangitis, which is confined to the lymph vessels of extremities particularly the hind legs, is a disease caused by this bacterium in the horse [8,9]. The bacterium in some cases of human lymphadenitis, clinical strains are occasionally recovered [10]. The prevalence of CLA in the animals scattered throughout the globe needs effective measures to control the onset of the disease in herds along with the treatment of infected animals. Numerous reports have been published worldwide where mainly small ruminants are the carriers of the . They include South Africa, Brazil, United States of America, Canada, Australia, New Zealand, United Kingdom and Egypt [11-18]. Histopathological examination of antelope carcasses from a South African game reserve, a part of their routine meat inspection, showed tuberculosis-like lesions. These lesions were characterized by the presence of encapsulated necrogranulomatous inflammation similar to CLA within the pulmonary tissues, in bronchial lymph nodes, liver, kidney and some other organs of the antelopes [11]. Diseases caused by the bacterium are presented in various clinical forms as sheep and goats, affected with CLA [19]. Among the affected animal population, the increased prevalence and rapid transmission of the disease necessitates certain measures to control disease dissemination and prevent the nearby wildlife. The analysis of the genome sequence will help us better understand the molecular and genetic basis of virulence of this bacterium.

Classification and Features

is a facultative intracellular pathogen showing pleomorphic forms like coccoids and filamentous rods, with sizes ranging between 0.5-0.6 µm and 1.0-3.0 µm [2]. Cells are described as Gram-positive, non-encapsulated, non-motile, non-sporulating and possessing fimbriae [12,20]. The bacterium was first isolated in 1888 from bovine farcy by Nocard and was first completely described by Preisz, showing its resemblance to diphtheria bacillus. The organism has been previously named and, [8,21]. It is a facultative anaerobe. The best growth temperature and pH are 37o C and 7.0-7.2, respectively [17,22]. After initially growing sparsely, strain P54B96 forms organized clumps on the agar surface, demonstrating dry opaque and concentrically ringed colonies. In liquid media it develops a granular deposit with a surface pellicle [8,22,23]. There exist two biotypes of according to their capability of nitrate reduction. Bacteria capable of performing the reduction of nitrate are classified into biovar equi (nitrate reduction positive; mainly isolated from horses and cattle) while the bacteria which can not perform the reduction of nitrate, pertain to biovar ovis (nitrate reduction negative; frequently isolated from sheep and goats) [2,24]. Corynebacteria possess an unusual structural organization in their cell envelope, similar to the Gram-negative bacteria [25] and belong to a very heterogeneous CMNR ( and ) group that shares characteristics including an outer lipid layer, mycolic acids in the cell wall along with with its derivatives including phospholipids and lipomannans [4]. Marchand et al. (2012) and others reported the presumed mycomembrane, an atypical outer membrane, pore-forming proteins like PorA and PorB, mycoloyltransferases, the so-called fibronectin-binding proteins like cMytA-D and cMytF, several lipoproteins and some unknown putative C-terminal hydrophobic anchored proteins [26]. Analysis of amino acids and amino sugars of cell wall peptidoglycan reveals the presence of meso-diaminopimelic acid (meso-DAP). Major cell wall sugars are arabinose and galactose [17,27]. In addition, high and low molecular mass glucan, arabinomannan and lipoglycan also make part of the cell wall. Trehalose dimycolate (TDM) and trehalose monomycolate (TMM) are soluble cell envelope lipids [28]. Biochemically, all strains produce acid from glucose, maltose, fructose, sucrose and mannose [21,22]. This bacterium is catalase positive and phospholipase D, beta-hemolysis and oxidase negative [23,29]. Figure 1 shows the phylogenetic neighborhood of strain P54B96 in an rpoB gene (β subunit of RNA polymerase) based tree. It has recently been shown that phylogenetic analysis for the identification of as well as other CMNR species based on rpoB gene sequences are more accurate than analyses based on 16S rRNA [42,43]. The rpoB gene sequences of reference strains from the CMNR group were used to construct the phylogenetic tree.

Figure 1

Phylogenetic tree of strain P54B96 representing its position relative to type strains in along with some other type strains of CMNR group. The tree was inferred from 3,537 aligned characters of the rpoB gene sequence using maximum likelihood method and then checked for its agreement with the current classification Table 1. The branch lengths represent the expected number of substitutions per site. Numbers adjacent to the branches are support values from 1,000 bootstrap replicates, indicated when Larger than 60%. Calculations to determine the phylogenetic distances were done by the software MEGA v5 [30].

Table 1

Classification and general features of strain P54B96 according to the MIGS recommendations [31].

MIGS ID	Property	Term	Evidence code
	Classification	Domain Bacteria	TAS [32]
		Phylum Actinobacteria	TAS [33]
		Class Actinobacteria	TAS [34]
		Order Actinomycetales Suborder Corynebacterineae	TAS [34-37]
		Family Corynebacteriaceae	TAS [34,35,37,38]
		Genus Corynebacterium	TAS [35,38,39]
		Species Corynebacterium pseudotuberculosis	TAS [35,40]
		Strain P54B96	TAS [11]
	Gram stain	Positive	TAS [21]
	Cell shape	pleomorphic forms	TAS [21]
	Motility	non-motile	TAS [8]
	Sporulation	non-sporulating	TAS [22]
	Temperature range	mesophilic	TAS [8,22]
	Optimum temperature	37°C	TAS [8,22]
	Salinity	not reported	NAS
MIGS-22	Oxygen requirement	aerobic and facultatively anaerobic	TAS [8,22]
	Carbon source	glucose, fructose, maltose, mannose, and sucrose	TAS [8]
	Energy source	Chemoorganotroph	TAS [8]
MIGS-6	Habitat	Host	TAS [22]
MIGS-15	Biotic relationship	intracellular facultative pathogen	TAS [22]
MIGS-14	Pathogenicity	sheep, goats, horses and cattle, rarely humans	TAS [5,6]
	Biosafety level	2	TAS [22]
	Isolation	liver, lung, mediastinal lymph node lesions of antelope	TAS [11]
MIGS-4	Geographic location	Mpumalanga province, South Africa	TAS [11]
MIGS-5	Sample collection time	2009	TAS [11]
MIGS-4.1MIGS-4.2	Latitude Longitude	not reported not reported
MIGS-4.3	Depth	not reported
MIGS-4.4	Altitude	not reported

Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i. e. a direct report exists in the literature); NAS: Non-traceable Author Statement (i. e. not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [41]. If the evidence code is IDA, then the property was directly observed for a living isolate by one of the authors or an expert mentioned in the acknowledgements.

Genome sequencing and annotation

Genome project history

This organism was selected for sequencing on the basis of its phylogenetic position. The genome project is deposited in the Genomes OnLine Database [44] and the complete genome sequence is available in GenBank (CP003385.1). Sequencing, finishing and annotation were performed by the Rede Paraense de Genômica e Proteômica (RPGP), Pará, Brazil. A summary of the project information is shown in Table 2.

Table 2

Genome sequencing project information

MIGS ID	Property	Term
MIGS-31	Finishing quality	Finished
MIGS-28	Libraries used	Fragments (mean size 112 bp)
MIGS-29	Sequencing platforms	Semiconductor Ion Torrent PGM
MIGS-31.2	Sequencing coverage	35-fold
MIGS-30	Assemblers	CLC Genome Workbench 4.7.2, Velvet
MIGS-32	Gene calling method	Glimmer v3.02
	INSDC ID	CP003385 (chromosome)
	GenBank Date of Release	April 05, 2012
	GOLD ID	Gc02176
	NCBI project ID	77871
	Database: IMG-GEBA	2512564058
MIGS-13	Source material identifier	BHI broth, P54B96
	Project relevance	Animal Pathogen, Medical

Growth conditions and DNA isolation

P54B96 was grown in brain-heart-infusion broth (BHI-HiMedia Laboratories Pvt. Ltda, India) in shake culture at140 rpm and at 37oC. Extraction of chromosomal DNA was performed by using 50 mL of 48–72 h culture of , centrifuged at 4oC and 2000× g for 20 min. Re-suspension of cell pellets was done in 1 mL Tris/EDTA/NaCl [10 mM Tris/HCl (pH7.0), 10 mM EDTA (pH8.0), and 300 mM NaCl] for re-centrifugation under the same conditions. The pellets were re-suspended in 1 mL TE/lysozyme [25 mM Tris/HCl (pH8.0), 10 mM EDTA (pH8.0), 10 mM NaCl, and 10 mg lysozyme/mL]. The sample was then incubated at 37oC for 30 min and then 30 µL of 30% (w/v) sodium N- lauroyl-sarcosine (Sarcosyl) was added to it, incubated for 20 min at 65oC, followed by incubation for 5 min at 4oC. Purification of DNA with phenol/chloroform/isoamylalcohol (25:24:1) was followed by precipitation with ethanol. DNA concentration was determined by spectrophotometer, and the DNA was visualized in ethidium bromide-stained 0.7% agarose gel.

Genome sequencing and assembly

The complete genome sequence of P54B96 was obtained using the Ion Torrent PGM (Life Technologies) Sequencing Platform. A total, of 562,812 reads were generated, each with a mean size of 112 nts usable sequence (35-fold coverage). Furthermore, a hybrid de novo assembly approach was applied using 376,642 Ion filtered reads (19-fold coverage). This was carried out after quality filtering process during which reads representing an average Phred quality of less than 20, were removed. This strategy allowed closing gaps without bench work time cost [45]. For homopolymer correction, an inherent problem of the Ion Torrent [46], CLCBio Genome Workbench 4.7.2 was used. Having detected a high number of frameshifts, manual curation was required prior to analysis to prevent false-positive identification of pseudogenes. The genome of P54B96 strain consists of 2,337,657 bp circular chromosome and the average G+C content of the chromosome is 52.2%. The genome was predicted to contain 2,084 coding sequences (CDS), four rRNA operons, 49 tRNA and 62 pseudogenes.

Genome annotation

For automatic annotation, different programs were used. These include; Glimmer: gene predictor [47], RNAmmer: rRNA predictor [48]; tRNA-scan-SE: tRNA predictor [49]; and Tandem Repeat Finder: repetitive DNA predictor [50]. Functional annotation was performed by similarity analyses, using public databases of National Center for Biotechnology Information (NCBI) non-redundant database, Pfam and InterProScan software [51], which integrates multiple domain and protein family databases. Manual annotation was performed using Artemis [52].

Metabolic network analysis

The metabolic Pathway/Genome Database (PGDB) was computationally generated using Pathway Tools software version 15.0 [53] and MetaCyc version 15.0 [54], based on annotated EC numbers and a customized enzyme name mapping file. There has been no manual curation in the database and it may contain errors, similar to a Tier 3 BioCyc PGDB [55].

Genome properties

The genome is 2,337,657 bp long and comprises one main circular chromosome with a 52.19% GC content. A total of 2,207 genes were predicted, among which 2,146 were protein coding genes, and 61 RNAs; 62 pseudogenes were also identified. Of the whole genome, 69.01% comprise genes that were assigned with putative functions, while the remaining genes were annotated as hypothetical proteins. The properties and statistics of the genome are listed in Table 3. The distributions of genes into COGs functional categories is presented in Figure 2 and Table 4, followed by a cellular overview diagram in Figure 3 and a summary of metabolic network statistics shown in Table 5.

Table 3

Genome Statistics

Attribute	Value	% of Total
Genome size (bp)	2,337,657	100.00%
DNA coding region (bp)	2,005,391	85.79%
DNA G+C content (bp)	1,219,912	52.19%
Number of replicons	1
Extrachromosomal elements	0
Total genes	2,145	100.00%
RNA genes	61	2.76%
rRNA operons	4
Protein-coding genes	2,084	97.16%
Pseudo genes	62	2.81%
Genes with function prediction	1,511	68.46%
Genes in paralog clusters	425	19.26%
Genes assigned to COGs	1,552	70.32%
Genes assigned Pfam domains	1,596	72.32%
Genes with signal peptides	651	29.50%
Genes with transmembrane helices	584	26.46%
CRISPR repeats	0

Figure 2

Table 4

Number of genes associated with the general COG functional categories

Code	Value	%age	Description
J	140	6.72	Translation, ribosomal structure and biogenesis
A	1	0.1	RNA processing and modification
K	121	5.8	Transcription
L	88	4.2	Replication, recombination and repair
B	0	0.0	Chromatin structure and dynamics
D	21	1.0	Cell cycle control, cell division, chromosome partitioning
Y	0	0.0	Nuclear structure
V	25	1.2	Defense mechanisms
T	54	2.6	Signal transduction mechanisms
M	87	4.2	Cell wall/membrane biogenesis
N	1	0.1	Cell motility
Z	0	0.0	Cytoskeleton
W	0	0.0	Extracellular structures
U	27	1.3	Intracellular trafficking and secretion
O	77	3.7	Posttranslational modification, protein turnover, chaperones
C	90	4.3	Energy production and conversion
G	113	5.4	Carbohydrate transport and metabolism
E	177	8.5	Amino acid transport and metabolism
F	73	3.5	Nucleotide transport and metabolism
H	102	4.9	Coenzyme transport and metabolism
I	57	2.7	Lipid transport and metabolism
P	122	5.9	Inorganic ion transport and metabolism
Q	26	1.3	Secondary metabolites biosynthesis, transport and catabolism
R	169	8.1	General function prediction only
S	136	6.5	Function unknown
-	655	31.4	Not in COGs

Figure 3

Schematic cellular overview of all pathways of the P54B96 metabolism. Nodes represent metabolites, with shape indicating class of metabolite. Lines represent reactions.

Table 5

Metabolic Network Statistics

Attribute	Value
Total genes	2,145
Enzymes	500
Enzymatic reactions	764
Metabolic pathways	152
Metabolites	622

Graphical circular map of the genome. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew. Schematic cellular overview of all pathways of the P54B96 metabolism. Nodes represent metabolites, with shape indicating class of metabolite. Lines represent reactions.

45 in total

1. Artemis: sequence visualization and annotation.

Authors: K Rutherford; J Parkhill; J Crook; T Horsnell; P Rice; M A Rajandream; B Barrell
Journal: Bioinformatics Date: 2000-10 Impact factor: 6.937

2. Structure of the cell envelope of corynebacteria: importance of the non-covalently bound lipids in the formation of the cell wall permeability barrier and fracture plane.

Authors: Virginie Puech; Mohamed Chami; Anne Lemassu; Marie-Antoinette Lanéelle; Bettina Schiffler; Pierre Gounon; Nicolas Bayan; Roland Benz; Mamadou Daffé
Journal: Microbiology (Reading) Date: 2001-05 Impact factor: 2.777

3. InterProScan--an integration platform for the signature-recognition methods in InterPro.

Authors: E M Zdobnov; R Apweiler
Journal: Bioinformatics Date: 2001-09 Impact factor: 6.937

4. The Pathway Tools software.

Authors: Peter D Karp; Suzanne Paley; Pedro Romero
Journal: Bioinformatics Date: 2002 Impact factor: 6.937

5. Improved microbial gene identification with GLIMMER.

Authors: A L Delcher; D Harmon; S Kasif; O White; S L Salzberg
Journal: Nucleic Acids Res Date: 1999-12-01 Impact factor: 16.971

6. Characterization of United Kingdom isolates of Corynebacterium pseudotuberculosis using pulsed-field gel electrophoresis.

Authors: K M Connor; M M Quirie; G Baird; W Donachie
Journal: J Clin Microbiol Date: 2000-07 Impact factor: 5.948

7. Postal survey of ovine caseous lymphadenitis in the United Kingdom between 1990 and 1999.

Authors: S H Binns; M Bailey; L E Green
Journal: Vet Rec Date: 2002-03-02 Impact factor: 2.695

8. Biochemical disclosure of the mycolate outer membrane of Corynebacterium glutamicum.

Authors: Christophe H Marchand; Christophe Salmeron; Roland Bou Raad; Xavier Méniche; Mohamed Chami; Muriel Masi; Didier Blanot; Mamadou Daffé; Marielle Tropis; Emilie Huc; Pierre Le Maréchal; Paulette Decottignies; Nicolas Bayan
Journal: J Bacteriol Date: 2011-11-28 Impact factor: 3.490

Review 9. Caseous lymphadenitis in small ruminants.

Authors: L H Williamson
Journal: Vet Clin North Am Food Anim Pract Date: 2001-07 Impact factor: 3.357

10. Prevalence of and carcass condemnation from maedi-visna, paratuberculosis and caseous lymphadenitis in culled sheep from Quebec, Canada.

Authors: Julie Arsenault; Christiane Girard; Pascal Dubreuil; Danielle Daignault; Jean René Galarneau; Julie Boisclair; Carole Simard; Denise Bélanger
Journal: Prev Vet Med Date: 2003-05-30 Impact factor: 2.670

5 in total

1. Whole-Genome Sequence of Corynebacterium pseudotuberculosis 262 Biovar equi Isolated from Cow Milk.

Authors: Carlos Leonardo de A Araújo; Larissa M Dias; Adonney A O Veras; Jorianne T C Alves; Ana Lídia Q Cavalcante; Christopher G Dowson; Vasco Azevedo; Rommel T J Ramos; Artur Silva; Adriana R Carneiro
Journal: Genome Announc Date: 2016-03-24

2. In silico identification of essential proteins in Corynebacterium pseudotuberculosis based on protein-protein interaction networks.

Authors: Edson Luiz Folador; Paulo Vinícius Sanches Daltro de Carvalho; Wanderson Marques Silva; Rafaela Salgado Ferreira; Artur Silva; Michael Gromiha; Preetam Ghosh; Debmalya Barh; Vasco Azevedo; Richard Röttger
Journal: BMC Syst Biol Date: 2016-11-04

3. Exploration of Nitrate Reductase Metabolic Pathway in Corynebacterium pseudotuberculosis.

Authors: Sintia Almeida; Cassiana Sousa; Vinícius Abreu; Carlos Diniz; Elaine M S Dorneles; Andrey P Lage; Debmalya Barh; Vasco Azevedo
Journal: Int J Genomics Date: 2017-02-20 Impact factor: 2.326

4. Genome Sequence of Corynebacterium ulcerans Strain FRC11.

Authors: Leandro de Jesus Benevides; Marcus Vinicius Canário Viana; Diego César Batista Mariano; Flávia de Souza Rocha; Priscilla Carolinne Bagano; Edson Luiz Folador; Felipe Luiz Pereira; Fernanda Alves Dorella; Carlos Augusto Gomes Leal; Alex Fiorini Carvalho; Siomar de Castro Soares; Adriana Carneiro; Rommel Ramos; Edgar Badell-Ocando; Nicole Guiso; Artur Silva; Henrique Figueiredo; Vasco Azevedo; Luis Carlos Guimarães
Journal: Genome Announc Date: 2015-03-12

5. First genome sequencing and comparative analyses of Corynebacterium pseudotuberculosis strains from Mexico.

Authors: Doglas Parise; Mariana T D Parise; Marcus V C Viana; Adrian V Muñoz-Bucio; Yazmin A Cortés-Pérez; Beatriz Arellano-Reynoso; Efrén Díaz-Aparicio; Fernanda A Dorella; Felipe L Pereira; Alex F Carvalho; Henrique C P Figueiredo; Preetam Ghosh; Debmalya Barh; Anne C P Gomide; Vasco A C Azevedo
Journal: Stand Genomic Sci Date: 2018-10-10

5 in total