Literature DB >> 32351981

Whole-Genome Sequencing of Mexican Strains of Anaplasma marginale: An Approach to the Causal Agent of Bovine Anaplasmosis.

Fernando Martínez-Ocampo1, Rosa Estela Quiroz-Castañeda2, Itzel Amaro-Estrada2, Edgar Dantán-González1, Jesús Francisco Preciado de la Torre2, Sergio Rodríguez-Camarillo2.   

Abstract

Anaplasma marginale is the main etiologic agent of bovine anaplasmosis, and it is extensively distributed worldwide. We have previously reported the first genome sequence of a Mexican strain of A. marginale (Mex-01-001-01). In this work, we report the genomic analysis of one strain from Hidalgo (MEX-14-010-01), one from Morelos (MEX-17-017-01), and two strains from Veracruz (MEX-30-184-02 and MEX-30-193-01). We found that the genome average size is 1.16-1.17 Mbp with a GC content close to 49.80%. The genomic comparison reveals that most of the A. marginale genomes are highly conserved and the phylogeny showed that Mexican strains cluster with Brazilian strains. The genomic information contained in the four draft genomes of A. marginale from Mexico will contribute to understanding the molecular landscape of this pathogen.
Copyright © 2020 Fernando Martínez-Ocampo et al.

Entities:  

Year:  2020        PMID: 32351981      PMCID: PMC7178543          DOI: 10.1155/2020/5902029

Source DB:  PubMed          Journal:  Int J Genomics        ISSN: 2314-436X            Impact factor:   2.326


1. Introduction

Bovine anaplasmosis is an infectious, tick-borne disease caused mainly by Anaplasma marginale; typical signs include anemia, fever, abortion, weight loss, decreased milk production, jaundice, and potentially death. Although a sick bovine may recover when antibiotics are administered, it usually remains as a carrier for life, being a risk of infection for susceptible cattle. Anaplasma marginale is an obligate intracellular Gram-negative bacterium with a genetic composition that is highly diverse among geographical isolates [1]. Currently, there are no fully effective vaccines against bovine anaplasmosis; therefore, the economic losses due to the disease are present. Whole-genome sequencing (WGS) is an applicable tool for many pathogenic bacterial studies since 1995, when the first bacterial genomes were determined [2, 3]. Vaccine formulation became a hard task for pathogens as diverse as Anaplasma marginale, and almost all efforts have been directed toward Outer Membrane Proteins (Omp), Type IV Secretion System (T4SS), and Major Surface Proteins (Msp) [4-8]. Up to date, there are several genomes reported from A. marginale, but only one is from a Mexican strain [9]. New data could be useful for focusing in alternative antigens that induce specific and protective responses against bovine anaplasmosis. In this work, we present draft genomes from four Anaplasma marginale Mexican strains. In addition, a first approach for comparative analyses between them and Brazilian, Australian, and North American strains is shown. In order to advance in the identification of potential vaccine molecules, pathogenicity, transmission and infection mechanisms, and genetic diversity of Anaplasma marginale, further analyses are necessary.

2. Materials and Methods

2.1. Strain Origin

Each of the four Mexican strains were isolated from infected blood of animals from Atitalaquia, Hidalgo (MEX-14-010-01); Puente de Ixtla, Morelos (MEX-17-017-01); Tlapacoyan, Veracruz (MEX-30-184-02); and Veracruz, Veracruz (MEX-30-193-01). The infected blood was collected and kept at -80°C until its use.

2.2. Genome Sequencing, Assembly, and Annotation

We used 200 μl of bovine blood for each isolate to extract genomic DNA using the UltraClean DNA BloodSpin kit (Mo Bio Laboratories). The library preparation was performed by the University of Arizona Genetics Core, using a DNA TruSeq library construction kit (Illumina). Two micrograms of genomic DNA for each isolate was sequenced with MiSeq platform (Illumina). The NextSeq instrument from Illumina uses sequencing-by-synthesis (SBS) chemistry. The Illumina adapter sequences were removed from paired-end reads using ILLUMINACLIP trimming step of the Trimmomatic (version 0.36) program with default settings [10]. Low-quality bases were removed using the dynamictrim algorithm of SolexaQA++ (version 3.1.7.1) suite [11] with a Phred quality score Q < 13. The resulting paired-end reads were de novo assembled using the SPAdes (version 3.11.1) program [12] with the following options: (i) only runs assembly module (--only-assembler), (ii) reduce number of mismatches (--careful), and (iii) k-mer lengths between 21 and 127. Based on the G+C content of each contig assembled using a Python script (https://github.com/FernandoMtzMx/GC_content_MultiFasta) (A. marginale genomes reported in databases have a G+C content between 46 and 52%), contigs of four Mexican strains were differentiated from contigs that belong to other organisms (i.e., bovine genomes). Also, we aligned the sequences of each contig assembled with the nucleotide collection (nr/nt) database and Anaplasma marginale as the organism name using BLASTN suite [13]. Contigs with an alignment coverage higher than 50% and an identity higher than 70% belong to A. marginale genomes were considered “reasonably good” alignments [14]. The features of four draft genomes were evaluated using the QUAST (version 4.6.2) program [15]. The draft genomes of four Mexican strains were annotated automatically using the RAST (version 2.0) server [16], and the 16S rRNA gene sequences were obtained using the RNAmmer (version 1.2) server [17].

2.3. Genomic Comparison

The Blast Ring Image Generator (BRIG) (v0.95) program [18] was used to determine the genome comparison between the Mexican A. marginale strains and six strains from Australia, Brazil, and the United States. The circular comparative genomic map was constructed by BRIG using the GenBank files (gbk format) with standard default parameters and NCBI local blast-2.9.0+ suite.

2.4. Phylogenetic Analysis

The 16S rRNA, gyrA, gyrB, groEL, and rpoB sequences of housekeeping genes were obtained from the genomes of MEX-01-001-01 (Aguascalientes, Aguascalientes), MEX-14-010-01 (Atitalaquia, Hidalgo), MEX-17-017-01 (Puente de Ixtla, Morelos), MEX-30-184-02 (Tlapacoyan, Veracruz), and MEX-30-193-01 (Veracruz, Veracruz). The gene sequence datasets of Mexican strains were compared to 13 downloaded gene sequence datasets of A. marginale, A. centrale, A. ovis, A. phagocytophilum, and Ehrlichia canis and E. ruminantium (as outgroup), which were obtained from the GenBank database (https://www.ncbi.nlm.nih.gov/) using the nucleotide BLAST suite (https://blast.ncbi.nlm.nih.gov/Blast.cgi). Multiple alignments between all gene sequence datasets were made using the MUSCLE (v3.8.31) program [19]. Alignment sequences per genome were concatenated using a Python script. The jModelTest (v2.1.10) program [20] was used to select the best model of nucleotide substitution using the Akaike information criterion (AIC). Phylogenetic tree was inferred based on a maximum likelihood method using the PhyML (v3.1) program [21] with 1000 bootstrap replicates. The phylogenetic tree was visualized and edited using the FigTree (v1.4.3) program (http://tree.bio.ed.ac.uk/software/figtree/).

2.5. Genome Synteny Analysis

The presence of large-scale evolutionary events, such as rearrangement and inversion of genomic segments, was detected by aligning the ordered contigs of the four Mexican strains and the genomes of Florida, Dawn, Gypsy Plains, Jaboticabal and Palmeira strains (GenBank accession numbers CP001079.1, CP006847.1, CP006846.1, CP023731.1, and CP023730.1, respectively), against the reference genome of A. marginale St. Maries (GenBank accession number CP000030.1), when using the “Align with progressiveMauve” algorithm of the Mauve program (v2.4.0) [22].

3. Results

3.1. Data Availability

The Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession VTSO00000000 (MEX-14-010-01), VTCX00000000 (MEX-17-017-01), VTCY00000000 (MEX-30-184-02), and VTCZ00000000 (MEX-30-193-01).

3.2. Genome Features

We obtained four random datasets of 1,418,888 (MEX-14-010-01); 567,482 (MEX-17-017-01); 671,050 (MEX-30-184-02); and 940,598 (MEX-30-193-01) paired-end reads of 300 bp which were reported in the GenBank database. The number of reads obtained after Trimmomatic and dynamictrim analyses are as follows: MEX-30-184-02, 669,396; MEX-30-193-01, 940,392; MEX-17-017-01, 567,434; and MEX-14-010-01, 1,418,550; the assembly coverage is 27X, 154X, 58X, and 22X, respectively. The GC content for MEX-14-010-01, MEX-17-017-01, and MEX-30-184-02 strains is 49.79% and 49.80% for MEX-30-193-01. The percentage of GC values for Mexican strains is very similar to that for the reference strain A. marginale St. Maries (49.80%). Other genomic characteristics of each strain are shown in Table 1.
Table 1

Genomic statistics of the four draft genomes of Mexican strains.

StrainMEX-14-010-01MEX-17-017-01MEX-30-184-02MEX-30-193-01
Genome size (bp)1,172,3271,172,7161,176,6811,167,111
Contigs46414041
N50 length (bp)85,14765,20361,32865,442
Coverage~22X~58X~27X~154X
G+C content49.7949.7949.7949.80
Genes1,1901,2031,2051,178
CDS1,1501,1631,1651,138
According to the RNAmmer server, all 16S rRNA Mexican strains have a length of 1,491 bp, which have 100% alignment coverage, and between 99 and 100% are identified with the 16S rRNA gene sequence of A. marginale strain St. Maries (GenBank accession CP000030). The number of rRNAs and tRNAs in the four strains is 3 and 37, respectively. The number of protein-coding genes for MEX-14-010-01, MEX-17-017-01, MEX-30-184-02, and MEX-30-193-01 is 1150, 1163, 1165, and 1138, respectively. In Table 2, the information derived from the SEED subsystem of the RAST server for each strain is shown.
Table 2

RAST annotation categories of Mexican strain genomes.

StrainMEX-14-010-01MEX-17-017-01MEX-30-184-02MEX-30-193-01
Cofactors, vitamins, prosthetic groups, pigments124128124120
Cell wall and capsule37373737
Virulence, disease, and defense17181817
Potassium metabolism3333
Miscellaneous4444
Phages, prophages, transposable elements, plasmids1111
Membrane transport13131413
Iron acquisition and metabolism3333
RNA metabolism52505250
Nucleosides and nucleotides42464243
Protein metabolism185189188185
Cell division and cell cycle24242624
Regulation and cell signaling1111
DNA metabolism51524649
Fatty acids, lipids, and isoprenoids54565456
Nitrogen metabolism5555
Dormancy and sporulation1111
Respiration61616069
Stress response26262626
Amino acids and derivatives51515151
Sulfur metabolism4444
Phosphorus metabolism9999
Carbohydrates41434543

3.3. Genomic Comparison and Phylogeny

We compared the four Mexican draft genomes of A. marginale with Brazilian, Australian, and North American strains. In Figure 1, the comparative genomics is shown. Although most of the genomes are highly conserved, the Dawn and Gypsy Plains strains showed some differences from the Mexican, North American, and Brazilian strains. We randomly selected fourteen ORFs found in the genomic annotation predicted as membrane proteins, and then, we located them in the genomes; as observed, most of these proteins are conserved in all genomes (Figure 2).
Figure 1

Phylogeny obtained based on the concatenated sequences of five housekeeping genes (16S rRNA, gyA, gyrB, groEL, and rpoB). The tree shows that Mexican strains of the A. marginale cluster with Brazilian strains, specifically, MEX-01-001-01 and MEX-17-017-01 strains that cluster with A. marginale Jaboticabal. The phylogenetic tree was obtained using the PhyML (v3.1) program with the maximum likelihood method and 1000 bootstrap replicates. GenBank accession numbers of the genomes are shown in square brackets. Bootstrap values are displayed in the nodes.

Figure 2

Circular comparative genomic map between five Mexican strains with six strains of A. marginale from Australia, Brazil, and the United States. The innermost ring (black) represents the A. marginale St. Maries genome (length of 1,197,687 bp). Rings 2-11 represent the A. marginale genomes of Florida, Dawn, Gypsy Plains, Jaboticabal, Palmeira, MEX-01-001-01, MEX-14-010-01, MEX-17-017-01, MEX-30-184-02, and MEX-30-193-01 strains, respectively. The outermost ring highlights 10 genes (red arrows and numbers) that are highly conserved in the genomes. The annotated function on the RAST server of the 10 proteins is shown on the left side.

3.4. Genome Synteny Analysis

The genome synteny of 11 A. marginale genomes of Australian, Brazilian, Mexican, and North American strains shows that the first 100,000 bases have a rearrangement of several small fragments (Figure 3). In addition, the genome synteny of A. marginale shows that the Australian, Brazilian, and North American strains have a highly conserved genome structure, while the genomes of Mexican strains show some rearrangement and inversion of genomic segments (Figure 3). In general, the structure of A. marginale genomes shares a high percentage of coverage and is widely conserved in different geographical regions of the world.
Figure 3

Genome synteny of A. marginale genomes of Australian, Brazilian, Mexican, and North American strains. In general, a highly conserved structure is observed in all genomes, with the exception of Mexican strains that show rearrangements and inversions along the genome sequences.

4. Discussion

So far, only one draft genome of a Mexican strain of A. marginale has been reported [9]. In this work, we present the genomic information of other four strains: MEX-14-010-01, MEX-17-017-01, MEX-30-184-02, and MEX-30-193-01. The genomic analysis reveals that their size (ranging from 1,167,111 bp to 1,176,681 bp) and a GC content (about 49.79%) are very similar to other A. marginale strains reported in GenBank such as the reference genome of the St. Maries strain, with a genome size of 1,197,690 bp and a GC content of 49.80%. The number of Genes and CDS is very similar in the four strains. In fact, in the genome annotation, using the different subsystem classification of RAST server, we identified genes related to cell wall and capsule, virulence, disease and defense, membrane transport, and protein and DNA metabolism, among others. In the virulence, disease, and defense categories, we found genes associated with the cobalt-zinc-cadmium resistance, fluoroquinolone resistance, cooper homeostasis, and beta lactamase. Also, we identified genes of Mycobacterium virulence operon involved in protein synthesis (SSU and LSU ribosomal proteins) and Mycobacterium virulence operon involved in DNA transcription. Mycobacterium operon is present in several species, including Mycobacterium tuberculosis, Streptococcus pneumoniae, Bartonella bovis, and Streptococcus suis, among other animal and plant pathogens [23-25]. In the stress response category, we found genes associated with oxidative stress, cold shock, heat shock, periplasmic stress response, and detoxification. For most of the obligate intracellular bacteria, the presence of peptidoglycan is not necessarily needed to maintain the integrity of the bacterial cell. In A. marginale, there are no reports of the analysis or isolation of its peptidoglycan [26]; however, we identified genes associated with the cell wall and capsule, specifically with the peptidoglycan biosynthesis. An interesting feature of A. marginale genomes is the role of the genes that we found in nitrogen metabolism. In alphaproteobacteria, the role of nitrogen metabolism may be essential for full virulence [27]. The phylogeny analysis indicates that Mexican strains are more related to Brazilian strains than to North American ones. The genomic comparison of the strains reveals the high percent of identity between A. marginale genomes as observed in the genome synteny analysis, where most of the strains are highly conserved in its structure and the Mexican strains have some rearrangements and inversions in certain genomic sequences. The report of four draft genomes of A. marginale found in Mexico represents a first approach to unveil information that could help to develop new strategies for the design of vaccines against bovine anaplasmosis and new diagnostic methods. Still, more genomic analyses are needed to complete the molecular landscape of this pathogen.

5. Conclusions

We present here, the genomic report and analyses of four Mexican strains of A. marginale, the causal agent of bovine anaplasmosis. So far, only one genome of a Mexican strain has been reported; with this contribution, we compare our results with information of strains from the USA, Brazil, and Australia and provide more information of this pathogen.
  27 in total

1.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood.

Authors:  Stéphane Guindon; Olivier Gascuel
Journal:  Syst Biol       Date:  2003-10       Impact factor: 15.683

2.  Mauve: multiple alignment of conserved genomic sequence with rearrangements.

Authors:  Aaron C E Darling; Bob Mau; Frederick R Blattner; Nicole T Perna
Journal:  Genome Res       Date:  2004-07       Impact factor: 9.043

3.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

4.  Identification of Anaplasma marginale outer membrane protein antigens conserved between A. marginale sensu stricto strains and the live A. marginale subsp. centrale vaccine.

Authors:  Joseph T Agnes; Kelly A Brayton; Megan LaFollett; Junzo Norimine; Wendy C Brown; Guy H Palmer
Journal:  Infect Immun       Date:  2010-12-28       Impact factor: 3.441

Review 5.  Brucella, nitrogen and virulence.

Authors:  Severin Ronneau; Simon Moussa; Thibault Barbier; Raquel Conde-Álvarez; Amaia Zuniga-Ripa; Ignacio Moriyon; Jean-Jacques Letesson
Journal:  Crit Rev Microbiol       Date:  2014-12-04       Impact factor: 7.624

6.  QUAST: quality assessment tool for genome assemblies.

Authors:  Alexey Gurevich; Vladislav Saveliev; Nikolay Vyahhi; Glenn Tesler
Journal:  Bioinformatics       Date:  2013-02-19       Impact factor: 6.937

7.  BLAST Ring Image Generator (BRIG): simple prokaryote genome comparisons.

Authors:  Nabil-Fareed Alikhan; Nicola K Petty; Nouri L Ben Zakour; Scott A Beatson
Journal:  BMC Genomics       Date:  2011-08-08       Impact factor: 3.969

8.  SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data.

Authors:  Murray P Cox; Daniel A Peterson; Patrick J Biggs
Journal:  BMC Bioinformatics       Date:  2010-09-27       Impact factor: 3.169

9.  Subdominant Outer Membrane Antigens in Anaplasma marginale: Conservation, Antigenicity, and Protective Capacity Using Recombinant Protein.

Authors:  Deirdre R Ducken; Wendy C Brown; Debra C Alperin; Kelly A Brayton; Kathryn E Reif; Joshua E Turse; Guy H Palmer; Susan M Noh
Journal:  PLoS One       Date:  2015-06-16       Impact factor: 3.240

10.  Assembly of a pan-genome from deep sequencing of 910 humans of African descent.

Authors:  Rachel M Sherman; Juliet Forman; Valentin Antonescu; Daniela Puiu; Michelle Daya; Nicholas Rafaels; Meher Preethi Boorgula; Sameer Chavan; Candelaria Vergara; Victor E Ortega; Albert M Levin; Celeste Eng; Maria Yazdanbakhsh; James G Wilson; Javier Marrugo; Leslie A Lange; L Keoki Williams; Harold Watson; Lorraine B Ware; Christopher O Olopade; Olufunmilayo Olopade; Ricardo R Oliveira; Carole Ober; Dan L Nicolae; Deborah A Meyers; Alvaro Mayorga; Jennifer Knight-Madden; Tina Hartert; Nadia N Hansel; Marilyn G Foreman; Jean G Ford; Mezbah U Faruque; Georgia M Dunston; Luis Caraballo; Esteban G Burchard; Eugene R Bleecker; Maria I Araujo; Edwin F Herrera-Paz; Monica Campbell; Cassandra Foster; Margaret A Taub; Terri H Beaty; Ingo Ruczinski; Rasika A Mathias; Kathleen C Barnes; Steven L Salzberg
Journal:  Nat Genet       Date:  2018-11-19       Impact factor: 38.330

View more
  1 in total

1.  Mexican Strains of Anaplasma marginale: A First Comparative Genomics and Phylogeographic Analysis.

Authors:  Edgar Dantán-González; Rosa Estela Quiroz-Castañeda; Hugo Aguilar-Díaz; Itzel Amaro-Estrada; Fernando Martínez-Ocampo; Sergio Rodríguez-Camarillo
Journal:  Pathogens       Date:  2022-08-02
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.