Literature DB >> 27660670

Genome sequence of Candidatus Arsenophonus lipopteni, the exclusive symbiont of a blood sucking fly Lipoptena cervi (Diptera: Hippoboscidae).

Eva Nováková1, Václav Hypša1, Petr Nguyen2, Filip Husník1, Alistair C Darby3.   

Abstract

Candidatus Arsenophonus lipopteni (Enterobacteriaceae, Gammaproteobacteria) is an obligate intracellular symbiont of the blood feeding deer ked, Lipoptena cervi (Diptera: Hippoboscidae). The bacteria reside in specialized cells derived from host gut epithelia (bacteriocytes) forming a compact symbiotic organ (bacteriome). Compared to the closely related complex symbiotic system in the sheep ked, involving four bacterial species, Lipoptena cervi appears to maintain its symbiosis exclusively with Ca. Arsenophonus lipopteni. The genome of 836,724 bp and 24.8 % GC content codes for 667 predicted functional genes and bears the common characteristics of sequence economization coupled with obligate host-dependent lifestyle, e.g. reduced number of RNA genes along with the rRNA operon split, and strongly reduced metabolic capacity. Particularly, biosynthetic capacity for B vitamins possibly supplementing the host diet is highly compromised in Ca. Arsenophonus lipopteni. The gene sets are complete only for riboflavin (B2), pyridoxine (B6) and biotin (B7) implying the content of some B vitamins, e.g. thiamin, in the deer blood might be sufficient for the insect metabolic needs. The phylogenetic position within the spectrum of known Arsenophonus genomes and fundamental genomic features of Ca. Arsenophonus lipopteni indicate the obligate character of this symbiosis and its independent origin within Hippoboscidae.

Entities:  

Keywords:  Arsenophonus; Hippoboscidae; Symbiosis; Tsetse

Year:  2016        PMID: 27660670      PMCID: PMC5027103          DOI: 10.1186/s40793-016-0195-1

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

Symbiosis has for long been recognized as one of the crucial drivers of evolution. In insects, numerous symbiotic relationships, mainly with bacteria, enabled the hosts to exploit various environments and/or life strategies, and supposedly started adaptive radiations in some groups. The mechanisms of such evolutionary processes include for example contribution to the host immunity, modification of the reproductive strategy, or provision of essential compounds to the hosts relying on nutritionally compromised resources. Blood feeding (hematophagous) insects provide an illuminating example of a life strategy shift coupled with symbiosis. Since blood meal lacks some of the B vitamins, hematophagous insects rely on their supply by symbiotic bacteria. The relationships between bacteria and hematophagous insect displays considerable degree of variability spanning from less intimate associations with entire gut microbial community, e.g. triatomine bugs [1, 2], to highly specialized interactions with few or single obligate symbiont(s), e.g. lice, bed bugs, tsetse flies, louse flies and bat flies [3-7]. With the recent advancement of genomic approaches and genetic manipulations, symbioses in these insect groups, often important disease vectors, have become of a high interest. Here we describe fundamental biological characteristics and genome properties of the obligate symbiont of a deer ked, (Hippoboscidae). In comparison to multipartite symbiotic systems of closely related hosts from families Hippoboscidae (i.e. [7]) and Glossinidae (i.e. Glossina sp. [6]), harbors a single unaccompanied obligate symbiont from the genus . The genome of Candidatus Arsenophonus lipopteni has been sequenced for two reasons. The first was to extend our knowledge on occurrence and genomics of the obligate symbionts across the spectrum of hematophagous hosts involved in strictly bilateral symbiosis, e.g. bed bugs [5], head lice [4], leaches [8, 9]. This is a necessary prerequisite for the future analysis of the origins and evolution of this kind of symbioses. In addition, we intend to use the sequence in a broader comparative framework focused on evolution of bacterial symbiosis, particularly on its role in B vitamin provision to various ecological types of the hosts.

Organism information

Classification and features

Ca. has an obligate association with its host, , and is therefore uncultivable. In order to localize the bacteria within the host, Fluorescent In Situ Hybridization and Transmission Electron Microscopy was performed on dissected gut tissue as described in detail in [7]. For FISH, the tissue was fixed and hybridized in tubes with eubacterial (EUB338, Flc-GCTGCCTCCCGTAGGA; [10]) and Ca. specific probes (ArL, Cy3-CTGACTAACGCTTGCACC; this study). The later was designed in a variable region of 16S rRNA gene taking the target sequence accessibility into account [11]. The distribution of Ca. (Fig. 1) in the host body closely resembles that of Ca. melophagi and T, the obligate symbionts of the blood sucking flies and Glossina sp., respectively [7]. Highly pleomorphic cells of the Gram negative non-sporulating bacteria from the family are primarily found in the modified part of the gut wall (bacteriome) formed by the specialized enlarged epithelial cells (bacteriocytes, Fig. 1c, 1d). Additional key features of Ca. are provided as a standardized summary in Table 1.
Fig. 1

Visualization of the bacteria in the host tissue using FISH and TEM (D). The symbiotic organ (bacteriome) localized in the midgut section harboring Ca. Arsenophonus lipopteni targeted with green (Flc) labeled eubacterial probe (a) and red (Cy3) labeled specific probe (b). Detail of the host cells (bacteriocytes) filled with the symbionts (c). The blue signal is DAPI stained DNA. Four cells of Ca. Arsenophonus lipopteni under TEM (d). The white arrow points to bacterial outer membrane and the red bordered arrow shows the cytoplasmatic cell membrane

Table 1

Classification and general features of Ca. Arsenophpnus lipopteni

MIGS IDPropertyTermEvidence codea
ClassificationDomain Bacteria TAS[33]
Phylum Proteobacteria TAS[34]
Class Gammaproteobacteria TAS[35]
Order “EnterobacterialesTAS[36]
Family Enterobacteriaceae TAS[37]
Genus Arsenophonus TAS[38]
Species Ca. Arsenophonus lipopteni IDA
Strain: CBIDA
Gram stainNegativeTAS[38]
Cell shapePleomorphicNAS
MotilityNon-motileTAS[38]
SporulationNon-sporulatingTAS[38]
Temperature rangeNot determinedIDA
Optimum temperatureNot determinedIDA
pH range; OptimumNot determinedIDA
Carbon sourceNot determinedIDA
MIGS-6HabitatInsect host; bacteriome of L. cervi IDA
MIGS-6.3SalinityNot determinedIDA
MIGS-22Oxygen requirementNot determinedIDA
MIGS-15Biotic relationshipSymbioticIDA
MIGS-14PathogenicityNon-pathogenNAS
MIGS-4Geographic locationCeske Budejovice, Czech RepublicIDA
MIGS-5Sample collection dateJune 2010IDA
MIGS-4.1Longitude14.43IDA
MIGS-4.2Latitude48.97IDA
MIGS-4.4Altitude399 mIDA

aEvidence codes, IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [39]

Visualization of the bacteria in the host tissue using FISH and TEM (D). The symbiotic organ (bacteriome) localized in the midgut section harboring Ca. Arsenophonus lipopteni targeted with green (Flc) labeled eubacterial probe (a) and red (Cy3) labeled specific probe (b). Detail of the host cells (bacteriocytes) filled with the symbionts (c). The blue signal is DAPI stained DNA. Four cells of Ca. Arsenophonus lipopteni under TEM (d). The white arrow points to bacterial outer membrane and the red bordered arrow shows the cytoplasmatic cell membrane Classification and general features of Ca. Arsenophpnus lipopteni aEvidence codes, IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [39] Apart from the functional characterization, the genome sequence of Ca. was also utilized to assess the relationship of this bacterium to other symbionts. Since the sequence compositional shift compromises phylogenetic usage of 16S rDNA, leading to topological artifacts with long branched symbiotic taxa clustering together [12], we carried out a phylogenetic analysis of a multi-gene matrix and used advanced Bayesian approaches. The matrix was generated for all available genomes (incl. Ca.), five other symbionts, eight non-symbiotic members of , and two outgroups. A set of 70 orthologous genes was determined as an intersection of COGs shared by these bacteria (generated using the MicrobesOnline database; [13]) with “SICO” gene list [14]. The genes were retrieved from the finished assembly using Blastp searches [15] and processed as described previously [7]. The resulting matrix contained 22618 unequivocally aligned positions. Phylobayes [16], a tool specifically developed to overcome the difficulty with heterogeneous composition of sequences, was used for the tree reconstruction. The analysis was run in 2 chains under the GTR + CAT model with amino acids recoded according to the Dayhoff6 option. When the convergence was not reached after 20,000 cycles, the program was stopped and majority rule consensus was calculated after discarding 4,000 cycles burn-in. The results confirm Ca. membership in the genus . All species (including Ca. Rieisa pediculicola) formed a well-supported monophyletic branch clustering as a sister group to (Fig. 2). Despite the length of branches for the obligate symbionts with highly modified genomes, this arrangement was assigned a high posterior probability. Although the six included lineages certainly do not form a monophyletic group within the known spectrum [17], the results indicate that Ca. evolved independently from Ca. melophagi housed in related Hippoboscidae host [7].
Fig. 2

Phylogenomic reconstruction of Ca. Arsenophonus lipopteni position. The length of the double crossed branches was scaled to 1:4. The numbers indicate posterior probability for each node. The “N-S” in the brackets following the taxon name designates the non-symbiotic bacteria included into the dataset

Phylogenomic reconstruction of Ca. Arsenophonus lipopteni position. The length of the double crossed branches was scaled to 1:4. The numbers indicate posterior probability for each node. The “N-S” in the brackets following the taxon name designates the non-symbiotic bacteria included into the dataset

Genome sequencing information

Genome project history

The host specimens were collected from wild populations during summer 2010 in the Czech Republic. Finished genome sequence has been deposited in GenBank under acc. No. CP013920 on January 11, 2016. A summary on the sequencing project is provided in Table 2.
Table 2

Project information

MIGS IDPropertyTerm
MIGS 31Finishing qualityFinished
MIGS-28Libraries used2 × 100 bp paired end
MIGS 29Sequencing platformsIllumina
MIGS 31.2Fold coverage40
MIGS 30AssemblersA5
MIGS 32Gene calling methodRAST, PGAP, PROKKA
Locus TagAUT07
Genbank IDCP013920
Genbank Date of Release1-25-2016
GOLD IDGp0127464
BIOPROJECTPRJNA306001
MIGS 13Source Material IdentifierHost tissue
Project relevanceEvolution of bacterial symbiosis
Project information

Growth conditions and genomic DNA preparation

Since the bacterium is uncultivable, the host tissue was used for DNA extraction. The gut tissue containing the symbiotic organs were dissected from 6 flies in 1× phosphate buffered saline, homogenized with a sterile mortar and pestle and extracted using QiaAmp DNA Micro Kit (QIAGEN, United Kingdom). The DNA quality was assessed using Agilent 2100 Bioanalyzer (Agilent Technologies).

Genome sequencing and assembly

The paired end 100 bp long reads were generated on one lane of Illumina HiSeq2000 run at Yale Center for Genome Analysis. A5 assembly pipeline with the default settings was used to assemble the reads [18]. Of the 109,640 resulting contigs, the longest contig (836,730 bp) with 40× fold coverage formed a circular molecule with 99 bp overlap at the ends. This contig corresponds to the Ca. genome. Pilon v1.12 [19] was used to check assembly quality and to improve base calls and small indels.

Genome annotation

The finished genome was annotated using a combination of following tools: RAST [20], PGAP, and Prokka v1.10 [21]. The annotation was then manually curated and checked for the presence of gene remnants. The final annotation is available in GenBank (CP013920). Metabolic pathways were reconstructed in the RAST server [20] and gene absence was verified using BlastP searches. Proteins were assigned to the clusters of orthologous groups using COGnitor [22], and the presence of signal peptides was detected using SignalP [23]. Pfam domains were predicted using HMMER [24] against the Pfam-A database [25]. Transmembrane predictions were done using TMHMM Server v. 2.0. The search for CRISPR repeats was performed in Geneious [26].

Genome properties

The finished genome consists of 836,724 nucleotides in a single circular chomosome with a low GC content of 24.9 %. The total number of predicted functional genes (667) relative to the genome size implies a lower coding density (75.8 %). The average gene length of 1,001 bp however does not suggest that the genome underwent an extreme economization typical for the obligate symbionts, e.g. str. Cc, Ca., Ca., Ca. [27]. Over 99 % of protein coding genes have been assigned to particular COGs and Pfam domains (Tables 3 and 4). Signal peptides and transmembrane helices have been identified for 5 and 124 protein coding genes respectively (Table 3). The noncoding RNA genes consist of tmRNA, RNAseP, Alpha RBS, cspA, 35 tRNAs, and 3 rRNA genes (altogether 42 RNA genes). The three ribosomal genes are however not organized into a single operon, a phenomenon previously described for at least 9 unrelated bacterial clades, including gammaproteobacterial symbionts of the genus and Candidatus Blochmannia, and attributed to their host-dependent lifestyle [28].
Table 3

Statistics for finished genome assebly of Ca. Arsenophonus lipopteni

AttributeValue% of Totala
Genome size (bp)836,724100.00
DNA coding (bp)633,82275.80
DNA G + C (bp)208,10324.90
DNA scaffolds1100.00
Total genes683100.00
Protein coding genes62591.50
RNA genes426.10
Pseudo genes162.30
Genes assigned to COGs62299.52
Genes assigned Pfam domains625100.00
Genes with signal peptides50.80
Genes with transmembrane helices12419.8.00
CRISPR repeats00.00

aThe total is based on either the size of the genome in bp or the total number of genes

Table 4

Number of protein coding genes assigned to the COG categories

Cat. codeValuePercentage of totalDescription
J12820.48Translation, ribosomal structure and biogenesis
A10.16Processing and modification
K172.72Transcription
L406.4Replication, recombination and repair
B00Chromatin structure and dynamics
D152.4Cell cycle control, cell division, chromosome partitioning
Y00Nuclear structure
V71.12Defense mechanisms
T101.6Signal transduction mechanisms
M7011.2Cell wall/membrane biogenesis
N10.16Cell motility
Z00Cytoskeleton
W00Extracellular structures
U121.92Intracellular trafficking and secretion
O426.72Posttranslational modification, protein turnover, chaperones
C223.52Energy production and conversion
G264.16Carbohydrate transport and metabolism
E345.44Aminoacid transport and metabolism
F243.84Nucleotide transport and metabolism
H497.84Coenzyme transport and metabolism
I264.16Lipid transport and metabolism
P243.84Inorganic ion transport and metabolism
Q10.16Secondary metabolites biosynthesis, transport and catabolism
R91.44General function prediction only
S132.08Function unknown
-518.16Assigned to more than one category
-30.48Not in COGs
Statistics for finished genome assebly of Ca. Arsenophonus lipopteni aThe total is based on either the size of the genome in bp or the total number of genes Number of protein coding genes assigned to the COG categories The genome properties described above coupled with 16 pseudogenes identified in the genome suggest rather recent establishment of the obligate symbiosis resulting in significant but recent gene/function loss without removal of presently non-coding regions. Regarding the coding capacity for B vitamins and related cofactors, the genome of Ca. appears to be highly economized. Similar to Ca. melophagi, the bacteria cannot synthesize thiamine (B1), niacin (B3), panthothenic acid (B5) and folic acid (B9). In addition, the genome does not code for heme biosynthesis. Other basic genome characteristics are summarized in Table 3.

Conclusions

Compared to the closely related complex symbiotic system in the sheep ked, , appears to maintain symbiosis exclusively with Ca.. The growing number of genome sequences available for the symbionts and the hematophagous hosts involved in strictly bilateral symbiosis (e.g. [29, 30]) will help elucidating some common requirements on B vitamins, or possibly highlight diverse needs of insects digesting blood of various vertebrates. Ca. possesses complete gene sets for biosynthesis of three B vitamins, riboflavin (B2), pyridoxine (B6) and biotin (B7). While the metabolic capacity is directly assessed from genomic data, the presence of any vitamin efflux systems cannot be easily elucidated due to yet poorly understood mechanisms for vitamin export [31]. However, based on recent findings from other hematophagous systems, it has become more clear that the nutritional interaction does not rely on biosynthesis of all B vitamins as originally suggested by Puchta [32]. For instance, similar to all the other genomes, biosynthetic capacity for thiamin is compromised in Ca.. The genome however possesses ABC thiamin transporter genes (thiP, thiQ, tbpA) implying the content of thiamin or thiamin pyrophosphate, compared to e.g. biotin or riboflavin, in the host blood might be sufficient for the insect metabolic needs (Novakova, unpublished data). Within the spectrum of known genomes ranging from 0.57 Mb of Ca. to 3.5 Mb of , representing various symbiotic types, the genomic sequence of Ca. clearly reflects characteristics common for obligate mutualists. Furthermore, the phylogenetic reconstruction suggests an independent origin of this obligate association within Hippoboscidae.
  34 in total

Review 1.  Extreme genome reduction in symbiotic bacteria.

Authors:  John P McCutcheon; Nancy A Moran
Journal:  Nat Rev Microbiol       Date:  2011-11-08       Impact factor: 60.633

2.  SignalP 4.0: discriminating signal peptides from transmembrane regions.

Authors:  Thomas Nordahl Petersen; Søren Brunak; Gunnar von Heijne; Henrik Nielsen
Journal:  Nat Methods       Date:  2011-09-29       Impact factor: 28.547

3.  Molecular evidence for polyphyletic origin of the primary symbionts of sucking lice (phthiraptera, anoplura).

Authors:  Václav Hypsa; Jaroslav Krízek
Journal:  Microb Ecol       Date:  2007-03-08       Impact factor: 4.552

4.  Prokka: rapid prokaryotic genome annotation.

Authors:  Torsten Seemann
Journal:  Bioinformatics       Date:  2014-03-18       Impact factor: 6.937

Review 5.  Bacterial symbionts of the triatominae and their potential use in control of Chagas disease transmission.

Authors:  C Ben Beard; Celia Cordon-Rosales; Ravi V Durvasula
Journal:  Annu Rev Entomol       Date:  2002       Impact factor: 19.686

Review 6.  Bacterial symbioses of the medicinal leech Hirudo verbana.

Authors:  Michael C Nelson; Joerg Graf
Journal:  Gut Microbes       Date:  2012-05-10

7.  Genome sequence of the tsetse fly (Glossina morsitans): vector of African trypanosomiasis.

Authors: 
Journal:  Science       Date:  2014-04-25       Impact factor: 47.728

8.  Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data.

Authors:  Matthew Kearse; Richard Moir; Amy Wilson; Steven Stones-Havas; Matthew Cheung; Shane Sturrock; Simon Buxton; Alex Cooper; Sidney Markowitz; Chris Duran; Tobias Thierer; Bruce Ashton; Peter Meintjes; Alexei Drummond
Journal:  Bioinformatics       Date:  2012-04-27       Impact factor: 6.937

9.  Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement.

Authors:  Bruce J Walker; Thomas Abeel; Terrance Shea; Margaret Priest; Amr Abouelliel; Sharadha Sakthikumar; Christina A Cuomo; Qiandong Zeng; Jennifer Wortman; Sarah K Young; Ashlee M Earl
Journal:  PLoS One       Date:  2014-11-19       Impact factor: 3.240

10.  MicrobesOnline: an integrated portal for comparative and functional genomics.

Authors:  Paramvir S Dehal; Marcin P Joachimiak; Morgan N Price; John T Bates; Jason K Baumohl; Dylan Chivian; Greg D Friedland; Katherine H Huang; Keith Keller; Pavel S Novichkov; Inna L Dubchak; Eric J Alm; Adam P Arkin
Journal:  Nucleic Acids Res       Date:  2009-11-11       Impact factor: 16.971

View more
  6 in total

1.  Skipping the Insect Vector: Plant Stolon Transmission of the Phytopathogen 'Ca. Phlomobacter fragariae' from the Arsenophonus Clade of Insect Endosymbionts.

Authors:  Jessica Dittmer; Thierry Lusseau; Xavier Foissac; Franco Faoro
Journal:  Insects       Date:  2021-01-22       Impact factor: 2.769

2.  Arsenophonus and Sodalis replacements shape evolution of symbiosis in louse flies.

Authors:  Eva Šochová; Filip Husník; Eva Nováková; Ali Halajian; Václav Hypša
Journal:  PeerJ       Date:  2017-12-11       Impact factor: 2.984

3.  Predictive Genomic Analyses Inform the Basis for Vitamin Metabolism and Provisioning in Bacteria-Arthropod Endosymbioses.

Authors:  Laura R Serbus; Brian Garcia Rodriguez; Zinat Sharmin; A J M Zehadee Momtaz; Steen Christensen
Journal:  G3 (Bethesda)       Date:  2017-06-07       Impact factor: 3.154

4.  Microbiomes of North American Triatominae: The Grounds for Chagas Disease Epidemiology.

Authors:  Sonia M Rodríguez-Ruano; Veronika Škochová; Ryan O M Rego; Justin O Schmidt; Walter Roachell; Václav Hypša; Eva Nováková
Journal:  Front Microbiol       Date:  2018-06-13       Impact factor: 5.640

5.  To B or Not to B: Comparative Genomics Suggests Arsenophonus as a Source of B Vitamins in Whiteflies.

Authors:  Diego Santos-Garcia; Ksenia Juravel; Shiri Freilich; Einat Zchori-Fein; Amparo Latorre; Andrés Moya; Shai Morin; Francisco J Silva
Journal:  Front Microbiol       Date:  2018-09-25       Impact factor: 5.640

6.  Insect-Symbiont Gene Expression in the Midgut Bacteriocytes of a Blood-Sucking Parasite.

Authors:  Filip Husnik; Vaclav Hypsa; Alistair Darby
Journal:  Genome Biol Evol       Date:  2020-04-01       Impact factor: 3.416

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.