Literature DB >> 32022853

The Genome of Setaria digitata: A Cattle Nematode Closely Related to Human Filarial Parasites.

Kanchana S Senanayake1, Jonas Söderberg2, Aleksei Põlajev2, Maja Malmberg2,3, Eric H Karunanayake1, Kamani H Tennekoon1, Sameera R Samarakoon1, Erik Bongcam-Rudloff2, Adnan Niazi2.   

Abstract

Here we present the draft genome sequence of Setaria digitata, a parasitic nematode affecting cattle. Due to its similarity to Wuchereria bancrofti, the parasitic nematode that causes lymphatic filariasis in humans, S. digitata has been used as a model organism at the genomic level to find drug targets which can be used for the development of novel drugs and/or vaccines for human filariasis. Setaria digitata causes cerebrospinal nematodiasis in goats, sheep, and horses posing a serious threat to livestock in developing countries. The genome sequence of S. digitata will assist in finding candidate genes to use as drug targets in both S. digitata and W. bancrofti. The assembled draft genome is ∼90 Mb long and contains 8,974 genomic scaffolds with a G+C content of 31.73%.
© The Author(s) 2020. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  zzm321990 Setaria digitatazzm321990 ; cattle parasite; cerebrospinal nematodiasis; filariasis; genome; helminth; setariosis

Year:  2020        PMID: 32022853      PMCID: PMC7061230          DOI: 10.1093/gbe/evaa017

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


Introduction

Setaria digitata is a parasitic nematode found in the peritoneal cavity of cattle. Although nonpathogenic to their natural hosts, there have been instances where the larvae were observed in the cerebrospinal cavity (Tung et al. 2003), causing fatal paralysis. Transmission of infective larvae to aberrant hosts such as goats, sheep, cows and horses can cause a serious and often fatal neuropathological disorder commonly identified as cerebrospinal setariosis (Mohanty et al. 2000; Tung et al. 2003; Bazargani et al. 2008; Kaur et al. 2015; Shin et al. 2017). Due to its similarity to Wuchereria bancrofti with respect to morphology and histology (Decruse and Raj 1990) as well as antigenic properties (Dissanayake and Ismail 1980), S. digitata has been used as a model organism for the study of W. bancrofti in view of developing a vaccine against lymphatic filariasis (Madathiparambil et al. 2011; Perumal et al. 2016). A few genes from S. digitata have been characterized with the aim of identifying possible drug targets for W. bancrofti (Murugananthan et al. 2010; Rodrigo et al. 2013; Nagaratnam et al. 2014; Rodrigo et al. 2014). Although W. bancrofti is the major causative organism for lymphatic filariasis accounting for 90% of the cases (WHO 1992) and an estimated 120 million people in 72 countries were infected in 2010 (Dissanayake and Ismail 1980), very little is known about the molecular biology, biochemistry and immune mechanisms of this parasite. The necessity of a model organism to study the biology of W. bancrofti has arisen because W. bancrofti grow in human lymph vessels and are therefore not easily isolated, and cannot be cultured in laboratory conditions. This is an impediment for testing new treatments for lymphatic filariasis in vitro (Murugananthan et al. 2010). Because the draft genome of the W. bancrofti is already publicly available (Small et al. 2019), the S. digitata genome will greatly facilitate comparative studies at the genetic level between the two nematodes and allow identification of genes that can be used as drug targets, which can then be tested in the laboratory. Furthermore, availability of the S. digitata genome will enable drug development and vaccine production to eliminate or control S. digitata infections in abnormal hosts which cause serious economic loss in places where sheep and goat farming is a common livelihood (Tung et al. 2003; Nakano et al. 2007; Bazargani et al. 2008).

Materials and Methods

Collection of Adult S. digitata Worms

Adult S. digitata were collected from freshly slaughtered cattle at a nearby abattoir and immediately transported to the laboratory in sterile Hank’s Buffered Salt Solution (Fisher Scientific, UK, Gibco Cat# 15420614). Adult worms were washed six times in sterile phosphate buffered saline (PBS) and used for genomic DNA extraction.

Extraction of Genomic DNA

Genomic DNA of adult S. digitata worms was extracted by phenol/chloroform extraction method and ethanol precipitated as described earlier (Nayak et al. 2012). Briefly, adult worms were lysed in 500 μl of lysis buffer (Tris–HCl 20 mM, ethylenediaminetetraacetic acid 50 mM, pH 8.0, SDS 0.5%, NaCl 100 mM, β mercaptoethanol 1%, v/v) containing 0.1 mg/ml proteinase K and incubated at 37 °C for 2 h. Then RNase (5 μg/ml) treatment was carried out for 2 h at 56 °C and DNA was isolated by phenolchloroform–isoamyl alcohol extraction and ethanol precipitation. Total DNA amounts were quantified using the Qubit (Version 2.0) (ThermoFisherScientific, Waltham, MA) using the dsDNA assay. In parallel the integrity of the DNA was checked by gel electrophoresis on a 1.5% agarose gel for a qualitative assessment of DNA integrity.

Genome Sequencing

Sequencing libraries were constructed from the extracted DNA using the NexteraXT kit (Illumina, San Diego, CA) with small modifications; 10 cycles of amplification were used instead of 12, the samples were indexed as 10 different samples, libraries were normalized manually based on concentration measurements from Agilent High Sensitivity DNA Kit (2100 Bioanalyzer, Agilent Technologies, Palo Alto, CA). After library preparation the samples were pooled and sequencing was performed using the v3 600 cycles kit on the Illumina MiSeq platform (Illumina) at National Veterinary Institute in Uppsala.

Genome Assembly and Annotation

The raw sequences were first checked for adapters with Nesoni v0.13 (https://github.com/Victorian-Bioinformatics-Consortium/nesoni, last accessed February 11, 2020), trimmed at Q30 using prinseq v0.20.4 (http://prinseq.sourceforge.net, last accessed February 11, 2020), and checked for bacterial contamination with RAMBO-K v1.21 (Tausch et al. 2015) against the Refseq bacterial genomes to filter any remnants of contamination. These decontaminated sequences were assembled using SPAdes v3.6 (Bankevich et al. 2012) and corrected for local misassemblies and small INDELs using Pilon v1.13 (Walker et al. 2014). The genome annotation was performed using MAKER2 (Holt and Yandell 2011) in a two-step annotation workflow. Briefly, GeneMark-ES (Lomsadze et al. 2005) and CEGMA (Parra et al. 2007) were used to produce HMM profiles from the assembly. MAKER2 was then run a first time with these HMM profiles. The first annotation results were themselves converted into HMM profiles, and MAKER2 was run a second time using those updated HMM profiles. Annotation completeness was analyzed using CEGMA v2.5 (Parra et al. 2007) and BUSCO v2.0 (Simao et al. 2015). Protein sequences of the predicted genes were searched against the Swiss-Prot and TrEMBL databases (release 2019_07) using BlastP program (e-value ≤ 10−5). BLAST hits with query coverage <40% were filtered out. Functional annotation and classification of the annotated genes was performed using eggNOG mapper (Huerta-Cepas et al. 2016). Proteinortho v5.16 (Lechner et al. 2011) was used to find orthologous genes in the genomes with 30% identity, 50% coverage, and e-value ≤ 10−5.

Phylogenetic Analysis

The genomes of 40 nematodes were downloaded including 12 genomes from NCBI and 28 from WormBase version WS250 (Harris et al. 2010). At first, we identified “complete” BUSCO genes across the genomes of 41 nematodes using the Arthropoda database provided by BUSCO. Complete BUSCOs that were only present in <80% genomes were filtered out. Next, we aligned protein sequences of each BUSCO gene using MAFFT version 7.4 alignment program (Katoh et al. 2002). The resulting alignments were trimmed for spurious sequences or poorly aligned regions using trimAl version 1.4.1 (Capella-Gutierrez et al. 2009) with the “-automated1” set of parameters. A matrix of aligned sequences was created by concatenating all the trimmed alignments. To infer a species tree, the matrix was provided to IQ-tree software version 1.6.9 (Nguyen et al. 2015), run with ultrafast bootstrap replicates (N = 1,000) to determine the support for bipartitions and internode certainty.

Results and Discussion

Genome Assembly and Gene Annotation

In total 4 Gb of DNA sequence data were generated which is made up of 14,370,809 paired-end reads with a maximum read length of 301 bp and a minimum read length of 35 bp with a 44x raw sequence coverage. The GC content of the raw sequences was 34%. The genome was assembled into 8,974 scaffolds which is 89.88 Mb in length, with L50 of 882 bp and an overall G + C content of 31.7%. The genome assembly covered over 87% of the total genome size, estimated using k-mer counts from the sequencing data. Additional genomic characteristics are shown in Table 1. The size and the GC content of the S. digitata genome were similar to Loa loa and Brugia malayi which cause filariasis in humans. Moreover, the number of predicted genes estimated among those filarial nematode genomes was comparable.
Table 1

Summary of Assembly Statistics of the Setaria digitata Genome and Published Genomes of Several Nematodes That Cause Filariasis in Humans

Genomic Features Setaria digitata a Loa loa b Brugia malayi c Wuchereria bancrofti d
Sequencing technologyIllumina454WGSe454
DNA (Mb)89.891.393.688.4
DNA coding (Mb)19.1915.5312.7814.2
G+C content (%)31.7330.9730.2128.8
DNA scaffolds8,9745,76424,2865,105
N50 (bp)24,961174,38841,30856,670
L50 (bp)882130226351
Avg. N’s per 100 kb5.144210.817035.7546.14
Protein-coding genes20,56815,44011,46011,068
Avg. gene length (bp)3073298928154307
Avg. exon per gene8.66.87.159.2
Avg. exon length (bp)109164158141
Genes known in Swiss-Prot6,0096,1575,2635,855
Genes with signal peptides711
Genes with TM helices3,646
tRNA17412497

PRJEB13338 (present study).

PRJNA37757.

PRJNA27801.

PRJNA275548.

Whole-genome shotgun.

Summary of Assembly Statistics of the Setaria digitata Genome and Published Genomes of Several Nematodes That Cause Filariasis in Humans PRJEB13338 (present study). PRJNA37757. PRJNA27801. PRJNA275548. Whole-genome shotgun. Gene prediction with MAKER2 annotation tool identified 20,568 protein-coding genes in the assembled genome (Table 1). The quality of the gene model data set was analyzed using the Swiss-Prot, KEGG, and Pfam databases. Putative function was assigned to 6,009 protein-coding genes based on the Swiss-Prot curated annotations, whereas 11,541 of the total genes were found in the TrEMBL database. In total, 8,560 (41.6%) of annotated genes have well-defined PFAM protein domains. Moreover, KEGG terms were assigned to 20% of the predicted genes. In total, the annotated regions comprise 21% of the genome with an average of 8.6 exons per gene and mean transcript length of 3,073 bp. We estimated the repeat content up to ∼13.5% of the total genome. However, repeats annotation analysis could detect only 4.2% of the repetitive sequences, of which 2.9% were simple repeats. Furthermore, the quality of S. digitata genome annotation was assessed using CEGMA and BUSCO pipelines. Both CEGMA and BUSCO used conserved gene sets comprising 248 and 982 genes, respectively, to analyze the completeness of the genome. The analysis showed a high level of genome completeness (CEGMA 91.5%; BUSCO 85.5%) in the genome. Because BUSCO comprised a larger set of conserved genes compared with CEGMA, only the results of BUSCO analysis are shown in Table 2.
Table 2

BUSCO Statistics of the Completeness of the Genome Based on 982 Nematode-Conserved Genes

BUSCOGenes PresentPercentage (%)
Complete BUSCOs (C)83985.5
Complete and single-copy BUSCOs (S)80582
Complete and duplicated BUSCOs (D)343.5
Fragmented BUSCOs (F)969.8
Missing BUSCOs (M)474.7
BUSCO Statistics of the Completeness of the Genome Based on 982 Nematode-Conserved Genes

Comparative Genomics and Phylogenetic Analysis

Comparative genomics of S. digitata with other filariasis causing parasitic nematodes L. loa, B. malayi, and W. bancrofti, revealed that 8,369 (40%) of S. digitata protein-coding genes were orthologous among the genomes, of which 4,643 genes were single-copy orthologs (supplementary table 1, Supplementary Material online). Setaria digitata shared 7,493 genes with L. loa, 7,070 with W. bancrofti, and 6,612 with B. malayi. In total, 5,087 genes were shared among the four nematodes. In addition, the functional classification of protein-coding genes classified 9,768 genes into different COGs (Cluster of Orthologs Groups) (fig. 1, supplementary table 2, Supplementary Material online). Of the entire S. digitata gene set, 4,315 genes had KEGG orthologs linked to 375 KEGG biological pathways (supplementary table 3, Supplementary Material online).
. 1.

—(A) Classification of protein-coding genes in COG functional groups. (B–D) Correlation of Setaria digitata proteins length with other nematode species. (B) Setaria digitata with Loa loa, (C) S. digitata with Brugia malayi, (D) S. digitata with Wuchereria bancrofti. The Pearson’s correlation coefficients are calculated and shown on each plot.

—(A) Classification of protein-coding genes in COG functional groups. (B–D) Correlation of Setaria digitata proteins length with other nematode species. (B) Setaria digitata with Loa loa, (C) S. digitata with Brugia malayi, (D) S. digitata with Wuchereria bancrofti. The Pearson’s correlation coefficients are calculated and shown on each plot. A majority of filarial parasites, for instance, W. bancrofti, harbor an endobacterium, Wolbachia. Loss of Wolbachia in Wolbachia-dependent worm hampers growth and fertility of the host. Unlike many filarial species, both S. digitata and L. loa are devoid of Wolbachia endosymbiont (Voronin et al. 2015). However, the surface structure of S. digitata is similar to that of W. bancrofti (Madathiparambil et al. 2011). BLAST-based search of the assembled S. digitata genome against Wolbachia protein sequences did not reveal any large transfers of Wolbachia DNA. Protein length correlation analysis of the single-copy orthologs showed a high level of correlation between the species (fig. 1). Setaria digitata showed similar correlation (R = 0.9) with L. loa, W. bancrofti, and B. malayi. Strong correlations with the protein-coding genes of W. bancrofti and L. loa indicate coverage of protein-coding sequences identified in S. digitata is of considerably higher quality. Thus, genomic comparisons will greatly facilitate the identification of genes involved in development and potential drug targets. To examine the evolution of filarial parasites in the context of other nematodes, we estimated a phylogeny across 41 nematode genomes available in NCBI and WormBase databases and S. digitata genome assembly created in this study (fig. 2). Maximum-Likelihood tree was generated based on 507 BUSCO genes present in at least 80% (n ≥ 32) genomes. The tree was similar to 12S rDNA-based tree published earlier (Yatawara et al. 2007). Although the genus Caenorhabditis formed well-supported monophyletic group, S. digitata was grouped with the clade mainly consists of Onchocerca and Brugia species.
. 2.

—Phylogenetic analysis of the nematode genomes. Consensus tree between 41 nematode genomes, which includes the Setaria digitata assembly. Total 118,531 parsimony-informative sites out of 184,820 aligned positions were used to generate the tree under a single LG + G4 model. To the right of each node are bootstrap support values.

—Phylogenetic analysis of the nematode genomes. Consensus tree between 41 nematode genomes, which includes the Setaria digitata assembly. Total 118,531 parsimony-informative sites out of 184,820 aligned positions were used to generate the tree under a single LG + G4 model. To the right of each node are bootstrap support values.

Conclusion

Here we present the genome sequence of S. digitata, a parasitic nematode found in the peritoneal cavity of cattle. Comparative genomic analysis of S. digitata with W. bancrofti, L. loa, and B. malayi genomes revealed similarity in several genomic features such as genome size, GC and gene content. The genome of S. digitata presented here is comparable with the genomic architecture of other filarial parasites. Moreover, its morphological similarities to other filarial parasites such as W. bancrofti, and its better accessibility, support the use of S. digitata as a model organism for finding effective treatment for lymphatic filariasis. The availability of the genome of S. digitata will be a critical resource to not only elucidate the biology of filarial worms, but also benefit programs aimed at the treatment and elimination of these parasites. This draft genome will allow a more extensive manual curation of the annotation of the genome of S. digitata. Comparative analysis with the existent W. bancrofti draft genome is expected to lead to the identification of genes susceptible to new candidate drugs in both species that can be ultimately tested using S. digitata worms in laboratory conditions. Click here for additional data file.
  27 in total

1.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

Authors:  Kazutaka Katoh; Kazuharu Misawa; Kei-ichi Kuma; Takashi Miyata
Journal:  Nucleic Acids Res       Date:  2002-07-15       Impact factor: 16.971

2.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

3.  CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes.

Authors:  Genis Parra; Keith Bradnam; Ian Korf
Journal:  Bioinformatics       Date:  2007-03-01       Impact factor: 6.937

4.  Heterologous expression of uncharacterized parasitic nematode-specific growth factor-like protein of Setaria digitata in Pichia pastoris expression systems.

Authors:  W W Rodrigo; R S Dassanayake; S J Weerasena; E H Karunanayake
Journal:  Trop Biomed       Date:  2013-06       Impact factor: 0.623

5.  Setaria digitata infections in cattle: parasite load, microfilaraemia status and relationship to immune response.

Authors:  M C Mohanty; P K Sahoo; A K Satapathy; B Ravindran
Journal:  J Helminthol       Date:  2000-12       Impact factor: 2.170

6.  Antigen of Setaria digitata: cross-reaction with surface antigens of Wuchereria bancrofti microfilariae and serum antibodies of W. bancrofti-infected subjects.

Authors:  S Dissanayake; M M Ismail
Journal:  Bull World Health Organ       Date:  1980       Impact factor: 9.408

7.  Cerebrospinal setariosis with Setaria marshalli and Setaria digitata infection in cattle.

Authors:  Kwong-Chung Tung; Cheng-Hung Lai; Hong-Kean Ooi; Chen-Hsiung Yang; Jiunn-Shiow Wang
Journal:  J Vet Med Sci       Date:  2003-09       Impact factor: 1.267

8.  Proteinortho: detection of (co-)orthologs in large-scale analysis.

Authors:  Marcus Lechner; Sven Findeiss; Lydia Steiner; Manja Marz; Peter F Stadler; Sonja J Prohaska
Journal:  BMC Bioinformatics       Date:  2011-04-28       Impact factor: 3.169

9.  Gene identification in novel eukaryotic genomes by self-training algorithm.

Authors:  Alexandre Lomsadze; Vardges Ter-Hovhannisyan; Yury O Chernoff; Mark Borodovsky
Journal:  Nucleic Acids Res       Date:  2005-11-28       Impact factor: 16.971

10.  RAMBO-K: Rapid and Sensitive Removal of Background Sequences from Next Generation Sequencing Data.

Authors:  Simon H Tausch; Bernhard Y Renard; Andreas Nitsche; Piotr Wojciech Dabrowski
Journal:  PLoS One       Date:  2015-09-17       Impact factor: 3.240

View more
  1 in total

1.  Detection of DNA of filariae closely related to Mansonella perstans in faecal samples from wild non-human primates from Cameroon and Gabon.

Authors:  Colette Marie Gaillard; Sebastien David Pion; Hadjira Hamou; Constant Sirima; Charlotte Bizet; Thomas Lemarcis; Jules Rodrigues; Amandine Esteban; Martine Peeters; Eitel Mpoudi Ngole; Illich Mombo; Florian Liégeois; Coralie Martin; Michel Boussinesq; Sabrina Locatelli
Journal:  Parasit Vectors       Date:  2020-06-16       Impact factor: 3.876

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.