Literature DB >> 27490201

Genome Skimming: A Rapid Approach to Gaining Diverse Biological Insights into Multicellular Pathogens.

Dee R Denver1, Amanda M V Brown1, Dana K Howe1, Amy B Peetz2, Inga A Zasada2.   

Abstract

Entities:  

Mesh:

Year:  2016        PMID: 27490201      PMCID: PMC4973915          DOI: 10.1371/journal.ppat.1005713

Source DB:  PubMed          Journal:  PLoS Pathog        ISSN: 1553-7366            Impact factor:   6.823


× No keyword cloud information.

Introduction

Genomic data acquisition is now trivial for biologists. Yet, moving from millions of sequence reads to an assembled and annotated genome continues to pose a daunting challenge. The first animal genome sequenced arose from the free-living model nematode Caenorhabditis elegans [1]. This venture provided an unprecedented foundation for new insights into genome function and ‘omics tool development. However, the C. elegans endeavor has been tough to repeat, even with the advent of new high-throughput DNA sequencing technologies. For example, the first plant-parasitic nematode (PPN) genomes were published ten years after the C. elegans genome [2,3], and only five publication-quality PPN genomes are presently available [4-6]. Fig 1 overviews the course of a typical genome project. Millions of DNA sequences are initially collected in a matter of days, thanks to new DNA sequencing technologies. Early analytical phases (quality control and initial assembly) are also quick and usually straightforward. However, the subsequent computational stages (refining the assembly, gene prediction, and annotation) present significant bioinformatics bottlenecks. These lengthy in silico steps require multiple iterative stages of analysis, finally leading to a finished genome deemed “good enough” for publication. These latter stages often take years.
Fig 1

Genome skimming schematic.

Boxes progressing diagonally from top left to bottom right show steps typical of conventional genome projects. Grey boxes show steps shared by genome skimming and conventional genome projects. Red boxes, arrows, and Xs show conventional genome project steps eliminated in the genome skimming approach. Green boxes show analyses specific to our genome skimming strategy.

Genome skimming schematic.

Boxes progressing diagonally from top left to bottom right show steps typical of conventional genome projects. Grey boxes show steps shared by genome skimming and conventional genome projects. Red boxes, arrows, and Xs show conventional genome project steps eliminated in the genome skimming approach. Green boxes show analyses specific to our genome skimming strategy. The term “genome skimming” was recently coined [7-9] to describe shallow sequencing approaches aiming to uncover conserved ortholog sequences for phylogenomic studies. Here, we overview a genome skimming strategy applied to six PPN species but expand the scope beyond phylogenetics and toward diverse questions relating to pathogen function and biology. We demonstrate our strategy’s utility in rapidly revealing insights and new hypotheses relating to nematode genome structure, effector genes, and endosymbionts.

Genome Assembly Results

We applied our genome skimming strategy (Fig 1; see S1 Text) to six PPN species: Anguina agrostis, Globodera ellingtonae, Pratylenchus neglectus, P. penetrans, P. thornei, and Xiphinema americanum. Five of these species are in the “top ten” list of nematode plant pathogens [10]. Our approach begins like most genome projects by creating a single unrefined assembly for each PPN that provides a reference set of sequences for subsequent study. The lengthy downstream bioinformatics steps of typical genome projects, however, were simply not done. After completing single-pass assemblies, we examined the basic properties of the assembled contigs (Table 1). Assemblies yielded between ~10,000 and ~50,000 contigs per PPN, with average n-fold DNA sequence coverage values ranging from 7.7X to 30.4X. With an average coarse genome size estimate of 107.1 Mb and average GC content of 40.5%, these 6 PPN genome assembly patterns are consistent with known nematode genome size ranges [11,12]. We note that our smallest estimate (38.5 Mb) came from X. americanum, whose relative in the family Longidoridae, Longidorus kuiperi, also has a small genome size estimate of 56.5 Mb [13]. The N50 statistic, a common statistical measure for average length of a set of sequences (see S1 Text for more detail) was 8,863 bp on average for the six PPN species analyzed. Since nematode genes average ~2–3 kb in length [1,11,12], the contigs resulting from our single-pass assembly are sufficiently long to be useful database resources for BLAST [14].
Table 1

Genome skimming summary information and effector gene hits.

AaGePnPpPtXa
Genomics Summary
Number of nematodes17,00037,00048,00014,70079,0001,000
μg DNA yield1.65.49.01.459.41.5
Number of reads9,133,65210,453,61211,109,55410,653,6458,517,7247,937,548
Bases sequenced (Mbp)2.52.83.02.92.32.2
Insert size (mean +/- SD)525 +/- 114530 +/- 99552 +/- 130496 +/- 153560 +/- 57556 +/- 78
Maximum RAM for assembly (Gb)60.157.760.894.753.857.0
Assembly time (min)161826272418
% of reads assembled61.861.569.216.161.732.6
Number of contigs35,38018,03313,21237,55547,84531,176
Contig lengths sum (Mbp)154.2100.8129.856.3163.238.5
N50 (bp)7,40911,35526,6181,3095,673936
Largest contig (bp)97,848172,336333,54239,629105,62148,513
Average coverage9.7630.415.17.78.616.3
% G+C39.036.743.938.540.044.6
Effector Genes Hits
Annexin++++++
β-1,4-Endoglucanase++++++
Cellulose Binding Protein+
Chorismate Mutase+
Fatty Acid & Retinol Binding Protein+++++
Peroxiredoxin++++++
Pectate Lyase+++
SPRYSEC+
Transthyretin-like Protein+++++
Venom-like Allergen Protein+++++

Characterizing Genomic Variation

Early genome sequencing initiatives focused on model organisms such as C. elegans, in which sequenced DNA came from highly inbred lab populations. Modern pathogen genomics, however, often requires analysis of natural populations in which numerous factors can lead to deviations from the genomic uniformity of an inbred lab culture. For example, pathogens may display population-level genetic variation, within-individual heterozygosity, and other deviations (e.g., polyploidy or interspecies hybridization). These pose potential challenges but also opportunities for discovery. Interspecies hybridization and associated genome admixture is of increasing relevance to natural parasite populations [15]. Meloidogyne incognita, the world’s most devastating PPN species, evolved through between-species hybridization, as evidenced by recent phylogenomic analyses and the complex ploidy state of its nuclear genome [2,16]. The extent of hybridization among PPN species, however, remains unclear. We developed a simple BLASTN-based method to quickly screen for evidence of genomic variation, using a list of 65 conserved single copy orthologs found in the genomes of C. elegans and G. rostochiensis (S1 Table) and our PPN genome assemblies. G. rostochiensis orthologs were used as queries against our G. ellingtonae contig database; single hits were found for all orthologs in the latter species, suggesting a high degree of genomic uniformity in the sample sequenced for this species. For the other 5 PPN species, however, more variable results were observed (Fig 2A). The median number of orthologs was equal to 1 for 2 species (A. agrostis, X. americanum), with small variances in copy number among the 65 genes (0.56 for A. agrostis, 1.25 for X. americanum). This small variation likely reflects some small genetic variation among the nematodes sequenced and/or the occurrence of lineage-specific duplicates for some of the orthologs. The median number of orthologs detected was 2 for all 3 Pratylenchus species. For the P. penetrans sample, it was known that nematodes from many field populations were combined in the sample used for the Illumina run, and thus, this genomic diversity is reflected in the high variance in ortholog copy number calculated for this species (4.46). The sequenced DNA samples for P. neglectus and P. thornei, however, each came from single nematode populations. The variances for these two species (0.67 and 0.95, respectively) were similar to those calculated for A. agrostis and X. americanum. The median value of two copies per ortholog for P. neglectus and P. thornei, combined with their low variance, suggests possible tetraploidy in these species. This hypothesis is supported by cytological evidence collected nearly 50 years ago [17] suggesting tetraploidy for P. neglectus and diploidy for P. penetrans.
Fig 2

Box and blob plots.

(A) Box plots reporting results for numbers of homologs detected for 65 highly conserved orthologs in 5 PPN species analyzed. Results for G. ellingtonae are not included because this species was found to encode a single homolog for all 65 orthologs. (B) and (C) Blob plot results for X. americanum and P. penetrans, respectively. Colors indicate BLAST matches to different species of bacteria.

Box and blob plots.

(A) Box plots reporting results for numbers of homologs detected for 65 highly conserved orthologs in 5 PPN species analyzed. Results for G. ellingtonae are not included because this species was found to encode a single homolog for all 65 orthologs. (B) and (C) Blob plot results for X. americanum and P. penetrans, respectively. Colors indicate BLAST matches to different species of bacteria.

Finding Effector Genes

Discovery and functional characterization of effector genes, whose products directly engage in attacks on host defenses, is a central aim of any pathogen genome project. Protein sequences for 10 effectors, well characterized in other PPN species (S2 Table), were used as TBLASTN queries to screen our PPN contig databases for homologous matches. Our search revealed 42 matches (out of 60 possible) distributed across the PPN genomes (Table 1). As expected, more hits were observed in the 5 tylenchid PPN species analyzed (ranging from 6 to 8) compared to the very distantly related X. americanum, in which only 3 hits were observed. These 3 genes (annexin, β-1,4-endoglucanase, peroxiredoxin) were found in all 5 of the other species studied; a previous study revealed evidence for an expressed endoglucanase effector in X. index [18], a congener of X. americanum. The 3 X. americanum hit e-values (averaging 7.1 E-30) and hit lengths (averaging 459 bp) were larger and shorter, respectively, compared to averages for these 3 genes in the other 5 species (1.0 E-42, 632 bp). The addition of a simple single BLAST step to our genome skimming strategy quickly revealed the presence of numerous putative effector genes in the PPN species, though follow-up experimentation and analysis remains necessary to evaluate whether or not bona fide effectors are encoded by the DNA sequences identified.

Discovering Endosymbionts

Bacterial endosymbionts, such as Wolbachia spp., are well known and widespread components of diverse arthropods. Genome sequencing efforts in filarial nematode species revealed the presence of Wolbachia, which functions as an obligate mutualist in these pathogens of animals and humans [19,20]. We combined “Blob plot” approaches [21] with BLAST to uncover bacterial genomes associated with our PPN species. For the X. americanum analysis, evidence for its known endosymbiont Xiphinematobacter sp. [22] was observed as expected (Fig 2B). This genome-skimming result led to the hypothesis that the contigs in this blob constituted the Xiphinematobacter sp. genome. Follow-up bioinformatics, functional genomics, and fluorescence in situ hybridization (FISH) microscopy work supported this hypothesis and suggested that the endosymbiont functions as a nutritional mutualist with its nematode host [23]. A second interesting case was P. penetrans, in which 1,593 contigs matched bacterial DNA of diverse origins. Although many of these sequences contained high %GC, which were likely environmental contaminants (Fig 2C), two bacterial blobs of higher %AT were found containing contigs matching DNA of the known endosymbionts Wolbachia sp. and Cardinium sp. The only PPN previously reported to harbor Wolbachia is Radopholus similis [24]. A P. penetrans contig matched the 16S rDNA gene for Wolbachia in R. similis at 98% identity. Further bioinformatic and FISH work is underway to validate and build upon these initial endosymbiosis hypotheses arising from the P. penetrans genome skimming data.

Conclusions

Genome skimming provides a rapid and affordable avenue for biological inquiry and hypothesis generation that avoids the time delays that accompany most genomic endeavors. A single-pass assembly followed by BLAST-based and other simple analyses revealed evidence for potential genomic hybridization, effector genes, and endosymbionts in the PPN genomes studied. Although genome skimming provides an effective approach to hypothesis generation, follow-up work remains necessary for hypothesis evaluation. Genome skimming alone will not suffice for biological questions requiring gene prediction and annotation (e.g., patterns of gene family expansion, instances of horizontal gene transfer). Nonetheless, our genome skimming pilot experiment provided quick and exciting biological insights and community genomic resources, essentially doubling the number of PPN species for which published genome sequence resources are available. How might our understanding of nematode pathogens change if genome skimming were applied to 600 PPN species instead of 6?

Materials and Methods.

(PDF) Click here for additional data file.

Conserved Orthologs Used in Genomic Variation Analysis.

(DOCX) Click here for additional data file.

Effector Protein Sequences Used in BLAST Analysis.

(DOCX) Click here for additional data file.
  21 in total

1.  Occurrence of novel verrucomicrobial species, endosymbiotic and associated with parthenogenesis in Xiphinema americanum-group species (Nematoda, Longidoridae).

Authors:  T T Vandekerckhove; A Willems; M Gillis; A Coomans
Journal:  Int J Syst Evol Microbiol       Date:  2000-11       Impact factor: 2.747

2.  Sequence and genetic map of Meloidogyne hapla: A compact nematode genome for plant parasitism.

Authors:  Charles H Opperman; David M Bird; Valerie M Williamson; Dan S Rokhsar; Mark Burke; Jonathan Cohn; John Cromer; Steve Diener; Jim Gajan; Steve Graham; T D Houfek; Qingli Liu; Therese Mitros; Jennifer Schaff; Reenah Schaffer; Elizabeth Scholl; Bryon R Sosinski; Varghese P Thomas; Eric Windham
Journal:  Proc Natl Acad Sci U S A       Date:  2008-09-22       Impact factor: 11.205

3.  Genome skimming by shotgun sequencing helps resolve the phylogeny of a pantropical tree family.

Authors:  Pierre-Jean G Malé; Léa Bardon; Guillaume Besnard; Eric Coissac; Frédéric Delsuc; Julien Engel; Emeline Lhuillier; Caroline Scotti-Saintagne; Alexandra Tinaut; Jérôme Chave
Journal:  Mol Ecol Resour       Date:  2014-04-02       Impact factor: 7.090

Review 4.  Genome sequence of the nematode C. elegans: a platform for investigating biology.

Authors: 
Journal:  Science       Date:  1998-12-11       Impact factor: 47.728

5.  Genomic insights into the origin of parasitism in the emerging plant pathogen Bursaphelenchus xylophilus.

Authors:  Taisei Kikuchi; James A Cotton; Jonathan J Dalzell; Koichi Hasegawa; Natsumi Kanzaki; Paul McVeigh; Takuma Takanashi; Isheng J Tsai; Samuel A Assefa; Peter J A Cock; Thomas Dan Otto; Martin Hunt; Adam J Reid; Alejandro Sanchez-Flores; Kazuko Tsuchihara; Toshiro Yokoi; Mattias C Larsson; Johji Miwa; Aaron G Maule; Norio Sahashi; John T Jones; Matthew Berriman
Journal:  PLoS Pathog       Date:  2011-09-01       Impact factor: 6.823

6.  The Wolbachia genome of Brugia malayi: endosymbiont evolution within a human pathogenic nematode.

Authors:  Jeremy Foster; Mehul Ganatra; Ibrahim Kamal; Jennifer Ware; Kira Makarova; Natalia Ivanova; Anamitra Bhattacharyya; Vinayak Kapatral; Sanjay Kumar; Janos Posfai; Tamas Vincze; Jessica Ingram; Laurie Moran; Alla Lapidus; Marina Omelchenko; Nikos Kyrpides; Elodie Ghedin; Shiliang Wang; Eugene Goltsman; Victor Joukov; Olga Ostrovskaya; Kiryl Tsukerman; Mikhail Mazur; Donald Comb; Eugene Koonin; Barton Slatko
Journal:  PLoS Biol       Date:  2005-03-29       Impact factor: 8.029

Review 7.  Hybridization in Parasites: Consequences for Adaptive Evolution, Pathogenesis, and Public Health in a Changing World.

Authors:  Kayla C King; Rike B Stelkens; Joanne P Webster; Deborah F Smith; Michael A Brockhurst
Journal:  PLoS Pathog       Date:  2015-09-03       Impact factor: 6.823

8.  Blobology: exploring raw genome data for contaminants, symbionts and parasites using taxon-annotated GC-coverage plots.

Authors:  Sujai Kumar; Martin Jones; Georgios Koutsovoulos; Michael Clarke; Mark Blaxter
Journal:  Front Genet       Date:  2013-11-29       Impact factor: 4.599

9.  The genome and life-stage specific transcriptomes of Globodera pallida elucidate key aspects of plant parasitism by a cyst nematode.

Authors:  James A Cotton; Catherine J Lilley; Laura M Jones; Taisei Kikuchi; Adam J Reid; Peter Thorpe; Isheng J Tsai; Helen Beasley; Vivian Blok; Peter J A Cock; Sebastian Eves-van den Akker; Nancy Holroyd; Martin Hunt; Sophie Mantelin; Hardeep Naghra; Arnab Pain; Juan E Palomares-Rius; Magdalena Zarowiecki; Matthew Berriman; John T Jones; Peter E Urwin
Journal:  Genome Biol       Date:  2014-03-03       Impact factor: 13.583

10.  The complex hybrid origins of the root knot nematodes revealed through comparative genomics.

Authors:  David H Lunt; Sujai Kumar; Georgios Koutsovoulos; Mark L Blaxter
Journal:  PeerJ       Date:  2014-05-06       Impact factor: 2.984

View more
  11 in total

1.  Sequence Comparison Without Alignment: The SpaM Approaches.

Authors:  Burkhard Morgenstern
Journal:  Methods Mol Biol       Date:  2021

2.  To Petabytes and beyond: recent advances in probabilistic and signal processing algorithms and their application to metagenomics.

Authors:  R A Leo Elworth; Qi Wang; Pavan K Kota; C J Barberan; Benjamin Coleman; Advait Balaji; Gaurav Gupta; Richard G Baraniuk; Anshumali Shrivastava; Todd J Treangen
Journal:  Nucleic Acids Res       Date:  2020-06-04       Impact factor: 16.971

3.  Read-SpaM: assembly-free and alignment-free comparison of bacterial genomes with low sequencing coverage.

Authors:  Anna-Katharina Lau; Svenja Dörrer; Chris-André Leimeister; Christoph Bleidorn; Burkhard Morgenstern
Journal:  BMC Bioinformatics       Date:  2019-12-17       Impact factor: 3.169

4.  Discovery of Early-Branching Wolbachia Reveals Functional Enrichment on Horizontally Transferred Genes.

Authors:  Nicholas Weyandt; Shiva A Aghdam; Amanda M V Brown
Journal:  Front Microbiol       Date:  2022-04-25       Impact factor: 6.064

5.  The Transcriptomes of Xiphinema index and Longidorus elongatus Suggest Independent Acquisition of Some Plant Parasitism Genes by Horizontal Gene Transfer in Early-Branching Nematodes.

Authors:  Etienne G J Danchin; Laetitia Perfus-Barbeoch; Corinne Rancurel; Peter Thorpe; Martine Da Rocha; Simon Bajew; Roy Neilson; Elena Sokolova Guzeeva; Corinne Da Silva; Julie Guy; Karine Labadie; Daniel Esmenjaud; Johannes Helder; John T Jones; Sebastian Eves-van den Akker
Journal:  Genes (Basel)       Date:  2017-10-23       Impact factor: 4.096

6.  Short reads from honey bee (Apis sp.) sequencing projects reflect microbial associate diversity.

Authors:  Michael Gerth; Gregory D D Hurst
Journal:  PeerJ       Date:  2017-07-12       Impact factor: 2.984

Review 7.  Viruses and Phytoparasitic Nematodes of Cicer arietinum L.: Biotechnological Approaches in Interaction Studies and for Sustainable Control.

Authors:  Paola Leonetti; Gian Paolo Accotto; Moemen S Hanafy; Vitantonio Pantaleo
Journal:  Front Plant Sci       Date:  2018-03-15       Impact factor: 5.753

8.  The draft genome of strain cCpun from biting midges confirms insect Cardinium are not a monophyletic group and reveals a novel gene family expansion in a symbiont.

Authors:  Stefanos Siozios; Jack Pilgrim; Alistair C Darby; Matthew Baylis; Gregory D D Hurst
Journal:  PeerJ       Date:  2019-02-21       Impact factor: 2.984

9.  Identification and characterization of the first pectin methylesterase gene discovered in the root lesion nematode Pratylenchus penetrans.

Authors:  Cláudia S L Vicente; Lev G Nemchinov; Manuel Mota; Jonathan D Eisenback; Kathryn Kamo; Paulo Vieira
Journal:  PLoS One       Date:  2019-02-22       Impact factor: 3.240

10.  Comparative Genomics of Wolbachia-Cardinium Dual Endosymbiosis in a Plant-Parasitic Nematode.

Authors:  Amanda M V Brown; Sulochana K Wasala; Dana K Howe; Amy B Peetz; Inga A Zasada; Dee R Denver
Journal:  Front Microbiol       Date:  2018-10-16       Impact factor: 5.640

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.