| Literature DB >> 30976027 |
Emilie Lefoulon1, Natalie Vaisman2,3, Horacio M Frydman2,4, Luo Sun5, Lise Voland5, Jeremy M Foster5, Barton E Slatko5.
Abstract
Symbiosis is a major force of evolutionary change, influencing virtually all aspects of biology, from population ecology and evolution to genomics and molecular/biochemical mechanisms of development and reproduction. A remarkable example is Wolbachia endobacteria, present in some parasitic nematodes and many arthropod species. Acquisition of genomic data from diverse Wolbachia clades will aid in the elucidation of the different symbiotic mechanisms(s). However, challenges of de novo assembly of Wolbachia genomes include the presence in the sample of host DNA: nematode/vertebrate or insect. We designed biotinylated probes to capture large fragments of Wolbachia DNA for sequencing using PacBio technology (LEFT-SEQ: Large Enriched Fragment Targeted Sequencing). LEFT-SEQ was used to capture and sequence four Wolbachia genomes: the filarial nematode Brugia malayi, wBm, (21-fold enrichment), Drosophila mauritiana flies (2 isolates), wMau (11-fold enrichment), and Aedes albopictus mosquitoes, wAlbB (200-fold enrichment). LEFT-SEQ resulted in complete genomes for wBm and for wMau. For wBm, 18 single-nucleotide polymorphisms (SNPs), relative to the wBm reference, were identified and confirmed by PCR. A limit of LEFT-SEQ is illustrated by the wAlbB genome, characterized by a very high level of insertion sequences elements (ISs) and DNA repeats, for which only a 20-contig draft assembly was achieved.Entities:
Mesh:
Year: 2019 PMID: 30976027 PMCID: PMC6459864 DOI: 10.1038/s41598-019-42454-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Workflow overview of LEFT-SEQ (Large Enriched Fragment Targeted Sequencing) library preparation.
Figure 2Overview of the bioinformatics pipeline.
Information of produced genomes.
| Number of SMRT cell | 2 | 3 | 3 (barcoded) | 3 (barcoded) | 2 | 12 |
| Number of reads | 40,241 | 76,216 | 45,984 | 28,840 | 81,233 | 290,028 |
| Number of contigs | 3 | 1 | 1 | 1 | 42 | 20 |
| Size of the largest contig | 416,288 | 1,080,939 | 1,273,588 | 1,273,527 | 186,312 | 249,386 |
| Total length (bp) | 1,082,170 | 1,080,939 | 1,273,588 | 1,273,527 | 1,466,139 | 1,492,731 |
| N50 | 415,815 | 1,080,939 | 1,273,588 | 1,273,527 | 54,550 | 145,461 |
| L50 | 2 | 1 | 1 | 1 | 7 | 4 |
| Number of reads mapped to | 23,742 | 45,107 | NA | NA | 61,930 | 246,276 |
| % reads mapped to | 59 | 59 | NA | NA | 76 | 84.9 |
| Number of reads mapped to produced | 23,733 | 45,103 | 44,734 | 27,842 | 71,203 | 249,032 |
| Number of reads mapped to produced | 59 | 59 | 97.28 | 96.5 | 87 | 85.8 |
| Average depth | 52X | 78X | 71X | 44X | 75X | 266X |
| bases with coverage <20X | 35,393 | 1,586 | 28,128 | 63,096 | 129,224 | 24,200 |
| % bases with coverage <20X | 3.2 | 0.14 | 2.2 | 4.9 | 8.5 | 1.6 |
Summary of de novo assembly using canu processed in the current study (statistics using QUAST[36]). Lines 1–7, summary statistics; lines 8 to 11, summary of mapping using ngmlr; lines 12 to 14, coverage statistics across the produced genomes using the SAMtools depth[38]. Abbreviations: bp: bases pair; wb: Wolbachia; pop: population.
Figure 3Evaluation of LEFT-SEQ enrichment method. The percentage of the reads mapped across different reference or draft genomes is reported for the three different samples without (pale grey) or with the enrichment method (dark grey). Host and Wolbachia symbionts are indicated with animal symbols.
Figure 4Box-and-whisker plot showing insert size for the three samples with the LEFT-SEQ method and the unenriched control. The different samples are indicated with color (blue for B. malayi, yellow for D. mauritiana and red for A. albopictus) and symbol. Small circles are outlier values. Additional statistics are indicated above the boxplot: number of analyzed reads, the median and the mean.