Literature DB >> 32616635

Nearly Complete Genome Sequence of Brugia pahangi FR3.

John Mattick1, Silvia Libro2, Benjamin C Sparklin1, Matthew Chung1, Robin E Bromley1, Suvarna Nadendla1, Xuechu Zhao1, Sandra Ott1, Lisa Sadzewicz1, Luke J Tallon1, Michelle L Michalski3, Jeremy M Foster2, Julie C Dunning Hotopp4,5,6.   

Abstract

Brugia pahangi is a zoonotic parasite that is closely related to human-infecting filarial nematodes. Here, we report the nearly complete genome of Brugia pahangi, including assemblies of four autosomes and an X chromosome, with only seven gaps. The Y chromosome is still not completely assembled.
Copyright © 2020 Mattick et al.

Entities:  

Year:  2020        PMID: 32616635      PMCID: PMC7330237          DOI: 10.1128/MRA.00479-20

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

We sequenced the zoonotic parasitic roundworm Brugia pahangi FR3, which was originally obtained from a green leaf monkey in Kuala Lumpur and is distributed by the NIAID Filariasis Research Reagent Resource Center, known as FR3 (1). The 107,643,863 Illumina HiSeq 2500 paired-end 150-bp reads were generated from KAPA Hyper libraries using Brugia pahangi FR3 genomic DNA acquired from BEI Resources. Genomic DNA for PacBio RS II sequencing was obtained from 16 adult male and 16 adult female frozen worms obtained from FR3, which were homogenized in Qiagen G2 buffer with RNase A, and purified using Qiagen gravity-flow Genomic-tips with 80 U of proteinase K and DNA precipitation by centrifugation in the presence of 20 μg of glycogen. PacBio RS II data (P6C4 chemistry; 3,267,281 reads; average read length, 8,695 bp; N50, 25.4 kbp; maximum read length, 139 kbp) were generated from Sage Blue Pippin size-selected (>15 kbp) SMRTbell v1.0 libraries constructed using Covaris gTUBE-fragmented DNA. For Oxford Nanopore Technologies (ONT) sequencing, 185 adult female worms acquired from TRS Labs (Athens, GA, USA) were ground in liquid nitrogen, and DNA was extracted with a single phenol-chloroform DNA extraction, with spooling from an ethanol precipitation. Rapid libraries were constructed (SQK-RAD004) four times, from 4 μg, 1.8 μg, 0.9 μg, and 0.2 μg DNA. The latter three libraries were sequenced with an R9.4 MinION flow cell (FLO-MIN106), replacing the loading beads with water. The 4-μg library was modified using 1.5 μl of DNA fragmentation mixture and 3.5 μl of 10 mM Tris-Cl (pH 8.0) and 0.02% Triton X-100 (2) and was sequenced with a different R9.4 MinION flow cell, compared with the previous runs. This collectively resulted in 727,358 reads, which were called using Guppy v3.1.5 (average read length, 10,109 bp; N50, 22 kbp; maximum read length, 828 kbp). Illumina reads after quality control and trimming with FastQC v0.11.7 (3) and Trimmomatic v0.38 (4), respectively, and untrimmed PacBio and MinION reads were assembled with Canu v1.8 (5) multiple times but using different subsets of MinION experiments (excluding and including the 4-μg sample) and varying the estimated genome size as 90 Mbp and 100 Mbp. Default parameters were used except as otherwise noted. These assemblies were manually examined, including unique joins leading to a manual merger of assemblies. This manually merged assembly was indel corrected using Illumina reads and Pilon v1.22 (mindepth, 5; K, 85; minmq, 0; minqual, 35; fix indels) (6). Contigs were ordered and oriented according to the Brugia malayi reference genome (7), using NUCmer v3.1 to identify matches between the two genomes and to order B. pahangi contigs based on their B. malayi counterparts (Fig. 1). The genome spans 80.8 Mbp with seven gaps (N50, 11.2 Mbp; L50, 3), has a GC content of 29% with four autosomes and an X chromosome, and was deemed 96.4% complete with BUSCO v3 (8), with 206× Illumina, 182× PacBio, and 72× ONT sequencing depths. The remaining 142 contigs, spanning 16 Mbp, likely contain the mitochondrial genome, haplotypes, and pseudoautosomal and Y chromosome fragments.
FIG 1

Synteny between Brugia pahangi and Brugia malayi chromosomes. The current Brugia pahangi genome was aligned to the Brugia malayi V4 genome (7) using NUCmer v3.1 (default options). The resulting delta file was converted into tabular form using the show-coords function with the option -qlTHb. Contigs >4.5% of the length of the matching B. malayi chromosome were identified in R (v. 3.4.4 [9]; libraries: RColorBrewer, ggplot2, data.table) and plotted with ggplots by chromosome. (A) Chromosome 1; (B) chromosome 2; (C) chromosome 3; (D) chromosome 4; (E) chromosome X. This nearly complete B. pahangi genome spans most of the B. malayi genome with some rearrangements. The orthologous regions of the genomes vary in percent identity but typically show >90% identity, with more nucleotide variation at the ends of chromosomes, relative to the middle of the same chromosomes, using the B. malayi chromosomes as references.

Synteny between Brugia pahangi and Brugia malayi chromosomes. The current Brugia pahangi genome was aligned to the Brugia malayi V4 genome (7) using NUCmer v3.1 (default options). The resulting delta file was converted into tabular form using the show-coords function with the option -qlTHb. Contigs >4.5% of the length of the matching B. malayi chromosome were identified in R (v. 3.4.4 [9]; libraries: RColorBrewer, ggplot2, data.table) and plotted with ggplots by chromosome. (A) Chromosome 1; (B) chromosome 2; (C) chromosome 3; (D) chromosome 4; (E) chromosome X. This nearly complete B. pahangi genome spans most of the B. malayi genome with some rearrangements. The orthologous regions of the genomes vary in percent identity but typically show >90% identity, with more nucleotide variation at the ends of chromosomes, relative to the middle of the same chromosomes, using the B. malayi chromosomes as references.

Data availability.

This genome has been deposited in GenBank under accession number JAAVKF000000000. The version described in this paper is the first version, JAAVKF010000000. The raw data have been deposited in the SRA under the accession numbers SRX4135331, SRX4135330, SRX4135329, SRX4135328, SRX4135327, SRX4135326, SRX4135325, SRX4135324, SRX4135323, SRX4135322, SRX4135321, SRX4135320, SRX4135319, SRX4135318, SRX4135317, SRX4135316, SRX4135315, SRX4135314, SRX4135313, and SRX4135312 (PacBio), SRX7658407, SRX7658383, SRX7658378, SRX7658377, SRX7658352, SRX7658349, SRX7658341, SRX7658327, SRX7658323, SRX7658322, SRX7658317, and SRR10997235 (Illumina), and SRR11565851, SRR11472020, SRR11565849, and SRR11565826 (MinION).
  7 in total

1.  BUSCO: Assessing Genome Assembly and Annotation Completeness.

Authors:  Mathieu Seppey; Mosè Manni; Evgeny M Zdobnov
Journal:  Methods Mol Biol       Date:  2019

Review 2.  The NIH-NIAID Filariasis Research Reagent Resource Center.

Authors:  Michelle L Michalski; Kathryn G Griffiths; Steven A Williams; Ray M Kaplan; Andrew R Moorhead
Journal:  PLoS Negl Trop Dis       Date:  2011-11-29

3.  Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement.

Authors:  Bruce J Walker; Thomas Abeel; Terrance Shea; Margaret Priest; Amr Abouelliel; Sharadha Sakthikumar; Christina A Cuomo; Qiandong Zeng; Jennifer Wortman; Sarah K Young; Ashlee M Earl
Journal:  PLoS One       Date:  2014-11-19       Impact factor: 3.240

4.  MinION-based long-read sequencing and assembly extends the Caenorhabditis elegans reference genome.

Authors:  John R Tyson; Nigel J O'Neil; Miten Jain; Hugh E Olsen; Philip Hieter; Terrance P Snutch
Journal:  Genome Res       Date:  2017-12-22       Impact factor: 9.043

5.  Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation.

Authors:  Sergey Koren; Brian P Walenz; Konstantin Berlin; Jason R Miller; Nicholas H Bergman; Adam M Phillippy
Journal:  Genome Res       Date:  2017-03-15       Impact factor: 9.043

6.  Nearly Complete Genome Sequence of Brugia malayi Strain FR3.

Authors:  Alan Tracey; Jeremy M Foster; Michael Paulini; Alexandra Grote; John Mattick; Yu-Chih Tsai; Matthew Chung; James A Cotton; Tyson A Clark; Adam Geber; Nancy Holroyd; Jonas Korlach; Silvia Libro; Sara Lustigman; Michelle L Michalski; Matthew B Rogers; Alan Twaddle; Julie C Dunning Hotopp; Matthew Berriman; Elodie Ghedin
Journal:  Microbiol Resour Announc       Date:  2020-06-11

7.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

  7 in total
  2 in total

1.  X-treme loss of sequence diversity linked to neo-X chromosomes in filarial nematodes.

Authors:  John Mattick; Silvia Libro; Robin Bromley; Wanpen Chaicumpa; Matthew Chung; Darren Cook; Mohammad Behram Khan; Nikhil Kumar; Yee-Ling Lau; Shailja Misra-Bhattacharya; Ramakrishna Rao; Lisa Sadzewicz; Atiporn Saeung; Mohd Shahab; Benjamin C Sparklin; Andrew Steven; Joseph D Turner; Luke J Tallon; Mark J Taylor; Andrew R Moorhead; Michelle Michalski; Jeremy M Foster; Julie C Dunning Hotopp
Journal:  PLoS Negl Trop Dis       Date:  2021-10-27

2.  Long-read RNA sequencing of human and animal filarial parasites improves gene models and discovers operons.

Authors:  Nicolas J Wheeler; Paul M Airs; Mostafa Zamanian
Journal:  PLoS Negl Trop Dis       Date:  2020-11-16
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.