Literature DB >> 30938318

Complete Genome Sequence of the Telford Type S Strain of Mycobacterium avium subsp. paratuberculosis.

Rudiger Brauning1, Karren Plain2, Milan Gautam3, Tonia Russell4, C Carolina Correa4, Patrick Biggs5, Richard Whittington2, Alan Murray6, Marian Price-Carter7.   

Abstract

Mycobacterium avium subsp. paratuberculosis is the causative agent of Johne's disease (JD). Here, we report the complete genome sequence of Telford 9.2, a well-characterized representative strain of the M. avium subsp. paratuberculosis S subtype that is endemic in New Zealand and Australian sheep.
Copyright © 2019 Brauning et al.

Entities:  

Year:  2019        PMID: 30938318      PMCID: PMC6424202          DOI: 10.1128/MRA.00004-19

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Mycobacterium avium subsp. paratuberculosis is the causative agent of Johne’s disease (JD), a chronic, generally subclinical but sometimes fatal granulomatous enteritis of ruminants (1). M. avium subsp. paratuberculosis subtype S (also called either subtype I or subtype III) has been isolated primarily from sheep but also from other ruminant species (reviewed in reference 2). Only draft genomes (3–5) of M. avium subsp. paratuberculosis type S are currently available. Here, we announce the complete genome sequence of Telford 9.2, an IS1311 type S IS900 restriction fragment length polymorphism (RFLP) type S1 strain. This is a clonal culture (passage level 5, including its primary isolation from sheep feces) of an isolate from a clinically infected sheep from New South Wales, Australia. It has been used as inoculum in an experimental model for clinical JD in sheep (6, 7), characterized genetically (8), and is representative of the M. avium subsp. paratuberculosis type endemic in Australian and New Zealand (NZ) sheep (9–11). For Illumina and PacBio sequencing, bacterial stock was inoculated into either supplemented Middlebrook 7H9 (12) (Illumina) or M7H9C (PacBio) (13) liquid medium, cultured for 3 to 4 weeks, and then cultivated on modified Middlebrook 7H10 solid medium (12), harvested, and stored at −80°C. Genomic DNA was prepared for both PacBio and Illumina sequencing by isopropanol precipitation and 70% ethanol wash of cetyltrimethylammonium bromide (CTAB)/phenol-chloroform-extracted cellular material after stepwise enzymatic digestion with lysozyme, RNase A, and proteinase K. For PacBio sequencing, the DNA was also digested with mutanolysin prior to proteinase K digestion and subjected to extra cleanup and concentration on Ampure PB magnetic beads. A PacBio library was constructed and sequenced at the Ramaciotti Centre in Sydney, Australia, using P6-C4 chemistry, and sequenced to a coverage depth of 80× on the PacBio RS II platform on a single-molecule real-time (SMRT) cell. It was improved with Illumina MiSeq 250-bp paired-end (PE) reads generated by sequencing two cultures of the Telford 9.2 reference strain. MiSeq-indexed libraries were created at New Zealand Genomics Limited using Nextera XT DNA kits (Illumina, San Diego, CA). Average coverage was 120× from PacBio data and 135× from Illumina data. There were 2.5 million Illumina PE reads (909 Mbp) and 150,000 PacBio reads prefilter (555 Mbp; N50 value, 10.5 kbp). PacBio reads went through default filtering steps in SMRTPipe v1.87.139483, which reduced read numbers to 63,000 (491 Mbp; N50 value, 10.8 kbp), and were assembled using PacBio Hierarchical Genome Assembly Process v3 (HGAP3; SMRT analysis v2.3.0) into a single contig (Telford1) of the size expected for a complete M. avium subsp. paratuberculosis genome and with a GC content of 69.2%, which is typical of M. avium subsp. paratuberculosis (3–5, 14). The PacBio-based assembly was improved by removing a 9-kbp overlap between the start and the end of the genome, orienting the genome with the start position at the beginning of the dnaA gene and mapping Illumina reads onto the PacBio assembly to detect and repair small-scale variations, as described in Table 1.
TABLE 1

Discrepancies between Illumina and PacBio data

Position before fixVariant typeAccepted solutionPacBio alleleIllumina alleleApplied fix
780880bIndelIlluminaTTGInsertion
931746SNPPacBioCGna
1112469IndelIlluminaGCCCCCGCCCCCCInsertion
1302183IndelIlluminaAGGGGAGGGGGInsertion
1969375IndelIlluminaGCCCCCGCCCCCCInsertion
2128150IndelIlluminaACCCCCACCCCCCInsertion
2276090IndelIlluminaCGGGGGCGGGGGGInsertion
2577759SNPPacBioGAna
2635929IndelIlluminaGCCCCGCCCCCInsertion
2642118SNPPacBioCTna
2705636IndelIlluminaGCCCCCGCCCCCCInsertion
3024648IndelIlluminaTTCInsertion
3201490SNPPacBioCGna
3201602SNPPacBioAGna
3211597IndelIlluminaCGGGGGGGCGGGGGGGGInsertion
3450836IndelPacBioCATCGTCGCGCCGTGCTGGGCGGCCAGCGCGTCGCCGACCAGGCTGCGCGCCGGCTCGACGCGCCGCGCGGCCCGCAGCGCCTGCTGGGCna
4313098SNPIlluminaNGBase change
4314018IndelIlluminaGTTTGTTDeletion
4318473IndelIlluminaACADeletion
4319018IndelIlluminaACADeletion
4319236IndelIlluminaGTTTGTTDeletion
4319286IndelIlluminaCGGGGCGGGDeletion
4320148IndelIlluminaACGCGCGCACGCGCDeletion
4371898SNPPacBioGTna
4416918IndelPacBioCCGTTCGGCGCCGAGCGTCACGCCAGCGTGGCGCTCGCGGGCCGGCGCCACGCTGGCGTGACGCCGna
4421523IndelIlluminaGCCCCGCCCCCInsertion
4572001SNPPacBioGAna
4594338IndelIlluminaACCCCACCCCCInsertion

Illumina reads were mapped onto the PacBio assembly using BWA-MEM (17) v0.7.17-r1188 with parameter “-M,” and then variants (SNPs and indels) were detected (SAMtools [18] v1.3 with parameters “view -q 30 -F 256,” SAMtools v1.3 with parameters “mpileup -t DP,AD,” BCFtools v0.1.16 with parameters “call –cv,” BCFtools v0.1.16 with parameters “view -M2”). For each variant, a read depth greater than 10 was required, and a visual check of mapq values as well as the reference and alternative allele counts was performed. As a result of this analysis, for SNPs the PacBio alleles were accepted, for short indels the Illumina alleles were accepted, and for longer indels the PacBio alleles were accepted. All variants were verified by comparing 200 bp of flanking sequence (centered on the variants) to very closely related map strains (3, 4) using the “map to a reference” function in Geneious (19) and also comparing this fragment to M. avium subsp. paratuberculosis strains included in NCBI taxid 1770 using the NCBI BLAST service with default settings. SNP, single nucleotide polymorphism; na, no action.

For the indel at position 780880, the Telford1 sequence differed from closely related strains in both PacBio and Illumina alleles; Sanger sequencing confirmed the Illumina call.

Discrepancies between Illumina and PacBio data Illumina reads were mapped onto the PacBio assembly using BWA-MEM (17) v0.7.17-r1188 with parameter “-M,” and then variants (SNPs and indels) were detected (SAMtools [18] v1.3 with parameters “view -q 30 -F 256,” SAMtools v1.3 with parameters “mpileup -t DP,AD,” BCFtools v0.1.16 with parameters “call –cv,” BCFtools v0.1.16 with parameters “view -M2”). For each variant, a read depth greater than 10 was required, and a visual check of mapq values as well as the reference and alternative allele counts was performed. As a result of this analysis, for SNPs the PacBio alleles were accepted, for short indels the Illumina alleles were accepted, and for longer indels the PacBio alleles were accepted. All variants were verified by comparing 200 bp of flanking sequence (centered on the variants) to very closely related map strains (3, 4) using the “map to a reference” function in Geneious (19) and also comparing this fragment to M. avium subsp. paratuberculosis strains included in NCBI taxid 1770 using the NCBI BLAST service with default settings. SNP, single nucleotide polymorphism; na, no action. For the indel at position 780880, the Telford1 sequence differed from closely related strains in both PacBio and Illumina alleles; Sanger sequencing confirmed the Illumina call. Telford1 has a sequence length of 4,907,428 bases, 4,377 coding sequences as predicted with the NCBI Prokaryotic Genome Annotation Pipeline (15), and an in silico IS1311 type S IS900 RFLP type S1 type (16).

Data availability.

The genome assembly is available at GenBank under accession number CP033688 and the BioProject accession number PRJNA504678; raw data are available under SRA accession numbers SRX4997502 (Illumina) and SRX4997501 (PacBio), and in silico typing results can be found at https://doi.org/10.6084/m9.figshare.7635977.
  4 in total

Review 1.  The Many Hosts of Mycobacteria 8 (MHM8): A conference report.

Authors:  Michelle H Larsen; Karen Lacourciere; Tina M Parker; Alison Kraigsley; Jacqueline M Achkar; Linda B Adams; Kathryn M Dupnik; Luanne Hall-Stoodley; Travis Hartman; Carly Kanipe; Sherry L Kurtz; Michele A Miller; Liliana C M Salvador; John S Spencer; Richard T Robinson
Journal:  Tuberculosis (Edinb)       Date:  2020-02-11       Impact factor: 3.131

2.  Complete Genome Sequence of a Type III Ovine Strain of Mycobacterium avium subsp. paratuberculosis.

Authors:  John P Bannantine; Darrell O Bayles; Franck Biet
Journal:  Microbiol Resour Announc       Date:  2021-03-11

3.  Molecular characterisation of Mycobacterium avium subsp. paratuberculosis in Australia.

Authors:  Rachel Hodgeman; Rachel Mann; Keith Savin; Noel Djitro; Simone Rochfort; Brendan Rodoni
Journal:  BMC Microbiol       Date:  2021-04-01       Impact factor: 3.605

4.  Whole-Genome Analysis of Mycobacterium avium subsp. paratuberculosis IS900 Insertions Reveals Strain Type-Specific Modalities.

Authors:  Cyril Conde; Marian Price-Carter; Thierry Cochard; Maxime Branger; Karen Stevenson; Richard Whittington; John P Bannantine; Franck Biet
Journal:  Front Microbiol       Date:  2021-05-10       Impact factor: 5.640

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.