Literature DB >> 35234497

Whole-Genome Sequencing and De Novo Assembly of 67 Staphylococcus pseudintermedius Strains Isolated from the Skin of Healthy Dogs.

Norma Fàbregas1, Daniel Pérez2, Joaquim Viñes1, Rocío Fonticoba2, Anna Cuscó1, Lourdes Migura-García3, Lluís Ferrer2, Olga Francino4.   

Abstract

We have de novo assembled 67 Staphylococcus pseudintermedius genomes, with median values of 2.6 Mbp size and 99.43% completeness, 2,386 coding sequences, 19 complete rRNAs, 59 tRNAs, and 4 noncoding RNAs. We released 51 single-contig complete genomes and 16 genomes with a circular main contig using Nanopore sequencing.

Entities:  

Year:  2022        PMID: 35234497      PMCID: PMC8928782          DOI: 10.1128/mra.00039-22

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Staphylococcus pseudintermedius is a commensal of the skin of dogs and the main causative agent of canine pyoderma (1), but populations inhabiting the skin of healthy dogs are largely unknown. The current paradigm indicates that infection arises when the skin barrier is altered by predisposing factors (2). We previously demonstrated that Nanopore sequencing allows de novo assembly of the entire genome of S. pseudintermedius (3). Here, we retrieved 67 S. pseudintermedius genomes from isolates from the skin of healthy dogs using long-read Nanopore sequencing. Samples were obtained by rubbing sterile swabs on four skin sites from nine healthy dogs—perinasal, perioral, inguinal, and perianal. After culture in blood agar at 37°C for 24 h, colonies grown with the morphology of S. pseudintermedius (small silver colonies) were seeded in 3 mL of brain heart infusion (BHI) at 37°C for 16 h. DNA was extracted with a ZymoBIOMICS DNA miniprep kit (Zymo Research). DNA quality and quantity were determined using a NanoDrop 2000 instrument and a Qubit double-stranded DNA (dsDNA) broad range (BR) assay kit (Fisher Scientific). The sequencing libraries were prepared with the rapid barcoding sequencing kit (SQK-RBK004; Oxford Nanopore Technologies [ONT]). Up to 12 barcoded samples were loaded in a MinION FLO-MIN106 9.4.1 flow cell and sequenced in a MinION Mk1B or Mk1C device (ONT). The fast5 files were base-called and demultiplexed, and adapters were trimmed with Guppy 5.0.11 (4) (ONT) (--dna_r9.4.1_450bps_sup.cfg) (--config configuration.cfg --barcode_kits SQK-RBK004 --trim_barcodes; min_score threshold default 60). Reads with a quality score lower than 10 were discarded. Run summary statistics were obtained with Nanoplot 1.38.1 (5) (--N50 --fastq). Isolates were confirmed as S. pseudintermedius by EPI2ME WIMP workflow (6). Genomes were de novo assembled using Flye 2.8.3 (7) (--nano-raw --plasmids --trestle), except HSP279 and HSP281, which were assembled with Flye 2.9 (--nano-hq). Contigs were polished with medaka 1.4.3 (8) (medaka_consensus; -m r941_min_sup_g507). Genome completeness and contamination were assessed with CheckM 1.1.3 (lineage_wf) (9). Circlator 1.5.5 was used to rotate the genomes, fixing the start with the dnaA gene (10) (fixstart --min_id 70). Genomes were annotated with the NCBI Prokaryotic Genome Annotation Pipeline 5.2 and 5.3 (11). Multilocus sequence types (MLSTs) were assigned with PubMLST (https://pubmlst.org/; accessed November 2021) (12). Nanopore sequencing allowed successful de novo assembly and polishing of 67 S. pseudintermedius genomes (Table 1). The average read N50 value was 5,270.77 bp with 133,226.86 reads per sample. The median values were 154× coverage (33 to 398×), 99.43% completeness (96.9 to 99.4%) as in previous hybrid assemblies (13), 2.6 Mbp for contig N50 value and genome size (2.5 to 2.9 Mbp), 37.6% GC content (37.2 to 37.8%), 2,386 coding DNA sequences (CDS) (2,247 to 2,805), 19 complete rRNAs (6 to 8 5S, 5 to 7 26S, and 5 to 8 23S), 59 tRNAs (58 to 61), and 4 noncoding RNAs (ncRNAs).
TABLE 1

Characteristics and accession numbers of high-quality genome assemblies from 67 Staphylococcus pseudintermedius isolates from 4 different skin sites of 9 healthy dogs

IDMLSTSourceYr of isolationCountry of isolationAssembly ID (accession version no.)Assembly levelGenome accession no.Genome assemblyNo. of contigsN50 contigs (bp)Genome size of all contigs (bp)No. of CDS (total)Cov (×)Comp (%)Cont (%)GC content (%)
HSP14745bCf, I, H2020Spain GCA_019997105.1 Compl CP083230 Circular12,596,7382,596,7382,37919997.960.5737.6
HSP14945bCf, I, H2020Spain GCA_019997905.1 Compl CP083229 Circularc12,592,2382,592,2382,3849097.820.5737.6
HSP15145bCf, Pa, H2020Spain GCA_019997185.1 Compl CP083231 Circularc12,596,5282,596,5282,38013497.960.5737.6
HSP15245bCf, Pa, H2020Spain GCA_019997145.1 Compl CP083228 Circular12,596,4992,596,4992,3867896.920.5737.6
HSP153402bCf, Pa, H2020Spain GCA_019997125.1 Compl CP083227 Circularc12,596,5632,596,5632,38711397.680.5737.6
HSP15445bCf, Pa, H2020Spain GCA_019997265.1 Compl CP083226 Circular12,596,6242,596,6242,3765397.960.5737.6
HSP15545bCf, Pa, H2020Spain GCA_019998705.1 Compl CP083225 Circular12,596,6282,596,6282,37115898.770.5737.6
HSP15645bCf, P, H2020Spain GCA_019997565.1 Compl CP083224 Circularc12,596,6952,596,6952,37413298.110.5737.6
HSP15745Cf, P, H2020Spain GCA_019998065.1 Compl CP083223 Circularc12,596,6462,596,6462,38115198.250.5737.6
HSP15845bCf, P, H2020Spain GCA_019997985.1 Compl CP083222 Circular12,596,6302,596,6302,39112097.850.5737.6
HSP15945bCf, P, H2020Spain GCA_019998395.1 Compl CP083221 Circular12,596,3162,596,3162,3779097.660.5737.6
HSP16045bCf, P, H2020Spain GCA_019998665.1 Compl CP083220 Circular12,596,4812,596,4812,3883897.570.5737.6
HSP16645bCf, Pn, H2020Spain GCA_019997325.1 Compl CP083219 Circular12,596,7892,596,7892,37514897.820.5737.6
HSP1992189Cf, Pa, H2020Spain GCA_019998005.1 Compl CP083218 Circular12,574,1552,574,1552,3803399.290.0037.6
HSP2042190Cf, Pa, H2020Spain GCA_019997445.1 Compl CP083217 Circular12,470,6022,470,6022,2603398.300.0037.8
HSP2102191Cf, P, H2020Spain GCA_019998825.1 Compl CP083216 Circular12,472,6612,472,6612,24919199.430.0037.8
HSP2112175Cf, P, H2020Spain GCA_019998805.1 Compl CP083215 Circular12,472,6532,472,6532,24913699.430.0037.8
HSP2122175Cf, P, H2020Spain GCA_019998765.1 Compl CP083214 Circular12,472,7542,472,7542,25715899.430.0037.8
HSP2132175Cf, P, H2020Spain GCA_019998525.1 Compl CP083213 Circular12,472,6622,472,6622,24739899.430.0037.8
HSP2142192Cf, Pn, H2020Spain GCA_019998185.1 Compl CP083212 Circular12,524,3842,524,3842,31911999.430.0037.8
HSP2162176Cf, Pn, H2020Spain GCA_019998745.1 Compl CP083211 Circular12,611,8972,611,8972,4039799.430.5737.6
HSP2242177Cf, P, H2020Spain GCA_019998685.1 Compl CP083210 Circular12,711,3972,711,3972,5018899.430.5737.6
HSP2252177Cf, P, H2020Spain GCA_019998785.1 Compl CP083209 Circular12,752,8982,752,8982,57128999.430.5737.6
HSP2262177Cf, P, H2020Spain GCA_019997165.1 Compl CP083208 Circular12,750,6312,750,6312,56722799.430.5737.6
HSP2272177Cf, P, H2020Spain GCA_019996845.1 Compl CP083207 Circular12,750,5632,750,5632,56821299.430.5737.6
HSP2282177Cf, P, H2020Spain GCA_019997925.1 Compl CP083206 Circular12,750,5602,750,5602,56819599.430.5737.6
HSP2322178Cf, Pa, H2020Spain GCA_020685925.1 Contig JAJEKF000000000 Circular22,598,2042,607,1922,40322099.430.0037.6
HSP235551Cf, Pn, H2020Spain GCA_020685885.1 Contig JAJEKT000000000 Circular22,870,8622,873,6042,79131299.431.1437.2
HSP236551Cf, Pn, H2020Spain GCA_020685905.1 Contig JAJEKS000000000 Circular22,876,0472,878,7922,79012699.431.1437.3
HSP237551Cf, Pn, H2020Spain GCA_020685685.1 Contig JAJEKR000000000 Circular22,829,6052,835,0872,72715799.431.1437.3
HSP23845bCf, Pn, H2020Spain GCA_019997285.1 Compl CP083205 Circular12,646,6442,646,6442,45327296.880.5737.5
HSP23945bCf, Pn, H2020Spain GCA_019997805.1 Compl CP083204 Circular12,647,9822,647,9822,45026598.060.5737.5
HSP240551Cf, P, H2020Spain GCA_019998365.1 Compl CP083203 Circular12,870,8582,870,8582,76016999.431.1437.2
HSP241551Cf, P, H2020Spain GCA_020685845.1 Contig JAJEKQ000000000 Circular22,870,8532,873,5942,78418199.431.1437.2
HSP242551Cf, P, H2020Spain GCA_020685645.1 Contig JAJEKP000000000 Circular22,870,8532,876,3392,79418299.431.1437.2
HSP243551Cf, P, H2020Spain GCA_020685745.1 Contig JAJEKO000000000 Circular22,870,7492,876,2352,77926699.431.1437.2
HSP244551Cf, P, H2020Spain GCA_020685805.1 Contig JAJEKN000000000 Circular22,870,8632,876,3492,80520499.431.1437.2
HSP2452178Cf, Pa, H2020Spain GCA_020685785.1 Contig JAJEKM000000000 Circular32,598,1962,609,4782,39919899.430.0037.6
HSP2462178Cf, Pa, H2020Spain GCA_020685705.1 Contig JAJEKL000000000 Circular22,597,9382,600,9812,39713799.430.0037.6
HSP2492178Cf, Pa, H2020Spain GCA_020685725.1 Contig JAJEKK000000000 Circular22,597,9342,600,9772,39614899.430.0037.6
HSP2502178Cf, I, H2020Spain GCA_020685625.1 Contig JAJEKJ000000000 Circular22,598,1822,601,2322,40517599.430.0037.6
HSP2512178Cf, I, H2020Spain GCA_020685865.1 Contig JAJEKI000000000 Circular22,598,1802,602,6492,40015899.430.0037.6
HSP2522178Cf, I, H2020Spain GCA_019997885.1 Compl CP083202 Circular12,598,1842,598,1842,38017699.430.0037.6
HSP2532178Cf, I, H2020Spain GCA_020685765.1 Contig JAJEKH000000000 Circular22,598,1922,604,2782,39820699.430.0037.6
HSP2551026Cf, Pn, H2020Spain GCA_019997765.1 Compl CP083201 Circular12,599,0102,599,0102,37713198.860.0037.6
HSP2581026Cf, P, H2020Spain GCA_019997205.1 Compl CP083200 Circular12,599,0122,599,0122,37012098.300.0037.6
HSP2591026Cf, P, H2020Spain GCA_019997005.1 Compl CP083199 Circular12,599,0312,599,0312,38315598.860.0037.6
HSP2601026Cf, P, H2020Spain GCA_019997245.1 Compl CP083198 Circular12,599,0242,599,0242,3757199.240.0037.6
HSP2611026Cf, P, H2020Spain GCA_019997785.1 Compl CP083197 Circular12,599,0412,599,0412,37214599.430.0037.6
HSP2621026Cf, P, H2020Spain GCA_019997825.1 Compl CP083196 Circular12,599,0362,599,0362,38214799.430.0037.6
HSP2631026Cf, I, H2020Spain GCA_019997065.1 Compl CP083195 Circular12,599,0192,599,0192,37112599.430.0037.6
HSP2641026Cf, I, H2020Spain GCA_019997705.1 Compl CP083194 Circular12,599,0172,599,0172,38415598.860.0037.6
HSP2651026Cf, I, H2020Spain GCA_019997085.1 Compl CP083193 Circular12,599,0162,599,0162,36910999.430.0037.6
HSP2661026Cf, I, H2020Spain GCA_019997845.1 Compl CP083192 Circular12,599,0102,599,0102,37120499.430.0037.6
HSP2671026Cf, I, H2020Spain GCA_019997625.1 Compl CP083191 Circular12,598,9882,598,9882,3749199.430.0037.6
HSP2742179Cf, P, H2020Spain GCA_019998045.1 Compl CP083190 Circular12,587,5132,587,5132,39423097.820.0037.6
HSP2762179Cf, P, H2020Spain GCA_019998025.1 Compl CP083189 Circular12,587,5262,587,5262,38621998.670.0037.6
HSP2772179Cf, P, H2020Spain GCA_019997945.1 Compl CP083188 Circular12,587,5352,587,5352,39131698.300.0037.6
HSP2782179Cf, Pa, H2020Spain GCA_019997225.1 Compl CP083187 Circular12,587,5402,587,5402,38619698.300.0037.6
HSP279UnknownbCf, Pa, H2020Spain GCA_020693965.1 Compl CP085724 Circularc12,575,5042,575,5042,43613998.560.0037.8
HSP280UnknownbCf, Pa, H2020Spain GCA_019997045.1 Compl CP083186 Circularc12,575,8272,575,8272,37413799.410.0037.8
HSP281UnknownbCf, Pa, H2020Spain GCA_020693985.1 Compl CP085723 Circular12,575,3902,575,3902,44127499.130.0037.8
HSP282UnknownbCf, Pa, H2020Spain GCA_019998285.1 Compl CP083185 Circularc12,575,8512,575,8512,38117099.150.0037.8
HSP283294Cf, I, H2020Spain GCA_019997965.1 Compl CP083184 Circular12,607,0962,607,0962,39915499.430.0037.6
HSP2842180Cf, I, H2020Spain GCA_019996945.1 Compl CP083183 Circular12,568,7342,568,7342,36111399.431.1437.6
HSP2852181Cf, Pn, H2020Spain GCA_020685605.1 Contig JAJEKG000000000 Circular22,642,4102,646,7392,46712799.430.0037.5
HSP2862181Cf, Pn, H2020Spain GCA_020686685.1 Contig JAJEKF000000000 Circular42,642,4152,650,5682,46314499.430.0037.5

ID, identifier; MLST, multilocus sequence type; Cf, Canis lupus familiaris; P, perioral; Pa, perianal; I, inguinal; Pn, perinasal; H, healthy; Compl, complete genome; CDS (total), total number of coding sequences; Cov, coverage; Comp, completeness; Cont, contamination.

MLST alleles with mutations that have not been reported.

Genomes where the dnaA gene is not detected start with a random ATG gene.

Characteristics and accession numbers of high-quality genome assemblies from 67 Staphylococcus pseudintermedius isolates from 4 different skin sites of 9 healthy dogs ID, identifier; MLST, multilocus sequence type; Cf, Canis lupus familiaris; P, perioral; Pa, perianal; I, inguinal; Pn, perinasal; H, healthy; Compl, complete genome; CDS (total), total number of coding sequences; Cov, coverage; Comp, completeness; Cont, contamination. MLST alleles with mutations that have not been reported. Genomes where the dnaA gene is not detected start with a random ATG gene. A total of 51 bacterial isolates were assembled into single-circular-contig complete genomes; 16 isolates were assembled into a main circular contig corresponding to the bacterial chromosome and smaller contigs containing replicon proteins supporting their plasmid origin. Together with our previous study (3), we have announced 95 S. pseudintermedius genomes from the skin of healthy dogs, contributing to a better understanding of S. pseudintermedius pathogenesis.

Data availability.

The standardized strain descriptions and accession numbers are presented in Table 1; the genome assemblies, genomic data, and raw data are publicly available in GenBank under BioProject PRJNA685966 and genome accession numbers CP083183 to CP083231, CP085723, CP085724, and JAJEKF000000000 to JAJEKU000000000.
  11 in total

1.  Assembly of long, error-prone reads using repeat graphs.

Authors:  Mikhail Kolmogorov; Jeffrey Yuan; Yu Lin; Pavel A Pevzner
Journal:  Nat Biotechnol       Date:  2019-04-01       Impact factor: 54.908

2.  Genotypic relatedness of staphylococcal strains isolated from pustules and carriage sites in dogs with superficial bacterial folliculitis.

Authors:  Lauren R Pinchbeck; Lynette K Cole; Andrew Hillier; Joseph J Kowalski; Päivi J Rajala-Schultz; Tammy L Bannerman; Steven York
Journal:  Am J Vet Res       Date:  2006-08       Impact factor: 1.156

Review 3.  Staphylococcus pseudintermedius in the dog: taxonomy, diagnostics, ecology, epidemiology and pathogenicity.

Authors:  Jeanette Bannoehr; Luca Guardabassi
Journal:  Vet Dermatol       Date:  2012-04-19       Impact factor: 1.589

4.  CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes.

Authors:  Donovan H Parks; Michael Imelfort; Connor T Skennerton; Philip Hugenholtz; Gene W Tyson
Journal:  Genome Res       Date:  2015-05-14       Impact factor: 9.043

5.  Circlator: automated circularization of genome assemblies using long sequencing reads.

Authors:  Martin Hunt; Nishadi De Silva; Thomas D Otto; Julian Parkhill; Jacqueline A Keane; Simon R Harris
Journal:  Genome Biol       Date:  2015-12-29       Impact factor: 13.583

6.  Performance of neural network basecalling tools for Oxford Nanopore sequencing.

Authors:  Ryan R Wick; Louise M Judd; Kathryn E Holt
Journal:  Genome Biol       Date:  2019-06-24       Impact factor: 13.583

7.  Whole-Genome Sequencing and De Novo Assembly of 61 Staphylococcus pseudintermedius Isolates from Healthy Dogs and Dogs with Pyoderma.

Authors:  O Francino; D Pérez; J Viñes; R Fonticoba; S Madroñero; G Meroni; P Martino; S Martínez; A Cusco; N Fàbregas; L Migura-García; L Ferrer
Journal:  Microbiol Resour Announc       Date:  2021-04-22

8.  NanoPack: visualizing and processing long-read sequencing data.

Authors:  Wouter De Coster; Svenn D'Hert; Darrin T Schultz; Marc Cruts; Christine Van Broeckhoven
Journal:  Bioinformatics       Date:  2018-08-01       Impact factor: 6.937

9.  Open-access bacterial population genomics: BIGSdb software, the PubMLST.org website and their applications.

Authors:  Keith A Jolley; James E Bray; Martin C J Maiden
Journal:  Wellcome Open Res       Date:  2018-09-24

10.  Hybrid Assembly from a Pathogenic Methicillin- and Multidrug-Resistant Staphylococcus pseudintermedius Strain Isolated from a Case of Canine Otitis in Spain.

Authors:  Joaquim Viñes; Anna Cuscó; Olga Francino
Journal:  Microbiol Resour Announc       Date:  2020-01-02
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.