| Literature DB >> 31709015 |
Dhiviya Prabaa Muthuirulandi Sethuvel1, Balaji Veeraraghavan1, Karthick Vasudevan1, Naveen Kumar Devanga Ragupathi1, Dhivya Murugan1, Kamini Walia2, Shalini Anandan1.
Abstract
Shigella is ranked as the second leading cause of diarrheal disease worldwide. Though infection occurs in people of all ages, most of the disease burden constitutes among the children less than 5 years in low and middle income countries. Recent increasing incidence of drug resistant strains make this as a priority pathogen under the antimicrobial resistance surveillance by WHO. Despite this, only limited genomic studies on drug resistant Shigella exists. Here we report the first complete genome of clinical S. flexneri serotype 2a and S. sonnei strains using a hybrid approach of both long-read MinION (Oxford Nanopore Technologies) and short-read Ion Torrent 400 bp sequencing platforms. The utilization of this novel approach in the present study helped to identify the complete plasmid sequence of pSS1653 with structural genetic information of AMR genes such as sulII, tetA, tetR, aph(6)-Id and aph(3'')-Ib. Identification of AMR genes in mobile elements in this human-restricted enteric pathogen is a potential threat for dissemination to other gut pathogens. The information on Shigella at genome level could help us to understand the genome dynamics of existing and emerging resistant clones.Entities:
Keywords: Hybrid assembly; Insertion sequences; Pathogenicity island; Plasmids; Prophages
Year: 2019 PMID: 31709015 PMCID: PMC6836418 DOI: 10.1186/s13099-019-0334-5
Source DB: PubMed Journal: Gut Pathog ISSN: 1757-4749 Impact factor: 4.181
Fig. 1a Circular representation of unnamed plasmid 1, pSS1653 carrying AMR genes identified in S. sonnei (red color indicates AMR genes, green color indicates mobile elements, other CDS shown in blue color). The direction of arrows indicates the orientation of open reading frames (ORFs). b Circular representation of genetic arrangement of unnamed plasmid 2 identified in S. sonnei (blue and green color denotes CDS and reference sequence respectively). The direction of arrows indicates the orientation of open reading frames (ORFs). c Circular representation of genetic arrangement of unnamed plasmid 2 identified in S. sonnei (dark blue and light blue color denotes CDS and reference sequence respectively). The direction of arrows indicates the orientation of open reading frames (ORFs)
Genomic features and Predicted insertion sequence elements of S. flexneri (FC906) and S. sonnei (FC1653) by hybrid assembly approach
| Genomic features | ||
| Length | 4,655,489 bp | 4,807,231 bp |
| Coverage | 278× | 174× |
| Contigs | 1 (chromosome) | 4 (1 chromosome, 3 plasmid) |
| Total genes | 4770 | 4932 |
| Total CDS | 4637 | 4806 |
| Total pseudo genes | 735 | 857 |
| tRNAs | 104 | 95 |
| rRNAs | 22 | 22 |
| GC content | 50.8 | 51.0 |
| Virulence factor (VFDB) | 66 | 57 |
| Resistance (CARD) | 90 | 86 |
| Sequence type | 245 | 152 |
aComplete/partial/pseudogene/unknown
Prophage content of S. flexneri (FC906) analyzed using PHAST tool
| Region | Length (Kb) | CDS | GC content (%) | Position | Completenessa | Possible phagesb | Accession number |
|---|---|---|---|---|---|---|---|
| 1 | 22.5 | 9 | 45.02 | 609,158–631,696 | Incomplete | Bacillus_Blue (3) | NC_031056 |
| 2 | 19 | 19 | 51.73 | 1,558,706–1,577,795 | Intact | Enterobacteria_UAB_Phi20 (10) | NC_031019 |
| 3 | 35.4 | 30 | 47.11 | 1,631,678–1,667,092 | Intact | Shigella_SfII (11) | NC_021857 |
| 4 | 70 | 64 | 51.98 | 2,016,293–2,086,335 | Intact | Phage_Gifsy_1 (15) | NC_010392 |
| 5 | 46.6 | 38 | 49.23 | 2,213,449–2,260,088 | Intact | Salmonella_118970_sal3 (5) | NC_031940 |
| 6 | 36.1 | 27 | 48.89 | 2,408,721–2,444,902 | Incomplete | Salmonella_SJ46 (2) | NC_031129 |
| 7 | 15.7 | 24 | 50.80 | 2,540,849–2,556,550 | Questionable | Geobac_E2 (3) | NC_009552 |
| 8 | 55.4 | 67 | 49.55 | 2,749,711–2,805,160 | Intact | Salmonella_SJ46 (7) | NC_031129 |
| 9 | 14 | 22 | 48.79 | 2,947,521–2,961,586 | Incomplete | Enterobacteria_933 W (3) | NC_000924 |
| 10 | 6.9 | 7 | 49.11 | 3,195,314–3,202,254 | Incomplete | Shigella_SfIV (1) | NC_022749 |
| 11 | 30.9 | 43 | 52.60 | 3,290,710–3,321,696 | Intact | Enterobacteria_mEp460 (13) | NC_019716 |
| 12 | 29.9 | 26 | 47.50 | 3,416,333–3,446,293 | Questionable | Shigella_SfII (3) | NC_021857 |
| 13 | 17.5 | 27 | 51.59 | 3,449,797–3,467,310 | Intact | Stx2_c_1717 (6) | NC_011357 |
| 14 | 44.5 | 27 | 51.58 | 3,573,186–3,617,688 | Intact | Enterobacteria_P1 (3) | NC_005856 |
| 15 | 9.6 | 11 | 50.59 | 4,048,439–4,058,064 | Questionable | Aeromonas_vB_AsaM_56 (4) | NC_019527 |
aPrediction of prophage region, intact (score > 90), questionable (score 70–90) or incomplete (score < 70)
bPhage with the highest number of proteins most similar to those in the region
Prophage content of S. sonnei (FC1653) analyzed using PHAST tool
| Region | Length (Kb) | CDS | GC content (%) | Position | Completenessa | Possible phagesb | Accession number |
|---|---|---|---|---|---|---|---|
| 1 | 31.9 | 44 | 50.44 | 2,093,301–2,125,236 | Intact | Aggregatibacter_S1249 (4) | NC_013597 |
| 2 | 16 | 10 | 51.97 | 2,127,662–2,143,700 | Questionable | Escherichia_Av_05 (2) | NC_025830 |
| 3 | 13.7 | 19 | 49.02 | 2,656,501–2,670,224 | Questionable | Enterobacteria_phiP27 (4) | NC_003356 |
| 4 | 9.9 | 9 | 50.68 | 2,850,233–2,860,210 | Incomplete | Bacillus_G (2) | NC_023719 |
| 5 | 14.7 | 21 | 48.72 | 2,985,856–3,000,602 | Incomplete | Shigella_Ss_VASD (3) | NC_028685 |
| 6 | 24.8 | 33 | 51.37 | 3,194,522–3,219,407 | Intact | Enterobacteria_mEp460 (6) | NC_019716 |
| 7 | 48.7 | 52 | 48.90 | 3,303,427–3,352,149 | Intact | Salmonella_BPS15Q2 (7) | NC_031939 |
| 8 | 22.3 | 12 | 51.21 | 3,615,693–3,638,085 | Incomplete | Escherichia_Av_05 (2) | NC_025830 |
| 9 | 14.4 | 13 | 50.42 | 3,748,403–3,762,850 | Incomplete | Cronobacter_vB_CsaM_GAP32 (2) | NC_019401 |
| 10 | 54.6 | 66 | 50.87 | 3,872,085–3,926,723 | Intact | Enterobacteria_lambda (20) | NC_001416 |
| 11 | 10.4 | 12 | 50.08 | 4,139,568–4,149,969 | Questionable | Stx2_c_1717 (3) | NC_011357 |
| 12 | 26.9 | 30 | 51.76 | 4,213,478–4,240,403 | Intact | Enterobacteria_phiP27 (13) | NC_003356 |
| 13 | 19 | 24 | 49.54 | 4,260,910–4,279,909 | Questionable | Enterobacteria_SfI (7) | NC_027339 |
| 14 | 9.5 | 11 | 50.95 | 4,294,012–4,303,528 | Incomplete | Gordon_OneUp (2) | NC_030917 |
| 15 | 7.9 | 10 | 49.00 | 4,404,915–4,412,855 | Incomplete | Shigella_Sf6 (1) | NC_005344 |
aPrediction of prophage region, intact (score > 90), questionable (score 70–90) or incomplete (score < 70)
bPhage with the highest number of proteins most similar to those in the region
Fig. 2BLAST comparison of Shigella pathogenicity islands identified in S. flexneri (FC906) against reference sequence using Easyfig. a SHI-1 pathogenicity island, b SHI-2 pathogenicity island, c Shigella resistance locus (SRL) carrying antimicrobial resistance genes. Vertical blocks between the two sequences indicate the shared similarity regions shaded according to BLASTn (the pink shading indicate the matches in the same direction and red for inverted matches)
Fig. 3BLAST comparison of Shigella pathogenicity island (SHI-2) identified in S. sonnei against reference sequence using Easyfig. Vertical blocks between the two sequences indicate the shared similarity regions shaded according to BLASTn (the pink shading indicate the matches in the same direction and red for inverted matches)