| Literature DB >> 32404118 |
Berenice Talamantes-Becerra1, Jason Carling2, Andrzej Kilian2, Arthur Georges3.
Abstract
BACKGROUND: This study demonstrates the use of reduced-representation genotyping to provide preliminary identifications for thermophilic bacterial isolates. The approach combines restriction enzyme digestion and PCR with next-generation sequencing to provide thousands of short-read sequences from across the bacterial genomes. Isolates were obtained from compost, hot water systems, and artesian bores of the Great Artesian Basin. Genomic DNA was double-digested with two combinations of restriction enzymes followed by PCR amplification, using a commercial provider of DArTseq™, Diversity Arrays Technology Pty Ltd. (Canberra, Australia). The resulting fragments which formed a reduced-representation of approximately 2.3% of the genome were sequenced. The sequence tags obtained were aligned against all available RefSeq bacterial genome assemblies by BLASTn to identify the nearest reference genome.Entities:
Keywords: Bacterial identification; DArTseq; Genotyping-by-sequencing; Great Artesian Basin; Reduced-representation sequencing; Thermophiles
Mesh:
Substances:
Year: 2020 PMID: 32404118 PMCID: PMC7222431 DOI: 10.1186/s12866-020-01800-z
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Microbial growth for samples from hot water systems and commercial compost. Incubation temperature was 62.5 °C, culture media LB broth agar
| Source | Sample name | Temperature (T°C) | Microbial growth | No. Bacterial isolates |
|---|---|---|---|---|
| Domestic hot water systems | ||||
| DPS1 | 60 °C | (+)(A), (+)(B) | 2 | |
| DPS2 | 61.1 °C | (+)(A), (+)(B) | 2 | |
| DPS3 | 62.6 °C | (+) | 1 | |
| DPS4 | 62.6 °C | (+)(A), (+)(B) | 2 | |
| DPS5 | 79.6 °C | (+)(A), (+)(B), (+)(C) | 3 | |
| DPS6 | 57 °C | (+)(A), (+)(B), (+)(C) | 3 | |
| DPS7 | 57.6 °C | (−) | ||
| HTR | 84 °C | (++) | 1 | |
| DHW | 60 °C | (+)(A), (+)(B) | 2 | |
| Commercial compost | ||||
| DMW | na | (+) | 1 | |
| MPCC | na | (+) | 1 | |
| NFOSA | na | (+)(A), (+)(B), (+)(C) | 3 | |
| MMBA | na | (+)(A), (+)(B) | 2 | |
| MFBB | na | (+)(A), (+)(B), (+)(C), (+)(D) | 4 | |
| MPCB | na | (+) | 1 | |
| CBSP | na | (+)(A), (+)(B), (+)(C) | 3 | |
| Total bacterial isolates | 31 | |||
na = not applicable; (−) = no growth was observed; (+) = growth was observed; (++) = strong growth was observed; (A), (B), (C) name assigned if more than one microorganism was observed
Microbial growth for water and mud samples from the Great Artesian Basin. Incubation temperature was 62.5 °C. Culture media: LB broth agar (pH 5, pH 6.8, pH 8) and PBT pH 6.0
| LB Broth | PBT | No. Bacterial isolates | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Location | Source | Temperature (°C) | pH | Sample name | pH 5.0 | pH 6.8 | pH 8.0 | pH 6.0 | |
| Birdsville | water | 98 °Ca | nd | Birdsville Bore | nd | (+) | (+) | (−) | 2 |
| bore drain | 98 °Ca | nd | Birdsville mud | nd | (+)(A), (+)(B) | (+) | (+)(A), (+)(B), (+)(C) | 6 | |
| Clifton hills | water | 80 °C | 8.0 | CHfil | (−) | (−) | (+)(A) | (−) | 1 |
| bore drain | 80 °C | nd | CHMUD | (+) | (++) | (++)(A), (++)(B) | (++) | 4 | |
| Mount Gason | water | 80 °C | 8.0 | MtGfil | nd | (+)(A), (+)(B) | (+)(A) | (+)(A) | 4 |
| bore drain | 80 °C | nd | Mt.GODS / Mt. GMUD | (+)(A), (+)(B) | (+)(A), (+)(B), (+)(C), (+)(D), (+)(E), (+)(F), (+)(G) | (+)(A), (+)(B), (+)(C), | (+) | 13 | |
| Mirra Mita | water | 79 °C | 8.0 | MMfil | nd | (+) | (−) | (−) | 1 |
| bore drain | 79 °C | nd | MMMUD1 | (−) | (+) | (−) | (+) | 2 | |
| bore drain | 68 °C | nd | MMMUD2 | (++) | (++) | (−) | (−) | 2 | |
| bore drain | 74 °C | nd | MMMUD3 | (−) | (++) | (+)(A) | (+) | 3 | |
| bore drain | 62 °C | nd | MMMUD4 | (−) | (+)(A), (+)(B) | (+)(A), (+)(B) | (+) | 5 | |
| bore drain | 66 °C | nd | MMMUD5 | (−) | (+)(A), (+)(B) | (−) | (−) | 2 | |
| bore drain | 50 °C | nd | MMMUD6 | (−) | (+)(A), (+)(B) | (−) | (−) | 2 | |
| bore drain | 39 °C | nd | MMMUD7 | (−) | (++) | (−) | (−) | 1 | |
| bore drain | 74 °C | nd | MMMUD8 | (+) | (−) | (−) | (−) | 1 | |
| Mungerannie station | water | 78 °C | 7.2 | Mgnhotfil / MgnCCG | (+) | (+) | (+) | (−) | 3 |
| bore drain | 60 °C | nd | MCWH | (−) | (+)(A), (+)(B) | (−) | (−) | 2 | |
| Mulka | soil | 38 °C | nd | MR | (−) | (+)(A) | (+)(A) | (−) | 2 |
| Kopperamanna | water | 60 °C | 8.5 | Efil | (−) | (+)(A), (+)(B) | (−) | (++) | 3 |
| soil | 38 °C | nd | ECO3 | (−) | (−) | (−) | (++) | 1 | |
| Etadunna station | water | 77.9 °C | 8.5 | Kanufil | nd | (+) | (−) | (−) | 1 |
| Dulkaninna | water | 47.8 °C | 8.5 | Dulfil | nd | (+)(+) | (−) | (−) | 1 |
| Clayton station | water | 34 °C | 8.5 | Clfil | nd | (+)(A), (+)(B) | (−) | (−) | 2 |
| bore drain | 34 °C | nd | CLB | (−) | (+) | (−) | (−) | 1 | |
| Lake Harry | water | 46 °C | 8.5 | LHfil | nd | (−) | (−) | (−) | |
| bore drain | 46 °C | nd | LHMUD | (−) | (+)(A), (+)(B) | (−) | (+) | 3 | |
| Total bacterial isolates | 68 | ||||||||
(a) Based on published temperature (Habermehl and Pestov 2002 [14])
nd = not determined; (−) = no growth was observed; (+) = growth was observed; (++) = strong growth was observed; (A), (B), (C) name assigned if more than one microorganism was observed
Fig. 1Extract of report generated by bioinformatics pipeline Currito3.1 DNA Fragment Analyser [32] for sample MMMud_3_LB_ph8. This image shows the first candidate closest match to the sample. The circular graph plotted in Circos [33] shows the BLAST alignment position against the reference genome. The outer black circle represents the candidate reference genome with size indicated in megabases (Mb); the middle blue circle shows aligned sequenced fragments obtained by complexity-reduced genotyping and the Inner green / red circle shows the percentage identity of the alignments, in which values below 95% are red and values equal or above 95% are green. The bar plots show the sequences obtained with and without BLAST alignments against the best reference, in which the X axis shows the sequences classification of sequences with and without hits to a reference or plasmid and the Y axis shows the total number of sequences. The histogram shows the percentage identity of BLAST alignments against the candidate reference, where the X axis shows the BLAST alignment percentage identity highest to lowest and the Y axis has the number of aligned sequences
Bacterial identification results for isolates obtained from hot water systems and commercial compost, showing nearest matches based on BLAST alignment of complexity-reduced genotyping fragments. Average BLASTn percentage identity, nucleotide sequence distance values to best matches and genome coverage percentage are shown
| Source | Sample name | Closest match | Average % identity | Nucleotide sequence distance (NSD) | Genome coverage (%) |
|---|---|---|---|---|---|
| Domestic hot water systems | |||||
| DHWa | 99.27 | 0.00446 | 2.28 | ||
| DHWb | 98.84 | 0.00719 | 2.12 | ||
| DSP1a | 98.18 | 0.01177 | 1.80 | ||
| DPS1b | 98.11 | 0.01174 | 1.80 | ||
| DSP2a | 98.58 | 0.00838 | 3.25 | ||
| DSP2b | 98.58 | 0.00855 | 3.23 | ||
| DSP3 | 98.61 | 0.00837 | 3.18 | ||
| DSP4a | 98.55 | 0.00871 | 3.29 | ||
| DSP4b | 98.60 | 0.00849 | 3.27 | ||
| DSP5a | 98.29 | 0.01041 | 4.32 | ||
| DSP5b | 98.33 | 0.01023 | 4.29 | ||
| DPS5c | 98.35 | 0.01006 | 4.27 | ||
| DSP6a | 98.51 | 0.00876 | 3.29 | ||
| DSP6b | 98.37 | 0.00982 | 2.06 | ||
| DSP6c | 98.35 | 0.00993 | 2.08 | ||
| HTR | 98.29 | 0.01025 | 2.02 | ||
| Commercial compost | |||||
| DMW1 | 99.66 | 0.00189 | 4.54 | ||
| CBSPa | 99.39 | 0.00345 | 2.03 | ||
| CBSPb | 99.84 | 0.00099 | 2.10 | ||
| CBSPc | 99.83 | 0.00105 | 2.19 | ||
| MFBBa | 98.97 | 0.00648 | 1.14 | ||
| MFBBb | 99.59 | 0.00248 | 1.63 | ||
| MFBBc | 99.66 | 0.00212 | 1.10 | ||
| MFBBd | 98.65 | 0.00804 | 1.14 | ||
| MMBAa | 99.00 | 0.00671 | 1.28 | ||
| MMBAb | 98.92 | 0.00674 | 1.12 | ||
| MPCB | 99.58 | 0.00254 | 1.59 | ||
| MPCC | 99.56 | 0.00267 | 1.73 | ||
| NFOSA1 | 99.83 | 0.00106 | 2.11 | ||
| NFOSA2 | 98.69 | 0.00809 | 1.85 | ||
| NFOSA3 | 98.85 | 0.00725 | 1.14 | ||
Bacterial identification results for isolates from the Great Artesian Basin, showing nearest matches based on BLAST alignment of complexity-reduced genotyping fragments. Average BLAST percentage identity, nucleotide sequence distance values to best matches and genome coverage percentage are shown
| Location | Sample name | Closest match | Average % identity | Nucleotide sequence distance (NSD) | Genome coverage (%) |
|---|---|---|---|---|---|
| Birdsville | |||||
| B_fil_LB_pH6.8_a | 98.42 | 0.00937 | 2.15 | ||
| B_fil_LB_pH6.8_b | 98.33 | 0.00985 | 2.16 | ||
| B_mud_LB_pH6.8_a | 94.92 | 0.02571 | 1.43 | ||
| B_mud_LB_pH6.8_b | 94.96 | 0.02496 | 1.44 | ||
| B_mud_LB_pH8 | 99.40 | 0.00389 | 2.08 | ||
| B_mud_PBT_pH6.0_a | 96.97 | 0.01787 | 1.43 | ||
| B_mud_PBT_pH6.0_b | 98.60 | 0.00858 | 1.79 | ||
| B_mud_PBT_pH6.0_c | 97.78 | 0.01344 | 3.01 | ||
| Clifton hills | |||||
| CHfil_LB_pH8 | 99.08 | 0.00577 | 3.32 | ||
| CHMud_LB_pH5 | 98.33 | 0.00964 | 1.26 | ||
| CHMud_LB_pH6.8 | 98.53 | 0.01127 | 2.16 | ||
| CHMud_LB_pH8 | 98.43 | 0.00903 | 1.33 | ||
| CHMud_PBT_pH6.0 | 98.19 | 0.01060 | 1.62 | ||
| Mount Gason | |||||
| MtGfil_LB_pH6.8_a | 98.42 | 0.00951 | 2.32 | ||
| MtGfil_LB_pH6.8_b | 95.49 | 0.02693 | 1.88 | ||
| MtGfil_LB_pH8 | 98.21 | 0.01065 | 2.36 | ||
| MtGfil_PBT_pH6.0 | 94.89 | 0.02937 | 2.19 | ||
| Mt_GMud_LB_pH5 | 97.39 | 0.01570 | 3.51 | ||
| Mt_GMud_LB_pH6.8_a | 97.89 | 0.01101 | 2.02 | ||
| Mt_GMud_LB_pH6.8_b | 99.34 | 0.00434 | 3.45 | ||
| Mt_GMud_LB_pH6.8_c | 99.06 | 0.00561 | 3.37 | ||
| Mt_GMud_LB_pH8 | 99.37 | 0.00424 | 3.44 | ||
| Mt_GMud_PBT_pH6.0 | 96.60 | 0.02055 | 2.48 | ||
| Mt_GODS_LB_pH5 | 99.01 | 0.00600 | 3.60 | ||
| Mt_GODS_LB_pH6.8_a | 98.41 | 0.00921 | 2.18 | ||
| Mt_GODSa_LB_pH6.8_b | 97.18 | 0.01851 | 2.97 | ||
| Mt_GODSb_LB_pH6.8_c | 98.36 | 0.00918 | 2.31 | ||
| Mt_GODSc_LB_pH6.8_d | 98.31 | 0.00937 | 2.27 | ||
| Mt_GODSc_LB_pH8_a | 96.52 | 0.02288 | 3.22 | ||
| Mt_GODSa_LB_pH8_b | 98.25 | 0.00969 | 2.33 | ||
| Mirra Mita | |||||
| MMfil_LB_25/11/15 | 98.10 | 0.01448 | 3.84 | ||
| MMMud_1_LB_pH6.8 | 98.15 | 0.01108 | 1.52 | ||
| MMMud_1_PBT_pH6.0 | 97.74 | 0.01651 | 1.12 | ||
| MMMud_2_LB_pH5 | 96.10 | 0.02497 | 1.74 | ||
| MMMud_2_LB_pH6.8 | 94.41 | 0.03313 | 2.31 | ||
| MMMud_3_LB_pH6.8 | 98.06 | 0.01171 | 1.55 | ||
| MMMud_3_LB_pH8 | 98.14 | 0.01122 | 1.49 | ||
| MMMud_3_PBT_pH6.0 | 99.05 | 0.00721 | 1.49 | ||
| MMMud_4_LB_pH6.8_a | 99.26 | 0.00500 | 2.81 | ||
| MMMud_4_LB_pH6.8_b | 99.65 | 0.00358 | 1.87 | ||
| MMMud_4_LB_pH8_a | 99.49 | 0.00417 | 1.87 | ||
| MMMud_4_LB_pH8_b | 98.41 | 0.00908 | 2.25 | ||
| MMMud_4_PBT_pH6.0 | 98.10 | 0.01148 | 1.38 | ||
| MMMud_5_LB_pH6.8_a | 99.60 | 0.00221 | 3.60 | ||
| MMMud_5_LB_pH6.8_b | 99.29 | 0.00443 | 2.39 | ||
| MMMud_6_LB_pH6.8_a | 99.25 | 0.00438 | 3.58 | ||
| MMMud_6_LB_pH6.8_b | 85.43 | 0.09046 | 8.38 | ||
| MMMud_7_LB_pH5 | 95.90 | 0.02818 | 2.53 | ||
| MMMud_8_LB_pH5 | 97.59 | 0.01704 | 2.12 | ||
| Mungerannie station | |||||
| MgnHotfil_LB_pH6.8 | 99.40 | 0.00403 | 1.88 | ||
| Mgn_CCG_LB_pH5 | 95.60 | 0.02500 | 1.68 | ||
| Mgn_CCG_LB_pH8 | 98.93 | 0.00522 | 3.54 | ||
| MCWH_LB_pH6.8 | 99.04 | 0.00473 | 3.52 | ||
| MCWH_LB_pH8 | 94.13 | 0.06061 | 2.92 | ||
| Mulka | |||||
| MR_LB_pH6.8 | 99.86 | 0.00100 | 1.73 | ||
| MR_LB_pH8 | 99.64 | 0.00212 | 1.41 | ||
| Kopperamanna | |||||
| Efil_LB_pH6.8_a | 98.40 | 0.00959 | 2.30 | ||
| Efil_LB_pH6.8_b | 95.82 | 0.02610 | 2.71 | ||
| Efil_PBT_pH6.0 | 94.90 | 0.02852 | 2.23 | ||
| ECO3_PBT_pH6.0 | 98.08 | 0.01121 | 2.05 | ||
| Etadunna station | |||||
| Kanufil_LB_pH6.8 | 99.39 | 0.00394 | 1.94 | ||
| Dulkaninna | |||||
| Dufil_LB_pH6.8 | 94.92 | 0.02867 | 2.23 | ||
| Clayton station | |||||
| Clfil_LB_pH6.8 | 98.43 | 0.00933 | 2.30 | ||
| Clfil_LB_pH6.8 | 94.36 | 0.03240 | 2.29 | ||
| CLB_LB_pH6.8 | 97.34 | 0.01678 | 1.30 | ||
| Lake Harry | |||||
| LH_Mud_LB_pH6.8_a | 99.57 | 0.00379 | 1.88 | ||
| LH_Mud_LB_pH6.8_b | 98.34 | 0.01092 | 1.73 | ||
| LH_Mud_PBT_pH6.0 | 99.04 | 0.00608 | 3.57 | ||
Whole genome sequencing assembly statistics and nearest relative based on progressiveMAUVE [35, 36]
| Group | Sample name | GenBank accession | Reads | contigs | Largest contig | Total length | N50 | GC (%) | Nearest relative based on MAUVE |
|---|---|---|---|---|---|---|---|---|---|
| MR_LB_pH8 | SDLB00000000 | 3,295,260 | 153 | 166,175 | 3,592,399 | 61,995 | 48.89 | ||
| B_mud_LB_pH8 | SDLA00000000 | 275,247 | 60 | 765,875 | 3,434,851 | 188,577 | 52.02 | ||
| NFOSA3 | SDLE00000000 | 2,067,951 | 111 | 273,724 | 3,334,687 | 72,143 | 42.13 | ||
| DSP4a | SDLD00000000 | 885,071 | 188 | 271,123 | 3,273,238 | 58,160 | 52.32 | ||
| CHMud_LB_pH8 | SDLG00000000 | 636,456 | 67 | 226,012 | 2,712,590 | 137,053 | 41.81 | ||
| Efil_LB_pH6.8 | SDLH00000000 | 777,211 | 53 | 596,376 | 2,794,302 | 321,229 | 41.90 | ||
| MMMud_3_LB_pH8 | SDLC00000000 | 1,174,246 | 1196 | 233,455 | 4,372,943 | 60,623 | 55.27 | ||
| MCWH_LB_pH8 | SDLF00000000 | 1,349,927 | 124 | 345,578 | 3,934,072 | 177,957 | 56.20 |
Comparison of bacterial identification methods showing percentage identity for complexity-reduced genotyping based on BLASTn alignment; whole-genome sequencing best matches using progressiveMauve [35, 36] alignment tool; and the best matches obtained with 16S rRNA gene alignment, including multiple results per sample with equal highest bitscore and percentage identity
| Group | Sample name | Complexity-reduced genotyping best match | % ID | Whole genome sequencing best match result | Mauve mean similarity profile value | 16 s rRNA in silico best match result | % ID |
|---|---|---|---|---|---|---|---|
| MR_LB_pH8 | 99.64 | strain KCTC3902 | 13,618.31 | 100.00 | |||
PA-3 GEPA3 | 100.00 | ||||||
| 100.00 | |||||||
| B_mud_LB_pH8 | PSS1 | 99.40 | PSS1 | 12,550.22 | PSS1 | 100.00 | |
FW23 | 100.00 | ||||||
| NFOSA3 | 98.85 | 8003.04 | 100.00 | ||||
| DSP4a | strain N-3 | 98.55 | strain N-3 | 4299.05 | 100.00 | ||
| 100.00 | |||||||
| 100.00 | |||||||
Sah69 | 100.00 | ||||||
| 100.00 | |||||||
| 100.00 | |||||||
| CHMud_LB_pH8 | 98.43 | 7800.76 | 100.00 | ||||
| Efil_LB_pH6.8_a | strain AB04 | 98.40 | strain AB04 | 10,221.31 | 99.66 | ||
| MMMud_3_LB_pH8 | 98.14 | 3684.00 | strain G1w1 | 99.76 | |||
| 99.76 | |||||||
| MCWH_LB_pH8 | PM1 N690 | 94.13 | 423 | 3758.04 | PM1 N690 | 99.35 |
Fig. 2Sampling locations of 11 water-bores of The Great Artesian Basin showing groundwater temperatures derived from Habermehl and Pestov (2002) [14]. Inset map shows larger position of larger map within Australia