Literature DB >> 32404118

Discovery of thermophilic Bacillales using reduced-representation genotyping for identification.

Berenice Talamantes-Becerra1, Jason Carling2, Andrzej Kilian2, Arthur Georges3.   

Abstract

BACKGROUND: This study demonstrates the use of reduced-representation genotyping to provide preliminary identifications for thermophilic bacterial isolates. The approach combines restriction enzyme digestion and PCR with next-generation sequencing to provide thousands of short-read sequences from across the bacterial genomes. Isolates were obtained from compost, hot water systems, and artesian bores of the Great Artesian Basin. Genomic DNA was double-digested with two combinations of restriction enzymes followed by PCR amplification, using a commercial provider of DArTseq™, Diversity Arrays Technology Pty Ltd. (Canberra, Australia). The resulting fragments which formed a reduced-representation of approximately 2.3% of the genome were sequenced. The sequence tags obtained were aligned against all available RefSeq bacterial genome assemblies by BLASTn to identify the nearest reference genome.
RESULTS: Based on the preliminary identifications, a total of 99 bacterial isolates were identified to species level, from which 8 isolates were selected for whole-genome sequencing to assess the identification results. Novel species and strains were discovered within this set of isolates. The preliminary identifications obtained by reduced-representation genotyping, as well as identifications obtained by BLASTn alignment of the 16S rRNA gene sequence, were compared with those derived from the whole-genome sequence data, using the same RefSeq sequence database for the three methods. Identifications obtained with reduced-representation sequencing agreed with the identifications provided by whole-genome sequencing in 100% of cases. The identifications produced by BLASTn alignment of 16S rRNA gene sequence to the same database differed from those provided by whole-genome sequencing in 37.5% of cases, and produced ambiguous identifications in 50% of cases.
CONCLUSIONS: Previously, this method has been successfully demonstrated for use in bacterial identification for medical microbiology. This study demonstrates the first successful use of DArTseq™ for preliminary identification of thermophilic bacterial isolates, providing results in complete agreement with those obtained from whole-genome sequencing of the same isolates. The growing database of bacterial genome sequences provides an excellent resource for alignment of reduced-representation sequence data for identification purposes, and as the available sequenced genomes continue to grow, the technique will become more effective.

Entities:  

Keywords:  Bacterial identification; DArTseq; Genotyping-by-sequencing; Great Artesian Basin; Reduced-representation sequencing; Thermophiles

Mesh:

Substances:

Year:  2020        PMID: 32404118      PMCID: PMC7222431          DOI: 10.1186/s12866-020-01800-z

Source DB:  PubMed          Journal:  BMC Microbiol        ISSN: 1471-2180            Impact factor:   3.605


Background

Thermophiles continue to generate interest owing to the thermostability of their enzymes, which have been adapted for use in scientific and industrial processes The proteins of thermophilic bacteria generally exhibit higher thermostability compared to those of mesophiles, in part because they tend to have stronger hydrophobic interactions amongst their amino acids than in other bacteria [1]. The ability to withstand such extreme temperatures has made the enzymes from thermophilic bacteria of particular interest for commercial, industrial and scientific applications [2-5] in areas such as pharmaceutical [6], food [7, 8] and detergent industries [9]. The classic environments in which thermophilic microorganisms occur are primarily geothermal in nature [10]. The Great Artesian Basin in South Australia has the temperature and chemical properties which are suitable for thermophiles [11-13]. Specifically, there are bores in this region with water temperatures of 90 °C or more [14, 15], some of them with open, running bore drains known to contain communities of thermophilic microorganisms [16]. Thermophilic bacteria are also found in other environments such as compost [17-20] and hot water systems [21, 22]. The isolation and discovery of thermophilic bacteria is a continuing area of research interest around the world. The identification of novel thermophilic isolates is now routinely achieved through DNA sequencing methods. Jain et al. (2018), in a high throughput analysis of 90 thousand bacterial genomes, discussed the importance of accurate estimation of genetic relatedness in species delimitation. In this context, ANI (Average Nucleotide Identity) has been considered one of the standard tools for this task. ANI is calculated as the average nucleotide identity from the set of orthologous genes identified between any two genomes. Organisms belonging to the same species are typically considered to show ANI values of ≥95% in pairwise comparisons [23]. Here we aim to assess reduced-representation sequencing as an alternative method for preliminary identification of isolates derived from sampling locations across Australia. A standard approach to the identification of novel bacterial strains or species utilises partial or complete 16S rRNA gene sequence as a preliminary identification method to screen for potentially novel strains or species among a set of isolates. Candidates identified from the 16S rRNA gene sequencing subsequently undergo whole-genome sequencing. The use of 16S rRNA gene sequencing for bacterial identification is well established, although it has two potential limitations: firstly, in some cases it is necessary to attempt more than one set of PCR primers in order to achieve amplification from bacterial genomes of unknown taxonomic affinity, and secondly, the potential for limited resolution of identifications obtained from the 16S rRNA gene sequence. For this study, we have tested a novel approach of reduced-representation sequencing for the first stage identification of bacterial isolates to identify 99 isolates from a variety of thermal sources. Additionally, we have compared the preliminary identification outcomes obtained from 16S rRNA gene sequence and reduced-representation sequencing with identifications derived from whole-genome sequence on a subset of bacterial isolates. Our method used DArTseq™ (Canberra, Australia) [24], one of several available methods for generating representative sequences from the genome. It uses restriction enzyme digestion followed by PCR and Illumina short-read sequencing to amplify and sequence thousands of restriction fragments as genomic representations. DArTseq™ has been successfully used for a broad range of applications, for breeding of plants and animals [25], for assessment of genetic diversity [26-28] and for ecological genetics [29, 30]. This study represents the first usage of DArTseq™ for identification of thermophilic bacterial isolates.

Results

In-silico analysis of control E. coli O157 (EDL 933) IRMM449 certified reference standard

Reduced-representation sequence assays were performed as a control experiment on the reference standard genomic DNA of E. coli O157 (EDL 933) IRMM449 [31], with 6 technical replicates for each combination of restriction enzymes. Correct identification results were produced for all assays at the species and strain level using the Currito3.1 DNA Fragment Analysis Software [32] which was developed for this project. The mean genome coverage obtained for each method was 2.64% for PstI with HpaII and 2.34% for PstI with MseI. The mean BLASTn percentage alignment values obtained against the genome sequence of E. coli O157 (EDL 933) IRMM449, GenBank accession number CP008957.1 [31] were 99.9915 and 99.9974%, respectively. The average number of restriction fragments obtained in the sequence output for each method was 2433 and 1836 fragments, respectively. Finally, the average nucleotide sequence distance (NSD) value obtained for each method was 0.000103 and 0.000040 respectively. For the PstI with HpaII enzyme combination, the average NSD values showed less than 1 bp of difference per 10,000 bp aligned.

Isolation of the strains

A total of 99 bacterial isolates were obtained from 27 different sampling sources. Microbial growth results of 31 isolates from hot water systems and commercial composts are shown in Table 1 and microbial growth results of 68 bacterial isolates from artesian bore water and bore drains are shown in Table 2.
Table 1

Microbial growth for samples from hot water systems and commercial compost. Incubation temperature was 62.5 °C, culture media LB broth agar

SourceSample nameTemperature (T°C)Microbial growthNo. Bacterial isolates
Domestic hot water systems
DPS160 °C(+)(A), (+)(B)2
DPS261.1 °C(+)(A), (+)(B)2
DPS362.6 °C(+)1
DPS462.6 °C(+)(A), (+)(B)2
DPS579.6 °C(+)(A), (+)(B), (+)(C)3
DPS657 °C(+)(A), (+)(B), (+)(C)3
DPS757.6 °C(−)
HTR84 °C(++)1
DHW60 °C(+)(A), (+)(B)2
Commercial compost
DMWna(+)1
MPCCna(+)1
NFOSAna(+)(A), (+)(B), (+)(C)3
MMBAna(+)(A), (+)(B)2
MFBBna(+)(A), (+)(B), (+)(C), (+)(D)4
MPCBna(+)1
CBSPna(+)(A), (+)(B), (+)(C)3
Total bacterial isolates31

na = not applicable; (−) = no growth was observed; (+) = growth was observed; (++) = strong growth was observed; (A), (B), (C) name assigned if more than one microorganism was observed

Table 2

Microbial growth for water and mud samples from the Great Artesian Basin. Incubation temperature was 62.5 °C. Culture media: LB broth agar (pH 5, pH 6.8, pH 8) and PBT pH 6.0

LB BrothPBTNo. Bacterial isolates
LocationSourceTemperature (°C)pHSample namepH 5.0pH 6.8pH 8.0pH 6.0
Birdsvillewater98 °CandBirdsville Borend(+)(+)(−)2
bore drain98 °CandBirdsville mudnd(+)(A), (+)(B)(+)(+)(A), (+)(B), (+)(C)6
Clifton hillswater80 °C8.0CHfil(−)(−)(+)(A)(−)1
bore drain80 °CndCHMUD(+)(++)(++)(A), (++)(B)(++)4
Mount Gasonwater80 °C8.0MtGfilnd(+)(A), (+)(B)(+)(A)(+)(A)4
bore drain80 °CndMt.GODS / Mt. GMUD(+)(A), (+)(B)(+)(A), (+)(B), (+)(C), (+)(D), (+)(E), (+)(F), (+)(G)(+)(A), (+)(B), (+)(C),(+)13
Mirra Mitawater79 °C8.0MMfilnd(+)(−)(−)1
bore drain79 °CndMMMUD1(−)(+)(−)(+)2
bore drain68 °CndMMMUD2(++)(++)(−)(−)2
bore drain74 °CndMMMUD3(−)(++)(+)(A)(+)3
bore drain62 °CndMMMUD4(−)(+)(A), (+)(B)(+)(A), (+)(B)(+)5
bore drain66 °CndMMMUD5(−)(+)(A), (+)(B)(−)(−)2
bore drain50 °CndMMMUD6(−)(+)(A), (+)(B)(−)(−)2
bore drain39 °CndMMMUD7(−)(++)(−)(−)1
bore drain74 °CndMMMUD8(+)(−)(−)(−)1
Mungerannie stationwater78 °C7.2Mgnhotfil / MgnCCG(+)(+)(+)(−)3
bore drain60 °CndMCWH(−)(+)(A), (+)(B)(−)(−)2
Mulkasoil38 °CndMR(−)(+)(A)(+)(A)(−)2
Kopperamannawater60 °C8.5Efil(−)(+)(A), (+)(B)(−)(++)3
soil38 °CndECO3(−)(−)(−)(++)1
Etadunna stationwater77.9 °C8.5Kanufilnd(+)(−)(−)1
Dulkaninnawater47.8 °C8.5Dulfilnd(+)(+)(−)(−)1
Clayton stationwater34 °C8.5Clfilnd(+)(A), (+)(B)(−)(−)2
bore drain34 °CndCLB(−)(+)(−)(−)1
Lake Harrywater46 °C8.5LHfilnd(−)(−)(−)
bore drain46 °CndLHMUD(−)(+)(A), (+)(B)(−)(+)3
Total bacterial isolates68

(a) Based on published temperature (Habermehl and Pestov 2002 [14])

nd = not determined; (−) = no growth was observed; (+) = growth was observed; (++) = strong growth was observed; (A), (B), (C) name assigned if more than one microorganism was observed

Microbial growth for samples from hot water systems and commercial compost. Incubation temperature was 62.5 °C, culture media LB broth agar na = not applicable; (−) = no growth was observed; (+) = growth was observed; (++) = strong growth was observed; (A), (B), (C) name assigned if more than one microorganism was observed Microbial growth for water and mud samples from the Great Artesian Basin. Incubation temperature was 62.5 °C. Culture media: LB broth agar (pH 5, pH 6.8, pH 8) and PBT pH 6.0 (a) Based on published temperature (Habermehl and Pestov 2002 [14]) nd = not determined; (−) = no growth was observed; (+) = growth was observed; (++) = strong growth was observed; (A), (B), (C) name assigned if more than one microorganism was observed Sediment and water samples inoculated into culture media showed different growth depending on the source. Sediment samples collected from flowing bore drains from the Great Artesian Basin, showed heavy microbial growth at various pH values ranging from between 5.0 to 8.0. Microbial colonies within the inoculated agar plates sometimes showed differing morphologies, indicating the possible presence of more than one bacterial strain. Colonies with distinct morphologies were isolated individually (Table 2).

Species identification

Results obtained from the analysis of the reduced-representation sequences using Currito3.1 DNA Fragment Analysis Software [32] provided preliminary identification and similarity information for all isolates. An example of the report produced by Currito3.1 DNA Fragment Analysis Software [32] for the isolate MMMud_3_LB_pH8 is shown in Fig. 1. The full list of identification results of isolates obtained from hot water systems and commercial composts are shown in Table 3 and the identification results of all isolates from artesian bore water and bore drains are shown in Table 4. Colonies possessing different morphologies within a single plate were isolated and given suffix a, b, c, and d, in some cases these isolates may be duplicates. For each isolate, the nearest sequenced genome from the NCBI RefSeq database is given, along with the average BLASTn percentage identity based on the nearest genome. Percentages obtained ranged from 85.43 to 99.84%. In many instances the BLASTn percentage identity against the nearest genome was > 98.00%, indicating that the isolates belonged to the same species. From this set, 8 isolates were found to have a BLASTn percentage identity against the nearest genome of < 95.00%, indicating potential new species [23].
Fig. 1

Extract of report generated by bioinformatics pipeline Currito3.1 DNA Fragment Analyser [32] for sample MMMud_3_LB_ph8. This image shows the first candidate closest match to the sample. The circular graph plotted in Circos [33] shows the BLAST alignment position against the reference genome. The outer black circle represents the candidate reference genome with size indicated in megabases (Mb); the middle blue circle shows aligned sequenced fragments obtained by complexity-reduced genotyping and the Inner green / red circle shows the percentage identity of the alignments, in which values below 95% are red and values equal or above 95% are green. The bar plots show the sequences obtained with and without BLAST alignments against the best reference, in which the X axis shows the sequences classification of sequences with and without hits to a reference or plasmid and the Y axis shows the total number of sequences. The histogram shows the percentage identity of BLAST alignments against the candidate reference, where the X axis shows the BLAST alignment percentage identity highest to lowest and the Y axis has the number of aligned sequences

Table 3

Bacterial identification results for isolates obtained from hot water systems and commercial compost, showing nearest matches based on BLAST alignment of complexity-reduced genotyping fragments. Average BLASTn percentage identity, nucleotide sequence distance values to best matches and genome coverage percentage are shown

SourceSample nameClosest matchAverage % identityNucleotide sequence distance (NSD)Genome coverage (%)
Domestic hot water systems
DHWaGeobacillus sp. 899.270.004462.28
DHWbGeobacillus sp. 898.840.007192.12
DSP1aGeobacillus lituanicus strain N-398.180.011771.80
DPS1bGeobacillus lituanicus strain N-398.110.011741.80
DSP2aGeobacillus lituanicus strain N-398.580.008383.25
DSP2bGeobacillus lituanicus strain N-398.580.008553.23
DSP3Geobacillus lituanicus strain N-398.610.008373.18
DSP4aGeobacillus lituanicus strain N-398.550.008713.29
DSP4bGeobacillus lituanicus strain N-398.600.008493.27
DSP5aGeobacillus lituanicus strain N-398.290.010414.32
DSP5bGeobacillus lituanicus strain N-398.330.010234.29
DPS5cGeobacillus lituanicus strain N-398.350.010064.27
DSP6aGeobacillus lituanicus strain N-398.510.008763.29
DSP6bGeobacillus sp. MAS1 T26098.370.009822.06
DSP6cGeobacillus sp. MAS1 T26098.350.009932.08
HTRGeobacillus sp. MAS1 T26098.290.010252.02
Commercial compost
DMW1Geobacillus thermoleovorans strain ID-199.660.001894.54
CBSPaGeobacillus thermodenitrificans strain G11MC1699.390.003452.03
CBSPbGeobacillus thermodenitrificans strain G11MC1699.840.000992.10
CBSPcGeobacillus thermodenitrificans strain G11MC1699.830.001052.19
MFBBaGeobacillus galactosidasius strain DSM 1875198.970.006481.14
MFBBbGeobacillus thermodenitrificans strain T1299.590.002481.63
MFBBcGeobacillus thermodenitrificans strain JSC_T9a99.660.002121.10
MFBBdGeobacillus galactosidasius strain DSM 1875198.650.008041.14
MMBAaGeobacillus galactosidasius strain DSM 1875199.000.006711.28
MMBAbGeobacillus galactosidasius strain DSM 1875198.920.006741.12
MPCBGeobacillus thermodenitrificans strain T1299.580.002541.59
MPCCGeobacillus thermodenitrificans strain G11MC1699.560.002671.73
NFOSA1Geobacillus thermodenitrificans strain G11MC1699.830.001062.11
NFOSA2Geobacillus sp. 898.690.008091.85
NFOSA3Geobacillus galactosidasius strain DSM 1875198.850.007251.14
Table 4

Bacterial identification results for isolates from the Great Artesian Basin, showing nearest matches based on BLAST alignment of complexity-reduced genotyping fragments. Average BLAST percentage identity, nucleotide sequence distance values to best matches and genome coverage percentage are shown

LocationSample nameClosest matchAverage % identityNucleotide sequence distance (NSD)Genome coverage (%)
Birdsville
B_fil_LB_pH6.8_aAnoxybacillus ayderensis strain AB0498.420.009372.15
B_fil_LB_pH6.8_bAnoxybacillus ayderensis strain AB0498.330.009852.16
B_mud_LB_pH6.8_aAnoxybacillus suryakundensis strain DSM 2737494.920.025711.43
B_mud_LB_pH6.8_bAnoxybacillus suryakundensis strain DSM 2737494.960.024961.44
B_mud_LB_pH8Geobacillus vulcani PSS1 N68599.400.003892.08
B_mud_PBT_pH6.0_aAnoxybacillus gonensis strain G296.970.017871.43
B_mud_PBT_pH6.0_bGeobacillus sp. 898.600.008581.79
B_mud_PBT_pH6.0_cAnoxybacillus kamchatkensis strain G1097.780.013443.01
Clifton hills
CHfil_LB_pH8Anoxybacillus flavithermus AK199.080.005773.32
CHMud_LB_pH5Anoxybacillus sp. 10398.330.009641.26
CHMud_LB_pH6.8Geobacillus sp. 46C-IIa98.530.011272.16
CHMud_LB_pH8Anoxybacillus sp. 10398.430.009031.33
CHMud_PBT_pH6.0Anoxybacillus sp. BCO1 LR6898.190.010601.62
Mount Gason
MtGfil_LB_pH6.8_aAnoxybacillus ayderensis strain AB0498.420.009512.32
MtGfil_LB_pH6.8_bAnoxybacillus sp. BCO1 LR6895.490.026931.88
MtGfil_LB_pH8Anoxybacillus ayderensis strain AB0498.210.010652.36
MtGfil_PBT_pH6.0Anoxybacillus kamchatkensis strain G1094.890.029372.19
Mt_GMud_LB_pH5Anoxybacillus flavithermus AK197.390.015703.51
Mt_GMud_LB_pH6.8_aAnoxybacillus sp. BCO1 LR6897.890.011012.02
Mt_GMud_LB_pH6.8_bAnoxybacillus kamchatkensis strain G1099.340.004343.45
Mt_GMud_LB_pH6.8_cAnoxybacillus flavithermus AK199.060.005613.37
Mt_GMud_LB_pH8Anoxybacillus kamchatkensis strain G1099.370.004243.44
Mt_GMud_PBT_pH6.0Anoxybacillus ayderensis strain AB0496.600.020552.48
Mt_GODS_LB_pH5Geobacillus thermoleovorans strain ID-199.010.006003.60
Mt_GODS_LB_pH6.8_aAnoxybacillus ayderensis strain AB0498.410.009212.18
Mt_GODSa_LB_pH6.8_bGeobacillus thermoleovorans strain ID-197.180.018512.97
Mt_GODSb_LB_pH6.8_cAnoxybacillus ayderensis strain AB0498.360.009182.31
Mt_GODSc_LB_pH6.8_dAnoxybacillus ayderensis strain AB0498.310.009372.27
Mt_GODSc_LB_pH8_aAnoxybacillus kamchatkensis strain G1096.520.022883.22
Mt_GODSa_LB_pH8_bAnoxybacillus ayderensis strain AB0498.250.009692.33
Mirra Mita
MMfil_LB_25/11/15Anoxybacillus flavithermus AK198.100.014483.84
MMMud_1_LB_pH6.8Geobacillus subterraneus PSS2 N67198.150.011081.52
MMMud_1_PBT_pH6.0Geobacillus sp. MAS1 T26097.740.016511.12
MMMud_2_LB_pH5Geobacillus subterraneus PSS2 N67196.100.024971.74
MMMud_2_LB_pH6.8Anoxybacillus kamchatkensis strain G1094.410.033132.31
MMMud_3_LB_pH6.8Geobacillus subterraneus PSS2 N67198.060.011711.55
MMMud_3_LB_pH8Geobacillus subterraneus PSS2 N67198.140.011221.49
MMMud_3_PBT_pH6.0Geobacillus jurassicus NBRC 10782999.050.007211.49
MMMud_4_LB_pH6.8_aAnoxybacillus gonensis strain G299.260.005002.81
MMMud_4_LB_pH6.8_bAnoxybacillus gonensis strain G2 AG-199.650.003581.87
MMMud_4_LB_pH8_aAnoxybacillus gonensis strain G2 AG-199.490.004171.87
MMMud_4_LB_pH8_bAnoxybacillus ayderensis strain AB0498.410.009082.25
MMMud_4_PBT_pH6.0Geobacillus subterraneus PSS2 N67198.100.011481.38
MMMud_5_LB_pH6.8_aGeobacillus thermoleovorans strain ID-199.600.002213.60
MMMud_5_LB_pH6.8_bGeobacillus sp. 899.290.004432.39
MMMud_6_LB_pH6.8_aGeobacillus thermoleovorans strain ID-199.250.004383.58
MMMud_6_LB_pH6.8_bGeobacillus kaustophilus strain Et7/4 LG5285.430.090468.38
MMMud_7_LB_pH5Geobacillus thermoleovorans strain ID-195.900.028182.53
MMMud_8_LB_pH5Geobacillus vulcani PSS1 N68597.590.017042.12
Mungerannie station
MgnHotfil_LB_pH6.8Geobacillus vulcani PSS1 N68599.400.004031.88
Mgn_CCG_LB_pH5Anoxybacillus gonensis strain G2 AG-195.600.025001.68
Mgn_CCG_LB_pH8Anoxybacillus flavithermus AK198.930.005223.54
MCWH_LB_pH6.8Anoxybacillus flavithermus AK199.040.004733.52
MCWH_LB_pH8Brevibacillus thermoruber PM1 N69094.130.060612.92
Mulka
MR_LB_pH6.8Geobacillus kaustophilus GBlys99.860.001001.73
MR_LB_pH8Geobacillus thermodenitrificans strain OS2799.640.002121.41
Kopperamanna
Efil_LB_pH6.8_aAnoxybacillus ayderensis strain AB0498.400.009592.30
Efil_LB_pH6.8_bGeobacillus thermoleovorans strain ID-195.820.026102.71
Efil_PBT_pH6.0Anoxybacillus kamchatkensis strain G1094.900.028522.23
ECO3_PBT_pH6.0Geobacillus thermoleovorans strain KCTC 357098.080.011212.05
Etadunna station
Kanufil_LB_pH6.8Geobacillus vulcani PSS1 N68599.390.003941.94
Dulkaninna
Dufil_LB_pH6.8Anoxybacillus kamchatkensis strain G1094.920.028672.23
Clayton station
Clfil_LB_pH6.8Anoxybacillus ayderensis strain AB0498.430.009332.30
Clfil_LB_pH6.8Anoxybacillus ayderensis strain AB0494.360.032402.29
CLB_LB_pH6.8Anoxybacillus sp. BCO1 LR6897.340.016781.30
Lake Harry
LH_Mud_LB_pH6.8_aAnoxybacillus gonensis strain G2 AG-199.570.003791.88
LH_Mud_LB_pH6.8_bGeobacillus jurassicus NBRC 10782998.340.010921.73
LH_Mud_PBT_pH6.0Geobacillus thermoleovorans strain ID-199.040.006083.57
Extract of report generated by bioinformatics pipeline Currito3.1 DNA Fragment Analyser [32] for sample MMMud_3_LB_ph8. This image shows the first candidate closest match to the sample. The circular graph plotted in Circos [33] shows the BLAST alignment position against the reference genome. The outer black circle represents the candidate reference genome with size indicated in megabases (Mb); the middle blue circle shows aligned sequenced fragments obtained by complexity-reduced genotyping and the Inner green / red circle shows the percentage identity of the alignments, in which values below 95% are red and values equal or above 95% are green. The bar plots show the sequences obtained with and without BLAST alignments against the best reference, in which the X axis shows the sequences classification of sequences with and without hits to a reference or plasmid and the Y axis shows the total number of sequences. The histogram shows the percentage identity of BLAST alignments against the candidate reference, where the X axis shows the BLAST alignment percentage identity highest to lowest and the Y axis has the number of aligned sequences Bacterial identification results for isolates obtained from hot water systems and commercial compost, showing nearest matches based on BLAST alignment of complexity-reduced genotyping fragments. Average BLASTn percentage identity, nucleotide sequence distance values to best matches and genome coverage percentage are shown Bacterial identification results for isolates from the Great Artesian Basin, showing nearest matches based on BLAST alignment of complexity-reduced genotyping fragments. Average BLAST percentage identity, nucleotide sequence distance values to best matches and genome coverage percentage are shown A total of 16 bacterial isolates were collected from nine domestic hot water systems. Temperatures at which these were collected ranged between 60 °C to 84 °C. A total of 15 bacterial isolates were obtained from seven commercial garden compost sources. The composting temperatures were not recorded, although the range of temperatures occurring during the high-temperature phase of the composting process has been reported as between 40 °C to 78 °C [34]. All isolates derived from omestic hot water systems and compost were identified as belonging to the genus Geobacillus. A total of 18 bacterial isolates were obtained from filtered water samples of artesian bores in The Great Artesian Basin. The range of temperatures at which the water samples were collected was between 34 °C to 98 °C. The pH values ranged from 7.2 to 8.5. From the total, 15 bacterial isolates belonged to the genus Anoxybacillus and three isolates belonged to the genus Geobacillus. A total of 50 bacterial isolates were obtained from artesian bore drains in the Great Artesian Basin. Artesian bore drain sediments produced the greatest diversity of strains in this study. The temperatures at which sediment was collected ranged between 34 °C to 98 °C. A total of 26 bore drain bacterial isolates belonged the genus Anoxybacillus, 23 isolates belonged to the genus Geobacillus and one isolate belonged to the genus Brevibacillus.

Whole-genome sequencing

To test the accuracy of preliminary identifications produced by reduced-representation sequencing, eight samples were selected for whole-genome sequencing. The eight genome assemblies were chosen to form three groups, A, B and C, based on the similarity to the nearest sequenced reference, as determined by the reduced-representation sequence analysis. Group A included isolates for which the identifications showed high similarity (99.4–99.64%) to existing sequenced genomes. Group B included isolates with moderate relatedness (98.14–98.85%) to their nearest identified genome assemblies, and group C contained isolates more distantly related (< 94.13%) to any of the sequenced genome assemblies, representing potential new species. The assembly statistics for the draft genomes of the eight bacterial isolates are shown in Table 5. The identification results obtained from progressiveMauve [35, 36] alignment of the eight draft genomes were compared with those derived from the reduced-representation sequence tags, and with the identifications based on 16S rRNA gene sequence alignments. These results are shown in Table 6. The results presented in this table show that identifications obtained with reduced-representation sequencing agreed with the identifications provided by whole-genome sequencing in 100% of cases. The identifications produced by BLASTn alignment of 16S rRNA gene sequence to the same database differed from those provided by whole-genome sequencing in 37.5% of cases, and produced ambiguous identifications in 50% of cases.
Table 5

Whole genome sequencing assembly statistics and nearest relative based on progressiveMAUVE [35, 36]

GroupSample nameGenBank accessionReadscontigsLargest contigTotal lengthN50GC (%)Nearest relative based on MAUVE
AMR_LB_pH8SDLB000000003,295,260153166,1753,592,39961,99548.89Geobacillus thermodenitrificans strain KCTC3902
B_mud_LB_pH8SDLA00000000275,24760765,8753,434,851188,57752.02Geobacillus vulcani PSS1
BNFOSA3SDLE000000002,067,951111273,7243,334,68772,14342.13Geobacillus galactosidasius strain DSM 18751
DSP4aSDLD00000000885,071188271,1233,273,23858,16052.32Geobacillus lituanicus strain N-3
CHMud_LB_pH8SDLG00000000636,45667226,0122,712,590137,05341.81Anoxybacillus sp. 103
Efil_LB_pH6.8SDLH00000000777,21153596,3762,794,302321,22941.90Anoxybacillus ayderensis strain AB04
MMMud_3_LB_pH8SDLC000000001,174,2461196233,4554,372,94360,62355.27Geobacillus subterraneus PSS2
CMCWH_LB_pH8SDLF000000001,349,927124345,5783,934,072177,95756.20Anoxybacillus flavithermus strain B4168
Table 6

Comparison of bacterial identification methods showing percentage identity for complexity-reduced genotyping based on BLASTn alignment; whole-genome sequencing best matches using progressiveMauve [35, 36] alignment tool; and the best matches obtained with 16S rRNA gene alignment, including multiple results per sample with equal highest bitscore and percentage identity

GroupSample nameComplexity-reduced genotyping best match% IDWhole genome sequencing best match resultMauve mean similarity profile value16 s rRNA in silico best match result% ID
AMR_LB_pH8Geobacillus thermodenitrificans strain OS2799.64

Geobacillus thermodenitrificans

strain KCTC3902

13,618.31Geobacillus thermodenitrificans strain KCTC3902100.00

Geobacillus sp.

PA-3 GEPA3

100.00
Geobacillus thermodenitrificans NG80–2100.00
B_mud_LB_pH8

Geobacillus vulcani

PSS1

99.40

Geobacillus vulcani

PSS1

12,550.22

Geobacillus vulcani

PSS1

100.00

Geobacillus sp.

FW23

100.00
BNFOSA3Geobacillus galactosidasius strain DSM 1875198.85Geobacillus galactosidasius strain DSM 187518003.04Geobacillus galactosidasius strain DSM 18751100.00
DSP4a

Geobacillus lituanicus

strain N-3

98.55

Geobacillus lituanicus

strain N-3

4299.05Geobacillus stearothermophilus strain FHS-PHGT51100.00
Geobacillus stearothermophilus strain DSM 458100.00
Geobacillus stearothermophilus strain GS27100.00

Geobacillus sp.

Sah69

100.00
Geobacillus stearothermophilus ATCC 12980100.00
Geobacillus stearothermophilus ATCC 7953100.00
CHMud_LB_pH8Anoxybacillus sp. 10398.43Anoxybacillus sp. 1037800.76Anoxybacillus kamchatkensis strain G10100.00
Efil_LB_pH6.8_a

Anoxybacillus ayderensis

strain AB04

98.40

Anoxybacillus ayderensis

strain AB04

10,221.31Anoxybacillus kamchatkensis strain G1099.66
MMMud_3_LB_pH8Geobacillus subterraneus PSS298.14Geobacillus subterraneus PSS23684.00

Geobacillus icigianus

strain G1w1

99.76
Geobacillus subterraneus PSS2 N67199.76
CMCWH_LB_pH8

Brevibacillus thermoruber

PM1 N690

94.13

Brevibacillus thermoruber

423

3758.04

Brevibacillus thermoruber

PM1 N690

99.35
Whole genome sequencing assembly statistics and nearest relative based on progressiveMAUVE [35, 36] Comparison of bacterial identification methods showing percentage identity for complexity-reduced genotyping based on BLASTn alignment; whole-genome sequencing best matches using progressiveMauve [35, 36] alignment tool; and the best matches obtained with 16S rRNA gene alignment, including multiple results per sample with equal highest bitscore and percentage identity Geobacillus thermodenitrificans strain KCTC3902 Geobacillus sp. PA-3 GEPA3 Geobacillus vulcani PSS1 Geobacillus vulcani PSS1 Geobacillus vulcani PSS1 Geobacillus sp. FW23 Geobacillus lituanicus strain N-3 Geobacillus lituanicus strain N-3 Geobacillus sp. Sah69 Anoxybacillus ayderensis strain AB04 Anoxybacillus ayderensis strain AB04 Geobacillus icigianus strain G1w1 Brevibacillus thermoruber PM1 N690 Brevibacillus thermoruber 423 Brevibacillus thermoruber PM1 N690

Discussion

The results showed a complete agreement of the progressiveMauve [35, 36] whole-genome identifications and those obtained from the reduced-representation sequence alignments for all eight bacterial isolates. ProgressiveMauve [35, 36] can perform comparative genome alignment of two or more genomes, identifying and aligning conserved genomic DNA regions. ProgressiveMauve [35, 36] identifies locally colinear blocks (LCBs), which are blocks of unbroken sequence homology between genomes. The progressiveMauve [35, 36] algorithm uses an iterative process to identify and refine the boundaries of LCBs identified between genomes. ProgressiveMauve [35, 36] alignment is able to recognize homologous regions in the presence of multiple complex rearrangements and provides a valuable tool for analysis of sequence homology between species and strains [35, 36]. Similarity profile values, corresponding to the average level of nucleotide sequence conservation within regions of local alignment, are calculated by progressiveMauve [35, 36] to be inversely proportional to the average alignment column entropy within the region [35]. To determine a pairwise similarity value between genomes, each of the eight genome assemblies was aligned against the complete set of available RefSeq Anoxybacillus Geobacillus and Brevibacillus assemblies in a pairwise manner. The mean similarity profile value from each progressiveMauve [35, 36] pairwise alignment was calculated. These values provide a comparative measure of the similarity of each pair of genomes, averaged over all aligned sequence regions between the genome pair. In each case the same species was identified as the closest match, and in seven of eight cases, the same assembly was identified. For the isolate MCWH_LB_pH8 the two methods identified different assemblies of Brevibacillus thermoruber. In all eight cases the progressiveMauve [35, 36] whole-genome identifications and the reduced-representation sequence results identified a single closest matching candidate assembly. The identification results from the 16S rRNA gene sequences sometimes identified multiple closest matching assemblies of equal bitscore and percentage identity. Out of the eight isolates, three did not produce the same species identification results between progressiveMauve [35, 36] and 16S rRNA gene alignment. Additionally, four out of the eight isolates did not identify a single best candidate assembly, based on bitscore and percentage identity. The genome sequences showed that the best 16S rRNA gene alignment did not always match the results from whole-genome sequencing; however, misalignments may indicate an intensive horizontal gene transfer or genome rearrangements rather than phylogenetic diversity. The sequence fragments produced by reduced representation sequencing represented coverage of approximately 2.3% of the genome, derived from up to 2500 individual fragments, depending on the combination of restriction enzymes used. This number of fragments can be sequenced to a read depth of 40x using 100,000 reads per assay, and the sequence barcoding system allows for multiplexing of up to 2300 assays. The volume of sequencing required to achieve full coverage of the reduce-representation fragments is very low in comparison to the sequencing output of current next-generation sequencers. A single lane of a HiSeq 2500 v4 flow-cell could be used to sequence assays for 2300 isolates with a read depth of >40x across the available fragments. Alternatively, for lower throughput processing, 250 assays could be processed in a MiSeq v3 flow-cell at a 40x read depth. The assay cost per sample, including library construction and sequencing would be expected to be 7 dollars (USD) [37].

Comparison of identification methods

DNA sequence-based bacterial identification has relied almost exclusively on partial or complete 16S ribosomal RNA gene sequencing [38-42]. In spite of the ubiquitous use of 16S sequence data, the limitations of this approach are well established [43]. One of the first problems identified with this technique was the difficulty of primer design, necessitating attempts at creation of ‘universal’ primers, ideally capable of amplifying a portion of the 16S rRNA gene from any bacterial isolate [44]. In practice, multiple primer pairs may sometimes need to be trialled to obtain successful amplification from any given isolate. Another issue relates to the limited resolution of the identification information provided by 16S rRNA gene sequencing [43]. The original rationale for the choice of the 16S rRNA gene for use in bacterial identification is based on the need for a balance between sequence conservation versus sequence diversity. Sequence similarity between taxa must be sufficient for priming and PCR amplification, but sequence variability must be sufficient to provide resolution between taxa for identification purposes. In practice the resolution provided by 16S rRNA sequence data can be insufficient for species delimitation as they may be identical between species [45]. Effectively, the sequence similarity of the 16S locus may not be a surrogate for the similarity of the genome as a whole [46]. Obtaining a whole-genome sequence is clearly the best option to identify bacterial isolates and determine their nearest relatives; however, the costs involved mean that it is generally not practical to do this for all of bacterial isolates in a study. An alternative identification method of sequencing complexity-reduced genomic representations could provide a potential replacement, avoiding some of the limitations. This study has shown that reduced-representation sequencing can provide fine scale identification information, most importantly, with complete agreement to whole-genome sequence information in terms of identification for the bacterial isolates tested in this study. Reduced-representation sequences can be produced for any organism, without need for prior sequence information, and with no prior knowledge of taxonomic affinities required.

Conclusions

This study clearly demonstrates the accuracy of the identifications based on reduced-representation sequencing. The eight isolates were selected for whole-genome sequencing to test the accuracy of the preliminary identifications. In each case the closest matching genomes identified by reduced-representation sequencing agreed completely with the identifications provided by whole-genome sequencing. The identifications provided by 16S rRNA gene sequence alignment, were in agreement for some of the isolates but differed from the whole-genome-based results for others. The 16S rRNA gene results also identified multiple accessions or even multiple species with equal distance in 50% of cases, failing to identify a single best candidate from the sequence database. The method of reduced-representation sequencing has been successfully applied in identification of bacterial isolates in a medical microbiology context [37] and this is the first successful use for identification of thermophilic bacterial isolates. The genome coverage obtained in this study ranged between 1.10 to 4.54%. This coverage is derived from short fragments obtained from across the entire genome, rather than longer consecutive regions, as shown in the circular graph plotted by Circos [33] from Fig. 1. This potentially allows detection of horizontal transfer between taxa; however, in practice many horizontal gene transfer events may be undetectable in the results produced from this method. Further work will need to be done to clarify the limits of horizontal gene transfer detection and the implications for bacterial identifications. Reduced-representation sequencing is equally well suited for use with small numbers of isolates or with large batch processing of thousands of assays. The growing database of available bacterial genome sequences provides an excellent resource for alignment of reduced-representation sequences for identification purposes, and as the available genomes continue to grow, the technique will become more effective with time.

Materials and methods

Sampling

Hot water systems

Water samples were collected from nine domestic hot water systems with at least five years of operation in the region of the ACT, Australia. A volume of 1.5 L was collected into sterile containers, temperature was recorded, and bottles were transported without refrigeration. The water samples were filtered with sterile membranes of 0.20 μm pore size (Nalgene™ Rapid-Flow™, PES Membrane Cat. No. 6.302336, type 565, ThermoFisher Scientific, Australia), connected to a vacuum pump, to capture any bacteria present.

Commercial composts

Samples were collected from seven commercial composts. The composts were made from various combinations of animal manure and plant matter. Samples taken from bagged compost were placed in 50 ml sterile falcon tubes and transferred to the laboratory for inoculation.

Artesian bores in the great Artesian Basin, South Australia

A total of 10 water samples were collected from the tap at the bore head into sterile bottles from selected bore locations on the Birdsville Track, South Australia (Fig. 2 and Supplementary Material 1). A volume of 1.5 L was taken, temperature was recorded, and water samples were filtered to collect any bacteria present, using the same method described above for hot water systems. In addition, sediment samples were collected from each of the bore drains at various distances from the bore head along the temperature gradient, especially where colonies of microorganisms were visible. These were collected into 50 ml sterile falcon tubes.
Fig. 2

Sampling locations of 11 water-bores of The Great Artesian Basin showing groundwater temperatures derived from Habermehl and Pestov (2002) [14]. Inset map shows larger position of larger map within Australia

Sampling locations of 11 water-bores of The Great Artesian Basin showing groundwater temperatures derived from Habermehl and Pestov (2002) [14]. Inset map shows larger position of larger map within Australia

Medium composition and cultivation

Four types of solid culture media were prepared using two different nutrient recipes and a series of pH values. The first nutrient recipe contained LB medium, and was prepared by dissolving 20 g Lb Broth (Sigma-Aldrich L3022) and 5 g Gelzan™ CM Gelrite® solidifying agent (G1910 Sigma-Aldrich) in 900 ml of miliQ water, then filling up with miliQ water to 1000 ml. Culture media were adjusted for final pH values of 5.0, 6.8 and 8.0. The second nutrient recipe combined 1.0 g L− 1 yeast extract, 1.0 g L− 1 tryptone, with a basal medium containing 1.3 g (NH4)2SO4, 0.47 g K2HPO4.3H2O, 0.25 g MgSO4.7H2O, 0.07 g CaCl2 and 1 ml of trace element solution [47]. This was prepared by dissolving all components with 5 g Gelzan™ CM Gelrite® solidifying agent in 900 ml of miliQ water, then filling up with miliQ water to 1000 ml and adjusting to pH 6.0. All media were autoclaved at 121 °C for 20 min. The four variations of culture media and pH were inoculated with each of the compost, filter strips, and sediment samples, and all cultures were incubated at 62.5 °C. Filter membranes were removed from the filter, cut into strips with a sterile scalpel blade, and placed inverted on the surface of the solid media. Similarly, approximately 1 g of compost/sediment was scattered on the surface of the media. Humidity conditions for the culture plates were controlled by adding sterile wet gauzes inside of a sealed plastic box that contained all Petri dishes. Bacterial growth was observed within a range of time from 48 to 72 h. Individual colonies identified were isolated by at least three passages of subculturing from single cell derived colonies.

Library preparation and sequencing

DNA extractions were performed for all bacterial isolates using the chloroform-isoamyl alcohol method [37]. The library preparation was done following the DArTseq™ methods, in which the DNA was digested with pairs of restriction enzymes, in this case, PstI with HpaII and PstI with MseI respectively, and PCR adapters were ligated to the fragments. Two adapters were used, one corresponding to each restriction enzyme. The adapter design included Illumina flow-cell specific sequences required for bridge PCR in cluster generation, as well as a barcode region to enable sample multiplexing. The adapters were designed such that only fragments with differing restriction sites at each end were capable of cluster generation. Digestion/ligation was followed by PCR amplification according to Georges et al. (2018). Final PCR products were stored at 4 °C. A post PCR quality control was performed by agarose gel electrophoresis. An equal volume of PCR product from each sample was pooled and then purified with a QIAGEN QIAquick PCR Purification Kit Cat. ID: 28106 (QUIAGEN, Chatstone, Victoria, Australia). For each sample, libraries made with both complexity-reduction methods were pooled together for sequencing. Clustering was done according to Illumina HiSeq SR Cluster Kit V4 recipe v9.0 and HiSeq SR Flow Cell v4 (Illumina Inc., San Diego CA, US). For sequencing, the Flow Cell was loaded according to the Illumina protocols on a HiSeq 2500 sequencer, using HiSeq SBS kit v4 for a total of 77 cycles [30]. A control was also sequenced, using the genomic DNA of Eschericha coli O157 (EDL 933) IRMM449 Sigma-Aldrich (Castle Hill, NSW, Australia) certified reference standard, GenBank [47] accession number AE005174.2, genome size of 5,639,399 bp [31]. This control was processed for library construction, sequencing and analysis using methods identical to those used for all other bacterial isolates.

Data analysis

Data from the sequencer in the form of FastQ files was processed according to the methods described in Talamantes-Becerra et al. (2019). Briefly, sequences were filtered by PHRED quality score, barcode sequences were removed and identical sequences were recognised and collapsed into ‘fastQcol’ files, which contain each unique sequence present in the original FastQ file, along with the respective read counts and the mean quality score at each base [48]. The reverse adapters which were present on sequences derived from fragments shorter than 69 bp were identified and trimmed, resulting in sequences from 30 bp to 69 bp. Data for each of the complexity reduction methods was processed with the analytical pipeline Currito3.1 DNA Fragment Analysis Software [32], which we developed specifically for analysing reduced-representation sequences from bacteria. The details of this software pipeline are described in Talamantes-Becerra et al. (2019). Briefly, a BLASTn alignment [49] of the trimmed sequence tags from each sample against all complete bacterial genome assemblies from the NCBI RefSeq database is performed, to identify the best candidate bacterial genomes for each sample. The BLASTn parameters were used were: word size 12, bitscore 50, evalue 0.000001, percentage identity 80, percentage query cover 80%. Candidate matching genome assemblies are selected according to the number of sequence tags obtaining a best or equal best BLASTn hit to each reference, as measured by bit score. After identifying candidate genomes, the trimmed sequences from each sample are aligned by BLASTn against the top three closest identified genomes individually. The Currito3.1 [32] pipeline uses the NSD calculation, shown in the following equation to determine the best matching candidate genome for each sample, based on the BLASTn alignments described above. NSD is a DNA sequence distance measurement considering identities (I), substitutions (S) and gap openings (G) across all aligned sequences to produce a global distance value [50]. Closer relatedness to a reference genome is associated with lower NSD values.

Confirmation of strain identification results by whole-genome sequencing

The genomic DNA of selected isolates was purified using the Agencourt AMPure XP (Beckman Coulter Inc., Brea, CA, US) genomic DNA purification beads protocol. The volume of Agencourt AMPure XP beads used for purification was 0.4 x genomic DNA volume. Briefly, genomic DNA clean-up was done as follows: selected volumes of AMPure XP beads and genomic DNA was mixed by pipette, then incubated for 5 min at room temperature. Sample tubes were placed onto the magnetic plate for 2 min, and supernatant was discarded carefully leaving 5 μL. Then 200 μL of freshly prepared 70% ethanol was added to the beads, incubating for 30 s before removal of the supernatant over the magnetic plate. This wash step was repeated. After the second wash, ethanol was completely removed and beads were allowed to dry. Samples were removed from the magnetic plate, eluted in 35 μL of EB buffer (10 mM Tris-Cl, pH 8.5), and mixed by pipette 10 times. Sample tubes were located again onto the magnetic plate for 1 min and elution buffer was transferred into a new tube. The whole-genome sequencing service was provided by MicrobesNG, IMI – School of Biosciences, University of Birmingham, United Kingdom. Libraries for whole-genome sequencing were prepared with the Nextera XT Library Prep Kit (Illumina, San Diego, USA), then quantified with the Microlab STAR automated liquid handling system. All libraries were quantified and pooled for sequencing with the Kapa Biosystems Library Quantification Kit for Illumina on a Roche light cycler 96 qPCR machine and were sequenced on the Illumina HiSeq2500 using a 250 bp paired end protocol. Sequencing depth for all bacterial isolates was 30X. To process the sequence data, Trimmomatic 0.30 [51] was used for trimming reads, SPAdes version 3.7 [52] was used for de novo assembly, and Prokka 1.11 [53] was used for annotation of contigs. The resulting draft genome assemblies for the 8 isolates were assessed using the software tool Kraken [54], which utilises DNA sequence K-mer alignments to determine taxonomic affinities. The Kraken results placed all isolates within the same genera reported by the Currito3.1 [32] analytical pipeline. The draft genome assemblies were then aligned against all available Anoxybacillus, Geobacillus and Brevibacillus assemblies totalling 31, 72 and 63 genome assemblies respectively, from GenBank [55] database up to December 2018, to identify the closest matching genome assembly for each isolate. For alignment of the whole-genome assemblies, the software progressiveMauve [35, 36] was used. For each of the eight draft genomes, the pairwise alignment which resulted in the highest mean similarity profile value was considered the closest matching genome. The identifications obtained in this way were used to test the accuracy of the identifications obtained using the method under current investigation, based on reduced-representation sequencing. In addition to the genome alignments, the complete 16S rRNA gene CDS sequence was also used to identify the closest genome assembly from amongst the same set of available RefSeq [55] assemblies. For each of the eight isolates, the 16S rRNA gene sequence was obtained from the draft genome assembly using the annotation produced by Prokka 1.11 [53]. The complete 16S rRNA gene sequences were aligned by BLASTn against each of the RefSeq Anoxybacillus, Geobacillus and Brevibacillus genome assemblies in order to find the best or equal best matches for the 16S rRNA gene, along with percentage identity values. Additional file 1.
  42 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  Circos: an information aesthetic for comparative genomics.

Authors:  Martin Krzywinski; Jacqueline Schein; Inanç Birol; Joseph Connors; Randy Gascoyne; Doug Horsman; Steven J Jones; Marco A Marra
Journal:  Genome Res       Date:  2009-06-18       Impact factor: 9.043

Review 3.  Exploration of extremophiles for high temperature biotechnological processes.

Authors:  Skander Elleuche; Christian Schäfers; Saskia Blank; Carola Schröder; Garabed Antranikian
Journal:  Curr Opin Microbiol       Date:  2015-06-09       Impact factor: 7.934

4.  Identification of bacterial isolates from a public hospital in Australia using complexity-reduced genotyping.

Authors:  Berenice Talamantes-Becerra; Jason Carling; Karina Kennedy; Michelle E Gahan; Arthur Georges
Journal:  J Microbiol Methods       Date:  2019-03-17       Impact factor: 2.363

Review 5.  Towards a sustainable biobased industry - Highlighting the impact of extremophiles.

Authors:  Anna Krüger; Christian Schäfers; Carola Schröder; Garabed Antranikian
Journal:  N Biotechnol       Date:  2017-05-13       Impact factor: 5.079

6.  Geobacillus galactosidasius sp. nov., a new thermophilic galactosidase-producing bacterium isolated from compost.

Authors:  Annarita Poli; Giusi Laezza; Reyhan Gul-Guven; Pierangelo Orlando; Barbara Nicolaus
Journal:  Syst Appl Microbiol       Date:  2011-06-08       Impact factor: 4.022

7.  Exploration and isolation of novel thermophiles in frozen enrichment cultures derived from a terrestrial acidic hot spring.

Authors:  Hiroyuki D Sakai; Norio Kurosawa
Journal:  Extremophiles       Date:  2016-02-10       Impact factor: 2.395

8.  progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement.

Authors:  Aaron E Darling; Bob Mau; Nicole T Perna
Journal:  PLoS One       Date:  2010-06-25       Impact factor: 3.240

9.  High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries.

Authors:  Chirag Jain; Luis M Rodriguez-R; Adam M Phillippy; Konstantinos T Konstantinidis; Srinivas Aluru
Journal:  Nat Commun       Date:  2018-11-30       Impact factor: 14.919

10.  Kraken: ultrafast metagenomic sequence classification using exact alignments.

Authors:  Derrick E Wood; Steven L Salzberg
Journal:  Genome Biol       Date:  2014-03-03       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.