Literature DB >> 35735095

A genome-wide association study of the occurrence of genetic variations in Edwardsiella piscicida, Vibrio harveyi, and Streptococcus parauberis under stressed environments.

HyeongJin Roh1.   

Abstract

Bacterial mutation and genetic diversity in aquaculture have led to increasing phenotypic variances, which can weaken or invalidate strategies for controlling diseases. However, few studies have monitored the degree of mutation in fish bacterial pathogens caused by environmental pressure within a short period. In this study, transcriptomic sequences from Edwardsiella piscicida, Vibrio harveyi and Streptococcus parauberis under stressed environments were used for investigating the emergence of variants. In detail, a sub-inhibitory concentration of formalin and phenol for E. piscicida, sea water at 30°C for V. harveyi and flounder serum for S. parauberis were used as stressed environments, and significant single-nucleotide polymorphisms (SNPs) and/or mutation sites were investigated after culture in the ordinary liquid media (control) and the stressed environment through a genome-wide association study. As results, several SNPs or mutations during incubation were observed under different environments in E. piscicida and/or V. harveyi in the genes relevant to flagella, fimbria type 3 secretion systems, and outer and inner membranes that have been directly exposed to external environments. In particular, given that flagella and fimbriae are considered important factors in differentiating the serotypes in some bacterial pathogens, it can be speculated that different environmental pressures are the source of phenotypic or serotypic differentiation from the same origin. On the other hands, S. parauberis did not exhibit notable changes for 4 h when inoculated in the serum from olive flounder. The results presented in this study provide examples of possible molecular evolution in pathogens relevant to the aquaculture industry as a response to different environmental pressure.
© 2022 The Author. Journal of Fish Diseases published by John Wiley & Sons Ltd.

Entities:  

Keywords:  GWAS; SNP; bacteria mutation; environmental pressure; genetic variance

Mesh:

Year:  2022        PMID: 35735095      PMCID: PMC9541752          DOI: 10.1111/jfd.13668

Source DB:  PubMed          Journal:  J Fish Dis        ISSN: 0140-7775            Impact factor:   2.580


INTRODUCTION

Bacterial diseases are the major threats to farmed fish in aquaculture, and multiple studies have been conducted to develop methods for controlling them (e.g., Roh et al., 2016; Seo et al., 2021; Zhang et al., 2021). However, the damage caused by bacterial diseases in aquaculture has continued, and the ability to alter their genetic makeup much faster than eukaryotes is regarded as a major obstacle (Figueroa et al., 2019; Kim et al., 2021; Roh et al., 2019). For example, the emergence and spread of multi‐drug‐resistant bacteria in aquaculture where bacteria are required to survive against antibiotics can be originated from the pressure to evolve by modifying genes that help them adapt to the newly encountered situation (Algammal et al., 2022; Nadella et al., 2022). In addition, because most bacterial pathogens in aquaculture can be horizontally transmitted through water where many hydrophilic pollutants and chemicals are easily dissolved, different water environments can accelerate the occurrence of bacterial mutations. It is therefore very important to understand the process or patterns of genetic changes among genomes differentiated from the same strain. In recent years, the importance of interpretation and analysis of massive data has received an enormous amount of attention as technological advances are made in the field of sequencing technology, and studies investigating the genotypic characteristics between strains at the genome level have been conducted (e.g., Le et al., 2021; Roh et al., 2019, 2020; Roh & Kim, 2021). Because big data from next‐generation sequencing (NGS) may contain valuable multidimensional results, these can be interpreted from multiple perspectives. However, in numerous studies, massive sequencing results have mainly been interpreted in a manner consistent with their purposes, which means some results might be important but are overlooked or underestimated (Lee et al., 2021; Montánchez et al., 2019; Yoon et al., 2020). Hence, making the most of released sequences in an open database such as a sequence read archive (SRA) and Genbank is becoming more important, given the release of a tremendous amount of sequencing results every day. Among the methods of bacterial genomic study, the pan‐genome method that profiles genomic differences by the existence of certain genes has been widely used and contributes to understanding the relationship between phenotypic and genotypic characteristics of bacterial pathogens to a certain extent (Kim et al., 2020; Roh & Kim, 2021; Rouli et al., 2015). However, because pan‐genome analysis is based on the existence of genes classified as core, accessory, and singleton genes, rather than the sequence differences which are annotated as the same gene (Blom et al., 2016; Zekic et al., 2018), it is not suitable for analysing the occurrence of different single‐nucleotide polymorphisms (SNPs) or mutations by strain. Even though emerging mutations and/or SNPs in certain genes or locations might be an earlier response than the differences in accessory genes, studies on SNPs and deletion and/or mutations caused by different environmental niches have been underestimated. To overcome these shortcomings, genome‐wide association studies (GWASs) offer a valuable alternative. GWAS is a method for excavating significant SNPs associated with phenotypic differences by profiling all genetic variants between phenotypic groups, not at the level of genes but at each nucleotide (Lees & Bentley, 2016). The advent of GWAS in bacteria has provided insight and understanding into SNPs associated with phenotypic characteristics (Chen & Shapiro, 2015). To estimate the occurrence of SNPs or mutated sequences when the same strain is cultured under different environments through GWAS, simultaneous deep sequencing of raw files from a large number of bacterial genomes is necessary (Wang et al., 2015). Thus far, few studies have examined genomic DNA sequences from the same strain cultured in different environments, although some studies have investigated transcriptomic responses for major bacterial pathogens in aquaculture (Vibrio harveyi, Edwardsiella piscicida and Streptococcus parauberis) (Lee et al., 2021; Montánchez et al., 2019; Yoon et al., 2020). However, the information on nucleic acid sequences in transcriptomics that may contain promising pieces of evidence for the emergence of mutated sequences can be valuable themselves. V. harveyi, E. piscicida and S. parauberis are well‐known bacterial pathogens in aquaculture, and both raw sequencing data (FastQ file) and experimental designs have been published (Lee et al., 2021; Montánchez et al., 2019; Yoon et al., 2020). In addition, their transcriptome responses have been known to change remarkably under sea water when exposed to high incubation temperature, pollutants or host serum environments, compared with cultures in liquid media at an optimal growth temperature (Lee et al., 2021; Montánchez et al., 2019; Yoon et al., 2020). These transcriptomic changes are evidence that bacterial pathogens are strongly influenced by environmental factors. Based on these studies, the purposes of this study were to (1) analyse RNA‐seq raw files from several fish bacterial pathogens in various environments with a focus on mutations and SNPs, (2) identify genes that exhibit significant SNPs and mutations by different groups and (3) speculate on the microbiological meaning of these changes.

MATERIALS AND METHODS

Samples and FastQ files information

In total, 41 FastQ files for transcriptomic studies on E. piscicida, V. harveyi and S. parauberis have been downloaded from a SRA using the SRA‐tool kit (Ver. 2.11.3‐ubuntu64). The SRR accessions used in this study are summarized in Table 1 (Lee et al., 2021; Montánchez et al., 2019; Yoon et al., 2020). To observe the emergence of genetic variants stimulated by sub‐lethal chemicals in E. piscicida, a study on E. piscicida CK108 transcriptomics that compared a control cultured in ordinary liquid media (brain and heart infusion) and treatments cultured in a media supplemented with either 0.0039% formalin or 0.146% phenol were used in this study (Yoon et al., 2020). In the case of V. harveyi, this study focussed on what kind of genetic variances occurred over time in the sea water with high water temperature compared with V. harveyi grown in sufficient nutrients and at the optimal temperature. In detail, the transcriptomic results of ATCC®14126™ grown in the marine broth at 26°C overnight were used as the control, while groups of V. harveyi transferred in sea water (SW) at 30°C for 12 h, 3 and 6 days were regarded as treatment groups for GWAS (Montánchez et al., 2019). Likewise, the possibilities of genetic variances in S. parauberis under liquid media and serum environment with the elapsed time were investigated in this study. In more detail, transcriptomic results of S. parauberis SPOF3K cultured in liquid media (brain and heart infusion supplemented with 1% NaCl) and serum from olive flounder (Paralichthys olivaceus; ~200 g) for 1, 2 and 4 h were utilized (Lee et al., 2021). The summaries of the experiment designs and comparison groups for GWAS are presented in Table 1.
TABLE 1

Bacteria species, comparison groups for genome‐wide association studies, experimental conditions, SRR accession numbers, and references used in this study

SpeciesGWAS comparisonExperimental conditionsSRR accessionReference
Edwardsiella piscicida

Control vs. formalin group

Control vs. phenol group

E. piscicida CK108 was incubated in liquid media (brain heart infusion) for 24 h at 27°C with 200 rpm shaking as the control (n = 3). The formalin (n = 4) and phenol treated groups (n = 4) were cultured under the same conditions but supplemented by formalin (final concentration = 0.0039%; 0.38 mM) and phenol (final concentration = 0.146%; 1. 3 mM)SRR11652662‐SRR11652676Yoon et al. (2020)
Vibrio harveyi

Control vs. 12 h seawater incubation at 30°C (12 h)

Control vs. 3 days seawater incubation at 30°C (3 days)

Control vs. 6 days seawater incubation at 30°C (6 days)

V. harveyi ATCC® 14126™ were aerobically cultured overnight in liquid media (marine broth) at 26°C as the control group (n = 3). The same V. harveyi strain which had been cultured for 12–16 h incubation in the same culturable environment as the control was diluted by sterile seawater (1:20; 20 folds dilution) and incubated at 30°C for 12 h, 3, and 6 daysSRR7058340‐SRR7058351Montánchez et al. (2019)
Streptococcus parauberis

1 h broth vs. 1 h serum

2 h broth vs. 2 h serum

4 h broth vs. 4 h serum

S. parauberis SPOF3K grown in liquid media (brain and heart infusion with 1% NaCl) at 26°C for 18 h with 180 rpm shaking was incubated in fresh liquid media or serum achieved from Olive flounder (~200 g). The tri‐replicated cultures in both broth (control) and serum (treatment) were sampled at 1, 2, and 4 h post‐incubationSRR13349544‐SRR13349561Lee et al. (2021)
Bacteria species, comparison groups for genome‐wide association studies, experimental conditions, SRR accession numbers, and references used in this study Control vs. formalin group Control vs. phenol group Control vs. 12 h seawater incubation at 30°C (12 h) Control vs. 3 days seawater incubation at 30°C (3 days) Control vs. 6 days seawater incubation at 30°C (6 days) 1 h broth vs. 1 h serum 2 h broth vs. 2 h serum 4 h broth vs. 4 h serum

Sequence analysis

The adapter and raw quality of reads in all FastQ files used in this study were trimmed using Trim Galore! (Ver. 0.6.7+ galaxy0) under the conditions of a Phred quality score threshold = 20, maximum allowed error rate = 0.1, and discarding reads that became shorter than 20 bp (Krueger, 2021). The genome sequences of E. piscicida CK41 (Genbank accession: CP047671.1), V. harveyi ATCC 14126 (Genbank accession: NZ_BCUF01000001.1) and S. parauberis SPOF3K (CP025420.1) downloaded from GenBank were indexed, and the trimmed FastQ files were mapped onto each indexed genome using Bowtie2 (Ver. 2.4.1) (Langmead & Salzberg, 2012). The SAM or BAM file was converted to a sorted BAM file using the sort option of samtools (Ver. 1.9) (Li et al., 2009). Based on the comparison groups of GWAS in E. piscicida, V. harveyi and S. parauberis described in Table 1, multiple sorted BAM files were combined to generate a bcf file in the conditions of ‐Ob, ‐m and‐‐ploidy 1 using the mpileup option of bcftools (Danecek et al., 2016). Likewise, all sorted BAM files for each species were merged, and its bcf file was used to observe correlation missing genotypes based on the identity‐by‐missingness (IBM) between samples. All bcf files were converted to vcf files for further analyses using bcftools, and variant sequence frequency (VSF) was calculated for minor variant sequences from each comparison of vcf file.

SNP calling, genome‐wide association analysis and bioinformatics

All genome‐wide association analyses used in this study were conducted according to the guidelines of second‐generation PLINK (Ver. 1.9) (Chang et al., 2015). In brief, SNPs and/or mutations were identified with a threshold of 0.01 minor allele frequency (MAF), and ped and map files were generated using vcftools (Danecek et al., 2011). The label information and ped and/or map files were then merged, and bed, fam and bim files were used to locate missing SNPs and SNPs that exhibited significant differences by treatment with‐‐allow‐no‐sex mode. To analyse the correlation of missing genotypes in each bacterial species, the bcf file containing all sorted BAM files of the same species was used. Conversely, for GWAS, the merging file containing only the sorted BAM files relevant to a GWAS comparison was used. A p‐value less than .05 for each GWAS comparison was considered a significant SNP.

Data analysis, visualization and statistics

The results of the correlation of missing genotypes between samples based on IBM were employed to identify the differences among all aligned sequences in each bacterial species, and hierarchical clustering with the complete method was performed using the ape package in R (Paradis & Schliep, 2019). The location and annotation of coding DNA sequences in each reference genome were predicted using RastTk pipeline (Brettin et al., 2015). All nucleic acid sequences that exceeded 0.01 MAF and SNPs belonging to genes that possessed the first to third highest numbers of significant SNPs were visualized in a Manhattan plot using qqman and ggplot2 library in R (Ver. 4.0.5) (Turner, 2014; Wickham, 2016). The gene with at least one significant SNP was mapped onto KEGG pathways using the KEGG automatic annotation server to trace the biological pathways that harboured genes with SNPs identified in each GWAS comparison (Moriya et al., 2007). For this, a single‐directional best hit mode was used, and several data sets of reference bacterial species (eic and etr; E. piscicida, the same or closest species to E. piscicida (eic and etr), V. harveyi (vch, vcj, vco, vcm, vvu, vvy, vvm, vpa, vha, vex, vsp and van), and S. parauberis (spy, spn, sag, smu, stc, ssa, ssb, sgo, sez, sub, sds, sga, smb and std) were selected as the reference databases for the KEGG annotation. The ratio for the number of genes in featured pathways where more than three genes have been harboured was visualized using the Enrichment map in Cytoscape (Ver. 3.7.2) (Merico et al., 2010; Shannon et al., 2003). The significant difference in the number of genes that have at least one SNP in each KEGG pathway has been identified by the chi‐square analysis using a gmodels package in R (Warnes et al., 2015).

RESULTS

SNPs and GWAS for E. piscicida

Although one control sample of E. piscicida (E. piscicida WT_3) was located relatively far from the other two samples, most samples were well‐clustered in accordance with the phenol and formalin treatments (Figure S1a). The number of SNPs that exceeded 0.01 MAF was 1721 and 2731 in the formalin and phenol groups, respectively, and 514 and 512 were significant SNPs (p < .05) (Figure S1b). The number of significant SNPs belonging to each gene was then counted based on the location of significant SNPs and genes in E. piscicida, and 15 genes (malS, TcfC, MrdA, ClC, nrfE, SDM, L‐Asc family, dppA, fucP, tolC, TREM2, LcrD, DgcZ, MetC and FimD; all abbreviations are shown in Table 2) were found to contain more than 3 significant SNPs in either the formalin or the phenol comparison (Table 2). Among these genes, MrdA, TcfC and fucP had 6, 4 and 4 significant SNPs, which were the first to third highest genes between the control and formalin groups (Figure 1A). For the control and phenol comparison, both genes (MrdA and LcrD) contained highest number of significant SNPs (n = 6), and fucP was the third highest (n = 5) (Figure 1A). Similar or identical numbers of SNPs were observed in 8 of 15 genes in both formalin and phenol groups; however, seven genes (L‐Asc family, dppA, fucP, tolC, DgcZ, nrfE and FimD) were primarily observed in the formalin comparison group (Figure 1B). On the other hand, a smaller number of SNPs from these 15 genes was identified in the comparison between formalin and phenol groups (Table S1). To estimate the biological meaning of the genes where at least one SNPs or mutation was observed, all genes with the significant SNP(s) were annotated using the KEGG database; the numbers of genes mapped onto each pathway are depicted in Table S2. The proportion of the number of genes in each pathway out of all genes confirmed in both formalin and phenol comparison was calculated, and the pathways where more than three genes were assigned to at least one group are visualized in Figure 2. Five of 28 KEGG pathways (metabolic, galactose metabolism, bacterial chemotaxis, flagellar assembly and two‐component systems) contained a significantly higher number of genes in phenol groups than in formalin groups (Figure 2).
TABLE 2

Location, a reference code, A2 (major allele sequence)/A1 (minor allele sequence), chi‐square and p‐value of significant SNPs from GWAS for Edwardsiella piscicida in the formalin and phenol group, and mapping information (MI) including depth (DP) and variant sequence frequency (VSF) of each SNP

Gene (abbreviation)SNP siteRefWT vs. formalinWT vs. phenol
GWASMIGWASMI
A2/A1CHISQPFDRDPVSF (%)A2/A1CHISQPFDRDPVSF (%)
Periplasmic alpha‐amylase (malS)370,815TT/C80.0050.0401310T/C80.0050.0861111
370,888TT/C60.0140.100917T/C100.0020.086148
371,452GG/A80.0050.040138G/A80.0050.0861011
TcfC E‐set like domain‐containing protein a (TcfC)1,238,362AA/G80.0050.040617G/A40.0460.185250
1,239,622G G/A 4.8 0.028 0.103 47 6 1418
1,240,084AA/G60.0140.100425G/A40.0460.185250
1,240,241AA/G100.0020.017225G/A40.0460.185250
1,238,381G80G/A40.0460.185250
1,239,178T443T/C60.0140.152814
Peptidoglycan D,D‐transpeptidase (MrdA)1,394,009A A/C 4.8 0.028 0.103 142 2 A/C4.80.0280.185298
1,394,010A A/T 4.8 0.028 0.103 144 2 A/T4.80.0280.185298
1,394,012A A/G 4.8 0.028 0.103 145 2 A/G4.80.0280.185298
1,394,013C C/G 4.8 0.028 0.103 146 2 C/G4.80.0280.185298
1,394,014C C/A 4.8 0.028 0.103 147 2 C/A4.80.0280.185299
1,394,016T T/A 4.8 0.028 0.103 147 2 T/A 4.8 0.028 0.185 30 8
Chloride channel protein (ClC)1,437,777GG/A4.80.0280.1031111G/A4.80.0280.185149
1,437,628CC/T4.80.0280.185275
1,437,717GG/A4.80.0280.185285
Cytochrome c‐type heme lyase subunit (nrfE)1,539,560A A/G 10 0.002 0.017 43 4 G/A40.0460.185533
1,539,936GG/A60.0140.1006504100
1,540,797TT/C100.0020.017206T/C60.0140.152425
SAM‐dependent methyltransferase (SDM)1,601,113AA/G100.0020.0172414A/G80.0050.0861230
1,601,399TT/C100.0020.017334T/C100.0020.0861214
1,601,414AA/G100.0020.017265A/G100.0020.0861213
PTS system, ascorbate‐specific IIC component (L‐Asc family)2,073,595T T/C 4.8 0.028 0.103 90 6 2233
2,074,422TT/C4.80.0280.103245433
2,074,672CC/T4.80.0280.1032110729
2,074,142T410T/C60.0140.152540
2,074,671T210T/C60.0140.152714
Dipeptide ABC transporter (dppA)2,194,891TT/C4.80.0280.103304T/C4.80.0280.185206
2,194,894TT/C4.80.0280.103304T/C4.80.0280.185216
2,195,101CC/T4.80.0280.1031713833
Fucose permease (fucP)2,690,807AA/T100.0020.017197T/A40.0460.185450
2,690,834GG/A100.0020.0172072100
2,691,021TT/C40.0460.12633320
2,691,140TA/T40.0460.1264251100
TolC family protein (tolC) a 2,726,025TT/C100.0020.01715132100
2,726,173TT/C4.80.0280.1031120550
2,726,409AA/G4.80.0280.103286625
Ig‐like domain‐containing protein (TREM2) a 2,735,523G G/A 8 0.005 0.040 39 3 A/G40.0460.185250
2,738,408AA/G60.0140.1004332100
2,738,491TT/C60.0140.1005251100
2,734,265A249A/G4.80.0280.185929
2,736,314A150A/G60.0140.152717
2,736,408T100T/C60.0140.152825
Type III secretion inner membrane channel protein (LcrD)3,326,526T T/C 10 0.002 0.017 230 1 C/T40.0460.185333
3,327,336A A/G 4.8 0.028 0.103 196 2 A/G4.80.0280.1851517
3,327,402A A/G 4.8 0.028 0.103 107 2 A/G4.80.0280.1852125
3,326,852T520T/C40.0460.185250
3,327,486C1250C/A40.0460.185250
3,328,029T820T/C40.0460.185250
Diguanylate cyclase (DgcZ)3,401,083T T/C 4.8 0.028 0.103 106 2 T/C4.80.0280.185235
3,402,611T T/C 4.8 0.028 0.103 175 1 294
3,402,641T T/C 4.8 0.028 0.103 197 1 334
Cystathionine beta‐lyase (MetC)3,518,551TT/G100.0020.017714T/G80.0050.086525
3,518,851CT/C40.0460.126367T/C40.0460.185467
3,518,926TT/C40.0460.12663340
3,519,395G170G/A40.0460.185350
Type 1 fimbriae anchoring protein (FimD)3,733,406TT/C4.80.0280.1032451210
3,734,033TT/C60.0140.1005251100
3,734,698TT/C40.0460.12625010

Note: Bold letters indicated SNPs with FDR <0.25 and DP ≥30.

Gene was annotated by non‐redundant (NR) protein sequence database (blastp).

FIGURE 1

Manhattan plot of genome‐wide association analysis (GWAS) between WT and formalin ((a)‐a) or phenol group ((a)‐b). Circos plot for the genes with more than 3 significant SNPs in one group. The thickness of green and pink coloured lines denotes the number of Edwardsiella piscicida SNPs in phenol and formalin groups, the abbreviation of each gene was written in brackets (b). The brown and green dots (a) indicate the genes that contained the first to the third highest number of significant SNPs (p < .05)

FIGURE 2

The ratio for the number of genes with at least one significant SNP between formalin and phenol groups belonging to KEGG pathways in Edwardsiella piscicida. The thickness of edges between nodes indicates similarity based on the number of shared genes. The yellow background with the asterisk in the node indicates significantly different counts based on chi‐square analysis (*p < .05; **p < .01; ***p < .001)

Location, a reference code, A2 (major allele sequence)/A1 (minor allele sequence), chi‐square and p‐value of significant SNPs from GWAS for Edwardsiella piscicida in the formalin and phenol group, and mapping information (MI) including depth (DP) and variant sequence frequency (VSF) of each SNP Note: Bold letters indicated SNPs with FDR <0.25 and DP ≥30. Gene was annotated by non‐redundant (NR) protein sequence database (blastp). Manhattan plot of genome‐wide association analysis (GWAS) between WT and formalin ((a)‐a) or phenol group ((a)‐b). Circos plot for the genes with more than 3 significant SNPs in one group. The thickness of green and pink coloured lines denotes the number of Edwardsiella piscicida SNPs in phenol and formalin groups, the abbreviation of each gene was written in brackets (b). The brown and green dots (a) indicate the genes that contained the first to the third highest number of significant SNPs (p < .05) The ratio for the number of genes with at least one significant SNP between formalin and phenol groups belonging to KEGG pathways in Edwardsiella piscicida. The thickness of edges between nodes indicates similarity based on the number of shared genes. The yellow background with the asterisk in the node indicates significantly different counts based on chi‐square analysis (*p < .05; **p < .01; ***p < .001)

SNPs and GWAS for V. harveyi

In general, the clustering result of missing genotype correlations based on the IBM did not show any specific patterns among the samples, although the samples from V. harveyi incubated in the SW at 30°C for 6 d tended to be closer than other groups (Figure S2a). The numbers of SNPs higher than 0.01 MAF from the three comparisons (between Con and SW at 30°C for 12 h, 3 and 6 days) were 2955, 6678 and 2015, respectively, and 143, 574 and 143 sites in 12 h, 3 and 6 days groups, respectively, were significant sites (p < .05) (Figure S2b). The upper genes with the first to the third‐largest number of significant SNPs were identified for each sampling time point, and six genes (fliN, FlaD, RcpC, DECR, PrkC and Eis) were selected in the 12‐h group. Likewise, seven genes (COX1, RMD‐MFP, RND‐IMT, mcpB, FliN, lapD and PrkC) and six genes (FliN, RcpC, OMP, PrkC, DUF4150 and TKT) were regarded as those that had the largest number of significant SNPs in the 3 and 6 days groups, respectively (Figure 3A). Among these genes, seven (RND‐MFP, RND‐IMT, mcpB, COX1, lapD, Eis and TKT) exhibited 3 or more significant SNPs in a gene from at least one group (Table 3). In particular, five genes (RND‐MFP, RND‐IMT, mcpB, COX1 and lapD) had SNPs in only the 3 days group, and GNAT family N‐acetyltransferase and Transketolase contained SNPs in the 12 h or 6 days groups (Figure 3B). Because numerous genes were dominantly enriched into KEGG pathways in the 3 days group, compared with other groups, 20 of 28 KEGG pathways had a statistically different number of genes that harboured significant SNPs between the groups (Figure 4; Table S2). In particular, KEGG pathways such as flagella assembly, ABC transporters, two‐component system, biosynthesis of cofactors, microbial metabolism in diverse environments, metabolic pathways, biosynthesis of secondary metabolites, and biofilm formation—Vibrio cholera had a higher number of genes in the 3 days group (p < .001).
FIGURE 3

Manhattan plot of genome‐wide association analysis (GWAS) in Vibrio harveyi with 12 h ((a)‐a), 3 days ((a)‐b) and 6 days ((a)‐c) seawater at 30°C incubation. Circos plot for the genes that have the first to third highest SNPs in one group. The thickness of the green and pink coloured lines denotes the number of V. harveyi SNPs at each time point (12 h, 3 and 6 days) (b). The coloured dots (brown, green and blue; a) indicate the genes that contained the first to the third highest number of significant SNPs in each time point (p < .05)

TABLE 3

Location, a reference sequence, A2 (major allele sequence)/A1 (minor allele sequence), chi‐square, and p‐value of significant SNPs from GWAS for Vibrio harveyi incubated in seawater at 30°C for 12 h, 3 and 6 days, and mapping information (MI) including depth (DP) and variant sequence frequency (VSF) of each SNP

Gene (abbreviation)SNP site contig)RefCon vs. 12 hCon vs. 3 daysCon vs. 6 days
GWASMIGWASMIGWASMI
A2/A1CHISQPFDRDPVSF (%)A2/A1CHISQPFDRDPVSF (%)A2/A1CHISQPFDRDPVSF (%)
RND efflux system, membrane fusion protein (RND‐MFP)218,971 (2)T660 T/C 8 0.005 0.092 51 2 640
219,094 (2)A353 A/T 8 0.005 0.092 32 7 373
219,098 (2)T350 T/C 8 0.005 0.092 32 3 360
RND efflux system, inner membrane transporter (RND‐IMT)51,394 (8)C250C/T80.0050.092520157
51,397 (8)A254A/T80.0050.092620160
51,789 (8)T210T/G80.0050.0921011370
Methyl‐accepting chemotaxis sensor/transducer protein (mcpB)83,161 (15)T/A310 A/G 8 0.005 0.092 87 20 1350
83,178 (15)T240 T/A 6 0.014 0.158 68 22 1100
83,798 (15)G/A370 A/G 8 0.005 0.092 44 6 910
putative flagellar motor switch protein (fliN)7823 (63)T T/C 8 0.005 0.162 30 10 T/C60.0140.158922 T/C 8 0.005 0.111 86 21& INDEL; TG (6)
7824 (63)G G/T 8 0.005 0.162 30 3 G/T60.0140.158914 G/T 8 0.005 0.111 85 7
8309 (63)AA/G60.0140.1621217G/A40.0460.158550A/G80.0050.1111812
Cytochrome O ubiquinol oxidase subunit I (COX1)5731 (71)A702 A/G 8 0.005 0.092 42 5 821
5967 (71)T472 T/C 8 0.005 0.092 31 7 6213
6420 (71)A805 A/C 8 0.005 0.092 47 11 818
6731 (71)A620 A/G 8 0.005 0.092 32 3 602
diguanylate cyclase/phosphodiesterase (GGDEF & EAL domains) with PAS/PAC sensor(s) (lapD)11,269 (77)A290A/C60.0140.1581114360
11,718 (77)G210G/A60.0140.158525260
12,403 (77)G220G/A80.0050.092914420
Serine/threonine protein kinase, regulator of stationary phase (PrkC)21 (128)TA/T40.0460.16210202100T/A60.0140.144560
930 (128)AA/G60.0140.1621417550 A/G 6 0.014 0.144 42 11
53 (128)C150C/T80.0050.0926172010
54 (128)T150T/C80.0050.092617200
57 (128)T150T/G80.0050.092617200
Flagellin protein (FlaD)325 (132)A G/A 6 0.014 0.162 81 59 524712756
328 (132)G A/G 6 0.014 0.162 80 58 514612754
688 (132)G A/G 6 0.014 0.162 59 63 A/G 4.444 0.035 0.158 35 50 10151

Note: Bold letters indicated SNPs with FDR <0.25 and DP ≥30.

FIGURE 4

The ratio for the number of genes with at least one significant SNP in each time point of sea water at 30°C incubation (12 h, 3 and 6 days) belonging to KEGG pathways in Vibrio harveyi. The thickness of edges between nodes indicates similarity based on the number of shared genes. The yellow background with the asterisk in the node indicates significantly different counts based on chi‐square analysis (*p < .05; **p < .01; ***p < .001)

Manhattan plot of genome‐wide association analysis (GWAS) in Vibrio harveyi with 12 h ((a)‐a), 3 days ((a)‐b) and 6 days ((a)‐c) seawater at 30°C incubation. Circos plot for the genes that have the first to third highest SNPs in one group. The thickness of the green and pink coloured lines denotes the number of V. harveyi SNPs at each time point (12 h, 3 and 6 days) (b). The coloured dots (brown, green and blue; a) indicate the genes that contained the first to the third highest number of significant SNPs in each time point (p < .05) Location, a reference sequence, A2 (major allele sequence)/A1 (minor allele sequence), chi‐square, and p‐value of significant SNPs from GWAS for Vibrio harveyi incubated in seawater at 30°C for 12 h, 3 and 6 days, and mapping information (MI) including depth (DP) and variant sequence frequency (VSF) of each SNP Note: Bold letters indicated SNPs with FDR <0.25 and DP ≥30. The ratio for the number of genes with at least one significant SNP in each time point of sea water at 30°C incubation (12 h, 3 and 6 days) belonging to KEGG pathways in Vibrio harveyi. The thickness of edges between nodes indicates similarity based on the number of shared genes. The yellow background with the asterisk in the node indicates significantly different counts based on chi‐square analysis (*p < .05; **p < .01; ***p < .001)

SNPs and GWAS for S. parauberis

In the case of S. parauberis, the dendrogram analysis based on missing genotype correlation revealed that the culture conditions and incubation time, especially the difference between broth and serum environments, were greatly influenced by their clustering. Although one sample (broth 4 h rep3) was abnormally closer than serum samples, most could be primarily divided by the culturable environments (broth and serum). The samples from broth environments tended to cluster well by incubation time, but not in the serum environment (Figure S3a). The number of SNPs exceeding 0.01 MAF in the comparison with each time point of broth and serum groups were 54, 43 and 65 at 1, 2 and 4 h, respectively, which was much smaller for E. piscicida and V. harveyi (Figures S3b; Figure 5). Similarly, no gene with more than two significant SNPs in all sampling time points was observed (Table 4).
FIGURE 5

Manhattan plot of genome‐wide association analysis (GWAS) in Streptococcus parauberis for 1 h (a), 2 h (b) and 4 h (c) incubation

TABLE 4

Location, a reference sequence, A2 (major allele sequence)/A1 (minor allele sequence), chi‐square, and p‐value of significant SNPs from GWAS for Streptococcus parauberis incubated in broth and serum for 1, 2, and 4 h, and mapping information (MI) including depth (DP) and variant sequence frequence (VSF) of each SNP

Gene (abbreviation)SNP siteRef1 h2 h4 h
GWASMIGWASMIGWASMI
A2/A1CHISQPFDRDPVSF (%)A2/A1CHISQPFDRDPVSF (%)A2/A1CHISQPFDRDPVSF (%)
11,965T40A/T4.000.0460.18625030
34,946T13070T/C6.000.0140.172425
37,322TT/A6.000.0140.116171040
tRNA‐Cys‐GCA (NSUN6)116,816A5848G/A8.000.0050.10123703650
135,377TT/C8.000.0050.118131030
138,394A3040T/A4.000.0460.172250
603,666A150A/T4.000.0460.18625090
677,993T13050C/T4.000.0460.172333
Replication initiator protein A (DnaA)1,099,212TAATAA/TAAA6.000.0140.111206030
Hypothetical protein1,111,230C50C/G6.000.0140.18642550
6‐phospho‐beta glucosidase (lacG)1,288,372A4060A/G8.000.0050.1521010
Hypothetical protein1,309,621C430C/T8.000.0050.1011010150
Phage tail length tape‐measure protein T (yomI)1,460,848TT/C6.000.0140.114253020
Phage terminase arge subunit (A)1,471,143T30T/C4.000.0460.186250T/C6.000.0140.172333
1,526,293TA/T4.000.0460.1682501020
1,526,442AG/A4.000.0460.16825020
Hypothetical protein1,662,165G6040G/A8.000.0050.152425
Hypothetical protein1,706,945CTTTTTCTTTTT/CTTT8.000.0050.11149050
Tn5252, relaxase (Tn5252)1,885,284TT/C6.000.0140.116174020
Hypothetical protein2,037,162TT/C6.000.0140.113331010
PTS system, cellobiose‐specific IIC component (celB)2,040,252T7030T/C4.000.0460.172250
Phosphoribosylformylglycinamidine synthase (purL)2,089,390ATTT4030ATT/ATTT4.000.0460.172250

Note: Bold letters indicated SNPs with FDR <0.25 and DP ≥30.

Manhattan plot of genome‐wide association analysis (GWAS) in Streptococcus parauberis for 1 h (a), 2 h (b) and 4 h (c) incubation Location, a reference sequence, A2 (major allele sequence)/A1 (minor allele sequence), chi‐square, and p‐value of significant SNPs from GWAS for Streptococcus parauberis incubated in broth and serum for 1, 2, and 4 h, and mapping information (MI) including depth (DP) and variant sequence frequence (VSF) of each SNP Note: Bold letters indicated SNPs with FDR <0.25 and DP ≥30.

DISCUSSION

Although GWAS for fish bacterial pathogens is relatively underrated compared with fish GWAS for the purpose of quantitative trait locus (QTL) marker identification, a few studies have applied GWAS for fish bacterial pathogens, which has contributed to identifying genotypic characteristics in the same species (Correa et al., 2015; Holborn et al., 2018; Le et al., 2021; Rasmussen et al., 2016). Different phenotypic characteristics between strains in the same species can be created by a genetic pressure to change nucleic sequences and genes to adapt to specific environments, which has been widely observed in both prokaryotes and eukaryotes (e.g., Kim et al., 2021; Kjærner‐Semb et al., 2021). This is one of the main reasons why the first step in most GWAS is to classify the phenotypic characteristics or measurements (which can be discrete units [e.g., high, intermediate and low virulence] and/or continuous values) (Chen & Shapiro, 2015). Given that bacterial genomes tend to have a much stronger linkage disequilibrium than eukaryotes, bacteria are more flexible in relation to altering their genetic information under different environments, and this can accompany mutation and sequence alternation within short time periods when they are in different culturable environments (Chen & Shapiro, 2015). It is not surprising that bacteria mutation rate can be influenced by death, population dynamics and stressful environments, and generally speaking 1–100 nucleotides in 10 million to 1 billion bases can be substituted per generation depending on environments and conditions (Chevallereau et al., 2019; Frenoy & Bonhoeffer, 2018; Westra et al., 2017). Given that bacterial doubling time is approximately 20 min in favourable environments, although the time span for bacterial replication differs owing to multiple factors (e.g., species, temperature and nutrients) (Jaruszewicz‐Błońska & Lipniacki, 2017), more than one‐day incubation in different conditions can be enough time to make a mutation in different sites in the genome and change the major nucleic acid sequences in their population group. In addition, the fact that relatively more frequent mutated, missed or substituted genes for a certain gene compared with others indirectly suggests the importance of the gene for adapting to the surrounding environment. The advance in transcriptomic sequencing technologies can provide valuable information for profiling the sequence dynamics from the survived bacteria even in the same cultures; however, most studies have focussed on mRNA expression in different environments using transcriptomic results (Le et al., 2021; Lee et al., 2021; Montánchez et al., 2019; Yoon et al., 2020). Hence, this study attempted to observe SNPs and/or mutations in the same strain with different environments and exposure times through raw sequencing files that had been used for transcriptomic studies (Lee et al., 2021; Montánchez et al., 2019; Yoon et al., 2020). In general, GWAS have been performed using genomic information rather than the sequencing for transcriptomes. However, sequences from RNA‐seq can provide an alternative source of host genetic information. For example, Berthouly‐Salazar et al. (2016) have used transcriptomes easier to obtain from a non‐model plant for investigating SNPs and genotyping as an alternative strategy, and Chandhini and Rejish Kumar (2019) also found that transcriptomes obtained from NGSs, such as genomic results, can serve as resources to excavate SNPs and other molecular makers. Moreover, as transcripts have been produced from only living bacteria, the result of transcriptomes has the advantage that it mainly comes from survived bacteria adapted to the changed environment regardless of the elapsed incubation time. On the other hand, one drawback of utilizing raw RNA‐seq files for GWAS is that the sequencing depth can fluctuate because of the differences in bacterial mRNA expression between groups, which can cause different depth levels and negatively impact the results. Nevertheless, strategies for deep sequencing of bacterial cultures can identify the occurrence of mutations even in a minority of the major population, and this provides potential insight into bacterial evolution and adaptation with the respect to genomes. For example, the VSF and DP scores in Tables 2, 3, 4 indicate the diversity of sequence and depth rate at each position. The higher VSF with deeper DP can be interpreted as the more reliable SNPs or mutated position because it meant the more different nucleotides and sequences were mapped onto a reference genome (Danecek et al., 2011). Also, as a small number of samples used in this study can lead to the risk of false‐negative (type 2 error) if Bonferroni correction would be applied, the p‐value less than .05 without Bonferroni correction, which thresholds also used in some studies, has been used in this study (Asif et al., 2021; Kap et al., 2016; Lu et al., 2015; Zeng et al., 2015). Instead, the FDR value has been described in Tables 2, 3, 4 for evaluating the reliability of each SNP. Hence, if we keep this in mind for the interpretation of results, RNA‐seq can demonstrate the overall genomic changes under different environments in pathogenic bacteria in aquaculture. When E. piscicida cultured in the liquid media (broth) and a minimal sub‐inhibitory concentration of formalin and phenol were compared (Yoon et al., 2020), TcfC and MrdA were the top three genes that harboured multiple SNPs and/or mutated sequences (Figure S4). In general, Tcf (typhi colonization factor) has been known to exhibit chaperone‐usher fimbriae and is classified in α fimbrial clade, and its regions are composed of six operons including chaperone (TcfA), major fimbriae subunit (TcfB), usher (TcfC) and adhesion (TcfD) (Azriel et al., 2017; Leclerc et al., 2016; Yue et al., 2012). Given that Tcf has been reported as being part of the type six secretion systems and atypical fimbriae in some species, particularly Salmonella, it can be an important virulence factor and supporting bacterial colonization in E. piscicida (Azriel et al., 2017; Folkesson et al., 1999). Former studies (e.g., Azriel et al., 2017; Leclerc et al., 2016) have reported that Tcf is the factor involved in host specificity, such as serotypes in Salmonella enterica. Switching the gene sequences involved in serotypes is noteworthy because the mutation in a Tcf region of E. piscicida suggested one strain can be differentiated into several serotypes under different culturable environments. Similarly, type 3 secretion system (T3SS) is one of the well‐known virulence factors that contribute to invading and surviving in host cells through flagellar‐like needles with injecting effector proteins, and T3SS in E. piscicida has been found to be significantly up‐regulated under the sub‐inhibitory concentration of formalin (0.38 mM) (Hou et al., 2017; Yoon et al., 2020). Notably, the group that had mutated or variant nucleic acid sequences in LcrD was broth cultured E. piscicida rather than formalin and phenol groups. Specifically, there was little change in the nucleotide sequence in an environment that stimulate the expression of T3SS, but a higher percentage of sequence changes was observed in broth media that does not require the high expression of T3SS. Given that numerous studies (e.g., Almaguer‐Chávez et al., 2011; Barrick et al., 2009) have observed the decreasing virulence of bacteria when pathogenic bacteria are sub‐cultured continuously, the sequence change of T3SS might affect the virulence of E. piscicida. Barrick et al. (2009) noted that Escherichia coli has continuously evolved through long‐term subcultures with deletions, mutations, inversions and duplications in its genome that resulted in the loss of 1.2% of the original chromosome. This implies that the observation of mutations and/or SNPs in T3SS that has been known to the important virulence factor and factors exposed to the external environment in E. piscicida from the broth group can be the process of evolution to be more adaptable in the liquid media. Further study about the influence of T3SS mutation in a nutrient‐enriched environment is necessary from the perspective of bacterial pathogenicity and environmental adaptation. In this context, the plethora of mutations in MrdA can be interpreted similarly to T3SS. Peptidoglycan is one of the major components of bacterial cell walls, and D‐,D‐transpeptidase is well‐known for its strong involvement in crossing‐linking peptidoglycan strands (Hugonnet et al., 2016; Triboulet et al., 2013). The peptidoglycan layer can help to maintain bacteria structures in the face of internal and outer pressures, and physiological impacts can exert on peptidoglycan cross‐linking (Pidgeon et al., 2019). In particular, because D‐,D‐transpeptidase as penicillin‐binding proteins can be easily inhibited by β‐lactam antibiotics, several studies have reported that L‐,D‐transpeptidase, which is highly resistant to β‐lactams, can be replaced when bacteria need to adapt β‐lactam antibacterial agents (e.g., Pidgeon et al., 2019; Triboulet et al., 2013). Based on all these studies and the results of this study, it is speculated that D‐,D‐transpeptidase in E. piscicida is an important gene that exhibits multiple alternations and/or changes depending on environmental changes and differences. In addition, the number of genes with significant SNPs or mutations belonging to certain KEGG pathways (e.g., bacterial chemotaxis, flagellar assembly and galactose metabolisms) differed between the sub‐inhibitory concentration of formalin and phenol groups. All these results support the claim that the E. piscicida genome can be flexibly changed by the external environment and mutations or SNPs do not occur with the same probability among all genes but instead are concentrated on specific genes. The comparison of mutation or SNP sites in V. harveyi, although under the same environment (sea water at 30°C), indicated the different incubation times significantly affected the patterns of mutation. The number of SNPs that exceeded 0.01 MAF identified in the 3 days group was 6678, whereas 12 h and 6 days groups had less than half that number. This difference would have resulted in the 3 days group in most KEGG pathways having genes with at least one significant SNP or mutation site compared to others. The genes involved in flagella and flagellin components exhibited several mutations during the samplings, and these patterns were more notable as time elapsed (Figure S5). Namely, the sequence of the control group did not necessarily mean that the sequence of mutations or SNPs in the regions where mutations frequently occurred was not observed. For instance, although putative flagellar motor switch protein and flagellin protein in the control group of V. harveyi also have contained a piece of minor variants, the number of mutated sequences is much higher enough to switch the major sequence after incubation in high‐temperature sea water (Figure S5). In addition, the variances of sequences were not identical in each group, taking into account that several ‘T’ were identified in the 325 position (contig 63) from only one sample belonging to the 6 days group. The mutation and differences of flagellin and flagella relevant genes may be the reason for the alternation of phenotypic characteristics. The flagellar motor switch proteins are known to promote motility and biosynthesis in flagella, and their inactivation can cause decreasing velocity (swimming ability) in highly viscous media (1% methylcellulose) and infectivity in Borrelia burgdorferi (Li et al., 2010). In addition, Kim et al. (2014) reported that Vibrio vulnificus which has a few nucleotide deletions of flagellin(s) affected the significant reduction in cytotoxicity, motility and adhesion. All these results suggest that although there may be genes and regions where mutations in bacteria occur frequently and randomly, environmental differences can create greater diversity and phenotypic characteristics. In the case of a resistance nodulation division (RND) where the system is responsible for transporting dyes, antibiotics, host molecules and detergents located between the inner and outer membrane in gram‐negative bacteria, some mutations were identified in the 3 days group (Blair and Piddock, 2009). However, given that these mutations were not consistent until the 6 days group, some sequences in certain genes exhibited greater variation depending on sampling time points, although not all mutated sequences are necessarily meant to become major populations continuously. Regarding the comparison of S. parauberis, the correlation of missing genotypes between samples was well‐clustered in broth and serum groups. However, no gene contained several significant mutations or SNPs, as observed in E. piscicida and V. harveyi. Because S. parauberis cultured in broth and serum for 4 h did not exhibit less than two times the high number of bacteria compared to 0 h (Lee et al., 2021), it is suggested that the incubation times were not sufficient to initiate significant SNPs or mutation. However, missing genotypes might be affected by gene expression because RNA‐seq, files not genome sequencing results, were used in this study. According to the results of principal investigation analysis based on transcriptomic results, despite the big difference in transcriptomic responses as incubation time elapsed, closer clustering distance by incubation time rather than a different environment suggested that environmental pressure can be a much stronger factor in initiating the missing sequence or deletions. In conclusion, this study first investigated SNPs or occurring mutations in major fish bacterial pathogens from the same strain, and successfully identified genes (especially, flagella, fimbriae and flagellin relevant genes) that have a much higher number of mutated sequences than other genes under different environments. Given that multiple studies have already demonstrated fimbriae that are directly exposed to the external environment are the important gene for differentiating bacterial serotypes, these genes can be more flexible in changing their structures which can be linked to alternation in their phenotypic characteristics. In addition, other genes that contained several significant mutations and SNPs based on GWAS can be potential biomarkers for tracking the origin and sources of the same bacterial species. Although further studies are necessary to investigate the meaning of mutations in each gene and species, the data presented in this study provide valuable information and insight into the evolution of bacterial pathogens in aquaculture.

CONFLICT OF INTEREST

The author declares no competing interest exists. Figure S1 Dendrogram for the correlation of missing genotypes in each E. piscicida based on identity‐by‐missingness (IBM) (a). Venn diagram for the number of SNPs with more than 0.01 minor allele frequency (MAF) in the comparison between wild type (WT) and treatments (formalin and phenol) (b). Figure S2. Dendrogram for the correlation of missing genotypes in each group of V. harveyi based on identity‐by‐missingness (IBM) (a). Venn diagram for the number of SNPs with more than 0.01 minor allele frequency (MAF) in the comparison between control and sea water at 30°C incubation for 12 h, 3, and 6 days (b). Figure S3. Dendrogram for the correlation of missing genotypes in each group of S. parauberis based on identity‐by‐missingness (IBM) (a). Venn diagram for the number of SNPs with more than 0.01 minor allele frequency (MAF) in the comparison between each time point of broth (control) and serum (treatment) incubation for 1, 2 and 4 h (b). Figure S4. The reference and portion of mapped sequences where significant SNPs were identified through GWAS from E. piscicida cultured in control (Wild‐type; broth), formalin, and phenol. Figure S5. The reference and portion of mapped sequences where significant SNPs were identified through GWAS from V. harveyi incubated in control (broth at 26°C) and sea water at 30°C for 12 h, 3 and 6 days, respectively. Click here for additional data file. Table S1 Location, a reference code, A2 (major allele sequence)/A1 (minor allele sequence), chi‐square, and p‐value of significant SNPs from GWAS for E. piscicida between formalin and phenol group, and mapping information (MI) including depth (DP) and variant sequence frequency (VSF) of each SNP. Click here for additional data file. Table S2 The number of gene(s) that have at least one significant SNP(s) in each KEGG pathway in E. piscicida or V. harveyi group comparison Click here for additional data file.
  50 in total

Review 1.  Current status of pan-genome analysis for pathogenic bacteria.

Authors:  Yeji Kim; Changdai Gu; Hyun Uk Kim; Sang Yup Lee
Journal:  Curr Opin Biotechnol       Date:  2019-12-28       Impact factor: 9.740

2.  Multiple insertions of fimbrial operons correlate with the evolution of Salmonella serovars responsible for human disease.

Authors:  A Folkesson; A Advani; S Sukupolvi; J D Pfeifer; S Normark; S Löfdahl
Journal:  Mol Microbiol       Date:  1999-08       Impact factor: 3.501

3.  EDGAR 2.0: an enhanced software platform for comparative gene content analyses.

Authors:  Jochen Blom; Julian Kreis; Sebastian Spänig; Tobias Juhre; Claire Bertelli; Corinna Ernst; Alexander Goesmann
Journal:  Nucleic Acids Res       Date:  2016-04-20       Impact factor: 16.971

4.  Regulation and production of Tcf, a cable-like fimbriae from Salmonella enterica serovar Typhi.

Authors:  Jean-Mathieu Leclerc; Eve-Lyne Quevillon; Yoan Houde; Kiran Paranjape; Charles M Dozois; France Daigle
Journal:  Microbiology       Date:  2016-03-04       Impact factor: 2.777

5.  Multi-drug resistance, integron and transposon-mediated gene transfer in heterotrophic bacteria from Penaeus vannamei and its culture environment.

Authors:  Ranjit Kumar Nadella; Satyen Kumar Panda; Madhusudana Rao Badireddy; Pani Prasad Kurcheti; Ram Prakash Raman; Mukteswar Prasad Mothadaka
Journal:  Environ Sci Pollut Res Int       Date:  2022-01-23       Impact factor: 4.223

6.  Diversification of the Salmonella fimbriae: a model of macro- and microevolution.

Authors:  Min Yue; Shelley C Rankin; Ryan T Blanchet; James D Nulton; Robert A Edwards; Dieter M Schifferli
Journal:  PLoS One       Date:  2012-06-12       Impact factor: 3.240

7.  RASTtk: a modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes.

Authors:  Thomas Brettin; James J Davis; Terry Disz; Robert A Edwards; Svetlana Gerdes; Gary J Olsen; Robert Olson; Ross Overbeek; Bruce Parrello; Gordon D Pusch; Maulik Shukla; James A Thomason; Rick Stevens; Veronika Vonstein; Alice R Wattam; Fangfang Xia
Journal:  Sci Rep       Date:  2015-02-10       Impact factor: 4.379

8.  Candidate Gene Association Analysis of Neuroblastoma in Chinese Children Strengthens the Role of LMO1.

Authors:  Jie Lu; Ping Chu; Huanmin Wang; Yaqiong Jin; Shujing Han; Wei Han; Jun Tai; Yongli Guo; Xin Ni
Journal:  PLoS One       Date:  2015-06-01       Impact factor: 3.240

9.  Genetic toggle switch controlled by bacterial growth rate.

Authors:  Joanna Jaruszewicz-Błońska; Tomasz Lipniacki
Journal:  BMC Syst Biol       Date:  2017-12-02
View more
  1 in total

1.  A genome-wide association study of the occurrence of genetic variations in Edwardsiella piscicida, Vibrio harveyi, and Streptococcus parauberis under stressed environments.

Authors:  HyeongJin Roh
Journal:  J Fish Dis       Date:  2022-06-23       Impact factor: 2.580

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.