Literature DB >> 31482696

The complete genome sequence of the thermophilic bacterium Laceyella sacchari FBKL4.010 reveals the basis for tetramethylpyrazine biosynthesis in Moutai-flavor Daqu.

Dounan Li1,2, Wei Huang2,3, Chunxiao Wang2,3, Shuyi Qiu1,2,3.   

Abstract

The genus Laceyella consists of a thermophilic filamentous bacteria. The pure isolate of Laceyella sacchari FBKL4.010 was isolated from Moutai-flavor Daqu, Guizhou Province, China. In this study, the whole genome was sequenced and analyzed. The complete genome consists of one 3,374,379-bp circular chromosome with 3,145 coding sequences (CDSs), seven clustered regularly interspaced short palindromic repeat (CRISPR) regions of 12 CRISPRs. Moreover, we identified that the genome contains genes encoding key enzymes such as proteases, peptidases, and acetolactate synthase (ALS) of the tetramethylpyrazine metabolic pathway. Metabolic pathways relevant to tetramethylpyrazine synthesis were also reconstructed based on the Kyoto Encyclopedia of Genes and Genomes (KEGG) PATHWAY database. Annotation and syntenic analyses using antiSMASH 4.0 also revealed the presence of two gene clusters in this strain that differ from known tetramethylpyrazine synthesis clusters, with one encoding amino acid dehydrogenase (ADH) and the other encoding transaminase in tetramethylpyrazine metabolism. The results of this study provide flavor and genomic references for further research on the flavor-producing functions of strain FBKL4.010 in the Moutai liquor-making process.
© 2019 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.

Entities:  

Keywords:  zzm321990Laceyella saccharizzm321990; Moutai-flavor Daqu; antiSMASH analysis; complete genome; liquor-making; tetramethylpyrazine

Mesh:

Substances:

Year:  2019        PMID: 31482696      PMCID: PMC6925174          DOI: 10.1002/mbo3.922

Source DB:  PubMed          Journal:  Microbiologyopen        ISSN: 2045-8827            Impact factor:   3.139


INTRODUCTION

Chinese Moutai liquor has been made for 800 years and is distilled from the product of fermentation using a wild microbial starter, Moutai‐flavor Daqu. In contrast to other Daqus, this Daqu uses only wheat as raw material and is an important starter for the whole fermentation process (Wang, Shi, & Gong, 2008). In general, the maximum temperature while making Moutai‐flavor can reach 65°C. This temperature creates good growth or sporulation conditions for many thermophilic microbes that are believed to be important in the formation of the soy sauce flavor during the liquor‐making process (Li, Lian, Ding, Nie, & Zhang, 2014; Wang et al., 2017; Zhang et al., 2012). Among these microbes, Bacillus sp. such as Bacillus subtilis and Bacillus licheniformis have been considered as a source of the soy sauce flavor in Moutai liquor. However, the roles of other thermophilic microbes in the formation of the soy sauce flavor have not been confirmed (Li & Qiu, 2017). Laceyella sacchari is a chemoheterotrophic, heat‐resistant, and gram‐positive microbe in the family Thermoactinomycetaceae, genus Laceyella. Members of the genus Laceyella originally belonged to the genus Thermoactinomyces, which was first discovered and described as a single species by Tsiklinsky in 1899 (Yoon, Kim, Shin, & Park, 2005). After the use of new polyphasic taxonomy methods, the genus Laceyella was divided from the genus Thermoactinomyces and described as a new genus (Matsuo et al., 2006; Yoon et al., 2005). Laceyella sacchari often inhabits high‐temperature environments (Xian, Ming, & Li, 2015) and has great potential for application in the modern food industry (Hanphakphoom, Maneewong, Sukkhum, Tokuyama, & Kitpreechavanich, 2014; Shukla & Singh, 2015). According to the whole‐genome analysis of the phylogenetically related strain, Thermoactinomyces daqus H‐18, which was isolated from high‐temperature Daqu (approximately 3.44Mb), 60 coding sequences (CDSs) of peptidases and amino acid dehydrogenases (ADHs) were identified (Yao et al., 2014). In the tetramethylpyrazine metabolic pathway, peptidases degrade proteins into amino acids and ADH further turns amino acids into ammonium (an important flavor precursor) (Wu & Xu, 2014; Zhu, Xu, & Fan, 2010). This result revealed that some thermophilic microbes in the family Thermoactinomycetaceae in Daqu may play a role similar to that of Bacillus sp., which is relevant to the formation of soy sauce flavor. In a recent study by our research group, the family Thermoactinomycetaceae was identified as a novel dominant microbial group (accounting for 34.40% of the total microbial biomass) in Moutai‐flavor Daqu through metagenome sequencing (Wang, Ban, Zhou, Hu, & Qiu, 2016). A pure isolate designated FBKL4.010 was obtained from Moutai‐flavor Daqu and identified as L. sacchari using gene phylogenetic analysis based on the 16S rRNA sequence. Thus, the aims of this study were (a) to explore the main function of the strain FBKL4.010 under a simulated environment of Moutai‐flavor Daqu fermentation and (b) to confirm key genes involved in flavor formation on the genome of FBKL4.010.

MATERIALS AND METHODS

Target strain

The strain FBKL4.010 was isolated from Moutai‐flavor Daqu and identified as L. sacchari in our previous study.

GC‐MS analysis

In the solid‐state fermentation experiment, we used pure wheat as a solid‐state medium to simulate the fermentation of Moutai‐flavor Daqu. The wheat was moistened in 60% boiling water for 5–6 hr and saccharified in a 60°C water bath for 3–4 hr. Finally, we sterilized the medium at 115°C for 30 min. FBKL4.010 was cultured in sterilized wheat medium at its optimum growth temperature (45°C). After 5 days of cultivation in 250‐ml bottles under aerobic, standing conditions, we selected fermentative material as the subject for gas chromatography–mass spectrometry (GC‐MS) analysis. Fermentation samples (5 g per sample) were weighed into 20‐ml vials sealed with Teflon‐coated septa and incubated in a 60°C water bath for 15 min prior to a 30‐min solid‐phase microextraction (SPME) extraction. The SPME fiber assembly, 50/30 μm DVB/CAR/PDMS, stableflex, manual holder was obtained from Supelco. Samples of the medium alone were used as the blank group. Substances were separated with a ZB‐5MSI column (30 m × 0.25 mm × 0.25 μm). The injection temperature was 250°C (splitless injection mode), and high‐purity He (99.999%) was used as the carrier gas with a 1.0 ml/min flow rate. The column pressure was set at 7.62 psi, and the solvent delay time was 1 min. The temperature program was as follows: the oven initial temperature was maintained at 40°C for 6 min, increased to 135°C at 4°C/min, and maintained for 2 min, and then increased to 180°C at 8°C/min and maintained for 5 min. The ion source temperature was 250°C; the MS quad temperature was 150°C; the ionization voltage was 70 eV; and the emission current was 34.6 μA. The scan range was 20–450 amu.

DNA extraction and genomic sequencing

Genomic DNA of FBKL4.010 was purified using the DNeasy Blood & Tissue Kit (QIAGEN®). About 155 μg DNA was obtained taken up in the water, and 80.7 μg was used for sequencing (the concentration of DNA was 180.54 ng/µl, with an OD260/280 value of 2.06). Whole‐genome sequencing was performed on the Illumina MiSeq‐Dx Sequencer using a PE400 library (400‐bp inserts) in the sequencing mode of paired‐end 2 × 251 bp and PacBio RS II Sequencer using a S20K library (20‐kbp inserts) with P6‐C4 chemistry in the standard sequencing mode. Raw reads from the Illumina MiSeq platform were examined and subjected to quality control using FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/). Low‐quality reads were removed using SOAPec v2.0 (Luo et al., 2012), and high‐quality reads were obtained based on the kmer frequency (value = 17). Subsequently, the FBKL4.010 genome was assembled using SPAdes v3.9.0 (Bankevich et al., 2012). The data from the PacBio platform were assembled into a number of scaffolds and contigs using Canu v1.4 (Koren et al., 2017). Finally, we generated a mixed assembly using SPAdes v3.9.0 (Bankevich et al., 2012) and a multicollinearity analysis using MUMmer v3.1 (Kurtz et al., 2004) to close the gaps among the contigs.

Genomic functional annotation and discovery of tetramethylpyrazine‐relevant genes

The protein‐coding genes were predicted using Glimmer 3.02 (Delcher, Harmon, Kasif, White, & Salzberg, 1999) and the National Center for Biotechnology Information (NCBI) Prokaryotic Genome Annotation Pipeline with GeneMarkS+ v4.3 (Tatusova et al., 2016). In addition, annotation was further refined with blast searches to the NCBI Reference Sequence database (RefSeq) (Release on March 20, 2019) and the nonredundant protein sequence database (NCBI‐nr) (Pruitt, Tatusova, Klimke, & Maglott, 2009) using blast+ v2.8.1 (Mount, 2007). The functional criteria included an E‐value of less than 1e‐6, the sequence identity more than 30%, and the minimal alignment length percentage of larger than 70%. Finally, the annotated information of the best hit was assigned to the corresponding protein‐coding gene. The discovery of tetramethylpyrazine‐relevant genes and reconstruction of relevant metabolic pathways was performed using the Kyoto Encyclopedia of Genes and Genomes (KEGG) PATHWAY and Mapper databases (Kanehisa, Sato, Kawashima, Furumichi, & Tanabe, 2016). The prediction of gene clusters and syntenic analyses were performed using antiSMASH 4.0, bacterial version (Tilmann et al., 2015). The circular graph of the genome was generated using CGView (Copyright 2004) (Stothard & Wishart, 2005).

RESULTS AND DISCUSSION

Flavor component characteristics of FBKL4.010

According to the results of the GC‐MS analysis, we found that the strain FBKL4.010 produces a flavor precursor, acetoin, and various pyrazines, including 2,5‐dimethyl pyrazine, 2,3,5‐trimethylpyrazine, tetramethylpyrazine, and 2‐ethyl‐3,5,6‐trimethylpyrazine, under a solid‐state system using pure wheat (Figure 1). Notably, the most abundant flavor component was tetramethylpyrazine (red frame in Figure 1), which accounted for 52.68% of the total flavor component content. In previous GC‐MS analyses of B. subtilis MTDB‐03, B. licheniformis FBKL1.0199 and FBKL1.0201 isolated from Moutai‐flavor Daqu, similar flavor components were also identified, with the tetramethylpyrazine content exceeded 40.00% of the total flavor component for each strain (Wang et al., 2017; Yang et al., 2011). In the previous study, no strains in the genus Laceyella with the ability to produce tetramethylpyrazine have been described. Therefore, the identification of a strain in the genus Laceyella that plays a role in the formation of soy sauce flavor, similar to that of Bacillus sp. under the simulated environment of Moutai‐flavor Daqu fermentation, is notable.
Figure 1

The chromatogram of flavor components under solid‐state fermentation conditions from the strain FBKL4.010. Number 1 represents the precursor acetoin; numbers 2, 3, 4, and 5 represent 2,5‐dimethyl pyrazine, 2,3,5‐trimethylpyrazine, tetramethylpyrazine, and 2‐ethyl‐3,5,6‐trimethylpyrazine, respectively

The chromatogram of flavor components under solid‐state fermentation conditions from the strain FBKL4.010. Number 1 represents the precursor acetoin; numbers 2, 3, 4, and 5 represent 2,5‐dimethyl pyrazine, 2,3,5‐trimethylpyrazine, tetramethylpyrazine, and 2‐ethyl‐3,5,6‐trimethylpyrazine, respectively

Genome characteristics

Further research was conducted to identify key genes relevant to tetramethylpyrazine metabolism using complete genome analysis to study the tetramethylpyrazine metabolic pathway in FBKL4.010 under solid fermentation conditions. The results showed that the complete genome sequence of strain FBKL4.010 consisted of a single, circular 3,374,379‐bp chromosome (Figure 2) based on the mixed assembly and multicollinearity analysis. The GenBank accession no. is CP025943.1. The average GC content of the whole sequence is 49.19%. The analysis with the Prokaryotic Genome Annotation Pipeline (GeneMarkS+ v4.3) (Tatusova et al., 2016) resulted in the annotation of 3,145 CDSs, 12 clustered regularly interspaced short palindromic repeats (CRISPRs) (distributed in 7 regions) and 135 RNAs, including 97 tRNAs, 34 rRNAs, and 4 other ncRNAs in the whole‐genome sequence (Table 1). These genome features are similar to other members in the genus Laceyella such as L. sacchari 1–1 (GenBank WGS accession number is ASZU00000000.1), which has a genome size of 3.32 Mbp and 48.90% G + C content (ANI value with FBKL4.010 was 97.81%) (Kaur, Arora, Kumar, & Mayilraj, 2014), and Laceyella sediminis RHA1 (GenBank WGS accession number is NZ_PVTZ00000000), which has a genome size of 3.38 Mbp and 48.90% G + C content, 75 tRNAs, 39 rRNAs, and 4 ncRNAs (ANI value with FBKL4.010 was 95.57%) (Whitman et al., 2015). FBKL4.010 can be classified into L. sacchari due to the high ANI values observed above the 95% threshold.
Figure 2

Circular graph of Laceyella sacchari FBKL4.010. The innermost circle represents the GC Skew +(green) and GC Skew-(purple), followed by the second circle representing the GC content (black); the outside rings depict the position of RNAs or clusters of orthologous group (COG) annotations of various structures such as CDSs in the genome of the strain

Table 1

Genome features of strain FBKL4.010

FeaturesChromosome
Genome size (bp)3,374,379
G + C content49.19%
Average genome coverage202× (or fold)
Plasmid0
CDS (total)3,199
CDS (coding)3,145
tRNAs97
rRNAs34
ncRNAs4
CRISPRs12
Putative genes3,334
Prophage (incomplete)2
GenBank accession no. CP025943.1
Circular graph of Laceyella sacchari FBKL4.010. The innermost circle represents the GC Skew +(green) and GC Skew-(purple), followed by the second circle representing the GC content (black); the outside rings depict the position of RNAs or clusters of orthologous group (COG) annotations of various structures such as CDSs in the genome of the strain Genome features of strain FBKL4.010

Tetramethylpyrazine‐relevant genes and reconstruction of relevant metabolic pathways

Tetramethylpyrazine, which often present in Chinese liquor‐making systems, is considered to be a major source of soy sauce flavor (Xu, Wu, Fan, & Zhu, 2011) and is a component of Moutai‐flavor liquor (Huo et al., 2017). Tetramethylpyrazine is generated by the Maillard reaction during the Daqu‐making process and is transferred to enter the liquor by distillation (Wu, 2007). With the rapid development of the Chinese liquor industry, research on tetramethylpyrazine has become a major focus (Zhu et al., 2010). In general, the metabolic pathways of acetoin and ammonium (NH3) are two important aspects of microbes producing tetramethylpyrazine in the solid‐state system of Chinese liquor‐making. The acetoin biosynthetic pathway of in bacteria such as B. subtilis has been well studied (Xiao, Hou, Xin, Xi, & Zhao, 2014; Xiao, Liu, Qin, & Xu, 2007; Xu et al., 2011; Zhu, Xu, & Fan, 2009). The formation of NH3 under solid substrate conditions has also been analyzed in recent research (Wu & Xu, 2014). All of the metabolic pathways and key enzymes relevant to tetramethylpyrazine production in B. subtilis under solid‐state fermentation conditions are shown in Figure 3. However, not all genes encoding key enzymes in the pathway were identified in the strain in this study, as indicated in Figure 3 with different colors. However, annotations for two metabolic pathways, the acetoin and NH3 metabolic pathways, were identified in L. sacchari FBKL4.010.
Figure 3

The Bacillus subtilis tetramethylpyrazine metabolic pathway under solid‐state fermentation conditions in Chinese liquor‐making (Wu & Xu, 2014). Bold arrows with solid lines are the major reactions taking place during tetramethylpyrazine formation. Arrows with dotted lines indicate branched metabolic pathways. Red circles and green circles indicate the confirmed and unconfirmed genes encoding key enzymes in this study, respectively. ALS, acetolactate synthase; ALDC, α‐acetolactate decarboxylase; NOD, the enzyme catalytic oxidative decarboxylation; BDH, 2,3‐butanediol dehydrogenase; ADH, amino acid dehydrogenase; AcCoA, acetyl coenzyme A; EMP, Embden–Meyerhof pathway; TCA, tricarboxylic acid cycle. The letter S indicates nonenzymatic spontaneous condensation reactions

The Bacillus subtilis tetramethylpyrazine metabolic pathway under solid‐state fermentation conditions in Chinese liquor‐making (Wu & Xu, 2014). Bold arrows with solid lines are the major reactions taking place during tetramethylpyrazine formation. Arrows with dotted lines indicate branched metabolic pathways. Red circles and green circles indicate the confirmed and unconfirmed genes encoding key enzymes in this study, respectively. ALS, acetolactate synthase; ALDC, α‐acetolactate decarboxylase; NOD, the enzyme catalytic oxidative decarboxylation; BDH, 2,3‐butanediol dehydrogenase; ADH, amino acid dehydrogenase; AcCoA, acetyl coenzyme A; EMP, Embden–Meyerhof pathway; TCA, tricarboxylic acid cycle. The letter S indicates nonenzymatic spontaneous condensation reactions Gene functional annotations of L. sacchari FBKL4.010 allowed us to identify genes encoding key enzymes that are relevant to acetoin metabolism, such as the ilvB and ilvN genes, which encode the large and small subunits of acetolactate synthase (ALS), respectively (GenBank accession no. of protein sequences: AUS09165.1 and AUS09164.1). The bacterial ALS holoenzyme is a tetramer of two identical large subunits and two identical small subunits encoded by two genes of the large subunit and small subunit, respectively (Hershey, Schwartz, Gale, & Abell, 1999). This study is the first time that these genes have been identified in Laceyella sacchari, and they are similar to those of L. sediminis (Chen et al., 2012), and Thermoactinomycetaceae with 98% (large SU), 95% (small SU) amino acid sequence identities (GenBank accession no. of protein sequences: WP_106343376.1 and WP_022739119.1). However, in our study, key genes encoding acetolactate decarboxylase (ALDC) and 2,3‐butanediol dehydrogenase (BDH) could not be identified in the genome of strain FBKL4.010 (green circles in Figure 3). It was supposed that an unknown alternative route of acetoin biosynthesis exists in this strain. Functional annotations also pinpointed genes in NH3 metabolic pathways that encoded proteases, peptidases, and carboxypeptidases. It has been shown that these enzymes are often involved in protein and peptide degradation and in the degradation of proteins into free amino acids (the detailed information is shown in Table 2). Some of these enzymes have good thermostability and thermophilic characteristics (Arndt et al., 2002), indicating that they may play a role in the high‐temperature Daqu‐making process, such as Moutai‐flavor Daqu.
Table 2

Information for proteases and peptidases relevant to the degradation of proteins and peptides in Laceyella sacchari FBKL4.010

FBKL4.010 encoding protein IDLocus tagsDescription of specific proteases and peptidasesSimilarity to described protein and peptide degrading enzymes

AUS09177.1

AUS09178.1

C1X05_10310

C1X05_10315

HslVU is an ATP‐dependent protease consisting of HslU and HslV subunits. HslU markedly stimulates the proteolytic activity of HslV which can slowly degrade specific hydrophobic peptides and polypeptides (Kang et al., 2014; Yoo et al., 1996)

98.95% with the HslU subunit in Laceyella sediminis

99.44% with the HslV subunit HslV in Laceyella sacchari

AUS09873.1C1X05_14280M32 carboxypeptidases are thermostable metalloproteases consisting of two members isolated from the thermophilic bacteria Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP). These enzymes hydrolyze peptides and sequentially release amino acids from the C‐terminus, with a broad specificity toward a wide range of C‐terminal substrates (Lee et al., 1996; Niemirowicz, Parussini, Fernán, & Cazzulo, 2007)96.79% with the M32 Carboxypeptidase in Laceyella sediminis

AUS07652.1

AUS08399.1

C1X05_01430

C1X05_05860

Methionine aminopeptidases (MetAPs) are organized into two classes (types I and II), with type I MetAPs present in Escherichia coli. MetAP can remove N‐terminal methionine residues from polypeptide chains and catalyze the release of several hydrophobic amino acids in addition to methionine (Mitra, Sheppard, Wang, Bennett, & Holz, 2009; Walker & Bradshaw, 2013)

99.60% with the type I methionyl aminopeptidase in Laceyella sediminis

98.80% with the type I methionyl aminopeptidase in Laceyella sediminis

AUS09934.1C1X05_14610Thermitase is a thermostable endoprotease with the ability to convert various food‐relevant substrates into low‐molecular‐weight peptides. This enzyme can be produced by strains in Laceyella sp. and Thermoactinomyces sp. (Jørgensen, Madsen, Vrang, Hansen, & Johnsen, 2013)99.74% with a thermitase member of the peptidase S8 family in Laceyella sediminis
Information for proteases and peptidases relevant to the degradation of proteins and peptides in Laceyella sacchari FBKL4.010 AUS09177.1 AUS09178.1 C1X05_10310 C1X05_10315 98.95% with the HslU subunit in Laceyella sediminis 99.44% with the HslV subunit HslV in Laceyella sacchari AUS07652.1 AUS08399.1 C1X05_01430 C1X05_05860 99.60% with the type I methionyl aminopeptidase in Laceyella sediminis 98.80% with the type I methionyl aminopeptidase in Laceyella sediminis In addition, the genome of strain FBKL4.010 contains genes that encoded ADHs, such as alanine, glutamate, leucine, and proline dehydrogenases (GenBank accession no. of protein sequences: AUS08132.1, AUS08496.1, AUS08677.1, AUS08555.1, AUS09758.1, and AUS08130.1). In a previous study, ADH and aminotransferase were shown to play a role in enzymatic transamination and transdeamination. Free amino acids are transformed into NH3 via catalysis by these enzymes (Wu & Xu, 2014). The identification of all genes encoding enzymes that are relevant to tetramethylpyrazine synthesis indicated the possible presence of the complete tetramethylpyrazine metabolic pathways in L. sacchari FBKL4.010. Finally, we confirmed that all tetramethylpyrazine‐relevant protein sequences encoded in the genome of FBKL4.010 showed the highest protein identity (≥95%) with other members of the family Thermoactinomycetaceae such as L. sediminis and T. vulgaris according to the annotation pipeline using GeneMarkS + v4.3 (Table A1).
Table A1

Protein sequences relevant to tetramethylpyrazine metabolism in Laceyella sacchari FBKL4.010

FBKL4.010 encoding protein IDLocus tagsMost similar protein sequence IDSimilar family/genus/speciesIdentity %Protein name
AUS10426.1 C1X05_08485WP_106342963.1 Laceyella sediminis 99Alpha‐amylase (EC:3.2.1.1)
AUS09165.1 C1X05_10200WP_106343376.1 Laceyella sediminis 98Biosynthetic‐type acetolactate synthase large subunit (EC:2.2.1.6)
AUS09164.1 C1X05_10195WP_022739119.1 Thermoactinomycetaceae 95Acetolactate synthase small subunit (EC:2.2.1.6)
AUS09177.1 C1X05_10310WP_106342640.1 Laceyella sediminis 99HslU‐HslV ATP‐dependent protease ATPase subunit
AUS09178.1 C1X05_10315WP_022736267.1 Laceyella sacchari 99HslU‐HslV ATP‐dependent protease peptidase proteolytic subunit
AUS09873.1 C1X05_14280WP_106342307.1 Laceyella sediminis 97Carboxypeptidase M32 (EC:3.4.17.-)
AUS07652.1 C1X05_01430WP_106342798.1 Laceyella sediminis 99Type I methionyl aminopeptidase (EC: 3.4.11.18)
AUS08399.1 C1X05_05860WP_106341940.1 Laceyella sediminis 99Type I methionyl aminopeptidase (EC: 3.4.11.18)
AUS09934.1 C1X05_14610WP_106342352.1 Laceyella sediminis 99Thermitase‐like peptidase S8 (EC: 3.4.21.-)
AUS10364.1 C1X05_04375WP_106343256.1 Laceyella sediminis 98D‐alanyl‐D‐alanine carboxypeptidase (EC: 3.4.17.14)
AUS08130.1 C1X05_04340WP_102991771.1 Laceyella sp.100Proline dehydrogenase (EC: 1.5.5.2)
AUS08132.1 C1X05_04350WP_106343258.1 Laceyella sediminis 99Alanine dehydrogenase NAD‐binding and catalytic domains (EC:1.4.1.1)
AUS08494.1 C1X05_06400WP_054095413.1 Thermoactinomyces vulgaris 98Glycine cleavage system aminomethyltransferase (EC: 2.1.2.10)
AUS08496.1 C1X05_06410WP_054095412.1 Thermoactinomyces vulgaris 99Aminomethyl‐transferring glycine dehydrogenase (EC:1.4.4.2)
AUS08495.1 C1X05_06405WP_106342890.1 Laceyella sediminis 99Glycine dehydrogenase (EC:1.4.4.2)
AUS09283.1 C1X05_10935WP_029071817.1 Laceyella sacchari 99L‐serine ammonia‐lyase, iron‐sulfur‐dependent, subunit alpha (EC: 4.3.1.17)
AUS09284.1 C1X05_10940WP_054096012.1 Thermoactinomycetaceae 99L‐serine ammonia‐lyase, iron‐sulfur‐dependent, subunit beta (EC: 4.3.1.19)
AUS10223.1 C1X05_16190WP_022737378.1 Thermoactinomyces vulgaris 99

Serine hydroxymethyl transferase

(EC: 2.1.2.1)

AUS08578.1 C1X05_06835WP_022736215.1 Thermoactinomyces vulgaris 99Alanine dehydrogenase (EC:1.4.1.1)
AUS08555.1 C1X05_06715WP_054095392.1 Thermoactinomycetaceae 99Leucine dehydrogenase (EC:1.4.1.9)
AUS09758.1 C1X05_13650WP_029071896.1 Thermoactinomyces vulgaris 99Leucine dehydrogenase (EC:1.4.1.9)
AUS08677.1 C1X05_07360WP_022738428.1 Thermoactinomyces vulgaris 99Glutamate dehydrogenase (EC:1.4.1.3)
AUS08910.1 C1X05_08670WP_106342055.1 Laceyella sediminis 99Glutamate synthase large subunit (EC: 1.4.1.13)
AUS08911.1 C1X05_08675WP_054096311.1 Thermoactinomyces vulgaris 99Glutamate synthase (EC: 1.4.1.14)
In this study, metabolic pathways relevant to tetramethylpyrazine were also reconstructed based on the KEGG PATHWAY and Mapper databases. In the process of reconstructing the metabolic pathways for acetoin, we inferred that a complete Embden–Meyerhof pathway (EMP) exists in the genome of FBKL4.010. Pyruvate is an important precursor that is generated by EMP pathway. Thus, the metabolic pathway from pyruvate to acetoin was reconstructed (Figure A1). Although genes encoding ALS in the genome of FBKL4.010 genome were annotated in this pathway, those genes encoding α‐acetolactate decarboxylase (ALDC) and 2,3‐butanediol dehydrogenase (BDH) could not be confirmed. These results indicate that FBKL4.010 could possibly generate acetolactate. However, how the strain produces acetoin needs to be further tested.
Figure A1

The reconstructed acetoin metabolic pathway in the strain FBKL4.010 based on the KEGG PATHWAY database. EMP represents Embden–Meyerhof pathway. Red marked genes represent genes related to tetramethylpyrazine synthesis on FBKL4.010 genome. The locus tags of marked genes and the ANI value with the most similar genes were shown next to the marked genes

Additionally, metabolic pathways of NH3 production were reconstructed according to the information obtained for FBKL4.010. Two major metabolic pathways of NH3 may exist in strain FBKL4.010. One pathway may start from L‐glutamate and L‐glutamine and finally generate NH3 under the catalysis of glutamate dehydrogenase and glutamate synthase (Glevarec et al., 2004) (Figure A2). Another pathway may start at serine and glycine and generate NH3 under the catalysis of serine hydroxymethyltransferase, glycine dehydrogenase, and aminomethyl‐transferring glycine dehydrogenase (Alhasawi, Castonguay, Appanna, Auger, & Appanna, 2015; Zhang, Wu, & Chen, 2018) (Figure A3). From the above finding of several encoding genes relevant to protein, peptide degradation, and the production of free amino acids, it can be inferred that complete metabolic pathways of NH3 production are present in strain FBKL4.010. Strain FBKL4.010 may produce NH3 as an important precursor of tetramethylpyrazine via this metabolic pathway during the simulated fermentation of Moutai‐flavor Daqu.
Figure A2

The reconstructed result of the ammonium metabolic pathway in the strain FBKL4.010 based on the KEGG PATHWAY database. Red marked genes represent genes related to tetramethylpyrazine synthesis on FBKL4.010 genome. The locus tags of marked genes and the ANI value with the most similar genes were shown next to the marked genes

Figure A3

The reconstructed result of the ammonium metabolic pathway in the strain FBKL4.010 based on the KEGG PATHWAY database. Red marked genes represent genes related to tetramethylpyrazine synthesis on FBKL4.010 genome. The locus tags of marked genes and the ANI value with the most similar genes were shown next to the marked genes

Two gene clusters differing from known tetramethylpyrazine synthesis clusters, encoding ADH and aminotransferase involved in tetramethylpyrazine metabolism, were also predicted in our study. Cluster 1 (from C1X05_04335 to C1X05_04380 loci) consists of ten genes (the PutA and L‐AlaDH genes encoding ADH and the vanY gene encoding D‐alanyl‐D‐alanine carboxypeptidase). Cluster 2 (from C1X05_06405 to C1X05_06425 loci) contains two aminotransferase‐encoding genes (the gcvP and gcvT genes) (Figure 4). With the identification of two new gene clusters, we found that the distribution of some genes related to tetramethylpyrazine biosynthesis was regular. These results provide a guide for identifying other unconfirmed genes in tetramethylpyrazine metabolism from the whole‐genome sequence of strain FBKL4.010. Furthermore, the order of tetramethylpyrazine‐relevant genes in clusters 1 and 2 showed some similarity with other closely related strains, such as T. vulgaris NRRL F‐5595 and Paenibacillus larvae DSM 25719 (Figure 5), and the similarity of the gene clusters was 48% and 32%, respectively. However, we did not find a similar alignment with genes in B. subtilis or other species that produce tetramethylpyrazine. This result may indicate that tetramethylpyrazine‐relevant genes of the strain FBKL4.010 have been notably reordered.
Figure 4

Two gene clusters related to key enzymes in the tetramethylpyrazine metabolic pathways. Lines among arrows represent intergenic spacers in clusters. The length of arrows and lines roughly reflects the gene and intergenic spacer length but not in actual proportions. Abbreviations: putA, proline dehydrogenase; L‐alaDH, alanine dehydrogenase NAD‐binding and catalytic domains; vanY, D‐alanyl‐D‐alanine carboxypeptidase; gcvP, glycine dehydrogenase; gcvT, aminomethyl‐transferring glycine dehydrogenase

Figure 5

A synteny analysis to compare the alignment of the tetramethylpyrazine‐related genes in two gene clusters between the strain FBKL4.010 and other closely related strains by antiSMASH 4.0. Each arrow represents a single gene. Homologous genes share a single color across strains. Vertical lines with corresponding colors connect homologous and conserved genes between the two clusters. Labels next to the arrows indicate gene position in the whole genome

Two gene clusters related to key enzymes in the tetramethylpyrazine metabolic pathways. Lines among arrows represent intergenic spacers in clusters. The length of arrows and lines roughly reflects the gene and intergenic spacer length but not in actual proportions. Abbreviations: putA, proline dehydrogenase; L‐alaDH, alanine dehydrogenase NAD‐binding and catalytic domains; vanY, D‐alanyl‐D‐alanine carboxypeptidase; gcvP, glycine dehydrogenase; gcvT, aminomethyl‐transferring glycine dehydrogenase A synteny analysis to compare the alignment of the tetramethylpyrazine‐related genes in two gene clusters between the strain FBKL4.010 and other closely related strains by antiSMASH 4.0. Each arrow represents a single gene. Homologous genes share a single color across strains. Vertical lines with corresponding colors connect homologous and conserved genes between the two clusters. Labels next to the arrows indicate gene position in the whole genome

CONCLUSION

In conclusion, this is the first report on the complete genome of the tetramethylpyrazine‐producing species L. sacchari. The whole‐genome sequence analysis of L. sacchari FBKL4.010 showed that a set of protein‐coding genes related to tetramethylpyrazine synthesis is present in the genome of this strain. The results of this study provide an opportunity to further understand the tetramethylpyrazine synthesis pathway of L. sacchari in the process of liquor‐making.

CONFLICT OF INTERESTS

None declared.

AUTHOR CONTRIBUTIONS

Shuyi Qiu, Dounan Li, Chunxiao Wang, and Wei Huang conceived the study. Shuyi Qiu, Dounan Li, Chunxiao Wang, and Wei Huang were involved in formal analysis. Shuyi Qiu acquired the funding. Shuyi Qiu and Dounan Li administrated the project. Shuyi Qiu provided resources. Shuyi Qiu and Dounan Li wrote the original draft of the manuscript. Shuyi Qiu, Dounan Li, Chunxiao Wang, and Wei Huang wrote, reviewed, and edited the manuscript. All authors read and approved the manuscript for publication.

ETHICS STATEMENT

None required.
  32 in total

1.  Crystal structure of a novel carboxypeptidase from the hyperthermophilic archaeon Pyrococcus furiosus.

Authors:  Joseph W Arndt; Bing Hao; Vijay Ramakrishnan; Timothy Cheng; Sunney I Chan; Michael K Chan
Journal:  Structure       Date:  2002-02       Impact factor: 5.006

2.  Improved microbial gene identification with GLIMMER.

Authors:  A L Delcher; D Harmon; S Kasif; O White; S L Salzberg
Journal:  Nucleic Acids Res       Date:  1999-12-01       Impact factor: 16.971

Review 3.  [The family Thermoactinomycetaceae--a review].

Authors:  Wendong Xian; Hong Ming; Wenjun Li
Journal:  Wei Sheng Wu Xue Bao       Date:  2015-01-04

4.  Two metallocarboxypeptidases from the protozoan Trypanosoma cruzi belong to the M32 family, found so far only in prokaryotes.

Authors:  Gabriela Niemirowicz; Fabiola Parussini; Fernán Agüero; Juan J Cazzulo
Journal:  Biochem J       Date:  2007-01-15       Impact factor: 3.857

5.  The ATP-dependent CodWX (HslVU) protease in Bacillus subtilis is an N-terminal serine protease.

Authors:  M S Kang; B K Lim; I S Seong; J H Seol; N Tanahashi; K Tanaka; C H Chung
Journal:  EMBO J       Date:  2001-02-15       Impact factor: 11.598

6.  Versatile and open software for comparing large genomes.

Authors:  Stefan Kurtz; Adam Phillippy; Arthur L Delcher; Michael Smoot; Martin Shumway; Corina Antonescu; Steven L Salzberg
Journal:  Genome Biol       Date:  2004-01-30       Impact factor: 13.583

7.  Genome sequencing and annotation of Laceyella sacchari strain GS 1-1, isolated from hot spring, Chumathang, Leh, India.

Authors:  Navjot Kaur; Amit Arora; Narender Kumar; Shanmugam Mayilraj
Journal:  Genom Data       Date:  2013-11-27

8.  Genomic Encyclopedia of Bacterial and Archaeal Type Strains, Phase III: the genomes of soil and plant-associated and newly described type strains.

Authors:  William B Whitman; Tanja Woyke; Hans-Peter Klenk; Yuguang Zhou; Timothy G Lilburn; Brian J Beck; Paul De Vos; Peter Vandamme; Jonathan A Eisen; George Garrity; Philip Hugenholtz; Nikos C Kyrpides
Journal:  Stand Genomic Sci       Date:  2015-05-17

9.  SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.

Authors:  Ruibang Luo; Binghang Liu; Yinlong Xie; Zhenyu Li; Weihua Huang; Jianying Yuan; Guangzhu He; Yanxiang Chen; Qi Pan; Yunjie Liu; Jingbo Tang; Gengxiong Wu; Hao Zhang; Yujian Shi; Yong Liu; Chang Yu; Bo Wang; Yao Lu; Changlei Han; David W Cheung; Siu-Ming Yiu; Shaoliang Peng; Zhu Xiaoqian; Guangming Liu; Xiangke Liao; Yingrui Li; Huanming Yang; Jian Wang; Tak-Wah Lam; Jun Wang
Journal:  Gigascience       Date:  2012-12-27       Impact factor: 6.524

10.  NCBI prokaryotic genome annotation pipeline.

Authors:  Tatiana Tatusova; Michael DiCuccio; Azat Badretdin; Vyacheslav Chetvernin; Eric P Nawrocki; Leonid Zaslavsky; Alexandre Lomsadze; Kim D Pruitt; Mark Borodovsky; James Ostell
Journal:  Nucleic Acids Res       Date:  2016-06-24       Impact factor: 16.971

View more
  2 in total

1.  Comprehensive evaluation of physicochemical properties and antioxidant activity of B. subtilis-fermented polished adlay subjected to different drying methods.

Authors:  Anyan Wen; Likang Qin; Haiying Zeng; Yi Zhu
Journal:  Food Sci Nutr       Date:  2020-03-14       Impact factor: 2.863

2.  Isolation, identification, and whole-genome sequencing of high-yield protease bacteria from Daqu of ZhangGong Laojiu.

Authors:  Yanbo Liu; Junying Fu; Linlin Wang; Zhijun Zhao; Huihui Wang; Suna Han; Xiyu Sun; Chunmei Pan
Journal:  PLoS One       Date:  2022-04-26       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.