| Literature DB >> 30233508 |
Ruidan Qu1, Xiaoyu Zhu1, Min Tian1, Yingjie Liu1, Wenjuan Yan1, Jian Ye1, Hongliang Gao1, Jing Huang1.
Abstract
Recently, an enzyme named protein-glutaminase (PG) has been identified as a new type of enzyme with significant potential for deamidation of food proteins. The enzyme is shown to be expressed as a pre-pro-protein with a putative signal peptide of 21 amino acids, a pro-sequence of 114 amino acids, and a mature PG of 185 amino acids. The microbial enzyme PG specifically catalyzes deamidation of proteins without protein hydrolysis pretreatment and only reacts with glutamine residues in the side-chains of proteins or long peptides. All these attributes suggest that it has a great potential for food industrial applications. However, until recently, there have been relatively few studies of the PG-producing strains. A strain named Chryseobacterium proteolyticum QSH1265 which can produce PG was isolated from a soil sample collected in Songjiang, Shanghai, China. Its enzyme activity was about 0.34 ± 0.01 U/mL when using carboxybenzoxy-Gln-Gly as a substrate. The strain can produce acid from D-glucose, maltose, L-arabinose sucrose, glycerol, and mannitol but not fructose, and it is also positive for indole production and urease. Here we describe the complete genome sequence of this strain via PacBio RSII sequencing. The C. proteolyticum QSH1265 genome consists of a circular chromosome with total length of 4,849,803 bp without any plasmids. All of 4563 genes were predicted including 4459 genes for protein-coding and 104 RNA-relative genes with an average G+C content of 36.16%. The KEGG and COG annotation provide information for the specific function of proteins encoded in the genome, such as proteases, chromoproteins, stress proteins, antiporters, etc. A highly conserved hypothetical protein shares a promoter with the gene encoding the protein-glutaminase enzyme. The genome sequence and preliminary annotation provide valuable genetic information for further study of C. proteolyticum.Entities:
Keywords: Chryseobacterium proteolyticum; annotation; deamidation; protein-glutaminase; third-generation sequencing
Year: 2018 PMID: 30233508 PMCID: PMC6132073 DOI: 10.3389/fmicb.2018.01975
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Classification and general features of C. proteolyticum QSH1265 according to the MIGS recommendations.
| MIGS | Property | Term | Evidence code |
|---|---|---|---|
| Classification | Domain | TAS | |
| Phylum | TAS | ||
| Class | TAS | ||
| Order | TAS | ||
| Family | TAS | ||
| Genus | TAS | ||
| Species | TAS | ||
| Strain: | TAS | ||
| Gram stain | Gram-negative | TAS | |
| Cell shape | Rod | TAS | |
| Motility | Non-motility | TAS | |
| Sporulation | None | TAS | |
| Temperature range | Mesophile | TAS | |
| Optimum temperature | 36–37°C | TAS | |
| pH range | pH 5.0–11.0 | TAS | |
| Carbon source | Glucose, sucrose, mannitol | TAS | |
| MIGS-6 | Habitat | Soil, fresh water, fish, diary food | IDA |
| MIGS-22 | Oxygen requirement | Aerobic | TAS |
| MIGS-23 | soil | ||
| MIGS-15 | Biotic relationship | Free-living | IDA |
| MIGS-14 | Pathogenicity | Safe | TAS |
| MIGS-4 | Geographic location | Shanghai, China | This study |
| MIGS-5 | Sample collection time | 2014 | This study |
| MIGS-4.1 | Longitude | 121.24 | This study |
| MIGS-4.2 | Latitude | 31.0 | This study |
| MIGS-4.3 | Depth | 0.1–0.5 m above the surface | This study |
| MIGS-4.4 | Altitude | 98 m | This study |
Genome sequencing project information for C. proteolyticum QSH1265.
| MIGS | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | 2 PacBio SMRT cells |
| MIGS-29 | Sequencing platforms | PacBio RS II Illumina Miseq |
| MIGS-30 | Assembler | HGAP |
| MIGS-32 | Genome database release GenBank ID Bioproject accession Data type | GenBank SRR7156726 PRJNA471029 Genome sequencing |
Nucleotide content and gene count levels of genome.
| Attribute | Value | % of total |
|---|---|---|
| Size (bp) | 4,849,803 | 100.00 |
| G+C content (bp) | 1,753,689 | 36.16 |
| Coding regions (bp) | 4,324,467 | 89.17 |
| Total genes | 4,563 | 100.00 |
| RNA genes | 104 | 2.28 |
| Protein-coding genes | 4,459 | 97.72 |
| Genes assigned to COGs | 3,759 | 82.38 |
| Genes with signal peptides | 489 | 10.97 |
| Genes with transmembrane helices | 906 | 20.32 |
Number of genes associated with the 25 general COG functional categories.
| Code | Value | % of total | Description |
|---|---|---|---|
| J | 145 | 3.25 | Translation, ribosomal structure, and biogenesis |
| A | 0 | 0.00 | RNA processing and dynamics |
| K | 221 | 4.96 | Transcription |
| L | 121 | 2.71 | Replication, recombination, and repair |
| B | 0 | 0.00 | Chromatin structure and dynamics |
| D | 16 | 0.36 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.00 | Nuclear structure |
| V | 58 | 1.30 | Defense mechanisms |
| T | 127 | 2.85 | Signal transduction mechanisms |
| M | 178 | 3.99 | Cell wall/membrane/envelope biogenesis |
| N | 5 | 0.11 | Cell motility |
| Z | 0 | 0.00 | Cytoskeleton |
| W | 0 | 0.00 | Extracellular structure |
| U | 33 | 0.74 | Intracellular trafficking, secretion, and vesicular transport |
| O | 94 | 2.11 | Posttranslational modification, protein turnover, chaperones |
| C | 153 | 3.43 | Energy production and conversion |
| G | 175 | 3.92 | Carbohydrate transport and metabolism |
| E | 235 | 5.27 | Amino acid transport and metabolism |
| F | 57 | 1.28 | Nucleotide transport and metabolism |
| H | 114 | 2.56 | Coenzyme transport and metabolism |
| I | 118 | 2.65 | Lipid transport and metabolism |
| P | 152 | 3.41 | Inorganic ion transport and metabolism |
| Q | 92 | 2.06 | Secondary metabolites biosynthesis, transport, and catabolism |
| R | 399 | 8.95 | General function prediction only |
| S | 182 | 4.08 | Function unknown |
| – | 1784 | 40.01 | Not in COGs |
Sequence structure and general information of motifs in the whole genome.
| Motif string | CenterPos | Type | Fraction | nDetectd | nGenome | MeanScore | meanCoverage |
|---|---|---|---|---|---|---|---|
| GGANNNNNTCG | 2 | m6A | 0.9815 | 902 | 919 | 212.969 | 178.49 |
| CGANNNNNTCC | 2 | m6A | 0.9793 | 900 | 919 | 208.0344 | 179.42 |