| Literature DB >> 31824458 |
Kamyar Mogodiniyai Kasmaei1, John Sundh2.
Abstract
Feruloyl esterases (FAEs) can reduce the recalcitrance of lignocellulosic biomass to enzymatic hydrolysis, thereby enhancing biorefinery potentials or animal feeding values of the biomass. In addition, ferulic acid, a product of FAE activity, has applications in pharmaceutical and food/beverage industries. It is therefore of great interest to identify new FAEs to enhance understanding about this enzyme family. For this purpose, we used whole-genome shotgun metagenomics and genome binning to explore rumens of dairy cows, large intestines of horses, sediments of freshwater and forest topsoils to identify novel prokaryotic FAEs and trace the responsible microorganisms. A number of prokaryotic genomes were recovered of which, genomes of Clostridiales order and Candidatus Rhabdochlamydia genus showed FAE coding capacities. In total, five sequences were deemed as putative FAE. The BLASTP search against non-redundant protein database of NCBI indicated that these putative FAEs represented novel sequences within this enzyme family. The phylogenetic analysis showed that at least three putative sequences shared evolutionary lineage with FAEs of type A and thus could possess specific activities similar to this type of FAEs, something that is not previously found outside fungal kingdom. We nominate Candidatus Rhabdochlamydia genus as a novel FAE producing taxonomic unit.Entities:
Keywords: biorefinery; de novo assembly; lignocellulosic biomass; phylogenetic analysis; sequence motif; taxonomic classification
Year: 2019 PMID: 31824458 PMCID: PMC6879456 DOI: 10.3389/fmicb.2019.02673
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Protocol used to annotate predicted proteins as putative feruloyl esterases (FAEs).
Assembly statistics of Cow, Horse, Sediment, and Soil datasets.
| Cow | 157,074 | 180,624 | 42,765 | 2,853 | 128,498 | 1,662 |
| Horse | 92,420 | 251,279 | 19,700 | 3,548 | 73,073 | 1,709 |
| Sediment | 25,588 | 70,080 | 7,787 | 2,485 | 21,338 | 1,622 |
| Soil | 45,259 | 514,904 | 13,027 | 2,546 | 37,505 | 1,627 |
FIGURE 2Prokaryotic composition (family level) of Cow, Horse, Sediment, and Soil ecosystems based on classification of reads by Kaiju. Families shown make up at least 75% of the community (together with the unclassified and unassigned sequences). “unclassified” sequences have no classification by Kaiju and “unassigned” sequences are those not assigned to a non-viral species.
FIGURE 3Number of binned genomes per taxon (order level) recovered from Cow, Horse, Sediment, and Soil datasets.
BLASTP scores of putative proteins annotated as members of IPR011118, IPR010126, and IPR002921 entries of InterPro database.
| ELP382 | Cow.1 | IPR002921 | A2QSY5 | 31 | 3E-03 | 26 | Hypothetical protein (WP_103986834.1) | 245 | 2E-65 | 32 |
| KFM882 | Cow.4 | IPR010126 | A1CC33 | 32 | 8E-04 | 21 | Hypothetical protein (WP_092995296.1) | 861 | 0E + 00 | 97 |
| GEF307 | Cow.5 | IPR010126 | A1CC33 | 52 | 1E-10 | 27 | Esterase (WP_122274278.1) | 319 | 5E-106 | 55 |
| JPJ405 | Cow.7 | IPR002921 | A1CC33 | 24 | 3E-01 | 35 | VWA domain-containing protein (WP_042172771.1) | 184 | 3E-46 | 31 |
| OJJ032 | Cow.9 | IPR002921 | B8NIB8 | 30 | 6E-03 | 24 | Hypothetical protein (WP_093044389.1) | 453 | 2E-145 | 44 |
| BPL864 | Cow.11 | IPR010126 | Q0CDX2 | 49 | 2E-09 | 31 | Hypothetical protein (CCX69434.1) | 327 | 5E-105 | 40 |
| Cow.15 | IPR002921 | Q0CBM7 | 32 | 1E-03 | 26 | Hypothetical protein (WP_120429016.1) | 223 | 4E-58 | 30 | |
| IDJ033 | Cow.15 | IPR002921 | Q9P979 | 29 | 9E-03 | 30 | Hypothetical protein (WP_081669054.1) | 155 | 4E-37 | 34 |
| EMM549 | Cow.16 | IPR010126 | B8M9H9 | 38 | 7E-06 | 29 | Hypothetical protein (WP_092995296.1) | 488 | 6E-171 | 86 |
| GMA315 | Cow.28 | IPR010126 | G2QND5 | 53 | 7E-11 | 25 | Hypothetical protein (WP_081861271.1) | 457 | 2E-159 | 74 |
| GMA314 | Cow.28 | IPR010126 | A1CC33 | 48 | 2E-09 | 35 | Hypothetical protein (WP_081861271.1) | 433 | 5E-151 | 79 |
| BEP156 | Horse.7 | IPR002921 | Q0CBM7 | 39 | 3E-06 | 31 | Lipase family protein (WP_087378587.1) | 166 | 3E-44 | 31 |
| Horse.7 | IPR002921 | Q2UNW5 | 37 | 3E-05 | 22 | Hypothetical protein (WP_073288296.1) | 149 | 2E-33 | 36 | |
| BLI323 | Horse.8 | IPR010126 | G2QND5 | 56 | 3E-11 | 31 | Hypothetical protein (WP_028520965.1) | 87 | 4E-17 | 65 |
| NAH160 | Horse.14 | IPR011118 | B8NPT0 | 103 | 1E-26 | 26 | Tannase/FAE family α/β hydrolase (WP_106055381.1) | 209 | 2E-58 | 32 |
| FOA763 | Horse.16 | IPR010126 | Q9Y871 | 223 | 2E-69 | 45 | FAE (WP_101478763.1) | 1023 | 0E + 00 | 100 |
| FOA089 | Horse.16 | IPR010126 | Q9Y871 | 245 | 1E-77 | 46 | polyhydroxybutyrate depolymerase (RAR66513.1) | 1036 | 0E + 00 | 100 |
| FOA043 | Horse.16 | IPR010126 | Q9Y871 | 81 | 4E-19 | 26 | Carbohydrate-binding protein CenC (WP_101478973.1) | 1009 | 0E + 00 | 100 |
| IIC869 | Horse.17 | IPR010126 | Q9Y871 | 27 | 2E-02 | 34 | Hypothetical protein (WP_025834368.1) | 322 | 1E-103 | 58 |
| CCB829 | Horse.19 | IPR010126 | Q9Y871 | 52 | 2E-10 | 26 | Poly(3-hydroxybutyrate) depolymerase (CDA95053.1) | 266 | 2E-85 | 46 |
| OCA543 | Horse.20 | IPR010126 | Q9HGR3 | 42 | 8E-07 | 25 | Hypothetical protein (WP_117574921.1 or WP_118573219.1) | 676 | 0E + 00 | 60 |
| Horse.22 | IPR002921 | Q0CBM7 | 25 | 1E-01 | 32 | DUF2974 domain-containing protein (WP_073565233.1) | 59 | 3E-06 | 36 | |
| Horse.24 | IPR002921 | Q9P979 | 22 | 1E + 00 | 24 | Lipase Class 3 (WP_014271472.1) | 53 | 3E-04 | 27 | |
| KKH120 | Horse.24 | IPR002921 | A2QSY5 | 32 | 2E-03 | 26 | Lipase class 3 (WP_014271472.1) | 108 | 4E-23 | 29 |
| KKH742 | Horse.24 | IPR002921 | Q0CBM7 | 31 | 1E-03 | 28 | Hypothetical protein (WP_124756111.1) | 136 | 2E-33 | 33 |
| KKH437 | Horse.24 | IPR002921 | Q0CBM7 | 25 | 2E-01 | 24 | VWA domain-containing protein (WP_042172771.1) | 210 | 6E-53 | 28 |
| HEH134 | Horse.25 | IPR002921 | B8NIB8 | 34 | 7E-05 | 29 | Hypothetical protein (PWM34645.1) | 163 | 3E-44 | 43 |
| MMI830 | Horse.30 | IPR010126 | G2QND5 | 63 | 9E-14 | 26 | Phospholipase/carboxylesterase (EGG54990.1) | 285 | 1E-83 | 42 |
| BKD217 | Horse.31 | IPR010126 | Q9HGR3 | 41 | 1E-06 | 27 | Hypothetical protein (WP_093122987.1) | 200 | 5E-58 | 39 |
| JJL430 | Horse.32 | IPR002921 | Q0CVS2 | 26 | 1E-01 | 39 | Lipase Class 3 (CDC29637.1) | 127 | 4E-27 | 36 |
| GNF552 | Horse.43 | IPR010126 | G2QND5 | 46 | 1E-08 | 28 | Poly(3-hydroxybutyrate) depolymerase (CDD18994.1) | 341 | 1E-113 | 58 |
| NEB278 | Sediment.2 | IPR002921 | Q0CBM7 | 52 | 2E-10 | 31 | Lipase family protein (RPJ12008.1) | 656 | 0E + 00 | 95 |
| Sediment.5 | IPR002921 | B8NIB8 | 26 | 8E-02 | 24 | Hypothetical protein (PWU16597.1) | 950 | 0E + 00 | 78 | |
| DAH257 | Soil.2 | IPR011118 | B8NPT0 | 179 | 9E-53 | 30 | Tannase/FAE family α/β hydrolase (RZM34741.1) | 491 | 5E-166 | 50 |
| DAH259 | Soil.2 | IPR010126 | G2QND5 | 33 | 3E-04 | 33 | Hypothetical protein (OLB12881.1) | 362 | 5E-121 | 59 |
Location of the serine active site motif (PS00120) in putative feruloyl esterases (FAEs) (see Supplementary Sequence File S1 for complete sequences).
| IDJ731 | 613 | 162–171 | VLLTGYSRGA |
| BEP310 | 684 | 197–206 | IFITGHSRGA |
| LLA035 | 305 | 195–204 | VYLTGHSLGG |
| KKH736 | 336 | 183–192 | LYIIGHSLGS |
| ELA265 | 600 | 365–374 | LEITGHSLGG |
Taxonomic classification of binned genomes with feruloyl esterase (FAE) coding capacities.
| Cow.15 | IDJ731 | 1.7 | 69.56 | 3.52 | o_Clostridiales |
| Horse.7 | BEP310 | 1.7 | 90.88 | 0.79 | g_Ruminococcus |
| Horse.22 | LLA035 | 1.9 | 72.11 | 2.57 | f_Lachnospiraceae |
| Horse.24 | KKH736 | 1.1 | 69.13 | 0 | f_Ruminococcaceae |
| Sediment.5 | ELA265 | 0.8 | 49.81 | 1.19 | g_Candidatus Rhabdochlamydia |
FIGURE 4Phylogenetic relationships of feruloyl esterases (FAEs) predicted in this study (in bold) with reference FAEs, using maximum likelihood (A) and neighbor joining (B) methods. Confidence values in panels (A) and (B) represent Shimodaira–Hasegawa support values (1000 resampling) and bootstrap values (1000 bootstraps), respectively. Leaf nodes in green are the outgroup. Leaf naming is “