| Literature DB >> 35598016 |
Rihab Ameri1, José Luis García2,3, Amel Bouanane Derenfed4, Nathalie Pradel5, Sawssan Neifar1, Sonia Mhiri1, Monia Mezghanni1, Nadia Zaraî Jaouadi1, Jorge Barriuso2,3, Samir Bejar6.
Abstract
BACKGROUND: Omics approaches are widely applied in the field of biology for the discovery of potential CAZymes including whole genome sequencing. The aim of this study was to identify protein encoding genes including CAZymes in order to understand glycans-degrading machinery in the thermophilic Caldicoprobacter algeriensis TH7C1T strain.Entities:
Keywords: Biotechnological applications; CAZymes, CGC; Caldicoprobacter algeriensis TH7C1T
Mesh:
Substances:
Year: 2022 PMID: 35598016 PMCID: PMC9124407 DOI: 10.1186/s12934-022-01818-0
Source DB: PubMed Journal: Microb Cell Fact ISSN: 1475-2859 Impact factor: 6.352
Fig. 1Graphical Circular genome map of Caldicoprobacter algeriensis TH7C1T generated using CGView. From outside to inside, ring 1 represents the 45 assembled contigs. Ring 2 shows the GC skew with green indicating positive values and purple indicating negative values. The GC content is represented by the inner most ring
Comparison of genome features between C. algeriensis, C. faecalis, C. oshimai and C. guelmensis
| Total Sequence Length (bp) | 2,535,023 | 2,579,145 | 2,693,766 | 2,398,524 |
| Number of contigs | 45 | 87 | 427 | 20 |
| Longest Sequences (bp) | 255,969 | 158,877 | – | 505,220 |
| N50 (bp) | 121,172 | 78,525 | 10,681 | 394,366 |
| GC content (%) | 44.9 | 44.9 | 45.4 | 43.7 |
| Number of CDSs | 2425 | 2337 | 2618 | 2150 |
| Average Protein Length | 297.1 | – | – | – |
| Number of RNAs | 53 | 62 | 71 | 60 |
Fig. 2An overview of the RAST annotation and subsystems distribution for the C. algeriensis genome
Comparison of predicted CAZymes of C. algeriensis with those available in CAZy database using DIAMOND tool in dbCAN
| Query ID | Best Blast hit ID | % identical | Length | Query Start | Query End | Subject Start | Subject End | E Value |
|---|---|---|---|---|---|---|---|---|
| NODE_1_98 | AFM44649.1|GH3|3.2.1.37|3.2.1.37 | 69.6 | 782 | 4 | 782 | 9 | 788 | 0.0e+00 |
| NODE_1_142 | QBQ55251.1|CBM50| | 41.6 | 878 | 1 | 867 | 19 | 889 | 4.3e−173 |
| NODE_2_26 | CAN65674.1|GH28| | 51.4 | 552 | 73 | 618 | 111 | 642 | 2.2e−158 |
| NODE_2_32 | AWP04987.1|GT13| | 37.9 | 939 | 10 | 854 | 100 | 1037 | 6.2e−172 |
| NODE_2_34 | AUS96821.1|GH4| | 83.7 | 436 | 3 | 438 | 2 | 437 | 4.1e−225 |
| NODE_2 _43 | QAV32893.1|GH3| | 65.8 | 719 | 6 | 717 | 4 | 722 | 4.0e−278 |
| NODE_2_48 | QTM98178.1|GH35| | 40.1 | 770 | 8 | 770 | 2 | 757 | 1.1e−172 |
| NODE_2_73 | QEX52072.1|GH29| | 70.4 | 422 | 1 | 422 | 3 | 424 | 2.3e−196 |
| NODE_2_91 | QBQ55156.1|CBM50| | 36.6 | 658 | 164 | 820 | 62 | 707 | 2.3e−112 |
| NODE_2_93 | ADH60031.1|GH67| | 76.9 | 684 | 5 | 686 | 10 | 690 | 0.0e+00 |
| NODE_2_112 | ADH61394.1|GH127| | 58.5 | 650 | 5 | 651 | 8 | 648 | 1.7e−235 |
| NODE_2_119 | AZR73767.1|GH1| | 63.7 | 443 | 6 | 447 | 6 | 445 | 2.3e−178 |
| NODE_2_195 | QTH41126.1|GH30_1| | 62.0 | 447 | 1 | 445 | 2 | 447 | 1.0e−175 |
| NODE_2_205 | QGG57161.1|GH2| | 63.2 | 560 | 1 | 560 | 1 | 550 | 4.9e−221 |
| NODE_3_3 | AQQ10327.1|CBM38|GH32| | 47.2 | 557 | 11 | 538 | 201 | 751 | 9.7e−152 |
| NODE_3_5 | BBH23071.1|GH151| | 37.5 | 677 | 4 | 672 | 2 | 670 | 1.1e−147 |
| NODE_3_18 | AEE96970.1|GH109| | 78.1 | 375 | 3 | 377 | 2 | 374 | 1.2e−180 |
| NODE_3_20 | AEE96984.1|GH148|GH30| | 62.0 | 900 | 28 | 922 | 1290 | 2181 | 0.0e+00 |
| NODE_3_28 | QHT63269.1|GH28| | 44.3 | 422 | 19 | 438 | 31 | 450 | 1.1e−103 |
| NODE_3_33 | AEE96972.1|GH106| | 53.6 | 1009 | 1 | 1007 | 1 | 999 | 0.0e+00 |
| NODE_3_34 | ALS29147.1|GH140| | 76.0 | 437 | 3 | 439 | 8 | 442 | 1.3e−213 |
| NODE_3_ 39 | AEM77894.1|GH2| | 60.4 | 835 | 1 | 834 | 1 | 823 | 0.0e+00 |
| NODE_3_88 | AHO16406.1|GH13_11| | 61.0 | 367 | 9 | 371 | 3 | 367 | 7.4e−123 |
| NODE_3_137 | SNX53392.1|GH42| | 72.2 | 684 | 5 | 688 | 3 | 686 | 0.0e+00 |
| NODE_3_151 | AEE96723.1|GH3| | 56.8 | 368 | 119 | 486 | 101 | 468 | 1.1e−120 |
| NODE_4_38 | SMX54901.1|GH38| | 59.6 | 1026 | 1 | 1024 | 1 | 1022 | 0.0e+00 |
| NODE_4_77 | CCO21038.1|GH11|3.2.1.8| 3.2.1.8 | 52.3 | 622 | 5 | 625 | 24 | 634 | 3.7e−195 |
| NODE_4_111 | QNB44994.1|CBM34|GH13_39| | 60.7 | 646 | 6 | 650 | 5 | 649 | 2.6e−247 |
| NODE_4_112 | QCX33357.1|GT5| | 63.8 | 475 | 2 | 476 | 1 | 472 | 6.2e−182 |
| NODE_4_113 | QCX33360.1|CBM48|GH13_9| | 70.6 | 622 | 8 | 629 | 4 | 620 | 2.1e−278 |
| NODE_5_24 | AXM88146.1|GH65| | 52.0 | 782 | 1 | 777 | 1 | 778 | 1.1e−241 |
| NODE_5_25 | QUH28797.1|GH13_18| | 56.8 | 555 | 18 | 572 | 8 | 553 | 7.1e−185 |
| NODE_5_27 | AUS95782.1|GH51| | 73.7 | 490 | 1 | 490 | 1 | 490 | 4.9e−235 |
| NODE_5_125 | AZN38808.1|PL33_1| | 48.1 | 626 | 1 | 626 | 1 | 623 | 4.6e−185 |
| NODE_6_132 | QAT62855.1|GT8| | 64.6 | 268 | 1 | 268 | 1 | 268 | 2.6e−103 |
| NODE_7_1 | AEF18092.1|GH106| | 65.9 | 1005 | 1 | 1001 | 1 | 1001 | 0.0e+00 |
| NODE_7_17 | AWZ17133.1|GH13_30| | 41.4 | 1205 | 4 | 1159 | 406 | 1579 | 1.2e−247 |
| NODE_8_2 | AJO67863.1|GH3| | 71.2 | 712 | 20 | 731 | 11 | 721 | 0.0e+00 |
| NODE_8_11 | AEE96371.1|GT51| | 43.5 | 839 | 6 | 830 | 13 | 822 | 1.7e−183 |
| NODE_8_27 | ABB14581.1|GT51| | 40.4 | 774 | 38 | 801 | 11 | 757 | 2.1e−153 |
| NODE_8_48 | QCW79542.1|GH101| | 54.6 | 504 | 12 | 515 | 583 | 1068 | 1.4e−155 |
| NODE_9_39 | QSL64261.1|GT58| | 39.1 | 504 | 15 | 480 | 202 | 683 | 4.7e−105 |
| NODE_9_41 | AEE95239.1|GH38| | 61.6 | 921 | 19 | 938 | 42 | 961 | 0.0e+00 |
| NODE_9_43 | AEE97446.1|GH151| | 53.7 | 659 | 4 | 657 | 6 | 662 | 1.5e−215 |
| NODE_9_45 | QGT51146.1|GT28| | 55.1 | 365 | 1 | 364 | 1 | 365 | 3.1e−117 |
| NODE_9_60 | ACX65278.1|GH33| | 56.9 | 334 | 1 | 328 | 1 | 323 | 1.5e−110 |
| NODE_10_1 | ADU28965.1|CBM4|GH16_3| | 43.2 | 844 | 5 | 729 | 500 | 1333 | 8.0e−177 |
| NODE_10_ 30 | QVK18514.1|GH0| | 30.4 | 828 | 50 | 855 | 61 | 856 | 3.9e−110 |
| NODE_10_34 | AXR85444.1|GH9|3.2.1.151|3.2.1.4|3.2.1.6|3.2.1.151|3.2.1.4|3.2.1.6 | 54.9 | 539 | 153 | 685 | 1 | 533 | 4.1e−171 |
| NODE_10_35 | AEE95681.1|GH10| | 80.6 | 340 | 35 | 374 | 39 | 378 | 1.5e−175 |
| NODE_10_41 | QSQ07709.1|GH94|GT84| | 60.2 | 2866 | 1 | 2865 | 1 | 2857 | 0.0e+00 |
| NODE_11_1 | AEE95836.1|GH116| | 73.2 | 880 | 1 | 879 | 1 | 876 | 0.0e+00 |
| NODE_14_60 | AEV67980.1|GT4| | 70.3 | 391 | 2 | 392 | 1 | 391 | 1.4e−163 |
| NODE_17_1 | QCT01556.1|CBM4|CBM54| GH16_3| | 71.5 | 256 | 199 | 454 | 440 | 695 | 1.1e−111 |
| NODE_17_20 | AXG41146.1|GH6| | 35.3 | 842 | 6 | 828 | 12 | 840 | 5.5e−128 |
| NODE_17_35 | AFM44649.1|GH3|3.2.1.37 |3.2.1.37 | 68.6 | 784 | 4 | 784 | 9 | 789 | 0.0e+00 |
| NODE_18_30 | AXG41146.1|GH6| | 38.4 | 852 | 6 | 782 | 12 | 848 | 1.3e−128 |
| NODE_18_36 | BBH86227.1|GH2| | 60.4 | 1038 | 10 | 1041 | 11 | 1025 | 0.0e+00 |
| NODE_19_14 | AEE95407.1|GH2| | 73.7 | 582 | 1 | 582 | 1 | 571 | 2.1e−269 |
| NODE_21_5 | SNV81985.1|GT28| | 56.5 | 352 | 2 | 353 | 3 | 353 | 1.3e−117 |
| NODE_22_2 | BAZ22920.1|GT4| | 60.8 | 357 | 1 | 357 | 451 | 807 | 1.3e−131 |
| NODE_23_13 | AUS96695.1|GH31| | 78.6 | 774 | 1 | 774 | 1 | 773 | 0.0e+00 |
| NODE_25_1 | QUI22575.1|CBM50| | 38.5 | 589 | 7 | 593 | 34 | 621 | 9.5e−124 |
| NODE_25_5 | QMV43391.1|GH95| | 54.6 | 799 | 13 | 804 | 8 | 787 | 1.2e−259 |
| NODE_26_23 | AEE96522.1|GH50| | 51.8 | 467 | 1 | 464 | 65 | 528 | 4.2e−143 |
| NODE_30_9 | AEM77732.1|GH3| | 74.1 | 509 | 1 | 508 | 17 | 525 | 3.5e−220 |
| NODE_31_9 | VUW72999.1|AA1| | 51.0 | 451 | 166 | 612 | 182 | 620 | 8.4e−115 |
Fig. 3Schematic representation of the predicted 33 CAZyme Gene Clusters (CGCs) showing organization of genes in each cluster. CAZymes genes are colored red, TC (Transporters Classification) are colored green, TF (Transcription Factor) are colored blue. Non-signature genes, which can be inserted between signature genes, are colored gray
Targeted substrates predicted for CAZymes genes clusters
| CGC | Predicted CAZyme family | Targeted substrates |
|---|---|---|
| 1 | GH3 | Xylan |
| 2 | CE19 | Pectin |
| 3 | GH4 | Melibiose, raffinose |
| 4 | GH3-GH35 | Xylan, galactan |
| 5 | CE1 | Xylan |
| 6 | GH67 | Arabinoxylan |
| 7 | CBM25 | Starch |
| 8 | GH30-1 | Beta glucan, xylan |
| 9 | GH2 | Hemicellulose |
| 10 | CBM66-GH32-GH151 | Fructans |
| 11 | GH109-GH109 | – |
| 12 | GH3 | Chitin, xylan |
| 13 | GH109-GH109 | – |
| 14 | GH38 | Alpha mannan |
| 15 | GH13_39/CBM34-GT5-GH13_9/CBM48 | Starch, glycogen |
| 16 | GH109-GH65-GH13_18-GH51 | Arabinan |
| 17 | PL33-1 | |
| 18 | GH106 | Rhamnose |
| 19 | AA1 | – |
| 20 | GH3 | Xyloglucan |
| 21 | GT51 | – |
| 22 | GT51 | – |
| 23 | GT2 | – |
| 24 | GH109 | – |
| 25 | GH5-25/GH9-GH10-GT84/GH94 | Cellulose, hemicellulose |
| 26 | GH109 | – |
| 27 | CBM50 | – |
| 28 | GH109 | – |
| 29 | GH109-GH2 | Hemicellulose |
| 30 | GH2 | Hemicellulose |
| 31 | GT4 | – |
| 32 | GT2-GT4-GT4-GT2 | Polysaccharides, |
| 33 | GH23 | peptidoglycan |