| Literature DB >> 35918655 |
Merlin Bardin1, Pierre Rousselot-Pailley1, Thierry Tron1, Viviane Robert2.
Abstract
BACKGROUND: DIRs are mysterious protein that have the ability to scavenge free radicals, which, are highly reactive with molecules in their vicinity. What is even more fascinating is that they carry out from these highly unstable species, a selective reaction (i.e., stereoenantioselective) from a well-defined substrate to give a very precise product. Unfortunately, to date, only three products have been demonstrated following studies on DIRs from the plant world, which until now was the kingdom where these proteins had been demonstrated. Within this kingdom, each DIR protein has its own type of substrate. The products identified to date, have on the other hand, a strong economic impact: in agriculture for example, the biosynthesis of (+)-gossypol could be highlighted (a repellent antifood produced by the cotton plant) by the DIRs of cotton. In forsythia plant species, it is the biosynthesis of (-)-pinoresinol, an intermediate leading to the synthesis of podophyllotoxine (a powerful anicancerous agent) which has been revealed. Recently, a clear path of study, potentially with strong impact, appeared by the hypothesis of the potential existence of protein DIR within the genomes of prokaryotes. The possibility of working with this type of organism is an undeniable advantage: since many sequenced genomes are available and the molecular tools are already developed. Even easier to implement and working on microbes, of less complex composition, offers many opportunities for laboratory studies. On the other hand, the diversity of their environment (e.g., soil, aquatic environments, extreme environmental conditions (pH, temperature, pressure) make them very diverse and varied subjects of study. Identifying new DIR proteins from bacteria means identifying new substrate or product molecules from these organisms. It is the promise of going further in understanding the mechanism of action of these proteins and this will most likely have a strong impact in the fields of agricultural, pharmaceutical and/or food chemistry.Entities:
Keywords: Bacteria; Bacteria candidate; Bioinformatic analysis; Dirigent protein (DIRs); High product quality; Plant; Prokaryote; Radical; Stereoselectivity
Mesh:
Substances:
Year: 2022 PMID: 35918655 PMCID: PMC9344732 DOI: 10.1186/s12859-022-04832-6
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.307
Fig. 1Mechanism of functioning of DIRs interacting with mono- or bis-quinone methides intermediates. A In lignans forming-DIR, Example of FiDIR1 for (1)-pinoresinol forming DP and atDIR6 for (+d)-pinoresinol forming DP. B In terpenoids forming-DIR GhDIR4 and C in Pterocarpan forming-DIR as GePTS1 (adapted from [10]). FiDIR1, Forsythia intermedia (−)-pinoresinol–forming DIR, AtDIR6, A. thaliana (+)-pinoresinol–forming DIR, GhDIR4, Gossipium.hirsutum-gossypol–forming DIR, GePTS1, Glycyrrhiza. echinata pterocarpan synthase 1
Fig. 2HMM Dirigent domain profile of Pfam PF03018 DIR family from available sequenced genomes
Summary of information on the 49 bacteria having a DIRL protein
| n° | Bacteria species | Coloration of Gram | UniprotAccession number of DIRL | Secondary metabolism clusters via AntiSMASH version 6.0 | Protein size (AA) | Identity % with AtDIR6 (clustalW) (%) | Similarity % with AtDIR6 (%) | Peptide signal prediction size (AA) by The SignalP 5.0 tool | S–S bridge prediction position/probability Score by AlphaFold | N-Glycosylation prediction in prokaryotes via Glycopp with score > 0,8 | Bacterial family number via phylogenie results |
|---|---|---|---|---|---|---|---|---|---|---|---|
| + | 23 | 166 | 22 | 30 | 29 | 84–98 | 0 | I | |||
| 2 | + | 22 | 167 | 22 | 27 | 30 | 85–99 | 0 | I | ||
| 3 | + | 29 | 167 | 22 | 33 | 30 | 85–99 | 0 | i | ||
| 5 | + | 46 | 164 | 21 | 28 | 34 | no S–S bridge | 0 | NO | ||
| 6 | + | 28 | 160 | 20 | 29 | 32 | 87–101 | i | |||
| 7 | + | 0 | 166 | 20 | 27 | 29 | 84–98 | 0 | i | ||
| 8 | − | 47 | 154 | 20 | 39 | 27 | 81–92 | IV | |||
| 9 | + | 30 | 161 | 19 | 30 | 29 | 89–100 | 0 | NO | ||
| 11 | − | 9 | 147 | 19 | 31 | 19 | 76–87 | iii | |||
| 12 | − | 0 | 148 | 19 | 32 | 21 | 75–86 | IV | |||
| 13 | − | 1 | 155 | 18 | 31 | 25 | 82–93 | 0 | NO | ||
| 14 | + | 38 | 161 | 18 | 29 | 35 | 87–101 | 0 | NO | ||
| 15 | − | 10 | 146 | 18 | 30 | 18 | 75–86 | 0 | III | ||
| 16 | − | 44 | 137 | 18 | 27 | ND | 64–75 | IV | |||
| − | 41 | 151 | 17 | 27 | 19 | 78–89 | IV | ||||
| − | 41 | 137 | 17 | 27 | ND | 64–75 | IV | ||||
| 19 | + | 51 | 206 | 17 | 33 | 30 | 106–118 | 0 | V | ||
| 20 | − | 29 | 162 | 17 | 25 | ND | 89–100 | IV | |||
| 21 | + | 29 | 163 | 17 | 32 | 27 | no S–S bridge | 0 | II | ||
| 22 | + | 32 | 205 | 17 | 32 | 30 | 105–112 | V | |||
| + | 45 | 163 | 16 | 28 | 33 | no S–S bridge | 0 | II | |||
| 24 | + | 33 | 205 | 16 | 28 | 30 | 105–117 | 0 | V | ||
| 25 | + | 42 | 205 | 16 | 27 | 30 | 105–117 | 0 | V | ||
| 26 | + | 4 | 168 | 16 | 28 | 33 | 96–109 | 0 | NO | ||
| 28 | + | 36 | 161 | 16 | 25 | 29 | 89–100 | NO | |||
| 29 | − | 9 | 201 | 16 | 27 | 27 | only one Cys | 0 | NO | ||
| 31 | + | 17 | 205 | 15 | 28 | ND | 104–116 | 0 | V | ||
| 32 | + | 28 | 205 | 15 | 29 | 30 | 105–117 | V | |||
| 33 | + | 29 | 167 | 15 | 21 | 29 | 86–99 | I | |||
| 34 | − | 6 | 154 | 15 | 22 | 19 | 83–94 | 0 | NO | ||
| 35 | − | 28 | 162 | 15 | 25 | 35 | 89–100 | IV | |||
| 36 | + | 5 | 123 | 15 | 25 | ND | 41–52 | 0 | NO | ||
| 37 | + | 2 | 205 | 15 | 27 | 32 | 105–117 | 0 | V | ||
| 39 | − | 56 | 146 | 15 | 25 | ND | 74–85 | IV | |||
| 40 | − | 48 | 145 | 15 | 23 | ND | 72–83 | IV | |||
| 42 | + | 43 | 203 | 14 | 25 | 30 | 105–117 | 0 | V | ||
| 43 | + | 166 | 14 | 26 | 33 | no S–S bridge | 0 | II | |||
| 44 | + | 7 | 258 | 14 | 27 | ND | 158–170 | 0 | V | ||
| 45 | + | 32 | 187 | 13 | 23 | 36 | 112–124 | 0 | V | ||
| 46 | + | 41 | 122 | 13 | 18 | ND | 47–59 | 0 | V | ||
| 47 | + | 42 | 205 | 12 | 24 | 30 | 105–117 | 0 | V | ||
| 48 | + | 31 | 161 | 12 | 22 | 31 | 87–101 | 0 | V | ||
| 49 | − | 34 | 96 | 12 | 20 | ND | no S–S bridge | IV |
2021 additions to the list published by Dabravolski in 2020 are highlighted in bold. The DIRLs belonging to a predicted operon structure via AntiSmash logiciel are the 15 bacteria underlined
Fig. 3Phylogenetic tree constructed by Seaview server according to the Neighbor-Joining method, corrected by ML method. In green, branches linked to the 8 best characterized plant DIRs (PTS1 and DRR206 from Pisum sativum and FiDIR1 from fosythia intermedia are from family DIR-a1. AtDIR5 and atDIR6 from Arabidopsis thaliana are from family DIR-a2. GmDRR1 from Glycine max, GePTS1 from Glycyrrhiza echinanta and Gossipium Hirsitum GhDIR4 are from family DIR-b/d. AtDIR10 from A. thaliana is from family DIR-e); in black, branches connecting the 49 bacterial DIRLs; groups of similar DIRLs are squared and numbered in 5 groups DIRL I to IV. Branch Distance scale is indicated. Bacteria which do not belong to family I to V and not clustered are not squared
Fig. 5Schematic representation of a part the genome of Streptomyces formicae, annotated manually in order to obtain genomic information around the genes encoding a potential DIRL
Fig. 4:3D model of AtDIR6 (5LAL) and 3D model predictions of DPLs representative of family I to V. A superposition of the 6 models is presented far right