| Literature DB >> 32545314 |
Ria T Villafana1, Sephra N Rampersad1.
Abstract
The variability and phylogeny among TRI5, TRI8 and TRI11 nucleotide and translated protein sequences of isolates from Trinidad belonging to Fusarium incarnatum-equiseti species complex (FIESC) were compared with FIESC reference sequences. Taxa appeared to be more divergent when DNA sequences were analyzed compared to protein sequences. Neutral and non-neutral mutations in TRI protein sequences that may correspond to variability in the function and structure of the selected TRI proteins were identified. TRI5p had the lowest amino acid diversity with zero predicted non-neutral mutations. TRI5p had potentially three protein disorder regions compared to TRI8p with five protein disorder regions. The deduced TRI11p was more conserved than TRI8p of the same strains. Amino acid substitutions that may be non-neutral to protein function were only detected in diacetoxyscirpenol (DAS) and fusarenon-X (FUS-X) producers of the reference sequence subset for TRI8p and TRI11p. The deduced TRI5 and TRI8 amino acid sequences were mapped to known 3D-structure models and indicated that variations in specific protein order/disorder regions exist in these sequences which affect the overall structural conservation of TRI proteins. Assigning single or combination non-neutral mutations to a particular toxicogenic phenotype may be more representative of potential compared to using genotypic data alone, especially in the absence of wet-lab, experimental validation.Entities:
Keywords: Fusarium; non-neutral mutations; sequence diversity; trichothecene
Year: 2020 PMID: 32545314 PMCID: PMC7354511 DOI: 10.3390/toxins12060386
Source DB: PubMed Journal: Toxins (Basel) ISSN: 2072-6651 Impact factor: 4.546
DNA polymorphism statistics for aligned TRI5, TRI8 and TRI11 nucleotide sequences of Fusarium incarnatum-equiseti (FIESC) isolates from Trinidad and other reference FIESC sequences.
| DNA Alignment Statistic | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Trinidad | Reference Sequences | Total | Trinidad | Reference Sequences | Total | Trinidad | Reference Sequences | Total | |
|
| 32 | 8 | 40 | 32 | 10 | 42 | 32 | 12 | 44 |
| No. of sites analyzed | 394 | 394 | 394 | 470 | 470 | 470 | 705 | 705 | 705 |
| No. of singleton variable sites | 0 | 16 | 13 | 0 | 62 | 33 | 0 | 93 | 60 |
| Total number of singleton mutations, | 0 | 18 | 14 | 0 | 65 | 36 | 0 | 118 | 86 |
| Total number of mutations, | 55 | 56 | 86 | 108 | 102 | 176 | 78 | 237 | 264 |
| Number of polymorphic sites, | 52 | 53 | 81 | 100 | 97 | 158 | 77 | 203 | 217 |
| Average no. nucleotide differences, | 17.091 | 24.214 | 20.023 | 29.734 | 28.089 | 40.11 | 26.938 | 64.061 | 49.969 |
| Nucleotide diversity, | 0.04498 | 0.06372 | 0.05269 | 0.06326 | 0.05976 | 0.08534 | 0.03881 | 0.09244 | 0.07211 |
| Number of haplotypes, | 7 | 6 | 13 | 7 | 8 | 15 | 4 | 12 | 16 |
| Haplotype (gene) diversity, H | 0.792 | 0.893 | 0.864 | 0.738 | 0.956 | 0.847 | 0.651 | 1 | 0.817 |
| Fu and Li’s | 1.72888 | 0.4508 | 0.57684 | 1.87228 | −1.1548 | 0.29559 | 1.90862 | −0.63889 | −0.82709 |
| Fu and Li’s | 1.72973 | 0.5375 | 0.41238 | 1.63219 | −1.28824 | 0.19375 | 2.08398 | −0.79709 | −0.91004 |
| Recombination | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Figure 1TRI5 gene and protein phylogenetic trees. The trees were constructed using the maximum likelihood (ML) method. All positions containing gaps and missing data were eliminated and the trees are drawn to scale. The tree with the highest log likelihood is shown. Bootstrap support after 1000 replicates of associated taxa is shown next to the branches. The 75% majority consensus trees are shown; (A): phylogenetic tree for TRI5 gene sequences was based on the Kimura 2-parameter model; (B): phylogenetic tree for TRI5 protein sequences was constructed based on the maximum likelihood method using on the JTT + G matrix-based model.
Figure 2TRI8 gene and protein phylogenetic trees constructed using the maximum likelihood method. All positions containing gaps and missing data were eliminated and the trees are drawn to scale. The tree with the highest log likelihood is shown. Bootstrap support after 1000 replicates of associated taxa is shown next to the branches; (A): phylogenetic tree for TRI8 gene sequences was based on the Kimura 2-parameter model; (B): phylogenetic tree for TRI8 protein sequences was constructed based on the maximum likelihood method using on the JTT matrix-based model.
Figure 3TRI11 gene and protein phylogenetic trees constructed using the maximum likelihood method. All positions containing gaps and missing data were eliminated and the trees are drawn to scale. The tree with the highest log likelihood is shown. Bootstrap support after 1000 replicates of associated taxa is shown next to the branches; (A): phylogenetic tree for TRI11 gene sequences was based on the Kimura 2-parameter model; (B): phylogenetic tree for TRI11 protein sequences was constructed based on the maximum likelihood method using on the JTT + G matrix-based model.
Comparison of mutation effect prediction for TRI5 protein alignment using PROVEAN and SIFT software.
| Mutation Data | |
|---|---|
| 32 | |
| No. of neutral mutations | 28 |
| No. of non-neutral mutations | 4 |
|
|
|
| A54L | LN995579 (15511); LN995580 (11348) |
| Y56E | ACZ56393 (13381),ACZ56398 (31160),ACZ56401 (FRC R-06979), AXP09554 (64414) |
| I57T | TRI5_5,TRI5_6,TRI5_7,TRI5_8,TRI5_9,TRI5_11,TRI5_13 (Trinidad strains) |
| I57Q | ACZ56393 (13381),ACZ56398 (31160),ACZ56401 (FRC R-06979), AXP09554 (64414) |
Comparison of mutation effect prediction for TRI8 protein alignment using PROVEAN and SIFT software.
| Mutation Data | |
|---|---|
| 73 | |
| No. of neutral mutations | 62 |
| No. of non-neutral mutations | 11 |
|
|
|
| Y20H | TRI8_5, TRI8_9 (Trinidad strains) |
| H41Y | TRI8_5,TRI8_6,TRI8_7,TRI8_8,TRI8_9,TRI8_11,TRI8_13 (Trinidad strains) |
| R62P | TRI8_26, TRI8_29, TRI8_30, TRI8_31, TRI8_32, TRI8_33, TRI8_34 (Trinidad strains) |
| S112W | TRI8_ 5, TRI8_9 (Trinidad strains) |
| R115H | LN995594 (11363) |
| A125T | LN995587 (15511) |
| Y133C | TRI8_15, TRI8_19 (Trinidad strains) |
| P134S | TRI8_6, TRI8_7, TRI8_8, TRI8_13 (Trinidad strains) |
| E135D | LN995588 (11348) |
| L140F | TRI8_ 1, TRI8_12, TRI8_16, TRI8_17, TRI8_18, TRI8_23, TRI8_28 (Trinidad strains) |
| N187D | LN995587 (15511), LN995588 (11348), LN995591 (10395) |
Comparison of mutation effect prediction for TRI11 protein alignment using PROVEAN and SIFT software.
| Mutation Data | |
|---|---|
| N, total no. of mutations | 28 |
| No. of neutral mutations | 22 |
| No. of non-neutral mutations | 6 |
|
|
|
| E4V | GQ915566 (FRC R-06979) |
| G47D | LN995598 (11401), LN995599 (11407) |
| P50N | LN995603 (11363) |
| P50V | TRI11_5,TRI11_6,TRI11_7,TRI11_8,TRI11_9,TRI11_11,TRI11_13 (Trinidad strains) |
| G75S | LN995596 (15511), LN995597 (11348), LN995598 (11401), LN995599 (11407), LN995600 (10395), LN995601 (11294), LN995602 (11345), LN995603 (11363) |
| P117S | LN995596 (15511) |
Figure 4Sequence display for secondary structure entities in PDB model 2PS5 (https://www.rcsb.org/pdb/explore/remediatedSequence.do?structureId=2PS5) with TRI5 protein sequence amplified mapped onto the model sequence.
Figure 5A sequence logo of the TRI5 amino acid sequence alignment generated by WebLogo v.3.
Figure 6The protein features of PDB model 2PS5 (https://www.rcsb.org/pdb/protein/P13513) showing regions of protein order and predicted disorder that correspond to the TRI5 amino acid sequence alignment.
Figure 7Sequence display for secondary structure entities in PDB model 3GUU (https://www.rcsb.org/pdb/explore/remediatedSequence.do?structureId=3GUU) with TRI8 protein sequence amplified mapped onto the model sequence.
Figure 8Sequence display for secondary structure entities in PDB model 3ZPX (https://www.rcsb.org/pdb/explore/remediatedSequence.do?structureId=3ZPX) with TRI8 protein sequence mapped onto the model sequence.
Figure 9A sequence logo of the TRI8 amino acid sequence alignment generated by WebLogo v.3.
Figure 10The protein features of PDB model 3GUU (https://www.rcsb.org/pdb/protein/W3VKA4) showing regions of protein order and predicted disorder that correspond to the TRI8 amino acid sequence alignment.