| Literature DB >> 26964738 |
L Karaki1,2,3,4, P Da Silva1,2,4, F Rizk3, C Chouabe4,5, N Chantret6,7, V Eyraud1,2,4, F Gressent1,2,4, C Sivignon1,2,4, I Rahioui1,2,4, D Kahn4,8, C Brochier-Armanet4,8, Y Rahbé9,10,11, C Royer1,2,4.
Abstract
BACKGROUND: Albumin 1b peptides (A1b) are small disulfide-knotted insecticidal peptides produced by Fabaceae (also called Leguminosae). To date, their diversity among this plant family has been essentially investigated through biochemical and PCR-based approaches. The availability of high-quality genomic resources for several fabaceae species, among which the model species Medicago truncatula (Mtr), allowed for a genomic analysis of this protein family aimed at i) deciphering the evolutionary history of A1b proteins and their links with A1b-nodulins that are short non-insecticidal disulfide-bonded peptides involved in root nodule signaling and ii) exploring the functional diversity of A1b for novel bioactive molecules.Entities:
Keywords: Cystine-knot peptides; Insect-plant interaction; Insecticidal protein; Legumes; Multigenic protein family evolution
Mesh:
Substances:
Year: 2016 PMID: 26964738 PMCID: PMC4785745 DOI: 10.1186/s12870-016-0745-0
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Fig. 1Peptide sequence features of the PA1 protein. All original Uniprot features of preproprotein PA1 (P62931, ALB1F_PEA) are displayed: Signal peptide shown in green (canonically interrupted by a short intron); mature PA1b toxin and PA1a proprotein are displayed as red arrows; processed propeptides are in yellow boxes; cysteine-pairing is represented by the yellow arrows. The β-strands are boxed in blue and the 3- -helix in red. PA1b pertains to the Albumin I (IPR012512) Interpro family, which shows no relationship to other Interpro families
Fig. 2Organization of pa1 genes on Medicago truncatula genome. Positions of genes are indicated on chromosomes (scale in Mbp). The Medicago truncatula physical scaffold map is that of the genome assembly version 3.5 (including chromosome size and assembly quality map). Genes and their relative positions (%) on the chromosomes are those of assembly 4.1; a fictitious chromosome, called “0”, harbors the unplaced genes. AC146565_12.1, AC146565_18.1, AC146565_34.1 and AJ574790.1 are from genome version 3.5 and are not present at 100 % match in assembly v4. The orphan EST “TA24778_3880” is also reported
Fig. 3Phylogeny, CXC pattern and tissue EST expression of the Medicago truncatula PA1 paralogues. Unrooted Bayesian tree of Medicago truncatula PA1 family (53 sequences, 366 nucleic acid positions). The tree is presented according to rooted phylogeny shown on Fig. 4. Numbers at branch correspond respectively to posterior probabilities calculated with MrBayes, and to bootstrap values estimated by PhyML. The scale bar represents the average number of substitutions per site. Six strongly supported clusters were boxed and highlighted with colored backgrounds. The seven chemically and functionally synthetized sequences (AG41, EG41, GL44, AS40, AS37, DS37 and QT41) are indicated on the tree. Among them, AG41, EG41, AS40, and AS37 (boxed in purple), and showed toxicity against insect cells, whereas GL44, DS37 and QT41 (boxed in light green) did not show toxicity to insect cells. In the tissue expression part, the color-scale represent the expression value scales between 0 and >75 EST per cluster. Two internal sub-clusters were defined for running site model in cluster 3 and 6, and are named respectively 3a, 3b, and 6a, 6b (Table 2). Red branches are those that were tested for positive selection (Table 2)
Fig. 4Phylogeny of PA1 homologues identified in the complete genomes of Fabaceae species. Bayesian tree of PA1 homologues identified in Cajanus cajan (Phaseoleae), Glycine max (Phaseoleae), Phaseolus vulgaris (Phaseoleae), Pisum sativum (Fabeae), and Medicago truncatula (Trifolieae) (91 sequences, 327 nucleotide positions). No homologues were detected in Lotus corniculatus (Loteae) and Trifolium pratense (Fabeae), and only one in Cicer arietinum (Fabeae), which was not included in this tree (see text and Additional file 6: Table S1). The tree was rooted with a sequence from Styphnolobium japonicum (Sophoreae), according to the current phylogeny of Papilionoideae [45, 47]. Numbers at branch correspond to posterior probabilities calculated with MrBayes and bootstrap values estimated by PhyML. The scale bar represents the average number of substitutions per site. The six Medicago truncatula sequence clusters identified previously (Fig. 3) are recovered and indicated with the same colors. Labels on the left identify sustained nodes in the phylogeny of legumes, and labels on the right identify the protein species with substantial experimental data (protein names, Uniprot identifiers & references therein, and PDB identifiers when available). Pea proteins are included as reference peptides (P. sativum genome not available yet)
Results of selection footprints analysis (PAML site, branch, and branch-site models). Clusters are defined in the general Medicago-only phylogenetic analysis described in Fig. 3. A further subdivision of cluster 3 and 6 into two internal sub-clusters (denoted a and b) was defined for site models tests. Branch tested are colour-coded in red in Fig. 3. In each table cell are reported the significance of the model comparison (p value), position and ω values of the amino-acid found to be under positive selection in the ‘site’ and ‘branch-site’ analyses after manual curation (see Additional file 4: Table S4 for global alignment positioning)
| Cluster | Site model (a) | Branch model (b) | Branch-site model (b) | ||
|---|---|---|---|---|---|
| cluster_1 |
| ns (c) | ns | ||
| pos = 83 | ω =3.02 +/− 0.78 | ||||
| pos = 179 | ω =2.97 +/− 0.83 | ||||
| cluster_2 | no (c) | - (c) | - | ||
| cluster_3 |
| ns |
| ||
| pos = 27 | ω =3.27 +/− 0.59 | pos = 74 | ω =19.02 | ||
| pos = 82 | ω =19.02 | ||||
| cluster_3a |
| ||||
| cluster_3b |
| ||||
| pos = 27 | ω =5.54 +/− 1.89 | ||||
| pos = 82 | ω =5.66 +/− 1.79 | ||||
| cluster_4 | ns | - | - | ||
| cluster_5 | no | - | - | ||
| cluster_6 |
| ns |
| ||
| pos = 43 | ω =8.42 +/− 0.95 | pos = 43 | ω =19.23 | ||
| pos = 76 | ω =8.39 +/− 1.07 | pos = 92 | ω =19.23 | ||
| pos = 120 | ω =8.42 +/− 0.95 | pos = 128 | ω =19.23 | ||
| pos = 183 | ω =19.23 | ||||
| cluster_6a |
| ||||
| pos = 43 | ω =7.074 +/− 1.62 | ||||
| cluster_6b |
| ||||
| pos = 92 | ω =10.11 +/− 1.25 | ||||
| pos = 94 | ω =10.20 +/− 0.86 | ||||
(a) Probability associated with the LRT between the model M8 and the model M8a
(b) Probability associated with the LRT between the model for which branches in red are considered as foreground branches and the null model (cf. Fig. 3 for branch partition and Method section for models details)
(c) ns not significant, no no sites validated after manual curation, - no partition tested
Affinity to the PA1b binding site and insect cell toxicity of synthetic peptides
(−) scores a negative result (no toxicity nor binding in the toxin range assayed)
PA1b the referent molecule and AG41, the Mtr A1b with the highest toxicity were respectively highlighted in grey and pink. The nomenclature for the Mtr A1b (AS37…) was arbitrarily defined as the first and last amino acid in the sequence and the total length. Cysteine architecture is highlighted in yellow. Sites under positive selection in their respective branches are reported as grey background (see PAML analysis, Table 2)
aVariant amino-acids in AG41 compared to PA1b sequence are boldfaced
bThe Ki of PA1b and the synthetic peptides was determined by ligand binding using 125I-PA1b, according to [12]
cLC50 calculated from biological assays performed on cultured Sf9 cells according to Rahioui et al. [32]
Fig. 5Electrophysiology of two A1b isoforms (PA1b-F and AG41) on insect Sf9 cells. Concentration dependent blockage of Sf9 cells ramp membrane current by AG41 and PA1b. a membrane currents recorded in response to voltage ramps of 1.5 s duration applied from −100 to 90 mV in the absence (control) and presence of increasing concentrations of AG41 (upper traces) and PA1b (lower traces). b mean concentration-response data for AG41 (○) and PA1b (●) inhibition of ramp membrane current, measured at +50 mV. Each point represents the mean ± S.E. of n = 8–9 independent experiments. Hill equations were fitted to the data, with 100 % blockage taken as the fixed maximum effect, yielding EC50 values of 14.6 ± 0.4 nM (n = 9, Hill coefficient 1.4 ± 0.1) and 415.3 ± 75.6 nM (n = 8, Hill coefficient 1.7 ± 0.2) for AG41 and PA1b, respectively
Fig. 6Structures of two A1b isoforms (PA1b-F and AG41). a Ribbon representation of PA1b (PDB code: 1P8b). b Superposition of the backbones of PA1b (green) and AG41 (blue). c, d, e, f Lipophilic potentials calculated with the MOLCAD option of SYBYL at the Connolly surfaces of (c) PA1b and (d) AG41. Figures (c and d) are the same orientation as Figures (a and b), using a common hydrophobic scale. Hydrophobic and hydrophilic areas are displayed in brown and blue, respectively. Green surfaces represent an intermediate hydrophobicity. A 180 ° rotation according with respect to a vertical axis is applied from the upper (c and d) figures to the lower (e and f) figures
Fig. 7Heat map of all micro-array data available at Mt-Gene Expression Atlas. All (20) A1 genes available in arrays were mapped against their tissue expression, and displayed as a heat-colored cells (mean tissue expression) superimposed with their individual data cloud (showing experiment availability for each tissue –relatively few for flower, petiole etc. but many for root, nodule etc.–). Original probeset data (Y-axis) were mapped to their corresponding gene in the V4 assembly; three genes are represented by more than one probeset (p1-3; the * star points to a non-100 % match between the corresponding probeset and the V4 genome −92 % nucleic match-). Full Gene/probeset-ID mapping is reported in Additional file 6: Table S1