| Literature DB >> 29983641 |
Karlheinz Mann1, Nicolas Cerveau2, Meike Gummich3, Monika Fritz3, Matthias Mann1, Daniel J Jackson2.
Abstract
BACKGROUND: The shells of various Haliotis species have served as models of invertebrate biomineralization and physical shell properties for more than 20 years. A focus of this research has been the nacreous inner layer of the shell with its conspicuous arrangement of aragonite platelets, resembling in cross-section a brick-and-mortar wall. In comparison, the outer, less stable, calcitic prismatic layer has received much less attention. One of the first molluscan shell proteins to be characterized at the molecular level was Lustrin A, a component of the nacreous organic matrix of Haliotis rufescens. This was soon followed by the C-type lectin perlucin and the growth factor-binding perlustrin, both isolated from H. laevigata nacre, and the crystal growth-modulating AP7 and AP24, isolated from H. rufescens nacre. Mass spectrometry-based proteomics was subsequently applied to to Haliotis biomineralization research with the analysis of the H. asinina shell matrix and yielded 14 different shell-associated proteins. That study was the most comprehensive for a Haliotis species to date.Entities:
Keywords: Biomineralization; Mantle transcriptome; Nacre; Prismatic layer; Proteome; Shell organic matrix
Year: 2018 PMID: 29983641 PMCID: PMC6003135 DOI: 10.1186/s12953-018-0139-3
Source DB: PubMed Journal: Proteome Sci ISSN: 1477-5956 Impact factor: 2.480
Major proteins (≥ 0.2% of total in at least two fractions) of the Haliotis laevigata shell
| Protein | Accession | Abundance (% of total)a | Predicted domainsb and other features | Refc | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NAS | NBS | NCS | NAI | NBI | NCI | PAS | PBS | PAI | PBI | ||||
| Actin(s) | Tri_131427, | 0.05 | 0.14 | 0.03 | 0.1 | 0.26 | 0.34 | 0.01 | 0.10 | 0.17 | 0.32 | ||
| Similar to tyramine beta-hydroxylase/temptin | idb_10968, | 0.26 | 0.17 | 0.22 | 0.09 | 0.06 | 0.05 | – | – | – | – | SSP, IDR (C-term); R/G | |
| Uncharacterized | Comp128817_c0_seq1_3, idb_42198 | – | 0.01 | 0.02 | – | – | – | 0.01 | – | 0.48 | 0.30 | hirudin_antistatin, IDR; P | |
| Uncharacterized | Comp49273_c0_seq1_2, idb_46434 | 0.01 | 0.02 | 0.11 | – | 0.01 | 0.05 | 0.45 | 0.09 |
| 0.49 | SSP, IDR; N/Q/S | |
|
|
| 0.29 | 0.22 | 0.01 |
|
|
|
|
| 0.03 | 0.02 | SSP, reelin, DOMON, IDR | |
|
|
| 0.02 | 0.03 | 0.01 | 0.01 | 0.05 | 0.13 | 0.04 | – |
|
| SSP | |
|
|
| 0.31 | 0.40 |
| – | – | 0.30 |
| 0.53 |
|
| SSP, Growth_fac _rcpt/IGFBP; IDR | |
|
|
| 0.41 | 0.54 | 0.45 |
|
|
|
|
| 0.72 | 0.60 | SSP, Kunitz_ BPTI; R/C/G/L | [ |
| Similar to aragonite protein AP24 | CLC_1642, | 0.23 | 0.17 | 0.19 | 0.41 | 0.25 | 0.34 | – | – | 0.09 | 0.10 | TM; IDR | [ |
|
|
| 0.03 | 0.06 | 0.03 | – | 0.05 | 0.06 | 0.01 | – |
|
| SSP, VWA, chitin-bd_II (2×) | |
|
|
| 0.06 | 0.11 | 0.04 |
|
|
| 0.01 | – | 0.03 | 0.02 | TM; IDR | |
|
|
|
|
|
| 0.21 | 0.59 | 0.75 |
|
| 0.40 | 0.09 | SSP,TM; IDR, G/M/P; repeats (Additional file | |
| Similar to tyrosinase | CLC_123, idb_32947 | 0.05 | 0.02 | 0.02 | 0.81 | 0.79 | 0.68 | 0.03 | – | 0.14 | 0.13 | SSP; tyrosinase_ Cu-bd, IDR; G; repeats (Additional file | |
| Lustrin A (in several fragments) | CLC_1320 etc | 0.15 | 0.41 | 0.26 | 0.61 |
| 0.77 | 0.09 | 0.01 | 0.04 | 0.01 | SSP, Cys_repeats, IDR; C/P; repeats (PPA)7 | [ |
| Similar to ependymin-related protein 1 | CLC_160 | – | – | – | – | – | – | – | – | 0.73 |
| SSP, ependymin | [ |
| Similar to ependymin-related protein 1 | CLC_1876 | – | – | – | – | – | – | – | – | 0.33 |
| Ependymin; L/S | [ |
| Similar to glycine-, alanine- and asparagine-rich protein (GAAP) | Tri_107535, CLC_21 | 0.08 | 0.07 | 0.03 | 0.78 | 0.47 |
| 0.74 | 0.26 | 0.53 | 0.51 | IDR; A/G/S; repeats (Additional file | [ |
| Similar to glutamine-rich protein (QRP) | CLC_253 | – | – | – | 0.80 | 0.46 | 0.28 | – | – | – | – | IDR; Q; repeats (Additional file | [ |
| Uncharacterized/hasina | CLC_303 | 0.09 | 0.07 | 0.04 | 0.64 | 0.62 |
| 0.01 | – | 0.09 | 0.08 | Chitin-bd_II (3×), ConA-like; IDR, repeats (Additional file | [ |
|
|
|
|
|
|
|
|
| 0.13 | 0.06 | 0.66 | 0.75 | SSP, IDR; A/L/P; repeats: aa26–52 (GPPPGA[A,V]LR)3 | [ |
|
|
|
|
|
|
|
|
|
|
|
|
| SSP; IDR, N/D/G; repeats (Additional file | [ |
|
|
| 0.85 | 0.87 |
|
|
|
| 0.16 | 0.23 | 0.04 | 0.02 | SSP; IDP, Q/G/P; repeats (Additional file | |
|
|
|
|
|
| 0.51 | 0.69 | 0.66 | 0.06 | 0.02 | 0.41 | 0.38 | IDR; G/P/S; repeats (Additional file | |
| Uncharacterized | idb_16318 | 0.45 | 0.09 | 0.28 | 0.17 | 0.04 | 0.08 | – | – | – | – | P/V | |
|
|
| 0.03 | 0.07 | 0.12 | – | – | 0.03 |
| 0.67 | 0.84 |
| IDP; A/Q/S/T; repeats (Additional file | |
|
|
| 0.57 | 0.61 | 0.12 | 0.86 |
|
| 0.05 |
| 0.63 | 0.34 | SSP, | [ |
|
|
| 0.01 | 0.01 | – | – |
|
| 0.02 |
|
|
| SSP, ependymin, | [ |
|
|
| 0.16 | 0.17 | 0.45 | 0.04 | 0.05 | 0.15 | 0.63 | 0.21 |
|
| SSP; IDP; Q/G/P; repeats (Additional file | |
| Similar to shell protein 4/aplysianin-A | idb_20988 | 0.01 | 0.01 | 0.01 | 0.22 | 0.32 | 0.42 | – | – | 0.81 | 0.71 | amine_oxidase | [ |
| Similar to ependymin-related protein 1 | idb_22001 | 0.10 | 0.06 | 0.10 | 0.02 | 0.01 | 0.16 | – | – | 0.48 |
| Ependymin; T | [ |
|
|
| 0.06 | – | 0.04 |
|
|
| 0.01 | 0.01 | 0.09 | 0.06 | pI 3.3, IDP, D; repeats (Additional file | |
|
|
| 0.01 | 0.03 | 0.14 | – | 0.03 | 0.06 |
|
| 0.48 |
| pI 3.9; IDP, A/S/T; repeats (Additional file | |
|
|
| 0.05 | 0.06 | 0.05 | – | – | – | 0.20 | 0.03 |
|
| SSP, ependymin, V | [ |
| Uncharacterized | idb_25730 | 0.05 | 0.10 | 0.02 | 0.21 | 0.41 | 0.13 | – | – | – | – | VWA, TSP1, chitin-bd_II (2×), ConA_like; G/T; repeats (Additional file | |
|
|
| 0.11 | 0.19 | 0.14 | 0.84 |
|
| 0.02 | 0.02 |
|
| SSP, peroxidase_ 3; IDR | |
| Uncharacterized/similar to zinc transporter | idb_26030 | 0.35 | 0.25 | 0.05 |
| 0.60 | 0.32 | – | – | – | – | SSP, TM, zinc/iron_per-mease, IDR | |
| Uncharacterized | idb_26568, idb_26567 | 0.01 | 0.03 | 0.07 | – | – | 0.01 | 0.44 | 0.07 | 0.39 | 0.25 | SSP, IDP, N/Q/P/S; repeats (Additional file | |
| Uncharacterized | idb_26836 | – | – | 0.10 | – | – | – | 0.94 | 0.34 | 0.23 | 0.33 | IDP; S/T; repeats (Additional file | |
|
|
| 0.05 | 0.11 | 0.39 | 0.01 | 0.01 | 0.07 |
|
|
|
| SSP, IDP; A/Q/S/T; repeats (Additional file | |
|
|
| – | – | – | – | – | – |
|
| 0.57 | 0.26 | pI 4.1, IDP, A/Q/S/T; repeats (Additional file | |
|
|
| 0.02 | 0.10 | 0.11 | – | – | – |
|
| 0.80 | 0.84 | pI 4.5, IDP,A/G/S/T; repeats (Additional file | |
|
|
| 0.02 | 0.08 | 0.14 | – | – | 0.06 |
|
|
|
| SSP; IDP; Q/P/S; repeats (Additional file | |
| Uncharacterized protein 2 (UP2) | idb_34528 | 0.74 | 0.82 |
| 0.09 | 0.21 | 0.40 | – | – | 0.07 | 0.4 | SSP/TM; IDP, A/L/P; repeats | [ |
| Uncharacterized | idb_3591 | 0.59 | 0.62 | 0.90 | 0.23 | 0.32 | 0.35 | 0.01 | 0.01 | 0.03 | 0.01 | SSP; IDP, G/S | |
|
|
| 0.01 | 0.04 | 0.17 | – | 0.01 | 0.05 |
|
| 0.99 | 0.74 | IDP, G/S/T; repeats (Additional file | |
| Similar to ependymin-1/2 | idb_40080 | – | – | – | – | – | – | – | – | 0.56 | 0.59 | Ependymin; T/V | [ |
| Uncharacterized/similar to basic proline-rich protein/methionine-rich protein (MRP; aa162–270) | idb_4071 | 0.60 | 0.44 | 0.50 | 0.09 | 0.17 | 0.18 | 0.01 | – | 0.09 | 0.13 | pI 5.1; IDP; Q/P; repeats (Additional file | |
| Uncharacterized | idb_43368 | 0.01 | 0.02 | – | 0.40 | 0.44 |
| 0.51 | 0.23 | 0.05 | 0.05 | pI 3.3, IDP, A/Q/G/S; repeats (Additional file | |
|
|
| 0.01 | 0.03 | 0.12 | 0.01 | 0.03 | 0.09 |
|
| 0.99 |
| IDP, G/S/T; repeats | |
|
|
| 0.71 | 0.86 |
| 0.71 | 0.86 |
| 0.24 | 0.12 | 0.24 | 0.18 | SSP; IDP, 46 AQ-repeats in C-term, A/Q/G; pI 4.8 (Additional file | |
|
|
| – | – | – | – | – | – |
|
| 0.77 | 0.49 | pI 4.0, IDP, repeats; A/Q/S/T (Additional file | |
| Uncharacterized | idb_5218 | 0.98 |
| 0.97 | 0.33 | 0.46 | 0.32 | – | – | 0.02 | 0.01 | ||
|
|
| 0.01 | 0.02 | – | – | 0.02 | – | 0.05 | 0.02 |
|
| Ependymin; T | [ |
| Uncharacterized | idb_66139 | 0.14 | 0.03 | – |
| 0.82 | 0.34 | – | – | 0.02 | 0.06 | Repeats: [HQVXL]2 in aa56–65 | |
|
|
|
|
|
|
|
|
| 0.09 | 0.07 | 0.48 | 0.45 | SSP; IDP, P/S; repeats (Additional file | |
|
|
|
|
|
|
|
|
|
| 0.52 | 0.22 | 0.19 | TM; A/L | [ |
| Uncharacterized | Tri_127820 | 0.45 | 0.12 | 0.16 |
| 0.82 | 0.87 |
| 0.32 | – | – | SSP; IDR, G/L/P | |
| Similar to carbonic anhydrase | Tri_130845, idb_813 | – | – | – | 0.02 | 0.02 | 0.01 | – | – | 0.40 | 0.50 | SSP, αCA_2; Q/G | |
|
|
| 0.13 | 0.68 | 0.92 | 0.03 | 0.13 | 0.33 |
|
|
|
| SSP, IDR; A/Q/L | [ |
|
|
|
|
|
| 0.83 |
|
| 0.01 |
| 0.25 | 0.18 | SSP/TM; IDR, A/Q; aa176–203 similar to [G-MGA] 7, aa96–123 [QQQA]7 | |
|
|
|
|
| 0.24 |
|
| 0.87 |
| 0.02 | 0.04 | 0.20 | SSP | [ |
|
|
| 0.07 | 0.07 | 0.04 |
|
|
| 0.02 | 0.01 |
|
| SSP | [ |
|
|
| 0.07 | 0.05 | 0.05 | 0.70 |
|
|
|
| 0.04 | 0.02 | RmlC-like_jelly_ roll_fold, IDR, A/S/T; repeats (Additional file | |
|
|
| 0.50 |
|
| 0.04 | 0.11 | 0.42 | 0.41 | 0.10 |
|
| SSP, ependymin, | [ |
|
|
| 0.03 | 0.01 |
| – | – | – | – | – |
|
| SSP, ependymin | [ |
| Uncharacterized | Tri_35519 | 0.18 | 0.15 | 0.09 | 0.13 | 0.14 | 0.17 | – | – | 0.61 | 0.48 | ConA_like, TM | |
| Uncharacterized | Tri_45070 | 0.34 | 0.40 | 0.58 | 0.08 | 0.12 | 0.17 | 0.01 | 0.01 | 0.01 | 0.01 | IDP; P/S/T | |
|
|
| 0.05 | 0.14 | 0.13 |
|
|
|
| 0.83 |
|
| pI 3.5, IDP; A/D/G; repeats (Additional file | [ |
| Uncharacterized/similar to ferric-chelate reductase 1 | Tri_61496 | 0.12 | 0.12 | 0.09 | 0.46 | 0.29 | 0.63 | – | – | 0.17 | 0.12 | Reeler, TM, IDR; S | |
| Uncharacterized/ | Tri_63049 | 0.43 | 0.58 | 0.24 | 0.35 | 0.34 | 0.55 | – | – | 0.02 | 0.02 | Reeler, IDR; T | [ |
| Uncharacterized | Tri_64952 | 0.20 | 0.01 | 0.22 |
| 0.61 | 0.68 | – | – | 0.01 | – | IDR; R/G/S | |
| Carbonic anhydrase | Tri_72839 | 0.01 | – | 0.01 | 0.40 | 0.21 | 0.63 | – | – | – | 0.01 | SSP, carbonic_ anhydrase_a; IDR | |
|
|
|
|
|
| 0.77 |
|
| 0.02 | – | 0.06 | 0.04 | TM; IDP, A/G/P | |
| Uncharacterized | Tri_81308 | 0.26 | 0.32 | 0.17 | 0.13 | 0.16 | 0.12 | – | – | 0.02 | 0.02 | ||
| Perlustrin | PLS_HALLA | 0.14 | 0.49 | – | – |
| – | – | – | – | – | IGFBP_N; C | [ |
|
|
|
|
| 0.05 |
|
|
| 0.03 | 0.04 |
| 0.14 | CLECT | [ |
|
|
|
|
|
|
|
|
| 0.02 |
| 0.80 | 0.23 | WAP; C/G/P | [ |
For more detailed annotations see Additional file 4: Table S2 and Additional file 5: Table S3. SSP predicted signal sequence peptide, TM predicted transmembrane segment, IDR predicted intrinsically disordered sequence regions, IDP predicted intrinsically disordered protein (predicted disorder < 90%), N nacre, P prismatic layer, S acid-soluble, I acid-insoluble: A, B, C, shell cleaning protocols as detailed in Methods. Amino acids constituting > 10% of the overall amino acid composition are indicated by their standard one-letter abbreviation
acalculated from MaxQuant iBAQ intensities; the values are rounded to the second decimal
bdomain abbreviations are those of InterProScan (http://www.ebi.ac.uk/interpro/)
csimilar protein previously identified in Haliotis shell proteome. A complete list of accepted identifications is contained in tables S2 and S3 (Additional files 4 and 5). The quantitatively most important major proteins (abundance > 1.0 in at least two fractions) and abundance percentages > 1.0 are in bold. Figure S2 is contained in Additional file 27
Fig. 1SDS-PAGE separation of nacre and prismatic layer organic matrix proteins. S, acid-soluble; I, acid-insoluble. 200 μg of matrix were applied to each lane. Nacre acid-soluble matrix and prismatic layer acid-insoluble matrix were cut into sections for in-gel digestion as indicated. At the left the masses of marker proteins are shown in kDa (Novex Sharp pre-stained, Invitrogen)
Fig. 2Distribution of Haliotis laevigata shell protein between different fractions. Venn diagrams showing the distribution of accepted identifications between the different fractions obtained by extraction of different shell layers with different shell washing protocols. I, acetic acid-insoluble fraction; S, acetic acid-soluble fraction. a, b and c, shell washing protocols applied before matrix extraction as described in the Material and Methods section. Correctly proportioned two and three circle Venn diagrams were drawn using Venn Diagram Plotter (https://omics.pnl.gov/software/venn-diagram-plotter)
Fig. 3Perlustrin alignment and spectra. a Alignment of a nacre protein 70% identical to mature H. laevigata perlustrin isolated from nacre matrix and sequenced on the protein level using automated Edman chemistry [16]. A predicted signal sequence peptide is in red. Sequence regions confirmed by MS/MS-derived peptide sequences are in green. b MS/MS spectrum of a selected sequence-unique peptide of comp70759_c0_seq1_2. This peptide of a mass of 1714.8131 Da was identified with a Posterior Error Probability (PEP) of 5.2e-19 and a mass error of 0.3 ppm. c MS/MS spectrum of a selected sequence-unique peptide of P82595. This peptide showing one miss-cleavage was identified with a PEP of 0.019 and a mass error of 0.3 ppm. Y-ions are shown in red, b-ions are in blue, and fragments with neutral loss are in orange. A few fragment non-standard but advanced annotations with the help of the MaxQuant Expert system [44] are shown in black. For the sake of clarity most advanced annotations are not shown. The mass spectrometer model used, Velos or Elite, is contained in the raw-file name on top of the y-axis of the spectra
Fig. 4AP7 alignment and spectra, a Alignment of H. laevigata Tri_24151 to H. rufescens AP7 (Q9BP37_HALRU; [17]). Predicted signal sequence peptides are in red. Sequence regions confirmed by MS/MS-derived peptide sequences are in green. Cysteines proposed to be part of the metal binding site [98] are underlined. The N-terminal mineral-interacting domain [97] is shown in italics. b MS/MS spectrum of a selected sequence-unique peptide most probably representing the N-terminus of this protein and confirming the secretion signal peptide prediction. This doubly charged peptide was identified with a mass error of 0.5 ppm and a Posterior Error Probability (PEP) of 1.7e-42. Y-ions are shown in red, b-ions are in blue, and fragments with neutral loss are in orange. Ion a3 was identified using the advanced annotation option of the MaxQuant viewer (Expert system [44]). c MS/MS spectrum of a selected sequence-unique peptide from the insert sequence region not present in H. rufescens AP7. The doubly charged peptide was identified with a mass error of 0.01 ppm and a PEP of 2.7e-36. The mass spectrometer model used, Velos or Elite, is contained in the raw-file name on top of the y-axis of the spectra
Fig. 5Sequence alignment of KCP_HALAI to related major H. laevigata sequences. Predicted signal sequence peptides are underlined. Sequence regions confirmed by identified peptides are shown in green
Low-abundance proteins predicted to be related to chitin binding and modification
| Protein | Accession no. | Predicted domains | Shell layer |
|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| Uncharacterized | CLC_18633 | Chitin-bd_N; TM | N, P |
| Similar to chitinase-3 | CLC_2296 | SSP; chitinase_II, chitin-bd_II (2×) | N, P |
| Uncharacterized | CLC_2347, idb_28940 | ARM_like, chitin-bd_II (2×); ConA_like | N |
| Uncharacterized/IgGFc-binding protein | CLC_3878, idb_2768, idb_2772, Tri_120377, Tri_120379 | SSP; chitin-bd_II (4×), Sushi, galectin_CRP, FA58C_3 | N, P |
| Similar to shell matrix protein (PSM_MYTCA | idb_13357 (aa561–780), idb_13358 | chitin-binding_II (2×); IDP | N, P |
| Similar to IgGFc-binding protein | idb_1745 | SSP; chitin-bd_II (23×) | N, P |
| Uncharacterized | idb_2023, CLC_2607, idb_2021 | IG, chitin-bd_II | N, P |
| Similar to chitinase-3 | idb_32310 | SSP; chitinase_II, chitin-bd_II (2×) | N, P |
| Uncharacterized | idb_44571 | chitin-bd_II (4×); TM | N, P |
| Similar to endochitinase | idb_53451 | glyco_hydro_18, chitin-bd_II | N |
| Similar to chitin deacetylase | idb_6290 | SSP; glyco_hydro/deAcase_b/a-brl/NodB (2×) | N, P |
| Uncharacterized | idb_982 | SSP | N; P |
| Uncharacterized | Tri_109450 | SSP; chitin-bd_II (2×) | N, P |
| Uncharacterized | Tri_7902 | chitin-bd_II (3×) | N, P |
| Uncharacterized | idb_54309, Comp22563_c0_seq1_3, idb_57746 | SSP, chitin-bd_II (3×) | P |
| Uncharacterized | Comp99505_c0_seq1_5 | TM; chitinase_II | P |
| Uncharacterized | CLC_413 | chitinase_II | P |
| Uncharacterized | idb_32090 | TM; chitin-bd_II (3×) | P |
| Uncharacterized | idb_5844 | TM; SEA, chitin-bd_II (3×), Ig-like_fold | P |
| Uncharacterized | Tri_50040 | SSP; ConA-like, chitin-bd | P |
| Uncharacterized | Tri_95672 | SSP; ConA-like,, chitin-bd_II (3×) | P |
For more detailed annotations see additional Additional file 4: Table S2 and Additional file 5: Table S3. SSP predicted signal sequence peptide, TM predicted transmembrane segment, IDP predicted intrinsically disordered protein (predicted disorder > 90%), N nacre, P prismatic layer. Domain abbreviations are those of InterProScan (http://www.ebi.ac.uk/interpro/). The two first entries were close to the threshold for major proteins (bold print)
Fig. 6BLASTp comparisons of the Haliotis laevigata shell proteome against 799 biocalcifying proteins derived from 6 bivalves, 3 gastropods, 1 brachiopod, 1 sea urchin and 1 coral. Individual lines spanning the ideogram connect proteins that share significant similarity (e values <10e− 6). Transparent red lines connect proteins with the lowest quartile of similarity (with a threshold of 10e− 6), orange lines with the next highest quartile of similarity, blue lines with the next highest quartile of similarity and green lines with the highest quartile of similarity. The percentage of each biomineralizing proteome that shared similarity with the H. laevigata proteome is indicated. The table provides further information for those candidates that share sequence similarity. The tree is a consensus that was manually constructed based on previous phylogenetic studies (see Material and methods section)