| Literature DB >> 21040589 |
Caroline Joubert1, David Piquemal, Benjamin Marie, Laurent Manchon, Fabien Pierrat, Isabelle Zanella-Cléon, Nathalie Cochennec-Laureau, Yannick Gueguen, Caroline Montagnani.
Abstract
BACKGROUND: The shell of the pearl-producing bivalve Pinctada margaritifera is composed of an organic cell-free matrix that plays a key role in the dynamic process of biologically-controlled biomineralization. In order to increase genomic resources and identify shell matrix proteins implicated in biomineralization in P. margaritifera, high-throughput Expressed Sequence Tag (EST) pyrosequencing was undertaken on the calcifying mantle, combined with a proteomic analysis of the shell.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21040589 PMCID: PMC3091754 DOI: 10.1186/1471-2164-11-613
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary statistics for pyrosequencing and annotation of P. margaritifera mantle ESTs
| Feature | Number | Percentage |
|---|---|---|
| Total number of ESTs sequenced | 276 738 | - |
| Average lenght of ESTs (bp) | 234 | - |
| Number of assembled EST | 219 205 | 79.2% |
| Number of contigs | 19 257 | - |
| Number of singletons | 57 533 | 20.8% |
| Number of unique sequences | - | |
| Ratio of singletons per unique sequences | - | 74.9% |
| Number of contigs containing 2 ESTs | 8 717 | 45.3% |
| Number of contigs containing 3 ESTs | 3 419 | 17.8% |
| Number of contigs containing 4 ESTs | 1 779 | 9.2% |
| Number of contigs containing 5 ESTs | 1 119 | 5.8% |
| Number of contigs containing > 6 ESTs | 4 223 | 21.9% |
| Number of annotated unique sequences: | ||
| - Known protein | 13 064 | 44.3% |
| - Uncharacterized | 6 010 | 20.4% |
| - Predicted | 4 795 | 16.3% |
| - Unknown, Unnamed | 2 880 | 9.8% |
| - Hypothetical protein | 2 730 | 9.3% |
| Number of annotated contigs | 10 007 | 52.2% |
| Number of annotated singletons | 19 472 | 33.8% |
Figure 1Gene Ontology annotation of annotated unique sequences (singletons and contigs) from . Distribution (%) of gene ontology terms among the annotated unique sequences. GO ontologies are represented as general function categories. Among the 10 004 unique sequences with GO assignation, 5976 (59.7%) are involved in biological processes (A), 6855 (68.5%) have molecular functions (B) and 5737 (57.3%) are cellular components (C). Specialized terms of the binding category repartition (D) shows 17% of sequences implicated in ion binding.
Figure 2Pif-177 and . (A). Schematic representation of Pif-177 and Pmarg-Pif protein structures. Pif-97 consists of 525 amino acids (from a.a. 23 to 547) and Pmarg-Pif-97 consists of 536 amino acids (from a.a. 23 to 558). Pif-80 consists of 460 amino acids (from a.a. 548 to 1007) and Pmarg-Pif-80 consists of 456 amino acids (from a.a. 559 to 1014). Purple box is the signal peptide. Blue VWA box is von-Willebrand type A domain. Green CHIT box is a chitin-binding domain. Yellow box is aragonite-binding protein. The four-amino-acid motifs (DD-R/K-R/K) are indicated by asterisks, and arrows indicate motifs at conserved positions between the two sequences. Red box, in the aragonite-binding protein, indicates the cluster of acidic amino acid residues. (LVKEIERRKSDDK-K/I-S-F/L-DD) consensus motif repeated three times in Pmarg-Pif is boxed under aragonite-binding protein. (B) Alignments of the Pif-177 and Pmarg-Pif amino acid sequences of the von Willebrand type A domain and chitin-binding domain. Residues shaded in black are identical residues, and grey positions indicate conserved residues. Sequences are preceded and followed by numbers indicating the domain position in the entire sequence. Percentage of identity are indicated.
Figure 3Localization of pmarg-. Paraffin-embedded sections of oyster tissues were hybridized with antisense or sense single stranded cDNA probes labeled with digoxigenin and revealed using alkaline phosphatase-conjugated antibodies. Positive cells are stained in dark blue, sense probes showed no hybridization (data not shown). Stained cells enlargements are shown in A and B insets where scale bars are indicated. The expression partition limit is symbolized by a *. if: inner fold; mf: middle fold; of: outer fold; pg: periostracal groove; oe: outer epithelium; ie: inner epithelium; oec: outer epithelial cell.
Protein identification in the shell matrix of P. margaritifera by a proteomic approach.
| Protein | Matching peptides | Mascot score | Homolog protein | Accession no. | Identity | E-Value | Species |
|---|---|---|---|---|---|---|---|
| 26 | 1402 | Pif177* | C7G0B5 | 72% | 0.0 | ||
| 16 | 759 | N66* | Q9NL38 | 81% 67% | 0.0 1 e-123 | ||
| 9 | 519 | PFMG10 | Q45TK0 | 56% | 4 e-52 | ||
| 6 | 278 | Linkine | B5KFE5 | 100% | 4 e-62 | ||
| 6 | 183 | Perline* | Q14WA6 | 96% | 2 e-85 | ||
| 4 | 340 | Shematrin-8 | B5KFD0 | 96% 68% | 1 e-142 | ||
| 4 | 285 | Pfty-1* | A1IHF0 | 59% | 1 e-146 | ||
| 4 | 248 | Shematrin-5 | Q1MW92 | 54% | 1 e-68 | ||
| 3 | 181 | MSI60* | O02402 | 78% | 1 e-120 | ||
| 3 | 126 | MPN88 | B7X6S0 | 47% | 2 e-87 | ||
| 3 | 121 | KRMP-11 | A7X103 | 88% | 1 e-30 7 | ||
| 2 | 189 | Shematrin-9 | B5KFD1 | 98% | 4 e-99 | ||
| 2 | 149 | Prismalin-14* | Q6F4C6 | 68% | 5 e-40 | ||
The trypsin-digest peptides were separated on nano-LC, prior to nanoESI-qQ-TOF analysis. The MS/MS spectra were used for searching against the pyrosequencing based EST mantle library with Mascot software. We only considered proteins that presented at least 2 matching peptides. Mascot protein scores are indicated together with the number of unique peptides that matched to the sequence. The homologies were determined by BlastP interrogations against UniProtKB/Swiss-Prot protein database (January 2010) using the UniProt on-line tool http://www.uniprot.org, setting parameters to default. We only show here the best matches for the contigs which protein sequences are the homologues of already known mollusk shell proteins. We notice that all matches are from Pinctada genus origin.
* indicates proteins which the occurrence in shell matrix was previously demonstrated by direct biochemical characterization.