| Literature DB >> 23341956 |
Zhiyi Bai1, Hanfeng Zheng, Jingyun Lin, Guiling Wang, Jiale Li.
Abstract
The triangle sail mussel Hyriopsis cumingii (Lea) is the most important mussel species used for commercial freshwater pearl production in China. Mussel color is an important indicator of pearl quality. To identify genes involved in the nacre coloring, we conducted RNA-seq and obtained 541,268 sequences (298 bp average size) and 440,034 sequences (293 bp average size) in secreting purple and white nacre libraries (P- and W-libraries), respectively. The 981,302 Expressed Sequence Tags (ESTs) were assembled into 47,812 contigs and 289,386 singletons. In BLASTP searches of the deduced protein, 22,495 were proteins with functional annotations. Thirty-three genes involved in pearl or shell formation were identified. Digital expression analysis identified a total of 358 differentially expressed genes, and 137 genes in the P-library and 221 genes in the W-library showed significantly higher expression. Furthermore, a set of SSR motifs and SNPs between the two samples was identified from the ESTs, which provided the markers for genetic linkage, QTL analysis and future breeding. These EST sequences provided valuable information to further understand the molecular mechanisms involved in the formation, color determination and evolution of the pearl or shell.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23341956 PMCID: PMC3544910 DOI: 10.1371/journal.pone.0053617
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Grafting diagram of host mussels and donor mussels.
Summary of sequencing results.
| Feature | Number |
| Read number | 981,302 |
| -P-library | 541,268 |
| -W-library | 440,034 |
| Number of assembled reads | 691,916 |
| - P-library | 340,491 |
| - W-library | 351425 |
| Average read length (bp) | 296 |
| - P-library | 298 |
| - W-library | 293 |
| Number of contigs | 47,812 |
| Average length of contigs(bp) | 634 |
| -Max. | 10,137 |
| -Min. | 42 |
| Average reads of contigs(bp) | 14.5 |
| -Max. | 16,495 |
| -Min. | 2 |
Figure 2Gene Ontology classification of deduced protein sequences.
Figure 3COG classification of deduced protein sequences.
Genes involved in the biomineralization process.
| Putative function | Gene name (Query) | Accession No. | E-value | Species |
|
| Chitin synthase | Contig12395_52 | 0.0 |
|
| Chitin binding peritrophin-A | Contig10542_8 | 2e-06 |
| |
| Chitin deacetylase 5 | Contig25082_26 | 7e-60 |
| |
| Chitin deacetylase 1 precursor | Contig28908_5 | 1e-18 |
| |
| Chitinase | Contig19449_4 | 3e-63 |
| |
| Chit3 protein | p-G9PKTWR01BW1TH_9 | 2e-39 |
| |
|
| Pif 177 | Contig14390_10 | 3e-18 |
|
| Nacrein B3 | Contig43912 | 5e-09 |
| |
| Nacrein B4 | Contig23367 | 6e-10 |
| |
| Tyrosinase | Contig46530_5 | 5e-13 |
| |
| Dermatopontin 2 | Contig40684_4 | 5e-45 |
| |
| Perlucin 6 | p-G9NRHGJ02H3KTQ_2 | 3e-13 |
| |
| PfN23 | Contig24025_3 | 3e-21 |
| |
|
| Calreticulin | Contig20109_3 | 0 |
|
| Calmodulin | Contig41460_17 | 3e-15 |
| |
| Calcium-ATPase | w-G90OYPM02IM8L9_3 | 4e-10 |
| |
| Calcitonin receptor | Contig255_4 | 7e-30 |
| |
| Voltage-dependent L-type calcium channel alpha-1 subunit | w-G9CKNNJ01BTOZ8_3 | 1e-57 |
| |
| Calcium homeostasis endoplasmic reticulum protein | Contig15522_24 | 1e-78 |
| |
| Histidine rich calcium binding protein | Contig24603_3 | 9e-18 |
| |
| Calcium-binding atopy-related autoantigen 1 | Contig28072_6 | 7e-42 |
| |
| Calcium and integrin-binding protein 1 | p-G9PKTWR01B78VK_1 | 8e-17 |
| |
| Calcium binding protein | Contig44288_9 | 2e-06 |
| |
| Multicopper oxidase | contig4154_8 | 2e-152 |
| |
| Mg2+ and Co2+ transporter | Contig3824_6 | 1e-24 |
| |
| Ferritin | p-G9NRHGJ02F62Y3_6 | 0 |
| |
| Cadmium metallothionein | Contig3396_4 | 9e-28 |
| |
| Matrix metalloproteinase | Contig5483_17 | 6e-43 |
| |
| Astacin-like protein | p-G9NRHGJ02F543V_9 | 2e-05 |
| |
|
| Dentin matrix protein | p-G9PKTWR01A8WP0_9 | 3e-27 |
|
| Fras1 related extracellular matrix protein | Contig20564_10 | 1e-33 |
| |
| Perlecan | Contig21930_8 | 3e-90 |
| |
| Laminin | p-G9NRHGJ02IXPYS_3 | 1e-59 |
| |
| Carbonic anhydrase | Contig25806_12 | 6e-29 |
|
Figure 4Schematic of deduced proteins contained tandem-arranged repeat units (1–5) enriched Asp residues, (6–9) enriched Lys residues, (10–18) enriched Gly residues.
Each motif and copy number was shown on the right from up to down. Bars = 50 amino acids.