| Literature DB >> 20937131 |
Raul Bettencourt1, Miguel Pinheiro, Conceição Egas, Paula Gomes, Mafalda Afonso, Timothy Shank, Ricardo Serrão Santos.
Abstract
BACKGROUND: Bathymodiolus azoricus is a deep-sea hydrothermal vent mussel found in association with large faunal communities living in chemosynthetic environments at the bottom of the sea floor near the Azores Islands. Investigation of the exceptional physiological reactions that vent mussels have adopted in their habitat, including responses to environmental microbes, remains a difficult challenge for deep-sea biologists. In an attempt to reveal genes potentially involved in the deep-sea mussel innate immunity we carried out a high-throughput sequence analysis of freshly collected B. azoricus transcriptome using gills tissues as the primary source of immune transcripts given its strategic role in filtering the surrounding waterborne potentially infectious microorganisms. Additionally, a substantial EST data set was produced and from which a comprehensive collection of genes coding for putative proteins was organized in a dedicated database, "DeepSeaVent" the first deep-sea vent animal transcriptome database based on the 454 pyrosequencing technology.Entities:
Mesh:
Year: 2010 PMID: 20937131 PMCID: PMC3091708 DOI: 10.1186/1471-2164-11-559
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of assembly and EST data
| Number of Reads | 582,650 |
| Total Bases | 181 Mb |
| Average read length after MIRA | 312 |
| Number of contigs | 75,407 |
| Average contig length | 509 |
| Range contig length | 40-3,400 |
| Number of singletons | 3,071 |
| Number of Contigs with 2 reads | 29,206 |
| Number of Contigs with > 2 reads | 43,130 |
| Contigs with BLASTx matches (E-value ≤ 10-6) | 18,407 |
| *Remaining contigs with additional matches (E-value ≤ 10-2) | 3,616 |
| Contigs determined by ESTscan | 17,402 |
| **Total number of transcripts | 39,425 |
| **Total number of putatively translated amino-acids sequences | 42,073 |
*contigs without BLASTx matches at an E-value cut-off of 10-6 were queried again with BLASTx with an E-value cut-off of 10-2
** The difference between the number of transcripts and total number of amino-acid sequences is due to the possibility of a contig having more than one annotated protein hit.
Figure 1. (A) Size distribution of 454 sequences after assembly and contig joining. (B) Distribution of number of read per contig in normalized library. The number of contigs presenting the indicated amount of reads is plotted as a histogram.
Figure 2Classification of the annotated amino-acid sequences. Amino-acid sequences were grouped into different functional sub-categories within the Cellular Component, Molecular Function and Biological process Gene Ontology (GO) organizing principles.
B. azoricus genes putatively involved in immune response and inflammatory reactions.
| Function | Gene Ontology n° | Gene Ontology description |
|---|---|---|
| Peptidoglycan Recognition protein (PGRP) | GO: 0008745 | N-acetylmuramoyl-L-alanine amidase activity |
| Chitin binding protein | GO:0008061; GO:0006030 | Chitin binding; chitin metabolic process |
| Galectin 4-like protein | GO:0005529 | Sugar binding |
| Rhamnose-binding lectin | GO:0005529 | Sugar binding |
| Thrombospondin-like glycoprotein | GO:0007155; GO:0033627 | Cell adhesion; cell adhesion mediated by integrin |
| Glycoside hydrolase, Chitinase-like | GO:0005975 | Carbohydrate metabolic process |
| Mannose-6-phosphate receptor | GO:0005537 | Mannose binding |
| Contactin associated protein 2 | GO:0007155; GO:0005515 | Cell adhesion; protein binding |
| Tissue inhibitor of metalloproteinase | GO:0008191; GO:0005578 | Metalloendopeptidase inhibitor activity; proteinaceous extracellular matrix |
| Serpin (serine protease inhibitor) | GO:0004867 | Serine-type endopeptidase inhibitor activity |
| α2-Macroblobulin (thioester-containing protein) | GO:0004866 | Endopeptidase inhibitor activity |
| Syndecan binding protein | GO:0007265; GO:0005137 | Ras protein signal transduction; interleukin-5 receptor binding |
| Fibrinogen (pattern recognition receptor) | GO:0007165; GO:0005102 | signal transduction; receptor binding |
| Ficolin (opsonin, contain fibrinogen and collagen-like domains) | GO:0007165;GO:0005102 GO:0008228 | signal transduction; receptor binding; opsonization |
| Scavenger receptor cysteine-rich protein (SRCR) | GO:0005044 | Scavenger receptor activity |
| LBP/BPI (LPS binding, Crassostrea homologue) | GO:0008289 | Lipid binding |
| Toll-interleukin receptor | GO:0045087; GO:0007165 | Innate immune response; signal transduction |
| Myd88 | GO:0004888 | Transmembrane receptor activity |
| TRAF (TNF receptor-associated factor) | GO:0007165; GO:0042981 | Signal transduction; regulation of apoptosis |
| IRAK | GO:0019221; GO:0051092 | Cytokine-mediated signaling pathway; regulation of NF-κB |
| MAPK | GO:0004672; GO:0006468 | Protein kinase activity; protein amino acid phosphorylation |
| p38 | GO:0004672; GO:0051403 | Protein kinase activity; stress-activated MAPK cascade |
| Notch homologue | GO:0007411; GO: 0007219 | Axon guidance; Notch signaling pathway |
| EGF receptor | GO:0007173; GO:0007165 | Epidermal growth factor receptor signaling Pathway; signal transduction |
| TNF receptor | GO:0007165; GO:0042981 | Signal transduction; regulation of apoptosis |
| Fibropellin homologue | GO:0005509; GO:0005515 | Calcium ion binding; protein binding |
| Laminin_EGF | GO:0005539 | Glycosaminoglycan binding |
| Cadherin (EGF domain containing) | GO:0016020; GO:0007156 | Membrane;homophilic cell adhesion |
| Integrin (fibronectin receptor) | GO:0007155; GO:0007229 | Cell adhesion; integrin mediated signaling pathway |
| Nuclear Factor κB inhibitor | IPR015681 (InterPro) | Regulation of NF-κB activity |
| STAT | GO:0004871; GO:0045449 | Signal transducer activity; regulation of transcription |
| SH2 motif(Src homology 2 | GO:0007165; GO:0018108 | Signal transduction; peptidyl-tyrosine phosphorylation |
| P53 | GO:0006915; GO:0034984 | Apoptosis; cellular response to DNA damage stimulus |
| AP-1 (Proto-oncogene c-jun) | GO:0003700; GO:0045449 | Transcription factor activity; regulation of transcription |
| Tal (Crassostrea homologue) | GO:0045449; GO:0030528 | Regulation of transcription; transcription regulator activity |
| Defensin (big defensin) | GO:0006952 | Defense response |
| Cytolysin | GO:0009404 | Toxin metabolic process |
| Apolipoprotein (plasminogen) | GO:0007596; GO:0004252 | Blood coagulation; serine-type endopeptidase activity |
| TNF (LPS-induced, α factor) | GO:0006955; GO:0006952 | Immune response; defense response |
| Interferon | GO:0042742 | Defense response to bacterium; regulation of innate immune response |
| TGF | GO:0006954; GO:0006917 | Inflammatory response; induction of apoptosis |
| Glutatione peroxidase | GO:0004602; GO:0006979 | Glutathione peroxidase activity; response to oxidative stress |
| Prostaglandin synthase/cyclooxygenase | GO:0006979; GO:0004601 | Response to oxidative stress; peroxidase activity |
| Fibronectin | GO:0001968 | Fibronectin binding |
| Metalloproteinase | GO:0004222; GO:0005578 | Metalloendopeptidase activity; proteinaceous extracellular matrix |
| Metallothionein | GO:0046872 | Metal ion binding |
| Ferritin | GO:0006879 | Cellular iron ion homeostasis |
| Tenascin | GO:0009611; GO:0007155 | Cell adhesion; response to wounding |
| Glucose-regulated protein 94 | GO:0006950 | Response to stress |
Figure 3Categorization of putative immune genes. A proposed categorization of immune genes is illustrated, according to Gene Ontology terminology, into four functional classes of innate immunity constituents from B. azoricus: immune recognition, signal transduction, transcription and effector molecules.
Figure 4Semi-quantitative Reverse Transcription-PCR (RT-PCR) of candidate genes. Normalized cDNA obtained from reverse transcription of mRNA was used as template for PCR amplifications. Aliquots were taken from PCR reactions at 20, 25 and 30 cycles and analyzed by agarose gel electrophoresis.
Figure 5Quantitative expression of putative immune and stress-related genes. The quantitative expression of putative genes from vent mussel gills tissues was assessed by qPCR. Data were transferred to Excel files and plotted as histograms of fold expression of putative genes from non-normalized cDNA library. Results are mean ± SD (N = 3). Vertical bars represent the relative expression levels of putative transcripts using the 28S as control and normalization gene.
Comparison between Mytibase and DeepSeaVent database
| DeepSeaVent and Mytibase comparison | ||||
|---|---|---|---|---|
| E-value | Bit-score | Matched proteins with InterPro annotation | ||
| 10-5 | 90 | 5,261 | ||
| 10-5 | 120 | 4,120 | ||
| 10-5 | 200 | 1,923 | ||
| Cell killing | 12 | 6 | 50.0 | |
| Immune system process | 56 | 3 | 5.4 | |
| Death | 14 | 2 | 14.3 | |
| Multicellular organismal process | 39 | 6 | 15.4 | |
| Cellular component biogenesis | 318 | 175 | 55.0 | |
| Cell wall organization or biogenesis | 9 | 1 | 11.1 | |
| Virion | 63 | 1 | 1.6 | |
| Macromolecular complex | 1495 | 798 | 53.4 | |
| Virion part | 63 | 1 | 1.6 | |
| Structural molecule activity | 942 | 533 | 56.6 | |
| Transporter activity | 570 | 101 | 17.7 | |
| Electron carrier activity | 350 | 54 | 15.4 | |
| Enzyme regulator activity | 161 | 22 | 13.7 | |
| Molecular transducer activity | 110 | 6 | 5.5 | |
The fixed E-value 10-5 was kept for three different bit-scores (90,120 and 200). The number of contigs with GO ontology assignments shared between both databases is shown together with respective shared percentage
Figure 6Flowgram representing data processing pipeline for .
Figure 7Number of contigs presenting a bacterial match after BLASTx query.
Forward and reverse primer sequences used in semi quantitative RT-PCR analyses.
| Candidate gene | GenBank acc. no | 5'-3' forward primer | 5'-3' reverse primer |
|---|---|---|---|
| Galectin | CTCCGGCGGGAGGGAATCCA | AGTGGAAGCTGGGGTTCCGAGG | |
| Carcinolectin | CGGATACAGTGGCACGGCAG | TGATACCAACGAGCACCAGCAC | |
| Aggrecan | TGCAAGCGGATACCCGGTAAA | ATCAACGCAGAGTGGCCGAG | |
| C-lectin | AGGCTTGGGATAGGCACATGGA | ACGATTCACCCGAACAGAGTTGG | |
| LBP/BPI | GCTTCACTGATACTGCTTGCCC | CCACGGTGGAGCAGCATGGA | |
| SRCR | TGATTCGATACCAAGGACCCAAAGGT | TGTCAACTCCGGCTATTCCAGGT | |
| PGRP | TCACACGGAAGGAGGAGCGT | AGGGCTGCCTTGGATGGTGT | |
| Thrombospondin | TGCTGCGACCCATTTCTGTGA | GTGAGGAGTTCCACTGGTGAGGG | |
| Rhamnose-binding lectin | ACAATGGGTTGATTTGTTTGCCGA | CCGGGGGCCTGAAAGTTGGT | |
| Lipoprotein receptor | CAGAGCCATCCACTTTGGCGG | AGGTCTACACCTTCCAGCAGCA | |
| Integrin | ACGGCCGGGGAGAAGTTGAA | CGCAGTCACACGTTCCACAGAC | |
| Serpin | AGGGTTGTGCGTGAAGTGGA | TCTCAAAGCGAGGCTGCCAGA | |
| α-macroglobulin receptor | CATTACGGCCGGGGCAAAGG | TGCTGGCTCTCTCAGCTCGG | |
| Tenascin | CATTACGGCCGGGGGTTGTA | AGTCGGAACAGTCCTTTGGGT | |
| EGF | GGGACACATTGCGAAACGGC | TTGCCCCGTAAATCCAGGCA | |
| GTPase | ATTACGGCCGGGGGACACAC | TTCGGCATCCTGGCACTTCG | |
| TNF factor | GGGATTAGGCAACACCCAAGCC | CCGCCACAGTACAGCCAACC | |
| VEGF receptor | AGCTGCATGGAGACTTGAACCAGA | AGGTGGGGTGGTACTTGCTCC | |
| Fas ligand | CGATTCGCTAGGACCGGGGA | AGTCATTGGCGGTACTCCACACA | |
| Toll | AGGAGGACTCGGATGACACAGC | ACTCCGGAACTTGGAGAGCACG | |
| MyD88 | CTGCCACACCCAACAACGCA | TCGAGACTGAGGTTCTCGCACA | |
| TRAF | CCCAACGACAGCCTTCTTTGACG | ATTACGGCCGGGGGCTTGTT | |
| IRAK | GAGTGGCCATTACGGCCGGG | GCTTGCATCGATCTGGCGGGT | |
| TRAF 6 | CACCTATTTCCGCTTCCCGCC | TGGAGGGTGGTGGTGCTCTT | |
| NF-κB | CCAAATGATGCACCTGCTCTTTTCAGT | CATTACGGCCGGGGAAGGGA | |
| Tal | GTTGACGCCATCGCTCTCGG | GCCATTACGGCCGGGGTTTA | |
| Jun | CGCCAACACCGACACAGTTCA | AACCCCCGGGGAGTGTTGTT | |
| AP-1 | TGCAGCTACACGGTTTCTGGC | TCGGCAACAACACTCCCC | |
| I-κB | TGAGGCAGCACTGAACGGAC | CGCAGAGTGTGCCAACAGCA | |
| STAT long form | ACGGCCGGGGTAAAGCTGAA | ACAAATCCAGCCACATGCCCA | |
| STAT (SH2 motif) | AGCGTCAAACACGACAGACGA | AGACCACGCCCTGTTTCAGC | |
| Cytolysin | CGGTTGCTGTGTAGCCGCAT | GGCGTCCAGAGACCGGAGTT | |
| Glutathione peroxidase | TTAACGGCGTCGTCGCTTGG | TGGCTTCTCTCTGAGGAACAACTG | |
| TIMP | TGTCCCATGGGTCTGGAACGG | TCAGCCTGTTCCTCTTGGCATT | |
| Thiolester- macroglobulin | CTGGCTCTCTCAGCTCGGCA | GGGCACTCTCCGGTCTTGGT | |
| Metallothionein 1B | TCGGCACTGTCCACACAAAACC | CAACCGGAAGCGGATGTGGC | |
| Big Defensin | CCGGGGGCGATTGCCTTTC | ACCAAGGCCCAAAATGCAGC | |
| Defensin | AACGCAGAGTGGGCCATACG | TCACTGGTGCGAACCGTTTGT | |
| Ferritin | TCAACGCAGAGTGGGGCCAT | GCGGTTCAGAAGTTGTTGTCACG | |
| Catalase | CATGTTAGCAGGCACTCCAGACC | TACGGCCGGGGGAAAAAGGT |
Based on sequences retrieved from the DeepSeaVent database, primers were designed to confirm, in RT-PCR experiments, the physical counterpart of B. azoricus genes putatively associated with immunity and inflammatory reactions.