| Literature DB >> 20707909 |
Li-xin Xiang1, Ding He, Wei-ren Dong, Yi-wen Zhang, Jian-zhong Shao.
Abstract
BACKGROUND: Systematic research on fish immunogenetics is indispensable in understanding the origin and evolution of immune systems. This has long been a challenging task because of the limited number of deep sequencing technologies and genome backgrounds of non-model fish available. The newly developed Solexa/Illumina RNA-seq and Digital gene expression (DGE) are high-throughput sequencing approaches and are powerful tools for genomic studies at the transcriptome level. This study reports the transcriptome profiling analysis of bacteria-challenged Lateolabrax japonicus using RNA-seq and DGE in an attempt to gain insights into the immunogenetics of marine fish.Entities:
Mesh:
Year: 2010 PMID: 20707909 PMCID: PMC3091668 DOI: 10.1186/1471-2164-11-472
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Distribution of non-redundant consensus sequences
| Consensus Length (bp) | Total Number | Percentage |
|---|---|---|
| 100-500 | 143865 | 84.65% |
| 500-1000 | 14313 | 8.42% |
| 1000-1500 | 5475 | 3.22% |
| 1500-2000 | 2849 | 1.68% |
| ≥2000 | 3448 | 2.03% |
| Total | 169950 | 100.00% |
| Total bp | 57817772 | |
| N50 | 598 | |
| Mean | 340 | |
N50 = median length of all non-redundant consensus sequences
Mean = average length of all consensus sequences
Figure 1GO annotations of non-redundant consensus sequences. Best hits were aligned to the GO database, and 16,469 transcripts were assigned to at least one GO term. Most consensus sequences were grouped into three major functional categories, namely biological process, cellular component, and molecular function.
Figure 2COG annotations of putative proteins. All putative proteins were aligned to the COG database and can be classified functionally into at least 25 molecular families.
Figure 3KEGG categories of non-redundant consensus sequences. All non-redundant consensus sequences were annotated using KEGG Automatic Annotation Server for pathway information, and about 24,496 consensus sequences were annotated. The categories GIP and EIP stand for genetic information processing and environmental information processing, respectively.
Annotation of non-redundant consensus sequences
| Database | Number of annotated | Percentage of annotated |
|---|---|---|
| Swissprot | 37544 | 76.64% |
| Nr | 36935 | 75.40% |
| GO | 16469 | 33.62% |
| KEGG | 24496 | 50.01% |
| COG | 9545 | 19.48% |
All 48,987 CD-containing consensus sequences generated by ESTscan were annotated though Swissprot, Nr, GO, KEGG, and COG databases.
Immune-relevant genes/homologues in L. japonicus
| Putative Gene Catalogs | Consensus Number | Homologs Number in other species | Putative Gene Catalogs | Consensus Number | Homologs Number in other species |
|---|---|---|---|---|---|
| TLR (1-9,13,14,18,21,23) | 44 | 17 | TCR (α,β chain) | 6 | 4 |
| scavenger receptor | 19 | 11 | BCL (2,3,6,7,9,10,11) and relevant | 44 | 19 |
| Mannose receptor | 49 | 26 | Immunoglobulin and relevant | 52 | 32 |
| MBL | 13 | 9 | CD (2-6,8,9,22,40,79,81,97,99,154,166, 226,276) | 47 | 29 |
| NOD (1,2) | 7 | 5 | BLNK | 5 | 3 |
| C-type lectin (4,6,7,9-12) | 36 | 26 | AICDA | 3 | 2 |
| TOLLIP | 1 | 1 | CBLB | 7 | 4 |
| LPS-binding/anchor protein | 19 | 3 | DOCK2 | 4 | 2 |
| HMG (A,B,2L1,20A) | 8 | 6 | DRAM | 1 | 1 |
| LRR-containing proteins | 168 | 74 | Glomulin | 2 | 2 |
| NALP (1,3,4,6,9,12,13) | 21 | 8 | HDAC (1-8,10,11) | 40 | 23 |
| PGRP | 1 | 1 | GALNAC4S-6ST | 5 | 3 |
| BTK | 2 | 2 | KLF (1-4,6-9,11-13) | 29 | 15 |
| Fibrinogen | 4 | 4 | IGBP1 | 1 | 1 |
| HLA | 1 | 1 | |||
| C1 (q,r,s,t),C2-C7,C9,C10 | 40 | 32 | JAG (1,2) | 37 | 6 |
| Complement factor (B,H,I,properdin, DAF) | 22 | 11 | KI13B | 17 | 1 |
| Complement receptor (1,2,5) | 13 | 8 | LAG | 1 | 1 |
| MASP | 4 | 3 | NCK | 7 | 5 |
| RGC | 2 | 2 | Homeobox protein (NKX) | 4 | 3 |
| PAWR | 3 | 3 | |||
| IL (1,3,8,10-12,15,16,27,34) and relevant | 34 | 17 | Protein kinase C δ | 6 | 3 |
| ILR (1-3,6,7,10,12,15,17,18,20-22,31) and relevant | 48 | 31 | Prolactin receptor | 5 | 2 |
| IFN | 3 | 2 | RAG (1,2) | 3 | 2 |
| IFN-induced proteins and relevant | 59 | 27 | RGS (1-5,7-9,11,12,14,16,18-20) | 47 | 23 |
| IFN (α,β,ω) receptor | 2 | 1 | SFTPD/SP-D | 1 | 1 |
| IFN-related regulator | 31 | 16 | TYK2 | 1 | 1 |
| CRLF (1,3) | 3 | 2 | VAV oncogene family (1-3) | 13 | 6 |
| EBI 2 | 5 | 2 | WAS proteins | 23 | 11 |
| FND (3,C) | 12 | 4 | WWP | 2 | 2 |
| TTN/TITIN | 5 | 2 | |||
| CXCL(6,12,14),CXC,CXCR(1-5,7) | 20 | 17 | MHC I/II | 14 | 5 |
| CCL(2-4,7,14,16,19,C11b),CCR(2-9,11) | 33 | 26 | PSMA (1-7)/B/C/D/E(2-4) | 93 | 63 |
| CX3CR | 1 | 1 | Legumain | 5 | 3 |
| XCL(1),XCR(1,1a) | 6 | 3 | ITCH | 5 | 2 |
| Chemokine receptor-like | 4 | 2 | RFX5 | 4 | 1 |
| CFLAR | 1 | 1 | TAPBP | 3 | 2 |
| EGF | 125 | 50 | ASPP (1,2) | 24 | 4 |
| FAM | 15 | 8 | ICAM2 | 1 | 1 |
| FGF | 23 | 14 | RAC1 | 3 | 1 |
| FLT3/CD135 | 2 | 2 | AGT | 4 | 2 |
| GATA (1-3,5,6) | 14 | 8 | CDC (2,5-7,16,20,23,27,42,A) | 23 | 15 |
| GDF | 3 | 3 | CDK-inhibitor (1-3) | 9 | 8 |
| GFI-1/-1B | 13 | 3 | CTLA4 | 1 | 1 |
| GPR44 | 4 | 2 | |||
| Inhibin β | 1 | 1 | CD (33,34,44,45,48,83,151,302) | 10 | 8 |
| Integrin α/β | 89 | 38 | Apolipoprotein A | 2 | 2 |
| LIFR | 5 | 3 | HELLS | 1 | 1 |
| Lipoteichoic acid | 1 | 1 | IMPDH/IMDH | 4 | 3 |
| MAF | 2 | 2 | AP-1 | 20 | 15 |
| MIF | 3 | 2 | c-FOS | 2 | 2 |
| PCGF (1-3, 5, 6, A) | 8 | 6 | EGR | 11 | 5 |
| SCYE1/MCA | 3 | 2 | ELK4 | 1 | 1 |
| SDF2 | 2 | 2 | IKKA/CHUK, IKKB | 8 | 2 |
| TGF A/B(1-3) | 10 | 9 | JAK (1,2) | 11 | 6 |
| TNF and TNR | 54 | 38 | JUN oncogene | 11 | 6 |
| Thrombopoietin | 1 | 1 | MALT | 5 | 2 |
| VEGF (A,C,D) | 5 | 4 | NFAT | 32 | 12 |
| VEGF receptor (1-3) | 17 | 8 | NF κ-B inhibitor proteins | 11 | 4 |
| PPM | 33 | 14 | |||
| ASB (1-3,5-8,12,13) | 26 | 14 | RIPK (1-3,5) | 7 | 5 |
| CRADD | 2 | 1 | SLC20A-1/2 | 6 | 4 |
| DEDD | 4 | 2 | Transcription factor Sp and SPT | 21 | 18 |
| EIF 2-α kinase 2 | 7 | 2 | TANK-binding kinase | 9 | 5 |
| FADD | 1 | 1 | TBX | 12 | 7 |
| Heme oxygenase | 4 | 2 | TMED (1-5,7-9,A) | 12 | 10 |
| HTR (5A,5B) | 22 | 4 | TRIM (1-3,7-9) | 20 | 8 |
| IRAK (1-4) | 7 | 6 | Antimicrobial peptide | 2 | 2 |
| CSF | 12 | 5 | Co-chaperone | 4 | 3 |
| Myd88 | 2 | 1 | Ferritin | 1 | 1 |
| NR2C | 6 | 4 | Hepcidin | 1 | 1 |
| PEA15 | 2 | 1 | HSP (7,13,70-72,74,B) | 35 | 20 |
| PELI (1-3) | 21 | 6 | Lysozyme | 4 | 3 |
| PHLA (1-3) | 4 | 3 | Macroglobulin | 9 | 6 |
| PPARA | 2 | 2 | Microtubule-associated | 109 | 45 |
| REL | 1 | 1 | Nitric oxide synthase | 4 | 3 |
| RHOA | 3 | 1 | Proteolipid protein | 1 | 1 |
| RIP | 2 | 2 | Prohibitin | 3 | 2 |
| SARM1 | 8 | 2 | Selenoprotein | 22 | 11 |
| SIGIRR | 2 | 2 | ABCF | 8 | 7 |
| SMAD (1-7,9) | 33 | 16 | CAMP | 13 | 4 |
| SOCS (1-7) | 18 | 8 | CARD | 5 | 3 |
| TICAM1 | 1 | 1 | CEBP | 7 | 5 |
| TIRAP | 1 | 1 | DMBT | 6 | 4 |
| TRADD | 1 | 1 | PTAFR | 1 | 1 |
| TRAF (2-7) | 10 | 7 | Protein S100-A | 5 | 4 |
| TANK | 2 | 2 | UCRP/CRP | 1 | 1 |
| UBE | 47 | 25 | CYBB | 1 | 1 |
| STAT | 21 | 7 | Caspase (1-3,6-9) | 18 | 13 |
| NF κ | 15 | 4 | Adenosine receptor A (2a,2b) | 11 | 7 |
| MAPK (1-3,5-9,11-15) | 47 | 29 | Apoptotic relevant | 13 | 6 |
| MAP2K4(JUKK1) | 2 | 2 | IRG1 | 4 | 2 |
| MAP3K (2-9,11,14,15) | 35 | 18 | Stress response protein | 22 | 6 |
| MAP3K 7-interacting protein (TAB) | 4 | 4 | WSC (1,2,4) | 8 | 3 |
| JNK | 5 | 4 | OSGI2 | 1 | 1 |
Putative genes/homologues were confirmed based on their similarity to immune-relevant genes in other vertebrate species as predicted by BLAST. Cutoff E-value was set to 1e-10 during BLAST analysis. Characterization of the putative genes is presented in Additional file 1, Table S1.
Figure 4Putative TLR signal pathway. Putative TLR signal pathway of L. japonicus was constructed based on knowledge of TLR signalling in mammalian species. However, most interactions have to be confirmed experimentally.
Figure 5Putative TCR signal pathway. Putative TCR signal pathway of L. japonicus was constructed based on knowledge of TCR signalling in mammalian species. However, most interactions require experimental confirmation.
Figure 6Distribution of tags and gene expression between experimental and control groups. The black and gray columns indicate the distribution of tags and gene expression in bacteria- and mock-challenged groups, respectively. The distribution of tags matches the distribution of genes for both groups. Furthermore, an increase in tags or gene expression is accompanied by a decrease in the frequencies of tags or genes.
Figure 7Differential expression analyses of tags and consensus sequences by DGE. The expression level for each tag and consensus was included in the volcano plot (Figures A and B, respectively). 'Not DETs' and 'Not DEGs' indicate 'not detected expression tags' and 'not detected expression genes', respectively. For Figures A and B, the x-axis contains Log10 of transcript per million of the bacteria-challenged group and the y-axis indicates Log10 of transcript per million of the mock-challenged group. Limitations are based on P < 0.01, and the absolute value of Log2 (B/A) is greater than 1.
Figure 8Volcano plot of differentially expressed consensus sequences. For every consensus, the ratio of expression levels in the bacteria-challenged group over that in the mock-challenged group was plotted against the -log error rate. The horizontal line indicates the significance threshold (0.01 FDR), and the vertical lines indicate the twofold change threshold.
Representatives of putative immune relevant genes/homologs as predicted by DGE
| Catalogs | Consensus ID | Putative Identification | Fold Change |
|---|---|---|---|
| TLR and relevant | 3365 | TLR 1 | -6.45 |
| 8296 | TLR 21 | -374.00 | |
| Mannose receptor | 158882 | Macrophage mannose receptor 1 | -2.94 |
| NOD | 117946 | NOD protein 2 | +4.25 |
| C-type lectin | 36609 | Collectin-12 | +3.69 |
| HMG | 3367 | HMG AT-hook 1 | -218.00 |
| LRR-containing proteins | 128723 | F-box/LRR protein 14 | +5.87 |
| 5434 | F-box/LRR protein 20 | -2.35 | |
| 36369 | F-box/LRR protein | -249.00 | |
| 3391 | LRR-containing protein 68 | -6.14 | |
| 39099 | LRR flightless-interacting protein 2 | +2.32 | |
| NALP | 62330 | NALP 3 | -2.04 |
| 43979 | NALP 6 | -2.47 | |
| 719 | NALP 12 | -312.00 | |
| 14950 | NALP 13 | -3.77 | |
| BTK | 37665 | BTK | -249.00 |
| C1 | 39636 | C1q-like protein 4 | +5.03 |
| Complement factor | 58617 | Complement factor H-related protein 3 | -2.55 |
| IL, ILR, and relevant | 7183 | IL6R subunit β | -3.06 |
| 4828 | IL7R subunit α | -218.00 | |
| 29 | IL10R α chain | +265.00 | |
| 8308 | IL12R β-2 chain | -7.09 | |
| 70266 | Putative IL17R E-like | +2.23 | |
| 9160 | IL18R | -2.33 | |
| 36378 | IL18R accessory protein | -2.96 | |
| 67818 | IL31RA | -2.38 | |
| 4392 | IL enhancer-binding factor 3 | -4.29 | |
| IFN and relevant | 34 | similar to IFN-inducible protein Gig1 | -3.76 |
| 39770 | IFN-induced helicase C domain-containing protein 1 | -2.15 | |
| 39343 | IFN-induced GTP-binding protein Mx | +2.22 | |
| 42748 | IFN-induced transmembrane protein 2 | -8.97 | |
| 41016 | IFN regulatory factor 4 | -5.98 | |
| 43856 | IFN regulatory factor 8 | -6.14 | |
| CXCR | 41529 | CXC chemokine receptor 5 | -2.59 |
| CCL | 71546 | C-C motif chemokine 19 | +2.20 |
| XCR | 70282 | XC Chemokine receptor 1 | +4.03 |
| EGF | 8053 | SNED | -312.00 |
| FAM | 37053 | Protein FAM3C | -2.35 |
| FLT | 9012 | FLT3 ligand cytokine receptor | -405.00 |
| GATA | 42892 | GATA-binding factor 2 | -15.43 |
| GFI | 6300 | Zinc finger protein GFI-1 | +6.58 |
| 16357 | Zinc finger protein GFI-1B | -2.01 | |
| Integrin | 169765 | Integrin α-3 | +298.00 |
| PCGF | 36691 | PCGF 2 | -2.54 |
| TGF | 34228 | TGF β-1-induced transcript 1 protein | -4.02 |
| Thrombopoietin | 8606 | Thrombopoietin receptor | -4.19 |
| TNF and TNR | 167272 | TNF α-induced protein 8-like protein 2 | -6.38 |
| 36673 | TNR 1A | -405.00 | |
| 2890 | TNR 12A | -281.00 | |
| 9356 | EMILIN 1 | -2.40 | |
| ASB | 71979 | ASB2 | -5.04 |
| DEDD | 1437 | DEDD | -3.40 |
| Heme oxygenase | 71817 | Heme oxygenase | +2.48 |
| CSF | 50449 | Macrophage CSF 1 receptor 1 | -2.09 |
| 11081 | Macrophage CSF 1 receptor 2 | -218.00 | |
| 49226 | Granulocyte CSF receptor | -3.09 | |
| NR2C | 42918 | NR2C 2-associated protein | +4.22 |
| SMAD | 42078 | SMAD 1 | -437.00 |
| 42615 | SMAD 5 | -405.00 | |
| TRAF | 42729 | TRAF7 | -2.27 |
| 70242 | TANK | -2.11 | |
| UBE | 2069 | UBE E2C | +2.07 |
| 9613 | UBE E2H | -2.42 | |
| NF κ | 1073 | NF κ-A | +3.18 |
| MAPK | 68093 | MAPK 3 | -9.45 |
| 70484 | MAPK 6 | -312.00 | |
| MAP2K | 37130 | MAP2K4 | -10.39 |
| MAP3K | 38157 | MAP3K3 | -3.26 |
| 7280 | MAP3K4 | -4.92 | |
| CD | 60890 | CD2 | +2.81 |
| 70094 | CD22 | +4.10 | |
| 4032 | CD40 (TNR5) | -343.00 | |
| 26107 | CD79B | -2.48 | |
| DOCK2 | 4848 | DOCK2 | -2.54 |
| RAG | 36208 | RGA2 | -6.39 |
| RGS | 905 | RGS12 | -2.60 |
| TYK2 | 9722 | Non-receptor TYK2 | -2.11 |
| VAV oncogene family | 9761 | VAV2 | -343.00 |
| MHC I/II | 65343 | MHC II transactivator | -218.00 |
| PSM | 115644 | PSMB3 | +3.06 |
| 5004 | PSMC2 | +2.71 | |
| 42169 | PSMC6 | +3.39 | |
| 208 | PSMD3 | -2.05 | |
| 7477 | PSMD8 | +5.29 | |
| TAPBP | 10433 | TAPBP | -281.00 |
| RAC | 36540 | RAC1 | -2.72 |
| CDC | 70253 | CDC 16 | +2.78 |
| 55784 | CDC 42 | +331.00 | |
| CDK-inhibitor | 43078 | CDK 4 inhibitor D | +2.58 |
| AP-1 | 41196 | AP-1 complex subunit σ-2 | +4.03 |
| 10356 | AP1 subunit mu-1 | +2.05 | |
| JAK | 818 | JAK2 | -312.00 |
| MALT | 21139 | MALT protein | -6.14 |
| PPM | 365 | Protein phosphatase 1F | -2.14 |
| RIPK | 1908 | RIPK1 | -7.88 |
| TMED | 8436 | TMED2 | -3.18 |
| Hepcidin | 56643 | Hepcidin | +6.10 |
| HSP | 37584 | HSP 71 | -2.96 |
| Lysozyme | 51 | Lysozyme g | +2.18 |
| Macroglobulin | 2234 | α-2-macroglobulin | +3.31 |
| Microtubule-associated | 12594 | EMAP 3 | -249.00 |
| Proteolipid protein | 38379 | Proteolipid protein 2 | -3.60 |
| Prohibitin | 70061 | Prohibitin-2 | -7.09 |
| Selenoprotein | 80124 | Selenoprotein K | +3.19 |
| ABCF | 3906 | ABCF 2 | -3.49 |
| CARD | 1588 | CARD 11 | -3.08 |
| Caspase | 42397 | Caspase 1 | -2.05 |
| 31546 | Caspase 8 | -5.07 | |
| Stress response protein | 44158 | Stress response protein NST1 | -2.15 |
Limitations of all differentially expressed genes are based on P ≤ 0.01. The absolute value of Log2Ratio≥1 and FDR≤0.05 showed that the genes were significantly altered after bacterial challenge. The cutoff E-value was set to 1e-10 during the gene annotation process. The absolute value of "Fold Change" means magnitude of up- or down-regulation for each gene/homolog after bacteria challenge; "+" indicates up-regulation, "-" indicates down-regulation.