| Literature DB >> 31640560 |
Phongthana Pasookhush1, Charles Hindmarch2, Paisarn Sithigorngul1,3, Siwaporn Longyant1,3, William G Bendena4, Parin Chaivisuthangkura5,6.
Abstract
BACKGROUND: Macrobrachium rosenbergii, is one of a major freshwater prawn species cultured in Southeast Asia. White tail disease (WTD), caused by Macrobrachium rosenbergii nodavirus (MrNV), is a serious problem in farm cultivation and is responsible for up to 100% mortality in the post larvae stage. Molecular data on how M. rosenbergii post-larvae launches an immune response to an infection with MrNV is not currently available. We therefore compared the whole transcriptomic sequence of M. rosenbergii post-larvae before and after MrNV infection.Entities:
Keywords: Crustacean immunity; Differential expression analysis; M. rosenbergii nodavirus; Macrobrachium rosenbergii; RNAseq; de novo transcriptome assembly
Mesh:
Year: 2019 PMID: 31640560 PMCID: PMC6805343 DOI: 10.1186/s12864-019-6102-6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
The number of read pairs prior and after trimming
| Sample | Raw paired-end reads | Trimmed paired-end reads |
|---|---|---|
| Control 1 | 46,497,004 | 44,697,960 (96.13%) |
| Control 2 | 43,309,253 | 41,651,586 (96.17%) |
| Control 3 | 40,571,354 | 38,896,140 (95.87%) |
| Control 4 | 42,111,175 | 40,566,032 (96.33%) |
| Control 5 | 47,098,377 | 45,404,272 (95.63%) |
| Control 6 | 38,519,779 | 36,568,513 (94.93%) |
| Infected 1 | 44,325,031 | 42,803,756 (96.57%) |
| Infected 2 | 46,902,950 | 45,238,551 (96.45%) |
| Infected 3 | 49,289,659 | 47,443,959 (96.26%) |
| Infected 4 | 39,687,853 | 38,299,205 (96.50%) |
| Infected 5 | 33,036,002 | 31,758,938 (96.13%) |
| Infected 6 | 50,947,705 | 48,935,511 (96.05%) |
| Total reads | 522,296,142 | 501,900,423 (96.09%) |
Summary of the transcriptome assembly and quality assessment
| Prior to de novo assembly | |
| Length of raw reads (bp) | 75 |
| Total number of raw reads | 522,296,142 |
| Length of trimmed reads (bp) | 65 |
| Total number of trimmed reads | 501,900,423 (96.09%) |
| Total number of normalized reads | 51,971,920 (10.36%) |
| After de novo assembly | |
| Total number of contigs | 109,616 |
| Total number of unigenes | 96,362 |
| Mean length of unigenes (bp) | 776.73 |
| N50 length of unigenes (bp) | 1308 |
| Highest ExN50 value (% Ex) | 94 |
| Fragment mapping rate (%) | 96.85% |
| BUSCO completeness (%) | 83.4% |
Fig. 1Top 10 species distribution of Blastx results from different databases. a UniProt database b Non-redundant arthropod database c Non-redundant arthropod database with only crustacean species
Fig. 2GO distribution (level 2) of annotated unigenes based on UniProt database. GO assignments were divided into three categories including cellular process (CC, yellow), molecular function (MF, blue), and biological process (BP, green)
Fig. 3EggNOG classifications of annotated unigenes based on UniProt database. EggNOG functional annotations were divided into 23 categories. The EggNOG categories are shown on the horizontal axis as alphabets with category names on the right
Fig. 4KEGG orthology distribution of annotated unigenes based on UniProt database. KEGG orthology were categorized into five major categories. The names and distribution ratios of each category are shown in pie chart at the top right corner
Summary of the transcriptome assembly, annotation and differential abundance analysis
| Total number of contigs | 109,616 |
|---|---|
| Total number of unigenes | 96,362 |
| NCBI Nr annotated | 32,523 (33.75%) |
| UniProt annotated | 25,761 (26.73%) |
| GO annotated | 18,291 (18.98%) |
| EggNOG annotated | 20,130 (20.88%) |
| KEGG annotated | 19,715 (20.46%) |
| Differentially abundant gene (total) | 5538 (FDR < 0.05, LogFC < ±1) |
| Up-regulated gene | 2413 |
| Down-regulated gene | 3125 |
List of DEG transcripts involved in immune system
| Unigene | Functional annotation | Organisms | FC |
|---|---|---|---|
| Antiviral protein | |||
| DN14192_c1_g1_i2 | antiviral protein |
| 2.48 |
| Antimicrobial protein | |||
| DN46855_c0_g1_i1 | anti-lipopolysaccharide factor |
| 2.25 |
| DN3291_c0_g1_i1 | anti-lipopolysaccharide factor 1 |
| 2.73 |
| DN34234_c0_g1_i1 | anti-lipopolysaccharide factor 3 |
| 4.20 |
| DN9599_c0_g1_i3 | crustin 7, partial |
| 6.59 |
| DN584_c0_g1_i2 | crustin 6, partial |
| 3.48 |
| DN2919_c0_g1_i1 | crustin 5 |
| 3.05 |
| DN13113_c0_g1_i1 | i-type lysozyme-like protein 2 |
| −2.45 |
| DN5315_c0_g1_i1 | crustin A |
| −8.94 |
| DN25544_c0_g1_i1 | crustin 4 |
| −2.25 |
| Pattern recognition proteins (PRPs) | |||
| DN27838_c0_g1_i1 | C-type lectin |
| 3.29 |
| DN3149_c0_g1_i3 | C-type lectin 1 |
| 3.46 |
| DN25495_c0_g1_i4 | C-type lectin 2 |
| 4.32 |
| DN39427_c0_g1_i1 | C-type lectin 4 |
| −2.91 |
| DN8487_c0_g1_i1 | C-type lectin H |
| −3.05 |
| DN79_c0_g1_i9 | C-type lectin-like domain-containing protein PtLP |
| 2.58 |
| DN9664_c0_g1_i1 | C-type lectin-like protein |
| −4.08 |
| DN1597_c0_g1_i2 | down syndrome cell adhesion molecule |
| −2.03 |
| DN1969_c0_g1_i12 | ficolin |
| 2.55 |
| DN458_c0_g1_i6 | ficolin-like protein 2 |
| 3.32 |
| DN2732_c0_g1_i1 | lectin |
| 2.41 |
| DN22168_c0_g1_i1 | lectin 1 |
| 2.36 |
| DN1016_c0_g1_i2 | lectin 2 |
| 3.56 |
| DN2249_c0_g1_i2 | lectin 3 |
| 2.28 |
| DN248_c1_g1_i2 | lectin B isoform 2, partial |
| 2.13 |
| DN59056_c0_g1_i1 | lectin D, partial |
| 2.45 |
| DN11184_c0_g1_i1 | lectin E |
| 2.68 |
| DN15806_c0_g1_i1 | mannose-binding protein |
| −6.02 |
| DN51_c0_g1_i3 | tachylectin |
| 2.17 |
| Toll-IMD signaling pathway | |||
| DN1501_c0_g4_i1 | spatzle protein, partial |
| −17.88 |
| DN6896_c0_g2_i1 | toll-receptor 9 |
| 3.81 |
| DN48150_c0_g1_i1 | Nuclear factor NF-kappa-B p110 subunit |
| 4.69 |
| RNAi pathway | |||
| DN14942_c0_g1_i2 | dicer-2 |
| 3.66 |
| DN13890_c0_g1_i2 | argonaute-3 |
| 2.16 |
| Prophenol oxidase system (ProPO) | |||
| DN24054_c0_g1_i2 | prophenoloxidase, partial |
| 2.01 |
| DN9200_c0_g1_i1 | prophenoloxidase-activating enzyme 2a |
| 2.99 |
| DN27498_c0_g1_i2 | prophenoloxidase activating factor 1 |
| 2.31 |
| DN6266_c0_g1_i1 | prophenoloxide activating enzyme III |
| 2.23 |
| Serine proteinase cascade | |||
| DN449_c1_g1_i1 | serine proteinase |
| 2.51 |
| DN11492_c0_g1_i8 | serine proteinase stubble |
| −6.73 |
| DN833_c0_g1_i2 | serine proteinase inhibitor 6 |
| −2.48 |
| DN2466_c0_g1_i2 | alpha-2-macroglobulin |
| −2.68 |
| DN17466_c0_g1_i2 | pacifastin heavy chain |
| 2.14 |
| Ubiquitin proteasome pathway | |||
| DN49814_c0_g5_i1 | E3 ubiquitin-protein ligase Ubr3 |
| 2.22 |
| DN8249_c1_g1_i1 | E3 ubiquitin-protein ligase RNF216-like, partial |
| 3.89 |
| DN10760_c0_g1_i3 | ubiquitin |
| −4.89 |
| DN1685_c0_g1_i9 | RING finger protein nhl-1-like |
| 3.63 |
| Antioxidant system | |||
| DN6006_c0_g2_i1 | Microsomal glutathione S-transferase 1 |
| 2.01 |
| DN19734_c0_g1_i1 | glutathione peroxidase 3 |
| 4.08 |
| DN8125_c0_g1_i2 | selenium independent glutathione peroxidase |
| −2.20 |
| DN27870_c0_g2_i3 | copper/zinc superoxide dismutase isoform 3 |
| −13.45 |
| DN29677_c0_g1_i1 | thioredoxin |
| −2.16 |
| Blood coagulation | |||
| DN49188_c0_g1_i1 | transglutaminase |
| 2.08 |
| DN2823_c0_g1_i2 | hemicentin-1-like isoform X2 |
| 5.31 |
| Apoptosis | |||
| DN370_c0_g1_i2 | caspase |
| 2.64 |
| DN12951_c0_g1_i2 | caspase 4 |
| 3.76 |
| DN21090_c0_g1_i2 | inhibitor of apoptosis protein |
| 2.79 |
| Phagocytosis | |||
| DN57596_c0_g2_i1 | Ras-related protein Rab-37 |
| 2.66 |
| DN13418_c0_g1_i1 | Rab32 |
| 2.46 |
| DN19755_c0_g3_i1 | rac GTPase-activating protein 1-like |
| 2.95 |
| DN13060_c0_g1_i1 | VLIG2 |
| 3.92 |
| DN147_c0_g2_i3 | VLIG1 |
| 2.89 |
| DN18093_c0_g1_i3 | ADP-ribosylation factor |
| 2.19 |
| DN27087_c0_g1_i2 | interferon regulatory factor |
| 3.32 |
| Other immune genes | |||
| DN8990_c0_g1_i3 | Ferritin |
| 2.93 |
| DN39999_c0_g1_i6 | calcium/calmodulin-dependent protein kinase type II alpha chain isoform X6 |
| 2.19 |
| DN47919_c0_g1_i9 | integrin, partial |
| 2.53 |
| DN17544_c0_g2_i1 | integrin alpha 8 |
| 2.55 |
| DN11203_c0_g4_i1 | integrin alpha 4, partial |
| 2.04 |
| DN2727_c0_g1_i3 | Cathepsin B |
| 2.69 |
| DN928_c0_g1_i2 | cathepsin C |
| 2.50 |
| DN12284_c0_g1_i1 | cathepsin L |
| 2.66 |
| DN1236_c0_g1_i6 | crustacyanin-like lipocalin |
| −2.10 |
| DN18816_c0_g2_i1 | crustacyanin A, partial |
| −2.48 |
Fig. 5Principle component analysis of twelve samples (84,092 transcripts). PC 1 and 2 are principle component 1 and 2, respectively. Blue triangles are MrNV-infected group (n = 6), whereas red circles are control group (n = 6)
Fig. 6Heatmap of top 100 differentially expressed transcripts. The heatmap was generated using trimmed mean of M-values (TMM). Sample clustering was done using Pearson’s correlation. The Z-score scale is shown in the top-right corner ranging from − 2 (blue) to 2 (yellow)
Comparison of fold change in transcript abundance between qRT-PCR and RNAseq
| Gene symbol | qRT-PCR | RNAseq | ||
|---|---|---|---|---|
| Fold change ± SD |
| Fold change | Corrected P | |
|
| 5.86 ± 0.72 | 0.001 | 2.25 | 0.011 |
|
| −5.72 ± 1.09 | 0.003 | −13.45 | 0.001 |
|
| 8.97 ± 1.19 | 0.001 | 2.48 | 0.007 |
|
| −3.26 ± 1.08 | 0.053 | −17.88 | 2.90E-09 |
|
| 12.05 ± 0.91 | 7.04E-05 | 2.64 | 1.03E-06 |
|
| 6.81 ± 1.32 | 0.006 | 3.66 | 4.31E-11 |
|
| 16.07 ± 3.91 | 0.005 | 5.31 | 0.003 |
|
| 5.45 ± 0.90 | 0.004 | 2.19 | 0.008 |
|
| 9.38 ± 1.43 | 0.002 | 2.01 | 8.65E-06 |
Fig. 7Comparison of fold change in gene expression using either qRT-PCR or RNAseq. a Heatmap representing transcript per million (TPM) expression from RNAseq and relative expression from qRT-PCR. The Z-score scales are shown in the top-right corner ranging from blue to yellow. b Regression plot demonstrating the direct correlation between the average log2 FC expression values from both RNAseq and qRT-PCR. c The qRT-PCR validation results of nine selected genes including anti-lipopolysaccharide factor 1 (ALF1), Spatzle (Spz), copper/zinc superoxide dismutase 3 (CuZnSOD3), caspase (CASP), antiviral protein (Anv), dicer (DICER), hemicentin-1-like (HMCN1), ADP ribosylation factors (ARF), and prophenoloxidase (ProPO) with elongation factor1-alpha (EF1-alpha) as an internal reference
Fig. 8The overviews of analysis pipeline. The pipeline is divided into three major parts including raw data pre-processing, transcriptome assembly, and post-precessing of the transcriptome which indicated by area of different colors. Boxes represent datasets. Rounded boxes represent analyses. The software for each analysis is indicated at the top of the box whereas the database for homology search are listed under the box
Primers used in the qRT-PCR experiment
| Primer name | Sequence | Efficiency (%) | R2 |
|---|---|---|---|
| ALF1-F | 5′-CTG GTG ACG GAA GAA GAA GC-3′ | 98.13 | 0.9975 |
| ALF1-R | 5′–CTT AAC CAG GCC ATT CCT CA–3’ | ||
| Spz-F | 5′-CGA CGGA ATA CCC GAC CTA CA-3’ | 92.27 | 0.9926 |
| Spz-R | 5′-TGT CGG TTT TGC AGA CGT AG-3’ | ||
| CuZnSOD3-F | 5′-GGG AGA CCT AGG GAA CAT CC-3’ | 95.33 | 0.9920 |
| CuZnSOD3-R | 5′-GTG GAT GAC CAC GGC TCT AT-3’ | ||
| CASP-F | 5′-CTG CCC TGA ATT CCT CTC TG-3’ | 105.24 | 0.9885 |
| CASP-R | 5′-CGA AGG TGG TAT GGA GCA AT-3’ | ||
| Anv-F | 5′-AAT GGT GGT ATC AGC CTT GC-3’ | 94.53 | 0.9885 |
| Anv-R | 5′-TTA GAG GGT CGA CCA TGA GG-3’ | ||
| DICER-F | 5′-CAC TCG AGC ATC CTG TTT CA-3’ | 107.80 | 0.9969 |
| DICER-R | 5′-ACC AAT CCC CAT CCA ATG TA-3’ | ||
| HMCN1-F | 5′-TAA GGC AAC CGA CCA CTA CC-3’ | 107.85 | 0.9940 |
| HMCN1-R | 5′-GAC GTA GAG ACT GGC GGA AG-3’ | ||
| ARF-F | 5′-CCC ATT ACA GTG GTC CTG CT-3’ | 95.22 | 0.9928 |
| ARF-R | 5′-CAG AAC CCT TCC CTT CAC AA-3’ | ||
| ProPO-F | 5′-AAC AAC CTG AGA ACC GGA TG-3’ | 93.46 | 0.9899 |
| ProPO-R | 5′-CGG CAG GGT TGG CAT AAT CT-3’ | ||
| EF1a-F | 5′-TGC GCT GTG TTG ATT GTA GC-3’ | 103.19 | 0.9834 |
| EF1a-R | 5′-ACA ATG AGC TGC TTG ACA CC-3’ |