| Literature DB >> 24373586 |
Ruijia Wang, Luyang Sun, Lisui Bao, Jiaren Zhang, Yanliang Jiang, Jun Yao, Lin Song, Jianbin Feng, Shikai Liu, Zhanjiang Liu1.
Abstract
BACKGROUND: The application of RNA-seq has accelerated gene expression profiling and identification of gene-associated SNPs in many species. However, the integrated studies of gene expression along with SNP mapping have been lacking. Coupling of RNA-seq with bulked segregant analysis (BSA) should allow correlation of expression patterns and associated SNPs with the phenotypes.Entities:
Mesh:
Year: 2013 PMID: 24373586 PMCID: PMC3890627 DOI: 10.1186/1471-2164-14-929
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of Illumina sequencing of the catfish liver transcriptome with extreme phenotypes after ESC infection
| Number of reads | 132,406,228 | 116,652,262 | 151,256,664 | 400,315,154 |
| Read length (bp) | 100 bp | 100 bp | 100 bp | 100 bp |
| Number of reads after trimming | 123,437,613 | 110,344,258 | 140,669,530 | 374,451,401 |
| Percentage kept after trimming | 93.2% | 94.6% | 93.0% | 93.5% |
| Average read length after trimming | 95.8 bp | 96.0 bp | 95.8 bp | 95.9 bp |
Summary of assembly of the catfish liver transcriptome with infection of ESC generated by Illumina sequencing and assembled with Trinity
| Number of non-redundant contigs | 232,338 |
| Large contigs (≥1,000 bp) | 51,601 |
| Length of the largest contig | 18,759 bp |
| N50 size | 1900 bp |
| Average length of non-redundant contigs | 825 bp |
| % reads mapped to the final reference | 91.68% |
| Number of contigs with hits | 55,130 |
| Unigene matches | 22,126 |
| Known gene matches | 15,599 |
| Unknown hypothetical gene matches | 6,527 |
Contigs were used as queries for BLAST searches against Zebrafish and NR protein databases. The cut-off value for gene identity was set at e-value ≤ 1e-5.
Analysis of differentially expressed genes (fold change ≥2, p ≤ 0.05) after infection with
| Number of genes differentially expressed in resistant fish | 224 | 130 | 94 | 63 | 42 |
| Number of genes differentially expressed in susceptible fish | 1,240 | 771 | 469 | 520 | 233 |
| Regulated genes only in resistant fish | 89 | 57 | 32 | 43 | 35 |
| Regulated genes only in susceptible fish | 1,093 | 682 | 411 | 464 | 200 |
Comparison of gene expression between resistant fish and susceptible fish after infection with
| Differentially expressed genes between resistant and susceptible fish | 1,255 |
| Gene expressed higher in resistant fish | 528 |
| >100 fold | 5 |
| 50-100 fold | 18 |
| 10-50 fold | 86 |
| 5-10 fold | 94 |
| 2-5 fold | 325 |
| Genes expressed lower in resistant fish | 727 |
| >100 fold | 2 |
| 50-100 fold | 10 |
| 10-50 fold | 86 |
| 5-10 fold | 159 |
| 2-5 fold | 470 |
These genes are listed in Additional file 1: Table S1.
Figure 1Volcano plot of genes differentially expressed between resistant and susceptible fish. The dots located in the positive area stand for genes expressed higher in resistant fish, and dots located in the negative area stand for genes expressed higher in susceptible fish. As shown in graphic symbol, different color were used to scale different expression fold changes; purple stands for expression fold changes higher than 100-fold; red stands for expression fold changes from 50–100 fold; light blue stands for expression fold changes from 10–50 fold; green stands for expression fold changes from 5–10 fold; blue stands for expression fold changes from 2–5 fold; and gray stands for gene expressed insignificantly (p-value > 0.05 or fold change smaller than 2).
Identification of SNPs and significant SNPs (allele frequencies statistically different between the resistant and susceptible groups) from the assembled catfish liver transcriptome
| Total number of SNPs | 513,371 |
| Number of contigs containing SNPs | 31,646 |
| Number of significant SNPs | 56,419 (10.99%) |
| Number of contigs containing significant SNPs | 11,249 |
| Number of contigs with significant hits to genes | 5,480 |
| Number of genes containing significant SNPs | 4,304 |
| Number of genomic scaffolds containing significant SNPs | 2,096 |
Figure 2Genes harbouring significant SNPs, plotted by the log(BFR) versus rank of RPKM. Genes which have BFR larger than 16 (log2 =4) were highlighted by red and their gene names were labeled.
Allele ratio combination in two families
| AA x AG | 7:1 | 3:1 | 5:3 |
| AG x AG | 3:1 | 5:3 | 1:1 |
| AA x AA | - | 7:1 | 3:1 |
Figure 3Genes harbouring significant SNPs, plotted by their combined allele ratios versus rank of RPKM. Red dots represent genes with BFR ≥ 4 and combined allele ratio ≤ 9; brown dots represent genes with BFR ≥ 4 and allele ratio ≥ 14; green dots represent genes with BFR < 4 and allele ratio ≥ 14, blue stands for genes with allele ratio from 9 to 14, or genes with BFR < 4 and allele ratio ≤ 9. The three threshold lines of combined allele ratio of 1:7 (maximal possible ratio at the DNA level for any polymorphic SNP with two families), 1:9 (3X maximal possible ratio at the DNA level for SNPs polymorphic in both families); and 1:14 (2X maximal possible ratio at the DNA level for any polymorphic SNP with two families) were drawn for references.
Figure 4The distribution of high BFR genes (BFR ≥ 4) on linkage groups. Blue stands for the number of genes with 5 ≤ BFR < 10, red stands for the number of genes with BFR ≥ 10.
Figure 5Distribution of genes containing SNPs of high BFR in linkage group 6.
Figure 6Distribution of genes containing SNPs of high BFR in linkage group 15.
Figure 7Distribution of genes containing SNPs of high BFR in linkage group 17.
Figure 8Genes harbouring significant SNPs, plotted by their combined allele ratios versus the rank of RPKM. Red dots stand for genes with the preferentially expressed allele expressed higher in resistant group and their parental origin unknown; Solid red triangles stand for genes with the preferentially expressed allele expressed higher in resistant group and their parental origin being channel catfish; Unfilled red triangles stand for genes with the preferentially expressed allele expressed higher in susceptible group and their parental origin being channel catfish; Solid blue triangles stand for genes with the preferentially expressed allele expressed higher in resistant group and their parental origin being blue catfish; Unfilled blue triangles stand for genes with the preferentially expressed allele expressed higher in susceptible group and their parental origin being blue catfish; and green dots stand for genes with the preferentially expressed allele expressed higher in susceptible group and their parental origin unknown.
Figure 9Relationship between RPKM, BFR and p-value. The BFR is shown on the left-Y axis (red) and P-values are shown on right-Y axis (blue), both against the rank of RPKM on X-axis. Note there is correlation between p-values and RPKM, but not the BFR.
Differentially expressed genes after infection with high bulk frequency ratios and low combined allele ratio
| Metalloreductase STEAP2 | 21.2 | 3 | Regulates iron ion homeostasis and involved in endocytosis |
| Tumor suppressor candidate 5 homolog | 34.0 | 2.3 | Inhibits breast tumor formation in |
| Protein G7c-like | 17.8 | 3.2 | Effect the susceptibility to lung tumors |
| Acidic chitinase-like | 9.8 | 1.4 | Participates in the defense against nematodes, fungi and bacteria. Plays a role in T-helper cell type 2 (Th2) immune response. |
| Mannan-binding lectin serine peptidase 2 | 4.1 | 2.2 | Lectin complement pathway actication |
| Inter-alpha-trypsin inhibitor heavy chain H4 | 4.3 | 1.3 | Type II acute-phase protein (APP) involves in inflammatory responses to trauma. May also play a role in liver development or regeneration. |
| D-amino acid oxidase | 4.3 | 1.8 | Regulation of the glutamatergic neurotransmission; may play a role in the glutamatergic mechanisms of schizophrenia |
| MAWD binding protein like | 4.4 | 1.4 | Inhibits proliferation and invasion in gastric cancer |
| Cytokeratin-like | 4.5 | 1.5 | Mediate epithelial innate defense through their antimicrobial properties |
| Spermidine/spermine N1-acetyltransferase | 4.7 | 1 | Involves in polyamine homeostasis |
| Stonustoxin subunit alpha-like | 4.7 | 1.9 | Induces hemolytic activities, displays edema-inducing activities, increases vascular permeability and interferes irreversibly with neuromuscular function. |
| UDP-glucose 4-epimerase | 4.8 | 1.8 | Catalyzes the epimerization of UDP-glucose to UDP-galactose and the epimerization of UDP-N-acetylglucosamine to UDP-N-acetylgalactosamine. |
| Major histocompatibility complex class I UDA | 5.1 | 3.1 | Play an important role in immune response and antigen processing and presentation |
| NLR-C8 | 5.9 | 1.4 | Involves in the gram negative bacteria recognition. Against the intracellular pathogen. |
| MARCKS-like 1a | 6.1 | 2.5 | Most prominent cellular substrate for protein kinase C. It can bind calmodulin, actin, and synapsin. |
| 3-oxoacid CoA transferase 1a | 7.0 | 1.3 | Key enzyme for ketone body catabolism. Also plays and important roles in the energy metabolism of spermatozoa. |
| MHC class II beta chain | 17.9 | 2.4 | Involves in antigen processing and presentation of peptide or polysaccharide antigen via MHC class II |