| Literature DB >> 19402907 |
Matthew C Hale1, Cory R McCormick, James R Jackson, J Andrew Dewoody.
Abstract
BACKGROUND: Next-generation sequencing technologies have been applied most often to model organisms or species closely related to a model. However, these methods have the potential to be valuable in many wild organisms, including those of conservation concern. We used Roche 454 pyrosequencing to characterize gene expression in polyploid lake sturgeon (Acipenser fulvescens) gonads.Entities:
Mesh:
Year: 2009 PMID: 19402907 PMCID: PMC2688523 DOI: 10.1186/1471-2164-10-203
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Number of reads and nucleotides produced by five cDNA libraries (L1–L5). Significant BLAST hit criteria were an e-value ≤ 1 × 10-3 and a bit score > 40.
| L1 | L2 | L3 | L4 | L5 | Total | |
| Tissue source | adolescent male | adolescent female | unk. juvenile | adolescent male | adolescent female | n/a |
| Status | normalized | normalized | native | native | native | n/a |
| N. reads | 3,811 | 13,414 | 11,178 | 10,296 | 8,361 | 47,060 |
| N. reads after trimming | 1,234 | 1,385 | 8,700 | 5,061 | 4,361 | 20,741 |
| N. nucleotides | 286,822 | 270,099 | 1,247,400 | 870,023 | 869,912 | 3,544,256 |
| Mean read length | 232 | 195 | 143 | 162 | 199 | 186 |
| Median read length | 251 | 218 | 133 | 170 | 231 | 201 |
| Mode read length | 243 | 261 | 100 | 75 | 240 | 184 |
| Significant BLAST hits | 241 | 262 | 1,926 | 996 | 1,585 | 5,010 |
Figure 1Length of contigs as averaged (± SD) between the normalized and native libraries. Note the logarithmic axes.
Contig summary statistics from the PCAP analyses of all five libraries (L1–L5).
| L1 | L2 | L3 | L4 | L5 | |
| number of contigs | 170 | 110 | 572 | 578 | 401 |
| mean depth of contig | 2.8 | 3.7 | 4.5 | 4.3 | 4.9 |
| mean length of contig | 197 | 200 | 187 | 232 | 274 |
| number of reads in contigs | 481 | 397 | 2,578 | 3,041 | 2,374 |
| number of singletons | 753 | 988 | 6,122 | 7,834 | 2,390 |
Figure 2Depth of contigs as averaged (± SD) between the normalized and native libraries.
Figure 3Taxonomic distribution of the BLAST results. (a) The percentage of BLAST hits that are vertebrate, invertebrate, bacteria, plant, and fungi. (b) The percentage of vertebrate blast hits that are mammals, birds, herps, and fish. (c) The percentage of BLAST fish hits that are zebrafish, trout, puffer fish, sole, flounder, sturgeon, and other fish species.
Of 115 different Molecular Functions identified in the Gene Ontology analysis, 16 differed significantly in expression between normalized and native libraries.
| Molecular Function | Total counts normalized | % of counts | Total counts native | % of counts | P-value |
| actin binding | 7 | 1.13 | 24 | 0.30 | * |
| ATP binding | 101 | 16.29 | 152 | 1.88 | *** |
| cytochrome c oxidase | 78 | 12.58 | 155 | 1.92 | *** |
| DNA binding | 23 | 3.71 | 76 | 0.94 | *** |
| GTP binding | 6 | 0.97 | 22 | 0.27 | * |
| isomerase activity | 4 | 0.65 | 8 | 0.10 | ** |
| kinase activity | 4 | 0.65 | 5 | 0.06 | ** |
| motor activity | 4 | 0.65 | 0 | 0.00 | *** |
| oxygen binding | 12 | 1.94 | 2 | 0.02 | *** |
| protein binding | 178 | 28.71 | 1348 | 16.71 | ** |
| ribosome biogenesis | 0 | 0.00 | 811 | 10.05 | *** |
| RNA binding | 33 | 5.32 | 1482 | 18.37 | *** |
| structural constituent of ribosome | 0 | 0.00 | 1791 | 22.20 | *** |
| sugar binding | 2 | 0.32 | 6 | 0.07 | ** |
| ubiquinol-cytochrome-c reductase activity | 22 | 3.55 | 6 | 0.07 | *** |
| zinc ion binding | 136 | 21.94 | 725 | 8.99 | *** |
P-value significance; * = 0.05, ** = 0.01, *** = 0.001.
List of genes found within the Biological Process category of the Gene Ontology assignment with a significant difference in expression between normalized and native libraries.
| Biological Process | Total counts normalized libraries | % counts | Total counts native libraries | % counts | P-value |
| apoptosis | 6 | 2.08 | 8 | 0.18 | *** |
| biosynthetic process | 14 | 4.84 | 2 | 0.04 | *** |
| cell adhesion | 8 | 2.77 | 11 | 0.24 | *** |
| DNA repair | 14 | 4.84 | 9 | 0.2 | *** |
| glycoloysis | 7 | 2.42 | 8 | 0.18 | *** |
| immune response | 7 | 2.42 | 6 | 0.13 | *** |
| metabolic process | 26 | 9.0 | 4 | 0.09 | *** |
| muscle cell differentiation | 95 | 32.9 | 4 | 0.09 | *** |
| positive regulation of apoptosis | 3 | 1.04 | 6 | 0.13 | ** |
| protein folding | 15 | 5.19 | 25 | 0.56 | *** |
| protein transport | 4 | 1.38 | 17 | 0.38 | ** |
| proteolysis | 6 | 2.08 | 21 | 0.47 | ** |
| regulation of cell shape | 0 | 0 | 610 | 13.55 | *** |
| translation | 37 | 12.80 | 1830 | 40.66 | ** |
| transport | 40 | 13.84 | 211 | 4.69 | *** |
P-value significance; * = 0.05, ** = 0.01, *** = 0.001.
List of genes found within the Cellular Component category of the Gene Ontology assignment with a significant difference in expression between normalized and native libraries.
| Cellular Component | Total counts normalized libraries | % counts | Total counts native libraries | % counts | P-value |
| actin filament | 93 | 12.77 | 24 | 0.65 | *** |
| cytoplasm | 187 | 25.69 | 225 | 6.10 | *** |
| cytosol | 46 | 6.32 | 42 | 1.14 | *** |
| cytosolic large ribosomal subunit | 0 | 0.00 | 193 | 5.23 | ** |
| cytosolic small subunit | 0 | 0.00 | 810 | 21.97 | *** |
| ER | 16 | 2.20 | 43 | 1.17 | * |
| golgi apparatus | 9 | 1.24 | 20 | 0.54 | * |
| membrane | 110 | 15.11 | 239 | 6.48 | *** |
| mitochrondrion | 111 | 15.25 | 66 | 1.79 | *** |
| perinuclear region of cytoplasm | 15 | 2.06 | 19 | 0.52 | *** |
| plasma membrane | 4 | 0.55 | 207 | 5.61 | ** |
| ribosome | 25 | 3.43 | 645 | 17.49 | *** |
P-value significance; * = 0.05, ** = 0.01, *** = 0.001.
Subset of those contigs constructed from all five cDNA libraries combined (20,741 individual reads).
| Contig ID | Length | Depth | # SNPs | bp per SNP | Ts/Tv ratio | Top BLAST hit | GENBANK ID | Bit score | e-value |
| Contig18.1 | 421 | 28 | 5 | 84.2 | 0.7 | CXXC finger 6 | 53 | 7.00E-06 | |
| Contig19.1 | 379 | 27 | 8 | 47.4 | 0.5 | Ribosomal S20 | 181 | 2.00E-44 | |
| Contig26.1 | 630 | 24 | 2 | 315.0 | 0.0 | ORF2 | 89 | 1.00E-22 | |
| Contig61.1 | 465 | 13 | 1 | 465.0 | 0.0 | Calmodulin Complex | 202 | 7.00E-51 | |
| Contig65.1 | 553 | 13 | 7 | 79.0 | 0.8 | Endonuclease-reverse transcriptase | 80 | 6.00E-14 | |
| Contig176.1 | 311 | 7 | 14 | 22.2 | 0.4 | Polyprotein | 89 | 1.00E-16 | |
| Contig278.1 | 485 | 5 | 3 | 161.7 | 0.0 | Tc1-like transporase | 121 | 2.00E-26 |
The contigs listed below include only those with a significant BLAST hit and at least one SNP and with a Ts/Tv ratio < 1.0 (see Methods). In other words, the SNPs described in this table are a subset of the 877 we identified, but includes all of those associated with a particular gene and with a Ts/Tv ratio < 1.0. See additional file 5 for information from those contigs that produced a Ts/Tv ratio ≥ 1.0.
Figure 4Frequency plot of Ts/Tv ratios for 60 contigs with a significant BLAST hit. Points represent the number of contigs with that specific Ts/Tv ratio. Ts/Tv ratios greater than 1 indicate an excess of transitions, those lower than 1 indicate an excess of transversions. A line of best fit is plotted through the data.
Figure 5Rarefaction curves illustrating gene discovery (number of genes) as a function of effort (number of sequencing reads). (a) Rarefaction analysis comparing empirical gene discovery between normalized and native libraries. (b) Rarefaction analysis comparing simulated gene discovery between normalized and native libraries.