| Literature DB >> 20433739 |
Bo-Young Lee1, Aimee E Howe, Matthew A Conte, Helena D'Cotta, Elodie Pepey, Jean-Francois Baroiller, Federica di Palma, Karen L Carleton, Thomas D Kocher.
Abstract
BACKGROUND: Large collections of expressed sequence tags (ESTs) are a fundamental resource for analysis of gene expression and annotation of genome sequences. We generated 116,899 ESTs from 17 normalized and two non-normalized cDNA libraries representing 16 tissues from tilapia, a cichlid fish widely used in aquaculture and biological research.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20433739 PMCID: PMC2874815 DOI: 10.1186/1471-2164-11-278
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
cDNA libraries from Oreochromis niloticus.
| Lib name | Tissue | Description | Reads |
|---|---|---|---|
| Br3 | Brain | normalized | 11,520 |
| D0-4 | Whole embryos | normalized | 9,120 |
| D5-15 | Whole larvae | normalized | 9,216 |
| D16-40 | Whole juveniles | normalized | 9,216 |
| Gi1 | Gill | non-normalized | 6,144 |
| Gi2 | Gill | normalized | 4,992 |
| GOHD | Mixed gonad | normalized | 5,123 |
| Ht2 | Heart | normalized | 4,992 |
| Ki3 | Kidney | normalized | 6,144 |
| Li6 | Liver | normalized | 4,992 |
| Oe1 | Olfactory epithelium | normalized | 9,216 |
| Ov1 | Ovary | normalized | 9,216 |
| Re3 | Retina | non-normalized | 3,840 |
| Re4 | Retina | normalized | 15,027 |
| Sk1 | Skin | normalized | 4,992 |
| Sm1 | Skeletal muscle | normalized | 4,992 |
| Sp1 | Spleen | normalized | 4,992 |
| St1 | Stomach | normalized | 4,992 |
| Te2 | Testis | normalized | 8,928 |
Figure 1A single high density filter of retinal cDNA libraries hybridized with a rhodopsin probe. The left side of the filter contains ~9,000 clones from an un-normalized library. The right half of the filter contains ~9,000 clones from the normalized library. Normalization has greatly reduced the redundancy of the library.
Clustering statistics for each library.
| Lib name | HQ ESTs | Contigs | Singletons | Total | Coverage | Discovery |
|---|---|---|---|---|---|---|
| Br3 | 10,051 | 1,245 | 6,935 | 8,180 | 0.1862 | 1.229 |
| D0-4 | 7,891 | 991 | 5,517 | 6,508 | 0.1753 | 1.213 |
| D5-15 | 8,001 | 1,045 | 5,171 | 6,216 | 0.2231 | 1.287 |
| D16-40 | 8,101 | 773 | 6,294 | 7,067 | 0.1276 | 1.146 |
| Gi1 | 5,032 | 576 | 2,875 | 3,451 | 0.3142 | 1.458 |
| Gi2 | 4,164 | 357 | 3,206 | 3,563 | 0.1443 | 1.169 |
| GOHD | 3,936 | 367 | 2,806 | 3,173 | 0.1939 | 1.240 |
| Ht2 | 4,529 | 633 | 2,954 | 3,587 | 0.2080 | 1.263 |
| Ki3 | 5,131 | 526 | 3,871 | 4,397 | 0.1431 | 1.167 |
| Li6 | 4,360 | 459 | 3,306 | 3,765 | 0.1365 | 1.158 |
| Oe1 | 7,983 | 797 | 5,866 | 6,663 | 0.1654 | 1.198 |
| Ov1 | 7,988 | 772 | 6,221 | 6,993 | 0.1246 | 1.142 |
| Re3* | 2,650 | 374 | 906 | 1,280 | 0.5170 | 2.070 |
| Re4* | 11,298 | 2,162 | 5,306 | 7,468 | 0.3390 | 1.309 |
| Sk1 | 4,406 | 714 | 2,130 | 2,844 | 0.3545 | 1.549 |
| Sm1 | 4,434 | 465 | 3,363 | 3,828 | 0.1367 | 1.158 |
| Sp1 | 4,389 | 964 | 2,059 | 3,023 | 0.3112 | 1.452 |
| St1 | 4,498 | 695 | 2,791 | 3,486 | 0.2250 | 1.290 |
| Te2 | 8,057 | 1,134 | 5,228 | 6,362 | 0.2104 | 1.266 |
* Includes forward and reverse reads for some clones
Figure 2The redundancy of each library at different depths of sequencing. The x-axis is the number of sequence reads from each library. The y-axis indicates the number reads required to discover a sequence that does not cluster with the existing sequences for that library. Results after each round of sequence are shown for Br3 (squares) and Ret4 (diamonds). Other libraries are shown with circles. The point in the upper left is the non-normalized Ret3 library.
Figure 3Size distribution of the contigs in 100 bp bins.
Figure 4Number of contigs by the number of libraries in which that sequence is expressed.
Figure 5Distribution of sequence starts and stops on Uniprot entries. a, c - distributions for unigenes. b, d - distributions for unassembled ESTs.
Distribution of ESTs and Unigenes on Uniprot entries. The criteria for completeness of the EST was whether it reached within 10 amino acids of the end of the Uniprot entry. Values indicate the proportions within each size class.
| ESTs | < 250aa | 251-500aa | 501-750aa | 751-1000aa | > 1000aa |
|---|---|---|---|---|---|
| 5' & 3' | 0.31 | 0.00 | 0.00 | 0.00 | 0.00 |
| 5' only | 0.34 | 0.36 | 0.13 | 0.10 | 0.07 |
| 3' only | 0.20 | 0.21 | 0.27 | 0.27 | 0.23 |
| incomplete | 0.15 | 0.43 | 0.60 | 0.63 | 0.70 |
| total | 18,369 | 13,510 | 4,432 | 1,892 | 2,926 |
| sum | 41,129 | ||||
| 5' & 3' | 0.38 | 0.04 | 0.00 | 0.00 | 0.00 |
| 5' only | 0.30 | 0.32 | 0.15 | 0.11 | 0.08 |
| 3' only | 0.18 | 0.24 | 0.29 | 0.28 | 0.24 |
| incomplete | 0.14 | 0.40 | 0.56 | 0.61 | 0.68 |
| total | 5,421 | 6,277 | 2,740 | 1,238 | 2,046 |
| sum | 17,722 | ||||