| Literature DB >> 16887034 |
Marina Govoroun1, Florence Le Gac, Yann Guiguen.
Abstract
BACKGROUND: Within the framework of a genomics project on livestock species (AGENAE), we initiated a high-throughput DNA sequencing program of Expressed Sequence Tags (ESTs) in rainbow trout, Oncorhynchus mykiss.Entities:
Mesh:
Year: 2006 PMID: 16887034 PMCID: PMC1564016 DOI: 10.1186/1471-2164-7-196
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of the numbers of sequenced and released ESTs in the different AGENAE trout cDNA libraries.
| Libraries | pooled-tissue | Testis | Ovary | Total |
| Number of sequenced Clones | 65 664 | 13 824 | 5 376 | 84 864 |
| Number of Sequences | 88 704 | 13 824 | 5 376 | 107 904 |
| Including 5' | 65 664 | 13 824 | 5 376 | 84 864 |
| Including 3' | 23 040 | 0 | 0 | 23 040 |
| Published sequences | 79 037 | 12 499 | 4 936 | 96 472 |
Figure 1Percentage of novel EST clusters found as a function of the number of clones sequenced in the rainbow trout pooled-tissue cDNA library. N = Normalization. S1 = First subtraction. S2 = Second subtraction.
Top 20 most redundant EST clusters.
| Cluster Name | Cluster depth | Best swissprot hit | Hit description | Over-expressed in Agenae libraries |
| BX072800.1.p.om.3 | 1463 | Zona pellucida sperm-binding protein 3 precursor | Yes (Ovary) | |
| CR361581.1.p.om.3 | 1460 | NULL | Yes (Ovary) | |
| CA352033.1.p.om.3 | 1434 | Actin, alpha sarcomeric/cardiac (Actin alpha 2) | No | |
| BX073087.1.p.om.3 | 431 | Prolactin precursor (PRL) | No | |
| CA342041.1.p.om.3 | 402 | Hemoglobin beta-4 subunit | No | |
| CA369471.1.p.om.3 | 389 | ES1 protein homolog, mitochondrial precursor (Protein KNP-I) | Yes (Pooled-Tissue) | |
| CA368365.1.p.om.3 | 342 | Trypsin I precursor | Yes (Pooled-Tissue) | |
| BX074651.1.p.om.3 | 314 | Somatotropin 2 precursor (Growth hormone 2) | No | |
| CA341574.1.p.om.3 | 264 | 60S ribosomal protein L12 | No | |
| CA343108.1.p.om.3 | 255 | Myosin regulatory light chain 2, skeletal muscle isoform (G2) | No | |
| BX073970.1.p.om.3 | 224 | Somatotropin precursor (Growth hormone) | No | |
| CA341678.1.p.om.3 | 197 | Sarcoplasmic/endoplasmic reticulum calcium ATPase 1 | No | |
| CA341906.1.p.om.3 | 178 | 60S ribosomal protein L11 | No | |
| CA341950.1.p.om.3 | 173 | 60S ribosomal protein L13 | No | |
| CA341578.1.p.om.3 | 170 | Nitrogen regulation protein NR(II) | No | |
| CA345253.1.p.om.3 | 161 | Glutathione S-transferase P | No | |
| CA341772.1.p.om.3 | 159 | 60S ribosomal protein L18a | No | |
| CA342754.1.p.om.3 | 157 | 60S ribosomal protein L13a (Transplantation antigen P198 homolog) | No | |
| CA342547.1.p.om.3 | 155 | Collagen alpha 1(I) chain precursor | No | |
| CA378499.1.p.om.3 | 153 | NULL | No |
The 20 most redundant EST clusters in all rainbow trout cDNA libraries are listed with their Sigenae cluster name and the number of ESTs within each cluster (cluster depth). When a homology search using blastx was carried out against the Swissprot database returned a significant homology (blast score > 100), the accession number of this best putative homolog is given along with its associated description. When a cluster contains an over-representation of ESTs found only in one Agenae library, the name of this library is given in the last column.
Top 20 most redundant Agenae specific EST clusters.
| Cluster Name | Cluster depth | Best swissprot hit | Hit_description | Over-expressed in a specific Agenae library |
| CR361581.1.p.om.3 | 1460 | NULL | Yes (Ovary) | |
| CR361588.1.p.om.3 | 139 | NULL | Yes (Ovary) | |
| BX304386.1.p.om.3 | 63 | Zinc finger protein 318 (Testicular zinc finger protein) | Yes (Ovary) | |
| BX871489.1.p.om.3 | 59 | NULL | Yes (Ovary) | |
| BX310674.1.p.om.3 | 49 | NULL | No | |
| CR361690.1.p.om.3 | 48 | NADH-ubiquinone oxidoreductase 51 kDa subunit | Yes (Ovary) | |
| BX300811.1.p.om.3 | 48 | VEG136 protein (Fragment) | Yes (Pooled-Tissue) | |
| BX304242.1.p.om.3 | 40 | Very low-density lipoprotein receptor precursor | No | |
| BX310459.1.p.om.3 | 37 | Complement C1q-like protein 3 precursor (Gliacolin) | No | |
| CR361738.1.p.om.3 | 36 | NULL | Yes (Ovary) | |
| BX300472.1.p.om.3 | 36 | NULL | No | |
| CR361612.1.p.om.3 | 33 | Chondroitin beta-1, 4-N-acetylgalactosaminyltransferase 2 | Yes (Ovary) | |
| CR361716.1.p.om.3 | 33 | Sodium- and chloride-dependent creatine transporter 1 | Yes (Ovary) | |
| BX860653.1.p.om.3 | 33 | NULL | Yes (Ovary) | |
| BX321717.1.p.om.3 | 32 | Carnitine O-palmitoyltransferase I, mitochondrial liver isoform | No | |
| BX304069.1.p.om.3 | 31 | Baculoviral IAP repeat-containing protein 6 | No | |
| BX321042.1.p.om.3 | 30 | Metastasis-associated protein MTA2 | Yes (Pooled | |
| BX299242.1.p.om.3 | 28 | Regulator of G-protein signaling 2 (RGS2) | Yes (Pooled | |
| BX317551.1.p.om.3 | 28 | Endonuclease III (DNA-(apurinic or apyrimidinic site) lyase) | Yes (Ovary) | |
| BX298697.1.p.om.3 | 28 | NULL | Yes (Pooled-Tissue) |
The 20 most redundant EST clusters composed of only ESTs originated from Agenae rainbow trout cDNA libraries are listed with their Sigenae cluster name and the number of ESTs within each cluster (cluster depth). When homology search using blastx against the Swissprot database returned a significant homology (blast score > 100) the accession number of this best putative homolog is given along with its associated description. When a cluster contains an over-representation of ESTs found in only one Agenae library, the name of this library is given in the last column.
Figure 2Histogram of cluster sizes. Repartition in the different cluster size classes of the complete collection of trout clusters (black squares) and of Agenae specific clusters (open squares), both expressed in proportion of all clusters, based on Sigenae rainbow trout EST clustering version 3. The table represented the number of clusters in each cluster size class.
Figure 3Diagram showing the number and the relative proportion (%) of shared and unique clusters. Shared clusters contain ESTs from different projects (AGENAE, USDA, Others) and unique clusters only contain ESTs originating from one project. Clusters are made of one (called singletons and represented under brackets) or more ESTs assembled together following clustering analysis. USDA libraries were 01 to 10RT – NCCCWA libraries and 115RT – NCCCWA library.
Tissue representation in the pooled-tissue cDNA library.
| Tissues | Protein | Species | References | TBLASTN | BLASTX | Sequence ID |
| Testis | Testis Creatine kinase | [23] | 0 | / | tcaa0001c.e.17 ( | |
| Ovary | Factor in germ line alpha (Figa) | [24] | 89, 7e-19 | 89, 9e-17 | tcad0001a.a.19 ( | |
| Adipose tissue | Perilipin (PLIN) | [25] | 197, 7e-51 | 197, 2e-49 | tcbk0010c.i.14 ( | |
| Kidney | Collectrin | [26] | 141, 1e-34 | 141, 1e-32 | tcay0014b.i.05 ( | |
| Fetal gonads | DMRT1 | [27] | 0 | / | tcad0009a.n.11 ( | |
| Liver | Liver-basic fatty acid binding protein | [28] | 222, 1e-59 | 222, 3e-57 | tcay0037b.e.14 ( | |
| Gills | FHL5 | [29] | 399, 1e-112 | 404, 1e-117 | tcad0002a.a.17 ( | |
| Pituitary | Growth Hormone factor 1 (Pit-1) | [30] | 156, 4e-40 | 156, 2e-38 | tcay0007b.j.13 ( | |
| Blood | ERMAP | [31] | 163, 1e-40 | 163, 9e-43 | tcbk0052c.l.01 ( | |
| Brain | Brain cell membrane protein 1 (BCMP1) | [32] | 269, 3e-73 | 247, 2e-64 | tcbk0029c.j.07 ( | |
| Intestine | Intestinal mucin-like peptide | [33] | 217, 2e-56 | 194, 1e-48 | tcac0004c.g.02 ( | |
| Muscle | Muscle LIM protein | [34] | 315, 4e-97 | 270, 2e-71 | tcba0018c.g.07 ( |
Representative "tissue specific" protein homologs in the pooled-tissue cDNA library. Clone identity (clone ID) is given in the sequence ID column with the GenBank accession numbers in brackets. When more than one sequence was carried out on one clone (5' and 3' EST sequences), the second accession number is noted -X. For rainbow trout EST matching a rainbow trout gene only a blastN strategy was used.