| Literature DB >> 17547761 |
Susan E Douglas1, Leah C Knickle, Jennifer Kimball, Michael E Reith.
Abstract
BACKGROUND: An essential first step in the genomic characterisation of a new species, in this case Atlantic halibut (Hippoglossus hippoglossus), is the generation of EST information. This forms the basis for subsequent microarray design, SNP detection and the placement of novel markers on genetic linkage maps.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17547761 PMCID: PMC1924502 DOI: 10.1186/1471-2164-8-144
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Characteristics of Atlantic halibut normalised cDNA libraries
| Redundancy | ||||||||
| Library | Recom. Eff. | # Clones | Avg. insert size (kb) | 2 | 3 | 4 | 5 to 15 | |
| Tissue | Gill | 95 | 933 | 1.4 | 42 | 5 | 0 | 1 |
| Head Kidney | 94 | 932 | 1.4 | 23 | 2 | 0 | 0 | |
| Intestine | 91 | 910 | 1.3 | 45 | 9 | 0 | 0 | |
| Liver | 95 | 906 | 1.3 | 69 | 13 | 4 | 3 | |
| Ovary | 98 | 1609 | 1.5 | 154 | 27 | 6 | 4 | |
| Skin | 97 | 945 | 1.4 | 32 | 6 | 1 | 1 | |
| Spleen | 91 | 898 | 1.4 | 29 | 4 | 0 | 0 | |
| Testis | 95 | 1532 | 1.5 | 84 | 11 | 1 | 0 | |
| Larval | Hatching | 96 | 953 | 1.6 | 67 | 9 | 1 | 3 |
| Mouth-opening | 97 | 950 | 1.4 | 39 | 5 | 3 | 5 | |
| Midway to Metamorphosis | 97 | 915 | 1.6 | 41 | 0 | 1 | 0 | |
| Premetamorphosis | 94 | 929 | 1.4 | 47 | 3 | 0 | 0 | |
| Postmetamorphosis | 96 | 915 | 1.5 | 30 | 5 | 1 | 3 | |
The recombination efficiency represents the number of clones containing inserts. The number of clones sequenced, the average insert size in kb (based on analysis of 96 clones), and the number of clones represented more than once (redundancy) for each library are presented.
Figure 1Representation of sequencing read lengths of Atlantic halibut Expressed Sequence Tags (ESTs). Read lengths were binned in 100 base pair (bp) increments. Most of the ESTs fall into the 700–800 bp bin.
Most highly represented clones found in each cDNA library
| Library | Gene | Accession # | # hits |
| Gill | lectin-like protein | 7 | |
| Liver | epoxide hydrolase | 5 | |
| 14 kDa apolipoprotein | 16 | ||
| Ovary | ribosomal protein L3 | 5 | |
| transaldolase 1 | 5 | ||
| fish eggshell protein | 5 | ||
| unknown EST | 6 | ||
| Skin | lectin-like protein; | 7 | |
| Hatch | NAD(P)H dehydrogenase quinone 1 | 5 | |
| lipocalin family | 6 | ||
| Mouth-opening | parvalbumin beta | 5 | |
| myosin heavy chain | 5 | ||
| 14 kDa apolipoprotein | 6 | ||
| apolipoprotein AI precursor | 8 | ||
| tropomyosin | 13 | ||
| Post-metamorphosis | alpha actin 1 | 5 | |
| myosin light chain 2 | 6 | ||
| parvalbumin beta | 10 |
Genes that were represented more than five times in a given cDNA library are listed.
Classification of Atlantic halibut unique sequences
| Classification | Classification Method | GO source | Number of Sequences | |||
| Unclassified | No BLAST hit | 1016 | ||||
| Ribosomal RNA | rRNA hit | 28 | ||||
| Unassigned protein | BLAST hit >e-10 to unknown protein | 824 | ||||
| Unknown EST | BLAST hit >e-10 to unknown EST | 802 | ||||
| Functionally annotated protein | BLAST hit >e-10 to known protein | 5040 | ||||
| Informative terms | 4786 | |||||
| Domain name-containing protein | 254 | |||||
| Gene Ontology (107) | 3878 | |||||
| AutoFACT | 1640 | |||||
| Goblet | 1736 | |||||
| InterPro | 605 | |||||
| KEGG (185) | 578 | |||||
| COG (22) | 1191 | |||||
| Total | 7710 | 5040 | ||||
Numbers of sequences associated with AutoFact classification. The method of classifying functionally annotated proteins is indicated and the numbers of categories identified by GO, KEGG and COG are given in brackets. GO classifications were obtained using Goblet and Interpro information as well as AutoFACT.
Figure 2Classification of Atlantic halibut unique sequences according to Gene Ontology (GO) category: cellular component.
Figure 3Classification of Atlantic halibut unique sequences according to Gene Ontology (GO) category: molecular function.
Figure 4Classification of Atlantic halibut unique sequences according to Gene Ontology (GO) category: biological process.
Classification of Atlantic halibut unique sequences according to COG
| Category | # | % |
| Amino acid transport and metabolism | 72 | 6.05 |
| Carbohydrate transport and metabolism | 61 | 5.12 |
| Coenzyme transport and metabolism | 22 | 1.85 |
| Inorganic ion transport and metabolism | 27 | 2.27 |
| Lipid transport and metabolism | 42 | 3.53 |
| Nucleotide transport and metabolism | 43 | 3.61 |
| Secondary metabolites biosynthesis, transport and catabolism | 13 | 1.09 |
| Energy production and conversion | 128 | 10.75 |
| Total metabolism and energy | 408 | 34.26 |
| Cell cycle control, cell division, chromosome partitioning | 10 | 0.84 |
| Chromatin structure and dynamics | 14 | 1.18 |
| Replication, recombination and repair | 41 | 3.44 |
| RNA processing and modification | 6 | 0.50 |
| Transcription | 51 | 4.28 |
| Translation, ribosomal structure and biogenesis | 188 | 15.79 |
| Posttranslational modification, protein turnover, chaperones | 211 | 17.72 |
| Total nucleic acid processes | 521 | 43.74 |
| Cell wall/membrane/envelope biogenesis | 6 | 2.14 |
| Cytoskeleton | 40 | 3.36 |
| Total cell structure | 46 | 3.86 |
| Intracellular trafficking, secretion, and vesicular transport | 37 | 3.11 |
| Signal transduction mechanisms | 33 | 2.77 |
| Defense mechanisms | 4 | 0.34 |
| General function prediction only | 132 | 11.08 |
| Function unknown | 10 | 0.84 |
| Total | 1191 | 100.00 |
Categories were broadly grouped into metabolism and energy, nucleic acid processes and cell structure. The number (#) and percent (%) of ESTs in each category is shown.
Most commonly represented KEGG classifications of Atlantic halibut unique sequences
| Category | # | % |
| Oxidative phosphorylation | 101 | 17.5 |
| Ribosome | 58 | 10.0 |
| Purine & pyrimidine metabolism | 41 | 7.1 |
| Proteasome | 27 | 4.6 |
| Cell communication | 21 | 3.6 |
| SNARE interactions and vesicular transport | 17 | 2.9 |
| Arginine and proline metabolism | 16 | 2.8 |
| Transcription factors | 16 | 2.8 |
| Complement and coagulation cascades | 13 | 2.2 |
| Glycan structures | 12 | 2.1 |
| Arachidonic acid metabolism | 11 | 1.9 |
Categories with more than ten sequences out of a total of 578 are listed. The number (#) and percent (%) of ESTs in each category is shown.