| Literature DB >> 18042283 |
Flavia Frabetti1, Raffaella Casadei, Luca Lenzi, Silvia Canaider, Lorenza Vitale, Federica Facchin, Paolo Carinci, Maria Zannotti, Pierluigi Strippoli.
Abstract
BACKGROUND: All standard methods for cDNA cloning are affected by a potential inability to effectively clone the 5' region of mRNA. The aim of this work was to estimate mRNA open reading frame (ORF) 5' region sequence completeness in the model organism Danio rerio (zebrafish).Entities:
Mesh:
Substances:
Year: 2007 PMID: 18042283 PMCID: PMC2222617 DOI: 10.1186/1745-6150-2-34
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Figure 1The mRNA 5' ORF extension pipeline. Schematic flow of the approach to automated search for mRNA with a putatively incomplete coding sequence: building of a RefSeq mRNA database and of a Danio rerio EST database, highthroughput BLAST comparison between the two sequence sets, final elaboration integrating BLAST results and mRNA/EST sequences.
Exemplificative zebrafish genes with extended cDNA 5' region and deduced protein.
| Error typea | GenBank EST# Zebrafishb | Genomic clone # | Product length new/old (no. of new amino acids) | Kozak sequence old (top)/new (bottom). Consensec: GCC | GenBank EST# Non-zebrafish | |
| ND | CN505709 | - | 196/163 (33, +20%) | ATG | - | |
| CK681469 | CTG | |||||
| CN018643d | ||||||
| 1 | CN505408 | BX465229 | 264/206 (58, +28%) | GAG | pp DT261717e | |
| CK363344 | BX005137 | CGG | pp DT134309 | |||
| BI710727d | pp DT116366 | |||||
| pp DT263287 | ||||||
| 1, 2 | CN176149 | BX323876 | 139/106 (33, +31%) | AGC | - | |
| CN180261 | TCA | |||||
| CO929886d |
a (1) extended exon 1; (2) new exon; ND: not determined owing to unavailability of genomic sequence.
b GenBank sequences matching extended coding sequence from the new start codon in EST (Expressed Sequence Tag) division.
c The two most conserved positions (Kozak, 1999; Kozak, 2002) are underlined; start codon, in bold font.
d Only three representative sequences are listed, out of a total of 24 for selt1a, 4 for unc119.2, and 26 for nppa, showing consistent coding sequence extension (see Table online).
e pp = Pimephales promelas fish.
Figure 2ClustalW alignment of atrial natriuretic peptide (ANP) sequences from different species. Sequence for zebrafish (BRARE) ANP is derived from the extended nppa cDNA sequence we present here; methionine in position 34 is the previously described first amino acid. Other ANP amino acid sequences are from: white sturgeon (Acipenser transmontanus, ACITR), killifish or mummichog (Fundulus heteroclitus, FUNHE), Japanese eel (Anguilla japonica, ANGJA), sheep (Ovis aries, SHEEP).