| Literature DB >> 17925018 |
Claytus Davis1, Zeev Barvish, Inna Gitelman.
Abstract
BACKGROUND: The transcribed sequences of a cell, the transcriptome, represent the trans-acting fraction of the genetic information, yet eukaryotic cDNA libraries are typically made from only the poly-adenylated fraction. The non-coding or translated but non-polyadenylated RNAs are therefore not represented. The goal of this study was to develop a method that would more completely represent the transcriptome in a useful format, avoiding over-representation of some of the abundant, but low-complexity non-translated transcripts.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17925018 PMCID: PMC2134933 DOI: 10.1186/1471-2164-8-363
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Random directional cDNA synthesis. A. The first strand cDNA primer (DRP1) contains: a 5' phosphorylated buffer sequence devoid of A bases (1), an AscI site (2), a 5 nt random sequence (3), and a 3' T residue (4). By design, priming should begin only at an A in the RNA template. B. The non-phosphorylated lone-linker LL1 consists of the the two complimentary oligonucleotides LL1F and LL1R. It has one blunt end and one non-adhesive staggered end. LL1 can therefore ligate only to one strand of the cDNAs and in only one orientation. The remaining nick in the second strand is removed by preincubating the cDNAs before the first PCR reaction at 72°C for one minute to strip off the non-ligated strand of the linker and regenerate the sequence by extension from the 3' end of the cDNA (lower grey).
Figure 2Characterization of equalized cDNA libraries. 257 cDNA sequences from the Danio rerio and Drosophila melanogaster self-subtracted cDNA libraries were used as GenBank and Ensembl database query sequences. Diagnostic properties of near identity (>97%) matches were collected and scored. The labeled columns represent the following library characteristics: mRNA – the fraction of clones matching known or hypothetical mRNA sequences. mRNA (novel) – the fraction of clones representing the first reported matches to hypothetical mRNA sequence or previously unknown splice variants of known mRNA transcription units. hnRNA – the fraction of clones matching sequence that is likely removed during mRNA processing, eg. intron. ncRNA – the fraction of clones matching known or hypothetical non-coding RNA transcription units. interspersed repetitive – the fraction of clones matching more than 50 dispersed loci in the genome with probabilities of e < 10-9. gDNA – the fraction of clones matching genomic DNA outside of any known or predicted transcription unit. no match – the fraction of clones that did not match any known cDNA and could not be assigned an origin in the genome. sense – the fraction of matches in the sense orientation of known or hypothetical transcription units. antisense – the fraction of matches in the antisense orientation of known or hypothetical transcription units.