| Literature DB >> 22916334 |
Abstract
Despite recent technological advances, the study of the human transcriptome is still in its early stages. Here we provide an overview of the complex human transcriptomic landscape, present the bioinformatics challenges posed by the vast quantities of transcriptomic data, and discuss some of the studies that have tried to determine how much of the human genome is transcribed. Recent evidence has suggested that more than 90% of the human genome is transcribed into RNA. However, this view has been strongly contested by groups of scientists who argued that many of the observed transcripts are simply the result of transcriptional noise. In this review, we conclude that the full extent of transcription remains an open question that will not be fully addressed until we decipher the complete range and biological diversity of the transcribed genomic sequences.Entities:
Year: 2012 PMID: 22916334 PMCID: PMC3422666 DOI: 10.3390/genes3030344
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Composition of the human transcriptome. (a) Venn diagram of the number of loci containing mRNA transcripts (green), long ncRNAs (blue), and small ncRNAs (red); (b) Base pair coverage of the transcriptome by the three categories of transcripts.
Number of known annotated transcripts and human gene loci collected from Ensembl, NCBI’s RefSeq, UCSC Genome Browser, and Cabili et al.’s lincRNA catalog. A single locus typically contains multiple transcripts, particularly for mRNAs.
| Annotation | mRNA | Long ncRNA | Small ncRNA |
|---|---|---|---|
| Transcripts | 111,451 | 89,981 | 11,366 |
| Loci | 20,944 | 40,765 | 11,195 |
Figure 2The size of the transcriptome, computed as the fraction of the total number of base pairs in the human genome covered by the assembled transcripts, for 16 normal human tissues included in the Illumina Body Map [98]. Each RNA-seq data set was mapped to the genome with TopHat [78] and assembled with Cufflinks [41]. Note that except for adrenal tissue, in which transcripts cover 5.3% of the human genome, all other reconstructed transcriptomes are smaller in size than the currently annotated transcriptome.