| Literature DB >> 15059247 |
Svetlana A Shabalina1, Nikolay A Spiridonov.
Abstract
For decades, researchers have focused most of their attention on protein-coding genes and proteins. With the completion of the human and mouse genomes and the accumulation of data on the mammalian transcriptome, the focus now shifts to non-coding DNA sequences, RNA-coding genes and their transcripts. Many non-coding transcribed sequences are proving to have important regulatory roles, but the functions of the majority remain mysterious.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15059247 PMCID: PMC395773 DOI: 10.1186/gb-2004-5-4-105
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Ratios of the protein-coding, non-coding, and untranscribed sequences in bacterial, yeast, nematode and mammalian genomes. Estimations of the transcribed and protein-coding parts of genomes are based on the sequence length of annotated genes [3,12,13,73]. Estimation of the transcribed portion of the human genome is based on the sequence length occupied by the annotated genes on chromosomes 6, 7, 14, 20, and 22 [5].
General features of bacterial, yeast, nematode and mammalian genomes
| Species | Genome size (Mbp) | Repetitive sequences (%) | Transcriptional units | Protein-coding genes | Introns | References |
| 4.6 | 0.7 | 5,471 | 4,288 | [ | ||
| 12 | 3.2 | 6,682 | 6,183 | 233 | [ | |
| 100.3 | 16.5 | 19,646 | 18,808 | 99,237 | [ | |
| 104 | 22.4 | 20,469 | 19,507 | 94,832 | [ | |
| 2,500 | 40 | 33,409 | 22,011 | 191,500 | [ | |
| 2,900 | 44 | 25,003 | 22,808 | 177,000 | [ |