| Literature DB >> 18384671 |
Hirokazu Chiba1, Riu Yamashita, Kengo Kinoshita, Kenta Nakai.
Abstract
BACKGROUND: Interspecies sequence comparison is a powerful tool to extract functional or evolutionary information from the genomes of organisms. A number of studies have compared protein sequences or promoter sequences between mammals, which provided many insights into genomics. However, the correlation between protein conservation and promoter conservation remains controversial.Entities:
Mesh:
Year: 2008 PMID: 18384671 PMCID: PMC2335122 DOI: 10.1186/1471-2164-9-152
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Distribution of alignment scores of human and mouse promoters. The distribution for the orthologous gene pairs is depicted by the solid line, and the distribution for the negative control pairs is shown by the dashed line. The x-axis is shown in a logarithmic scale.
GO categories with high promoter conservation. Terms of biological process are labeled as P, cellular component as C, molecular function as F.
| P:development | 649 | 0 |
| P:regulation of transcription | 602 | 1.67E-15 |
| F:transcription factor activity | 263 | 3.44E-15 |
| P:transcription | 640 | 4.11E-14 |
| P:nervous system development | 154 | 1.99E-10 |
| P:organ development | 213 | 2.30E-10 |
| P:signal transduction | 994 | 5.19E-10 |
| F:DNA binding | 628 | 3.19E-08 |
| P:morphogenesis | 212 | 9.78E-08 |
| P:cell surface receptor linked signal transduction | 363 | 2.23E-06 |
| P:negative regulation of metabolism | 107 | 1.02E-05 |
| F:receptor binding | 221 | 1.90E-05 |
| P:cell-cell signaling | 176 | 2.27E-05 |
| F:cytoskeletal protein binding | 137 | 4.97E-05 |
| P:negative regulation of biological process | 327 | 6.87E-05 |
| F:ion channel activity | 98 | 9.87E-05 |
| C:extracellular matrix | 111 | 0.000119 |
| C:actin cytoskeleton | 85 | 0.000164 |
| P:cell differentiation | 173 | 0.000179 |
| P:cell adhesion | 242 | 0.000182 |
| P:cellular morphogenesis | 111 | 0.000607 |
| F:ion transporter activity | 237 | 0.001493 |
| P:protein amino acid phosphorylation | 213 | 0.001593 |
| P:ion transport | 239 | 0.001825 |
| F:protein kinase activity | 220 | 0.002033 |
| P:intracellular signaling cascade | 431 | 0.006872 |
| P:chromosome organization and biogenesis | 105 | 0.007832 |
| C:plasma membrane | 608 | 0.008026 |
GO categories with low promoter conservation. Terms of biological process are labeled as P, cellular component as C, molecular function as F.
| C:mitochondrion | 398 | 5.31E-09 |
| F:oxidoreductase activity | 309 | 2.07E-08 |
| C:lysosome | 77 | 9.94E-08 |
| C:ribosome | 114 | 7.54E-07 |
| P:lipid metabolism | 260 | 1.04E-06 |
| P:carboxylic acid metabolism | 225 | 4.43E-06 |
| F:structural constituent of ribosome | 130 | 5.76E-06 |
| P:amino acid metabolism | 112 | 0.000102 |
| P:electron transport | 151 | 0.000236 |
| P:catabolism | 260 | 0.000251 |
| P:carbohydrate metabolism | 220 | 0.000278 |
| C:peroxisome | 49 | 0.000623 |
| P:protein biosynthesis | 283 | 0.00063 |
| F:nuclease activity | 60 | 0.000772 |
| P:response to biotic stimulus | 318 | 0.000893 |
| C:nucleolus | 63 | 0.004455 |
| P:immune response | 270 | 0.005437 |
| F:iron ion binding | 111 | 0.0055 |
| F:peptidase activity | 227 | 0.005592 |
| P:proteolysis | 259 | 0.006844 |
Figure 2Distribution of alignment scores of human and mouse promoters for several categories with significant tendencies. For the high conservation tendency, actin cytoskeleton (A) and extracellular matrix (B), for the low conservation tendency, lysosome (C) and ribosome (D). For each of A-D, the solid line shows the distribution of the alignment scores for genes with the specific GO term, and the dashed line shows the distribution for the control gene set (see Materials and Methods for details).
GO categories with high protein conservation. Terms of biological process are labeled as P, cellular component as C, molecular function as F.
| F:GTPase activity | 88 | 0 |
| F:GTP binding | 160 | 0 |
| P:intracellular transport | 350 | 0 |
| P:small GTPase mediated signal transduction | 126 | 1.11E-16 |
| F:RNA binding | 290 | 1.33E-15 |
| C:cytosol | 171 | 3.70E-11 |
| P:RNA processing | 198 | 3.25E-10 |
| C:Golgi apparatus | 216 | 5.63E-10 |
| P:intracellular signaling cascade | 431 | 2.57E-09 |
| C:spliceosome complex | 37 | 6.46E-09 |
| P:transcription | 640 | 1.76E-08 |
| P:regulation of transcription | 602 | 2.02E-08 |
| F:ATP binding | 520 | 2.85E-08 |
| C:actin cytoskeleton | 85 | 5.37E-08 |
| P:vesicle-mediated transport | 190 | 7.02E-08 |
| P:cytoskeleton organization and biogenesis | 155 | 9.26E-08 |
| F:cytoskeletal protein binding | 137 | 1.44E-07 |
| P:secretory pathway | 102 | 7.91E-07 |
| C:nucleoplasm | 107 | 1.22E-06 |
| C:ribosome | 114 | 1.36E-06 |
| P:protein biosynthesis | 283 | 1.56E-06 |
| P:ubiquitin cycle | 235 | 2.86E-06 |
| F:ion channel activity | 98 | 7.08E-05 |
| P:protein amino acid phosphorylation | 213 | 0.000101 |
| F:ATPase activity | 130 | 0.000143 |
| C:endomembrane system | 163 | 0.000154 |
| F:protein kinase activity | 220 | 0.000178 |
| P:nervous system development | 154 | 0.000293 |
| F:transcription factor activity | 263 | 0.000465 |
| C:microtubule cytoskeleton | 115 | 0.000595 |
| C:vesicle | 86 | 0.000732 |
| F:structural molecule activity | 307 | 0.000801 |
| F:structural constituent of ribosome | 130 | 0.000843 |
| F:ubiquitin-protein ligase activity | 144 | 0.001969 |
| C:organelle membrane | 242 | 0.004122 |
| P:cell cycle | 340 | 0.006445 |
GO categories with low protein conservation. Terms of biological process are labeled as P, cellular component as C, molecular function as F.
| P:response to biotic stimulus | 318 | 4.08E-49 |
| P:immune response | 270 | 1.16E-44 |
| C:extracellular space | 179 | 3.49E-37 |
| P:response to stress | 446 | 5.05E-26 |
| F:oxidoreductase activity | 309 | 2.35E-12 |
| F:receptor activity | 391 | 1.11E-11 |
| F:receptor binding | 221 | 2.15E-11 |
| P:lipid metabolism | 260 | 5.95E-11 |
| P:electron transport | 151 | 7.64E-10 |
| C:lysosome | 77 | 6.38E-08 |
| F:peptidase activity | 227 | 6.15E-07 |
| P:cell proliferation | 258 | 1.65E-06 |
| P:cell adhesion | 242 | 2.16E-06 |
| C:mitochondrion | 398 | 3.00E-05 |
| P:proteolysis | 259 | 4.53E-05 |
| C:extracellular matrix | 111 | 5.52E-05 |
| C:peroxisome | 49 | 8.02E-05 |
| F:nuclease activity | 60 | 8.50E-05 |
| C:plasma membrane | 608 | 0.000291 |
| P:apoptosis | 244 | 0.001373 |
| P:carboxylic acid metabolism | 225 | 0.002527 |
| P:response to abiotic stimulus | 148 | 0.004444 |
| P:positive regulation of biological process | 275 | 0.004599 |
| P:response to chemical stimulus | 129 | 0.004626 |
| P:lipid biosynthesis | 101 | 0.005576 |
| P:cell-cell signaling | 176 | 0.006021 |
| P:sensory perception | 111 | 0.008042 |
Figure 3Distribution of percentage identities of human and mouse protein sequences. For the high conservation tendency, actin cytoskeleton (A) and ribosome (D), for the low conservation tendency, extracellular matrix (B) and lysosome (C). For each of A-D, the solid line shows the distribution of the identities for genes with the specific GO term, and the dashed line shows the distribution for the control gene set (see Materials and Methods for details).
Summary of GO categories that show significant conservation tendencies for both protein and promoter sequences.
| F:receptor binding (221) * | P:regulation of transcription (602) * | ||
| P:cell-cell signaling (176) * | F:transcription factor activity (263) * | ||
| C:extracellular matrix (111) * | P:transcription (640) * | ||
| P:cell adhesion (242) | P:nervous system development (154) * | ||
| C:plasma membrane (608) | F:cytoskeletal protein binding (137) * | ||
| F:ion channel activity (98) * | |||
| C:actin cytoskeleton (85) * | |||
| P:protein amino acid phosphorylation (213) * | |||
| F:protein kinase activity (220) * | |||
| P:intracellular signaling cascade (431) * | |||
| P:proteolysis (259) * | P:protein biosynthesis (283) * | ||
| F:peptidase activity (227) * | F:structural constituent of ribosome (130) * | ||
| P:immune response (270) | C:ribosome (114) * | ||
| P:response to biotic stimulus (318) | |||
| F:nuclease activity (60) * | |||
| C:peroxisome (49) * | |||
| P:electron transport (151) * | |||
| P:carboxylic acid metabolism (225) | |||
| P:lipid metabolism (260) * | |||
| C:lysosome (77) * | |||
| F:oxidoreductase activity (309) * | |||
| C:mitochondrion (398) * | |||
In each cell, the GO categories are ordered by promoter conservation. The number of genes for each term is shown in parentheses. GO annotations associated with human genes were used to make this table. '*' represent GO terms that show a significant tendency not only for the human annotation but also for the mouse annotation.
Figure 4Scatter plot of protein conservation and promoter conservation for two subsets of ribosomal proteins. Crosses represent cytoplasmic ribosomal proteins (58 genes). Dots represent mitochondrial ribosomal proteins (41 genes). The conspicuous outlier corresponding to (56, 302) does not seem to be an actual ribosomal protein, and might have been erroneously annotated by an electronic procedure (see Additional file 9).