| Literature DB >> 18984624 |
Ashwini Bhasi1, Philge Philip, Vinu Manikandan, Periannan Senapathy.
Abstract
We have developed ExDom, a unique database for the comparative analysis of the exon-intron structures of 96 680 protein domains from seven eukaryotic organisms (Homo sapiens, Mus musculus, Bos taurus, Rattus norvegicus, Danio rerio, Gallus gallus and Arabidopsis thaliana). ExDom provides integrated access to exon-domain data through a sophisticated web interface which has the following analytical capabilities: (i) intergenomic and intragenomic comparative analysis of exon-intron structure of domains; (ii) color-coded graphical display of the domain architecture of proteins correlated with their corresponding exon-intron structures; (iii) graphical analysis of multiple sequence alignments of amino acid and coding nucleotide sequences of homologous protein domains from seven organisms; (iv) comparative graphical display of exon distributions within the tertiary structures of protein domains; and (v) visualization of exon-intron structures of alternative transcripts of a gene correlated to variations in the domain architecture of corresponding protein isoforms. These novel analytical features are highly suited for detailed investigations on the exon-intron structure of domains and make ExDom a powerful tool for exploring several key questions concerning the function, origin and evolution of genes and proteins. ExDom database is freely accessible at: http://66.170.16.154/ExDom/.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18984624 PMCID: PMC2686582 DOI: 10.1093/nar/gkn746
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Summary of ExDom contents
| Organism | Unique Pfam domains | Total Pfam Domains | Proteins mapped to their gene structures | Proteins with multiple domains | Proteins with isoforms | Domain architecture variations among protein isoforms | Domains with introns | Average number of introns in domains | Intra-genomic common domains | Inter-genomic common domains |
|---|---|---|---|---|---|---|---|---|---|---|
| 3298 | 39 927 | 13 220 | 5860 (44.3%) | 1366 (10.4%) | 1072 (78.5%) | 25 824 (64.7%) | 3.5 | 2057 (62.4%) | 3154 (95.6%) | |
| 3089 | 24 596 | 10 465 | 4259 (40.7%) | 321 (3.1%) | 247 (77%) | 17 343 (70.5%) | 3.6 | 1739 (56.3%) | 3060 (99%) | |
| 1739 | 6145 | 3275 | 1140 (34.8%) | 2 (0.06%) | 1 (50%) | 4651 (75.7%) | 3.4 | 727 (41.8%) | 1734 (99.7%) | |
| 2107 | 10 218 | 4844 | 1904 (39.3%) | 35 (0.7%) | 27 (77.1%) | 7549 (73.9%) | 3.5 | 972 (46.1%) | 2099 (99.6%) | |
| 892 | 3008 | 1379 | 573 (41.6%) | 3 (0.2%) | 3 (100%) | 2291 (76.2%) | 3.3 | 301 (33.7%) | 886 (99.3%) | |
| 824 | 2393 | 1293 | 410 (31.7%) | – | – | 1723 (72%) | 3.4 | 272 (33%) | 812 (98.5%) | |
| 1353 | 10 393 | 5914 | 2258 (38.2%) | 203 (3.5%) | 155 (76.4%) | 6065 (58.4%) | 2.7 | 856 (63.3%) | 1067 (78.9%) |
aCommon domains found in different proteins of the same organism.
bCommon domains found in proteins of at least one other organism.
Figure 1.Schematic overview of the data extraction and integration involved in ExDom database development.
Figure 2.ExDom Plot for RASA1 protein.
Figure 3.Results of the sample query for the protein domain ‘SCP2’ in ExDom. (A) Compact View of the ExDom Plots of 14 protein hits. (B) Detailed View of the ExDom Plots of the four SCP2 proteins—SCP2 (chicken), SCP2 (human), Scp2 (mouse) and Scp2 (rat). (C) Domain-Exon Summary page. (D) Protein Sequence Alignment Viewer with multiple amino acid sequence alignment of the SCP2 domain in 14 protein hits. (E) Nucleotide Sequence Alignment Viewer with spliced coding nucleotide sequence alignment of the SCP2 domain in 14 protein hits. (F) Isoform display of human SCP2 protein. (G) Tertiary Structure Comparative Viewer displaying adh_short domain in HSD17B4 (human) and rat Hsd17b4 (rat).