| Literature DB >> 23509765 |
Pabitra Mohan Behera1, Deepak Kumar Behera, Aparajeya Panda, Anshuman Dixit, Payodhar Padhi.
Abstract
The expressed sequence tags (ESTs) are major entities for gene discovery, molecular transcripts, and single nucleotide polymorphism (SNPs) analysis as well as functional annotation of putative gene products. In our quest for identification of novel diabetic genes as virtual targets for type II diabetes, we searched various publicly available databases and found 7 reported genes. The in silico EST analysis of these reported genes produced 6 consensus contigs which illustrated some good matches to a number of chromosomes of the human genome. Again the conceptual translation of these contigs produced 3 protein sequences. The functional and structural annotations of these proteins revealed some important features which may lead to the discovery of novel therapeutic targets for the treatment of diabetes.Entities:
Mesh:
Year: 2013 PMID: 23509765 PMCID: PMC3582052 DOI: 10.1155/2013/704818
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Information content for Homo sapiens.
| Sl. no. | Database name | Release | Date | Information content |
|---|---|---|---|---|
| 1 | dbEST | 040112 | April 01, 2012 | ESTs 8315296 |
|
| ||||
| 2 | TIGR Gene Indices | 17.0 | July 28, 2006 | ESTs 7233257 |
| HTs 234976 | ||||
|
| ||||
| 3 | UniGene | — | December 23, 2011 | mRNAs 209412 |
| Models 212 | ||||
| HTC 20115 | ||||
| 3′ ESTs 1693253 | ||||
| 5′ ESTs 4027153 | ||||
| Unknown ESTs 927242 | ||||
| Total sequences 6877387 | ||||
UniGene information on human diabetes (mRNA and ESTs).
| Sl. no. | Name of the gene | Source | mRNA | ESTs |
|---|---|---|---|---|
| 1 | Glucokinase (GCK) |
| 12 | 46 |
| 2 | Arginine vasopressin receptor 2 (AVPR2) |
| 14 | 10 |
| 3 | Aquaporin 2 (AQP2) |
| 07 | 61 |
| 4 | Islet cell autoantigen 1 (ICA1) |
| 10 | 217 |
| 5 | SRY (sex determining region Y) box 13 (SOX13) |
| 09 | 180 |
| 6 | Ras-related associated with diabetes (RRAD) |
| 06 | 160 |
| 7 | Ankyrin repeat domain 23 (ANKRD23) |
| 06 | 141 |
ESTs reported from different genes.
| Sl. no. | GB accession no. | Description | Tissue type | EST type | Code* |
|---|---|---|---|---|---|
| 1 | Glucokinase (GCK) | ||||
| DA640823.1 | Clone LIVER2005873 | Liver | 5′ read | P | |
| DA637293.1 | Clone LIVER2000237 | Liver | 5′ read | P | |
| DA638310.1 | Clone LIVER2002033 | Liver | 5′ read | P | |
| CK823298.1 | Clone IMAGE:6136115 | Pancreas | 5′ read | P | |
| BM966889.1 | Clone IMAGE:6136115 | Pancreas | 5′ read | P | |
| BM966913.1 | Clone IMAGE:6135860 | Pancreas | 5′ read | P | |
| BQ101045.1 | Clone IMAGE:6135541 | Pancreas | 5′ read | P | |
|
| |||||
| 2 | Arginine vasopressin receptor 2 (AVPR2) | ||||
| BG830436.1 | Clone IMAGE:4908956 | Pancreas | 5′ read | — | |
| BI160709.1 | Clone IMAGE:5018991 | Pancreas | 5′ read | P | |
| BI161076.1 | Clone IMAGE:5019146 | Pancreas | 5′ read | — | |
| BI161438.1 | Clone IMAGE:5019572 | Pancreas | 5′ read | — | |
|
| |||||
| 3 | Islet cell autoantigen 1 (ICA1) | ||||
| CB134411.1 | Clone L14ChoiCK0-18-B12 | Liver | 5′ read | P | |
| BX497434.1 | Clone DKFZp779M2033 | Liver | 5′ read | A | |
| BX646846.1 | Clone DKFZp779C0346 | Liver | 5′ read | P | |
| AW583029.1 | Clone IMAGE:5637830 | Pancreas | 5′ read | P | |
| CK904151.1 | Clone IMAGE:5672417 | Pancreas | 5′ read | A | |
| BE736046.1 | Clone IMAGE:3639903 | Pancreas | 5′ read | P | |
| BI715368.1 | — | Pancreas | 5′ read | P | |
| BI962895.1 | Clone IMAGE:5671189 | Pancreas | 5′ read | — | |
| BI966135.1 | Clone IMAGE:5672382 | Pancreas | 5′ read | A | |
| BM021952.1 | Clone IMAGE:5672417 | Pancreas | 5′ read | A | |
| BU579558.1 | Clone IMAGE:6121832 | Pancreas | 5′ read | P | |
| BU951015.1 | Clone IMAGE:6132285 | Pancreas | 5′ read | P | |
|
| |||||
| 4 | SRY (sex determining region Y)-box 13 (SOX13) | ||||
| BE563236.1 | Clone IMAGE:3689361 | Pancreas | 5′ read | — | |
| BE904395.1 | Clone IMAGE:3898347 | Pancreas | 5′ read | — | |
| BE905187.1 | Clone IMAGE:3901107 | Pancreas | 5′ read | P | |
|
| |||||
| 5 | Ras-related associated with diabetes (RRAD) | ||||
| BG250011.1 | Clone IMAGE:4470428 | Liver | 5′ read | — | |
| BG250978.1 | Clone IMAGE:4472119 | Liver | 5′ read | — | |
| BG252988.1 | Clone IMAGE:4474056 | Liver | 5′ read | P | |
| BM967357.1 | Clone IMAGE:6136533 | Pancreas | 5′ read | P | |
|
| |||||
| 6 | Ankyrin repeat domain 23 (ANKRD23) | ||||
| CB159821.1 | Clone L18POOL1n1-19-D04 | Liver | 5′ read | — | |
| BM127096.1 | Clone IMAGE:5675155 | Pancreas | 5′ read | — | |
| BQ227733.1 | Clone IMAGE:6018368 | Pancreas | 5′ read | — | |
| BU073912.1 | — | Pancreas | 5′ read | — | |
P: presence of similarity to proteins after translation and A: contains a polyadenylation signal.
The ESTs and their corresponding contigs obtained from CAP3 Server.
| Sl. no. | Gene name | ESTs | No. of contigs | ||
|---|---|---|---|---|---|
| Liver | Pancreas | Liver | Pancreas | ||
| 1 | GCK | DA640823.1 | BM966889.1 | 1 | 1 |
| DA637293.1 | |||||
| DA638310.1 | |||||
|
| |||||
| 2 | AVPR2 | — | BI160709.1 | — | 1 |
| BI161076.1 | |||||
| BI161438.1 | |||||
|
| |||||
| 3 | ICA1 | BX497434.1 | BI715368.1 | 1 | 1 |
| BX646846.1 | BI962895.1 | ||||
|
| |||||
| 4 | SOX13 | — | BE563236.1 | — | 1 |
| BE904395.1 | |||||
| BE905187.1 | |||||
|
| |||||
| 5 | RRAD | BG250011.1 | — | 0 | — |
| BG250978.1 | |||||
| BG252988.1 | |||||
|
| |||||
| 6 | ANKRD23 | — | BQ227733.1 | — | 0 |
| BU073912.1 | |||||
BLAT output showing the alignment of contigs versus human genome sorted by score.
| Query | Score | Start | End | Qsize | Identity | Chromosome | Strand |
|---|---|---|---|---|---|---|---|
| Glucokinase (GCK) | |||||||
|
| |||||||
| Contig1 | 567 | 1 | 570 | 570 | 100.00% | 7 | − |
| Contig1 | 24 | 206 | 230 | 570 | 100.00% | 1 | − |
| Contig1 | 21 | 429 | 449 | 570 | 100.00% | 4 | − |
| Contig1 | 20 | 386 | 405 | 570 | 100.00% | 5 | + |
| Contig1 | 20 | 105 | 124 | 570 | 100.00% | 3 | + |
| Contig2 | 27 | 713 | 740 | 938 | 100.00% | 3 | − |
| Contig2 | 21 | 778 | 798 | 938 | 100.00% | 1 | − |
| Contig2 | 21 | 860 | 880 | 938 | 100.00% | X | + |
| Contig2 | 20 | 519 | 538 | 938 | 100.00% | 4 | + |
|
| |||||||
| Arginine vasopressin receptor 2 (AVPR2) | |||||||
|
| |||||||
| Contig3 | 26 | 434 | 460 | 911 | 100.00% | 2 | − |
| Contig3 | 25 | 516 | 541 | 911 | 100.00% | 2 | − |
| Contig3 | 21 | 336 | 356 | 911 | 100.00% | 1 | − |
|
| |||||||
| Islet cell autoantigen 1 (ICA1) | |||||||
|
| |||||||
| Contig4 | 586 | 4 | 593 | 593 | 100.00% | 7 | − |
| Contig4 | 21 | 570 | 590 | 593 | 100.00% | 2 | + |
| Contig4 | 20 | 104 | 123 | 593 | 100.00% | 1 | − |
| Contig4 | 20 | 105 | 124 | 593 | 100.00% | 5 | + |
| Contig5 | 963 | 7 | 992 | 1001 | 99.20% | 7 | − |
| Contig5 | 127 | 841 | 1001 | 1001 | 91.00% | 16 | + |
| Contig5 | 40 | 801 | 850 | 1001 | 90.00% | 2 | − |
| Contig5 | 20 | 545 | 564 | 1001 | 100.00% | 1 | − |
|
| |||||||
| Sex determining region Y-box 13 (SOX13) | |||||||
|
| |||||||
| Contig6 | 1283 | 22 | 1340 | 1551 | 99.30% | 1 | + |
| Contig6 | 35 | 869 | 905 | 1551 | 97.30% | 7 | − |
| Contig6 | 32 | 869 | 905 | 1551 | 97.10% | 6 | − |
Figure 1Graphical representation of protein sequences obtained from ESTScan2 translations and edited in BioEdit.
The InterProScan annotations for three hypothetical proteins.
| Sl. no. | InterProScan | Proteins | ||
|---|---|---|---|---|
| applications | GCK liver | GCK pancreas | ICA1 liver | |
| 1 | GENE3D | G3DSA: 3.30.420.40 | G3DSA: 3.40.367.20 | G3DSA: 3.20.1270.60 |
| 2 | PANTHER | PTHR19443 | PTHR19443 | PTHR10164 |
| 3 | PFAM | PF00349 | PF03727 | PF06456 |
| 4 | PRINTS | — | PR00475 | — |
| 5 | PROFILE | — | — | PS50870 |
| 6 | SMART | — | — | SM01015 |
| 7 | SUPER FAMILY | SSFS3067 | SSFS3067 | SSFS3067 |
Figure 23D representation of homology models of three hypothetical proteins. (a) the homology model of hypothetical protein 1, (b) the homology model of hypothetical protein 2, and (c) the homology model of hypothetical protein 3.