| Literature DB >> 16845075 |
Sarita Ranjan1, Jayshree Seshadri, Vaibhav Vindal, Sailu Yellaboina, Akash Ranjan.
Abstract
Gene regulatory circuits are often commonly shared between two closely related organisms. Our web tool iCR (identify Conserved target of a Regulon) makes use of this fact and identify conserved targets of a regulatory protein. iCR is a special refined extension of our previous tool PredictRegulon- that predicts genome wide, the potential binding sites and target operons of a regulatory protein in a single user selected genome. Like PredictRegulon, the iCR accepts known binding sites of a regulatory protein as ungapped multiple sequence alignment and provides the potential binding sites. However important differences are that the user can select more than one genome at a time and the output reports the genes that are common in two or more species. In order to achieve this, iCR makes use of Cluster of Orthologous Group (COG) indices for the genes. This tool analyses the upstream region of all user-selected prokaryote genome and gives the output based on conservation target orthologs. iCR also reports the Functional class codes based on COG classification for the encoded proteins of downstream genes which helps user understand the nature of the co-regulated genes at the result page itself. iCR is freely accessible at http://www.cdfd.org.in/icr/.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16845075 PMCID: PMC1538900 DOI: 10.1093/nar/gkl202
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Architecture of iCR. iCR is a CGI application which collects input from user using html forms (A). B represents a Perl script that gathers the input from A launches the Search Engine (C) which looks up genome sequences and their annotations (D), and returns the potential targets as an output which is further classified based on COG/Class or Genome. The classified output is returned as HTML output (F).
Output of iCR showing the conserved targets of LexA regulon in Fermicutes
| COG | Class | Genome | Score | Position | Site | Gene | Synonym |
|---|---|---|---|---|---|---|---|
| COG1974 | K | NC_004193 | 4.6875 | −77 | AGAACGAGTGTTTG | lexA | OB1669 |
| COG1974 | K | NC_003030 | 4.77125 | −84 | AGAACATAAGTTTG | lexA | CAC1832 |
| COG1974 | K | NC_002745 | 4.88271 | −71 | CGAACAAATGTTTG | lexA | SA1174 |
| COG1974 | K | NC_004557 | 4.82946 | −80 | AGAACATAAGTTTG | lexA | CTC01298 |
| COG1974 | K | NC_003366 | 4.83493 | −70 | AGAACATAAGTTTG | lexA | CPE1161 |
| COG1974 | K | NC_002570 | 4.72756 | −77 | AGAACTTATGTTTG | lexA | BH2356 |
| COG1974 | K | NC_000964 | 4.81601 | −118 | CGAACCTATGTTTG | lexA | BSU17850 |
| COG1974 | K | NC_003923 | 4.88303 | −71 | CGAACAAATGTTTG | lexA | MW1226 |
| COG1974 | K | NC_003212 | 4.82162 | −79 | CGAACCTTTGTTTG | — | LIN1340 |
| COG1974 | K | NC_002758 | 4.88182 | −138 | CGAACAAATGTTTG | lexA | SAV1339 |
| COG1974 | K | NC_003210 | 4.81541 | −79 | CGAACCTTTGTTTG | — | LMO1302 |
| COG0468 | L | NC_002570 | 4.64423 | −121 | CGAATAAATGTTCG | recA | BH2383 |
| COG0468 | L | NC_003212 | 4.67474 | −138 | CGAATAAATGTTCG | recA | LIN1435 |
| COG0468 | L | NC_003210 | 4.66915 | −138 | CGAATAAATGTTCG | recA | LMO1398 |
| COG0468 | L | NC_003923 | 4.40442 | −143 | AGCACGTTTGTTCG | recA | MW1168 |
| COG0468 | L | NC_002758 | 4.40302 | −80 | AGCACGTTTGTTCG | recA | SAV1285 |
| COG0468 | L | NC_003030 | 4.90549 | −48 | AGAACAAATGTTCG | recA | CAC1815 |
| COG0468 | L | NC_003366 | 5.01207 | −34 | AGAACTTATGTTCG | recA | CPE1673 |
| COG0468 | L | NC_004461 | 4.42484 | −143 | AGTACGTTTGTTCG | — | SE0963 |
| COG0468 | L | NC_000908 | 4.18494 | −236 | TGAACTGTTGTATG | recA | MG339 |
| COG0468 | L | NC_002745 | 4.40405 | −143 | AGCACGTTTGTTCG | recA | SA1128 |
| COG0468 | L | NC_004557 | 4.9426 | −54 | AGAACAGATGTTCG | recA | CTC01289 |
| COG0556 | L | NC_000964 | 4.7767 | −122 | CGAACTTTAGTTCG | uvrB | BSU35170 |
| COG0556 | L | NC_003923 | 4.8228 | −105 | CGAACAAACGTTTG | uvrB | MW0720 |
| COG0556 | L | NC_002745 | 4.82248 | −105 | CGAACAAACGTTTG | uvrB | SA0713 |
| COG0556 | L | NC_003030 | 4.93204 | −29 | CGAACAAATGTTTG | uvrB | CAC0502 |
| COG0556 | L | NC_002758 | 4.82157 | −103 | CGAACAAACGTTTG | uvrB | SAV0758 |
| COG0556 | L | NC_004193 | 4.65391 | −69 | CGAATACTTGTTCG | — | OB2488 |
| COG0556 | L | NC_003212 | 4.62091 | −158 | CGAAAATATGTTCG | uvrB | LIN2632 |
| COG0556 | L | NC_003210 | 4.61721 | −160 | CGAAAATATGTTCG | uvrB | LMO2489 |
| COG0556 | L | NC_004461 | 4.90087 | −128 | CGAACAAATGTTTG | — | SE0541 |
| COG0389 | L | NC_003366 | 4.82409 | −26 | TGAACATATGTTTG | dinP | CPE1566 |
| COG0389 | L | NC_003923 | 4.77999 | −49 | GGAACACGTGTTCG | — | MW1251 |
| COG0389 | L | NC_002758 | 4.33641 | −6 | AGAACATTTGTTCT | — | SAV1364 |
| COG0389 | L | NC_002745 | 4.81919 | −49 | AGAACACGTGTTCG | — | SA1196 |
| COG0389 | L | NC_003210 | 4.72978 | −33 | AGAACGCTTGTTCG | — | LMO1975 |
| COG0389 | L | NC_004461 | 4.32424 | −75 | AGAACAAATGTTCT | — | SE1046 |
| COG0389 | L | NC_003212 | 4.73647 | −33 | AGAACGCTTGTTCG | — | LIN2082 |
| COG0389 | L | NC_004557 | 4.82946 | −40 | AGAACATAAGTTTG | — | CTC00437 |
| COG0389 | L | NC_000964 | 4.37402 | −68 | CGAACATAAGTTCT | yqjW | BSU23710 |
| COG0199 | J | NC_004368 | 4.29015 | −280 | TGAACGTATGTACG | — | GBS0071 |
| COG0199 | J | NC_002662 | 4.9713 | −280 | CGAACGTATGTTCG | rpsN | L0391 |
| COG0199 | J | NC_003028 | 4.22998 | −280 | TGAACGTATGTACG | — | SP0222 |
| COG0199 | J | NC_003098 | 4.22977 | −280 | TGAACGTATGTACG | rpsN | SPR0202 |
| COG0199 | J | NC_002737 | 4.41534 | −278 | CGAACGTATGTACG | rpsN | SPY0064 |
| COG0199 | J | NC_003485 | 4.41477 | −278 | CGAACGTATGTACG | rpsN | SPYM18_0065 |
| COG0199 | J | NC_004432 | 4.29794 | −140 | CGAAATTGTGTATG | — | MYPE10040 |
| COG0199 | J | NC_004070 | 4.41397 | −278 | CGAACGTATGTACG | rpsN.1 | SPYM3_0053 |
| COG0199 | J | NC_004116 | 4.2898 | −280 | TGAACGTATGTACG | — | SAG0071 |
| COG1396 | K | NC_003485 | 4.21446 | −8 | AGAAACCATGTTAG | — | SPYM18_0038 |
| COG1396 | K | NC_003923 | 4.32136 | −263 | GGAACAAGTGTACG | — | MW1228 |
| COG1396 | K | NC_004070 | 4.21434 | −8 | AGAAACCATGTTAG | — | SPYM3_0031 |
| COG1396 | K | NC_002570 | 4.4873 | −118 | GGAACGGGCGTTTG | — | BH0096 |
| COG1396 | K | NC_003028 | 4.47861 | −127 | TGAACAAATGTTGG | — | SP1115 |
| COG1396 | K | NC_002737 | 4.21453 | −8 | AGAAACCATGTTAG | — | SPY0037 |
| COG1396 | K | NC_004193 | 4.36271 | −253 | TGAACAGGAGTTAG | — | OB3501 |
| COG1396 | K | NC_003366 | 4.35319 | −58 | TGAACATTTGATTG | — | CPE2564 |
| COG0098 | J | NC_003028 | 4.38376 | −109 | AGAAGTGGTGTTCG | — | SP0227 |
| COG0098 | J | NC_004116 | 4.25066 | −110 | TGAAGTGGTGTTTG | rpsE | SAG0075 |
| COG0098 | J | NC_002737 | 4.23373 | −110 | TGAAGTGGTGTTTG | rpsE | SPY0069 |
| COG0098 | J | NC_004368 | 4.25082 | −110 | TGAAGTGGTGTTTG | rpsE | GBS0075 |
| COG0098 | J | NC_003098 | 4.38367 | −109 | AGAAGTGGTGTTCG | rpsE | SPR0206 |
| COG0098 | J | NC_004070 | 4.23345 | −110 | TGAAGTGGTGTTTG | rpsE | SPYM3_0057 |
| COG0098 | J | NC_004350 | 4.24208 | −109 | TGAAGTGGTGTTTG | rs5 | SMU.2009 |
| COG0098 | J | NC_003485 | 4.23361 | −110 | TGAAGTGGTGTTTG | rpsE | SPYM18_0069 |
| COG0457 | R | NC_004557 | 4.34686 | −240 | GGAAGAAGAGTTTG | — | CTC02554 |
| COG0457 | R | NC_002570 | 4.38934 | −268 | CGAAGCAACGTTTG | — | BH3054 |
| COG0457 | R | NC_004557 | 4.39536 | −233 | AGAACAATTGTATG | — | CTC01089 |
| COG0457 | R | NC_002745 | 4.37946 | −17 | AGAAATGAGGTTCG | — | SA1448 |
| COG0457 | R | NC_003923 | 4.3797 | −17 | AGAAATGAGGTTCG | — | MW1570 |
| COG0457 | R | NC_003098 | 4.47855 | −97 | TGAACAAATGTTGG | rggD | SPR1022 |
| COG0457 | R | NC_002758 | 4.3788 | −86 | AGAAATGAGGTTCG | — | SAV1620 |
Note: Gene, Synonym column is as per NCBI ptt table. Class codes—K involved in transcription, L in DNA replication, recombination and repair, J represents orthologs involved in translation, ribosomal structure and biogenesis and so on.