| Literature DB >> 23122428 |
Mousami Srivastava1, Pankaj Khurana, Ragumani Sugadev.
Abstract
BACKGROUND: The tissue-specific Unigene Sets derived from more than one million expressed sequence tags (ESTs) in the NCBI, GenBank database offers a platform for identifying significantly and differentially expressed tissue-specific genes by in-silico methods. Digital differential display (DDD) rapidly creates transcription profiles based on EST comparisons and numerically calculates, as a fraction of the pool of ESTs, the relative sequence abundance of known and novel genes. However, the process of identifying the most likely tissue for a specific disease in which to search for candidate genes from the pool of differentially expressed genes remains difficult. Therefore, we have used 'Gene Ontology semantic similarity score' to measure the GO similarity between gene products of lung tissue-specific candidate genes from control (normal) and disease (cancer) sets. This semantic similarity score matrix based on hierarchical clustering represents in the form of a dendrogram. The dendrogram cluster stability was assessed by multiple bootstrapping. Multiple bootstrapping also computes a p-value for each cluster and corrects the bias of the bootstrap probability.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23122428 PMCID: PMC3532198 DOI: 10.1186/1756-0500-5-617
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Different tissue specific Unigene libraries employed in DDD
| 1 | Adipose | 2 | 10983, 16445 | 8299, 1646 |
| 2 | Adrenal gland | 4 | 6791, 6792, 16377, 18302 | 4582, 1425, 2756, 10026 |
| 3 | Tongue | 4 | 12982, 18362, 18389, 18479 | 1116, 7730, 23564, 7420 |
| 4 | Bladder | 1 | 18307 | 8220 |
| 5 | Blood | 6 | 7037, 7038, 8975, 11923, 6824, 9724 | 1524, 1721, 4215, 6553, 7241, 9352 |
| 6 | Bone | 3 | 1124, 821, 16433 | 6209, 1337, 2615 |
| 7 | Bone marrow | 8 | 6975, 6976, 15949, 15950, 16412, 931, 10409, 10410 | 2459, 3424, 1379, 2707, 3623, 5336, 1798, 5231 |
| 8 | Brain | 36 | 1749, 16376, 16390, 18317, 18318, 18352, 18353, 18415, 18466, 14591, 14592, 186, 17380, 742, 16380, 18310, 18311, 16382, 16383, 18322, 1918, 5655, 6811, 6812, 8570, 19377, 14298, 13711, 13053, 536, 7209, 16384, 18319 | 1600, 1361, 13592, 2579, 15152, 43177, 24735, 23791, 6112, 31569, 15839, 1461, 1334, 18126, 5838, 4298, 3364, 41751, 40272, 2802, 3192, 16612, 7369, 8197, 2790, 3145, 19115, 44785, 1609, 7033, 3931, 4897, 5194, 2565, 25752 |
| 9 | Eye | 26 | 13915, 17747, 7316, 7315, 10273, 10287, 13901, 19465, 10274, 10281, 10280, 10288, 10279, 10284, 19471, 10285, 10286, 12093, 302, 303, 433, 10966, 10282, 10283, 16572, 13902 | 3739, 4253, 1836, 1294, 4005, 3543, 2785, 1334, 1595, 1479, 1115, 6010, 1469, 6719, 3043, 8344, 1199, 7816, 9190, 1732, 2174, 4531, 6279, 1185, 6097, 2946 |
| 10 | Heart | 8 | 15951, 16399, 18354, 18410, 16421, 18503, 16379, 16381 | 5307, 3284, 2667, 8670, 4000, 8502, 7220, 4698 |
| 11 | Stomach | 12 | 10299, 10301, 10302, 10305, 10306, 10310, 10311, 10324, 10325, 18488, 18529, 16432 | 1793, 2409, 1453, 2790, 1692, 1984, 6125, 5913, 1422, 8604, 2574, 3137 |
| 12 | Testis | 5 | 1752, 16441, 18476, 18517, 19376 | 6624, 2983, 46964, 44057, 40315 |
| 13 | Thalamus | 3 | 16437, 18348, 18349 | 3154, 29651, 23010 |
| 14 | Thymus | 6 | 16440, 18518, 18519, 18520, 13049, 18375 | 2365, 1044, 31967, 37541, 3477, 15983 |
| 15 | Thyroid | 3 | 889, 16408, 7004 | 1357, 4827, 3342 |
| 16 | Breast | 4 | 894, 895, 18305, 18475 | 1786, 6346, 2538, 8256 |
| 17 | Cartilage | 2 | 8936, 8940 | 4310, 3858 |
| 18 | Cervix | 2 | 18425, 18506 | 2674, 2619 |
| 19 | Ear | 2 | 371, 18222 | 12666, 3396 |
| 20 | Intestine | 13 | 840, 841, 842, 882, 16385, 18350, 18489, 16387, 16400, 17427, 18473, 16425, 18486 | 1499, 1704, 1759, 11996, 1817, 8191, 2619, 1351, 5536, 8199, 16855, 3403, 2545 |
| 21 | Epidermis | 4 | 20865, 21612, 7269, 21098 | 1627, 13186, 10681, 1135 |
| 22 | Lung | 11 | 10395, 16406, 16413, 16438, 18355, 18363, 18521, 10398, 11912, 18522, 18537 | 12545, 2448, 6839, 3327, 2565, 16156, 2677, 11510, 15695, 32590, 19278 |
| 23 | Liver | 11 | 1365, 12531, 12532, 12535, 12549, 12550, 13859, 16392, 18416, 18525, 18893 | 2302, 1607, 2315, 1136, 2425, 1561, 7537, 6856, 6550, 8424, 31921, |
| 24 | Mammary gland | 3 | 6982, 16420, 16436 | 4561, 3502, 3371 |
| 25 | Lymph | 8 | 2709, 2710, 2711, 3718, 3719, 3720, 10312, 8613 | 3434, 3963, 1000, 9867, 1556, 1828, 7590, 1949 |
| 26 | Teeth | 1 | 12639 | 1576 |
| 27 | Medulla | 1 | 9725 | 9919 |
| 28 | Muscle | 4 | 530, 16391, 45, 18501 | 4271, 2154, 2485, 8276 |
| 29 | Ovary | 7 | 887, 6998, 18421, 18527, 5444, 12637, 12638 | 2341, 3678, 2300, 2543, 10294, 1050, 1031 |
| 30 | Nose | 2 | 358, 13908 | 1702, 24528 |
| 31 | Placenta | 14 | 13037, 16442, 507, 740, 6999, 10403, 10404, 10405, 10424, 10425, 16422, 17682, 18468, 18484, 13000 | 11885, 3501, 1370, 1172, 20941, 1862, 1166, 4260, 4517, 1271, 1529, 1032, 16852, 15843, 7344 |
| 32 | Pancreas | 6 | 16423, 422, 16960, 8840, 3884, 9821 | 4308, 1799, 13791, 60665, 1234, 17183 |
| 33 | Prostate | 8 | 888, 17392, 18469, 19880, 16424, 924, 928 | 1014, 1283, 16483, 41945, 2355, 7609, 1114, 1051 |
| 34 | Uterus | 6 | 1753, 16443, 18523, 18531, 18544, 12528 | 1645, 5121, 30124, 2486, 19168, 3510 |
| 35 | Spleen | 2 | 16431, 18474 | 2717, 33972 |
| 36 | Salivary gland | 1 | 16430 | 2336 |
| 37 | Kidney | 7 | 16393, 16395, 16410, 18374, 18377, 18524, 16429 | 5486, 5150, 3565, 17079, 2561, 15730, 1170 |
| 37 | Pituitary | 4 | 6828, 6829, 13019, 13737 | 3481, 1444, 7327, 1677 |
| 38 | immune cells | 6 | 1317, 17555, 17556, 8892, 12072, 12798 | 11447, 15272, 13685, 6112, 9223, 4548 |
| 39 | Hair | 4 | 21096, 21100, 21099, 21101 | 1269, 1288, 1298, 1543 |
| 40 | Alimentary canal | 2 | 18496, 18418 | 8371, 2586 |
| 41 | Cancerous lung | 8 | 1533, 10419, 537, 914, 14132, 14133, 14134, 14135 | 1377, 4401, 4173, 1850, 4365, 2121, 2847, 1560 |
Differentially expressed normal lung tissue specific genes (DDD1) were identified from UniGene libraries representing 39 human normal tissues (251 libraries, 1903301 ESTs, S. No. 1–21, 23–40) and counterpart normal lung tissue (11 libraries, 125630 ESTs, S. No. 22 only). Differentially expressed lung tissue specific genes (DDD2) were identified from UniGene libraries representing lung cancer libraries (8 libraries, 22694 ESTs, S. No. 41 only) and counterpart normal lung tissue (11 libraries, 125630 ESTs, S. No. 22 only).
Figure 1Go semantic similarity score between the set of normal lung tissue specific genes from TiSGeD (28-horizontal, x-axis) and the differentially expressed lung cancer genes from DDD2 (145-vertical, y-axis). The intensity of the color corresponds to the magnitude of the similarity. Red represents low semantic similarity below the median level whereas the green represents high semantic similarity above the median level.
Figure 2Average correlation distances with hierarchical clustering based on GO semantic similarity score matrix calculated between normal lung tissue specific genes from TiSGed and differentially expressed lung cancer gene from DDD2. Values in red represent AU (Approximately unbiased) p-value and green represents BP (Bootstrap probability) Clusters with AU larger than 95% are highlighted by red rectangle boxes. AU p-value, which is computed by multiscale bootstrap resampling, is a better approximation to unbiased p-value than BP value computer by normal bootstrap resampling.
Lung cancer signature biomarker clusters
| Panel 1 / Cluster 4 | Lung cancer metastasis
diagnostic markers | UCHL1 | 0 | 0.0002 | - | 0 | 0.0011 | + |
| | | LTF | 0.0224 | 0.0018 | +12.44 | 0.0224 | 0.0002 | -112 |
| Panel 2 / Cluster 5 | Chemotherapy/ drug
resistance related lung
cancer biomarkers | TUBA1B | 0 | 0.0002 | - | 0 | 0.0013 | + |
| | | RPSA | 0.0001 | 0.0004 | -4 | 0.0001 | 0.0027 | +27 |
| | | RPL9 | 0.0002 | 0.0006 | -3 | 0.0002 | 0.002 | +10 |
| | | TMSB4X | 0.0004 | 0.001 | -2.5 | 0.0004 | 0.0016 | +4 |
| | | COPB1 | 0.0007 | 0.0002 | +3.5 | 0.0007 | 0 | - |
| | | API5 | 0.0007 | 0.0003 | +2.3 | 0.0007 | 0 | - |
| | | NT5C2 | 0.0008 | 0.0003 | +2.6 | 0.0008 | 0 | - |
| | | CPN | 0.0009 | 0.0001 | +9 | 0.0009 | 0 | - |
| | | PRKAR1A | 0.0017 | 0.0006 | +2.83 | 0.0017 | 0 | - |
| Panel 3 / Cluster 6 | Hypoxia related lung
cancer biomarkers | FTL | 0.0001 | 0.0011 | -11 | 0.0001 | 0.0065 | +65 |
| | | COL1A2 | 0.0001 | 0.0006 | -6 | 0.0001 | 0.0023 | +23 |
| | | GAPDH | 0.0001 | 0.001 | -10 | 0.0001 | 0.0011 | +11 |
| | | IGKC | 0.0002 | 0.0009 | -4.5 | 0.0002 | 0.0016 | +8 |
| | | ALDOA | 0.0002 | 0.0006 | -3 | 0.0002 | 0.0014 | +7 |
| | | COL1A1 | 0.0001 | 0.0004 | -4 | 0.0001 | 0.0009 | +9 |
| | | FN1 | 0.0025 | 0.0012 | +2.08 | 0.0025 | 0.0007 | -3.57 |
| | | TGM2 | 0.0026 | 0.0008 | +3.25 | 0.0026 | 0.0007 | -3.71 |
| | | FOS | 0.0015 | 0.0007 | +2.14 | 0.0015 | 0.0002 | -7.5 |
| | | CTNNA1 | 0.0034 | 0.0008 | +4.25 | 0.0034 | 0.0003 | -11.33 |
| | | FOSB | 0.0024 | 0.0003 | +8 | 0.0024 | 0.0002 | -12 |
| | | APLP2 | 0.0109 | 0.0044 | +2.47 | 0.0109 | 0.0008 | -13.63 |
| | | NCOA4 | 0.0016 | 0.0005 | +3.2 | 0.0016 | 0.0001 | -16 |
| | | HIF1A | 0.0011 | 0.0004 | +2.75 | 0.0011 | 0 | - |
| | | AZIN1 | 0.001 | 0.0005 | +2 | 0.001 | 0 | - |
| | | EHF | 0.001 | 0.0001 | +10 | 0.001 | 0 | - |
| | | TICAM2 | 0.001 | 0.0003 | +3.33 | 0.001 | 0 | - |
| | | NAMPT | 0.0008 | 0.0002 | +4 | 0.0008 | 0 | - |
| | | TNFRSF1A | 0.0008 | 0.0004 | +2 | 0.0008 | 0 | - |
| | | DMBT1 | 0.0008 | 0.0001 | +8 | 0.0008 | 0 | - |
| | | CSNK1A1 | 0.0007 | 0.0003 | +2.33 | 0.0007 | 0 | - |
| Panel 4 / Cluster 7 | Lung cancer specific
extra cellular matrix
biomarkers | TFPI2 | 0 | 0.0004 | - | 0 | 0.002 | + |
| | | RPL10 | 0.0001 | 0.0007 | -7 | 0.0001 | 0.0014 | +14 |
| | | SFTPA1 | 0.0004 | 0 | + | 0.0001 | 0.0011 | +11 |
| | | KIAA1324 | 0.0013 | 0.0002 | +6.5 | 0.0013 | 0 | - |
| | | CRISP3 | 0.0009 | 0.0001 | +9 | 0.0009 | 0 | - |
| NET1 | 0.0007 | 0.0002 | +3.5 | 0.0007 | 0 | - | ||
(−) represents down regulation and (+) represents up regulation. Except SFTPA1, remaining all the up regulated genes in normal condition observed to be down regulated in cancerous condition and vice versa.