| Literature DB >> 26288567 |
Archana Jayaraman1, Kaiser Jamil2, Haseeb A Khan3.
Abstract
There is a need to identify novel targets in Acute Lymphoblastic Leukemia (ALL), a hematopoietic cancer affecting children, to improve our understanding of disease biology and that can be used for developing new therapeutics. Hence, the aim of our study was to find new genes as targets using in silico studies; for this we retrieved the top 10% overexpressed genes from Oncomine public domain microarray expression database; 530 overexpressed genes were short-listed from Oncomine database. Then, using prioritization tools such as ENDEAVOUR, DIR and TOPPGene online tools, we found fifty-four genes common to the three prioritization tools which formed our candidate leukemogenic genes for this study. As per the protocol we selected thirty training genes from PubMed. The prioritized and training genes were then used to construct STRING functional association network, which was further analyzed using cytoHubba hub analysis tool to investigate new genes which could form drug targets in leukemia. Analysis of the STRING protein network built from these prioritized and training genes led to identification of two hub genes, SMAD2 and CDK9, which were not implicated in leukemogenesis earlier. Filtering out from several hundred genes in the network we also found MEN1, HDAC1 and LCK genes, which re-emphasized the important role of these genes in leukemogenesis. This is the first report on these five additional signature genes in leukemogenesis. We propose these as new targets for developing novel therapeutics and also as biomarkers in leukemogenesis, which could be important for prognosis and diagnosis.Entities:
Keywords: Acute Lymphoblastic Leukemia (ALL); Gene prioritization; Microarray analysis; Protein interaction network; Therapeutic targets
Year: 2015 PMID: 26288567 PMCID: PMC4537869 DOI: 10.1016/j.sjbs.2015.01.012
Source DB: PubMed Journal: Saudi J Biol Sci ISSN: 1319-562X Impact factor: 4.219
Figure 1Scheme showing overview of the methodology followed in the study.
List of ALL specific genes used as training genes for prioritization.
| S.No. | Gene name | Reference | S.No. | Gene name | Reference |
|---|---|---|---|---|---|
| 1. | NOTCH1 | 16. | SCGF | ||
| 2. | CRLF2 | 17. | AML1 | ||
| 3. | NOTCH3 | 18. | CD49f | ||
| 4. | LEF1 | 19. | Aven | ||
| 5. | USP44 | 20. | BCL2 | ||
| 6. | MYC | 21. | ABCB1 | ||
| 7. | Survivin | 22. | Livin | ||
| 8. | WT1 | 23. | MK | ||
| 9. | hCLP46 | 24. | TNF-R1 | ||
| 10. | MDM2 | 25. | TRAIL-R2 | ||
| 11. | CDX2 | 26. | TRAIL-R4 | ||
| 12. | EPOR | 27. | BCL2L13 | ||
| 13. | MsrB2 | 28. | Ikaros 6 | ||
| 14. | ROR1 | 29. | XIAP | ||
| 15. | ABL1 | 30. | HOX11 |
List of training housekeeping genes with higher expression in bone marrow tissue (Chang et al., 2011).
| S.No. | Gene name | S.No. | Gene name |
|---|---|---|---|
| 1. | ACTB | 16. | RPS10 |
| 2. | B2 M | 17. | RPS11 |
| 3. | EEF1A1 | 18. | RPS12 |
| 4. | HBB | 19. | RPS14 |
| 5. | RPL13A | 20. | RPS15 |
| 6. | RPL23A | 21. | RPS17 |
| 7. | RPL27A | 22. | RPS18 |
| 8. | RPL3 | 23. | RPS23 |
| 9. | RPL30 | 24. | RPS27 |
| 10. | RPL41 | 25. | RPS29 |
| 11. | RPL7A | 26. | RPS3A |
| 12. | RPL9 | 27. | RPS6 |
| 13. | RPL32 | 28. | TPT1 |
| 14. | RPLP0 | 29. | UBB |
| 15. | RPLP1 | 30. | UBC |
Prioritized candidate genes common to ENDEAVOUR, DIR, TOPPGENE tools.
| T-ALL only | B-ALL only | Both B-ALL,T-ALL | |
|---|---|---|---|
| ABI2 | KHDRBS1 | BCR | CDK6 |
| ADA | LCK | BLNK | CSNK1E |
| AOF2 | MAP4K1 | CDK9 | DVL2 |
| BMI1 | MEN1 | CHD4 | GNPTAB |
| CD3D | MLL | ETS2 | MYB |
| CD3E | NPM1 | INSR | NONO |
| CD81 | PTMA | MEF2C | SET |
| CTCF | SMAD2 | NR3C1 | TCF3 |
| DNTT | SMO | NRIP1 | SPTBN1 |
| FGFR1 | TCEA2 | PARP1 | TP53BP1 |
| FUBP1 | TCF7 | PHB | YY1 |
| GATA3 | TFDP2 | PMAIP1 | Number of genes = 11 |
| HDAC1 | TRRAP | SOX4 | |
| HNRNPR | WHSC1 | Number of genes = 13 | |
| ILF3 | ZAP70 | ||
| Number of genes = 30 | |||
Figure 2Directed acyclic graph showing Gene Ontology of biological processes of the 54 prioritized genes (graph obtained from WebGestalt server).
Figure 3STRING database generated protein interaction network generated using prioritized and training protein names as query.
Figure 4STRING Protein–Protein Interaction network, separated into 12 k-Means clusters with clusters containing LCK, MEN1, SMAD2, HDAC1, CDK9 specifically highlighted.