| Literature DB >> 19401399 |
Dennis J Hazelett1, Daniel L Lakeland, Joseph B Weiss.
Abstract
METHODS: A new method was developed for identifying novel transcription factor regulatory targets based on calculating Local Affinity Density. Techniques from the signal-processing field were used, in particular the Hann digital filter, to calculate the relative binding affinity of different regions based on previously published in vitro binding data. To illustrate this approach, the complete genomes of Drosophila melanogaster and D.pseudoobscura were analyzed for binding sites of the homeodomain proteinc Tinman, an essential heart development gene in both Drosophila and Mouse. The significant binding regions were identified relative to genomic background and assigned to putative target genes. Valid candidates common to both species of Drosophila were selected as a test of conservation.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19401399 PMCID: PMC2732317 DOI: 10.1093/bioinformatics/btp282
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.A region of interest demonstrates the physical location of several predicted Tinman targets and other genes (arrows) relative to regions of high Tinman binding affinity. A spline was fit to the individual affinity data (‘cross’). Some of the displayed targets are not conserved between D.pseudoobscura and D.melanogaster and are therefore not among our predictions (Table 2).
Comparison of motif-mapping methods
| Tinman targets | Annotation | AD | TE | GS |
|---|---|---|---|---|
| CG7895 | 2 | 11 | 11 | |
| CG7902 | 4 | 291 | ||
| CG1322 | 23 | |||
| CG3978 | 36 | 14 | ||
| CG30040 | 67 | |||
| CG6634 | 147 | |||
| CG18647 | 192 | |||
| CG2328 | 266 | |||
| CG1429 | ||||
| CG18144 | ||||
| CG5772 | ||||
| CG3871 |
AD, Affinity Density; TE, TargetExplorer (Sosinsky et al., 2003); GS, GenomeSurveyor (Noyes et al., 2008); Lower rank (larger number) reflects decreasing score produced by each algorithm. Absence of a rank indicates that the algorithm did not predict the target.
Fig. 2.Target predictions were selected on the basis of conservation in two species of Drosophila. Each point (gray) represents the log of scores of a single gene. Dotted lines reveal the cutoff values used for selection. The points corresponding to known targets of Tinman from the literature are highlighted with circles.
Coexpression of prediction sets with Tinman
| Algorithm | Expression data | ovrlp tin expr pat (%) | |
|---|---|---|---|
| Affinity Density | 33 | 18 (54.5) | 0.0071 |
| TargetExplorer | 6 | 2 (33.8) | 0.3291 |
| GenomeSurveyor | 20 | 10 (50.0) | 0.0581 |
Top 100 predicted Tinman targets with equivalent coexpression terms in the BDGP in situ expression database.
Predicted Tinman targets with heart phenotypes
| RNAi target | Annotation | Score | Rank |
|---|---|---|---|
| CG1322 | 3.86 | 23 | |
| CG3978 | 3.49 | 36 | |
| CG4608 | 3.05 | 52 | |
| CG10961 | 2.94 | 65 | |
| CG5462 | 2.73 | 76 | |
| CG31349 | 2.75 | 79 | |
| CG17888 | 2.51 | 93 |
Out of 246, 105 predicted targets were also screened in the RNAi mutation project (Kim et al., 2004). Seven were defective in heart development (P<0.01).
Tinman regulatory target predictions
| I | II | III | IV |
|---|---|---|---|
| fas | CG31708 | CG14250 | scrib |
| CG12772 | bnl | tutl | |
| mXr | msi | LpR2 | CG31647 |
| fz | CG7196 | pyd | |
| Gbeta5 | eag | l(3)82Fd | CG5842 |
| CG12607 | klar | CG30268 | how |
| pk | CG32048 | CG30387 | CG6296 |
| hbn | nau | olf413 | CG6295 |
| Rgk1 | CG10301 | ptc | Spn |
| CG18262 | CG10300 | Btk29A | CG6271 |
| sm | knrl | UGP | |
| rols | Sox21b | CG13862 | CG32040 |
| ed | mspo | CG5391 | Irk2 |
| CG33100 | cnc | CG14559 | nuf |
| Lmpt | Sulf1 | Traf2 | CG3599 |
| beat-IIa | aPKC | dve | faf |
| dlp | Src64B | CG8475 | |
| eya | heph | pros | Pdp1 |
| CG8086 | fred | robo3 | Gr77a |
| Aats-asn | Fas2 | CG18769 | CG7918 |
| CG15336 | Dh31 | beat-VI | Ero1L |
| CG10959 | CG3502 | Mdr50 | wb |
| Doc3 | CG10882 | CG31221 | |
| baz | Argk | unc-5 | Ggamma30A |
| Sema-1a | CheB93a | lqf | CheA7a |
The top 100 predicted targets from Affinity Density algorithm. Superscripts indicate expression data from all available flybase sources; foregut/clypeolabrum primordium, mesoderm or somatic muscle, visceral mesoderm, heart (dorsal vessel). Known targets (Table 2) are highlighted in boldface.