| Literature DB >> 23981286 |
Sushil Tripathi1, Karen R Christie, Rama Balakrishnan, Rachael Huntley, David P Hill, Liv Thommesen, Judith A Blake, Martin Kuiper, Astrid Lægreid.
Abstract
Transcription factors control which information in a genome becomes transcribed to produce RNAs that function in the biological systems of cells and organisms. Reliable and comprehensive information about transcription factors is invaluable for large-scale network-based studies. However, existing transcription factor knowledge bases are still lacking in well-documented functional information. Here, we provide guidelines for a curation strategy, which constitutes a robust framework for using the controlled vocabularies defined by the Gene Ontology Consortium to annotate specific DNA binding transcription factors (DbTFs) based on experimental evidence reported in literature. Our standardized protocol and workflow for annotating specific DNA binding RNA polymerase II transcription factors is designed to document high-quality and decisive evidence from valid experimental methods. Within a collaborative biocuration effort involving the user community, we are now in the process of exhaustively annotating the full repertoire of human, mouse and rat proteins that qualify as DbTFs in as much as they are experimentally documented in the biomedical literature today. The completion of this task will significantly enrich Gene Ontology-based information resources for the research community. Database URL: www.tfcheckpoint.org.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23981286 PMCID: PMC3753819 DOI: 10.1093/database/bat062
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Primary GO terms/subgraphs used for DbTF annotation. (A) GO subgraph used for sequence-specific DbTF. In this graph, sequence-specific DNA binding MF terms (yellow), sequence-specific DNA binding TF activity MF terms (green) and transcription regulation BP (blue) are shown along with the relationships between terms in the graph structure. (B) GO subgraph used for transcription factor binding transcription factors. In this graph, the different color coding represents the following: TF binding MF terms (yellow), transcription regulation BP (blue) and TF binding TF activity MF terms (green). I, P and H on top of the lines stand for relationships ‘is_a’, ‘part_of’ and ‘has_part’.
Inference of transcription factor activity terms from DNA/TF binding and transcription regulation terms
Each transcription factor activity term (green) is determined by the composite annotation of the corresponding DNA binding or TF binding term (yellow) and a transcription regulation term (blue).
Evidence code table
| DNA binding/ TF binding | Transcription regulation | TF activity |
|---|---|---|
| IDA | IDA | IDA/ICa |
| IMP | IMP | IMP/ICa |
| IDA | IMP | IDA, IMP/ICa |
| IMP | IDA | IMP, IDA/ICa |
| IPIb | IDA | IPIb, IDA/ ICa |
ICa if evidence for ‘DNA binding / TF binding’ and ‘transcription regulation’ comes from two different papers.
IPIb applicable only for TF binding terms.
Assays documenting specific DNA binding
| Experimental assays | Variants | Evidence code | PSI-MI code |
|---|---|---|---|
| EMSA | Nuclear extract from native tissue or cells | No evidence | MI:0413 |
| Nuclear extracts from cells or tissue with ectopic expression of a TF | IDA | MI:0413 | |
| Purified TF ( | IDA | MI:0413 | |
| Nuclear extract from cells with ectopic expression of a mutated TF | IMP | MI:0413 | |
| Purified mutated TF ( | IMP | MI:0413 | |
| Electrophoretic mobility supershift assay (EMSA supershift) | Nuclear extract from native tissue or cells | IDA | MI:0412 |
| Nuclear extracts from cells or tissue with ectopic expression of a TF | IDA | MI:0412 | |
| Purified TF ( | IDA | MI:0412 | |
| Nuclear extract from cells with ectopic expression of a mutated TF | IMP | MI:0412 | |
| Purified mutated TF ( | IMP | MI:0412 | |
| Footprinting | IDA | MI:0417 | |
| DNase I footprinting (DNA footprint) | IDA | MI:0606 | |
| Methylation interference assay (MIC) | IDA | MI:1189 | |
| Ultraviolet (UV) footprinting (UV-footprint) | IDA | MI:1191 | |
| Dimethylsulphate footprinting (DMS-footprint) | IDA | MI:0603 | |
| Hydroxy radical footprinting (Hydroxy-footprint) | IDA | MI:1190 | |
| Potassium permanganate footprinting (KMnO4-footprint) | IDA | MI:0604 | |
| Affinity chromatography technology | IDA | MI:0004 | |
| Pull down | IDA | MI:0096 | |
| Southwestern blot assay (SW-blot) | IDA | ||
| IDA | MI:0657 | ||
| X-ray crystallography | IDA | MI:0114 |
The experimental assays are denoted with their standard nomenclature in PSI-MI; for the detailed description please see: http://www.ebi.ac.uk/ontologylookup/browse.do?ontName=MI. For Southwestern blot assay, see: http://www.nlm.nih.gov/mesh/.
Reporter gene-based assays variants documenting transcription regulation
This table is a decision matrix for selecting GO evidence codes based on the method used for TF identification (purple) and transcription regulation (green). wt = wild type, mut = mutated.
Assays documenting TF binding
| Assays | Evidence code | PSI-MI code |
|---|---|---|
| 2-Hybrid interactions | IPI | MI:0018 |
| Co-purification | IPI, IDA | MI:0004 |
| Co-immunoprecipitation | IPI, IDA | MI:0019 |
The experimental assays are denoted with their standard nomenclature in PSI-MI; for the detailed description please see: http://www.ebi.ac.uk/ontologylookup/browse.do?ontName=MI.
Figure 2.Sequence-specific DNA binding TF (DbTF) curation workflow. This workflow represents the step-by-step procedure for curating experimentally verified mammalian DbTFs from scientific publications. Selection of scientific publication from the literature corpus is the starting point of the curation procedure. From each relevant publication, DbTF-specific GO-terms are annotated and recorded.
Figure 3.UniProt-GOA screenshot of some of the DbTF annotations. The annotations generated using the DbTF curation guidelines discussed here can be accessed from the GO database using the QuickGO tool.