| Literature DB >> 19208128 |
Corinna Kolárik1, Roman Klinger, Martin Hofmann-Apitius.
Abstract
BACKGROUND: Posttranslational modifications of histones influence the structure of chromatine and in such a way take part in the regulation of gene expression. Certain histone modification patterns, distributed over the genome, are connected to cell as well as tissue differentiation and to the adaption of organisms to their environment. Abnormal changes instead influence the development of disease states like cancer. The regulation mechanisms for modifying histones and its functionalities are the subject of epigenomics investigation and are still not completely understood. Text provides a rich resource of knowledge on epigenomics and modifications of histones in particular. It contains information about experimental studies, the conditions used, and results. To our knowledge, no approach has been published so far for identifying histone modifications in text.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19208128 PMCID: PMC2648793 DOI: 10.1186/1471-2105-10-S1-S28
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Histone modifications. Histone modifying groups, molecules, and processes [1,41-43].
| Modification Types | Modification examples |
| Groups | acetyl, methyl, phosphate, ADP ribosyl, carbonyl, sumoyl |
| Molecules | biotin, ubiquitin |
| Process | proline isomerization, arginine deamination (citrulline generation) |
Figure 1Statistics on articles published about histone modifications in PubMed. Number of published articles about histone modifications in MEDLINE obtained by a coocurrence search of histone terms and modification terms with ProMiner.
Features applied as parameters of the CRF. Applied features which are used as parameters of the CRF are ordered by their classes, corresponding feature examples and explanations are given.
| Name | Explanation |
| All Caps | [A-Z]+ |
| Natural Number | [0–9]+ |
| Alpha-Num | [A-Za-z0–9]+ |
| Autom. Prefixes/Suffixes | Autom. generation of a feature for every token: match that prefix or suffix |
| WordsAsClass | Autom. generation of a feature for every token: match that token |
| Spaces | Is a token preceded or succeeded by white space |
| In Brackets | Is a token pre-ceded or succeeded by brackets |
Figure 2Annotated example text. Example title and abstract (PMID:18157086, Edmunds et.al 2008) with histone modifications annotated as entity type Hmod.
Figure 3Results of the feature analysis. Recall, precision, and F1 measure are given for every sinlge feature analysis experiment. The non-used features or combinations of them from the two classes Automatic generated morphological features (AM) and Context (C) are provided.
Results of the histone modification recognition approach. Results of the 10-fold cross-validation on the corpus EPI-TRAIN and testing of the model on the independent corpus EPI-TEST. Recall, precision, F1 measure, and the standard deviation for the cross-validation are given.
| E | E | |
| Recall | 0.81 (± 0.05) | 0.76 |
| Precision | 0.87 (± 0.05) | 0.87 |
| F1 measure | 0.84 (± 0.05) | 0.81 |
Results of the term standardization process. Given is the number of annotated histone modification terms (Ann. terms) and the fraction (in %) of correctly standardized terms (Std. terms).
| E | E | |
| Ann. terms | 414 | 123 |
| Std. terms | 397 (95.89%) | 121 (98.37%) |
Application of the developed approach: number of Medline articles for the most often occuring histone modifications. Number of obtained MEDLINE abstracts for the most often occurring histone modifications after term identification and standardization. (The histone modification example 'H3K9me3' introduced in Section Background is marked in bold.)
| Modification type | Number of articles |
| H3 K 9 me | 231 |
| H3 K 4 me | 173 |
| H3 K 4 me 3 | 104 |
| H3 K 9 me 2 | 80 |
| H3 S 10 ph | 79 |
| H3 K 27 me 3 | 71 |
| H3 K 9 ac | 62 |
| H3 K 27 me | 60 |
| H3 K 4 me 2 | 58 |