| Literature DB >> 26244889 |
E Abinaya1, Pankaj Narang2, Anshu Bhardwaj3.
Abstract
Genetic variations play a crucial role in differential phenotypic outcomes. Given the complexity in establishing this correlation and the enormous data available today, it is imperative to design machine-readable, efficient methods to store, label, search and analyze this data. A semantic approach, FROG: "FingeRprinting Ontology of Genomic variations" is implemented to label variation data, based on its location, function and interactions. FROG has six levels to describe the variation annotation, namely, chromosome, DNA, RNA, protein, variations and interactions. Each level is a conceptual aggregation of logically connected attributes each of which comprises of various properties for the variant. For example, in chromosome level, one of the attributes is location of variation and which has two properties, allosomes or autosomes. Another attribute is variation kind which has four properties, namely, indel, deletion, insertion, substitution. Likewise, there are 48 attributes and 278 properties to capture the variation annotation across six levels. Each property is then assigned a bit score which in turn leads to generation of a binary fingerprint based on the combination of these properties (mostly taken from existing variation ontologies). FROG is a novel and unique method designed for the purpose of labeling the entire variation data generated till date for efficient storage, search and analysis. A web-based platform is designed as a test case for users to navigate sample datasets and generate fingerprints. The platform is available at http://ab-openlab.csir.res.in/frog.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26244889 PMCID: PMC4526677 DOI: 10.1371/journal.pone.0134693
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Properties in FROG and their source.
| S.No. | Properties in FROG | VariO | PAGE-OM |
|---|---|---|---|
| 1. | Clinical Data Referenced | ✓ | |
| 2. | Repeats and penetrance | ||
| 3. | Outcome of substitution on function | ||
| 4. | Accessibility | ✓ | |
| 5 | Synonymous/non synonymous change | ||
| 6. | Allosomes / autosomes | ||
| 7. | Works on whole genome | ✓ | |
| 8. | Detailed description of Substitution | ||
| 9. | Copy number variation | ||
| 10. | Hap map data integration | ||
| 11. | Experimental details | ✓ | |
| 12. | Organism independent | ✓ | ✓ |
| 13. | Epigenetic changes | ✓ |
Fig 1(A) Scope of FROG (B) Number of attributes and terms in six levels.
Fig 2The attributes, ontology terms and their relationship within (A) Chromosome and (B) DNA level.
Fig 4The attributes, ontology terms and their relationship within (A) Variation and (B) Interaction level.
Fig 3The attributes, ontology terms and their relationship within (A) RNA and (B) Protein level.
Fig 5(A) Summary of the fingerprints designed to represent ontology terms in the six levels (B) As an example, the 15 attributes of the Variation level are listed along with their bit annotation (C) Example in Table 2 is explained with help of color coded bits.
Example variant with assigned terms and fingerprints.
| Column Name | Value | Description |
|---|---|---|
| Variant Id | 531 | Database (MitoLSDB) ID |
| Genomic position | m.10084T>C | mtDNA T>C change at 10084 |
| Disease | diabetes- Angiopathy | Associated disease |
| Protein Name | ND3 | Affected protein |
| Changed amino acid | p.I9T | Change of Isoleucine to Threonine at position 9 of protein. |
| DNA Level fingerprints | 01011********** | First 3 bits (010): Transition |
| 4th and 5th bits (11): Transition (T to C) | ||
| Protein Level fingerprints | 00 | 1st and 2nd bits (00): Substitution |
| 4th, 5th,6th and 7th (1101): Pathogenecity as benign and tolerated predicted by Sift and Polyphen, respectively | ||
| Variation Level fingerprints | 010111******010000 | 1st and 2nd bits (01): Maternal |
| 3rd bit (0): Single nucleotide | ||
| 4th bit (1): Synonymous | ||
| 5th bit (1): Tissue details available | ||
| 6th bit (1): Experimental data available | ||
| 13th bit (0): Conservation as variant | ||
| 14th bit (1): Mesh term available | ||
| 15-18th (000): Model organism not reported |
(*): Information not available in the database (MitoLSDB) to generate fingerprint for respective variant property.
Fig 6(A) The screenshot of FROG Interface displaying ontology terms and associated fingerprints (B) The search tool interface to query the DNA variation using ontology (C) Sample output of the search tool.