| Literature DB >> 35640593 |
Zhu Liu1, Tremayne Mushayahama1, Bryan Queme1, Dustin Ebert1, Anushya Muruganujan1, Caitlin Mills1, Paul D Thomas1, Huaiyu Mi1.
Abstract
The Annotation Query (AnnoQ) (http://annoq.org/) is designed to provide comprehensive and up-to-date functional annotations for human genetic variants. The system is supported by an annotation database with ∼39 million human variants from the Haplotype Reference Consortium (HRC) pre-annotated with sequence feature annotations by WGSA and functional annotations to Gene Ontology (GO) and pathways in PANTHER. The database operates on an optimized Elasticsearch framework to support real-time complex searches. This implementation enables users to annotate data with the most up-to-date functional annotations via simple queries instead of setting up individual tools. A web interface allows users to interactively browse the annotations, annotate variants and search variant data. Its easy-to-use interface and search capabilities are well-suited for scientists with fewer bioinformatics skills such as bench scientists and statisticians. AnnoQ also has an API for users to access and annotate the data programmatically. Packages for programming languages, such as the R package, are available for users to embed the annotation queries in their scripts. AnnoQ serves researchers with a wide range of backgrounds and research interests as an integrated annotation platform.Entities:
Year: 2022 PMID: 35640593 PMCID: PMC9252745 DOI: 10.1093/nar/gkac418
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 19.160
Figure 1.Overview of AnnoQ annotation data building process.
Figure 2.Architecture of AnnoQ platform. The Annotation Data (green box) are gathered from WGSA, PANTHER and PEREGRINE, and are used to annotate the variant dataset (yellow box) to create the annotation database. Users can query the annotation data through an API. There are three ways to query the data, an interactive web interface, command-line scripts, and a programming package.
Annotation data types
| Annotation data | Original annotation source | Annotation tools |
|---|---|---|
| Gene-model based annotation (sequence features) | ANNOVAR, SnpEFF, VEP | WGSA |
| Epigenomics | ENCODE, FANTOMS, Esembl | |
| Gene-centric annotation | REVEL, SIFT, FATJMM, Plyphen, etc. | |
| Allele frequency | 1000 Genome, gnomAD, ExAC, etc. | |
| Disease-related variants | COSMIC, ClinVar, GRASP | |
| Non-coding SNVs | CADD, funseq2, RegulomeDB | |
| Functional annotations | Gene Ontology, Reactome Pathways, PANTHER Pathways | PANTHER Service |
| Enhancer to gene links | PEREGRINE | PEREGRINE |
Figure 3.Supported Annotations. The annotations are grouped according to the type of annotation and the source of data. Basic Info is gathered from the genetic variant dataset. PANTHER Functional Annotations are functional annotations from the PANTHER System. PEREGRINE Functional Annotations are annotations to enhancers using the PEREGRINE database. The remaining are from WGSA. PANTHER also provides functional annotations to genes under ANNOVAR, SnpEff and VEP.
Number of annotation types in each annotation tool or resource group
| Annotation tools and resources | Number of annotation types |
|---|---|
| ANNOVAR | 56 |
| SnpEff | 79 |
| VEP | 75 |
| PANTHER Functional Annotation | 48 |
| PEREGRINE Functional Annotation | 18 |
| allele frequencies | 175 |
| disease related databases | 21 |
| others | 130 |
Figure 4.Comparison of gene annotations among different tools. For most variants, all three tools (ANNOVAR, SnpEff and VEP) are in agreement about the type of genomic region the variant appears in; however, a large fraction of variants have some disagreement among tools, and about 10% have different calls from all three tools.
Figure 5.AnnoQ Query UI.
Comparison of features between AnnoQ and other annotation tools
| Tools | ||||||
|---|---|---|---|---|---|---|
| AnnoQ | ANNOVAR | SnpEff | VEP | dbNSFP | MyVariant.info | |
| Require download and installation | No | Yes | Yes | Yes | Yes | No |
| Pre-annotated | Yes | No | No | No | Yes | Yes |
| Annotations from Multiple Tool | Yes | No | No | No | Yes | No |
| Realtime response | Yes | No | No | No | No | Yes |
| Complex Query functions | Yes | No | No | No | No | Yes |
| Interactive User Interface | Yes | Yes* | No | No | No | No |
| Functional annotation | Genes, GO and pathways | Genes only | Genes only | Genes only | Genes only | Genes only |