| Literature DB >> 31775760 |
Michael D Linderman1, Leo McElroy2, Laura Chang2.
Abstract
BACKGROUND: The complexity of genome informatics is a recurring challenge for genome exploration and analysis by students and other non-experts. This complexity creates a barrier to wider implementation of experiential genomics education, even in settings with substantial computational resources and expertise. Reducing the need for specialized software tools will increase access to hands-on genomics pedagogy.Entities:
Keywords: Genomics education; Personal Genome analysis; Web application
Mesh:
Year: 2019 PMID: 31775760 PMCID: PMC6882182 DOI: 10.1186/s12920-019-0615-3
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Overview of dataflow in MySeq. The MySeq single-page web application performs personal genome analyses in the user’s web browser. (1) MySeq components query a locally stored or remotely available VCF file by genomic coordinates. (2) Internally MySeq uses the Tabix index to fetch and parse only the portion of the file containing variants in the query region. (3) MySeq further analyzes the VCF records entirely in the browser (e.g. displays the genotypes to the user, performs ancestry analysis, etc.). Optionally MySeq can utilize the publicly available MyVariant.info and MyGene.info APIs [37] to annotate variants or translate gene symbols or rsIDs to genomic coordinates for queries (e.g. query for all variants in BRCA1), but does not send any genotypes to a remote server
Description of current MySeq functionality
| Category | Current Capabilities |
|---|---|
| Query | Query for variants by genomic position, rsID and gene name. Obtain comprehensive annotations, e.g. amino acid translation, allele frequency, etc., for each variant from |
| Physical Traits | Predict phenotypes for physical traits, e.g. taste PTC as bitter, based on the genotypes of one or more associated variants. |
| Pharmacogenomics (PGx) | Report genotypes and associated phenotypes based on CPIC guidelines and/or the drug label for Simvastatin and Warfarin. |
| Polygenic Disease Risk | Predict polygenic disease risk for Type 2 Diabetes using a likelihood ratio-based approach [ |
| Ancestry | Scatter plot visualization of the top two principal components using loadings for 496 ancestry informative markers (AIM) [ |
Fig. 2Example of MySeq VCF loading, variant query and PGx interfaces. a The user can load data is several ways, including pre-configured publicly available genomes. b Having loaded NA12878’s genome, the user’s query of chr7:141672604 returned one overlapping variant 7:g.141672604 T > C for which NA12878 is heterozygous. The user clicked on the variant to obtain functional and other annotations from MyVariant.info [37]. (c) Via the “Analyses” dropdown in the header bar (shown fully expanded in the larger screenshot), the user can launch other analyses, e.g. extract variants associated with Warfarin dosing