| Literature DB >> 22682510 |
Sarah A Pendergrass1, Scott M Dudek, Dana C Crawford, Marylyn D Ritchie.
Abstract
BACKGROUND: Phenome-Wide Association Studies (PheWAS) can be used to investigate the association between single nucleotide polymorphisms (SNPs) and a wide spectrum of phenotypes. This is a complementary approach to Genome Wide Association studies (GWAS) that calculate the association between hundreds of thousands of SNPs and one or a limited range of phenotypes. The extensive exploration of the association between phenotypic structure and genotypic variation through PheWAS produces a set of complex and comprehensive results. Integral to fully inspecting, analysing, and interpreting PheWAS results is visualization of the data.Entities:
Year: 2012 PMID: 22682510 PMCID: PMC3476448 DOI: 10.1186/1756-0381-5-5
Source DB: PubMed Journal: BioData Min ISSN: 1756-0381 Impact factor: 2.522
Listed are all the PheWAS-View plotting options, parameters, and flags (format: ) for creating PheWAS-View plots using phewas_view.rb
| | |
| Show PheWAS-View version | |
| PheWAS-View formatted file for input | |
| Optional output name for the resultant plot | |
| Main title for the plot (enclose in quotes) | |
| Image format for output (png default). Other options depend on ImageMagick installation. | |
| Low resolution image (72 dpi) | |
| Rotate final image 90 degrees | |
| p-value threshold, values less significant will be plotted in grey | |
| Maximum p-value to plot. Values less significant than the specified cut off are not plotted | |
| Draw a red line at the designated p-value | |
| Include direction of effect on plot | |
| Include sample size plot | |
| Optional ancestry map file | |
| Only results matching this phenotype class name are plotted | |
| Display detailed information for best score at each phenotype | |
| PheWAS expected SNP/phenotype file | |
| List of race/ethnicities to include (AA, EA, MA) | |
| SNP ID to display from input file | |
| Optional phenotype list for inclusion | |
| No background lines drawn on plot | |
| Optional file with phenotype correlations | |
| | |
| Produce sun plot | |
| SNP ID to display in center of sun plot | |
| Phenotype to display in center of sun plot | |
| Gene to display in center of sun plot | |
| Include gene name along with SNP, when SNP is selected for sun plot | |
| Include ancestry as description of result for sun plot | |
| Choose a p-value threshold, p-values less significant will not be plotted | |
| For plotted results, any results more significant will be plotted in red | |
| To apply direction of effect for phenotypes in sun plot, - is negative direction, + is positive direction |
Figure 1Standard PheWAS-View Output Example Plot. A series of simulated PheWAS results plotted in PheWAS-View for a group of phenotypes. The y-axis presents –log10(p-value) of the tests of association for all SNPs for each phenotype, and the x-axis represents individual phenotypes. In this way all results for all SNPs are plotted for each phenotype.
Figure 2Selecting Phenotypes for the Plot. PheWAS-View allows the user to filter the data on various criteria, including limiting the plot to a specific group or list of phenotypes. This is an example of filtering on the phenotype class “Allergy” in PheWAS-View. The data plotted only include those phenotypes that were notated as being in the allergy phenotype class. PheWAS-View has an option for supplying two pieces of phenotypic information, a short and a long phenotype description, in this case the short phenotypic description for all results is “Phen”.
Figure 3Options for Highlighting Results Based on p-value Thresholds. Figure 3A shows the result of using the parameter -p p-value, where results more significant than a specified p-value threshold are plotted in blue (in this case p = 0.01), and the other results are plotted in grey. Figure 3B shows the result of using the parameter -R p-value to plot a red line at a p-value of interest (p = 0.01). Figure 3C shows the results of using the parameter –m p-value to plot only those values more significant than a chosen p-value threshold (p = 0.01).
Figure 4Vertical Format for PheWAS-View Plots. PheWAS-View allows results to be plotted in a vertical format. Figure 4 shows the same data used in Figure 2, plotted in a vertical format through using the parameter –a. Compared to previous figures in this manuscript, the phenotypes are now listed along the y-axis, and the x-axis represents –log10(p-value) of the tests of association. In addition, using the parameter –B will plot the SNP identifier as well as direction of genetic effect for the association (+ for positive direction of effect, - for negative) for the most significant association for each phenotype listed. Plotting PheWAS results this way facilitates the reading of phenotype descriptions, while still allowing the most significant SNP-phenotype results to be inspected visually.
Figure 5Additional Information on Sample Size and Direction of Effect. For this plot the –b parameter was used to plot the direction of effect for all associations for each phenotype in a separate track. The sample size for each association was also plotted in a separate track using the –A parameter.
Figure 6Comparisons Across Groups. If the PheWAS analyses are stratified across multiple genetic ancestries or groups, it can be useful to view the similarities and differences of the significance of an association and direction of effect across groups. The output plot can be filtered by a single SNP (using –s SNP rs ID) and one or more groups (−r group1, group2, …) by specifying an identifier for specific results in a column labeled “Groups” in the input file (example in Figure 6). Triangles point up for direction of effect that is positive, and point down for direction of effect that is negative. A different color represents each group.
PheWAS-View recognizes the population or genetic ancestry abbreviations listed below that can be used as “group” identifiers in the input file
| red | European American | |
| blue | African American | |
| green | Hawaiian | |
| purple | Asian Pacific Islander | |
| orange | American Indian |
Figure 7Phenotypic Correlation Heatmap. A heatmap of correlations can be plotted with PheWAS-View if a matrix of all pairwise correlation coefficients between phenotypes is plotted. Figure 7A is a PheWAS-View plot with the addition of a correlation heatmap. The cells of the correlation plot range from yellow to blue in the direction of decreasing absolute value of the correlations. Figure 7B shows the same plot in vertical rather than horizontal format after using the –a parameter.
Figure 8Distinguishing “Expected” and “Novel” Associations Using Color. In this plot, novel associations with p < 0.01 are plotted in a different color (purple) from results that are more expected (blue). Using color to distinguish expected and more novel results facilitates prioritizing results to investigate further
Figure 9Sun Plot of Association Results for a Single SNP. PheWAS results plotted for a single SNP with p < 0.05 (−m p-value). The length of the line corresponds to the significance of the p-value, with the most significant result at the top (“noon”) sweeping around clockwise. The p-value of the most significant result is listed to provide a sense of scale. The lines are red for p-values more significant than a threshold of p = 1x10-3 (−p p-value). To add a “-” or “+” for direction of genetic effect for each phenotype use the flag –b.