| Literature DB >> 35845648 |
Cong Feng1, Xingwei Wang1,2,3, Shishi Wu1,2,3, Weidong Ning1,4, Bo Song1, Jianbin Yan1, Shifeng Cheng1.
Abstract
Haplotype identification, characterization and visualization are important for large-scale analysis and use in population genomics. Many tools have been developed to visualize haplotypes, but it is challenging to display both the pattern of haplotypes and the genotypes for each single SNP in the context of a large amount of genomic data. Here, we describe the tool HAPPE, which uses the agglomerative hierarchical clustering algorithm to characterize and visualize the genotypes and haplotypes in a phylogenetic context. The tool displays the plots by coloring the cells and/or their borders in Excel tables for any given gene and genomic region of interest. HAPPE facilitates informative displays wherein data in plots are easy to read and access. It allows parallel display of several lines of values, such as phylogenetic trees, P values of GWAS, the entry of genes or SNPs, and the sequencing depth at each position. These features are informative for the detection of insertion/deletions or copy number variations. Overall, HAPPE provides editable plots consisting of cells in Excel tables, which are user-friendly to non-programmers. This pipeline is coded in Python and is available at https://github.com/fengcong3/HAPPE.Entities:
Keywords: Excel; SNPs; haplotype; phylogenetic clustering; visualization
Year: 2022 PMID: 35845648 PMCID: PMC9284118 DOI: 10.3389/fpls.2022.927407
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
FIGURE 1Workflow of HAPPE. The inputs and outputs colored in white are required, whereas the others are optional.
FIGURE 2Example of HAPPE on the display of S. viridis gene Sevir.5G085400. (A) Haplotypes of the S. viridis gene Sevir.5G085400, which shows additional information, including (1) a tree, (2) descriptions of samples, (3) annotations of SNPs and (4) a heatmap of read depth. (B) Magnified view of Panel 2 showing the descriptions of the samples. (C) A map generated using the geographical coordinates shown in the table. (D) Zoomed-in view of Panel 3 showing the annotations of SNPs. (E) A deletion event shown by the heatmap of read depth in Panel 3, which was further confirmed in a genome browser. More details can be found in Supplementary Tables 1–3.
FIGURE 3Example of HAPPE on the display of the S. viridis gene Sevir.5G394700. (A) Haplotypes of the S. viridis gene Sevir.5G394700 with additional information, including (1) a tree, (2) descriptions of samples, (3) annotations of SNPs and (4) a heatmap of read depth. (B) Zoomed-in view of Panel 2 showing the details of the samples. (C) Distribution of haplotypes of the gene in different subgroups. More details can be found in Supplementary Tables 4–7.