| Literature DB >> 34541463 |
Jennifer L Wilson1, Mike Wong2, Nicholas Stepanov3, Dragutin Petkovic2,3, Russ Altman4,5.
Abstract
OBJECTIVES: We sought to cluster biological phenotypes using semantic similarity and create an easy-to-install, stable, and reproducible tool.Entities:
Keywords: Docker containers; computational tools; high-throughput analysis; network analysis; phenotype analysis; systems biology
Year: 2021 PMID: 34541463 PMCID: PMC8442701 DOI: 10.1093/jamiaopen/ooab079
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Figure 1.How to download Phenotype Clustering (PhenClust) and PathFX Docker container via PathFXweb. Users are guided to install Docker desktop and reminded of the requirement to have a National Library of Medicine (NLM) account. The Docker container is available for download after user authentication via the Unified Medical Language System terminology service. The user navigates to the download page, selects “Login to NIH,” is redirected to the NLM page. On the NLM page the user selects the identity provider of their choice. After the license is verified, the user is redirected to PathFXweb and the Docker container is available for download.
Figure 2.Clustering dendrograms from analysis of metformin-associated Concept Unique Identifier (CUI) terms with and without descriptive labels. Phenotype Clustering selected up to 5 of the most frequent words from disease names in the cluster as a label for the dendrogram (left) or provided the number of terms grouped into a cluster or the CUI identifier for un-clustered CUI terms (right). The x-axis represented the between-cluster Euclidean distance as calculated by the linkage function from the fastcluster Python module. The first line of the figure title is an analysis name parameter specified by the user. The branch colors are assigned by default from the scipy.cluster.hierarchy.dendrogram function. The color threshold is set with 0.7*max(linkage_distance) to approximate clusters in the data.