| Literature DB >> 27131357 |
Bruno Ribeiro-Gonçalves1, Alexandre P Francisco2, Cátia Vaz3, Mário Ramirez4, João André Carriço5.
Abstract
High-throughput sequencing methods generated allele and single nucleotide polymorphism information for thousands of bacterial strains that are publicly available in online repositories and created the possibility of generating similar information for hundreds to thousands of strains more in a single study. Minimum spanning tree analysis of allelic data offers a scalable and reproducible methodological alternative to traditional phylogenetic inference approaches, useful in epidemiological investigations and population studies of bacterial pathogens. PHYLOViZ Online was developed to allow users to do these analyses without software installation and to enable easy accessing and sharing of data and analyses results from any Internet enabled computer. PHYLOViZ Online also offers a RESTful API for programmatic access to data and algorithms, allowing it to be seamlessly integrated into any third party web service or software. PHYLOViZ Online is freely available at https://online.phyloviz.net.Entities:
Mesh:
Year: 2016 PMID: 27131357 PMCID: PMC4987911 DOI: 10.1093/nar/gkw359
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Different features available at Tree Visualization. (A) MST of a Streptococcus pneumoniae cgMLST dataset. Colors were attributed to nodes according to the serotypes existing in the auxiliary data file. (B) N Locus Variant graph. All nodes with distances equal or above 12 were linked. It is possible to visualize two distinct clusters in the 19F serotypes at a level of 12 differences between profiles. At that level, 6A and 6B serotypes are also clustered together. (C) Tree cut-off algorithm. All links from the MST with a distance value above 60 were deleted. Clusters are defined with different serotypes.
Figure 2.Interactive Distance Matrix constructed from a node selection of a cgMLST Streptococcus pneumoniae dataset. Matrix cells were ordered according to serotype and colors represent distances computed through pairwise comparisons of profiles. Selected nodes on the distance matrix (red border color) have their information displayed in the table according to the selected auxiliary data fields.