| Literature DB >> 32160033 |
Ali Ebrahimpour Boroojeny1, Akash Shrestha1, Ali Sharifi-Zarchi2, Suzanne Renick Gallagher1, Süleyman Cenk Sahinalp3, Hamidreza Chitsaz1.
Abstract
Graph Traversal Edit Distance (GTED) is a measure of distance (or dissimilarity) between two graphs introduced. This measure is based on the minimum edit distance between two strings formed by the edge labels of respective Eulerian traversals of the two graphs. GTED was motivated by and provides the first mathematical formalism for sequence coassembly and de novo variation detection in bioinformatics. Many problems in applied machine learning deal with graphs (also called networks), including social networks, security, web data mining, protein function prediction, and genome informatics. The kernel paradigm beautifully decouples the learning algorithm from the underlying geometric space, which renders graph kernels important for the aforementioned applications. In this article, we introduce a tool, PyGTED to compute GTED. It implements the algorithm based on the polynomial time algorithm devised for it by the authors. Informally, the GTED is the minimum edit distance between two strings formed by the edge labels of respective Eulerian traversals of the two graphs.Keywords: clustering genera; coassembly; de novo variation detaction; graph comparison; graph kernel; linear programming
Mesh:
Year: 2020 PMID: 32160033 PMCID: PMC7207050 DOI: 10.1089/cmb.2019.0510
Source DB: PubMed Journal: J Comput Biol ISSN: 1066-5277 Impact factor: 1.479