| Literature DB >> 33500773 |
Selena C Feng1, Nathan C Sheffield2, Jianglin Feng2,3.
Abstract
Searching genomic interval sets produced by sequencing methods has been widely and routinely performed; however, existing metrics for quantifying similarities among interval sets are inconsistent. Here we introduce Seqpare, a self-consistent and effective metric of similarity and tool for comparing sequences based on their interval sets. With this metric, the similarity of two interval sets is quantified by a single index, the ratio of their effective overlap over the union: an index of zero indicates unrelated interval sets, and an index of one means that the interval sets are identical. Analysis and tests confirm the effectiveness and self-consistency of the Seqpare metric. Copyright:Entities:
Keywords: Genome analysis; algorithm; interval set; sequence comparison; similarity metric
Year: 2020 PMID: 33500773 PMCID: PMC7808057 DOI: 10.12688/f1000research.23390.2
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402