| Literature DB >> 32968428 |
John L Spouge1, Joseph M Ziegelbauer2, Mileidy Gonzalez3.
Abstract
BACKGROUND: Data about herpesvirus microRNA motifs on human circular RNAs suggested the following statistical question. Consider independent random counts, not necessarily identically distributed. Conditioned on the sum, decide whether one of the counts is unusually large. Exact computation of the p-value leads to a specific algorithmic problem. Given n elements g 0 , g 1 , … , g n - 1 in a set G with the closure and associative properties and a commutative product without inverses, compute the jackknife (leave-one-out) products g ¯ j = g 0 g 1 ⋯ g j - 1 g j + 1 ⋯ g n - 1 ( 0 ≤ j < n ).Entities:
Keywords: Commutative semigroup; Data structure; Jackknife products; Leave-one-out; Segment tree
Year: 2020 PMID: 32968428 PMCID: PMC7502207 DOI: 10.1186/s13015-020-00178-x
Source DB: PubMed Journal: Algorithms Mol Biol ISSN: 1748-7188 Impact factor: 1.405
Fig. 1A schematic diagram of herpesvirus miRNA motif occurring on a human circRNA. As indicated in the legend, each thin circle represents a circRNA; each thick line segment, the occurrence of a miRNA motif on the corresponding circRNA. Both circRNAs and the miRNA motif have nucleotide sequences represented by IUPAC codes (A, C, G, U). This figure illustrates occurrences of a single miRNA motif (e.g., UUACAGG) on the circRNAs. The biological question is: “does any circRNA have too many occurrences of the motif to be explained by chance alone?” In the actual application, the circRNAs ranged in length from 69 nt to 158565 nt
Fig. 2A (rootless) segment tree. This figure illustrates the rootless segment tree constructed in the upward phase of the Jackknife Product algorithm. The commutative semigroup illustrated is the set of nonnegative integers under addition. The bottom row of squares () contains (). In the next row up, as indicated by the arrow pairs leading into each circle, the array contains consecutive sums of consecutive disjoint pairs in , e.g., . The rest of the segment tree is constructed recursively upward to , just as was constructed from . Here, 2 copies of the additive identity pad out on the right. Padded on the right, the copies contribute literally nothing to the segment tree above them. Their non-contributions have dotted outlines