David Koslicki1. 1. Department of Mathematics, Pennsylvania State University, State College, PA 16801, USA. koslicki@math.psu.edu
Abstract
MOTIVATION: Topological entropy has been one of the most difficult to implement of all the entropy-theoretic notions. This is primarily due to finite sample effects and high-dimensionality problems. In particular, topological entropy has been implemented in previous literature to conclude that entropy of exons is higher than of introns, thus implying that exons are more 'random' than introns. RESULTS: We define a new approximation to topological entropy free from the aforementioned difficulties. We compute its expected value and apply this definition to the intron and exon regions of the human genome to observe that as expected, the entropy of introns are significantly higher than that of exons. We also find that introns are less random than expected: their entropy is lower than the computed expected value. We also observe the perplexing phenomena that introns on chromosome Y have atypically low and bimodal entropy, possibly corresponding to random sequences (high entropy) and sequences that posses hidden structure or function (low entropy). AVAILABILITY: A Mathematica implementation is available at http://www.math.psu.edu/koslicki/entropy.nb CONTACT: koslicki@math.psu.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Topological entropy has been one of the most difficult to implement of all the entropy-theoretic notions. This is primarily due to finite sample effects and high-dimensionality problems. In particular, topological entropy has been implemented in previous literature to conclude that entropy of exons is higher than of introns, thus implying that exons are more 'random' than introns. RESULTS: We define a new approximation to topological entropy free from the aforementioned difficulties. We compute its expected value and apply this definition to the intron and exon regions of the human genome to observe that as expected, the entropy of introns are significantly higher than that of exons. We also find that introns are less random than expected: their entropy is lower than the computed expected value. We also observe the perplexing phenomena that introns on chromosome Y have atypically low and bimodal entropy, possibly corresponding to random sequences (high entropy) and sequences that posses hidden structure or function (low entropy). AVAILABILITY: A Mathematica implementation is available at http://www.math.psu.edu/koslicki/entropy.nb CONTACT: koslicki@math.psu.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Runxia Liu; Yang-Hui Jimmy Yeh; Ales Varabyou; Jack A Collora; Scott Sherrill-Mix; C Conover Talbot; Sameet Mehta; Kristen Albrecht; Haiping Hao; Hao Zhang; Ross A Pollack; Subul A Beg; Rachela M Calvi; Jianfei Hu; Christine M Durand; Richard F Ambinder; Rebecca Hoh; Steven G Deeks; Jennifer Chiarella; Serena Spudich; Daniel C Douek; Frederic D Bushman; Mihaela Pertea; Ya-Chi Ho Journal: Sci Transl Med Date: 2020-05-13 Impact factor: 17.956
Authors: David Danko; Daniela Bezdan; Evan E Afshin; Sofia Ahsanuddin; Chandrima Bhattacharya; Daniel J Butler; Kern Rei Chng; Daisy Donnellan; Jochen Hecht; Katelyn Jackson; Katerina Kuchin; Mikhail Karasikov; Abigail Lyons; Lauren Mak; Dmitry Meleshko; Harun Mustafa; Beth Mutai; Russell Y Neches; Amanda Ng; Olga Nikolayeva; Tatyana Nikolayeva; Eileen Png; Krista A Ryon; Jorge L Sanchez; Heba Shaaban; Maria A Sierra; Dominique Thomas; Ben Young; Omar O Abudayyeh; Josue Alicea; Malay Bhattacharyya; Ran Blekhman; Eduardo Castro-Nallar; Ana M Cañas; Aspassia D Chatziefthimiou; Robert W Crawford; Francesca De Filippis; Youping Deng; Christelle Desnues; Emmanuel Dias-Neto; Marius Dybwad; Eran Elhaik; Danilo Ercolini; Alina Frolova; Dennis Gankin; Jonathan S Gootenberg; Alexandra B Graf; David C Green; Iman Hajirasouliha; Jaden J A Hastings; Mark Hernandez; Gregorio Iraola; Soojin Jang; Andre Kahles; Frank J Kelly; Kaymisha Knights; Nikos C Kyrpides; Paweł P Łabaj; Patrick K H Lee; Marcus H Y Leung; Per O Ljungdahl; Gabriella Mason-Buck; Ken McGrath; Cem Meydan; Emmanuel F Mongodin; Milton Ozorio Moraes; Niranjan Nagarajan; Marina Nieto-Caballero; Houtan Noushmehr; Manuela Oliveira; Stephan Ossowski; Olayinka O Osuolale; Orhan Özcan; David Paez-Espino; Nicolás Rascovan; Hugues Richard; Gunnar Rätsch; Lynn M Schriml; Torsten Semmler; Osman U Sezerman; Leming Shi; Tieliu Shi; Rania Siam; Le Huu Song; Haruo Suzuki; Denise Syndercombe Court; Scott W Tighe; Xinzhao Tong; Klas I Udekwu; Juan A Ugalde; Brandon Valentine; Dimitar I Vassilev; Elena M Vayndorf; Thirumalaisamy P Velavan; Jun Wu; María M Zambrano; Jifeng Zhu; Sibo Zhu; Christopher E Mason Journal: Cell Date: 2021-05-26 Impact factor: 41.582