Literature DB >> 18177672

The performances of the chi-square test and complexity measures for signal recognition in biological sequences.

Leila Pirhaji1, Mehdi Kargar, Armita Sheari, Hadi Poormohammadi, Mehdi Sadeghi, Hamid Pezeshk, Changiz Eslahchi.   

Abstract

With large amounts of experimental data, modern molecular biology needs appropriate methods to deal with biological sequences. In this work, we apply a statistical method (Pearson's chi-square test) to recognize the signals appear in the whole genome of the Escherichia coli. To show the effectiveness of the method, we compare the Pearson's chi-square test with linguistic complexity on the complete genome of E. coli. The results suggest that Pearson's chi-square test is an efficient method for distinguishing genes (coding regions) form pseudogenes (noncoding regions). On the other hand, the performance of the linguistic complexity is much lower than the chi-square test method. We also use the Pearson's chi-square test method to determine which parts of the Open Reading Frame (ORF) have significant effect on discriminating genes form pseudogenes. Moreover, different complexity measures and Pearson's chi-square test applied on the genes with high value of Pearson's chi-square statistic. We also compute the measures on homologous of these genes. The results illustrate that there is a region near the start codon with high value of chi-square statistic and low complexity that is conserve between homologous genes.

Entities:  

Mesh:

Year:  2007        PMID: 18177672     DOI: 10.1016/j.jtbi.2007.11.021

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  3 in total

Review 1.  Information theory applications for biological sequence analysis.

Authors:  Susana Vinga
Journal:  Brief Bioinform       Date:  2013-09-20       Impact factor: 11.622

2.  Using the Symptom Patient Similarity Network to Explore the Difference between the Chinese and Western Medicine Pathways of Ischemic Stroke and its Comorbidities.

Authors:  Lunzhong Zhang; Shu Han; Manli Zhao; Runshun Zhang; Xuebin Zhang; Jing Zhang; Xiaoqing Liu; Yuyao He; Zhao He; Yunfang Dong; Xiaoying Hou; Zijun Mou; Liyun He; Hong Zhou; Jie Yang; Xingyan Huang; Yanjie Hu; Yuefeng Zhang; Lili Zhang; Zhengguang Chen; Xiaozhen Li; Yan Tan; Kegang Cao; Wei Meng; Liqun Zhong
Journal:  Evid Based Complement Alternat Med       Date:  2021-12-01       Impact factor: 2.629

3.  DNA sequences at a glance.

Authors:  Armando J Pinho; Sara P Garcia; Diogo Pratas; Paulo J S G Ferreira
Journal:  PLoS One       Date:  2013-11-21       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.