MOTIVATION: RNA expression signals detected by high-density genomic tiling microarrays contain comprehensive transcriptomic information of the target organism. Current methods for determining the RNA transcription units are still computation intense and lack the discriminative power. This article describes an efficient and accurate methodology to reveal complicated transcriptional architecture, including small regulatory RNAs, in microbial transcriptome profiles. RESULTS: Normalized microarray data were first subject to support vector regression to estimate the profile tendency by reducing noise interruption. A hybrid supervised machine learning algorithm, hidden Markov support vector machines, was then used to classify the underlying state of each probe to 'expression' or 'silence' with the assumption that the consecutive state sequence was a heterogeneous Markov chain. For model construction, we introduced a profile geometry learning method to construct the feature vectors, which considered both intensity profiles and changes of intensities over the probe spacing. Also, a robust strategy was used to dynamically evaluate and select the training set based only on prior computer gene annotation. The algorithm performed better than other methods in accuracy on simulated data, especially for small expressed regions with lower (<1) SNR (signal-to-noise ratio), hence more sensitive for detecting small RNAs. AVAILABILITY AND IMPLEMENTATION: Detail implementation steps of the algorithm and the complete result of the transcriptome analysis for a microbial genome Porphyromonas gingivalis W83 can be viewed at http://bioinformatics.forsyth.org/mtd.
MOTIVATION: RNA expression signals detected by high-density genomic tiling microarrays contain comprehensive transcriptomic information of the target organism. Current methods for determining the RNA transcription units are still computation intense and lack the discriminative power. This article describes an efficient and accurate methodology to reveal complicated transcriptional architecture, including small regulatory RNAs, in microbial transcriptome profiles. RESULTS: Normalized microarray data were first subject to support vector regression to estimate the profile tendency by reducing noise interruption. A hybrid supervised machine learning algorithm, hidden Markov support vector machines, was then used to classify the underlying state of each probe to 'expression' or 'silence' with the assumption that the consecutive state sequence was a heterogeneous Markov chain. For model construction, we introduced a profile geometry learning method to construct the feature vectors, which considered both intensity profiles and changes of intensities over the probe spacing. Also, a robust strategy was used to dynamically evaluate and select the training set based only on prior computer gene annotation. The algorithm performed better than other methods in accuracy on simulated data, especially for small expressed regions with lower (<1) SNR (signal-to-noise ratio), hence more sensitive for detecting small RNAs. AVAILABILITY AND IMPLEMENTATION: Detail implementation steps of the algorithm and the complete result of the transcriptome analysis for a microbial genome Porphyromonas gingivalis W83 can be viewed at http://bioinformatics.forsyth.org/mtd.
Authors: Thomas E Royce; Joel S Rozowsky; Paul Bertone; Manoj Samanta; Viktor Stolc; Sherman Weissman; Michael Snyder; Mark Gerstein Journal: Trends Genet Date: 2005-08 Impact factor: 11.639
Authors: Jiang Du; Joel S Rozowsky; Jan O Korbel; Zhengdong D Zhang; Thomas E Royce; Martin H Schultz; Michael Snyder; Mark Gerstein Journal: Bioinformatics Date: 2006-10-12 Impact factor: 6.937
Authors: Lei Li; Xiangfeng Wang; Rajkumar Sasidharan; Viktor Stolc; Wei Deng; Hang He; Jan Korbel; Xuewei Chen; Waraporn Tongprasit; Pamela Ronald; Runsheng Chen; Mark Gerstein; Xing Wang Deng Journal: PLoS One Date: 2007-03-14 Impact factor: 3.240
Authors: Roger Junges; Gabriela Salvadori; Sudhanshu Shekhar; Heidi A Åmdal; Jimstan N Periselneris; Tsute Chen; Jeremy S Brown; Fernanda C Petersen Journal: mSphere Date: 2017-09-13 Impact factor: 4.389
Authors: Rabia Khan; Roger Junges; Heidi A Åmdal; Tsute Chen; Donald A Morrison; Fernanda C Petersen Journal: Sci Rep Date: 2017-07-20 Impact factor: 4.379