| Literature DB >> 21364827 |
Takashi Hagiwara, Seiji Saito, Yoshifumi Ujiie, Kensaku Imai, Masanori Kakuta, Koji Kadota, Tohru Terada, Kazuya Sumikoshi, Kentaro Shimizu, Tatsunari Nishi.
Abstract
Liquid Chromatography Time-of-Flight Mass Spectrometry (LC-TOF-MS) is widely used for profiling metabolite compounds. LC-TOF-MS is a chemical analysis technique that combines the physical separation capabilities of high-pressure liquid chromatography (HPLC) with the mass analysis capabilities of Time-of-Flight Mass Spectrometry (TOF-MS) which utilizes the difference in the flight time of ions due to difference in the mass-to-charge ratio. Since metabolite compounds have various chemical characteristics, their precise identification is a crucial problem of metabolomics research. Contemporaneously analyzed reference standards are commonly required for mass spectral matching and retention time matching, but there are far fewer reference standards than there are compounds in the organism. We therefore developed a retention time prediction method for HPLC to improve the accuracy of identification of metabolite compounds. This method uses a combination of Support Vector Regression and Multiple Linear Regression adaptively to the measured retention time. We achieved a strong correlation (correlation coefficient = 0.974) between measured and predicted retention times for our experimental data. We also demonstrated a successful identification of an E. coli metabolite compound that cannot be identified by precise mass alone.Entities:
Year: 2010 PMID: 21364827 PMCID: PMC3055703 DOI: 10.6026/97320630005255
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1Metabolome analysis system. The system first searches for candidate compounds whose masses are equal to the mass measured by LC-TOFMS from the compound databases. Next, it predicts the retention times of these candidate. When the measured retention time is less than the threshold T, SVR is used (a combination of Complexity, TPSA, and HB are used for input features) and when the measured retention time is larger than or equal to T, MLR is used (ASA and XLogP are used as independent variables). The compounds whose measured retention times are most similar to the predicted retention times are selected for identificatio
Figure 2Prediction performance of SVR. The figure shows correlation between the measured retention time and predicted retention time when SVR is used for all range of the retention time. The blue line indicates the threshold T.
Figure 3Prediction performance of the combined method This figure shows correlation between the measured retention time and predicted retention time when combining SVR (for retention time of less than 1.5 minutes) and MLR (retention time of over 1.5 minutes)