Literature DB >> 24222034

Optimization and testing of mass spectral library search algorithms for compound identification.

S E Stein1, D R Scott.   

Abstract

Five algorithms proposed in the literature for library search identification of unknown compounds from their low resolution mass spectra were optimized and tested by matching test spectra against reference spectra in the NIST-EPA-NIH Mass Spectral Database. The algorithms were probability-based matching (PBM), dot-product, Hertz et al. similarity index, Euclidean distance, and absolute value distance. The test set consisted of 12,592 alternate spectra of about 8000 compounds represented in the database. Most algorithms were optimized by varying their mass weighting and intensity scaling factors. Rank in the list of candidatc compounds was used as the criterion for accuracy. The best performing algorithm (75% accuracy for rank 1) was the dot-product function that measures the cosine of the angle between spectra represented as vectors. Other methods in order of performance were the Euclidean distance (72%), absolute value distance (68%) PBM (65%), and Hertz et al. (64%). Intensity scaling and mass weighting were important in the optimized algorithms with the square root of the intensity scale nearly optimal and the square or cube the best mass weighting power. Several more complex schemes also were tested, but had little effect on the results. A modest improvement in the performance of the dot-product algorithm was made by adding a term that gave additional weight to relative peak intensities for spectra with many peaks in common.

Year:  1994        PMID: 24222034     DOI: 10.1016/1044-0305(94)87009-8

Source DB:  PubMed          Journal:  J Am Soc Mass Spectrom        ISSN: 1044-0305            Impact factor:   3.109


  3 in total

1.  Comparative evaluations of mass spectral data bases.

Authors:  F W McLafferty; D B Stauffer; S Y Loh
Journal:  J Am Soc Mass Spectrom       Date:  1991-09       Impact factor: 3.109

2.  An enlarged data base of electron-ionization mass spectra.

Authors:  F W McLafferty; D B Stauffer; A B Twiss-Brooks; S Y Loh
Journal:  J Am Soc Mass Spectrom       Date:  1991-09       Impact factor: 3.109

3.  Estimating probabilities of correct identification from results of mass spectral library searches.

Authors:  S E Stein
Journal:  J Am Soc Mass Spectrom       Date:  1994-04       Impact factor: 3.109

  3 in total
  165 in total

Review 1.  Unknown identification using reference mass spectra. Quality evaluation of databases.

Authors:  F W McLafferty; D A Stauffer; S Y Loh; C Wesdemiotis
Journal:  J Am Soc Mass Spectrom       Date:  1999-12       Impact factor: 3.109

2.  Comparing similar spectra: from similarity index to spectral contrast angle.

Authors:  Katty X Wan; Ilan Vidavsky; Michael L Gross
Journal:  J Am Soc Mass Spectrom       Date:  2002-01       Impact factor: 3.109

3.  A new matching algorithm for high resolution mass spectra.

Authors:  Michael Edberg Hansen; Jørn Smedsgaard
Journal:  J Am Soc Mass Spectrom       Date:  2004-08       Impact factor: 3.109

4.  On the normalization of a mass spectrum for comparison of two spectra.

Authors:  Zeev B Alfassi
Journal:  J Am Soc Mass Spectrom       Date:  2004-03       Impact factor: 3.109

5.  Protein turnover quantification in a multilabeling approach: from data calculation to evaluation.

Authors:  Christian Trötschel; Stefan P Albaum; Daniel Wolff; Simon Schröder; Alexander Goesmann; Tim W Nattkemper; Ansgar Poetsch
Journal:  Mol Cell Proteomics       Date:  2012-04-06       Impact factor: 5.911

6.  A method of finding optimal weight factors for compound identification in gas chromatography-mass spectrometry.

Authors:  Seongho Kim; Imhoi Koo; Xiaoli Wei; Xiang Zhang
Journal:  Bioinformatics       Date:  2012-02-13       Impact factor: 6.937

7.  A large scale test dataset to determine optimal retention index threshold based on three mass spectral similarity measures.

Authors:  Jun Zhang; Imhoi Koo; Bing Wang; Qing-Wei Gao; Chun-Hou Zheng; Xiang Zhang
Journal:  J Chromatogr A       Date:  2012-06-19       Impact factor: 4.759

8.  Systematic structural characterization of metabolites in Arabidopsis via candidate substrate-product pair networks.

Authors:  Kris Morreel; Yvan Saeys; Oana Dima; Fachuang Lu; Yves Van de Peer; Ruben Vanholme; John Ralph; Bartel Vanholme; Wout Boerjan
Journal:  Plant Cell       Date:  2014-03-31       Impact factor: 11.277

9.  A ranking-based scoring function for peptide-spectrum matches.

Authors:  Ari M Frank
Journal:  J Proteome Res       Date:  2009-05       Impact factor: 4.466

Review 10.  Identification of small molecules using accurate mass MS/MS search.

Authors:  Tobias Kind; Hiroshi Tsugawa; Tomas Cajka; Yan Ma; Zijuan Lai; Sajjan S Mehta; Gert Wohlgemuth; Dinesh Kumar Barupal; Megan R Showalter; Masanori Arita; Oliver Fiehn
Journal:  Mass Spectrom Rev       Date:  2017-04-24       Impact factor: 10.946

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.