MOTIVATION: Multidimensional scaling (MDS) is a well-known multivariate statistical analysis method used for dimensionality reduction and visualization of similarities and dissimilarities in multidimensional data. The advantage of MDS with respect to singular value decomposition (SVD) based methods such as principal component analysis is its superior fidelity in representing the distance between different instances specially for high-dimensional geometric objects. Here, we investigate the importance of the choice of initial conditions for MDS, and show that SVD is the best choice to initiate MDS. Furthermore, we demonstrate that the use of the first principal components of SVD to initiate the MDS algorithm is more efficient than an iteration through all the principal components. Adding stochasticity to the molecular dynamics simulations typically used for MDS of large datasets, contrary to previous suggestions, likewise does not increase accuracy. Finally, we introduce a k nearest neighbor method to analyze the local structure of the geometric objects and use it to control the quality of the dimensionality reduction. RESULTS: We demonstrate here the, to our knowledge, most efficient and accurate initialization strategy for MDS algorithms, reducing considerably computational load. SVD-based initialization renders MDS methodology much more useful in the analysis of high-dimensional data such as functional genomics datasets.
MOTIVATION: Multidimensional scaling (MDS) is a well-known multivariate statistical analysis method used for dimensionality reduction and visualization of similarities and dissimilarities in multidimensional data. The advantage of MDS with respect to singular value decomposition (SVD) based methods such as principal component analysis is its superior fidelity in representing the distance between different instances specially for high-dimensional geometric objects. Here, we investigate the importance of the choice of initial conditions for MDS, and show that SVD is the best choice to initiate MDS. Furthermore, we demonstrate that the use of the first principal components of SVD to initiate the MDS algorithm is more efficient than an iteration through all the principal components. Adding stochasticity to the molecular dynamics simulations typically used for MDS of large datasets, contrary to previous suggestions, likewise does not increase accuracy. Finally, we introduce a k nearest neighbor method to analyze the local structure of the geometric objects and use it to control the quality of the dimensionality reduction. RESULTS: We demonstrate here the, to our knowledge, most efficient and accurate initialization strategy for MDS algorithms, reducing considerably computational load. SVD-based initialization renders MDS methodology much more useful in the analysis of high-dimensional data such as functional genomics datasets.
Authors: Angela L Rasmussen; Nicolas Tchitchek; Nathan J Susnow; Alexei L Krasnoselsky; Deborah L Diamond; Matthew M Yeh; Sean C Proll; Marcus J Korth; Kathie-Anne Walters; Sharon Lederer; Anne M Larson; Robert L Carithers; Arndt Benecke; Michael G Katze Journal: Hepatology Date: 2012-06-05 Impact factor: 17.425
Authors: Angela L Rasmussen; Nicolas Tchitchek; David Safronetz; Victoria S Carter; Christopher M Williams; Elaine Haddock; Marcus J Korth; Heinz Feldmann; Michael G Katze Journal: J Virol Date: 2014-12-17 Impact factor: 5.103
Authors: Deborah L Diamond; Alexei L Krasnoselsky; Kristin E Burnum; Matthew E Monroe; Bobbie-Jo Webb-Robertson; Jason E McDermott; Matthew M Yeh; Jose Felipe Golib Dzib; Nathan Susnow; Susan Strom; Sean C Proll; Sarah E Belisle; David E Purdy; Angela L Rasmussen; Kathie-Anne Walters; Jon M Jacobs; Marina A Gritsenko; David G Camp; Renuka Bhattacharya; James D Perkins; Robert L Carithers; Iris W Liou; Anne M Larson; Arndt Benecke; Katrina M Waters; Richard D Smith; Michael G Katze Journal: Hepatology Date: 2012-04-24 Impact factor: 17.425
Authors: Pierre Roques; Karl Ljungberg; Beate M Kümmerer; Leslie Gosse; Nathalie Dereuddre-Bosquet; Nicolas Tchitchek; David Hallengärd; Juan García-Arriaza; Andreas Meinke; Mariano Esteban; Andres Merits; Roger Le Grand; Peter Liljeström Journal: JCI Insight Date: 2017-03-23
Authors: Vadim Sharov; Veronica V Rezelj; Vladimir V Galatenko; Avi Titievsky; Julia Panov; Konstantin Chumakov; Raul Andino; Marco Vignuzzi; Leonid Brodsky Journal: J Virol Date: 2021-09-01 Impact factor: 5.103
Authors: Tokiko Watanabe; Jennifer Tisoncik-Go; Nicolas Tchitchek; Shinji Watanabe; Arndt G Benecke; Michael G Katze; Yoshihiro Kawaoka Journal: J Virol Date: 2013-02-28 Impact factor: 5.103
Authors: Nicolas Tchitchek; Amie J Eisfeld; Jennifer Tisoncik-Go; Laurence Josset; Lisa E Gralinski; Christophe Bécavin; Susan C Tilton; Bobbie-Jo Webb-Robertson; Martin T Ferris; Allison L Totura; Chengjun Li; Gabriele Neumann; Thomas O Metz; Richard D Smith; Katrina M Waters; Ralph Baric; Yoshihiro Kawaoka; Michael G Katze Journal: BMC Syst Biol Date: 2013-07-29