Literature DB >> 33041421

Goodness-of-fit filtering in classical metric multidimensional scaling with large datasets.

Jan Graffelman1,2.   

Abstract

Metric multidimensional scaling (MDS) is a widely used multivariate method with applications in almost all scientific disciplines. Eigenvalues obtained in the analysis are usually reported in order to calculate the overall goodness-of-fit of the distance matrix. In this paper, we refine MDS goodness-of-fit calculations, proposing additional point and pairwise goodness-of-fit statistics that can be used to filter poorly represented observations in MDS maps. The proposed statistics are especially relevant for large data sets that contain outliers, with typically many poorly fitted observations, and are helpful for improving MDS output and emphasising the most important features of the dataset. Several goodness-of-fit statistics are considered, and both Euclidean and non-Euclidean distance matrices are considered. Some examples with data from demographic, genetic and geographic studies are shown.

Entities:  

Keywords:  Manhattan distance; Plot brushing; allele sharing distance; attractor point; eigenvalue; outlier

Year:  2019        PMID: 33041421      PMCID: PMC7539904          DOI: 10.1080/02664763.2019.1702929

Source DB:  PubMed          Journal:  J Appl Stat        ISSN: 0266-4763            Impact factor:   1.404


  9 in total

1.  Inference of unexpected genetic relatedness among individuals in HapMap Phase III.

Authors:  Trevor J Pemberton; Chaolong Wang; Jun Z Li; Noah A Rosenberg
Journal:  Am J Hum Genet       Date:  2010-10-08       Impact factor: 11.025

2.  Calibration of multivariate scatter plots for exploratory analysis of relations within and between sets of variables in genomic research.

Authors:  Jan Graffelman; Fred van Eeuwijk
Journal:  Biom J       Date:  2005-12       Impact factor: 2.207

3.  Genotype, haplotype and copy-number variation in worldwide human populations.

Authors:  Mattias Jakobsson; Sonja W Scholz; Paul Scheet; J Raphael Gibbs; Jenna M VanLiere; Hon-Chung Fung; Zachary A Szpiech; James H Degnan; Kai Wang; Rita Guerreiro; Jose M Bras; Jennifer C Schymick; Dena G Hernandez; Bryan J Traynor; Javier Simon-Sanchez; Mar Matarin; Angela Britton; Joyce van de Leemput; Ian Rafferty; Maja Bucan; Howard M Cann; John A Hardy; Noah A Rosenberg; Andrew B Singleton
Journal:  Nature       Date:  2008-02-21       Impact factor: 49.962

4.  PLINK: a tool set for whole-genome association and population-based linkage analyses.

Authors:  Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham
Journal:  Am J Hum Genet       Date:  2007-07-25       Impact factor: 11.025

5.  Goodness-of-Fit Assessment in Multidimensional Scaling and Unfolding.

Authors:  Patrick Mair; Ingwer Borg; Thomas Rusch
Journal:  Multivariate Behav Res       Date:  2016-11-01       Impact factor: 5.923

6.  Comparing spatial maps of human population-genetic variation using Procrustes analysis.

Authors:  Chaolong Wang; Zachary A Szpiech; James H Degnan; Mattias Jakobsson; Trevor J Pemberton; John A Hardy; Andrew B Singleton; Noah A Rosenberg
Journal:  Stat Appl Genet Mol Biol       Date:  2010-01-27

7.  Genome-wide association analysis of metabolic traits in a birth cohort from a founder population.

Authors:  Chiara Sabatti; Susan K Service; Anna-Liisa Hartikainen; Anneli Pouta; Samuli Ripatti; Jae Brodsky; Chris G Jones; Noah A Zaitlen; Teppo Varilo; Marika Kaakinen; Ulla Sovio; Aimo Ruokonen; Jaana Laitinen; Eveliina Jakkula; Lachlan Coin; Clive Hoggart; Andrew Collins; Hannu Turunen; Stacey Gabriel; Paul Elliot; Mark I McCarthy; Mark J Daly; Marjo-Riitta Järvelin; Nelson B Freimer; Leena Peltonen
Journal:  Nat Genet       Date:  2008-12-07       Impact factor: 38.330

8.  Population structure in a comprehensive genomic data set on human microsatellite variation.

Authors:  Trevor J Pemberton; Michael DeGiorgio; Noah A Rosenberg
Journal:  G3 (Bethesda)       Date:  2013-05-20       Impact factor: 3.154

9.  Multidimensional scaling for large genomic data sets.

Authors:  Jengnan Tzeng; Henry Horng-Shing Lu; Wen-Hsiung Li
Journal:  BMC Bioinformatics       Date:  2008-04-04       Impact factor: 3.169

  9 in total
  1 in total

1.  Transcriptomic analysis of resistance and short-term induction response to pyrethroids, in Anopheles coluzzii legs.

Authors:  M Kefi; J Charamis; V Balabanidou; P Ioannidis; H Ranson; V A Ingham; J Vontas
Journal:  BMC Genomics       Date:  2021-12-13       Impact factor: 3.969

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.