Literature DB >> 35390435

Variances and covariances of linear summary statistics of segregating sites.

Yun-Xin Fu1.   

Abstract

Each mutation in a population sample of DNA sequences can be classified by the number of sequences that inherit the mutant nucleotide, the resulting frequencies are known as mutations of different sizes or site frequency spectrum. Many summary statistics can be defined as a linear function of these frequencies. A flexible class of such linear summary statistics is explored analytically in this paper which include several well-known quantities, such as the number of segregating sizes and the mean number of nucleotide differences between two sequences. Some asymptotic variances and covariances are obtained while the analytical formulas for the variances and covariances of nine such linear summary statistics are derived, most of which are unknown to date. This study not only provides some theoretical foundations for exploring linear summary statistics, but also provides some newlinear summary statistics that may be utilized for analyzing sample polymorphism. Furthermore it is showed that a newly developed linear summary statistics has a smaller variance almost uniformly than Watterson's estimator, and that a class of linear summary statistics given too heavy weights on mutations of smaller sizes result in asymptotically non-zero variance.
Copyright © 2022. Published by Elsevier Inc.

Entities:  

Keywords:  -statistics; Coalescent; Linear summary statistics; Mutation size; Segregating sites; Variance and covariance

Mesh:

Substances:

Year:  2022        PMID: 35390435      PMCID: PMC9584357          DOI: 10.1016/j.tpb.2022.03.005

Source DB:  PubMed          Journal:  Theor Popul Biol        ISSN: 0040-5809            Impact factor:   1.514


  20 in total

1.  No BLUE among phylogenetic estimators.

Authors:  P Joyce
Journal:  J Math Biol       Date:  1999-11       Impact factor: 2.259

2.  Hitchhiking under positive Darwinian selection.

Authors:  J C Fay; C I Wu
Journal:  Genetics       Date:  2000-07       Impact factor: 4.562

3.  On the number of segregating sites in genetical models without recombination.

Authors:  G A Watterson
Journal:  Theor Popul Biol       Date:  1975-04       Impact factor: 1.570

4.  Statistical tests for detecting positive selection by utilizing high-frequency variants.

Authors:  Kai Zeng; Yun-Xin Fu; Suhua Shi; Chung-I Wu
Journal:  Genetics       Date:  2006-09-01       Impact factor: 4.562

5.  Genealogical structure among alleles regulating self-incompatibility in natural populations of flowering plants.

Authors:  M K Uyenoyama
Journal:  Genetics       Date:  1997-11       Impact factor: 4.562

6.  Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection.

Authors:  Y X Fu
Journal:  Genetics       Date:  1997-10       Impact factor: 4.562

7.  Large numbers of vertebrates began rapid population decline in the late 19th century.

Authors:  Haipeng Li; Jinggong Xiang-Yu; Guangyi Dai; Zhili Gu; Chen Ming; Zongfeng Yang; Oliver A Ryder; Wen-Hsiung Li; Yun-Xin Fu; Ya-Ping Zhang
Journal:  Proc Natl Acad Sci U S A       Date:  2016-11-21       Impact factor: 11.205

8.  Statistical properties of segregating sites.

Authors:  Y X Fu
Journal:  Theor Popul Biol       Date:  1995-10       Impact factor: 1.570

9.  Maximum likelihood estimation of population parameters.

Authors:  Y X Fu; W H Li
Journal:  Genetics       Date:  1993-08       Impact factor: 4.562

10.  Evolutionary relationship of DNA sequences in finite populations.

Authors:  F Tajima
Journal:  Genetics       Date:  1983-10       Impact factor: 4.562

View more
  1 in total

1.  Approximations to the expectations and variances of ratios of tree properties under the coalescent.

Authors:  Egor Lappo; Noah A Rosenberg
Journal:  G3 (Bethesda)       Date:  2022-09-30       Impact factor: 3.542

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.