| Literature DB >> 31695583 |
T H Noel Ellis1,2, Julie M I Hofer2, Martin T Swain3, Peter J van Dijk4.
Abstract
A controversy arose over Mendel's pea crossing experiments after the statistician R.A. Fisher proposed how these may have been performed and criticised Mendel's interpretation of his data. Here we re-examine Mendel's experiments and investigate Fisher's statistical criticisms of bias. We describe pea varieties available in Mendel's time and show that these could readily provide all the material Mendel needed for his experiments; the characters he chose to follow were clearly described in catalogues at the time. The combination of character states available in these varieties, together with Eichling's report of crosses Mendel performed, suggest that two of his F3 progeny test experiments may have involved the same F2 population, and therefore that these data should not be treated as independent variables in statistical analysis of Mendel's data. A comprehensive re-examination of Mendel's segregation ratios does not support previous suggestions that they differ remarkably from expectation. The χ2 values for his segregation ratios sum to a value close to the expectation and there is no deficiency of extreme segregation ratios. Overall the χ values for Mendel's segregation ratios deviate slightly from the standard normal distribution; this is probably because of the variance associated with phenotypic rather than genotypic ratios and because Mendel excluded some data sets with small numbers of progeny, where he noted the ratios "deviate not insignificantly" from expectation.Entities:
Keywords: Gregor Mendel; Pea varieties; RA Fisher; Statistical controversy
Mesh:
Year: 2019 PMID: 31695583 PMCID: PMC6823958 DOI: 10.1186/s41065-019-0111-y
Source DB: PubMed Journal: Hereditas ISSN: 0018-0661 Impact factor: 3.271
Fig. 1Frequency distribution of segregation ratios. The number of dominant homozygotes (AA) in the F2, m (x axis), among 100 F2 individuals of the dominant class (AA and Aa) (Additional file 1: Table S1.3) is plotted against the frequency with which this number is expected to occur. This frequency corresponds to the likelihood, L, (y-axis) with which that number occurs as determined by the binomial distribution; these values were obtained in Excel using the function BINOM.DIST. The values that Mendel obtained are ringed and the gene involved is indicated, see Ellis et al. [13] for an explanation of the gene symbols. The upper panel is the frequency expected given a 1: 2 ratio, and the lower panel is the frequency distribution based on Fisher’s ratio
Fig. 2Experimental design. This figure represents Mendel’s F2:F3 experiment to determine the genotype of individuals of the dominant (green) class in the F2 by examining the segregation of green vs yellow pods (Gp vs gp) in their F3 selfed progeny. The gene symbols are as described in Ellis et al. [13]
Anthocyanin pigmentation segregation in Mendel’s trifactorial experiment
The likelihoods of getting exactly the ratio AA: Aa as reported by Mendel were calculated as described by eq. 1 except that the value 100 is replaced by the number of segregants, given in the column “total”, and using the Excel function BINOM.DIST. The likelihoods (L) are listed in the columns “Likelihood from binomial assuming:” in which the two columns correspond to different expectations; the 1: 2 ratio, and Fisher’s ratio. The columns under “In 9 trials” give the corresponding likelihoods in a series of nine trials, there being nine categories in these data, this is estimated as [1 - (1 - L)9]
The shaded values (i) are the case in the most abundant class (Rr Ii, 127 plants) where there are fewer Aa plants than Fisher’s ratio predicts (78 vs 84.67), and (ii) the lowest likelihood of the nine AA: Aa segregation ratios (1.6% for the RR Ii class of 60 plants)
Fig. 3The frequency distribution of χ2 values. The frequency distribution of the proportion of χ2 values in a given range corresponding to phenotypic ratios in Mendel’s experiments is shown. The ranges of χ2 values are on the x axis and the frequency with which χ2 values in each range occur is on the y axis. For comparison the expected frequency distribution of χ2 values is also shown as shaded background bars. These are centred at the expected frequency and the three shades of grey correspond to ±1 (dark), 2 (lighter) or 3 (lightest) standard errors. Standard errors for the expected frequencies (p) were estimated as √(p(1-p)/N) where N = 88. Three sets χ2 values corresponding to Mendel’s data are plotted: Those with an open circle correspond to 64 values where the data were as Edward’s grouping, except that the data for Mendel’s experiments 1 and 2 are for the combined values. The second, marked as filled circles, correspond to 88 values which disaggregate the values of Mendel’s experiments 1 and 2 into all of the examples he gave for individual plants. The third set, with marked +, correspond to 84 values where experiments 1 and 2 disaggregate only the first 10 plants of each experiment as in Edwards [8]. The data corresponding to these values are given in Additional file 1: Table S1.3. The expected frequencies were calculated in excel using the function CHISQ.DIST(× 1,ν,c)) - CHISQ.DIST(× 2, ν,c)) where × 1 and × 2 correspond to the range of χ2 values, ν = 1 (degree of freedom) and c = 1 (cumulative distribution)