| Literature DB >> 20156342 |
Runchun Jing1, Alexander Vershinin, Jacek Grzebyta, Paul Shaw, Petr Smýkal, David Marshall, Michael J Ambrose, T H Noel Ellis, Andrew J Flavell.
Abstract
BACKGROUND: The genetic diversity of crop species is the result of natural selection on the wild progenitor and human intervention by ancient and modern farmers and breeders. The genomes of modern cultivars, old cultivated landraces, ecotypes and wild relatives reflect the effects of these forces and provide insights into germplasm structural diversity, the geographical dimension to species diversity and the process of domestication of wild organisms. This issue is also of great practical importance for crop improvement because wild germplasm represents a rich potential source of useful under-exploited alleles or allele combinations. The aim of the present study was to analyse a major Pisum germplasm collection to gain a broad understanding of the diversity and evolution of Pisum and provide a new rational framework for designing germplasm core collections of the genus.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20156342 PMCID: PMC2834689 DOI: 10.1186/1471-2148-10-44
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Examples of TAM arrays for RBIP markers scored in the JIC . Each array scores a single retrotransposon insertion polymorphism in 3029 Pisum DNAs. Markers illustrated [19] are as follows: A, UniTpv; B, 281 × 44; C, 1794-2; D, 95 × 25; E, Birte-x34; F, 64 × 11. The arrays also contain 672 blank spots (no sample arrayed) 128 positive controls 9 DNA duplicates and 42 PCR negative controls (mock DNA preparation with no input plant sample).
Markers used in this study
| Marker | Insert | Array Result | Used in this study? |
|---|---|---|---|
| 45 × 8 | Many missing scores, Cy3/Cy5 false positive problem | No | |
| 45 × 15 | Good | Yes | |
| 45 × 20 | Very weak signals, Cy3/Cy5 false positive problem | No | |
| 45 × 29 | Very weak signals, Cy3/Cy5 false positive problem | No | |
| 45 × 31 | Slight false positive problem | Yes | |
| 45 × 33 | Slight Cy5 false positive problem | Yes | |
| 45 × 38 | Very low polymorphism, Cy5 false positive problem | No | |
| 64 × 11 | Many weak or mixed Cy3/Cy5 scores | No | |
| 64 × 14 | Rather low polymorphism | Yes | |
| 64 × 15 | Many weak or mixed Cy3/Cy5 scores, Cy5 false positive problem | No | |
| 64 × 29 | Very low polymorphism, Cy3 false positive problem | No | |
| 64 × 40 | Many weak or mixed Cy3/Cy5 scores, false positive problem | No | |
| 64 × 45 | Many mixed Cy3/Cy5 scores, Cy5 false positive problem | No | |
| 64 × 74 | Many mixed Cy3/Cy5 scores, false positive problem | No | |
| 64 × 76 | Rather low polymorphism | Yes | |
| 95 × 2 | Slight Cy5 false positive problem | Yes | |
| 95 × 19 | Good | Yes | |
| 95 × 25 | Slight Cy5 false positive problem | Yes | |
| 95 × 43 | Good | Yes | |
| 261 × 1 | No polymorphism | No | |
| 261 × 13 | Cy5 false positive problem, very low polymorphism | No | |
| 281 × 1 | Slight Cy3 false positive problem | Yes | |
| 281 × 5 | Rather low polymorphism | Yes | |
| 281 × 16 | Good | Yes | |
| 281 × 40 | Slight Cy5 false positive problem | Yes | |
| 281 × 44 | Good | Yes | |
| 399-3-6 | Cy3 false positive problem, very low polymorphism | No | |
| 399-14-9 | Good | Yes | |
| 399-80-46 | Slight Cy5 false positive problem | Yes | |
| 399-9x | Slight Cy5 false positive problem | Yes | |
| 399 × 131 | Slight false positive problem | Yes | |
| 399 × 149 | Slight false positive problem | Yes | |
| 1006nr2 | Low polymorphism; Cy3 false positive problem | No | |
| 1006nr9 | Very poor image | No | |
| 1006nr13 | Good | Yes | |
| 1006nr27 | Good | Yes | |
| 1006nr32 | Weak signal; multiple spot colours | No | |
| 1006 × 6 | Slight Cy5 false positive problem | Yes | |
| 1006 × 19 | Low polymorphism; Cy5 false positives problem | No | |
| 1006 × 21 | Slight Cy3 false positive problem | Yes | |
| 1006 × 36 | Many signals weak or absent; Cy5 false positive problem | No | |
| 1006 × 50 | Some Cy5 background signal (Yellow spots) | Yes | |
| 1006 × 58 | Some Cy5 background signal (Yellow spots) | Yes | |
| 1794-1 | Good | Yes | |
| 1794-2 | Good | Yes | |
| 1794 × 7 | Slight Cy3 false positive problem | Yes | |
| 1794 × 9 | Very low polymorphism | No | |
| 1794 × 35 | Many signals absent; Cy5 background signal | No | |
| 2055nr1 | Good | Yes | |
| 2055nr16 | Low polymorphism, good otherwise | Yes | |
| 2055nr23 | Rather low polymorphism, good otherwise | Yes | |
| 2055nr51 | False positive problem, weak signals | No | |
| 2055nr53 | Slight Cy3 background signal | Yes | |
| 2055 × 10 | Many signals weak or absent; Cy5 false positive problem | No | |
| 2055 × 19 | Low polymorphism, weak signals, many mixed Cy3/Cy5 signals | No | |
| 2055 × 28 | Low polymorphism, weak signals, Cy3 false positive problem | No | |
| 2055 × 29 | Many signals weak or absent; Cy5 false positive problem | No | |
| 2055 × 36 | Cy3 false positive problem | No | |
| 2201CycL6 | Rather low polymorphism | Yes | |
| 2385 × 16 | Very low polymorphism, mixed Cy3/Cy5 signals, Cy3 false positive problem | No | |
| 2385 × 23 | Rather low polymorphism | Yes | |
| 2385 × 46 | Very weak signals, Cy5 false positive problem | No | |
| 2385 × 56 | Very weak signals, Cy3/Cy5 false positive problem | No | |
| 2385 × 64 | Rather low polymorphism | Yes | |
| 2539 × 7 | Some Cy5 contaminating signal | Yes | |
| 3150 × 11 | Almost no polymorphism | No | |
| Birte-B1 | Good | Yes | |
| Birte-x5 | Very low polymorphism, many very weak signals | No | |
| Birte-x16 | Good | Yes | |
| Birte-x28 | Some weak or mixed Cy3/Cy5 scores | Yes | |
| Birte-x34 | Good | Yes | |
| MKRBIP2 | Slight Cy5 contamination problem | Yes | |
| MKRBIP3 | Many missing scores | No | |
| MKRBIP7 | Good | Yes | |
| Cycl1074-L12 | Some Cy5 contaminating signal | Yes | |
| Cycl1074-L29 | Good | Yes | |
| Cycl711-L12 | Many missing scores, many mixed Cy3/Cy5 scores | No | |
| UniTpv | Good | Yes | |
| DBAP-261 × 1 | Indel | Very Low polymorphism | No |
| DBAP-2055nr53 | Indel | Very Low polymorphism | No |
The origin accession for each RBIP marker is given typically by the first number in the RBIP designation (ie marker 45 × 8 derived from JI45). Exceptions to this rule are MKRBIPs (which derived from mapping populations between accessions JI15, JI281 and JI399), Birte markers which derived from JI1068 [cv Birte] and Uni-Tpv which derived from cultivar Therese (not in the JI Collection).
Figure 2Exploration of K value for Structure analysis of the JIC . A. Mean values of the log likelihood of the data given K is plotted against K for 10 simulations with a burn-in of 10,000 runs followed by 10,000 MCMC runs. The error bars show the standard deviations of the mean values. B. Estimates of the rate of change of the slope of the log likelihood curve (ΔK) calculated according to [29] are plotted against K. K values of 3, 7 and 11 giving robust ΔK maxima were investigated further.
Figure 3Structure analysis of the JIC . The results of three Structure runs for K values of 3, 7 and 11 are shown, together with the means for 10 K = 3 runs and the 3 most reproducible K = 7 runs. For each run the fractional inferred ancestry of the 3029 individuals is plotted as a histogram, with each ancestral population colour-coded. Accessions are assigned to Sub-Group N if their average representation (QN) for that Sub-Group ≥ 0.5. Accessions in Groups 1 and 2 are ordered by decreasing Q and for group 3 by increasing Q. Admixtured accessions, where Q1+Q2 ≥ 0.5, are placed between groups 2 and 3, ordered by increasing Q3. For K = 7 the accessions are ordered within Sub-Groups in a similar way. For K = 11 there was no consistent assignment between Structure runs of accessions to groups; therefore the accession order is the same as for K = 3 (see also Additional file 4). At the bottom of the K = 3 figure the fraction of missing data per accession is plotted as a black vertical bar.
Figure 4Sub-structuring of K = 3 Structure groups and relationship to taxonomy and domestication traits. A: Nested Structure analysis for the 3 groups identified by the average of 10 independent K = 3 Structure simulations for the JIC germplasm (Figure 3) with K values of 6, 2 and 6 assigned to Groups 1-3 respectively. Accessions have been placed in order according to their average representation within the subgroups. B: Species assignments according to the key below. C: Cultivar assignments (each black bar is a cultivar). D: Distributions of important seed traits, indicated by colour according to the key below. Wrinkled includes, but is not limited to, rugosus mutants. E: Seed weights, F: Blow-up of Group 3 sub structuring (average of the 7 best correlated runs). G: Species assignments according to the key below. H: Identification of cv Afghanistan types according to [33]. Black bars are 'resistant' types (sym2) while grey bars are accessions show 'partial' resistance to nodulation by European Rhizobium strains [33]. I: I: Seed weights.
Figure 5Relationships between RBIP-based Structure Sub-Grouping and sequence-based tree structure. Accessions assigned to Sub-groups of the three K = 3 Groups (see Figure 4) are shown on a previous neighbour joining tree based on intron sequence [adapted from [17]]. Subgroup assignments are indicated by box fill colours, Group assignments are shown by box outline colours and taxonomic (species/sub-species) assignments by colour coded text, using colours from Figure 4. 'Unassigned' Subgroups U, 1.U and 3.U (white boxes) are comprised of accessions with no founder genotypic contribution of 50% or more in the corresponding Structure run. Subgroups not represented in the tree (Subgroups 1.1, 1.6 and 3.3) are unboxed.
Figure 6Distance-based estimations for Structure Sub-Groups. A neighbor joining tree of the genetic distance between Sub-Groups was calculated from the combined allele frequencies per Sub-Group, using Nei's method [40]. Structure Sub-groups are colour coded as in Figure 4 and predominant species in each Sub-Group (Additional file 2) are indicated by coloured text using the same species-specific colours as Figure 4.
Figure 7Multifactorial analysis of . For the entire data set the fraction of shared alleles for all pair-wise combinations of samples is analysed by multidimensional scaling. The output for the first two dimensions (explaining 5.32% and 2.99% of the variation are shown (see text). A: All points are plotted with each sample is colour-coded according to its corresponding sub-group membership. B: The mean of all values for each sub group, together with standard deviations are plotted. Sub Groups are colour coded as shown below. The subgroups of Group 1 lie so close together that they are not individually identified but collectively represented in red. All points are plotted in grey and the axes are removed for clarity.
Figure 8Geographical Distribution of Structure Sub-Groups of the JIC . Data are shown for all donor countries contributing 10 or more accessions in the JIC Pisum Collection. Each piechart refers to a single country, the area of the piechart reflects the number of accessions derived from that country and the corresponding compositions by Structure Sub-Group (colour-coded as in Figures 4-6) are indicated. The map is derived from [41].