| Literature DB >> 20302660 |
Liam H Childs1, Hanna Witucka-Wall, Torsten Günther, Ronan Sulpice, Maria V Korff, Mark Stitt, Dirk Walther, Karl J Schmid, Thomas Altmann.
Abstract
BACKGROUND: Natural accessions of Arabidopsis thaliana are characterized by a high level of phenotypic variation that can be used to investigate the extent and mode of selection on the primary metabolic traits. A collection of 54 A. thaliana natural accession-derived lines were subjected to deep genotyping through Single Feature Polymorphism (SFP) detection via genomic DNA hybridization to Arabidopsis Tiling 1.0 Arrays for the detection of selective sweeps, and identification of associations between sweep regions and growth-related metabolic traits.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20302660 PMCID: PMC2850358 DOI: 10.1186/1471-2164-11-188
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1The density of SFPs and genes across each chromosome of . SFP density is calculated for all accessions per base pair (bp) using a sliding window of 50 kbp and the density at each position is plotted on a log scale (shown in black). The gene density, calculated as coding bp per every 50,000 genomic bp is also calculated (shown in light blue). The olive regions show the position of the centromeres. Deletions are shown in red and are defined as 10 probes in a row that were called SFPs (~350 bp). In the figure, the number of deleted bp calculated in a 50 kbp window. We have defined a duplication to be at least 10 probes in a row with a log2-fold change of greater than 1 (2-fold increase in signal). In this figure, the number of duplicated bp is calculated in 50 kb windows across the chromosome (green).
Figure 2The presence of SFPs in various genomic structural features is shown as the number of SFPs per base pair for each entity. The spread of observed values across all accessions is shown in black. The blue band shows the range between the 1st and 3rd quartile while the red line shows the median value. A chi-squared test is used to determine whether the observed amount of SFPs per class was close to the expected number due to random chance. All classes are significantly greater than or less that random (p < 0.05) except for the classes 'intron', 'snRNA' and 'snoRNA'.
Figure 3Number of SFPs in selected gene families. Of the available gene families in Additional File 1, Table S3.1, 13 are shown here that have been investigated in previous studies for comparison purposes. The observed number (black) of SFPs in a gene family is compared to the expected number (red) due to random chance. Gene families are sorted by the significance of the difference between observed and expected SFP numbers. All gene families show significantly different observed SFPs than expected except for RING genes (Additional File 1, Table S3.1). Deletions follow a similar pattern to SFPs by removing the most nucleotides from resistance, F-box and receptor-like kinase genes but differ by removing fewer nucleotides from cytoplasmic ribosomal genes. Duplications rarely occur in any of the chosen gene families. Although they do occur in resistance genes, the difference between observed and expected is not significant.
Overview of sweep-candidate regions used for the association analysis
| Name | Focal position | From | To | Length of avg. haplotype sharing | |
|---|---|---|---|---|---|
| 1 | 12,104,888 | 12,102,020 | 12,108,376 | 6,356 | |
| 1 | 23,826,967 | 23,824,815 | 23,829,023 | 4,208 | |
| 3 | 16,905,787 | 16,905,273 | 16,906,447 | 1,174 | |
| 3 | 17,859,222 | 17,858,441 | 17,860,021 | 1,580 | |
| 4 | 198,623 | 195,197 | 201,162 | 5,965 |
Candidates were called from proportion of high PHS in a sliding window of 2000 SFPs (threshold = 45%). The focal SFP is the highest scoring SFP in the candidate region.
Figure 4Map over all sweep candidates. The graph shows the density of significantly scoring SFPs measured by the proportion of significant SFPs in a sliding window (windows size of 1000 SFPs, offset between windows of 50 SFPs) across the genome. The dashed line represents the cut-off for the definition of sweep candidates. Sweep candidate regions (SR) are denoted in red.