| Literature DB >> 32155892 |
Carla V Filippi1,2,3, Gabriela A Merino4,5, Juan F Montecchia1, Natalia C Aguirre1, Máximo Rivarola1, Guy Naamati3, Mónica I Fass1, Daniel Álvarez6, Julio Di Rienzo7, Ruth A Heinz1, Bruno Contreras Moreira3, Verónica V Lia1, Norma B Paniego1.
Abstract
Sunflower germplasm collections are valuable resources for broadening the genetic base of commercial hybrids and ameliorate the risk of climate events. Nowadays, the most studied worldwide sunflower pre-breeding collections belong to INTA (Argentina), INRA (France), and USDA-UBC (United States of America-Canada). In this work, we assess the amount and distribution of genetic diversity (GD) available within and between these collections to estimate the distribution pattern of global diversity. A mixed genotyping strategy was implemented, by combining proprietary genotyping-by-sequencing data with public whole-genome-sequencing data, to generate an integrative 11,834-common single nucleotide polymorphism matrix including the three breeding collections. In general, the GD estimates obtained were moderate. An analysis of molecular variance provided evidence of population structure between breeding collections. However, the optimal number of subpopulations, studied via discriminant analysis of principal components (K = 12), the bayesian STRUCTURE algorithm (K = 6) and distance-based methods (K = 9) remains unclear, since no single unifying characteristic is apparent for any of the inferred groups. Different overall patterns of linkage disequilibrium (LD) were observed across chromosomes, with Chr10, Chr17, Chr5, and Chr2 showing the highest LD. This work represents the largest and most comprehensive inter-breeding collection analysis of genomic diversity for cultivated sunflower conducted to date.Entities:
Keywords: breeding; genetic diversity; linkage disequilibrium; population structure; sunflower
Mesh:
Year: 2020 PMID: 32155892 PMCID: PMC7140877 DOI: 10.3390/genes11030283
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Initial characterization of the INTA ddRADseq matrix before filtering. (A) Allele frequency. AF_ALT, alternative allele (i.e., less frequent allele); AF_REF, reference allele (i.e., most frequent allele); (B) Percentage of missing data (x-axis: percentage); (C) Number of SNPs per tag, or sequenced region (110 bp).
Figure 2Distribution of SNPs along the 17 sunflower chromosomes. The colors indicate the number of markers within a 1 Mbp window.
Figure 3Variant effect predictor output for the 11,834 SNPs used in subsequent analysis. (A) Variant impact. (B) Variant consequences, according to genome site location.
Basic genetic diversity estimates within breeding collections.
|
| He | Ho | % of SNPs with MAF <0.05 | |||
|---|---|---|---|---|---|---|
| Min | Mean | Max | ||||
|
| 135 | 0.022 | 0.454 | 0.500 | 0.007 | - |
|
| 58 | 0.000 | 0.452 | 0.500 | 0.013 | 0.072 |
|
| 289 | 0.003 | 0.454 | 0.500 | 0.030 | 0.005 |
|
| 482 | 0.019 | 0.454 | 0.500 | 0.022 | 0.053 |
Figure 4Scatter plot of DAPC showing the first two principal components for K = 12. Dots represent accessions while the ellipses represent the 12 groups. Eigenvalues of the analysis are also displayed.
Percentage of identity between public sunflower inbred lines present in more than one breeding collection.
| Accession Name | Code 1 (USDA-UBC/INRA) | Code 2 (INTA) | % of Identity |
|---|---|---|---|
|
| SAM002 (USDA) | PMA102 (INTA) | 0.92 |
|
| SAM169 (USDA) | PMA55 (INTA) | 0.82 |
|
| SAM172 (USDA) | PMA124 (INTA) | 0.93 |
|
| SAM173 (USDA) | PMA78 (INTA) | 0.97 |
|
| SAM176 (USDA) | PMA80 (INTA) | 0.92 |
|
| SAM227 (USDA) | PMA97 (INTA) | 0.78 |
|
| SF268 (INRA) | PMA133 (INTA) | 0.94 |
|
| SF302 (INRA) | PMA132 (INTA) | 0.92 |
|
| SF330 (INRA) | PMA119 (INTA) | 0.92 |
|
| SF332 (INRA) | PMA123 (INTA) | 0.93 |
Figure 5Linkage disequilibrium (r2) vs. physical distance (bp) for the full panel of accessions. A cut-off line was plotted at r2 = 0.2. The blue line represents the y ~log(x) function. (A–Q) Sunflower chromosomes 1 to 17. CHR = chromosome