Literature DB >> 25014030

A high-density SNP Map of sunflower derived from RAD-sequencing facilitating fine-mapping of the rust resistance gene R12.

Zahirul I Talukder1, Li Gong2, Brent S Hulke3, Venkatramana Pegadaraju4, Qijian Song5, Quentin Schultz4, Lili Qi3.   

Abstract

A high-resolution genetic map of sunflower was constructed by integrating SNP data from three F2 mapping populations (HA 89/RHA 464, B-line/RHA 464, and CR 29/RHA 468). The consensus map spanned a total length of 1443.84 cM, and consisted of 5,019 SNP markers derived from RAD tag sequencing and 118 publicly available SSR markers distributed in 17 linkage groups, corresponding to the haploid chromosome number of sunflower. The maximum interval between markers in the consensus map is 12.37 cM and the average distance is 0.28 cM between adjacent markers. Despite a few short-distance inversions in marker order, the consensus map showed high levels of collinearity among individual maps with an average Spearman's rank correlation coefficient of 0.972 across the genome. The order of the SSR markers on the consensus map was also in agreement with the order of the individual map and with previously published sunflower maps. Three individual and one consensus maps revealed the uneven distribution of markers across the genome. Additionally, we performed fine mapping and marker validation of the rust resistance gene R12, providing closely linked SNP markers for marker-assisted selection of this gene in sunflower breeding programs. This high resolution consensus map will serve as a valuable tool to the sunflower community for studying marker-trait association of important agronomic traits, marker assisted breeding, map-based gene cloning, and comparative mapping.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25014030      PMCID: PMC4094432          DOI: 10.1371/journal.pone.0098628

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Sunflower (Helianthus annuus L) is a member of the Asteraceae family, and is the fourth most economically important annual crop grown worldwide for edible oil [1]. Cultivated sunflower is a diploid species (2n = 2x = 34) with a large genome size of ∼3.5 Gb [2]. Molecular markers and high density genetic linkage maps are important tools for understanding genome organization, and can facilitate comparative genomics, marker-assisted selection (MAS), identification of marker-trait associations via linkage or association mapping analysis, and isolation of genes by map-based cloning [3], [4]. Existing marker resources in sunflower include random amplified polymorphic DNA (RAPD) [5], restriction fragment length polymorphism (RFLP) [6]–[8], amplified fragment length polymorphism (AFLP) [9], and simple sequence repeat (SSR) [10]–[12]. Many important agronomic traits, including vertical disease resistance genes [13]–[21], fertility restoration genes [22]–[25] and numerous quantitative trait loci (QTL) were mapped using these data [26]–[31]. While the reports of sunflower linkage maps are numerous (http://sunflower.uga.edu/cmap/), the limited number of markers makes it difficult to conduct fine-scale linkage mapping and map-based cloning. Association mapping and genomic selection are dependent on a large number of polymorphic markers. These analyses are only successful if thousands of markers are available, because of the low level of linkage disequilibrium (LD) present in germplasm resources of sunflower [32]–[35]. When large numbers of markers are employed in an analysis, especially for routine breeding purposes such as genomic selection, the marker must also be high-throughput and cost effective to provide timely and repeatable data. Single nucleotide polymorphisms (SNP) are the most common type of genetic variation [36]. Through advances in sequencing technologies and high-throughput genotyping facilities, SNP markers have gained much interest in the scientific and breeding community because of their efficiency, repeatability, and low cost [37]. SNPs are usually biallelic and characterized by low mutation rates; therefore, stable from generation to generation across the genome [38]. This stability coupled with the abundance of SNPs makes them very useful both for linkage and genetic diversity studies. SNPs make it possible to conduct genome wide association mapping in low LD species. While SNP studies have been common for some time in human genetics, the advances in sequencing technology have allowed large scale SNP discovery also in crop plant species, such as sunflower [39]–[40]. Recently a high density linkage map based on ∼10,000 SNP markers was reported [41]. The National Sunflower Association (NSA) SNP Consortium, a private-public partnership of commercial seed companies, U.S. Department of Agriculture-Agricultural Research Service (USDA-ARS), and the National Sunflower Association, has developed 10,000 SNP markers using restriction site associated DNA (RAD) protocols and Illumina/Solexa paired-end sequencing chemistry [40]. The development of these SNP markers benefits the sunflower research community as a molecular genetics and genomics resource that offers the promise of speedy, inexpensive genotyping for multiple purposes, but in particular will facilitate gene mapping studies. Construction of a consensus map from multiple linkage maps offers the opportunity to map a larger number of markers than would be possible in any individual bi-parental map and also tends to eliminate many large marker gaps. Statistical software has been developed to pool segregation data from individual populations and compute loci orders and genetic map distances based on mean recombination frequencies and combined LOD scores [42]. In the absence of the whole genome sequence and a physical map, the high resolution genetic map remains an essential resource for dissection of complex traits and an essential guide to genomics-assisted crop improvement [43]. Consensus maps have been developed using multi-population linkage maps in several crop species including sunflower [41], tomato [43], soybean [44], common bean [45], sorghum [46], red clover [47], and rye [48]. Here, we report the construction of three linkage maps using SNP markers developed from RAD tag sequences and SSR markers previously positioned in the sunflower SSR reference map [10], [11], and the development of a consensus map. We also report and validate a marker linked to a rust resistance gene, R, in the constructed consensus map.

Materials and Methods

Plant materials

Three mapping populations were used to develop SNP genetic maps in sunflower (Table 1). Five parental lines were chosen to construct these three mapping populations, all but one of which were used in initial RAD tag sequencing [40]. Crosses were made in pairs predicted to maximize total cumulative polymorphism. The first mapping population (Pop1) consisted of 139 F2 progeny derived from a cross between HA 89 and RHA 464. HA 89 (PI 599773) is an oilseed maintainer line and RHA 464 (PI 655015) is an oilseed restorer sunflower germplasm which is known to possess resistance genes for both downy mildew and rust diseases [49]. This population was previously used to map the rust resistance gene (R-gene) R 12 to linkage group (LG) 11 of sunflower using simple sequence repeat (SSR) markers [13]. The second mapping population (Pop2) consisted of 141 F2 progeny derived from a cross between a proprietary confection B line (Nuseed Americas, Woodland, CA, USA) and RHA 464. The third mapping population (Pop3) consisted of 142 F2 progeny derived from a cross between CR 29 (Nuseed Americas, Woodland, CA, USA) and RHA 468 (PI 667184). CR 29 is a proprietary confection restorer line and RHA 468 is an oilseed restorer line. To graphically explain relationships among the parent lines, Jaccard's genetic similarity coefficient [50] was calculated using SNP marker data and a dendogram was constructed using unweighted pair-group method of arithmetic averages (UPGMA) clustering analysis in NTSYS-pc version 2.2 [51].
Table 1

SNP and SSR marker distributions in the three component maps and the consensus map of sunflower.

Pop1 (HA 89×RHA 464) F2 Pop2 (B-line×RHA 464) F2 Pop3 (CR29×RHA 468) F2 Consensus map
Linkage groupsNo. of markersMap length cMDensity cM/markerNo. of SNP markersMap length cMDensity cM/markerNo. of SNP markersMap length cMDensity cM/markerNo. of markersMap length cMDensity cM/markerNo. of large gaps
SSRSNPTotalSNPSSRTotal5 to 10 cM>10 cM
LG1928529488.560.3033773.820.223354.971.67384939376.090.1930
LG29829180.220.8817273.920.433440.591.19214922381.990.3721
LG36122128105.890.8318588.990.4814588.720.61327633395.330.2900
LG4614214857.450.3994100.301.07156108.940.702736279102.450.3740
LG510179189100.960.5324191.770.3814682.090.563741038491.870.2400
LG62515348.030.9111758.800.506756.580.84168217062.990.3700
LG78627067.280.967266.090.926055.180.92140814868.310.4630
LG8821422267.660.3016662.970.3817281.400.47320832875.420.2310
LG910108118106.790.9117986.500.48228108.830.4835210362104.600.2930
LG101338639994.840.2443794.480.229576.610.815031351690.890.1811
LG111014215276.320.5010388.370.8611795.860.822461025699.820.3910
LG12814114919.840.1314262.960.449866.120.67255826367.000.2500
LG134444831.190.6524869.450.2813677.700.57296430072.930.2410
LG141434438.560.8816078.560.4919173.630.39285128676.470.2730
LG155545953.950.9115880.280.519584.780.89225523085.460.3720
LG1657782105.801.2922395.770.43144107.670.753335338101.280.3010
LG174364021.370.5320297.940.4820657.520.28324432890.940.2830
Total118216822861164.710.5132361370.970.4221231317.190.62501911851371443.840.28282
A total of 548 sunflower lines were used in the present study to validate the R specific markers. They include 238 inbred lines and 63 germplasm lines released by USDA, and 247 plant introduction (PI) lines originally collected from 32 countries, which together represent a diverse germplasm pool of cultivated sunflower (Tables S1 and S2).

DNA extraction and genotyping

Genomic DNA of Pop1 along with its parental lines, HA 89 and RHA 464, were obtained from a previous mapping project [13]. Genomic DNA of Pop2 and Pop3, along with their parents and 548 sunflower germplasm lines, were extracted using 40 mg of lyophilized young leaves with the DNeasy 96 Plant Kit (Qiagen, Valencia, CA, USA) and a modified protocol. Tissue was pulverized with 3-mm steel beads in a Harbil 5G-HD paint shaker (Fluid Management, IL, USA). Buffer AP1 with DX and RNaseA was added to the tissue, 500 µl per sample, and incubated at 55°C for 60 min. Buffer AP2 was added at 150 µl per well, and incubated at −20°C for 15 min. AP3/E was combined with supernatant, 600 µl and 400 µl respectively, and then added to the binding plates. The rest of the extraction was carried out according to kit instructions. DNA was eluted in a final volume of 50 µl, and was quantified using the PicoGreen kit (Molecular Probes) according to the kit instructions. A standard curve was made using quantified λ DNA from 100 to 0 ng/µl. A 1/200 dilution of Picogreen reagent in 1×TE (provided in kit) was mixed with 2 µl of isolated DNA, briefly vortexed, and incubated in the dark for 5 min. Assays were performed in black 96-well Fluotrac plates and fluorescence was measured with a Spectramax Gemini XPS (Molecular Devices) at 485 nm excitation and 538 nm emission wavelengths. SSR markers were used only in Pop1 in order to determine the linkage groups and map orientation corresponding to the published maps of Tang et al. [10] and Yu et al. [11]. A total of 870 published SSR markers [10]–[12] were screened for polymorphism between the two parents, HA 89 and RHA 464 [13]. Two-hundred fifteen polymorphic SSR markers covering 17 linkage groups were selected for genotyping the 139 F2 individuals of Pop1. A total of 8,723 SNP markers selected from the original 10,000 SNPs derived from RAD sequencing were used to genotype all the parents and F2 progenies of the three mapping populations, as well as 548 sunflower lines (Tables S1 and S2). The SNP sequences for the 10,000 targeted loci were presented in Table S3. SNP marker discovery using paired-end RAD sequencing and Illumina Infinium quality control parameters have been described by Pegadaraju et al. [40], SNP markers were named starting with NSA followed by a six digit number. Samples were genotyped with a custom assembled Illumina Infinium chip (Illumina Inc., San Diego, CA, USA) containing 8,723 SNP markers. The genotypic data were analyzed using the Genome Studio Genotyping Module v1.0 (Illumina Inc.) clustering algorithm. All data were visually inspected and manually rescored if any errors were evident in the calling of the homozygous or heterozygous clusters. To reduce data set complexity in line with software requirements, SNP data were filtered to remove uninformative markers, such as those with no polymorphism observed between parents, those where one/both of the parental genotypes failed to amplify in the assay, or those possessing a heterozygous genotype in at least one of the parental genotypes. The remaining SNPs were mapped using JoinMap 4.1 [42].

Construction of individual population linkage maps

Linkage maps were constructed independently for each mapping population using the same procedures and parameters in each case. All the SNP markers and the majority of the SSR markers used for linkage mapping were co-dominant. The Chi-square test (p>0.05) was used to assess goodness-of-fit to the expected segregation ratio for each marker using the ‘locus genotype frequencies’ feature of JoinMap 4.1. Markers that showed significant segregation distortion from the expected 1∶2∶1 (co-dominant) or 3∶1 (dominant) ratios were excluded from map construction. Markers were assigned to linkage groups applying the independence LOD (logarithm of the odds) parameter with LOD threshold values ranging from 2.0 to 12.0. We used the ‘similarity of loci’ command of JoinMap to identify perfectly identical markers (similarity value  = 1.000) which are supposed to be mapped at exactly the same position on the linkage group. In order to reduce the burden of calculation effort, only one marker was kept of the ‘similar loci’ for linkage mapping analysis. Linkage analysis and marker order were carried out using the regression mapping algorithm. Recombination fractions were converted to map distances in centimorgans (cM) using the Kosambi mapping function [52]. The excluded similar markers, however, were included in the final map. The linkage groups of individual maps were drawn using MapChart 2.2 [53]. All unique, two-way, and three-way sets of shared markers across three mapping populations were analyzed and visualized using the Venn diagram [54].

Construction of a consensus linkage map

The integration of the linkage groups derived from three mapping populations followed the principle described by Stam [55] using JoinMap 4.1 [42]. First, groupings and group nodes for each individual population were loaded into the navigation tree of the same JoinMap 4.1 project. The groups that correspond to the same linkage group with at least two common loci were combined into a single ‘combined group node’ in the navigation tree using the ‘Combine groups for map integration’ command. The integrated linkage map was constructed using a regression mapping algorithm with the same threshold parameters used for individual population linkage mapping. The graphical representation of the integrated linkage map was drawn using MapChart 2.2 [53].

Comparison of the consensus map with the individual linkage maps

The extent of collinearity in marker orders between consensus and component genetic maps was assessed by calculating the Spearman's rank correlation coefficients (ρ) from marker positions in consensus and individual genetic maps. Significance tests were conducted in R version 2.13.1 [56]. Comparative analyses of marker order and collinearity were illustrated by plotting marker positions on the consensus map against individual population maps.

Analysis of marker distribution

All linkage groups were divided into 1, 2, 5, and 10 cM intervals and the observed marker frequency distribution of each interval was calculated. The observed marker frequencies per centiMorgan (cM) unit interval were compared to that of expected frequencies generated from a Poisson distribution using a Chi-square test [57]. The probability density function of the expectation iswhere x is the actual marker count in each interval, λ is the average number of markers per interval and e is the base of the natural logarithm. Analyses were conducted with the R statistical package [56].

Assessment of the rust resistance gene R and linked SNP markers

Rust phenotypic data of Pop1 were obtained from Gong et al. [13], where Pop1 was first used to map the rust resistance gene R with SSR markers. Briefly, urediniospores of North America (NA) race 336 were used to inoculate F2 plants, along with the two parental lines HA 89 (susceptible parent) and RHA 464 (resistant parent). Twenty seedlings of each of the F2-derived F3 families were also phenotyped with the same pathogen race to distinguish between F2 plants that were homozygous or heterozygous for the resistance gene. Rust infection types and severity (pustule coverage) were scored 10–12 days after inoculation as described by Qi et al. [16]. The rust phenotypic and SNP marker data were combined for fine-mapping of the gene R.

Results

Genetic diversity of the parental lines

Three segregating populations derived from five parental lines were used in this study (Table 1). RHA 464 was a common parent between Pop1 and Pop2. The SNP marker data of the parental lines were used to assess the genetic diversity between the parental lines based on Jaccard's coefficient and a dendogram was constructed using UPGMA clustering analysis (Figure 1). The parental lines of the respective mapping populations varied widely in genetic relatedness, with the parents of Pop2 being the most diverse pair (similarity coefficient value is only 0.007).
Figure 1

Genetic relationship of mapping parents.

Dendogram of the 5 sunflower parents of the three mapping populations based on unweighted pair-group method with arithmetic averages clustering analysis (UPGMA).

Genetic relationship of mapping parents.

Dendogram of the 5 sunflower parents of the three mapping populations based on unweighted pair-group method with arithmetic averages clustering analysis (UPGMA).

Component maps of individual populations

The three mapping populations, Pop1, Pop2, and Pop3, were used to produce three separate high-density genetic maps containing 2,286, 3,236, and 2,123 markers in each map, respectively (Table 1).

Pop1 linkage map

The construction of the Pop1 linkage map started with 141 F2 individuals from a cross between HA 89 and RHA 464. Two individuals were discarded because of too much missing data, leaving a total of 139 F2 individuals for the final linkage analysis. Pop1 was first genotyped with a total of 220 polymorphic SSR markers. Of these, 118 informative and co-dominant SSRs were integrated with SNP markers to construct a high-density linkage map of Pop1. The number of SSR markers for each LG varied and ranged from one on LG14 to 13 on LG10 (Table 1). All but two SSRs detected a single locus. CRT136 was mapped to LGs 4 and 7, and ORS679 was mapped to LGs 12 and 15, consistent with previous data [10]–[11]. Additionally, a total of 2,413 polymorphic SNP markers were used for linkage mapping in Pop1. About 10.2% of the SNP markers (245/2,413) showed significant distortion (P<0.05) from the expected Mendelian ratio, and were discarded from mapping analysis, leaving a total of 2,286 segregating markers (118 SSR and 2,168 SNP) (Table 1). The markers were assembled into 17 LGs identified in the same manner as the genetic maps of Tang et al. [10] and Yu et al. [11]. Mapped SNP markers were distributed in all 17 LGs, although, like the SSRs, the distribution was not homogeneous in all regions. The genetic map covers a total length of 1,164.71 cM, with an average density of one marker in every 0.51 cM. The length of the linkage groups ranges from 19.84 cM in LG12 to 106.79 cM in LG9, and the number of markers per linkage group varies from 40 in LG17 to 399 in LG10 (Table 1). Ninety one percent of the gaps between two adjacent loci were smaller than 5 cM with the largest being 33.89 cM on LG9 (Table S4).

Pop2 linkage map

The Pop2 linkage map was constructed with 141 F2 individuals of a cross between a proprietary confection B-line and RHA 464. Filtration of the SNP genotype data yielded 3,464 good quality SNP marker data for linkage analysis in Pop2. A total of 228 markers (6.6%) showed significant (P<0.05) distortion from the expected 1∶2∶1, which were removed, yielding a final genetic map of 3,236 SNP markers assembled into 17 LGs. The 17 LGs were identified on the basis of the common SNP markers located on each chromosome relative to the linkage map of Pop1. The Pop2 linkage map covers a total length of 1,370.97 cM with an average density of one marker in every 0.42 cM (Table 1). The length of individual linkage groups varies from 58.80 cM in LG6 to 100.30 cM in LG4. The number of markers per linkage group also varies considerably from 72 in LG7 to 437 in LG10 (Table 1). Most of the gaps (94%) between two adjacent loci were smaller than 5 cM while the largest gap was only 23.10 cM on LG4 (Table S4).

Pop3 linkage map

Linkage mapping of Pop3 started with 142 F2 individuals from a cross between CR 29 and RHA 468, and 2,681 good quality SNP marker data were obtained after filtration of the genotype data. Segregation analysis revealed that 552 SNPs (20.6%) showed significant (P<0.05) distortion from the expected Mendelian ratio and were removed from the linkage analysis. The remaining SNP markers were placed onto 17 sunflower linkage groups except for 6 markers, which could not be suitably added in any linkage group, resulting in a final map consisting of 2,123 SNP markers (Table 1). The total length of the genetic map of Pop3 was 1,317.19 cM with an average density of one marker in every 0.62 cM, the lowest among all three maps. Individual linkage groups range from 40.59 cM in LG2 to 108.94 cM in LG4, and the number of markers per linkage group varies from 33 in LG1 to 228 in LG9 (Table 1). Gaps 5 cM or smaller between two adjacent loci accounted for about 93% of the total gaps observed, with the largest gap being 26.87 cM at the distal end of LG13 (Table S4).

Unique and common markers across component maps

A total of 608, 1,300, and 855 SNP markers were mapped exclusively in Pop1, Pop2, and Pop3, respectively (Table 2; Figure S1). However, there were 252 SNP markers that were common and mapped in all three mapping populations. In total, 988, 696, and 320 common SNP markers were identified between pairs of component maps Pop1–Pop2, Pop2–Pop3 and Pop1–Pop3, respectively. The large number of common markers found between the Pop1 and Pop2 maps was expected due to RHA 464 being a common parent. Common SNP markers mapped in all three component maps were distributed in all linkage groups ranging from 2 in LG15 to 52 in LG8 (Table 2).
Table 2

The unique and common SNP markers across linkage groups and genetic maps of sunflower.

Linkage groupUnique SNP markers inCommon SNP markers betweenCommon SNP markers across all populationsTotal SNPs
Pop1Pop2Pop3Pop1 and Pop2Pop2 and Pop3Pop1 and Pop3
LG0135761224011010384
LG022710314501514214
LG03431056534353411327
LG046929641938468273
LG05521133763454420374
LG06176528192469168
LG072738301510119140
LG0861463341276052320
LG09247310827634116352
LG1050771128129550503
LG1174384217242724246
LG12523055752968255
LG131012435269335296
LG1413701001172127285
LG15148051364022225
LG16271366926511410333
LG17139710189087324
Total6081,3008559886963202525,019

Consensus map

Consensus maps were constructed by merging corresponding linkage groups from the three individual maps, one linkage group at a time, using JoinMap 4.1. The common markers on homologous linkage groups of individual maps served as bridges to integrate maps into a single consensus map. A schematic illustration of the consensus map, which included the expected 17 linkage groups of sunflower, is presented in Figure 2. The integrated linkage map consisted of 5,019 SNP markers and 118 SSR markers with a total map length of 1,443.84 cM (Table 1). The length of the linkage groups ranges from 62.99 cM in LG6 to 104.60 cM in LG9, and the number of markers per linkage group varies from 148 in LG7 to 516 in LG10. The total map length of the consensus map is greater than the map length of each component map. Detailed information of the consensus map including the genetic distance, marker types, and unique and common markers among the populations is illustrated in Figure S2, S3, S4, S5, S6, S7, and Table S4.
Figure 2

Schematic representation of the consensus map.

Ruler on left indicates the cM distance and the horizontal lines across the chromosomes indicate locus positions on each chromosome.

Schematic representation of the consensus map.

Ruler on left indicates the cM distance and the horizontal lines across the chromosomes indicate locus positions on each chromosome.

Collinearity of markers between consensus and component maps

Inequality of the lengths of individual linkage groups between the component maps and the consensus map was clearly visible in collinearity plots (Figure 3). In general, marker order between the consensus map and the component maps was consistent across all the linkage groups, with only a few ambiguities identified in LGs 1, 2, 5, 15, and 16. Correlation analysis revealed that marker orders were strongly correlated in all 17 linkage groups between the consensus and component maps, with a mean correlation coefficient value of 0.972 (Table S5).
Figure 3

Scatter plots of the 17 sunflower linkage groups showing the collinearity of marker order among the consensus genetic map and component population maps.

Distribution of markers along linkage groups

Chi-square test of marker distribution at 1, 2, 5, and 10 cM intervals on linkage groups revealed highly significant deviations from the Poisson expectation (data not shown). Genome wide marker distribution in 1-cM intervals shows a clear clustering of markers in certain genomic regions, indicating that the markers were not randomly distributed along the entire length of the sunflower linkage groups (Figure 4). The average genetic distance between markers was 0.28 cM and ranged from 0.18 cM in LG10 to 0.46 cM in LG7 (Table 1). Large gaps (>5 cM) observed in most linkage groups of the component maps were reduced during the map integration process. In the consensus map, gaps between two adjacent loci became smaller, with 98.6% (2,141 of 2,171) of the gaps being less than 5 cM (Table 1, Figure 4). There were only two gaps >10 cM, one each on LG2 and LG10 with the largest being 12.37 cM on LG10.
Figure 4

Frequency distribution of markers on the sunflower consensus map.

A x-axis indicates genetic distance in each linkage group in 1-cM intervals and the y-axis indicates number of markers in each 1-cM bin.

Frequency distribution of markers on the sunflower consensus map.

A x-axis indicates genetic distance in each linkage group in 1-cM intervals and the y-axis indicates number of markers in each 1-cM bin.

Fine mapping of the rust resistance gene R and marker validation

The rust resistance gene R, present in the inbred line RHA 464, was previously mapped to LG11 with flanking SSR markers, CRT275 and ZVG53, in an interval of 10.6 cM [13]. The same F2 population (Pop1 in the present study) was used to saturate the R region with SNP markers. Rust phenotypic data of Pop1 were integrated with SNP marker data, and seven linked SNP markers were identified, five on one side (NSA_000064, NSA_003320, NSA_003426, NSA_004155, NSA_008884), and two on the other side (NSA_001570, and NSA_001392), defining an interval less than 2.3 cM surrounding the previously mapped R gene in LG11 (Table S4). Five of seven linked SNP markers were used to genotype each of the 548 lines in our validation set (Tables S1 and S2). Among 348 sunflower lines with SNP data, the RHA 464-specific SNP allele of NSA_004155 is only present in RHA 464 and did not exist in any other of the tested lines (Table 3). Another SNP, NSA_003426, which had genotypic data in 322 lines was homozygous for a unique allele in RHA 464 and was heterozygous for that same allele in PI 600809. However, two other SNP markers, NSA_000064 and NSA_008884, which co-segregated with NSA_003426 and NSA_004155, shared the RHA 464 alleles with more than 250 rust susceptible lines, and were not diagnostic markers for R. NSA_001570 was located on the other side of R and 23 susceptible lines shared the RHA 464 allele (Table 3). The diagnostic alleles for R at NSA_003426 and NSA_004155 are cytosine nucleotides while the alleles in HA 89 are adenine nucleotides. Comparison of SNP alleles and rust phenotypes in a subset of 32 lines which included four rust resistant lines carrying different R-genes and 28 susceptible lines confirmed the cytosine alleles of NSA_003426 and NSA_004155 were diagnostic for R (Table 4). These results indicated that the rust resistance gene R is probably not widely distributed in the sunflower germplasm pool and the two SNPs could serve as diagnostic markers for the gene R in most genetic backgrounds.
Table 3

Validation of SNP markers linked to the rust gene R in a diverse sunflower germplasm pool.

SNPMap location (cM)Summary
No. of lines genotypedNo. of lines with dataNo. of HA 89 alleleNo. of RHA 464 alleleNo. of heterozygous allele
NSA_00006444.5554852720928731
NSA_00888444.555483323726629
NSA_00342644.5554832232011
NSA_00415544.5554834834710
R12 45.38-----
NSA_00157046.78548315288234
Table 4

A subset of rust phenotypes and SNP profiles of NSA_003426 and NSA_004155.

LinesRust phenotypes** Rust R-genesSNP genotypesReferences
NSA_003426NSA_004155
HA 89* SAAAA [13], [58]
RHA 464* R R12 CCCC [13], [58]
HA-R3R R4 AAAA [16], [58]
HA-R6R R13a AAAA [14], [58]
HA-R8RunknownAAAA [13], [58]
Rf ANN-1742R R11 AAAA [18], [58]
HA 61SAAAA [58]
HA 285SAAAA [58]
HA 286SAAAA [58]
HA 304SAAAA [58]
HA 305SAAAA [58]
HA 315SAAAA [58]
HA 317SAAAA [58]
HA 318SAAAA [58]
HA 323SAAAA [58]
HA 460SAAAA [58]
HA-R1SAAAA [58]
HA-R5SAAAA [58]
HA-R7SAAAA [58]
RHA 265SAAAA [58]
RHA 270SAAAA [58]
RHA 271SAAAA [58]
RHA 272SAAAA [58]
RHA 273SAAAA [58]
RHA 279S R1 AAAA [21], [58]
RHA 282SAAAA [58]
RHA 298SAAAA [58]
RHA 340S Radv AAAA [19], [21], [58]
RHA 801SAAAA [58]
RHA 854SAAAA [58]
RHA 855SAAAA [58]
RHA 856SAAAA [58]
RHA 858SAAAA [58]
RHA 859SAAAA [58]

*HA 89 and RHA 464 were susceptible and resistant parents of Pop1 used in the present study.

**rust phenotypic data were taken from Qi et al. (58) using NA rust races 336 and 777.

*HA 89 and RHA 464 were susceptible and resistant parents of Pop1 used in the present study. **rust phenotypic data were taken from Qi et al. (58) using NA rust races 336 and 777.

Discussion

Construction of a linkage map is often the first step to characterizing the genome of an organism. We presented a high density integrated genetic linkage map of sunflower using ∼8,700 SNP markers derived from RAD-sequencing. Three F2 mapping populations were developed using five parental lines of cultivated sunflower, four of which were used in the initial RAD sequencing step. Genetic analysis revealed that a high degree of genetic diversity exists between the parents of all three mapping populations. This contributed to the high SNP density in each of the component maps. These maps can be more readily used for breeding purposes because they contain SNPs that are informative within the closely related gene pool of cultivated sunflower. In addition, the high density genetic map facilitated fine mapping of the rust resistance gene R, providing closely linked SNP markers for high throughput, marker-assisted selection of this gene in breeding programs. The individual F2 mapping populations in our study are almost identical in size with ∼140 individuals per population. The linkage maps of Pop2 and Pop3 were similar in length (1,371 and 1,317 cM, respectively), while the Pop1 map (1,165 cM in length) was somewhat shorter than the rest of the maps. Comparisons of linkage groups among individual maps revealed that the upper ends of LGs 4 (∼42 cM), 13 (∼29 cM), 14 (∼33 cM), 15 (∼28 cM) and 17 (∼50 cM) and the lower end of LG12 (∼36 cM) in the Pop1 map showed no marker coverage, though the same regions in the other two maps possessed many mapped loci (Table S4). The other larger gap that lacked mappable markers was∼34 cM in LG9 of Pop1 as well as a few other ∼20 cM gaps in various LGs of all three mapping populations. Bowers et al. [41] also reported several gaps of up to 26 cM in individual sunflower crosses. This pattern is not due to a lack of SNP markers on these chromosome segments but is likely due to the mapping parents sharing similar genomic regions identical by descent. Large gaps with low polymorphism were also observed in the linkage maps of other species like soybean [44], common bean [45], sorghum [46] and rye [48]. Segregation distortion is a ubiquitous phenomenon in crop species that skews the frequency of alleles from the expected Mendelian ratio within a segregating population and has strong impact in genetic map construction [9], [59]–[63]. In the present study, the proportion of distorted markers varied from 6.6% (Pop2) to 20.6% (Pop3) which is comparable to other species like sorghum [46], red clover [47], rye [48], maize [64], and pigeonpea [65]. The distribution of distorted markers among individual linkage groups in the component maps was distinctly different. In Pop1, the highest number of distorted markers was observed in LGs 1, 7, 10, and 12, whereas in Pop2, the marker distorted regions were present in LGs 4, 9, 11, and 12. In Pop3, where the highest percentage of skewed segregations was observed, the most distorted markers were in LGs 1, 2, 8, 9, and 10. In order to smooth integration of the consensus map, we discarded 254, 228 and 552 distorted markers, respectively, during individual map construction. However, 48.2%, 67.7% and 75.7% of these distorted markers from Pop1, Pop2, and Pop3, respectively, were eventually included in the consensus map through integration of information available from one or both of the other populations without segregation distortion. Our mapping strategy with multiple segregating populations thus offered an excellent basis for the development of a consensus map in sunflower. The presence of common markers among component maps is a prerequisite for building a consensus linkage map [66]. In this study, sufficient numbers of common SNP markers were segregating within each individual mapping population, which made the merging of a large number of markers into the final consensus map possible. High numbers of common markers, with stable recombination frequencies across component mapping populations, allow positioning of markers on a highly reliable reference map and also in regions that were poorly covered in the individual maps [67]. As a result, the final consensus map possesses 5,137 markers (118 SSR and 5,019 SNP) spanning 1,443.84 cM, divided amongst 17 linkage groups (the actual number of sunflower haploid chromosomes), with an average distance of 0.28 cM between adjacent markers. The map length is comparable to the combined sunflower map (1,310 cM) developed recently by Bowers et al. [41] with 10,080 marker loci. The total length of the consensus map was greater than the length of each of the individual maps (Table 1). The extended map length of the consensus map was mainly due to the addition of markers to the distal parts of some linkage groups. Additionally, the consensus map allowed us to fill most of the larger gaps on individual maps, reducing the number of gaps >10 cM from 10 to 18 on the individual maps to just two on the consensus map. Despite minor local inversions of neighboring markers in linkage groups 1, 2, 15, and 16, the collinearity of the marker order between the consensus map and the individual maps showed excellent congruence (Figure 3). The high value of Spearman's rank correlation of marker orders between the consensus and individual maps supports this finding (Table S5). Local inversion of closely spaced markers is a common feature during map integration [46]–[47], [68]–[71]. Short span marker order rearrangements could be the reflection of real genetic events or could be caused by statistical uncertainty due to many weak linkages [68], a small number of progeny studied [72], or heterogeneity of recombination frequencies among populations. In the present study, marginal shifts in marker order were found in highly dense marker regions. Resolving scrambled marker order at high-density regions of the genome would require extremely large mapping populations [68]. Alternatively, the issue of inversions would be resolved by comparing linkage maps with a physical map of the sunflower genome which is yet to be completed [73]. The distribution of markers along linkage groups was not random and there were marker-rich and marker-poor regions in the sunflower linkage maps (Figures 2 and 4). Highly dense marker regions are typically centromeric regions [74]–[75]. However, this is not conclusive in our study, as many of the linkage groups showed more than one region of high marker density (Figure 4), similar to the finding of Bowers et al. [41]. A total of 118 publicly available SSR markers are present in our consensus map, which allowed us to reference the homologous linkage groups to the published sunflower maps. Overall, the order of the SSR markers was well conserved in all linkage groups between our map and the published sunflower SSR maps [10]–[12] and also the other integrated sunflower SNP map [41].This would allow cross-referencing between different published sunflower maps and would offer the opportunity of exploring a much larger number of markers for a given genomic region. A total of three SNP markers, two common between Pop1 and Pop3 and one between Pop1 and Pop2, map to different linkage groups in alternative populations, which could only be explained by the existence of paralogous loci. Similar paralogous loci were observed in sorghum [46] and most recently in sunflower [41] when aligning genetic linkage maps derived from four mapping populations. The number of paralogous loci reported by Bowers et al. [41] was much higher (∼14%) than we observed in our study. The most likely explanation for this difference might lay in the SNP development strategies between the two studies. In the linkage mapping of Bowers et al. [41], the SNPs were designed based on deep EST sequencing of the parental lines. The paralogous loci are the result of gene duplication, and the ESTs of paralogs would have similar sequences in some cases, causing non-specific binding of the SNP primers. In the current study, SNPs were developed from sunflower genomic sequence using restriction site-associated DNA sequencing (RAD-Seq), which is based on identifying polymorphic variants adjacent to restriction enzyme recognition sites [40]. The use of SNP markers combined with publicly available SSR markers in multiple populations greatly increased marker saturation in our consensus map, a major improvement over the low resolution sunflower maps constructed with single populations and other marker types [5]–[12]. The present consensus map of 5,019 SNP and 118 SSR markers is the second most dense genetic linkage map in sunflower next to the one developed recently by Bowers et al. [41] with 10,080 markers. Our consensus map can serve as a valuable tool to sunflower breeders for marker-trait association in QTL or association mapping of important agronomic traits, marker assisted breeding, map-based gene cloning, and comparative mapping. List of three-hundred one USDA released sunflower germplasms used for marker validation. (XLSX) Click here for additional data file. List of two-hundred forty seven sunflower plant introduction (PI) lines used for marker validation. (XLSX) Click here for additional data file. SNP sequences for the 10,000 targeted loci. (XLSX) Click here for additional data file. SNP and SSR marker positions in Pop1, Pop2, Pop3, and consensus genetic linkage maps. (XLSX) Click here for additional data file. Spearman's rank correlation coefficients between marker positions in the consensus map and individual population maps in each linkage group of sunflower. (DOCX) Click here for additional data file. A three-way Venn diagram illustrating all unique, two-way and three-way sets of shared SNP markers mapped in three component populations. The mapping populations are abbreviated as in the text: Pop 1 = HA 89×RHA 464; Pop 2 = B-Line×RHA 464; Pop 3 = CR29×RHA 468. (TIF) Click here for additional data file. Integrated genetic linkage map of sunflower. The map shows the linkage groups 1, 2, and 3 developed from three F2 mapping populations. Markers in bold font are SSR markers. (TIF) Click here for additional data file. Integrated genetic linkage map of sunflower. The map shows the linkage groups 4, 5, and 6 developed from three F2 mapping populations. Markers in bold font are SSR markers. (TIF) Click here for additional data file. Integrated genetic linkage map of sunflower. The map shows the linkage groups 7, 8, and 9 developed from three F2 mapping populations. Markers in bold font are SSR markers. (TIF) Click here for additional data file. Integrated genetic linkage map of sunflower. The map shows the linkage groups 10, 11, and 12 developed from three F2 mapping populations. Markers in bold font are SSR markers. (TIF) Click here for additional data file. Integrated genetic linkage map of sunflower. The map shows the linkage groups 13, 14, and 15 developed from three F2 mapping populations. Markers in bold font are SSR markers. (TIF) Click here for additional data file. Integrated genetic linkage map of sunflower. The map shows the linkage groups 16 and 17 developed from three F2 mapping populations. Markers in bold font are SSR markers. (TIF) Click here for additional data file.
  55 in total

1.  Molecular mapping of the Pl(16) downy mildew resistance gene from HA-R4 to facilitate marker-assisted selection in sunflower.

Authors:  Zhao Liu; Thomas J Gulya; Gerald J Seiler; Brady A Vick; Chao-Chien Jan
Journal:  Theor Appl Genet       Date:  2012-02-21       Impact factor: 5.699

2.  Development of a high density integrated reference genetic linkage map for the multinational Brassica rapa Genome Sequencing Project.

Authors:  Xiaonan Li; Nirala Ramchiary; Su Ryun Choi; Dan Van Nguyen; Md Jamil Hossain; Hyeon Kook Yang; Yong Pyo Lim
Journal:  Genome       Date:  2010-11       Impact factor: 2.166

3.  Identification of resistance to new virulent races of rust in sunflowers and validation of DNA markers in the gene pool.

Authors:  Lili Qi; Tom Gulya; Gerald J Seiler; Brent S Hulke; Brady A Vick
Journal:  Phytopathology       Date:  2011-02       Impact factor: 4.025

4.  Molecular mapping of the rust resistance gene R ( 4 ) to a large NBS-LRR cluster on linkage group 13 of sunflower.

Authors:  L L Qi; B S Hulke; B A Vick; T J Gulya
Journal:  Theor Appl Genet       Date:  2011-04-11       Impact factor: 5.699

5.  Molecular mapping of segregation distortion loci in Aegilops tauschii.

Authors:  J D Faris; B Laddomada; B S Gill
Journal:  Genetics       Date:  1998-05       Impact factor: 4.562

6.  Construction of an AFLP genetic map with nearly complete genome coverage in Pinus taeda.

Authors:  D L Remington; R W Whetten; B H Liu; D M O'Malley
Journal:  Theor Appl Genet       Date:  1999-06       Impact factor: 5.699

7.  Integrated consensus genetic and physical maps of flax (Linum usitatissimum L.).

Authors:  Sylvie Cloutier; Raja Ragupathy; Evelyn Miranda; Natasa Radovanovic; Elsa Reimer; Andrzej Walichnowski; Kerry Ward; Gordon Rowland; Scott Duguid; Mitali Banik
Journal:  Theor Appl Genet       Date:  2012-08-14       Impact factor: 5.699

8.  SNP discovery and development of a high-density genotyping array for sunflower.

Authors:  Eleni Bachlava; Christopher A Taylor; Shunxue Tang; John E Bowers; Jennifer R Mandel; John M Burke; Steven J Knapp
Journal:  PLoS One       Date:  2012-01-04       Impact factor: 3.240

9.  A consensus genetic map of sorghum that integrates multiple component maps and high-throughput Diversity Array Technology (DArT) markers.

Authors:  Emma S Mace; Jean-Francois Rami; Sophie Bouchet; Patricia E Klein; Robert R Klein; Andrzej Kilian; Peter Wenzl; Ling Xia; Kirsten Halloran; David R Jordan
Journal:  BMC Plant Biol       Date:  2009-01-26       Impact factor: 4.215

10.  De novo sequencing of sunflower genome for SNP discovery using RAD (Restriction site Associated DNA) approach.

Authors:  Venkatramana Pegadaraju; Rick Nipper; Brent Hulke; Lili Qi; Quentin Schultz
Journal:  BMC Genomics       Date:  2013-08-15       Impact factor: 3.969

View more
  34 in total

1.  Development and dissection of diagnostic SNP markers for the downy mildew resistance genes Pl Arg and Pl 8 and maker-assisted gene pyramiding in sunflower (Helianthus annuus L.).

Authors:  L L Qi; Z I Talukder; B S Hulke; M E Foley
Journal:  Mol Genet Genomics       Date:  2017-02-03       Impact factor: 3.291

2.  Relocation of a rust resistance gene R 2 and its marker-assisted gene pyramiding in confection sunflower (Helianthus annuus L.).

Authors:  L L Qi; G J Ma; Y M Long; B S Hulke; L Gong; S G Markell
Journal:  Theor Appl Genet       Date:  2015-01-11       Impact factor: 5.699

3.  Pl(17) is a novel gene independent of known downy mildew resistance genes in the cultivated sunflower (Helianthus annuus L.).

Authors:  L L Qi; Y M Long; C C Jan; G J Ma; T J Gulya
Journal:  Theor Appl Genet       Date:  2015-02-12       Impact factor: 5.699

4.  Genetics and mapping of a novel downy mildew resistance gene, Pl(18), introgressed from wild Helianthus argophyllus into cultivated sunflower (Helianthus annuus L.).

Authors:  L L Qi; M E Foley; X W Cai; T J Gulya
Journal:  Theor Appl Genet       Date:  2016-01-08       Impact factor: 5.699

5.  SNP discovery in common bean by restriction-associated DNA (RAD) sequencing for genetic diversity and population structure analysis.

Authors:  Paula Arielle M R Valdisser; Georgios J Pappas; Ivandilson P P de Menezes; Bárbara S F Müller; Wendell J Pereira; Marcelo G Narciso; Claudio Brondani; Thiago L P O Souza; Tereza C O Borba; Rosana P Vianello
Journal:  Mol Genet Genomics       Date:  2016-03-01       Impact factor: 3.291

6.  Inheritance and molecular mapping of powdery mildew (Golovinomyces orontii) resistance gene(s) in sunflower (Helianthus annuus L.).

Authors:  Prathap Reddy Kallamadi; Sujatha Mulpuri
Journal:  3 Biotech       Date:  2020-05-06       Impact factor: 2.406

7.  Diversity analysis and genomic prediction of Sclerotinia resistance in sunflower using a new 25 K SNP genotyping array.

Authors:  Maren Livaja; Sandra Unterseer; Wiltrud Erath; Christina Lehermeier; Ralf Wieseke; Jörg Plieske; Andreas Polley; Hartmut Luerßen; Silke Wieckhorst; Martin Mascher; Volker Hahn; Milena Ouzunova; Chris-Carolin Schön; Martin W Ganal
Journal:  Theor Appl Genet       Date:  2015-11-04       Impact factor: 5.699

8.  Diversification of the downy mildew resistance gene pool by introgression of a new gene, Pl35, from wild Helianthus argophyllus into oilseed and confection sunflowers (Helianthus annuus L.).

Authors:  L L Qi; G J Ma; X H Li; G J Seiler
Journal:  Theor Appl Genet       Date:  2019-06-18       Impact factor: 5.699

9.  High-throughput genotyping-by-sequencing facilitates molecular tagging of a novel rust resistance gene, R 15 , in sunflower (Helianthus annuus L.).

Authors:  G J Ma; Q J Song; S G Markell; L L Qi
Journal:  Theor Appl Genet       Date:  2018-03-21       Impact factor: 5.699

10.  Genotyping-by-sequencing targeting of a novel downy mildew resistance gene Pl 20 from wild Helianthus argophyllus for sunflower (Helianthus annuus L.).

Authors:  G J Ma; S G Markell; Q J Song; L L Qi
Journal:  Theor Appl Genet       Date:  2017-04-21       Impact factor: 5.699

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.