Literature DB >> 29186379

Development of a high-density linkage map and chromosome segment substitution lines for Japanese soybean cultivar Enrei.

Satoshi Watanabe1, Takehiko Shimizu1, Kayo Machita1, Yasutaka Tsubokura1, Zhengjun Xia1, Tetsuya Yamada2, Makita Hajika2, Masao Ishimoto1, Yuichi Katayose1, Kyuya Harada1, Akito Kaga1.   

Abstract

Using progeny of a cross between Japanese soybean Enrei and Chinese soybean Peking, we developed a high-density linkage map and chromosomal segment substitution lines (CSSLs). The map consists of 2,177 markers with polymorphism information for 32 accessions and provides a detailed genetic framework for these markers. The marker order on the linkage map revealed close agreement with that on the chromosome-scale assembly, Wm82.a2.v1. The differences, especially on Chr. 5 and Chr. 11, in the present map provides information to identify regions in the genome assembly where additional information is required to resolve marker order and assign remaining scaffolds. To cover the entire soybean genome, we used 999 BC3F2 backcross plants and selected 103 CSSLs carrying chromosomal segments from Peking in the genetic background of Enrei. Using these low-genetic-complexity resources, we dissected variation in traits related to flowering, maturity and yield into approximately 50 reproducible quantitative trait loci (QTLs) and evaluated QTLs with small genetic effects as single genetic factors in a uniform genetic background. CSSLs developed in this study may be good starting material for removing the unfavourable characteristics of Peking during pre-breeding and for isolation of genes conferring disease and stress resistance that have not yet been characterized.

Entities:  

Mesh:

Year:  2018        PMID: 29186379      PMCID: PMC5909467          DOI: 10.1093/dnares/dsx043

Source DB:  PubMed          Journal:  DNA Res        ISSN: 1340-2838            Impact factor:   4.458


1. Introduction

Soybean, Glycine max (L.) Merr., is the most important legume and is the fourth crop next to rice, wheat and maize in terms of world crop production. The estimated size of the soybean genome is 1.1 Gb. The genome sequence of the US cultivar Williams 82, Glyma0, became available on the Phytozome in January 2008. The first chromosome-scale assembly, Glyma1.01, became available in December 2008, and a new assembly, Wm82.a2.v1, was released in January 2014 (https://phytozome.jgi.doe.gov/pz/portal.html (1 November 2017, date last accessed)). According to the Phytozome web site, annotations of some genes have been improved on the basis of RNA-seq data and the number of unmapped scaffolds has been reduced by using new assembly methods and constructing high-density linkage maps. Two other soybean chromosome-scale assemblies are available from the NCBI web site (Glycine_max_v1.1 and Glycine_max_v2.0). However, gene names, numbers and genomic positions are not comparable among different assemblies and are sometimes very confusing to users. In the present study, we discuss only the Glyma1.01 and Wm82.a2.v1 assemblies at the Phytozome web site. A reference genome sequence is a versatile tool with which to characterize the relationships between genes and agronomically important traits. However, additional genome information and experimental materials suitable for genetic characterization are also needed. New next-generation sequencing (NGS) technologies are expected to yield genomic sequences of a wide variety of soybean germplasms. In addition, the development of soybean mutant libraries and the identification of mutants related to agronomically important traits from their phenotypes and reverse-genetic approaches based on NGS are expected to provide new genetic resources. The efficient use of this information and experimental materials is necessary for further soybean breeding. Recent re-sequencing information has enabled the development of molecular markers for soybean gene discovery and breeding. Among molecular markers, simple sequence repeats (SSRs) and microsatellites are useful tools because of their abundance, multi-allelic features, co-dominant inheritance, high variability and ease of analysis. SSR markers developed by previous studies, have been widely used in soybean breeding programs worldwide and are available from SoyBase (https://www.soybase.org/ (1 November 2017, date last accessed)). A genetic linkage map is essential for soybean chromosome assembly. The reference sequence Glyma1.01 was assembled on the basis of a consensus linkage map, and Wm82.a2.v1 (Glyma2) on the basis of high-density linkage maps. However, chromosomal translocation and inversion in soybean have been reported with cytogenic analysis and karyotyping based on fluorescence in situ hybridization has also identified translocations and inversions in several soybean cultivars. Comparison of high-density linkage maps is expected to be deepen with our knowledge about chromosomal structure in soybean cultivars. Soybean is an important source of traditional staple foods such as tofu, natto, miso and soy sauce in Japan. The unique cuisine, geographical and historical isolation of Japan probably shaped distinct agro-morphological characteristics of Japanese soybeans from those of continental soybeans. Since Enrei is a major cultivar (9% of total soybean cultivation area in Japan in 2014) with seeds of high quality for food processing, previous study sequenced the whole genome of Enrei as a representative Japanese cultivar. The sequencing average read coverage at a locus was 22.2×, and 1,659,041 SNPs and 344,418 insertions/deletions between the Enrei assembly and the reference sequence of Williams 82 were identified. One 4-coumaroyl-CoA-ligase gene (out of 10 genes in the William 82 reference genome), seven chalcone synthase genes (out of 24 genes), three chalcone isomerase genes (out of 16 genes), one flavonol synthase gene (out of 4 genes) and six dihydroflavonol 4-reductase genes (out of 10 genes), those predicted as anthocyanin and flavonoid biosynthesis genes in Williams 82 genome, were not found in the Enrei genome. Hence, the accumulation of genome and marker information for a wide range of soybean elite cultivars and the development of experimental resources to facilitate the evaluation of useful genes that differ from Williams 82 orthologs are necessary to utilize a wide range of the genetic diversity in soybean breeding. Peking is a landrace that has been extensively used in a breeding programs because of its resistance to the soybean cyst nematode Heterodera glycines. In 1906, the accession was introduced from Beijing, China, into the USA by the US Department of Agriculture (USDA). Peking is also resistant to soybean mosaic virus, peanut mottle virus, bacterial blight, frogeye leaf spot (Cercospora sojina), soybean dwarf disease, soybean stem canker, reniform nematode Rotylenchulus reniformis,Phytophthora stem and root rot and germinates well under wet conditions., However, the genes responsible for these traits, except for resistance to cyst nematode and phytophthora stem and root rot, remain uncharacterized. Many quantitative trait loci (QTLs) related to important agronomic traits such as flowering time, plant height, maturity, seed weight, yield, seed nutrients, seed oil contents and seed protein are summarized in SoyBase. Isolation and characterization of genes for these traits are very important in facilitating MAS with DNA markers tightly linked to a locus or nucleotide polymorphism suitable for distinguishing functional alleles. However, the development of experimental material suitable for positional cloning is time consuming. Among such materials, ‘heterozygous inbred families’ and ‘residual heterozygous lines’ have been used to analyze QTLs as single Mendelian factors for fine mapping. In addition, series of near-isogenic lines (NILs), which have a common genetic background, are advantageous for QTL evaluation. First NIL library, which consisted of a series of NILs, currently referred to as chromosomal segment substitution lines (CSSLs), each having different chromosomal segments originating from a wild donor parent in the genetic background of cultivated tomato, was developed and used for identification of a yield-related QTL. CSSLs were developed in many plant species, including tomato, Arabidopsis, rice), barley, peanut rye, lettuce and wild soybean. In soybean, one genetic locus from wild soybean increasing yield was also reported. Genomic sequences indicate that the genetic diversity of cultivated soybean is narrow compared with that of landraces and wild soybean (G. soja). Although CSSLs carrying segments of the wild soybean genome have been developed, the availability of CSSLs from out of Japan is limited, and the development of CSSLs using various combinations of soybean germplasms would provide novel breeding materials and increase genetic diversity. In the present study, using progeny from a cross between the leading Japanese cultivar, Enrei, and the Chinese landrace Peking, we developed a high-density linkage map and CSSLs. Enrei is a representative Japanese cultivar with high quality for food processing, whereas Peking is an excellent Chinese germplasm resistant to various diseases and stresses. The genetic distance between these parents is much larger than that between Japanese landraces and Enrei but smaller than wild soybean and Enrei. In addition to re-sequencing information,, genes controlling flowering time and growth habit have been characterized for these parents. Therefore, genomic resources developed in the present study may help to characterize agronomically important genes.

2. Materials and methods

Plant materials

A total of 32 Soybean lines including 16 Japanese breeding varieties, two USA breeding varieties, six Japanese landraces, four exotic landraces and four wild soybean accessions were used to identify SSR polymorphisms (Supplementary Table S1). A cross between Enrei [G. max; accession number in National Agriculture and Food Research Organization (NARO) Genebank, Japan: GmJMC025] as the female parent and Peking (GmWMC084) as the male parent was performed in 2005, and an F2 mapping population was developed. The F2 population (189 plants) and 20 plants of each parent were grown with an inter-row spacing of 80 cm and a hill spacing of 30 cm in the field at NARO in Tsukuba, Ibaraki, Japan (36°01'25.6”N 140°06'59.1”E). All experimental populations were evaluated at the same location. Seeds were sown on 30 May 2007. Images of the parent plants are shown in Fig. 1. The breeding scheme is shown in Supplementary Fig. S1.
Figure 1

Seeds and mature plants of Enrei and Peking. The ruler is in centimeters.

Seeds and mature plants of Enrei and Peking. The ruler is in centimeters.

Development of a backcross population and CSSLs

Enrei was used as the recurrent parent and Peking as the donor parent. Pollen from F1 plants was used to pollinate Enrei flowers before blooming, and >1,000 BC1F1 seeds were obtained in 2006. All BC1F1 seeds were sown, and 999 BC1F1 plants were crossed with Enrei to produce BC2F1 in 2007 and BC3F1 in 2008. BC3F2 seeds were obtained by self-pollinating each BC3F1 plant; the BC3F2 population (999 plants) was sown on 16 June 2010. Next year, CSSLs (103 lines) were selected from the BC3F2 population (as described below) and grown under natural day length in the same field. CSSLs were sown on 24 June 2011 (10 plants per line) and on 27 June 2012 (4–20 plants per line). Selection of CSSLs was based on the data obtained from 320 SSR markers that covered all chromosomes evenly. The proportion and length of donor chromosomal segments were calculated from genetic distances between DNA markers, and the positions of recombination breakpoints in all chromosomes were calculated in individual lines. Candidate lines with low proportions of donor chromosomal segments per chromosome were first selected, and then 103 CSSLs were selected so that the donor segments of 4–6 lines covered each chromosome.

Phenotypic evaluation

Agronomic traits of each plant in the F2, BC3F2 and CSSL populations were evaluated (Table 1). Days to first flowering (DFF) corresponded to the R1 stage. Days to flowering of the top raceme (DFT) corresponded to the R2 stage. Days to harvest (DH) corresponded to the R8 stage. Flowering period (FP = DFT − DFF), reproductive period (RP = DH − DFF) were calculated. Plant height (PH), number of pods (NP), one-hundred-seed weight (SWH) and total seed weight (TSW) were measured in each BC3F2 plant and CSSL population. Genetic variance (heritability) was calculated from the phenotypic variance of each population, F2 and BC3F2, with that of the parents as follows.
Table 1

Description of investigated traits, broard sence heritability and number of QTLs detected in the present study

TraitTrait abbreviationQTL abbreviationTrait evaluationBroad-sense heritability
Number of QTLs detected in each population
Number detected QTLs common between populationsa
F2BC3F2CSSLs (2011)CSSLs (2012)F2BC3F2CSSLF2 vs BC3F2F2 vs CSSLBC3F2 vs CSSL
Days to first flowering (R1)DFFqDFFDays from sowing to first flowering97.6%92.0%77.9%89.0%767676
Days to flowering of top raceme (R2)DFTqDFTDays from sowing to flowering of the top racemeNA96.3%94.3%79.9%Not evaluated66Not evaluatedNot evaluated6
Flowering period (days from R1 to R2)FPqFPR2−R1NA89.0%55.8%51.0%Not evaluated22Not evaluatedNot evaluated2
Days to harvest (R8)DHqDHDays from sowing to harvest (maturity stage)89.7%35.9%94.5%61.8%597557
Reproductive period (days from R1 to harvest)RPqRPHarvesting day–first flowering day74.9%5.2%85.3%48.4%264224
Plant heightPHqPHDistance from the primary leaf node to the base of the top raceme89.8%87.7%93.7%87.1%344334
Number of podsNPqNPNumber of harvested total pods90.6%76.5%81.9%66.9%285225
Seed weight per plant (total seed weight)TSWqTSWWeight of harvested total seedNA72.3%64.5%51.8%288228
Seed weight (100 seeds)SWHqSWHWeight of 100 seeds60.3%63.3%63.6%71.2%Not evaluated77Not evaluatedNot evaluated7
Total215650202149

aPartially confirmed QTLs are included.

Description of investigated traits, broard sence heritability and number of QTLs detected in the present study aPartially confirmed QTLs are included.

Total genomic DNA extraction

DNA of parents and F2 plants was extracted from fresh leaves (3 g) by a Cetyltrimethylammonium Bromide (CTAB) protocol. DNA of cultivars and BC3F2 plants was isolated as described previously. DNA was quantified on a fluorescence microplate reader (ARVO; Perkin Elmer, Boston, MA, USA) according to the manufacturer‘s instructions and was adjusted to 50 and 20 ng/μl for SSR and SNP analyses, respectively.

SSR marker analysis

Marker design: SSR core motifs in Glyma1.01 and in large scaffolds (>100 kb) of Glyma0 were extracted in the read2Marker program with default parameters. Primer pairs to amplify the core motifs were designed in Primer3 software. Three types of primer pairs with different amplicon sizes (small, 80–210 bp; medium, 211–340 bp; large, 341–500 bp) were designed; the parameter settings were Opt_Tm = 60, Min_Tm = 53, Max_Tm = 70 and Max_Poly_X = 3. Sequences of the primer pairs were searched against Glyma1.01 to determine the number of binding sites, amplicon sizes and locations in Genome tester software with default parameters until a single amplicon was obtained (Supplementary Table S2). The SSR motifs that were consistent with BARC soybean potential SSR markers in SoyBase and EST-SSR markers are denoted in Supplementary Table S2. Detection with fluorescently labelled primers: Primers were labelled with 6-FAM, HEX or NED fluorescent dyes. Multiplex PCR mixture (6 µl) contained DNA (10 ng/µl), 0.1 unit/µl of Taq DNA polymerase (Finnzymes), 1× Optimized Taq buffer, primer mix (0.02–0.2 µM each), 200 µM dNTPs and 1.6 M betaine. PCR was performed on a GeneAmp 9700 amplifier (Applied Biosystems, Foster City, USA) as follows: 1 cycle of initial denaturation at 95 °C for 2 min; 40 cycles of denaturation at 94 °C for 30 s, annealing at 55 °C for 2 min and extension at 72 °C for 1 min; and final extension at 72 °C for 3 min. The PCR product was diluted with water (1 : 10), and 1 µl was added to a mixture of Hi-Di formamide (10 µl) and GeneScan 400HD ROX size standard (0.2 µl). The sample was separated on an ABI3730 capillary sequencer (Applied Biosystems). PCR fragments detected simultaneously with three dyes were resolved in ABI GeneMapper v. 4.0 software (Applied Biosystems). To measure allelic size, relative fluorescent units and sizes (bp) of the highest stutter peaks of the 32 accessions were sorted by peak size rounded to integer numbers (Supplementary Table S3). Detection with modified universal fluorescently labelled (UFL) primer: The UFL method was modified to attain multiplex amplification. Short polylinker sequences from the pBluescript vector were used as queries in blastn searches against Glyma1.01; the parameter settings were word_size = 7, evalue = 0.1 and perc_identity = 90. Three oligonucleotide tag sequences that did not show any hits in the soybean genome were used instead of the original M13 universal primer and were fluorescently labelled: 5′-CCACCGACGTGTCGCAC with 6-FAM, CCGTGCAGTCCGTCAGC with HEX and GGTGGCGACTCCTGGAG with NED (all dyes from Applied Biosystems). The concentrations of the oligonucleotide tag sequence (0.04 µM), 5′-tagged forward primer (0.04 µM) and unlabelled reverse primer (0.4 µM each) per marker were optimized to attain multiplex amplification. PCR conditions, detection of PCR fragments, genotyping and allele sizing were as described above. Genotyping of the mapping population: PCR conditions were optimized to attain multiplex amplification for 12–18 fluorescently labelled SSR markers and 9–12 UFL SSR markers. Multiplex PCR mixture (3–5 µl) contained total DNA (50 ng/µl), 1× Multiplex PCR Mix (Qiagen), 1× Q solution and primer mix (described above). The following conditions were used: initial denaturation at 95 °C for 15 min; 18 cycles total of 94 °C for 30 s and 3 cycles each of 68, 66, 64, 62, 60 and 58 °C for 3 min; then 40 cycles of 30 s at 94 °C, 3 min at 55 °C and 1 min at 72 °C; and final extension at 72 °C for 10 min. The primer concentration for fluorescently labelled SSR markers can be reduced to 1/10. Genotyping conditions were as for SSR marker detection.

SNP marker analysis

Sequence-tagged sites containing SNP information developed were used as queries in blastn searches against Glyma1.01 with default parameters. Multiplex assays for 1,000 randomly selected SNPs distributed throughout the genome (Supplementary Table S3) were designed to amplify low-copy sequences in Sequenom Assay Design 3.1 software (Sequenom). The Sequenom MassARRAY system was used for SNP genotyping. Multiplex PCR followed by template-directed single base extension at each SNP site was conducted with a MassARRAY iPLEX Gold kit (Sequenom) following the manufacturer‘s protocol. The genotypes were determined in MassARRAY Typer 4.0 software (Sequenom).

Linkage map construction and QTL detection

The linkage maps of F2 and BC3F2 populations were constructed by using JoinMap v. 4.0 software. The logarithm of odds (LOD) threshold for grouping of DNA markers was 4.0. The marker order was determined using the maximum likelihood mapping algorithm. The recombination frequency was converted into genetic distance (cM) using the Haldane mapping function. Marker genotypes were examined by eye to check whether the pattern of marker segregation changed gradually at each round of map construction in order to ensure the correct marker order. When a discrepancy between the marker order and chromosome assembly was found, additional markers were integrated into the position to confirm the discrepancies. QTLs were analyzed by composite interval mapping implemented in the R/qtl package with threshold values (P < 0.1) that were calculated by the 1,000-permutation test for each trait. The detected peak positions were used for the ‘refineqtl’ and ‘fitqtl’ functions to estimate the maximum-likelihood position for each QTL model, the effects of each QTL and genetic variance. The phenotypic values of CSSLs and Enrei were evaluated in 2011 and 2012 using the Tukey-Kramer test in R v. 3.3.1 software with significance level, P < 0.01.

Comparison of the linkage map with the chromosome assembly

The positions of primer sequences of mapped markers on Glyma1.01 and Wm82.a2.v1 were estimated by similarity searches using Genome tester and/or blastn described above. The first base position of either forward or reverse primer sequences on the coordinates of the chromosome assembly and the size and number of expected PCR products are listed in Supplementary Table S3. When multiple polymorphic fragments are amplified by a primer set, we added the 0.1, 0.2, 0.3 suffixes after name of the marker mapped to different linkage groups. In the comparison of the linkage map with the chromosome assembly, we omitted markers being far from the expected position based on information of surrounding markers and then used only information of the marker position of chromosome assembly which marker designed in Glyma1.01 and Wm82.a2.v1. Thirty-one public SSR markers for which only information on primer sequences but difficult to locate precise position on chromosome assemblies were not included. The Marey map approach was used to visualize the corresponding positions between the linkage map and the genome assemblies. Genetic positions of the markers were interpolated using the cubic spline method with default parameters settings in MareyMap version 1.3.3.

3. Results and discussion

DNA markers

SSR markers has been used in MAS because of multi allelic behavior, easy handling and low cost. Information on their polymorphism, genotyping quality and genetic positions is useful for marker selection. We identified 171,915 SSR loci (62,739 di-, 41, 696 tri-, 154 tetranucleotide and 67,326 compound loci) in the soybean reference sequence Glyma1.01. Because extended regions in the soybean genome are duplicated as a result of ancient polyploidization, novel primer pairs for 148,569 SSR loci were designed to amplify single PCR products to avoid analytical complexity (Supplementary Table S2). Polymorphisms of 2,235 SSR loci and that of previously developed 982 SNP markers were evaluated in 32 soybean germplasms (Supplementary Table S3). Sizes of amplified PCR product containing SSRs and nucleotides for SNPs, the number of PCR products, polymorphic information content and genetic positions estimated from the high-density linkage map are listed in Supplementary Tables S2 and S3.

High-density linkage map

A high-density linkage map was constructed by using a single F2 mapping population. Genomic SSR markers,, EST-SSR markers and the new SSR markers described above were incorporated into the map to cover the entire genome. The map spans 2885.7 cM and contains 1,667 SSR and 510 SNP markers (Fig. 2 and Supplementary Fig. S2, left white bars); the average marker distance is 1.3 cM and maximum distance is 6.8 cM. Severe segregation distortion (P ≤ 0.001) was observed for the markers on Gm06 (115.5–118.4 cM, corresponding to 46.7–47.3 Mbp) and Gm11 (116.3–119.3 cM, 36.1–36.2 Mbp) (Fig. 2, blue lines). In these regions, the frequency of Peking alleles was less than that of Enrei alleles.
Figure 2

A high-density genetic linkage map between Enrei and Peking (left) aligned with a physical map of Williams 82 (Glyma1.01) (right), with marker locations connected by black lines. The regions with suppressed recombination, shown on the same scale on the right, are connected with the linkage map by orange lines and may correspond to the pericentromeric regions. Red trapezoids indicate discrepancies between genetic and physical maps; blue bars indicate regions with severe segregation distortion. The discrepancies that have been resolved in the genome assembly Wm82.a2.v1 are indicated by red dot trapezoids.

A high-density genetic linkage map between Enrei and Peking (left) aligned with a physical map of Williams 82 (Glyma1.01) (right), with marker locations connected by black lines. The regions with suppressed recombination, shown on the same scale on the right, are connected with the linkage map by orange lines and may correspond to the pericentromeric regions. Red trapezoids indicate discrepancies between genetic and physical maps; blue bars indicate regions with severe segregation distortion. The discrepancies that have been resolved in the genome assembly Wm82.a2.v1 are indicated by red dot trapezoids. We compared marker locations on the linkage map (Fig. 2, left white bars) with those on Glyma1.01 (Fig. 2, right black bars) and Wm82.a2.v1 assemblies (Supplementary Table S3). Because important flowering- and yield-related QTLs in the region of Gm11 (11.0–14.8 Mb) of Glyma1.01 assembly are absent on Chr. 11 of Wm82.a2.v1, we discussed based on the Glyma1.01 assembly. The corresponding region was found to be included in two unassigned scaffolds of Wm82.a2.v1 assembly, scaffold_21 and scaffold_32 (Supplementary Table S3). In total, 2,155 of 2,177 markers (99%) were anchored on Glyma1.01. The short-range order of most markers on the linkage map closely agreed with that of their physical positions in the genome assembly, but a wide range of regions differed at the distal ends of chromosomes. For example, reverse marker order on Gm05, Gm11, Gm13, Gm14 and Gm19 was observed at the distal ends of the corresponding linkage groups (Fig. 2, red rectangles). In particular, the orientation of the top of Gm13 (14.8–15.6 Mb) containing nucleolar organizer region (NOR) was reversed in our map. Similarly, by using fluorescent CentGm probes, the reverse orientation of this region was previously detected in a cytogenetic study of Peking. These large discrepancies have been resolved in Wm82.a2.v1, although some differences, especially on Chr. 05 (8.6–22.0 Mb) and Chr. 11 (11.1–28.0 Mb), remain. As for Chr. 11, the two unassigned scaffolds described above are not integrated into the two high density linkage maps, because no SNP marker to anchor the scaffolds is not available. In contrast, order of SNP markers in the region of Chr. 05 are consistent among the two maps, therefore, the discrepancy with information of the present map might reflect the genomic differences in different accessions. Thus, the differences in the present map provides information to identify regions in the genome assembly where additional information is required to resolve marker order and assign remaining scaffolds.

Relationships between physical and genetic distances

Integration of markers that have been used for MAS and QTL mapping by breeders and researchers into the high-density linkage map allowed us to determine relative genetic and physical relationships through marker positions. Corresponding position between the linkage map and the genome assembly was visualized by the Marey map approach. The corresponding physical distance to genetic distance was ∼360 kb/cM, assuming a genome size of ∼1.1 Gb, although positional biases (50 kb/cM–7 Mb/cM) were found. The ratio of physical to genetic distance varied considerably depending on the chromosomal region, and looked like a sigmoid curve. For example, the ratio in the middle regions of Gm06 (100–110 cM) and Gm11 (94–103 cM) was >10 times that in the other regions (Fig. 3). Such regions with highly suppressed recombination flanking the centromeres are termed the pericentromeres. In each chromosome, highly suppressed recombination between markers was observed in the pericentromere, but the extent of suppression differed among chromosomes (Fig. 2). A pair of markers in such regions would provide insufficient information on recombination for genetic mapping even if they are located physically far from each other in the reference sequence. Therefore, information on recombination frequency across the genome is useful for genome-wide association study and for marker choice for MAS and QTL mapping. We estimated the genetic positions of all markers from the Marey map (Supplementary Tables S2 and S3). The choice of markers based on their genetic positions rather than their physical positions in the reference sequences would reduce the cost of MAS for breeding and QTL mapping.
Figure 3

Marey maps for chromosomes Gm06 and Gm11. Small dots indicate marker locations. Black boxes indicate the locations of the centromere repeat sequences. The recombination rate apparently decreases in the pericentromeric region. Arrow indicates the location of the E1 gene.

Marey maps for chromosomes Gm06 and Gm11. Small dots indicate marker locations. Black boxes indicate the locations of the centromere repeat sequences. The recombination rate apparently decreases in the pericentromeric region. Arrow indicates the location of the E1 gene. Surprisingly, low-recombination regions covered ∼555 Mb (∼60%) of the published soybean genome sequence and sometimes they formed patches in euchromatic regions (e.g. on Gm11 76.5–78.6 cM; Fig. 3). Interestingly, the distribution of low-recombination regions in all chromosomes coincided with the abundance of long terminal repeat retrotransposons reported in soybean. Gm07 and Gm16 had more than one apparent peak of suppressed recombination in addition to the pericentromeric regions.

QTLs in F2 population

High-density genetic linkage maps have made it possible to genetically dissect flowering time differences between parents, and thus to better understand the genetic basis for soybean flowering by comparing previously reported flowering- and/or maturity-related QTLs. Both parents belong to soybean maturity group IV; DFF was 48.2 for Enrei and 60.7 for Peking. The F2 population showed transgressive segregation (Supplementary Fig. S3), and the broad-sense heritability of DFF was 75.5%. Seven QTLs (qDFF, Table 2) for DFF were identified. Both parents had alleles that accelerated flowering; Enrei had such alleles on Gm10, Gm12, Gm16 and Gm19 and Peking on Gm06, Gm11 and Gm13. Despite the high genetic complexity of the F2 population, almost all phenotypic variation in DFF was explained (91.4%) by the four major QTLs (on Gm06, Gm10, Gm12 and Gm19) and three minor QTLs (on Gm11, Gm13, Gm16). Only qDFF_Gm06 was located close to the low-recombination region. In F2 population, 19 out of 21 QTLs (expect for qDFF_Gm13 and qPH_Gm10) showed higher LOD score than 1% significant level threshold value. Surprisingly, most QTLs for other traits (PH, NP, SWH, DH and RP) were clustered with QTLs for DFF (Table 2), suggesting two possibilities that the latter have pleiotropic effects on other traits or genes for these traits are clustered together.
Table 2

QTLs detected in F2 and BC3F2 populations

QTLs detected in F2 populationa
QTLs detected in BC3F2 populationa
TraitQTL nameQTL positionaFlanking marker 1Flanking marker 2Closest DNA markerLODPVE (%)Additive effect of Peking alleles.d.Dominance effects.d.QTL positionbFlanking marker 1Flanking marker 2Closest DNA markerLODPVE (%)Additive effect of Peking alleles.d.Dominance effects.d.Confirmation of QTL with CSSLcConfirmed QTLs between F2 and BC3 F2 populationCandidate genes or related QTLsID of related QTLs reported in SoybaseReferences
DFFqDFF_Gm19Gm19@118.9FT3-751C19-BARC- 018175-02533FT3-75151.2**23.94.90.21.60.3Gm19@106.0Sat_286FT3-751FT3-751174.8**29.53.30.10.50.2ConfirmedConfirmedDt1/E3Liu et al. 2009, Watanabe et al. 2009
qDFF_Gm06Gm06@102.0sF3HC06-BARC- 041345-07969C06-BARC- 041345-0796957.5**29.6–5.50.22.60.4Gm06@106.4sF3HFT1SSR9FT1SSR9145.1**22.7–3.20.11.20.2ConfirmedConfirmedE1Xia et al. 2012
qDFF_Gm12Gm12@33.0C12-BARC- 015603-02006s024200450s02420045036.3**13.63.70.20.30.4Gm12@49.0s024200450s023500372s02420045074.5**9.82.00.1–0.10.2ConfirmedConfirmedqDFF_Gm12First flower 25–2Kuroda et al. 2013
qDFF_Gm10Gm10@116.0C10-BARC- 015003-01948C10-BARC- 015925-02017C10-BARC- 015925-0201737.4**14.34.00.21.20.4Gm10@103.0Satt331FT2-300H01dFT2-300H01d70.7**9.21.90.10.20.2ConfirmedConfirmedE2Watanabe et al. 2011
qDFF_Gm11Gm11@71.0s008000014-2T001111280mT001111280m18.9**5.6–2.40.20.00.4Gm11@61.5GMES0766T001111280mT001111280m38.7**4.7–1.20.1–0.20.2Confirmed(partially)ConfirmedqDFF_Gm11First flower 8–4Yamanaka et al. 2001
qDFF_Gm16Gm16@33.0C16-BARC- 020505-04644GMES0775GMES077510.3**2.71.70.20.80.4Gm16@68.5GMES4751s026800043GMES475116.9**1.9–0.90.1–0.20.2ConfirmedConfirmedGmFT5aTakeshima et al. 2016
qDFF_Gm13Gm13@7.0s000702358-2s000701869-2s000701869-26.8*1.7–1.50.3–0.10.4––––––––––––––––––––Confirmed-qDFF_Gm13First flower 11–4Gai et al. 2007
DFTqDFT_Gm19_1dNANANANANANANANANANAGm19@102.0Sat_286FT3-751FT3-751285.3**55.38.80.20.00.4ConfirmedNADt1Liu et al. 2009
qDFT_Gm19_2dNANANANANANANANANANAGm19@106.0Sat_286FT3-751FT3-751193.8**37.94.80.22.80.3ConfirmedNAE3Watanabe et al. 2009
qDFT_Gm06NANANANANANANANANANAGm06@105.6Satt277sF3HsF3H78.8**11.5–3.50.21.10.3ConfirmedNAE1
qDFT_Gm10NANANANANANANANANANAGm10@103.0Satt331FT2-300H01dFT2-300H01d74.5**10.73.10.20.10.3ConfirmedNAE2
qDFT_Gm12NANANANANANANANANANAGm12@49.0s024200450s023500372s02420045068.5**9.73.00.2–0.20.3ConfirmedNAqDFF_Gm12
qDFT_Gm11NANANANANANANANANANAGm11@60.0GMES0766T001111280mT001111280m35.3**4.6–2.10.20.40.3ConfirmedNAqDFF_Gm11
FPqFP_Gm19_1dNANANANANANANANANANAGm19@100.0Sat_286FT3-751FT3-751242.4**67.45.40.1–0.20.3Confirmed(partially)NADt1
qFP_Gm19_2dNANANANANANANANANANAGm19@102.0Sat_286FT3-751FT3-751131.9**45.82.70.22.60.3Confirmed(partially)NAE3
DHqDH_Gm19Gm19@118.0Satt229FT3-751FT3-75122.4**14.28.10.72.51.0Gm19@105.0Sat_286FT3-751FT3-75139.7**12.16.10.5–0.60.9ConfirmedConfirmedDt1/E3
qDH_Gm06Gm06@102.0sF3HC06-BARC- 041345-07969C06-BARC- 041345-0796934.8**26.2–11.20.72.11.0Gm06@104.4Satt277sF3HsF3H30.6**9.1–5.40.50.80.9ConfirmedConfirmedE1
qDH_Gm11Gm11@71.0s008000014-2T001111280mT001111280m29.9**21.1–10.00.70.01.0Gm11@41.0GMES0045Satt197Satt19718.8**5.5–4.30.50.51.1ConfirmedConfirmedqDFF_Gm11Reproductive stage length 8–1Komatsu et al. 2012
qDH_Gm10Gm10@119.0FT2-300H01ds005205820-2s005205820-217.6**10.57.30.81.91.0Gm10@103.0Satt331FT2-300H01dFT2-300H01d13.3**3.83.40.5–0.40.9ConfirmedConfirmedE2
qDH_Gm12Gm12@33.5s024200450s024200389s02420045016.4**9.66.40.71.31.0Gm12@49.0s024200450s023500372s02420045011.8**3.43.00.50.00.9ConfirmedConfirmedqDFF_Gm12
qDH_Gm02––––––––––––––––––––Gm02@48.0Satt701Satt296Satt7018.2**2.32.30.51.61.1––––qRP_Gm02Pod maturity 22–4Reinprecht et al. 2006
qDH_Gm13––––––––––––––––––––Gm13@125.0Satt362s003004926-2Satt3625.2**1.5–2.30.52.71.0Confirmed––qRP_Gm13Novel
qDH_Gm17––––––––––––––––––––Gm17@46.0T001709449lT001710954mT001709449l5.0*1.4–2.50.61.31.0Confirmed––qRP_Gm17Reproductive period 1–7Wang et al. 2015 (position is slightly different)
qDH_Gm01––––––––––––––––––––Gm01@43.8Satt221Satt370Satt2214.6*1.32.20.5–2.20.9––––qRP_Gm01Pod maturity 1–2Keim et al. 1990B
RPqRP_Gm11Gm11@70.0s008000014-2T001111280ms008000014-216.4**27.4–7.00.70.11.1Gm11@38.0GMES0045Satt197GMES004512.3**4.6–3.40.50.21.1ConfirmedConfirmedqDFF_Gm11Reproductive stage length 8–1Komatsu et al. 2012
qRP_Gm02––––––––––––––––––––Gm02@49.0Satt701Satt296Satt7019.3**3.52.60.61.21.1––––qRP_Gm02
qRP_Gm13––––––––––––––––––––Gm13@124.0Satt362s003004926-2Satt3627.5**2.8–2.60.52.81.0Confirmed––qRP_Gm13Novel
qRP_Gm19––––––––––––––––––––Gm19@105.0Sat_286FT3-751FT3-7517.2**2.72.70.5–1.10.9Confirmed––Dt1/E3
qRP_Gm06Gm06@98.4Satt277C06-BARC- 014491-01562Satt27710.9**16.9–5.40.7–0.91.1Gm06@104.4Satt277sF3HsF3H6.3**2.3–2.20.5–0.10.9ConfirmedConfirmedE1
qRP_Gm16––––––––––––––––––––Gm16@54.5Satt596s014300091-2Satt5965.7**2.12.30.5–0.51.0––––GmFT5a
qRP_Gm17––––––––––––––––––––Gm17@46.0T001709449lT001710954mT001709449l4.5**1.6–2.30.51.10.9Confirmed––qRP_Gm17
qRP_Gm01––––––––––––––––––––Gm01@44.8Satt221Satt370Satt2214.4**1.62.10.5–2.00.9––––qRP_Gm01
PHqPH_Gm19_1dGm19@114.0Satt664C19-BARC- 030101-06809Satt66430.4**42.715.51.10.51.6Gm19@101.0Sat_286FT3-751FT3-751129.8**45.714.40.5–3.51.0ConfirmedConfirmedDt1
qPH_Gm19_2d––––––––––––––––––––Gm19@102.0Sat_286FT3-751FT3-75178.7**27.67.00.67.21.1Confirmed––E3
qPH_Gm10Gm10@117.2FT2-300H01ds005205820-2FT2-300H01d6.3*6.45.81.14.11.6Gm10@102.0Satt331FT2-300H01dFT2-300H01d17.4**5.34.00.5–0.40.9ConfirmedConfirmedE2
qPH_Gm06Gm06@103.0s005700017s007100484s00570001716.6**19.3–10.51.13.81.6Gm06@106.4sF3HFT1SSR9FT1SSR915.9**4.8–3.80.51.10.9ConfirmedConfirmedE1
NPqNP_Gm19Gm19@116.0C19-BARC- 030101-06809Satt229C19-BARC- 030101-0680915.0**26.484.49.717.0##Gm19@101.0Sat_286FT3-751FT3-751148.2**39.380.03.530.06.8ConfirmedConfirmedDt1/E3
qNP_Gm12––––––––––––––––––––Gm12@50.0s024200450s023500372s02420045033.9**6.831.63.06.76.0Confirmed––qDFF_Gm12
qNP_Gm10––––––––––––––––––––Gm10@109.0FT2-300H01dT001045537mFT2-300H01d27.7**5.428.33.15.35.6––––E2
qNP_Gm06Gm06@103.0s005700017s007100484s00570001711.5**19.3−73.09.59.2##Gm06@104.4Satt277sF3HsF3H23.5**4.6–29.73.25.15.8ConfirmedConfirmedE1
qNP_Gm11––––––––––––––––––––Gm11@61.5GMES0766T001111280mT001111280m15.3**2.9–21.22.81.75.4Confirmed––qDFF_Gm11
qNP_Gm16––––––––––––––––––––Gm16@69.5GMES4751s026800043s0268000437.4*1.4–18.53.216.36.1––––GmFT5a
TSWqTSW_Gm19NANANANANANANANANANAGm19@102.0Sat_286FT3-751FT3-751155.5**37.337.91.718.33.2––––Dt1/E3
qTSW_Gm12NANANANANANANANANANAGm12@50.0s024200450s023500372s02420045044.2**8.018.11.51.72.9Confirmed––qDFF_Gm12
qTSW_Gm11NANANANANANANANANANAGm11@61.0GMES0766T001111280mT001111280m36.2**6.5–16.31.41.42.6Confirmed––qDFF_Gm11
qTSW_Gm06NANANANANANANANANANAGm06@105.4Satt277sF3HsF3H35.7**6.4–17.41.53.22.6Confirmed––E1
qTSW_Gm10NANANANANANANANANANAGm10@110.0FT2-300H01dT001045537mFT2-300H01d20.9**3.610.71.55.22.8Confirmed––E2
qTSW_Gm16NANANANANANANANANANAGm16@68.5GMES4751s026800043GMES47518.7**1.5–9.71.59.42.9Confirmed––GmFT5a
qTSW_Gm20NANANANANANANANANANAGm20@99.0T002042675sSct_189T002042675s6.8**1.1–9.42.01.53.9Confirmed––qTSW_Gm20Seed yield 15–15Kabelka et al. 2004
SWHqSWH_Gm08––––––––––––––––––––Gm08@46.0GMES1620AY262686BAY262686B33.3**10.1–2.80.21.90.4Confirmed––qSWH_Gm08Seed weight 4–5Maughan et al. 1996 (close to I locus)
qSWH_Gm06Gm06@98.4Satt277C06-BARC- 014491-01562Satt2778.9**17.1–2.40.41.70.5Gm06@106.4sF3HFT1SSR9FT1SSR929.1**8.8–2.30.21.10.4Confirmed(partially)ConfirmedE1
qSWH_Gm11––––––––––––––––––––Gm11@59.0GMES0766T001111280mT001111280m22.3**6.6–1.80.20.00.4Confirmed––qDFF_Gm11
qSWH_Gm17––––––––––––––––––––Gm17@51.0T001709449lT001710954mT001710954m15.1**4.4–2.00.30.50.5Confirmed––qSWH_Gm17Seed yield 5-2Renyna and Sneller 2001
qSWH_Gm19Gm19@118.9FT3-751C19-BARC- 018175-02533FT3-7518.4**15.92.40.41.10.5Gm19@107.1Sat_286FT3-751FT3-75110.7**3.11.10.20.10.3ConfirmedConfirmedDt1/E3
qSWH_Gm20––––––––––––––––––––Gm20@98.0T002042675sSct_189T002042675s6.6**1.9–1.50.31.10.5Confirmed(partially)––qSWH_Gm20
qSWH_Gm13––––––––––––––––––––Gm13@16.0Satt146Satt252Satt2526.6**1.9–1.10.20.00.5Confirmed(partially)––qDFF_Gm13
qSWH_Gm02––––––––––––––––––––Gm02@22.0Sat_096Sat_351Sat_3515.9**1.7–0.90.2–0.40.5Confirmed––qSWH_Gm02

NA, the trait was not evaluated in the F2 population; LOD, logarithm of odds of the presence of a QTL; PVE, percentage of variance explained by the QTL.

Sowing date of F2 population and BC3F3 population were 30 May 2007 and 24 June2011, respectively.

Indicated as chromosome name and position in cM separated by @.

‘Confirmed partially’ indicates that a singificant difference was observed between Enrei and a CSSL habouring several donor choromosomal fragments.

Effects of the QTLs corresponding to E3 and Dt1 loci were calculated separately.

Significant at P < 0.05 after 1,000 permutation; **Significant at P < 0.01 after 1,000 permutation.

QTLs detected in F2 and BC3F2 populations NA, the trait was not evaluated in the F2 population; LOD, logarithm of odds of the presence of a QTL; PVE, percentage of variance explained by the QTL. Sowing date of F2 population and BC3F3 population were 30 May 2007 and 24 June2011, respectively. Indicated as chromosome name and position in cM separated by @. ‘Confirmed partially’ indicates that a singificant difference was observed between Enrei and a CSSL habouring several donor choromosomal fragments. Effects of the QTLs corresponding to E3 and Dt1 loci were calculated separately. Significant at P < 0.05 after 1,000 permutation; **Significant at P < 0.01 after 1,000 permutation. Our high-density linkage map offers an opportunity to explore candidate genes for flowering time and related QTLs. For instance, the physical location of two major QTLs, qDFF_Gm19 and qDFF_Gm10, estimated from the Marey map was 47.6 Mb and 44.9 Mb, respectively. Genes responsible for E3 and E2 are located at ∼47.5 Mb on Gm19 and ∼44.7 Mb on Gm10, respectively. The estimated physical locations of the two QTLs differ from those of E2 and E3 by only ∼200 kb. Thus, the positions of SSR markers and interpolated genetic distance provide great potential to narrow down the boundaries of genomic regions that include candidate genes.

QTLs in BC3F2 population

Nine agronomic traits (Table 1) of 999 BC3F2 individuals were evaluated and the average values of each trait was compared with those of the recurrent parent (Enrei); the average value of each trait of BC3F2 were almost the same as those of Enrei (Supplementary Table S4, Fig. S4). Whereas the effects of Peking alleles increase the phenotypic variance observed in BC3F2. Among the genetic variances (heritability) of all traits, some traits such as DFF, DFT and PH were highly heritable, whereas heritability of yield-related traits, such as TSW and SWH, were intermediate, and that of DH and RP was very low (Table 1). Enrei tends to suffer from green stem disorder, caused by biotic and abiotic stresses; this disorder, which delays harvesting, would broaden the range of the values of maturity-related traits of the recurrent parent and the BC3F2 population and decrease the heritability of these traits. The linkage map was constructed for the BC3F2 population to perform QTL mapping for these nine traits. The genotype of each BC3F2 line was determined by using 320 SSR markers selected from the high-density genetic linkage map described above, so that they covered the whole genome (Supplementary Fig. S5). The ratios of Peking-homozygous (3.1%), heterozygous (6.3%) and Enrei-homozygous (90.6%) loci in the population coincided well with the theoretical ratios (3.3%, 6.0% and 89.1%, respectively). The map consisted of 20 linkage groups covering 2475.3 cM with an average interval of 8.3 cM, and showed a genetic order of DNA markers similar to that expected from the high-density linkage map of the F2 population. In total, 56 QTLs scattered over 12 chromosomes were identified from the positions of LOD peaks for the nine traits. Six QTLs for DFF, six for DFT, two for FP, nine for DH, eight for RP, four for PH, six for NP, seven for TSW and eight for SWH showed significant LOD scores. Among them, 53 QTLs (expect for qDH_Gm01, qDH_Gm17 and qNP_Gm16) showed higher LOD score than 1% significant level of threshold value (Table 2). Six out of seven QTLs for DFF detected in the F2 population were reproducible in the BC3F2 population. As in the F2 population, many QTLs for other traits were clustered with QTLs for DFF, suggesting that QTLs for flowering time strongly affect other traits. However, additional QTLs for maturity and yield traits were separate from DFF QTLs in the BC3F2 population. We previously characterized the sequences of the known flowering genes, E1–E4, in many soybean cultivars; on this information, Enrei has E1, e2, e3 and E4 alleles, and Peking has E1, E2, E3 and E4 alleles. Therefore, we expected that only the E2 and E3 loci would segregate in the population; however, qDFF_Gm06 was identified in a region close to the E1 locus (Table 2). Allelic effects of qDFFs agreed well with the results of QTL analysis in the F2 population. In addition, two soybean orthologs of Arabidopsis Flowering locus T (FT), GmFT2a and GmFT5a delay flowering in cultivars adapted to northern Japan. The location of qDFF_Gm16 was very close to the GmFT5a locus, and the Peking allele delayed flowering. A major QTL for DFT, qDFT_Gm19, was identified close to the Dt1 locus. The Peking allele significantly delayed flowering of the top raceme. The dominant allele of Dt1, which is a homolog of Arabidopsis TERMINAL FLOWER 1, controls indeterminate growth habit in soybean. The growth habit of Peking is closer to that of a semi-determinate isoline (Dt1, Dt2) than to that of a determinate isoline (dt1, dt2) in top-leaf and stem traits, but the Peking allele is controlled by dt1-t, which is allelic to dt1. Together with the semi-determinate growth of Peking, delaying flowering of the top raceme will increase both the number of nodes on the main stem and plant height. The corresponding traits, NP and PH, were controlled by QTLs detected near the Dt1 locus (Fig. 4). The genetic variance of DFT and PH explained by the Dt1 locus accounted up to 55.3% and 45.7%, respectively, of total variance (Table 2), indicating that the Dt1 locus strongly affects these traits.
Figure 4

Whole-chromosome view of the graphical genotypes of the CSSLs and locations of QTLs identified in the BC3F2 population. Marker genotypes: black, Peking homozygous; white, Enrei homozygous; gray, heterozygous. QTL positions on the right side are in black.

Whole-chromosome view of the graphical genotypes of the CSSLs and locations of QTLs identified in the BC3F2 population. Marker genotypes: black, Peking homozygous; white, Enrei homozygous; gray, heterozygous. QTL positions on the right side are in black. The locations of QTLs for yield-related traits NP and TSW in the BC3F2 population were very similar to those of QTLs for DFF in the F2 population. The ratios of genetic variance explained by six flowering-time QTLs and the Dt1 locus were examined but the order of effects of qDFFs did not coincide with that of post-flowering including yield-related traits. For example, qRP_Gm11, detected at the same locus as qDFF_Gm11, had a larger additive effect on RP (−3.4 days) than on DFF (−1.2 days). Each QTL for DFF had a different effect on the phenotypic variance of other traits, likely reflecting functional differences between the underlying genes. Hence, dissection of these QTLs as single Mendelian factors would provide more precise information about their effects. Four QTLs for SWH were identified on Gm02, Gm08, Gm17 and Gm20, where no QTLs for DFF were identified. None of the QTLs from small-seeded Peking increased SWH. The Peking allele of qSWH_Gm08 had the strongest effect of decreasing SWH (10% of phenotypic variance in the BC3F2 population explained). Because this QTL is close to Rhg4, an important locus for cyst nematode resistance of Peking, pre-breeding of an Rhg4 NIL to remove such an unfavourable Peking allele is important for breeding of large-seeded cultivars. Previous study identified a QTL (qSW17-1OA) with stable effects on seed weight in diverse environments over several years. The genetic position of qSWH_Gm17 is likely to be the same as that of qSW17-1OA and some other QTLs in previous study (Table 2). The locations of qSWH_Gm20 and qTSW_Gm20 were close to that of Ln, which controls leaflet shape and seed size. The gene responsible for Ln is homologous to Arabidopsis JAGGED. Although most Peking alleles had negative effects on yield-related traits in the Enrei genetic background, two QTLs related to late flowering (qDFF_Gm10, qDFF_Gm19) and one QTL extending the reproductive period (qRP_Gm19) associated with increased TSW in this genetic background (Table 2).

Development of CSSLs and confirmation of QTLs

The graphical genotypes of the selected 103 CSSLs are shown in Fig. 4. The average length (± standard deviation) of the donor chromosomal segment was 85.4 ± 42.1 cM for homozygous and 130.8 cM ± 59.6 cM for heterozygous alleles. A few donor segments other than the target segment remained in the genetic background of the recurrent parent; therefore, further backcrossing with MAS would be necessary to eliminate these extra segments. However, CSSLs harboring different donor chromosomal segments from Peking were still useful to dissect QTLs as single genetic factors and to evaluate the genetic effects of individual QTLs. Two-year evaluations of nine agronomic traits using 103 CSSLs revealed that 50 of these lines had at least one trait that was significantly different from that of Enrei by the Tukey-Kramer test (P < 0.01). Most lines whose donor segments contained QTLs originating from Peking showed significantly different phenotypes. These differences agreed well with the effects of QTLs identified in the BC3F2 population (Supplementary Table S5). Of the 57 QTLs detected in the F2 and BC3F2 populations, 49 were also detected in CSSLs (Table 1). Therefore, we considered that majority of QTLs reported in this study are highly reliable. Interestingly, four CSSLs B0704, B0804, B0816 and B0879 had the same genotype as Enrei at all DFF QTLs but had a significantly (P <0.01) different DFF from Enrei (Supplementary Table S5). The results indicate that the effect of a novel QTL appeared in the simple genetic background of a CSSL but was undetectable in the BC3F2 population. In 2011 and 2012, 36 CSSLs showed significant differences in TSW from Enrei; 27 of them had QTLs for DFF, RP and SWH, but genetic factors in the remaining lines were not identified. Increasing the number of plants used for evaluation or crossing with Enrei would be necessary to uncover the cause of the increase in TSW and to use this QTL in soybean breeding programs. We further evaluated the progeny of each line heterozygous for each of six flowering-time QTLs (E1, E2, E3, qDFF_Gm11, qDFF_Gm12 and GmFT5a) to evaluate whether the chromosomal segments from Peking contained flowering time QTLs. Progeny classification according to the genotypes of SSR markers near each QTL showed significant association between phenotypic values and QTL genotypes (data not shown). Representative graphical genotypes of two CSSLs harboring donor chromosomal segments that included qDFF_Gm11 (B0015 and B0676) and the effect of qDFF_Gm11 on the phenotype are shown in Fig. 5. Both lines differed significantly (P <0.01) from Enrei in RP (Fig. 5B). qDFF_Gm11 had the fifth strongest effect among seven DFF QTLs (Table 2), whereas qDFF_Gm11 had the strongest effect on RP among all RP QTLs. The decreased RP resulted in a clear difference in the maturity phenotype (Fig. 5C), and this QTL also affected TSW and SWH (Supplementary Table S4). Comparison of the effects of qDFF(qRP)_Gm11 on different traits suggests that the gene responsible for this QTL probably extends the pod-filling period. The results described above indicate that CSSLs developed in this study may help to dissect the genes underlying the detected QTLs into single genetic factors and provide breeding materials with the genetic background of an elite Japanese cultivar.
Figure 5

Confirmation of QTL effects on the reproductive period using CSSL lines. A, Graphical genotypes of the CSSLs B0676 and B0015. These lines have donor (Peking) chromosomal segments containing qDFF_Gm11, which affects the length of the reproductive period (arrowheads). B, Reproductive periods of the two CSSLs and the recurrent parent Enrei (En). C, Representative images of CSSL B0676 (right) in comparison with the recurrent parent Enrei (left). Early maturity in B0676 is caused by a chromosome segment that includes qDFF_Gm11 from Peking.

Confirmation of QTL effects on the reproductive period using CSSL lines. A, Graphical genotypes of the CSSLs B0676 and B0015. These lines have donor (Peking) chromosomal segments containing qDFF_Gm11, which affects the length of the reproductive period (arrowheads). B, Reproductive periods of the two CSSLs and the recurrent parent Enrei (En). C, Representative images of CSSL B0676 (right) in comparison with the recurrent parent Enrei (left). Early maturity in B0676 is caused by a chromosome segment that includes qDFF_Gm11 from Peking.

Novel flowering-related genes

We identified seven QTLs related to DFF in F2, BC3F2 populations and selected CSSLs (Table 2 and Supplementary Table S5). Among soybean maturity genes, the classical E1 locus has the strongest effect on flowering time. In response to photoperiod, a putative transcription factor encoded by E1 controls flowering time by regulating the expression of FT genes. The detection of qDFF_Gm06 close to E1 in the F2 and BC3F2 populations suggests that the E1 allele differs between Peking and Enrei. We assume that Peking has a recessive allele because of an early flowering effect of this QTL (Table 2). However, the coding region of Peking E1 is reported to be identical to that of Enrei E1. Thus, Peking may have a recessive allele different from the known one, because several different types of recessive alleles have been identified for soybean maturity genes, such as the E4 locus, and the promoter region of Peking has a high similarity to those of other cultivars with recessive e1 alleles. Alternatively, a novel gene affecting soybean flowering time and maturity may be located in this region. The qDFF_Gm06 was located in the pericentromeric region (Fig. 3). The other maturity gene E7 locus for flowering is located on the same chromosome and is genetically tightly linked to the E1 locus. Fine mapping would be needed to resolve these possibilities. Genes responsible for qDFF_Gm11, qDFF_Gm12 and qDFF_Gm13 (and nearby QTLs for other traits) have not been characterized. QTLs corresponding to qDFF_Gm12 have been identified in two different populations derived from crosses between wild and cultivated soybeans. Satt442 (the closest DNA marker in the above study) was located close to the marker s024200450 in the present study. Previous study reported QTLs at positions similar to that of qDFF_Gm11. They identified a strong QTL, named qDfm1 (Duration from flowering to maturity), using RILs derived from a cross between Fukuyutaka and Ippon-sango in a low-latitude region of Japan (Kyushu, 32°52′N). Although the positions of qDFF_Gm11 and qDfm1 are close to each other on the linkage map, the effect of qDfm1 was not associated with DFF. The difference might be related to population size (192 F2 or 999 BC3F2 plants vs. 143 RILs) and a short photoperiod at low latitudes. In any case, both QTLs control RP under different photoperiod and temperature conditions (in the middle and southern parts of Japan). If the same gene is responsible for these QTLs, the Peking allele, which shortens RP, would be useful in wide latitudinal range. The evaluation of genetic interactions between E genes and qDFF_Gm11 by crossing CSSLs having different qDFF alleles would be important for controlling maturity.

Future characterization of useful traits in Peking

Peking was first identified as material resistant to the soybean cyst nematode H. glycines. Peking was introduced into the USA from Beijing, China, in 1906 (previous accession number: PI17852B; current USDA accession number: PI548402). However, other USDA accessions named Peking differ from PI548402 by RFLP analysis: PI297543 (introduced from Hungary in 1964), PI438496 and PI438497 (from the Russian Federation in 1979). The Peking accession used in the present study revealed an SSR genotype patterns very similar to those of PI548402, but quite different from the patterns of PI438496A, PI438496B, PI438496C, PI438497 and two pure lines, PI548205 and PI548359, derived from Peking in the USDA germplasm collection, and from the pattern of JP28432 in NARO Genebank (data not shown). DNA markers for the soybean cyst nematode resistance genes Rhg4 and Rhg1, based on functional SNPs in Peking, have been reported. Previous study isolated the Peking gene Rps1k, which promotes resistance to stem rot disease caused by Phytophthora sojae races 2 and 11. Peking has also been used in studies of interactions with nitrogen-fixing rhizobia (Rhizobium and Sinorhizobium species) owing to their host specificity and genetic control of symbiosis by a host gene. Peking carries rfg1, which controls effective nodulation of fast-growing rhizobia, and Rj4, which controls nodulation of slow-growing bradyrhizobia. The responsible genes were isolated for rfg1 and that for Rj4. Peking is also resistant to the reniform nematode Rotylenchulus reniformis. Peking has a Rsv4 gene for resistance to soybean mosaic virus and DNA markers have been developed for this gene. A recessive gene, rpv2, for resistance to peanut mottle virus has been reported in Peking. Peking is resistant to many isolates of the frogeye leaf spot pathogen, Cercospora sojina, which is controlled by a single dominant gene, Rcs, mapped near the SSR marker Satt244, and another resistance gene, Rcs3. Peking is tolerant to Pseudomonas syringae pv. glycinea race 6, which cause bacterial blight disease, and to soybean dwarf disease, soybean stem canker and corn earworm. Its seed-flooding tolerance at the germination stage has been explained by the structural characteristics of its seeds, and is controlled by four QTLs. The success rate of somatic embryogenesis from immature embryo cultures of Peking is reportedly high, and six QTLs associated with somatic embryogenesis have been identified. Susceptibility of Peking to tumor formation by Agrobacterium tumefaciens is controlled by several genes, but these genes have not been isolated. The evaluation of CSSLs for resistance to these diseases and to stress would help to identify and characterize the resistance genes.

4. Conclusion

We developed a high-density linkage map and CSSLs carrying chromosomal segments from the Chinese soybean Peking in the background of the Japanese soybean Enrei. The map provides a detailed genetic framework within which to use molecular markers for breeding and to obtain a precise assembly of the genome sequence in the Japanese soybean genetic background. The marker order on the linkage map agreed well with the new genome assembly Wm82.a2.v1, but large differences were identified on Chrs. 05 and 11. These CSSLs are a unique resource that would be useful for evaluating minor QTLs as single genetic factors in a uniform genetic background. We demonstrated that many QTLs related to basic agronomic traits detected in the F2 and BC3F2 populations were reproducible in the CSSLs. Peking has many useful genes for resistance to diseases, pests and stress, but most of these genes have not yet been characterized. CSSLs developed in this study would be a good resource for us in developing new cultivars harboring resistance genes and for gene isolation by positional cloning. To increase the genetic diversity of soybean for breeding, finding and confirmation of QTLs associated with agronomic traits using a mapping population derived from these lines would be necessary. CSSLs developed in this study would also be good starting materials for removing the unfavourable characteristics of Peking. Click here for additional data file.
  54 in total

1.  R gene-controlled host specificity in the legume-rhizobia symbiosis.

Authors:  Shengming Yang; Fang Tang; Muqiang Gao; Hari B Krishnan; Hongyan Zhu
Journal:  Proc Natl Acad Sci U S A       Date:  2010-10-11       Impact factor: 11.205

2.  A soybean transcript map: gene distribution, haplotype and single-nucleotide polymorphism analysis.

Authors:  Ik-Young Choi; David L Hyten; Lakshmi K Matukumalli; Qijian Song; Julian M Chaky; Charles V Quigley; Kevin Chase; K Gordon Lark; Robert S Reiter; Mun-Sup Yoon; Eun-Young Hwang; Seung-In Yi; Nevin D Young; Randy C Shoemaker; Curtis P van Tassell; James E Specht; Perry B Cregan
Journal:  Genetics       Date:  2007-03-04       Impact factor: 4.562

3.  A graphical representation of genetic and physical maps: the Marey map.

Authors:  A Chakravarti
Journal:  Genomics       Date:  1991-09       Impact factor: 5.736

4.  Resistance to the reniform nematode in selected soybean cultivars and germplasm lines.

Authors:  R T Robbins; L Rakes
Journal:  J Nematol       Date:  1996-12       Impact factor: 1.402

5.  STAIRS: a new genetic resource for functional genomic studies of Arabidopsis.

Authors:  Rachil Koumproglou; Tim M Wilkes; Paul Townson; Xiao Y Wang; Jim Beynon; Harpal S Pooni; H John Newbury; Mike J Kearsey
Journal:  Plant J       Date:  2002-08       Impact factor: 6.417

6.  Map-based cloning of the gene associated with the soybean maturity locus E3.

Authors:  Satoshi Watanabe; Rumiko Hideshima; Zhengjun Xia; Yasutaka Tsubokura; Shusei Sato; Yumi Nakamoto; Naoki Yamanaka; Ryoji Takahashi; Masao Ishimoto; Toyoaki Anai; Satoshi Tabata; Kyuya Harada
Journal:  Genetics       Date:  2009-05-27       Impact factor: 4.562

7.  Distinct Copy Number, Coding Sequence, and Locus Methylation Patterns Underlie Rhg1-Mediated Soybean Resistance to Soybean Cyst Nematode.

Authors:  David E Cook; Adam M Bayless; Kai Wang; Xiaoli Guo; Qijian Song; Jiming Jiang; Andrew F Bent
Journal:  Plant Physiol       Date:  2014-04-14       Impact factor: 8.340

8.  Genetic variation in four maturity genes affects photoperiod insensitivity and PHYA-regulated post-flowering responses of soybean.

Authors:  Meilan Xu; Zeheng Xu; Baohui Liu; Fanjiang Kong; Yasutaka Tsubokura; Satoshi Watanabe; Zhengjun Xia; Kyuya Harada; Akira Kanazawa; Testuya Yamada; Jun Abe
Journal:  BMC Plant Biol       Date:  2013-06-25       Impact factor: 4.215

9.  GENOMEMASKER package for designing unique genomic PCR primers.

Authors:  Reidar Andreson; Eric Reppo; Lauris Kaplinski; Maido Remm
Journal:  BMC Bioinformatics       Date:  2006-03-27       Impact factor: 3.169

10.  Construction of high resolution genetic linkage maps to improve the soybean genome sequence assembly Glyma1.01.

Authors:  Qijian Song; Jerry Jenkins; Gaofeng Jia; David L Hyten; Vince Pantalone; Scott A Jackson; Jeremy Schmutz; Perry B Cregan
Journal:  BMC Genomics       Date:  2016-01-06       Impact factor: 3.969

View more
  5 in total

1.  Genetic dissection of an allotetraploid interspecific CSSLs guides interspecific genetics and breeding in cotton.

Authors:  Ximei Li; Zhiwei Wang; Chunyuan You; Xinhui Nie; Jie Sun; Xianlong Zhang; Dawei Zhang; Zhongxu Lin
Journal:  BMC Genomics       Date:  2020-06-26       Impact factor: 3.969

2.  Genetic Diversity in Stomatal Density among Soybeans Elucidated Using High-throughput Technique Based on an Algorithm for Object Detection.

Authors:  Kazuma Sakoda; Tomoya Watanabe; Shun Sukemura; Shunzo Kobayashi; Yuichi Nagasaki; Yu Tanaka; Tatsuhiko Shiraiwa
Journal:  Sci Rep       Date:  2019-05-20       Impact factor: 4.379

3.  Development of Three Sets of High-Throughput Genotyped Rice Chromosome Segment Substitution Lines and QTL Mapping for Eleven Traits.

Authors:  Bin Zhang; Lianguang Shang; Banpu Ruan; Anpeng Zhang; Shenglong Yang; Hongzhen Jiang; Chaolei Liu; Kai Hong; Hai Lin; Zhenyu Gao; Jiang Hu; Dali Zeng; Longbiao Guo; Qian Qian
Journal:  Rice (N Y)       Date:  2019-05-10       Impact factor: 4.783

Review 4.  Impacts of genomic research on soybean improvement in East Asia.

Authors:  Man-Wah Li; Zhili Wang; Bingjun Jiang; Akito Kaga; Fuk-Ling Wong; Guohong Zhang; Tianfu Han; Gyuhwa Chung; Henry Nguyen; Hon-Ming Lam
Journal:  Theor Appl Genet       Date:  2019-10-23       Impact factor: 5.699

5.  QTL Mapping of Palmitic Acid Content Using Specific-Locus Amplified Fragment Sequencing (SLAF-Seq) Genotyping in Soybeans (Glycine max L.).

Authors:  Yongguo Xue; Huawei Gao; Xinlei Liu; Xiaofei Tang; Dan Cao; Xiaoyan Luan; Lin Zhao; Lijuan Qiu
Journal:  Int J Mol Sci       Date:  2022-09-24       Impact factor: 6.208

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.