Literature DB >> 32351310

Development of co-dominant markers linked to a hemizygous region that is related to the self-compatibility locus (S) in buckwheat (Fagopyrum esculentum).

Katsuhiro Matsui1,2, Nobuyuki Mizuno3, Mariko Ueno3, Ryoma Takeshima1, Yasuo Yasui3.   

Abstract

Common buckwheat (Fagopyrum esculentum) is a heterostylous self-incompatible (SI) species with two different flower morphologies, pin and thrum. The SI trait is controlled by a single gene complex locus, S. Self-compatible (SC) lines were developed by crossing F. esculentum and F. homotropicum; these lines have an SC gene, Sh , which is dominant over the s allele and recessive to the S allele. S-ELF3 has been identified as a candidate gene in the S locus and is present in the S and Sh but not s alleles. A single-nucleotide deletion in the S-ELF3 gene of the Sh allele results in a frame shift. To develop co-dominant markers to distinguish between ShSh and Shs plants, we performed a next-generation sequencing analysis in combination with bulked-segregant analysis. We developed four co-dominant markers linked to the S locus. We investigated the polymorphism frequency between a self-compatible line and leading Japanese buckwheat cultivars. Linkage between a developed sequence-tagged-site marker and flower morphology was confirmed using more than 1000 segregating plants and showed no recombination. The developed markers would be useful for buckwheat breeding and also to produce lines for genetic analysis such as recombinant inbred lines.
Copyright © 2020 by JAPANESE SOCIETY OF BREEDING.

Entities:  

Keywords:  bulked segregant analysis; heterostyly; self-incompatibility; sequence-tagged sites (STS)

Year:  2020        PMID: 32351310      PMCID: PMC7180148          DOI: 10.1270/jsbbs.19129

Source DB:  PubMed          Journal:  Breed Sci        ISSN: 1344-7610            Impact factor:   2.086


Introduction

Common buckwheat (Fagopyrum esculentum) is a heteromorphic self-incompatible (SI) plant species; it needs insects for cross-pollination between flowers with different morphologies (pin and thrum flowers; Garber and Quisenberry 1927). The yields of buckwheat are influenced by the activity of insects, which is affected by weather. Production of new buckwheat cultivars takes a long time because it requires the loci for useful agricultural traits to be homozygous and loci that would cause inbreeding depression or undesirable traits to be heterozygous. The self-incompatibility of buckwheat is controlled by a single gene complex locus, S, which is also called the S supergene: the short-style morph is a heterozygous thrum form (Ss) and the long-style morph is imparted by the recessive homozygous pin (ss) alleles (Sharma and Boyes 1961). Self-compatible (SC) lines have been developed by an interspecific cross between F. esculentum and F. homotropicum with embryo rescue (Aii , Campbell 1995, Matsui , Woo ). Although the SC lines have a potential for high and stable yields, no such lines are practically available because of inbreeding depression and undesirable traits caused by the homozygous state of some loci. To overcome these problems, crosses between SC lines and leading cultivars would be needed. The flower morphology of the SC lines, long homostyle (LH), is controlled by a single gene (Campbell 1995, Woo ). The allele controlling the homomorphic flower type was designated S and the dominance relationship was found to be S > S > s (Woo ). Sharma and Boyes (1961) postulated that the buckwheat S locus contains several genes and named it the S supergene, similar to that proposed for Primula (Dowrick 1956). The S supergene consists of at least five linked genes: I and I govern the incompatibility reaction in the style and pollen, respectively, G controls style length, P controls pollen size, and A controls anther height (Dowrick 1956, Lewis and Jones 1992, Sharma and Boyes 1961). Each of these genes is di-allelic, and the five alleles are all recessive in pin plants and all dominant in thrum plants. These genes have not been identified but at least two loci are thought to exist (Matsui ). S LOCUS EARLY FLOWRING3 (S-ELF3), which may be a transcription factor, has been identified as a candidate gene for the I or G gene; it is expressed only in the style of thrum plants and is linked to the S locus with no recombination in buckwheat (Yasui ). Using genome assembly analysis with next-generation sequencing (NGS), we have developed a buckwheat genome database (BGDB) and also found that the region that includes this locus is absent in the genome of pin plants (Yasui ). The S alleles have a deletion of a nucleotide in the S-ELF3 locus, which causes a frame shift. To obtain self-fertilized F2 seeds in an F1 plant, the styles of pin plants (ss) need to be pollinated with pollen of SC plants (S) because of the dominance relation, S > s, and to avoid production of seeds by self-pollination. In such a cross, the genotype of F1 plants would be S. In the F2 generation, flower morphology and SI/SC would segregate with a monogenic segregation ratio of 3 SC (long homostyle) to 1 SI (pin). Plants with the long homostyle would be either homozygous (S) or heterozygous (S). A method to distinguish between the two genotypes is desirable to select SC homozygous plants in the F2 generation or to choose SC/SI segregating lines in each generation for recurrent selection. DNA markers to distinguish the S and S genotypes would be powerful tools for this purpose. Because the region of the S-ELF3 locus is missing in the genome of s plants (Yasui ), we needed to detect the flanking regions of the S alleles that would be present in the s genome. In this study, we used NGS analysis in combination with bulked segregant analysis of segregating progeny of a cross between SI and SC plants. We developed co-dominant markers that can distinguish between homozygous (S) and heterozygous (S) plants for marker-assisted selection.

Materials and Methods

Plant materials

To detect the s and S regions, the segregating line 16Aseg03 (Hara ) was used; it consists of 142 F2 plants derived from a cross between a pin plant of the breeding line ‘Kyukei 29’ (KY29, ss) and an SC long-homostyle plant, ‘Kyukei SC7’ (KSC7, S). To develop co-dominant markers linked to the S allele, we used two different segregating lines, 16Aseg04 (Hara ) and 16AsegA. Because buckwheat is a cross-pollinating species, each plant, even within the same cultivar, contains many heterozygous loci and has some non-fixed traits. The 16Aseg04 line was produced by crossing a pin plant of KY29 (ss) (different from the plant used to produce 16Aseg03) and KSC7 (S). The 16AsegA line was developed by crossing a pin plant of a green stem mutant line (GSML, Matsui , 2018) and an SC long-homostyle plant, 13AL130-4, which was developed using an SC line ‘Norin-PL1’ (Matsui ). The 16AsegA line was developed to confirm if a marker was valid for lines derived from a different SC line with the S allele. These lines were grown in a glasshouse at the NARO, Tsukuba, Japan. To verify if a candidate region could be used to develop co-dominant markers, six 16AsegA plants were used for PCR amplification. To identify markers tightly linked to the S locus, we developed an F2 segregating line, 17KySeg01 (1009 plants), derived from a cross between a pin plant of ‘Sachiizumi’ (Matsui ) and KSC7. The 17KySeg01 line was grown in a glasshouse at the Kyoto University, Kyoto, Japan. To determine polymorphism rates between leading cultivars in Japan and KSC7, we used five cultivars, ‘Kitawasesoba’, ‘Hashikamiwase’, ‘Harunoibuki’ (Hara ), ‘NARO-Fe1’, and ‘Sachiizumi’, and two breeding lines, KY28 and KY29. These plants were grown in a field at the Institute of Crop Science NARO, Tsukuba, Ibaraki, Japan.

DNA isolation and preparation of bulked DNA for NGS analysis

Total DNA was isolated from leaves of each plant from all segregating populations with a DNeasy Plant Mini Kit (Qiagen, Hilden, Germany). Two sets of bulked DNA were made by mixing DNA of randomly selected 28 pin plants (PIN bulk) or 30 long-homostyle plants (LH bulk) of the 16ASeg03 line. The LH bulk was expected to contain two alleles (S and s) and the PIN bulk only the recessive allele (s), which would make it possible to detect the flanking regions by NGS analysis of the S allele. Genomic DNA was also extracted from the parental plants of the 16ASeg03 line and the five Japanese cultivars and two breeding lines mentioned above.

Next-generation sequencing analysis

Paired-end reads of 100 bp from the two bulks and the KSC7 and KY29 plants were obtained on an Illumina HiSeq 2000 System at Macrogen Japan (Kyoto, Japan). The raw reads are available from the DDBJ/EMBL/NCBI under the accession numbers DRX178921-DRX178924. Low-quality reads and adaptors (CACGACGCTCTTCCGATCT and ACCGCTCTTCCGATCTGTAA) were trimmed using Trimmomatic-0.32 (Bolger ) with the following settings: HEADCROP, 2; SLIDINGWINDOW, 4:25; LEADING, 25; TRAILING, 25; MINLEN, 50. Trimmed reads were mapped to the reference sequences using BWA 0.7.15 (Li and Durbin 2009) with the ‘bwa aln’ option with -l 32 -k 2 -n 5 and the ‘bwa sampe’ option with default settings. Only genome sequences (Yasui ) of ≥1 kb were selected as reference sequences. Mapping results were processed with SAMtools 0.1.18 (Li ). SNPs were detected using UnifiedGenotyper in GATK 3.7 (DePristo ) with the –glm BOTH option. The S-linked SNPs were detected using the following criteria: 1) homozygous in KSC7, the nucleotide differs from that in the reference sequence because the reference sequence in BGDB was developed using a thrum type of plant; 2) homozygous in KY29, the nucleotide is the same as in the reference sequence; 3) heterozygous in the LH bulk; 4) homozygous in the PIN bulk, the nucleotide is the same as in the reference sequence (Fig. 1). The reference sequences in BGDB were short (N50 = 25.1 kb; Yasui ), and it is difficult to obtain a graphical change of the number of S-linked SNP sites through scaffolds. Hence, we counted the S-linked SNPs in all reference sequences and finally obtained the ratio of the number of S-linked SNPs to the number of all SNPs, named S-linked SNP index, for each scaffold.
Fig. 1.

Scheme of screening for S allele–linked SNPs. Hatched and solid bars indicate genome sequences harboring the S and s allele, respectively. Sequences were screened for SNPs between KSC7 and the reference sequence (category 1, Yasui ) and between KSC7 and KY29 (category 2). Heterozygous sites in the LH bulk (category 3) and homozygous sites in the PIN bulk (category 4) were treated as S-linked SNPs.

Development of sequence-tagged-site markers from NGS data

Sequences of ca. 1000 bp were randomly selected in each of the 50 candidate regions (Supplemental Table 1) and primers were designed with Primer3 (http://bioinfo.ut.ee/primer3-0.4.0/) to satisfy the following conditions: product length, ca. 600–650 bp; primer length, 22–26 bp; Tm, 55–64°C. Amplification with genomic DNA as a template was performed with the designed specific primers as follows: 30 cycles at 94°C for 30 s, 60°C for 30 s, and 72°C for 60 s. Amplification was confirmed by agarose gel electrophoresis, and the DNA fragments were digested with three randomly chosen enzymes with four recognition sites (AluI, HaeIII, and MspI). Co-dominant markers were developed using six plants from the 16ASegA segregating population to find highly versatile markers.

Confirmation of linkage relations

To check whether the markers are linked to the S locus and are useful for different populations, we checked the linkage relation in the 16ASeg03 and 16ASeg04 populations. Using 50 plants from each population, the amplification with the corresponding primers and subsequent digestion of the PCR products with the corresponding restriction enzymes were performed. For a marker that seemed to be useful in linkage analysis, additional linkage analysis with 17KySeg01 which is F2 segregating lines (1009 plants) with the corresponding specific primers and enzymes was performed. The genotype of the S locus was determined based on the flower morphology. Electrophoresis was performed in a capillary electrophoresis system (LabChip GX, PerkinElmer). A DNA5K/RNA/CZE chip was used with a HT DNA5K Reagent Kit (PerkinElmer).

Results

Identification of flanking regions of the Sh allele and development of co-dominant markers

By comparing the sequences between the LH and PIN DNA bulks, we obtained 263 scaffolds with high values of the S-linked SNP index (>0.5; Supplemental Table 1). Because it takes time to design markers at the detected SNPs, and actual buckwheat breeding does not require many markers for the selection of a trait, we chose the top 50 scaffolds and made primers to amplify each region (Supplemental Table 1). Amplification in six 16ASegA plants showed that 16 primer sets resulted in a weak band, more than two bands, or no amplification in some plants (Supplemental Table 1). The 34 amplicons that showed good amplification were digested with AluI, HaeIII, and MspI. Four primer sets showed polymorphisms consistent with flower morphology (Fig. 2, Supplemental Table 1), and the primer sets with the restriction enzymes were named as S linked sequence-tagged-site (STS) markers (STS-0012458, STS-007596, STS-0007304, and STS-0006886; Fig. 2, Supplemental Table 1). Polymorphism was detected by AluI digestion in STS-0012458 and STS-007596, and by HaeIII digestion in STS-0006886. Sizes of the undigested fragments amplified in STS-0007304 differed between pin and LH plants, but the band intensity and stability were low (Supplemental Table 1).
Fig. 2.

PCR products from F2 pin and long-homostyle (LH) plants (16ASeg3). PCR products were digested with AluI or HaeIII. Undigested amplification products of STS-0007304 differed between pin and LH. The genotypes of S locus, S and S were shown based on the genotype of each marker.

Confirmation of linkage using several segregating lines

To check if these markers are really linked to the S locus and are useful for different segregating populations, we performed linkage analysis with randomly selected 50 plants from each of the 16ASeg3 and 16ASeg4 populations. Segregation of the four markers fit flower morphology with no recombination (Table 1). STS-0012458 was polymorphic in all populations that we used and was further investigated using the segregating line, 17KySeg01 (1009 plants). Flower morphology segregated as 745 LH and 264 pin, fitting a segregating ratio of 3:1 (Table 2; χ2 = 0.7298, 0.40 < P < 0.50); STS-0012458 segregated as AA:AB:BB = 240:505:264, fitting a segregating ratio of 1:2:1 (χ2 = 1.1427, 0.40 < P < 0.50).
Table 1.

Linkage relation between flower morphology and developed markers evaluated with 50 randomly selected plants from each segregating line

Segregating lineLong homostylePinχ2 value, P
AaHBAHBFlower morphology (3:1)Marker (1:2:1)
Seg3b1523000120.027, 0.80 < P < 0.900.680, 0.70 < P < 0.80
Seg41426000100.667, 0.40 < P < 0.500.720, 0.60 < P < 0.70

A, H and B are the genotypes of STS markers. All STS markers (0012458, 0007596, 0007304, and 0006886) showed the same segregation pattern in each segregating line.

Seg3, 16ASeg3 line; Seg4, 16ASeg4 line.

Table 2.

Linkage relation between flower morphology and a developed marker, STS-0012458

Long homostylePinχ2 value, P
AaHBAHBFlower morphology (3:1)Marker (1:2:1)
2405050002640.7298, 0.40 < P < 0.501.1427, 0.40 < P < 0.50

A, H and B are the genotypes of STS markers.

Polymorphism between leading cultivars in Japan and KSC7

It is important to show high rates of polymorphism between parental lines. Among the four primer sets, STS-0012458, STS-0007304, and STS-0006886 showed high frequency of polymorphisms between KSC7 and pin plants of leading Japanese cultivars and breeding lines (Table 3), indicating that these three markers would be useful for introducing the S into many varieties.
Table 3.

Polymorphism frequency between a self-compatible line, KSC7, and Japanese cultivars and breeding lines

MarkerEnzymeKSC7KTWHKWHARNF1SACK28K29Total by the band patternaTotal number of plantsPercentage of polymorphism relative to KSC7
+H+H+H+H+H+H+H+H
STS-0012458AluI+006006006004006006004 003838100.0
STS-0007596MspI0060066003015000060041402337 37.8
STS-0007304006004003004004005004 003030100.0
STS-0006886HaeIII600600300400400600600350 035100.0

Only pin plants of each variety were used. KTW, Kitawasesoba; HKW, Hashikamiwase; HAR, Harunoibuki; NRF, NARO-Fe1; SAC, Sachiizumi; K28, Kyukei 28; K29, Kyukei 29. +, digested; –, not digested; H, heterozygous.

Discussion

Marker-assisted selection can improve the efficiency and accuracy of conventional plant breeding in many crops. Co-dominant markers are particularly powerful in selecting the desirable traits without missing heterozygous plants because they can distinguish between homozygous and heterozygous plants. Co-dominant DNA markers can usually be developed on the basis of the sequence of a gene controlling targeted traits or tightly linked regions. Because the region of the S locus of buckwheat is hemizygous (S-ELF3 in the S locus is present only on a chromosome carrying the S or S allele; Yasui , 2016), the S-ELF3 sequence cannot be used for developing co-dominant markers. Furthermore, the exact length of the deletion in the s allele are still unknown because the scaffold that includes S-ELF3 is short. Bulked segregant analysis is a powerful tool to identify the flanking regions of a target gene. Several markers linked to genes controlling agricultural traits in buckwheat have been developed with this technique (Aii , Matsui ). Aii have developed a co-dominant marker linked to the S allele on the basis of a random amplified polymorphic DNA (RAPD) marker. However, because of a large distance between this marker and the S locus (ca. 6 cM), the genotype of some plants determined with the marker did not match their SC/SI morphology. The ability of RAPD or similar markers such as amplified fragment length polymorphism (AFLP) markers to reveal polymorphism depends on the primer sequences. In this study, we sequenced the whole genome using bulked DNA. By setting screening criteria (see Materials and Methods), we can determine the area linked to the hemizygous region. We developed tightly linked markers, and the linkage distance of one of them was less than 0.1% because any recombination was not recognized in the segregating line, 17KySeg01 (1009 plants). It would be very useful for buckwheat breeding. Among multiple candidate regions, we used the top 50 only. If we knew the candidate regions tightly linked to the S locus, we could develop more co-dominant markers more efficiently. Recently, the NGS-based target re-sequencing AmpliSeq technology (Thermo Fisher Scientific, Waltham, MA, USA) has been used to sequence plant DNA (Ogiso-Tanaka , Stevanato ). Such methods would also increase the efficiency of co-dominant marker development. Identification of the region deleted in the s allele would allow us to accurately develop co-dominant markers by designing primers to cover the deleted region. Unfortunately, the tightly linked marker STS-0012458 we developed is not on the same scaffold as S-ELF3. The BGDB consists of more than 300,000 scaffolds, so the missing region of the s allele remains unidentified. Upgrade and improvement of BGDB would help to solve the issue. All markers except STS-0007596 showed a high rate of polymorphism between the SC line KSC7 and cultivars and breeding lines in Japan, probably because KSC7 contains genomic regions derived from F. homotropicum. KSC7 was developed using ‘Norin-PL1’ (Matsui ), which was generated by a cross between F. esculentum and F. homotropicum (Matsui ). A low rate of polymorphism of the STS-0007596 may be because it was developed on a genomic region that was derived from the F. esculentum, or on a conserved area between F. esculentum and F. homotropicum. Both SC lines, ‘Norin-PL1’ and KSC7, have inferior traits that make them unsuitable for cultivation by farmers. Identification of the genomic region of F. homotropicum in ‘Norin-PL1’ may reveal regions related to these inferior traits such as preharvest sprouting (Hara ). SC lines are beneficial for buckwheat cultivation and breeding, because they do not need pollinators, have stable yield, and their useful agronomical traits can be fixed easily. Because some traits including beneficial traits are hidden by heterozygosity in normal cultivated and indigenous natural populations, SC lines can be used to reveal these traits easily by homozygous. Furthermore, SC lines can be easily used to produce segregating and analytical lines such as F2, recombinant inbred lines, near isogenic lines, and mutated populations. With the development of sequencing technology, it is now possible to obtain varieties with new desirable traits such as non-allergens and glutinous starches from artificially mutated or unique natural populations based on DNA sequences. To breed a high-yielding variety adapted to a particular cultivation area, a line with desirable traits would need to be crossed with a leading local cultivar. Co-dominant markers such as those developed in this study would be useful for distinguishing between SI and SC lines to avoid inbreeding depression often found in the latter.

Author Contribution Statement

KM and YY conceived of the study and designed the experiments. YY, MN and UM performed the NGS analysis. KM developed co-dominant markers. KM, YY, MN, UM and TR performed linkage analysis. KM and TR investigated the frequency of polymorphisms. KM and YY wrote the manuscript. MN, UM and TR edited and revised the manuscript. Supplemental Table
  10 in total

1.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

2.  Identification of AFLP makers linked to non-seed shattering locus (sht1) in buckwheat and conversion to STS markers for marker-assisted selection.

Authors:  K Matsui; Y Kiryu; T Komatsuda; N Kurauchi; T Ohtani; T Tetsuka
Journal:  Genome       Date:  2004-06       Impact factor: 2.166

3.  Identification of a gene encoding glutathione S-transferase that is related to anthocyanin accumulation in buckwheat (Fagopyrum esculentum).

Authors:  Katsuhiro Matsui; Takuya Tomatsu; Saeka Kinouchi; Tatsuro Suzuki; Takeshi Sato
Journal:  J Plant Physiol       Date:  2018-10-06       Impact factor: 3.549

4.  A framework for variation discovery and genotyping using next-generation DNA sequencing data.

Authors:  Mark A DePristo; Eric Banks; Ryan Poplin; Kiran V Garimella; Jared R Maguire; Christopher Hartl; Anthony A Philippakis; Guillermo del Angel; Manuel A Rivas; Matt Hanna; Aaron McKenna; Tim J Fennell; Andrew M Kernytsky; Andrey Y Sivachenko; Kristian Cibulskis; Stacey B Gabriel; David Altshuler; Mark J Daly
Journal:  Nat Genet       Date:  2011-04-10       Impact factor: 38.330

5.  S-LOCUS EARLY FLOWERING 3 is exclusively present in the genomes of short-styled buckwheat plants that exhibit heteromorphic self-incompatibility.

Authors:  Yasuo Yasui; Masashi Mori; Jotaro Aii; Tomoko Abe; Daiki Matsumoto; Shingo Sato; Yoriko Hayashi; Ohmi Ohnishi; Tatsuya Ota
Journal:  PLoS One       Date:  2012-02-01       Impact factor: 3.240

6.  Highly multiplexed AmpliSeq technology identifies novel variation of flowering time-related genes in soybean (Glycine max).

Authors:  Eri Ogiso-Tanaka; Takehiko Shimizu; Makita Hajika; Akito Kaga; Masao Ishimoto
Journal:  DNA Res       Date:  2019-06-01       Impact factor: 4.458

7.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

8.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

9.  Assembly of the draft genome of buckwheat and its applications in identifying agronomically useful genes.

Authors:  Yasuo Yasui; Hideki Hirakawa; Mariko Ueno; Katsuhiro Matsui; Tomoyuki Katsube-Tanaka; Soo Jung Yang; Jotaro Aii; Shingo Sato; Masashi Mori
Journal:  DNA Res       Date:  2016-04-02       Impact factor: 4.458

10.  Targeted Next-Generation Sequencing Identification of Mutations in Disease Resistance Gene Analogs (RGAs) in Wild and Cultivated Beets.

Authors:  Piergiorgio Stevanato; Chiara Broccanello; Luca Pajola; Filippo Biscarini; Chris Richards; Lee Panella; Mahdi Hassani; Elide Formentin; Claudia Chiodi; Giuseppe Concheri; Bahram Heidari
Journal:  Genes (Basel)       Date:  2017-10-11       Impact factor: 4.096

  10 in total
  2 in total

Review 1.  Breeding Buckwheat for Increased Levels of Rutin, Quercetin and Other Bioactive Compounds with Potential Antiviral Effects.

Authors:  Zlata Luthar; Mateja Germ; Matevž Likar; Aleksandra Golob; Katarina Vogel-Mikuš; Paula Pongrac; Anita Kušar; Igor Pravst; Ivan Kreft
Journal:  Plants (Basel)       Date:  2020-11-24

2.  Targeted amplicon sequencing + next-generation sequencing-based bulked segregant analysis identified genetic loci associated with preharvest sprouting tolerance in common buckwheat (Fagopyrum esculentum).

Authors:  Ryoma Takeshima; Eri Ogiso-Tanaka; Yasuo Yasui; Katsuhiro Matsui
Journal:  BMC Plant Biol       Date:  2021-01-06       Impact factor: 4.215

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.