Literature DB >> 26565976

An Efficient Approach for the Development of Locus Specific Primers in Bread Wheat (Triticum aestivum L.) and Its Application to Re-Sequencing of Genes Involved in Frost Tolerance.

Steve Babben1, Dragan Perovic1, Michael Koch2, Frank Ordon1.   

Abstract

Recent declines in costs accelerated sequencing of many species with large genomes, including hexaploid wheat (Triticum aestivum L.). Although the draft sequence of bread wheat is known, it is still one of the major challenges to developlocus specific primers suitable to be used in marker assisted selection procedures, due to the high homology of the three genomes. In this study we describe an efficient approach for the development of locus specific primers comprising four steps, i.e. (i) identification of genomic and coding sequences (CDS) of candidate genes, (ii) intron- and exon-structure reconstruction, (iii) identification of wheat A, B and D sub-genome sequences and primer development based on sequence differences between the three sub-genomes, and (iv); testing of primers for functionality, correct size and localisation. This approach was applied to single, low and high copy genes involved in frost tolerance in wheat. In summary for 27 of these genes for which sequences were derived from Triticum aestivum, Triticum monococcum and Hordeum vulgare, a set of 119 primer pairs was developed and after testing on Nulli-tetrasomic (NT) lines, a set of 65 primer pairs (54.6%), corresponding to 19 candidate genes, turned out to be specific. Out of these a set of 35 fragments was selected for validation via Sanger's amplicon re-sequencing. All fragments, with the exception of one, could be assigned to the original reference sequence. The approach presented here showed a much higher specificity in primer development in comparison to techniques used so far in bread wheat and can be applied to other polyploid species with a known draft sequence.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26565976      PMCID: PMC4643983          DOI: 10.1371/journal.pone.0142746

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Genomic resources in wheat

Wheat (Triticum aestivum L.) is the cereal with the largest acreage worldwide [1]. It belongs to the family Poaceae and has a complex allohexaploid genome of about 17 Giga-base pairs (Gbp). The repeat content is approximately 80% which consists primarily of retroelements. The gene density is between 1 per 87 Kilo-base pairs (Kbp) and 1 per 184 Kbp [2, 3]. During evolution wheat became an alohexaploid organism (2n = 6x = 42) with the A, B and D genome. In brief, 300.000–500.000 years ago the first hybridisation between the wild diploid wheat (Triticum urartu, 2n = 2x = 14, genome AuAu) and an ancestor closest related to goat grass (Aegilops speltoides, 2n = 2x = 14, genome SS) took place [4, 5] leading to the generation of wild emmer wheat (Triticum dicoccoides, 2n = 4x = 28, genome AuAuBB) [6]. Tribal communities formerly making a living of gathering and hunting began to cultivate the wild emmer about 10,000 years ago. Human selection led to cultivated emmer (Triticum dicoccum). By a spontaneous hybridisation of cultivated emmer with another goat grass (Aegilops tauschii 2n = 2x = 14, genome DD) in combination with a natural mutation, bread wheat (Triticum aestivum, 2n = 6x = 42, genome AABBDD) was created [7]. Due to the hexaploid genome and a very high homology of the three sub-genomes in wheat, the genome sequence information has an inestimable value for molecular breeding, comparative genomics and association studies. Nowadays, the National Center for Biotechnology Information (NCBI, http://www.ncbi.nlm.nih.gov/) database is a key virtual library of genomic, transcriptional and protein sequence data for more than 33,000 organisms [8]. NCBI serves as a web-platform for the identification of target gene sequences in organisms of interest, e.g. Triticum aestivum, Triticum monococcum, Hordeum vulgare etc. An additional wheat database is the CerealsDB web page created by members of the Functional Genomics Group at the University of Bristol (http://www.cerealsdb.uk.net), which includes online resources of genomic information, i.e. varietal SNPs, DArT markers, and EST sequences all linked to a draft genome sequence of the cultivar Chinese Spring [9]. Another web based portal is URGI, which includes datasets such as chromosome survey sequences, reference sequences, physical maps, genetic maps, polymorphisms, genetic resources, many phenotypic data and various genomic arrays (http://wheat-urgi.versailles.inra.fr). The chromosomal sequence information is granted by the International Wheat Genome Sequencing Consortium (IWGSC). All mentioned databases are suitable for the identification of homologous chromosome sequences in bread wheat. In addition to these resources, an important tool for wheat is the upcoming Genome Zipper of wheat (http://wheat-urgi.versailles.inra.fr). In the past few years, a lot of sequence information of wheatsorted chromosome arms [10-12], T. urartu [13] and Ae. tauschii [14] became available and was integrated in the above mentioned databases.

Function and structure of frost tolerance genes

Low temperature is one of the most important limiting factors of wheat cultivation in North America and Eastern Europe. To ensure high yields in these areas, introduction of efficient frost tolerance alleles into elite cultivars is a prerequisite. Cold stress inhibits metabolic reactions and prevents wheat from fulfilling its genetic potential. To avoid yield losses, wheat needs acclimatisation to low temperatures, which prevents premature transition to the reproductive phase. This must happen before the threat of freezing stress during winter has passed [15]. Frost tolerance is a complex system involving many genes out of which six gene families/groups have been analysed in this study. According to their function, these genes belong to two separated metabolic pathways. The Ppd and Vrn genes are responsible for flowering, whereas the Cbfs, Ices, Tacr7, Dem, Cab and Dhn genes are involved directly in frost tolerance. Regarding copy number, the analysed genes could be assigned as follows: Dem and Tacr7 are single copy; Ppd, Vrn and Ice are low copy, while Cbfs, Cab and Dhn are high copy genes. A high number of low temperature-induced genes was identified and characterized in plants [16, 17]. These are referred to as LATE EMBRYOGENESIS-ABUNDANT (LEA), Dehydrin (Dhn), Responsive to Abscisic Acid (RAB), Low Temperature–Responsive (LT) and Cold-Responsive (COR) genes. Several of the COR genes are dehydrins, which are a distinct biochemical group of LEA proteins [18-20] for which 54 different unigenes are described, of which 23 are involved in frost tolerance [21]. Dehydrins have either one but mostly two exons [22]. Cab genes or CAM-like (CML) genes, encoding proteins composed mostly of EF-hand Ca2+-binding motifs, may contain one to six exons [23]. Cbf genes are very important in the induction of COR genes through binding of C-repeat/dehydration-responsive elements (CRT/DRE) [15]. The complex Cbf gene family consists of 27 paralogs with 1–3 homologous copies per sub-genome. In total, the family contains at least 65 Cbf gene family members [24]. Knox et al. [25] detected that approximately half of the eleven Cbf orthologues at the FR-H2 locus in barley are duplicated. In addition, they reported that the variation in Cbf genes, which do not carry any introns, is widespread in the Triticeae [26]. This gene family is regulated by two wheat specific Ice genes under cold conditions [27, 28]. Both Ice genes have four exons [29, 30]. Tacr7 belongs to the group of LT genes [31]. The Dem genes have an important role in the development of apical meristems and are thereby involved in the vegetative/reproductive transition of the shoot apex [27]. Flowering genes may be involved in frost the tolerance pathway because the flowering pathway contains vernalization and photoperiod response genes at crucial positions [32]. This pathway is regulated by five major Vrn genes (Vrn1, Vrn2, Vrn3, Vrn4 and Vrn5) and two Ppd genes (Ppd1 and Ppd2) [33]. The gene structure of the five vernalization genes varies from Vrn1 having eight exons [34], via Vrn3 with three exons [35] and Vrn2 with two exons [36] to Vrn4 and Vrn5 of which the structure is unknown. The Ppd1 gene shows eight exons [37], while the structure of Ppd2 is unknown. The interaction between the flowering and the frost tolerance pathway is based on Vrn1 and Cbf genes. The Vrn1 gene may reduce transcript levels of Cbfs and COR genes under long day conditions.

The draft wheat sequence and development of genomic markers

Nowadays, molecular markers, i.e. marker-assisted selection (MAS), are basic tools in plant breeding during germplasm characterization and cost efficient selection of important traits/genes. Furthermore, after gene isolation re-sequencing of specific fragments allows efficient allele mining [38]. However, the development of gene specific primers in wheat is hampered by the large genome size of 17 Gbp, the high repeat content of about 80% [2, 3], by the close homology of the three genomes (A, B and D) and by the high rate of similarity within genes and gene family members [10]. Comparative analysis of wheat sub-genomes shows high sequence homology and structural conservation and no significant differences in the rate of duplications between the sub-genomes are observed [11]. Recent efforts of the scientific community and the IWGS in sequencing of the 3 donor genomes as well as of the hexaploid wheat offer a solution in deciphering the intron-exon-structure of genes. By using differences of intron sequences among the homologous and paralogous copies of the various genes, it is possible to reconstruct the gene structure and identify differences between homologues. Continuous improvements of BLAST algorithms enhance the use of the above mentioned wheat genomic resources facilitating efficient primer development.Furthermore, specific primers are the basis for the development of molecular marker assays based on SNPsi.e. cleaved amplified polymorphic sequence (CAPS) [39], pyrosequencing [40] or competitive allele-specific polymerase chain reaction (KASP) [41], which are the base for marker assisted selection (MAS) procedures, anchoring physical and sequence contigs [12], germplasm characterization [42].

Material and Methods

Plant material and DNA extraction

In this study three cultivars (`Chinese Spring`, `Moskovskaya 39`and `VAKKA`) were used in initial testing of designed primer pairs, while a set of 24 genotypes, comprising two spring and 22 winter wheat cultivars, was used for re-sequencing of amplicons of frost tolerance genes (Table 1). For the physical assignment to chromosomes and chromosome segments 21 NT-lines [43] and 46 deletion-lines [44] were used (S1 Table) having the genetic background of ‘Chinese Spring’. The DNA was extracted at the three leaf stage according to Stein et al. [45].
Table 1

Plant material for PCR amplification and re-sequencing.

No.GenotypeCountryVariety
1Chinese Spring* spring
2ZentosGermanywinter
3SimilaCzech Republicwinter
4RoughriderUSAwinter
5NorstarUSAwinter
6Moskovskaya 39* Russiawinter
7Bezenchukskaja 380Russiawinter
8CheyenneUSAwinter
9ÄRING IISwedenwinter
10VAKKA* Finlandwinter
11Bezostaja 1Russiawinter
12Capelle DesprezFrancewinter
13CenturkUSAwinter
14Mironovska 808Ukrainewinter
15PobedaSerbiawinter
16RenesansaSerbiawinter
17SavaSerbiawinter
18Triple Dirk B (GK 775)Australiawinter
19Triple Dirk SAustraliaspring
20ISENGRAINFrancewinter
21APACHEFrancewinter
22SKAGENGermanywinter
23JULIUSGermanywinter
24BiryuzaRussiawinter
25Moskovskaya 40Russiawinter

Complete set of 24 genotypes (without `Chinese Spring`) were used for sequencing.

* Genotypes for primer testing

Complete set of 24 genotypes (without `Chinese Spring`) were used for sequencing. * Genotypes for primer testing

Sequence retrieval of genes involved in frost tolerance

As a starting point a set of 27 genes involved in frost tolerance was selected. 9 Triticum aestivum sequences together with 9 sequences from Triticum monococcum and 9 from Hordeum vulgare, known to be involved in frost tolerance from previous studies, served as a back bone for the identification of bread wheat frost tolerance candidate gene sequences (Table 2). If only the coding regions (mRNA-, EST- or protein-sequences) were available, the data bases of the International Wheat Genome Sequencing Consortium (IWGSC, http://www.wheatgenome.org/) and/or the Bristol Wheat Genomics (http://www.cerealsdb.uk.net/) were used for the identification of the full genomic sequence and subsequent reconstruction of the gene structure. The BLAST algorithm parameters were set as default.
Table 2

List of identified frost tolerance candidate gene sequences.

Candidate geneGeneSpeciesCultivarAccessionTypeCitation
Cbf1 TaCBF1 Triticum aestivumWinokaAF376136Gene/CDS[72]
Cbf4 TmCBF4 Triticum monococcumn.aAY951945Gene/CDS[73]
Cbf5 TmCBF5 Triticum monococcumn.aAY951947Gene/CDS[73]
Cbf7 TmCBF7 Triticum monococcumDV92AY785904Gene/CDS[26]
Cbf8 HvCBF8 Hordeum vulgareTremoisDQ445252Gene/CDS[25]
Cbf10 TmCBF10 Triticum monococcumn.aAY951950Gene/CDS[73]
Cbf13 TmCBF13 Triticum monococcumn.aAY951951Gene/CDS[73]
Cbf14 TmCBF14 Triticum monococcumn.aAY951948Gene/CDS[73]
Cbf15 TaCBF15 Triticum aestivumNorstarEF028765Gene/CDS[74]
Cbf16 TmCBF16 Triticum monococcumG3116EU076384Gene/CDS[75]
Cbf17 TmCBF17 Triticum monococcumn.aAY951945Gene/CDS[73]
Cbf18 TmCBF18 Triticum monococcumn.aAY951946Gene/CDS[73]
Dhn1 HvDhn1 Hordeum vulgareDicktooAF043087Gene/CDS[76]
Dhn3 HvDhn3 Hordeum vulgareDicktooAF043089Gene/CDS[76]
Dhn4 HvDhn4 Hordeum vulgareBarkeBQ466915EST[77]
Ice2 HvIce2 Hordeum vulgareMorexDQ113909Gene/CDS[29]
Vrn-A1 TaVRN-A1 Triticum aestivumTriple Dirk C LineAY747600Gene/CDS[34]
Vrn-B1 TaVRN-B1 Triticum aestivumTriple Dirk B LineAY747603Gene/CDS[34]
Vrn-D1 TaVRN-D1 Triticum aestivumTriple Dirk C LineAY747606Gene/CDS[34]
Vrn2 TaVRN2 Hordeum vulgareDairokkakuAY485977partial CDS[78]
Vrn3 TaVRN3 Triticum aestivumChinese SpringDQ890162Gene/CDS[35]
Cab HvCab Hordeum vulgareBarkeBQ465487EST[77]
Dem HvDem Hordeum vulgareBarkeAL504294EST[79]
Tacr7 HvTacr7 Hordeum vulgareGolden PromiseBQ659345EST[77]
Ppd-A1 TaPpd-A1 Triticum aestivumChinese SpringDQ885753Gene/CDS[37]
Ppd-B1 TaPpd-B1 Triticum aestivumChinese SpringDQ885757Gene/CDS[37]
Ppd-D1 TaPpd-D1 Triticum aestivumChinese SpringDQ885766Gene/CDS[37]

Reconstruction of intron-exon-structure and gene specific primer development

The reconstruction of the gene intron-exon-structure was performed using the internet platform ‘Spidey’ (http://www.ncbi.nlm.nih.gov/spidey/spideyweb.cgi) from NCBI, which allowsalignment of mRNA to genomic sequence. The intron/UTR regions sequences were used for primer development. The next step was the identification of the best hits to the three different wheat genomes on the IWGSC and/or the Bristol Wheat Genomics website via BLASTn. After collecting three homologue sequences of each targeted gene the gene structure was reconstructed for each one separately and then used for multiple alignments. Multiple alignments were constructed by using Sequencer 5.1 (Gene Codes Corporation, Ann Arbor, USA) and CLC Main Workbench 7.6 (CLC Bio, Aarhus, Denmark) software and visually inspected for unique stretches among three homologues. The polymorphisms between the three homologous genomes of each gene were detected and used for specific primer development. The primers were developed by using ‘Primer3’ (v. 0.4.0) [46, 47]. Parameters utilized for primer development were set to a maximal 3`stability of 50, primer size between 19 and 28 bp and primer melting temperature between 57° and 63°Celsius. The maximal fragment length was set up to 1200 bp, while optimal fragment length was 900 bp. Other parameters remained as default. Specificity of primers was based on two nucleotide differences within the primer binding site or one difference within the last seven nucleotides at the 3`end of the primer based on the analyses of the three homologue target sequences [48]. All primers were designed to bind locus specific sequences within the introns/UTR regions of selected genes. At least one primer of a primer pair had to be locus specific for single band amplification.

PCR amplification and fragment analysis

Newly designed PCR primers were amplified in two different reaction volumes i.e. firstly, in a volume of 10 μl for functionality testing and chromosomal assignment, and secondly in a 20 μl reaction volume for re-sequencing. The PCR reactions comprised two different polymerases, FIREPol® DNA polymerase (Solis BioDyne, Tartu, Estonia), in a first round of testing, and MyTaq™ DNA polymerase (BIOLINE, Luckenwalde, Germany), in a second round of testing in case the FIREPol product was very weak, with 50 ng of genomic DNA. The master mix for one PCR reaction comprised 0.4 U FIREPol® DNA Polymerase, 1 x Buffer B, 2.5 mM MgCl2 (Solis BioDyne, Tartu, Estonia), 0.2 mM dNTPs (Fermentas, St. Leon-Rot, Germany) and 0.25 pmol primers (Microsynth, Balgach, Switzerland) or 0.4 U MyTaq™ DNA Polymerase, 1 x My Taq Reaction Buffer B (that comprised 1 mM dNTPs and 3 mM MgCl2) (BIOLINE, Luckenwalde, Germany) and 0.25 pmol primers. The fragment amplification was conducted in a thermal cycler GeneAmp® PCR System 9700 (Applied Biosystems, Darmstadt, Germany) under various PCR profiles (S2 Table). PCR fragments were separated by using agarose gel electrophoreses and analysed using the imaging system Gel Doc™ XR and the Quantity One® 1-D analysis software (4.6.2) (Bio-Rad, Hercules, USA).

PCR fragment mapping by using NT- and deletion lines

All specific and single banded PCR fragments were assigned to chromosomes by using 21 nullisomic-tetrasomic (NT) lines [43] and by a set of 46 deletion-lines [44]. The information about chromosomal localisation of these gene specific amplicons was compared to published results. The map of specific PCR fragments was printed via LaTeX 4.4.1 software (freeware).

In silico analysis of primer sub-genome specificity

A set of98 primers used for amplification of 65 PCR fragments with correct chromosomal localisation were in silico validated for sub-genome specificity by aligning to the draft sequence of wheat. The primers were aligned via Multiple Alignment using Fast Fourier Transform (MAFFT, http://www.ebi.ac.uk/Tools/msa/mafft/), CLC and Sequencher. Parameters for the Sequencher based alignment were as follows: clean data with minimum overlap of 19 nucleotides and minimum match percentage of 90%, while CLC and MAFFT parameters were as default. The differences between the sub-genome sequences and designed primers were manually inspected. Primers with sub-genome specificity were those having two or more differences in binding site or at least one difference at the last seven nucleotide bases at 3`end of primer.

Re-sequencing of frost tolerance candidate genes and BLAST verification

Sequencing of PCR fragments was performed by Microsynth AG (Balgach, Switzerland) using the Sanger sequencing method [49]. First sequencing reactions were performed with primers used for amplification and if quality was lower than 70% an optimisation with redesigned oligos was conducted. Subsequently all fragment sequences were compared to reference sequences and/or candidate genes of related species by using NCBI MegaBlast function [50]. The results were limited to five hits, minimum expect threshold of e-100 and minimum identity of 85%. All other parameters remained as default. The haplotype diversity (Hd), the nucleotide diversity and the average number of nucleotide diversity in a set of 24 analysed wheat cultivars were calculated using the DnaSP 5.1 freeware software [51, 52].

Results

Alignment of candidate gene sequences with corresponding genomic sequences retrieved from the International Wheat Genome Sequencing Consortium, the Bristol Wheat Genomics and NCBI allowed the identification of exon-intron splicing positions, and the identification of coding and non coding regions. Therefore, reconstruction of the intron-exon structure by using newly available genomic sequences is the basic step towards the development of gene specific primers in polyploid plants such as hexaploid wheat.

Reconstruction of intron-exon-structure and development of gene specific primers

The workflow for the development of gene specific primers and validation regarding PCR specificity, chromosomal localisation and sequence homology contains four steps (Fig 1). In short, the procedure starts with collecting sequences of candidate genes, followed by the reconstruction of intron and exonstructure and sub-genome sequence identification, until primer development and PCR fragment testing. Functionality and correctness of PCR fragments were assessed by NT mapping, sequencing and BLASTing by using three databases, six tools (‘Spidey’, ‘Primer3’, BLASTn, BLASTx, CLC Main Workbench and Sequencer) and two cytological stocks of wheat.
Fig 1

Workflow of development gene specific primers and PCR fragments in wheat.

The method comprises four steps, i.e. (i) identification of genomic and coding sequences (CDS) of candidate genes, (ii) intron- and exon-structure reconstruction, (iii) identification of wheat A, B and D sub-genome sequences and primer development on sequence differences between the three sub-genomes, and (iv); primer and PCR fragment testing for functionality, correct size and localisation. The dashed lines show optional applications.

Workflow of development gene specific primers and PCR fragments in wheat.

The method comprises four steps, i.e. (i) identification of genomic and coding sequences (CDS) of candidate genes, (ii) intron- and exon-structure reconstruction, (iii) identification of wheat A, B and D sub-genome sequences and primer development on sequence differences between the three sub-genomes, and (iv); primer and PCR fragment testing for functionality, correct size and localisation. The dashed lines show optional applications. For all of the 27 candidate genes we were able to re-construct the gene structure or at least a part of it. A set of 119 PCR products was obtained from 157 primers pairs designed in this study. 13 of them have recently been published in Keilwagen et al. [53]. Additional 12 primers from literature were used for the amplification of targeted genes. By combining the primers from this study and the 12 primers from literature a total of 169 primers were analysed. As an example the reconstruction of the three copies of the Vrn1 gene structure, primer positions, intron length differences and exon SNPs are shown in Fig 2.
Fig 2

Example for an intron-exon structure, intron length differences, exon SNPs and primer position of the three copies of the Vrn1 gene.

(I) A, B and D on the left border stand for the three different wheat sub-genomes and I1 to I7 on the top for the seven introns of Vrn1. The arrows which bend to the right are forward primers and the arrows which bend to the left are reverse primers. The red vertical lines show SNPs between the three gene copies. A I1 has a length of 8518 bp, B I1 2821 bp, D I1 8625bp, A I2 1475 bp, B I2 1246 bp, D I2 1504 bp, A I3 90 bp, B I3 92 bp, D I3 90 bp, A I4 192 bp, B I4 196 bp, D I4 188 bp, A I5 152 bp, B I5 156 bp, D I5 156 bp, A I6 93 bp, B I6 91 bp, D I6 91 bp, A I7 166 bp, B I7 168 bp and D I7 168 bp. (II) This figure shows the SNPs between the three sub-genomes and their coding sequences (CDS) position.

Example for an intron-exon structure, intron length differences, exon SNPs and primer position of the three copies of the Vrn1 gene.

(I) A, B and D on the left border stand for the three different wheat sub-genomes and I1 to I7 on the top for the seven introns of Vrn1. The arrows which bend to the right are forward primers and the arrows which bend to the left are reverse primers. The red vertical lines show SNPs between the three gene copies. A I1 has a length of 8518 bp, B I1 2821 bp, D I1 8625bp, A I2 1475 bp, B I2 1246 bp, D I2 1504 bp, A I3 90 bp, B I3 92 bp, D I3 90 bp, A I4 192 bp, B I4 196 bp, D I4 188 bp, A I5 152 bp, B I5 156 bp, D I5 156 bp, A I6 93 bp, B I6 91 bp, D I6 91 bp, A I7 166 bp, B I7 168 bp and D I7 168 bp. (II) This figure shows the SNPs between the three sub-genomes and their coding sequences (CDS) position.

Testing primers for specificity and chromosomal assignment of PCR products

In total, a set of 169 primers representing 119 PCR products from 27 candidate genes was tested for functionality and specificity. A set of 86 primer combinations from 23 candidate genes showed single band amplification (72.27%). Chromosomal localisation via Nulli-tetrasomic (NT)-lines of Chinese Spring [43] of a set of 86 single band PCR amplicons revealed that 65 fragments were located on expected chromosomes according to the literature. Out of these 65 fragments, six were products of combination of already published and newly designed primers. The remaining 19 fragments showed an incorrect localisation (literature vs. NT-lines) or no localisation was possible as all NT-lines showed a fragment. Correctly assigned amplicons originated from 19 genes and were located on 11 wheat chromosomes (Table 3, Fig 3). A set of 10 out of 19 analysed genes were located on wheat chromosome group 5, out of 119 PCR fragments 65 single bands were correctly localised. That is equivalent to a success rate of 54.6%. These 65 amplicons represent 19 frost tolerance genes, are gene specific and were therefore selected for further studies (Table 4, S2 Table).
Table 3

Overview of chromosome localisation of candidate genes (PCR fragment) via NT-lines, deletion-lines and literature.

GeneNT-linesPCR signal present via deletion-linesDeletion-line localisation sectionLiterature locationReference
Cbf1 5D5DS-5; 5DL-5,-7proximal from 5DL-5 and distal from 5DL-1n.a
Cbf5 7A7AL-1distal on short arm from 7AS-17Am[73]
Cbf7 2B2BS-1,-3,-4distal on long arm from 2BL-6n.a
Cbf10 5A5AS-3,-10; 5AL-17,-23proximal from 5AL-12 and 5AL-175Am[73]
Cbf13 5A5AS-3,-10; 5AL-17,-23proximal from 5AL-12 and 5AL-175Am[73]
Cbf14 5A5AS-3,-10; 5AL-17,-23proximal from 5AL-12 and 5AL-175Am[73]
Cbf15 5A5AS-3,-10; 5AL-17,-23proximal from 5AL-12 and 5AL-175Am[73]
Cbf18 6A6AS-1distal on long arm from 6AL-86Am[73]
Dhn1 5D5DS-5; 5DL-5proximal from 5DL-5 and distal from 5DL-15H[80]
Vrn-A1 5A5AS-3,-10; 5AL-17,-23proximal from 5AL-12 and 5AL-175A[81]
Vrn-B1 5B5BS-4,-5,-6,-8; 5BL-16proxiaml from 5BL-16 and distal from 5BL-95B[81]
Vrn-D1 5D5DS-5; 5DL-7distal on long arm from 5DL-55D[81]
Vrn2 4Deverywhere (4DS-1,-2;-3; 4DL-9,-13)proximal from 4DS-1 and 4DL-95AmL[82]
Vrn3 7B7BL-7;-6;-10distal on short arm from 7BS-17BS[35]
Cab 5A5AS-3,-10; 5AL-17,-23proximal from 5AL-12 and 5AL-175HL[55]
Dem 6B/6D6BS-2; 6BL-3,-5,-6; 6DS-4,-6; 6-DL 10distal on short arm from 6BS-3 and proximal from 6DL-11 and distal from 6DL-126HL[55]
Tacr7 2Beverywhere (2BS-1,-3,-4; 2BL-6)proximal from 2BS-1 and 2BL-62HL[55]
Ppd-B1 2B2BS-3; 2BL-6proximal from 2BS-1 and distal from 2BS-42B[8385]
Ppd-D1 2D2DL-9distal on short arm from 2DS-52D[84, 85]

The table shows the analysed frost tolerance candidate gene, their chromosomal localisation and fine mapping via NT and deletion-lines. The column deletion-line localisation section shows the approximate chromosomal position of respective genes based on deletion break points.

Fig 3

Map of gene specific PCR fragments by using wheat NT- and deletion lines.

In this figure only wheat chromosomes are shown harbouring mapped PCR fragments. The white bar is the chromosome, the constriction symbolised the centromere, on the left side of chromosomes deletion break points are listed and the black bars are the regions of mapped PCR fragments with appended candidate gene.

Table 4

Primer sequences used for amplification of candidate genes.

FragmentForward primer nameForward primer sequence (5`- 3`)Reverse primer nameReverse primer sequence (5`- 3`)
Cbf1AF376136_s1TTTTTGACGCTGCAACTGATAF376136_as709TTTACCGAGGGAGTAGTTTCCA
Cbf5TmCBF5_F * 2 CGATGCAAAGTGTGCAATTCAY951947_as1691ACTAGCTCATGCGAATATGGTGT
Cbf7AY785904_s4TTCTAGTCCACCTAGCTACAGGCAY785904_as926CACTAGCAAAGCAATTCATGAGC
Cbf10AY951950_s1522ACATCTCACACACTCCACAGATGCbf4B_R * 3 GCAGAATCGGCTACAAGCTCCAG
Cbf13Cbf5_F * 3 CAGAGCAGAATCAGATGGGGAATCAY951951_as1964GCTAAGCTCACACTCCTCGATAA
Cbf14AY951948_s_565TAAACTCGCTGCTTAATTACCCCAY951948_as_1312ATATTTGGTGGAACAGAAGCAGA
AY951948_s_528CAGCATCCATCTCTCTCAAATCTAY951948_as_1299CAGAAGCAGAGAAACCGTCTAAA
AY951948_s_565TAAACTCGCTGCTTAATTACCCCAY951948_as_1299CAGAAGCAGAGAAACCGTCTAAA
AY951948_s_528CAGCATCCATCTCTCTCAAATCTAY951948_as_1312ATATTTGGTGGAACAGAAGCAGA
Cbf15EF028765_s_90ACCGACCACCTGCAGTACCEF028765_as_875TTGTTCCATGCATAGAGTCAAAG
Cbf18AY951946_s400CGTATAAATACGCACACGCACTAAY951946_as1445ACATGGTGGAGGGATCTTTTTAT
Dhn1ScDhn1_F * 1 CCACGTAGCACGCACGCTGTAF043087_as1808TCGGAACATAGAGAAGACACACA
Vrn-A1bVRN1-A_F * 4 GAAAGGAAAAATTCTGCTCGAY747600_as1083GATTACCGTCTTAACCCTTCCAC
Vrn-A1cAY747600_s9072CATGAAACAACGCATTACAGAAAAY747600_as10169CAGATAGAACTGGTTGGATCCCT
Vrn-A1dAY747600_s_10698TTTCTGTCATTGTTCCTTCCTGTAY747600_as_11318CAAGCTAAGGCTTCATGACAAGT
AY747600_s_10718TGTCCCACCCAAAGTTAGTAATGAY747600_as_11390AACGATGTAATGAGGTTACGTGC
AY747600_s_10698TTTCTGTCATTGTTCCTTCCTGTAY747600_as_11390AACGATGTAATGAGGTTACGTGC
AY747600_s_10718TGTCCCACCCAAAGTTAGTAATGAY747600_as_11318CAAGCTAAGGCTTCATGACAAGT
Vrn-A1eAY747600_s_11297CTTGTCATGAAGCCTTAGCTTGTAY747600_as_12066GCTGCAGCTTGCTACTTTACTCT
AY747600_s_11297CTTGTCATGAAGCCTTAGCTTGTAY747600_as_12099AAACTGAGGTGGACAAAGTGAAA
Vrn-B1bAY747603_s18AGGCCTAGGGTACAGTAGAATAGTAGAY747606_as820CAAACGGAATCAACCAAACAG
Vrn-B1cAY747603_s3097TCTGAGCAGAATTATACTTACCTTGCAY747606_as9488AGATCATCTGATATCGGCAAAAA
Vrn-B1dAY747603_s_4783CCTTCCTGTTCCACTCAAAGTTAAY747603_as_5249TTTTTAACTGTGAAGAGCATATGACTAA
Vrn-B1eAY747603_s5134AAACAAGAAAAACACTTGCAGAGAAY747603_as6211ATTACATGGTAAATTGAGCCCAG
Vrn-D1bAY747606_s6TTCCCTTCTACTAGGCATAGGGTAY747606_as820CAAACGGAATCAACCAAACAG
Vrn-D1cAY747606_s8129GTGTTGGTAGAAGGCTAGAAGCAAY747606_as9488AGATCATCTGATATCGGCAAAAA
Vrn-D1dAY747606_s10179GACCTCACGCCAATTTTTGTAY747606_as11608TACGAAACAATTTAGACCGGTTG
Vrn-D1eAY747606_s11586CAACCGGTCTAAATTGTTTCGTAAY747606_as12291TTAATTCACATAAACAACATCCCACTA
Vrn2aAY485977_s306AAACAAGCAAACGTTGGAGTTAGAY485977_as1282AATAAGCAATTTCCTGATGCAAA
Vrn2bAY485977_s_1542CAACACTGAATGAAAATGGATCAAY485977_as_1985GAACCATCCGAGGTGAAGTTTA
AY485977_s_1542CAACACTGAATGAAAATGGATCAAY485977_as_1972TGAAGTTTACTAGGATCATGGGG
AY485977_s_1439CCATAGAGCAATTGAGTTTGGACAY485977_as_1972TGAAGTTTACTAGGATCATGGGG
Vrn2a/bAY485977_s306AAACAAGCAAACGTTGGAGTTAGAY485977_as_1972TGAAGTTTACTAGGATCATGGGG
Vrn3aDQ890162_s_1430AAGGAGTACTAGAGCGGCGAGDQ890162_as_1915TGTGGTGAGCACTTTCAGAGATA
DQ890162_s_1552TTCCTCAATTCACAGCTTACTCCDQ890162_as_1915TGTGGTGAGCACTTTCAGAGATA
Vrn3bDQ890162_s2159TCTTAAATACTCTCTCCGTCCGADQ890162_as3153AAGCCATTGATCTAGGGTTCAC
DQ890162_s2396GAAGTACACTTATTCGTGGACGGDQ890162_as3153AAGCCATTGATCTAGGGTTCAC
Vrn3a/bDQ890162_s_1552TTCCTCAATTCACAGCTTACTCCDQ890162_as3153AAGCCATTGATCTAGGGTTCAC
Cab bcontig22616_s209TTTTGCGAAAGCACACTTATACAcontig22616_as938GAAGCATCGCCAGCTATAAATAC
Cab dcontig22616_s209TTTTGCGAAAGCACACTTATACAcontig22616_as828CAGTTGCAGCAGAGAGATTCTT
DemCD937801_s29ATACCATCGGCAACTCCTCTGcontig1013618_as520CCATTATGGATAGCGAAATTTGA
Tacr7 bcontig4120743_s26CAACCAAAACTCGCCTATAAAAGcontig2688312_as455AATCGGAGAGGAAGCTCTCTTTA
Tacr7 ccontig4120743_s271CGAGGAGAAGGTTTGGGGTTcontig2688312_as455AATCGGAGAGGAAGCTCTCTTTA
Tacr7 dBJ246882_s196GTCGGCGAGGAGAAGGTTTTcontig2688312_as455AATCGGAGAGGAAGCTCTCTTTA
Ppd-B1cDQ885757_s11028TCCTTCCAGCTTACTAGTGCATCDQ885757_as11954ATCACCTGGAAAACATATTGGAA
Ppd-B1dDQ885757_s11883AACTGAACCAAAAGCCTGCTACTDQ885757_as12453GTACCTTGCAAAGAATGAAAACG
Ppd-B1eDQ885757_s_12390CCTTTGTGAATCCTTAAATCATCCDQ885757_as_13162AACAGAGAACAAACGAAATCGG
Ppd-B1fDQ885757_s_13184GGGCTTATTCATGATAGCTGATGDQ885757_as_13562ATCGACTCCGCACTTCTACTATG
DQ885757_s_13148CGTTTGTTCTCTGTTCTTCGTTTDQ885757_as_13625ACCGTTACACAGGTTCAGACATT
DQ885757_s_13184GGGCTTATTCATGATAGCTGATGDQ885757_as_13625ACCGTTACACAGGTTCAGACATT
DQ885757_s_13148CGTTTGTTCTCTGTTCTTCGTTTDQ885757_as_13562ATCGACTCCGCACTTCTACTATG
Ppd-D1 PromDQ885766_s3601 6 CTTGTCCAACTCCCAATCTAGTGDQ885766_as4689 6 TCCTCCCCTGTTTCTTTTTACTC
DQ885766_s4578TCGTCCATCCAAAGATACTGATTDQ885766_as5712 6 AGTACGCTGCCGTGAGTAATAAT
DQ885766_s4450 6 CATACTCCCTCCGTTTCTTCTTTDQ885766_as5712 6 AGTACGCTGCCGTGAGTAATAAT
DQ885766_s4578TCGTCCATCCAAAGATACTGATTDQ885766_as5700TGAGTAATAATCGAACCTCGGTC
Ppd-D1aDQ885766_s5689ATTATTACTCACGGCAGCGTACTDQ885766_as6299 6 TACTGAAACATTTTAGGGCCAAG
DQ885766_s5677 6 GACCGAGGTTCGATTATTACTCADQ885766_as6299 6 TACTGAAACATTTTAGGGCCAAG
Ppd-D1a2DQ885766_s5766 6 CAACATGTTTCCTCTTGGAGCDQ885766_as6535 6 GAACAGAGTCAAACACCATCAGA
Ppd-D1bDQ885766_s6298TATCAGGTTCATTTGCTTCAGTGDQ885766_as7002 6 ATGGACAAATTGACCTCTAGTGC
DQ885766_s6277 6 CTTGGCCCTAAAATGTTTCAGTADQ885766_as7002 6 ATGGACAAATTGACCTCTAGTGC
DQ885766_s6277 6 CTTGGCCCTAAAATGTTTCAGTADQ885766_as6963GCCATTCAGTTTTATCTAGCTTCC
Ppd-D1cDQ885766_s7244TGACAAGTATCTGCATCTGAACCDQ885766_as8033 6 GATTCGCAAAGGACACTGATATT
Ppd-D1dDQ885766_s6939 6 GGAAGCTAGATAAAACTGAATGGCDQ885766_as8033 6 GATTCGCAAAGGACACTGATATT
Ppd-D1eDQ885766_s8011 6 AATATCAGTGTCCTTTGCGAATCPpd-D1exon8_R1 * 5 gtctaaatagtaggtactagg
Ppd-D1 3`UTRDQ885766_s8771CTGCTCTCTGTTCTTGGTTTCATDQ885766_as9720ACCTCCCTGACGAAAAGCTC

Primer names with † are developed in course of this work but published from Keilwagen et al. [53].

Primer names with * as already published were used in combination with primers with † and without labels.

1[86]

2[73]

3[87]

4[81]

5[37]

6 [53]

Map of gene specific PCR fragments by using wheat NT- and deletion lines.

In this figure only wheat chromosomes are shown harbouring mapped PCR fragments. The white bar is the chromosome, the constriction symbolised the centromere, on the left side of chromosomes deletion break points are listed and the black bars are the regions of mapped PCR fragments with appended candidate gene. The table shows the analysed frost tolerance candidate gene, their chromosomal localisation and fine mapping via NT and deletion-lines. The column deletion-line localisation section shows the approximate chromosomal position of respective genes based on deletion break points. Primer names with † are developed in course of this work but published from Keilwagen et al. [53]. Primer names with * as already published were used in combination with primers with † and without labels. 1[86] 2[73] 3[87] 4[81] 5[37] 6 [53] Furthermore, a set of 40 amplicons was physically assigned using a set of 46 available deletion-lines [44] (Fig 3, Table 3). All six genes, which are localised on chromosome 5A via NT-lines, are map to a large cluster between sector AL-12 and AL-17 on the long arm of chromosome 5. The draft sequence of wheat and related species allows detailed in silicio analysis of oligos used in this study by doing simple BLAST comparison. Out of 98 oligos that were used for the amplification of 65 PCR fragments, 54 turned out to be specific to one sub-genome, 21 specific to two sub-genomes, and 14 were unspecific. For 9 oligos the comparison could not be performed due to non availability of sub-genome sequences (S2 Table). 57 out of 65 amplicons comprise at least one sub-genome specific primer. For five PCR fragments (Cbf5, Dhn1, Cab b, Cab d and Dem) no wheat sub-genome sequences could be identified. Both primers of PCR fragments Cbf7, Ppd-B1f and Ppd-D1b showed no-specificity to one sub-genome in reference to Wu et al. [48]. Nevertheless, all three fragments showed single bands and correct chromosome localisation via NT-lines (S1 Fig). The primer sequences of Ppd-B1f and Ppd-D1b were derived from a specific sub-genome. At least one of the primers showed one or more differences to corresponding regions on the chromosomes in alignments with the other two sub-genomes. Special cases are the primers of fragment Cbf7. The forward primer has no sub-genome specificity and the reverse primer is specific to sub-genomes A and B (S3 Table).

Re-sequencing of genes involved in frost tolerance and homology validation via BLAST

Five out of 40 amplicons revealed a presence/absence polymorphisms (dominant) and were therefore not sequenced. These five dominant markers were directly used for genotyping of a Ppd-D1 deletion in the promoter and a transposable element (TE) in intron1 [54]. One PCR fragment (Cbf7) could not be sequenced due to very low quality. Finally, 34 amplicons, representing 18 frost tolerance genes, were selected for sequencing and all 34 obtained sequences were compared to retrieved gene models via MegaBlast. In case of Tacr7, Kocsy et al. [55] identified BQ659345 of Hordeum vulgare as reference, but the fragments of 24 sequenced genotypes of Tacr7 do not exceed an identity of 84% to the published wheat reference sequence L28093 for Tacr7[31]. The best BLAST hit of the 24 sequences is still the initial barley sequence BQ659345 with an identity of 92.3%, but the second best barley BLAST hit of 91.8% identity is to X97916, the barley low temperature gene 14.1 abbreviated as blt14.1. The rest of 33 BLAST results show very high identities from 88.8% to 100% to initial gene sequences (Table 5).
Table 5

BLASTn results of sequenced PCR fragments versus NCBI nucleotide collection (nr/nt) and NCBI candidate gene reference EST.

GenePCR fragmentDatabaseSubject Seq-id (ID of the database hit)Percentage of identical matchesExpectation value (E-value)Bit scoreSubject discribtion
Cab Cab bESTgi|21273269|gb|BQ465487.1|93,710643HU03M14r HU Hordeum vulgare subsp. vulgare cDNA clone HU03M14 5-PRIME, mRNA sequence.
Cbf1 Cbf1nucleotidegi|17226800|gb|AF376136.1|10001279Triticum aestivum putative CRT/DRE-binding factor (CBF1) mRNA, complete cds
Cbf5 Cbf5nucleotidegi|404415276|gb|JN987194.1|10001519Triticum aestivum AP2 domain CBF protein (CBFII) mRNA, CBFII-5.4 allele, complete cds
Cbf10 Cbf10nucleotidegi|404415286|gb|JN987199.1|99,5301548Triticum aestivum AP2 domain CBF protein (CBFIIIc) mRNA, CBFIIIc-10.1 allele, complete cds
Cbf13 Cbf13nucleotidegi|404415320|gb|JN987217.1|10001493Triticum aestivum AP2 domain CBF protein (CBFIIIc) pseudogene, CBFIIIc-13.1c allele, complete sequence
Cbf14 Cbf14nucleotidegi|158999375|gb|EU076382.1|99,010902Triticum monococcum CBF14 gene, complete cds
Cbf15 Cbf15nucleotidegi|404415321|gb|JN987218.1|10001325Triticum aestivum AP2 domain CBF protein (CBFIIId) gene, CBFIIId-15.2b allele, complete cds
Cbf18 Cbf18nucleotidegi|63098599|gb|AY951946.1|94,9201559Triticum monococcum CRT/DRE binding factor 18 (CBF18) gene, complete cds
Dem Dem bnucleotidegi|241986478|dbj|AK333739.1|97,370713Triticum aestivum cDNA, clone: WT008_O03, cultivar: Chinese Spring
ESTgi|12030509|emb|AL504294.1|90,377,00E-128466AL504294 Hordeum vulgare Barke roots Hordeum vulgare subsp. vulgare cDNA clone HW04N07 5', mRNA sequence.
Dhn1 Dhn1nucleotidegi|59894280|gb|AY895879.1|88,793,00E-176625Hordeum vulgare subsp. spontaneum voucher NPGS PI 559556 dehydrin 1 (Dhn1) gene, partial cds
Ppd-B1 Ppd-B1cnucleotidegi|456359289|dbj|AB745620.1|10001679Triticum turgidum subsp. pyramidale Ppd-B1 gene for pseudo-response regulator, complete cds, strain: KU-9882
Ppd-B1dnucleotidegi|456359289|dbj|AB745620.1|99,8201009Triticum turgidum subsp. pyramidale Ppd-B1 gene for pseudo-response regulator, complete cds, strain: KU-9882
Ppd-B1enucleotidegi|456359289|dbj|AB745620.1|99,301297Triticum turgidum subsp. pyramidale Ppd-B1 gene for pseudo-response regulator, complete cds, strain: KU-9882
Ppd-B1fnucleotidegi|383215299|gb|JF946486.1|99,740693Triticum aestivum transposon TREP 3040_Harbinger, complete sequence; pseudo-response regulator (Ppd-B1) gene, Ppd-B1a allele, complete cds; and retrotransposon Gypsy TREP 3457_Danae, complete sequence
Ppd-D1 Ppd-D1dnucleotidegi|395759126|dbj|AB646977.1|99,9101965Triticum aestivum PRR gene for pseudo-response regulator, complete cds, allele: Ppd-D1b.2
Ppd-D1enucleotidegi|395759124|dbj|AB646976.1|10001731Triticum aestivum PRR gene for pseudo-response regulator, complete cds, allele: Ppd-D1a.1
PpD1 3`UTRnucleotidegi|118638641|gb|DQ885766.1|10001629Triticum aestivum cultivar Chinese Spring chromosome 2D pseudo-response regulator (PRR) gene, complete cds
Tacr7 Tacr7 bnucleotidegi|1418967|emb|X97916.1|91,750778H.vulgare blt14.1 gene
ESTgi|21800478|gb|BQ659345.1|92,345,00E-180640HD01A06w HD Hordeum vulgare cDNA clone HD01A06 3-PRIME, mRNA sequence.
Vrn2 Vrn2anucleotidegi|211593611|gb|FJ173824.1|91,076,00E-149534Triticum turgidum retrotransposon Wilma, partial sequence; and ZCCT2-B2b (VRN2) gene, complete cds
Vrn2bnucleotidegi|45390737|gb|AY485977.1|92,5701120Hordeum vulgare cultivar Dairokkaku ZCCT-Ha (VRN2) gene, partial cds
Vrn3 Vrn3anucleotidegi|117168399|gb|DQ890162.1|10001825Triticum aestivum cultivar Chinese Spring VRN3 (vrn-B3) gene, complete cds
Vrn3bnucleotidegi|117168399|gb|DQ890162.1|10001801Triticum aestivum cultivar Chinese Spring VRN3 (vrn-B3) gene, complete cds
Vrn-A1 Vrn-A1bnucleotidegi|383215288|gb|JF965395.1|10001829Triticum aestivum cultivar Claire VRN-A1 (VRN-A1) gene, complete cds
Vrn-A1cnucleotidegi|383215290|gb|JF965396.1|10001995Triticum aestivum cultivar Malacca VRN-A1 (VRN-A1) gene, complete cds
Vrn-A1dnucleotidegi|383215290|gb|JF965396.1|10001109Triticum aestivum cultivar Malacca VRN-A1 (VRN-A1) gene, complete cds
Vrn-A1enucleotidegi|383215292|gb|JF965397.1|10001279Triticum aestivum cultivar Hereward VRN-A1 (VRN-A1) gene, complete cds
Vrn-B1 Vrn-B1bnucleotidegi|384371844|gb|HQ130483.2|10001459Triticum aestivum cultivar Diamant2 Vrn-B1 (Vrn-B1) gene, Vrn-B1-a allele, promoter region and complete cds
Vrn-B1cnucleotidegi|384371844|gb|HQ130483.2|99,8201016Triticum aestivum cultivar Diamant2 Vrn-B1 (Vrn-B1) gene, Vrn-B1-a allele, promoter region and complete cds
Vrn-B1dnucleotidegi|384371844|gb|HQ130483.2|1000867Triticum aestivum cultivar Diamant2 Vrn-B1 (Vrn-B1) gene, Vrn-B1-a allele, promoter region and complete cds
Vrn-B1enucleotidegi|58423007|gb|AY747604.1|10001969Triticum aestivum cultivar Triple Dirk C line VRN-B1 (VRN-B1) gene, complete cds
Vrn-D1 Vrn-D1bnucleotidegi|58423011|gb|AY747606.1|10001328Triticum aestivum cultivar Triple Dirk C line VRN-D1 (VRN-D1) gene, complete cds
Vrn-D1cnucleotidegi|58423011|gb|AY747606.1|10001701Triticum aestivum cultivar Triple Dirk C line VRN-D1 (VRN-D1) gene, complete cds
Vrn-D1dnucleotidegi|58423011|gb|AY747606.1|99,1601714Triticum aestivum cultivar Triple Dirk C line VRN-D1 (VRN-D1) gene, complete cds
Vrn-D1enucleotidegi|58423011|gb|AY747606.1|10001327Triticum aestivum cultivar Triple Dirk C line VRN-D1 (VRN-D1) gene, complete cds
In 12 genes out ofa set of 18 sequenced candidate genes represented by 16 unique PCR amplicons, differences between the 24 genotypes were determined, revealing a high level of polymorphimsof 66.67%. The number of polymorphic sites ranged from 1 to 37, the haplotypes (h) from two to three, the haplotype diversity (Hd) from 0.08 to 0.61 and the nucleotide diversity (π) from 0.00008 to 0.00757 (Table 6).
Table 6

Nucleotide polymorphisms of coding and noncoding candidate gene regions.

GeneNo. accessonsNo. of bpNo. of polymorphic sitesPercentage polymorphismhHdkπk (i)π (i)
CBF5 2382420,2430,490,950,001150,440,00054
CBF10 2377620,2620,470,950,0012n/an/a
CBF13 2377340,5220,471,420,001930,950,00123
CBF14 22118460,5130,481,910,001630,460,00038
CBF15A 2475570,9320,492,940,003950,490,00065
CBF18 22951373,8920,093,090,003280,270,00029
Vrn-A1 24295490,3020,080,420,000140,330,00011
Vrn-D1 23309310,0320,240,240,00008n/an/a
Vrn3 24156610,0620,520,520,00033n/an/a
Cab 24707131,8430,615,180,007571,260,00179
Ppd-B1 24397120,0530,360,370,00009n/an/a
Ppd-D1 24264210,0420,23n/an/a0,230,00009

h haplotypes

Hd haplotype diversity

k average number of nucleotide differences

π nucleotide diversity

(i) InDel

h haplotypes Hd haplotype diversity k average number of nucleotide differences π nucleotide diversity (i) InDel The results of the workflow for locus specific primer development presented in this paper are very promising. The main workflow step is the identification of sub-genome sequences and the design of primers on sub-genome sequence differences. This is the essential step of this workflow and is crucial for the success of this approach. The primer amplification test for single bands and the fragment mapping via NT-lines are a simple way to verify locus specificity. The sequencing of selected locus specific amplicons and the BLAST analysis of these fragment sequences versus initial data bases is the last step of safe-guarding the correct amplification. The results of this BLAST search showed no critical differences to the initially selected sequences.

Discussion

New bioinformatic platforms and data bases containing recent genomics data are a powerful resource for the development of tools for molecular plant breeding.

Gene specific primer development and chromosomal assignment of specific PCR fragments by using NT- and deletion lines

The rapid progress in sequencing of plant genomes leads to the accumulation of whole genome sequence data,allowing the fast development of locus/genome specific markers in complex plant genomes (e.g. wheat) with a high success rate. Up to now, high homology of the hexaploid wheat genome hampered the success in gene specific primer development. Gene structure is important for marker development, because wheat introns have more sequence differences between the homologous chromosomes than exons [56, 57]. Therefore, gene structure reconstruction and comparison of homologue sequences by using three genomes facilitate an improved development of molecular markers as well as re-sequencing of targeted genes/loci.

Specificity of developed primers

Specificity of primers is the non-recurring binding in the target genome. This is reflected in a single PCR and a correct or syntenically localised amplicon. Fig 4 shows an example of the Cbf1 amplicon localisation via NT- and deletion-lines.
Fig 4

Example of fragment localisation from Cbf1 via NT- and deletion-lines.

(A) The missing PCR fragment on NT-line N5D-T5B indicated the location on wheat chromosome 5D. (B) The missing PCR fragment on Csdel 5DL-1 indicated the location on wheat long arm of chromosome 5D between the deletion segments 1 and 5.

Example of fragment localisation from Cbf1 via NT- and deletion-lines.

(A) The missing PCR fragment on NT-line N5D-T5B indicated the location on wheat chromosome 5D. (B) The missing PCR fragment on Csdel 5DL-1 indicated the location on wheat long arm of chromosome 5D between the deletion segments 1 and 5. The inspection of primer functionality and single PCR product generation is a standard for the development of primers and therefore is the first necessary step of the presented approach. Via the first inspection step we have eliminated 27.73% of studied primer pair combinations. Most of these showed no PCR amplification probably due to non-binding of target sequences. The second important step of checking the amplicon specificity is the mapping of the PCR products via NT-lines to get information about the correct amplification on the correct target chromosome template and sub-genome. By using NT-mapping of PCR amplicons we have eliminated 18.49% of primer pair combinations. One part of the eliminated PCR products shows a chromosome localisation that differs from what has been reported in the literature. In this case, we assume a non-specific binding in the wheat genome. That can occur if primers are derived from related organisms and not from wheat itself. For seven of eight discarded candidate genes, sequences of related organisms (Triticum monococcum and Hordeum vulgare) were used for primer development.The other part of eliminated primer pair combinations showed a PCR product on all NT-lines which may be due to the fact that both primers (forward and reverse) bind at least to two sub-genomes. By using the draft wheat chromosome arm sorted sequences [10-12] and simple comparative methods we were able to develop gene specific primers in hexaploid wheat with a high success rate of 58.60%. Also a very high rate of 54.62% for specific fragment amplification confirmed the usefulness of wheat genomic sequence. To our knowledge such high rate is not yet described in literature for specific primer/marker development in polyploid plants. An overview of published success rates revealed a variation in microsatellite amplification in wheat between 22.88 and 45.0% [58-61]. In cotton this rate was 23.3% [62]. Contrary, Wang et al. [63] describe the development of effectively derived primers for sequence tagged sites (STS) with 24.56% and for STS primer combinations of only 3.7% in wheat. Chen et al. [64] achieved a rate of 27.5% for STS marker development in wheat. In Brassica oleracea (which is a paleohexaploid plant) a success rate of 29.1% is described in allele specific PCR primer development [65]. The highest success rate reported in literature is for potato [56]. In this study a rate of 51.79% developed intron targeting (IT) markers was achieved. With the ongoing genome sequencing projects and subsequent development of genome-wide physical maps in wheat and related plants an increase in the success of specific primer development may be expected.

Sequencing of frost tolerance candidate genes and BLAST based verification

In this study 18 out of 19 (94.74%) frost tolerance genes were sequenced using the same primers used for PCR amplification. For gene Cbf7, for which initial sequencing failed, a set of newly designed sequencing primers improved the sequencing, therefore optimisation for single band products could be recommended as a part of the verification procedure. Concerning the gene Tacr7, Kocsy et al. [55] claimed BQ659345 of Hordeum vulgare is identical to the Tacr7 gene in wheat. However, the analysis of the generated sequences presented in this paper showed an identity of 84% to the reference sequence L28093 for Tacr7 of wheat and 92% to BQ659345. In contrast, our sequences reveal an identity of 92% to X97916 of Hordeum vulgare which is annotated as the barley low temperature gene 14.1 (Blt14.1). BLT14.1 shows a considerable homology to WLT10, as described by Ohno et al. [66]. Matching BQ659345 against X97916 results in an identity of 99%. Furthermore Tacr7, Blt14.1 and Wlt10 are located on chromosome 2 of barley and wheat, respectively [55, 66, 67]. We also mapped PCR fragments derived from Tacr7 on chromosome 2B. Further BLAST results indicate that the sequence of our Tacr7 is with 92% the initial sequence BQ659345. Furthermore, it was shown recently that the newest sequence of Tacr7 [55] is very similar to the sequences of the genes Blt14.1 and Wlt10, in contrast to the L28093 sequence (described also as Tac7 [31]). The nucleotide identity of 99% between Blt14.1 (X97916) and the initial reference sequence (BQ659345), which is published as Tacr7 [55], backed this hypothesis. All other PCR fragment sequences have shown a very good sequence identity to the original gene of interest (97.5%). The sequencing of single bands and correct chromosome assigned PCR amplicons followed by BLAST based verification is the last check-up step in the workflow presented in this study. The results of the BLAST based verification demonstrate that the selection of PCR single products and the assignment to the correct chromosomes of the PCR amplicons is an efficient instrument of locus specific primer selection. The combination of sequencing and BLAST based verification using the presented approach leads to very robust results with an error rate tending to zero. The identified SNPs at 11 polymorphic candidate genes can be used for developing SNP based marker. Also the InDels in eight candidate genes are suited for marker development based on size polymorphisms. Based on these PCR amplicons can be employed for genetic mapping of correspondingcandidate genes in biparental mapping populations, thereby allowing for the first time their genetic localization. This paper describes an efficient approach for the development of locus specific primers in wheat. With the aid of this locus specific primers are necessary for locus specific sequencing and detection of genes specific polymorphisms (SNPs and InDels) between genotypes of interest. The detected polymorphisms can follow up the use for genetic mapping, but also for gene editing via sequence information for transcription activator-like effector nucleases (TALENs) [68-70] or clustered regularly interspaced short palindromic repeats (CRISPR/Cas) systems [71]. Therefore our approach of development of locus specific primers is a base for many downstream applications i.e. detection of new polymorphisms, development of new markers, genetic mapping and gene editing in wheat.

Conclusion

It is still difficult to develop molecular markers in Triticum aestivum due to the very complex genome. In this study we presented anefficient approach for gene and genome specific primer development by using sequence data of wheat. Altogether, we have developed specific primers for 19 out of 27 selected frost candidate genes. For 27 candidate genes 119 primer pairs were generated of which 65 were specific. Out of candidate gene specific primer fragments 36 fragments were selected, corresponding to 19 genes, for validation via sequencing. Finally, 35 amplicons could be successfully sequenced and only one specific sequence showed a low identity of approximately 83% to the original reference sequence. By using the presented approach for gene specific primer/PCR development, it is possible to sequence and analyse interesting candidate genes in wheat by using gene information of related sequenced plant species. The wheat genome sequences currently available, in combination with the wheat physical map, are well suited for the development of specific primers. The approach for primer design, developed within this study turned out to be very efficient by using available wheat genomic resources and it is expected to perform even better once new versions of wheat genomic sequences will be available.

Chromosome localisation and cycler programs of the PCR fragments Cbf7, Ppd-B1f and Ppd-D1b via NT-lines.

(TIF) Click here for additional data file.

Plant material of NT- and Deletion-lines.

(XLSX) Click here for additional data file.

Candidate gene specific primers, primer specificity, PCR fragments, used polymerases, cycler programs and primers for fragment re-sequencing.

Primer names with † are developed in course of this work but published from Keilwagen et al. [53]. Primer names with * as already published were used in combination with primers with † and without labels. 1[86]; 2[73]; 3[87]; 4[81]; 5[37]; 6 [53]. †Primers published in Keilwagen et al. [53]. *Already published primers (XLSX) Click here for additional data file.

Primer specificity and mismatches to compared the three sub-genomes of functional and correct localised PCR fragments via in silico alignments.

Primers assigned † are developed in course of this study and published in Keilwagen et al. [54]. Already published primers with * assigned were used in combination with primers in green and black. The column differences describe the numbers of SNPs/InDels between primers at sequence level of A, B and D genomes. The columns position of InDels and SNPs in 5' to 3`direction describes the position of the differences between primers at sub-genomes (InDels and SNP) from primer 5´ to 3´ end direction. 1[86]; 2[73]; 3[87]; 4[81]; 5[37]; 6 [53]. †Primers published in Keilwagen et al. [53]. *Already published primers. (XLSX) Click here for additional data file.
  62 in total

1.  EST analysis in barley defines a unigene set comprising 4,000 genes.

Authors:  W Michalek; W Weschke; K-P Pleissner; A Graner
Journal:  Theor Appl Genet       Date:  2002-01       Impact factor: 5.699

2.  A simple cipher governs DNA recognition by TAL effectors.

Authors:  Matthew J Moscou; Adam J Bogdanove
Journal:  Science       Date:  2009-12-11       Impact factor: 47.728

3.  Regulation of gene expression by chromosome 5A during cold hardening in wheat.

Authors:  Gábor Kocsy; Benedikt Athmer; Dragan Perovic; Axel Himmelbach; Attila Szucs; Ildikó Vashegyi; Patrick Schweizer; Gábor Galiba; Nils Stein
Journal:  Mol Genet Genomics       Date:  2010-02-24       Impact factor: 3.291

4.  ICE1: a regulator of cold-induced transcriptome and freezing tolerance in Arabidopsis.

Authors:  Viswanathan Chinnusamy; Masaru Ohta; Siddhartha Kanrar; Byeong-Ha Lee; Xuhui Hong; Manu Agarwal; Jian-Kang Zhu
Journal:  Genes Dev       Date:  2003-04-02       Impact factor: 11.361

5.  Development of intron targeting (IT) markers for potato and cross-species amplification in Solanum nigrum (Solanaceae).

Authors:  Péter Poczai; István Cernák; Ahmad Mosapour Gorji; Sándor Nagy; János Taller; Zsolt Polgár
Journal:  Am J Bot       Date:  2010-11-05       Impact factor: 3.844

6.  Megabase level sequencing reveals contrasted organization and evolution patterns of the wheat gene and transposable element spaces.

Authors:  Frédéric Choulet; Thomas Wicker; Camille Rustenholz; Etienne Paux; Jérome Salse; Philippe Leroy; Stéphane Schlub; Marie-Christine Le Paslier; Ghislaine Magdelenat; Catherine Gonthier; Arnaud Couloux; Hikmet Budak; James Breen; Michael Pumphrey; Sixin Liu; Xiuying Kong; Jizeng Jia; Marta Gut; Dominique Brunel; James A Anderson; Bikram S Gill; Rudi Appels; Beat Keller; Catherine Feuillet
Journal:  Plant Cell       Date:  2010-06-25       Impact factor: 11.277

7.  Draft genome of the wheat A-genome progenitor Triticum urartu.

Authors:  Hong-Qing Ling; Shancen Zhao; Dongcheng Liu; Junyi Wang; Hua Sun; Chi Zhang; Huajie Fan; Dong Li; Lingli Dong; Yong Tao; Chuan Gao; Huilan Wu; Yiwen Li; Yan Cui; Xiaosen Guo; Shusong Zheng; Biao Wang; Kang Yu; Qinsi Liang; Wenlong Yang; Xueyuan Lou; Jie Chen; Mingji Feng; Jianbo Jian; Xiaofei Zhang; Guangbin Luo; Ying Jiang; Junjie Liu; Zhaobao Wang; Yuhui Sha; Bairu Zhang; Huajun Wu; Dingzhong Tang; Qianhua Shen; Pengya Xue; Shenhao Zou; Xiujie Wang; Xin Liu; Famin Wang; Yanping Yang; Xueli An; Zhenying Dong; Kunpu Zhang; Xiangqi Zhang; Ming-Cheng Luo; Jan Dvorak; Yiping Tong; Jian Wang; Huanming Yang; Zhensheng Li; Daowen Wang; Aimin Zhang; Jun Wang
Journal:  Nature       Date:  2013-03-24       Impact factor: 49.962

8.  Transcript-specific, single-nucleotide polymorphism discovery and linkage analysis in hexaploid bread wheat (Triticum aestivum L.).

Authors:  Alexandra M Allen; Gary L A Barker; Simon T Berry; Jane A Coghill; Rhian Gwilliam; Susan Kirby; Phil Robinson; Rachel C Brenchley; Rosalinda D'Amore; Neil McKenzie; Darren Waite; Anthony Hall; Michael Bevan; Neil Hall; Keith J Edwards
Journal:  Plant Biotechnol J       Date:  2011-06-01       Impact factor: 9.803

9.  Mapping of barley homologs to genes that regulate low temperature tolerance in Arabidopsis.

Authors:  Jeffrey S Skinner; Péter Szucs; Jarislav von Zitzewitz; Luis Marquez-Cedillo; Tanya Filichkin; Eric J Stockinger; Michael F Thomashow; Tony H H Chen; Patrick M Hayes
Journal:  Theor Appl Genet       Date:  2005-12-20       Impact factor: 5.574

10.  The wheat VRN2 gene is a flowering repressor down-regulated by vernalization.

Authors:  Liuling Yan; Artem Loukoianov; Ann Blechl; Gabriela Tranquilli; Wusirika Ramakrishna; Phillip SanMiguel; Jeffrey L Bennetzen; Viviana Echenique; Jorge Dubcovsky
Journal:  Science       Date:  2004-03-12       Impact factor: 47.728

View more
  1 in total

1.  Association genetics studies on frost tolerance in wheat (Triticum aestivum L.) reveal new highly conserved amino acid substitutions in CBF-A3, CBF-A15, VRN3 and PPD1 genes.

Authors:  Steve Babben; Edgar Schliephake; Philipp Janitza; Thomas Berner; Jens Keilwagen; Michael Koch; Fernando Alberto Arana-Ceballos; Sven Eduard Templer; Yuriy Chesnokov; Tatyana Pshenichnikova; Jörg Schondelmaier; Andreas Börner; Klaus Pillen; Frank Ordon; Dragan Perovic
Journal:  BMC Genomics       Date:  2018-05-29       Impact factor: 3.969

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.