Literature DB >> 30086718

Consensus genetic linkage map construction and QTL mapping for plant height-related traits in linseed flax (Linum usitatissimum L.).

Jianping Zhang1, Yan Long2, Liming Wang1, Zhao Dang1, Tianbao Zhang2, Xiaxia Song2, Zhanhai Dang3, Xinwu Pei4.   

Abstract

BACKGROUND: Flax is an important field crop that can be used for either oilseed or fiber production. Plant height and technical length are important characters for flax. For linseed flax, plants usually have a short technical length and plant height than those for fiber flax. As an important agronomical character for fiber and linseed flax, plant height is usually a selection target for breeding. However, because of limited technologies and methods available, there has been little research focused on discovering the molecular mechanism controlling plant height.
RESULTS: In this study, two related recombinant inbred line (RIL) populations developed from crosses of linseed and fiber parents were developed and phenotyped for plant height and technical length in four environments. A consensus linkage map based on two RIL populations was constructed using SNP markers generated by genotyping by sequencing (GBS) technology. A total of 4497 single nucleotide polymorphism (SNP) markers were included on 15 linkage groups with an average marker density of one marker every 2.71 cM. Quantitative trait locus (QTL) mapping analysis was performed for plant height and technical length using the two populations. A total of 19 QTLs were identified for plant height and technical length. For the MH population, eight plant height QTLs and seven technical length QTLs were identified, five of which were common QTLs for both traits. For the PH population, six plant height and three technical length QTLs were identified. By comparing the QTLs and candidate gene information in the two population, two common QTLs and three candidate genes were discovered.
CONCLUSIONS: This study provides a foundation for map-based cloning of QTLs and marker-assisted selection for plant height-related traits in linseed and fiber flax.

Entities:  

Keywords:  Flax; Linkage map; Plant height; QTL; SNP; Technical length

Mesh:

Year:  2018        PMID: 30086718      PMCID: PMC6081803          DOI: 10.1186/s12870-018-1366-6

Source DB:  PubMed          Journal:  BMC Plant Biol        ISSN: 1471-2229            Impact factor:   4.215


Background

Flax (Linum usitatissimum L.) is an annual self-pollinated diploid crop (2n = 2× = 30) that can be used either for stem fiber or seed oil production about 7000 years ago [1]. Fiber flax and linseed flax are genetically the same but morphologically different [2]. One of the important differences between fiber and linseed flax is the character of plant height. Fiber flax is usually 80–120 cm high with fewer branches, while oil flax is usually about 70 cm high with many branches. Fiber flax is mainly grown in Northern Europe, Russia and China, and linseed flax is widely grown in Canada, India, America, Argentina and Germany, as well as Russia and China (Scientific Database of China Plant Species, http://db.kib.ac.cn). Flax oil contains a mixture of fatty acids, including saturated, monounsaturated and polyunsaturated fatty acids. Among the different types of fatty acids, the oil is rich in polyunsaturated fatty acids, particularly alpha-linolenic acid (ALA), the essential omega-3 fatty acid, and linoleic acid (LA), the essential omega-6 fatty acid [3]. In China, linseed flax is mainly distributed in the Northwest and North areas, and has a planting history of about 2000 years [4]. As one of the important agronomical characters for both fiber and linseed flax, plant height is usually a selection target in breeding. Plant height trait has been found to have a significant co-relationship with seed yield traits, such as seed weight. For example, Contreras-Soto et al. found that some SNP markers on Chr19 controlled both the plant height and seed weight traits through a genome-wide association study (GWAS) in soybean [5]. Technical length is another selective breeding target in flax. In flax, Soto-Cerda et al. used 464 simple sequence repeat (SSR) markers to genotype 390 accessions and investigated the phenotypes of nine agronomic traits. Through GWAS analysis they discovered that 12 markers were significantly associated with six traits [6]. Halbauer et al.,(2017) used gSSRs to analyze 27 flax (Linum usitatissimum L.) accessions originating in the Alpine region and found a varying extent of accession-specific gene diversity (expected heterozygosity, HE) was revealed ranging from 0.05 to 0.51 [7]. However, there has been no research on plant height and technical length gene discovery in flax to date. In conjunction with genomic tool development, a linkage map and QTL mapping are useful tools for discovering genes controlling important agronomic traits [8-10]. In flax, only a few linkage maps have been published. The earliest linkage map was constructed with 213 RAPD and RFLP markers in 18 linkage groups, and two Fusarium wilt QTLs were identified [11]. Later, Oh et al. also used RAPD and RFLP markers to construct a linkage map composed of 94 markers [12]. With the development of molecular markers, SSRs have become the major marker type for linkage map construction. Cloutier et al. constructed a linkage map that included 24 linkage groups with 113 EST-SSR markers for a DH population, and identified QTLs for seed color, linolenic acid content and linoleic acid content [13]. Then several maps were constructed based on SSR markers. For example, Cloutier et al. construct a consensus linkage map by combined three individual linkage maps incorporating 770 markers based on 371 shared markers including 114 that were shared by all three populations and 257 shared between any two populations [14]. Most of the QTL mapping analysis focused on fatty acid related traits, such as fatty acid composition, yield [15, 16], seed and flower color [17]. For example, Sudarshan et al.,(2017) identified a seed and flower color QTL “D” by QTL mapping analysis and identified a candidate gene [17].Today, SNP markers are the most efficient and abundant markers for mapping [18] and other applications, such as GWASs [19, 20], diversity analyses [21] and bulked segregate analysis [22]. Most SNP markers come from sequencing data generated by high-throughput sequencing technologies using genomic sequences as references. Whole genome sequences have been determined for a range of species, including soybean, Brassica rapa, cotton, flax and about other 60 plants (https://phytozome.jgi.doe.gov/pz/portal.html). The flax genome includes about 302 Mb of non-redundant sequence representing an estimated 81% genome coverage [23]. However, the sequence data have only been assembled into 88,384 scaffolds containing 43,384 genes, and these are not well anchored to a linkage map showing the relative positions of the genes. Recently, many new technologies have been developed for SNP marker discovery, such as genotyping-by-sequencing (GBS) [24, 25], restriction site-associated DNA (RAD) sequencing [26, 27], SLAF-seq (specific length amplified fragment sequencing) [28] and ddRAD sequencing [29]. In flax, GBS [30] and SLAF-seq [31] technologies have been used for SNP marker identification. Kumar et al.,(2012) constructed eight reduced representation libraries to do GBS and discovered 55,465 SNPs in the genome of flax [27] and then used 329 SNP and 362 SSR markers to construct a linkage map and identified a total of 20 QTLs corresponding to 14 traits. Yi et al.,(2017) used a F2 population to do SLAF-seq and construct a genetic map for flax and finally they discovered 4638 SNPs for genetic map construction. The final genetic map included 4145 SNP markers on 15 linkage groups and with 2632.94 cM in length [31]. Thus, linkage map construction based on SNP markers has become easier and more efficient. Accordingly, candidate gene analysis for important agronomic traits has also become easier. For example, Singh et al. identified a candidate gene for 100-seed weight and root/total plant dry weight ratio under rainfed conditions in chickpea using whole genome resequencing and variant SNP loci collection [32]. Using recent tools for SNP marker identification, the aims of this study were to: 1) develop SNP linkage maps for two related recombinant inbred line (RIL) populations; 2) construct a consensus linkage map based on the two linkage maps; 3) perform QTL mapping for the plant height and technical length traits in the two related populations.

Results

Phenotype variation in the two populations

For the plant height and technical length phenotypes, the common male parent Heiya No.14 of the two populations showed higher values than both of the female parents, Macbeth and P.I.249991 (Fig. 1). The plant height and technical length values of the individuals in the two populations showed normal distributions in all of the environments (Fig. 2). The correlation analysis showed that the two traits in the two populations were positively related, and the correlation coefficient ranged from 0.7–0.85 in different environments. The phenotypic variations of the two related populations were listed in Table 1. An interesting phenomenon was that, for plant height, the phenotypic values were lowest in Jingtai, followed by Lanzhou and Langfang, and highest in Yuanmou (Table 1, Fig. 2). The same tendency was observed for the technical length distribution in three of the environments.
Fig. 1

Plant height phenotypes for the parents in the two populations. a, b Plant height phenotypes for the two parents, Macbeth and Heiya No.14, in the MH population; c, d plant height phenotypes for the two parents, P.I.294441 and Heiya No.14, in the PH population

Fig. 2

Plant height and technical length distributions in different environments in the two populations. a PH distribution in the MH population. Solid arrows represent the female parent Macbeth, and arrows with oblique lines represent the male parent Heiya No.14. b PH distribution in the PH population Solid arrows represent the female parent Macbeth; arrows with oblique lines represent the male parent Heiya No.14. Solid arrows represent the female parent PI.29991, and arrows with oblique lines represent the male parent Heiya No.14

Table 1

Phenotypic variation of MH and PH population in four environments

TraitenvironmentsFemale (cm)Male (cm)Mina (cm)Max b (cm)Meanc (cm)SDdCV (%)eKurto sisfSkewnessg
PH population
 PHLZ38.2081.8048.9075.4062.155.769.270.13−0.14
JT29.6563.6835.5062.6549.985.4610.920.03−0.52
YM65.50110.6055.70105.8078.979.7712.370.06−0.06
LF47.67103.5650.7885.0069.876.519.32−0.24−0.16
 TLLZ27.5070.3031.7081.2056.455.9910.621.640.40
JT20.8648.3816.5041.4028.954.5215.630.901.31
YM51.3288.3038.60128.0083.3010.3912.471.630.30
MH population
 PHLZ51.0080.5051.9084.7068.756.739.780.07−0.44
JT49.4070.7543.3082.0058.535.9910.230.570.89
YM74.10110.6059.4094.7075.197.359.770.27−0.33
LF50.9093.5050.7894.7870.017.8911.26−0.110.02
 TLLZ41.8768.9035.4067.0051.095.8211.390.05−0.36
JT26.3053.5023.1550.7530.654.4314.450.841.98
YM55.1080.0046.4075.9060.046.7011.160.07−0.58

aMax(cm): The maximum value of phenotypic data in the two RIL populations

bMin(cm): The Minimum value of phenotypic data in the two RIL populations

cMean(cm): The average value of phenotypic data in the two RIL populations

dSD: Standard deviation of the phenotypic trait

eCV (%): Coefficient of variation of the phenotypic trait

fSkewness: Skewness of the phenotypic trait

gKurtosis: Kurtosis of the phenotypic trait

Plant height phenotypes for the parents in the two populations. a, b Plant height phenotypes for the two parents, Macbeth and Heiya No.14, in the MH population; c, d plant height phenotypes for the two parents, P.I.294441 and Heiya No.14, in the PH population Plant height and technical length distributions in different environments in the two populations. a PH distribution in the MH population. Solid arrows represent the female parent Macbeth, and arrows with oblique lines represent the male parent Heiya No.14. b PH distribution in the PH population Solid arrows represent the female parent Macbeth; arrows with oblique lines represent the male parent Heiya No.14. Solid arrows represent the female parent PI.29991, and arrows with oblique lines represent the male parent Heiya No.14 Phenotypic variation of MH and PH population in four environments aMax(cm): The maximum value of phenotypic data in the two RIL populations bMin(cm): The Minimum value of phenotypic data in the two RIL populations cMean(cm): The average value of phenotypic data in the two RIL populations dSD: Standard deviation of the phenotypic trait eCV (%): Coefficient of variation of the phenotypic trait fSkewness: Skewness of the phenotypic trait gKurtosis: Kurtosis of the phenotypic trait

ddRADseq statistics for the two RIL populations

The ddRAD seq protocol was used to construct sequence libraries for both the MH and PH RIL populations. The genomic DNA was double digested with the restriction enzymes SacI and MseI, and then fragments in a size range of 141–420 bp were recovered. Libraries from 12 different individuals tagged with 12 barcodes were pooled and sequenced on the Illumina HiSeq2000 platform. The three parents Macbeth, Heiya No.14 and P.I.249991 yielded 1.05(3.5× genome coverage), 2.59 (8.6× genome coverage) and 2.08 (6.9× genome coverage) million PE reads, respectively. For the MH population, the 110 individuals yielded a total of 135.95 million PE reads, ranging from 0.38(1.3× genome coverage) to 4.99 (16.6× genome coverage) million reads in different RILs with an average of 1.21(4.03× average genome coverage) million reads per RIL line (Fig. 3a). For the PH population, the 123 individuals yielded a total of 168.19 million PE reads, ranging from 0.31(1.03× genome coverage) to 5.9 (19.7× genome coverage) million reads in different RILs with an average of 1.37 (4.57× average genome coverage) million reads per RIL line (Fig. 3b).
Fig. 3

The number of sequence reads for the two populations. The characters in X-axis mean the number of lines and the Y-axis represents the reads number for each line. a MH population. The number 1–110 in X-axis represent the RIL number, and number 111 mean the female parent Macbeth, 112 mean the male parent Heiya No.14. b PH population. The number 1–123 in X-axis represent the RIL number, and number 124 mean the female parent P.I.249991, 125 mean the male parent Heiya No.14

The number of sequence reads for the two populations. The characters in X-axis mean the number of lines and the Y-axis represents the reads number for each line. a MH population. The number 1–110 in X-axis represent the RIL number, and number 111 mean the female parent Macbeth, 112 mean the male parent Heiya No.14. b PH population. The number 1–123 in X-axis represent the RIL number, and number 124 mean the female parent P.I.249991, 125 mean the male parent Heiya No.14

SNP genotyping and linkage map construction in the two RIL populations

After getting the raw sequencing data, three major steps were used to identify SNP markers. First, the 5 bp index at the end of 5′ end was removed and the last 5 bp error-enriched nucleotides at the 3′ end were trimmed for each read. Therefore, 80 bp PE reads were kept for the following analysis. Then, subsequent steps including recognizing the tags, constructing tag networks, removing erroneous tags and filtering out networks having only a single tag were done. Finally, 7399 and 5505 SNPs were identified in the PH and MH populations, respectively. The distribution of informative SNP numbers was 24.5 and 18.2 per megabase along the flax chromosome for each population. This mean that the polymorphic variation between Macbeth and Heiya No.14 was lower than the two parents in PH population. To construct the linkage maps, one SNP was selected from each allelic tag pair to represent the locus, which resulted in a set of 4348 and 3231 SNPs for the PH and MH population, respectively. After deleting the distorted SNP markers, finally a total of 2788 SNP markers distributed on 15 linkage groups were mapped for PH population, with a length of 1138 cM. The average marker density was 2.45 markers/cM. The LGs ranged from 15 to 137 cM and contained 36 to 329 markers. For the PH map, a total of 2120 segregating markers were mapped on 15 linkage groups, and the total length was 1272 cM. There were 34 to 267 markers distributed on different LGs. The average marker density was 1.67 markers/cM. All the information for the SNP linkage maps is listed in Table 2. By combining the two individual linkage maps, an integrated linkage map was constructed. In total, the integrated map included 4497 molecular markers and was 1658 cM long. For each LG, the marker number ranged from 154 to 576 (Table 2, Additional file 1, Additional file 2). The biggest linkage group was Lu9 with 451 integrated markers, while the smallest linkage group was 87 cM and contained 154 molecular markers. Then the flanking sequences (Additional file 2) of all of the 4497 SNP markers were used for BLAST with the published flax genome. Totally, 1996 scaffolds were identified, and among them 1493 scaffolds were non-redundant. The other 2501 SNP markers couldn’t be mapped to the reference genome, which mean that genetic differences existed in different flax cultivars.
Table 2

Mapping statistics for the three individual and the consensus genetic maps of flax

Linkage groupMH mapPH mapIntegrated
marker numberLength (cM)average distance(cM)marker numberLength(cM)average distance(cM)marker numberLength(cM)average distance(cM)
Lu1 88460.52175750.43257830.32
Lu2 36150.422041190.582381240.52
Lu3 128630.492101050.503331080.33
Lu4 143670.4737571.55177740.42
Lu5 187900.4895650.682671100.41
Lu6 213720.34148520.35334810.24
Lu7 122590.4934501.47154670.43
Lu8 3291040.322671410.535761670.29
Lu9 2581120.431991280.644511840.41
Lu10 184690.3890540.602611110.43
Lu11 100570.57100970.971921020.53
Lu12 159530.34148650.44298830.28
Lu13 3481370.392141220.575231710.33
Lu14 240910.3840581.452771070.39
Lu15 2531030.41159840.53159840.53
Total278811380.41212012720.60449716580.37
Mapping statistics for the three individual and the consensus genetic maps of flax

QTL mapping of plant height and technical length in the two RIL populations

QTL mapping analysis was done using the WinQTLcart2.5 software. After initial QTL mapping analysis, several plant height and technical length QTLs were identified in different environments in the two populations (Table 3, Fig. 4). For the MH population, 10 QTLs distributed on eight linkage groups were identified, the phenotypic variation ranged from 18 to 26%, and most of the additive effect was negative, which means that most of the alleles from the female parent Macbeth made the plants shorter. For the plant height trait, eight QTLs distributed in seven linkage groups were identified. Seven QTLs on six linkage groups were detected for the technical length trait. Comparing the QTLs of the two traits showed that five QTLs were shared by both traits, such as uq.C5–1 and uq.C6–1. Conversely, three unique QTLs were detected in multiple environments and the other seven QTLs were environment-specific. For the PH population, nine QTLs distributed on eight linkage groups were identified, and the phenotypic variation ranged from 18 to 68% (Table 2, Fig. 4). Six QTLs distributed in six linkage groups were identified for the plant height trait, and three QTLs on three linkage groups were detected for the technical length trait. Comparing the QTLs of the two traits showed that no QTLs controlled both traits. Additionally, only one unique QTL was detected in two environments; the other QTLs were environment-specific. When the QTL mapping results were compared between the two populations, only one QTL located in Lu11 was detected in both populations. After the QTL mapping analysis, the candidate genes of the QTLs were identified based on the sequence information of the SNP markers. In total, 28 potential flax candidate genes were located in the QTL confidence intervals, including 17 candidate genes for the MH population and 11 candidate genes for the PH population (Table 3). Gene annotation results showed that different gene functions were potentially involving in the plant developing in flax, such as LACCASE, MYB regulators. And the actual functional genes controlling the plant height and technical length QTLs would be discovered by further study.
Table 3

QTL information for plant height and technical length in two populations

QTL nameLGPositionCloset markerLODAdditiveR2LOD2_LLOD2_RTraitEnvironmentCandidate geneGene annotation
MH population
uq.C1–1128.51Lu1_6953893.711.5712.7627.530.5TLYuanmou Lus10012562,Lus10016188 Unknown function; COBRA-like protein
uq.C4–1424.41Lu4_3007013.072.2825.724.124.4PHLangfang Lus10041481 Multicopper oxidase
uq.C5–1552.51Lu5_85043.891.9525.3152.253.3PH,TLLangfang, Jingtai Lus10007137 large subunit ribosomal protein L13Ae
uq.C6–1619.81Lu6_6392363.631.79–2.0316.21–22.4717.824PH,TLYuanmou Lus10034082,Lus10017587, Lus10019435 Proline-rich receptor-like protein kinase perk; carboxylesterase 17-related;oxidoreductase
uq.C8–1810.11Lu8_6461843.621.258.169.610.4TLJingtai Lus10011957 nucleosome assembly protein 1-like 1
uq.C8–2881.21Lu8_1850095.09−2.1619.0880.682.1PH,TLJingtai
uq.C8–3893.41Lu8_1194883.14−1.9725.9892.296.5PHLangfang Lus10012249,Lus10012252 fanconi anemia group J protein;high mobility group protein 2-related
uq.C11–11146.91Lu11_5576173.3−1.8822.1946.647.7PHLangfang Lus10038920 monocopper oxidase-like protein sks1-related
uq.C12–11234.31Lu12_6965082.7–4.8−4.5318.71–23.542.836.7PH,TLLangfang, Lanzhou, Yuanmou Lus10036228,Lus10036227,Lus10036229, Lus10036247,Lus10023392 Ribosomal protein L7/L12-related; UDP-N-acetylglucosamine pyrophosphorylase; choline-phosphate cytidylyltransferase; F21O3.9 Protein-related;protein LSD1
uq.C14–11432.61Lu14_2318533.94–7.262.56–3.0820.62–26.943233.1PH, TLLangfang,Lanzhou, Yuanmou Lus10015608,Lus10015614 MYB domain protein 26; insoluble isoenzyme cwinv1-related
PH population
uq.C1–1117.61Lu1_3964283.79−1.8421.1116.217.9PHLangfang Lus10027782,Lus10016188, Lus10027887 Laccase-10-related;COBRA-related protein 6; Wax ester synthase-like Acyl-CoA acyltransferase domain; Wax ester synthase-like Acyl-CoA acyltransferase domain
uq.C2–2296.71Lu2_5970576.791.7710.2994.697.9TLJingtai Lus10014561,Lus10014535, Lus10030692 glycosylphosphatidylinositol transamidase; Formate dehydrogenase
uq.C3–1323.41Lu3_6934234.05−2.4220.5320.924.2PHLanzhou
uq.C7–1712.31Lu7_7813123.043.6773.415.815.7TLYuanmou
uq.C9–1941.41Lu9_5031286.432.6821.8340.946.4PHYuanmou, Langfang Lus10002031,Lus10015623 Unknown function; fatty acid omega-hydroxy dehydrogenase
uq.C9–2956.21Lu9_6181223.431.2210.5351.958.7TLJingtai
uq.C11–11137.41Lu11_4470484.54−2.3222.9136.938PHLangfang Lus10038920 monocopper oxidase-like protein sks1-related
uq.C12–1122.01Lu12_1635963.44−2.2321.8403.6PHLanzhou Lus10031416 Nicotianamine synthase
uq.C13–11354.91Lu13_3671833.343.2168.253.856.7PHYuanmou Lus10026439,Lus10026441 Threonine-specific protein kinase; peptidyl-prolyl cis-trans isomerase SDCCAG10

LOD2_L/R: Confidence interval in 95% significance level; TL: technical length; PH: Plant height

Fig. 4

Graph of QTL mapping results for the two populations. a MH population. b PH population. The innermost circle represents 15 linkage groups, the outer circles represent QTL mapping results for plant height and technical length. Additional data about of the genetic maps can be found in Additional files 1 and 2. The circle with a red bar shows the QTLs for plant height, the circle with a blue bar shows the QTLs for technical length, and the circle with an orange bar shows QTLs detected for both of the traits

QTL information for plant height and technical length in two populations LOD2_L/R: Confidence interval in 95% significance level; TL: technical length; PH: Plant height Graph of QTL mapping results for the two populations. a MH population. b PH population. The innermost circle represents 15 linkage groups, the outer circles represent QTL mapping results for plant height and technical length. Additional data about of the genetic maps can be found in Additional files 1 and 2. The circle with a red bar shows the QTLs for plant height, the circle with a blue bar shows the QTLs for technical length, and the circle with an orange bar shows QTLs detected for both of the traits

Discussion

In the current study, a high-density integrated linkage map was constructed based on individual linkage maps from the MH and PH populations. Differing from previously published linkage maps in flax, the molecular markers used in this study were SNP markers based on the high-throughput sequencing technology-ddRAD sequencing method. Compared with traditional RAD sequencing, the ddRAD method simplifies the experimental process and reduces the cost of reduced representation library construction, and can be used in species without a reference genome [33]. Poland et al. first used this technology to develop thousands of SNP markers and constructed two high-density linkage maps for barley and wheat [34]. Subsequently, ddRAD-derived SNP linkage maps were constructed in different species, such as peanut [35] and strawberry [36]. In flax, the first SNP maps were developed by the GBS technology. Kumar et al., (2012) first used GBS technology to discovering SNPs among 8 flax cultivars [30], meanwhile Cloutier et al., (2012) constructed consensus linkage map by using these SNP markers and SSR markers, and finally there were 770 markers distributing in 15 linkage groups [14]. Until now, the latest linkage map published by Kumer et al.,(2015) was with 329 SNPs and 362SSRs. Compared with the previous linkage maps, the current SNP map generated in this study is the most high-density map to date and will help flax researchers to discover flax genes more easily. Based on the two individual linkage maps, a consensus linkage map was constructed. Mapping of genetic markers using multiple populations provides increased genome coverage because it is unlikely that multiple parents would all be fixed (monomorphic) in the same genomic regions. In the MH linkage map, the Lu2 linkage group contained only 36 molecular markers and was 15 cM, whereas in the PH linkage map, there were 204 molecular markers in the Lu2 linkage group covering 119 cM. Thus, once these two linkage groups were integrated together, the coverage of this group was greatly increased. Additionally, we anchored the published scaffolds onto the consensus map based on the SNP sequencing information. In total, 1493 non-redundant scaffolds were mapped onto the linkage map, which means that these scaffolds were physically mapped onto the linkage map and will help to construct the whole physical map. While there were still 2501 SNP markers couldn’t be anchored with published reference genome sequences. One of the reasons was genetic differences existing between the reference genome and the current flax cultivar genomes. The other possible reason was that the coverages of the reference genome published in 2012 [23] were not enough to dissect the whole flax genome. So the consensus linkage map would be benefit for flax genome assembly. Additionally, the flanking sequencing information could help to discover candidate genes for target QTLs of specific traits. Plant height is an important agronomic trait affecting crop performance, particularly lodging and consequently yield and quality. Technical length is also an important trait for flax breeding. Previous studies have shown that plant height and technical length are positively correlated. In general, flax plants with a greater technical length also have a greater plant height. There are different demands for plant height and technical length in flax because of its different usages. Specifically, fiber-type flax needs to be taller, while linseed-type flax needs to be relatively short. Thus, different alleles that can enhance or reduce plant height and technical length phenotypes are needed according to different breeding purposes for flax. In this study, the phenotype of plant height showed obvious differences among the four environments. Additionally, the same phenotype distribution was found in three locations. The average value of plant height was smallest in Jingtai, followed by Lanzhou and Langfang, and largest in Yuanmou. This was because of the latitude, longitude and temperature differences in these locations. It was easy to understand why the plant height and technical length were highest in Yuanmou. We planted the two populations in October and the growth cycle was 5 months long, which was much longer than in the other three locations. In the other three locations, we planted the populations in April and harvested the seeds in early July, so the life cycle was only 3 months. Among these three locations, plant height was greatest in Langfang, followed by Lanzhou and lastly Jingtai, which means that plant height was positively correlated with longitude and negatively correlated with latitude. Several previous studies have shown that different environmental factors can influence plant height. For example, a photoperiod insensitive allele of the major photoperiod regulator Ppd-1, located on group 2 chromosomes, can have pleiotropic effects on plant height [37]. In flax, Soto-Cerda et al., (2014) used GWAS approach to identify genes relating to 9 agronomic traits including plant height. The results showed that one SSR marker was not only relate with flowering time, also relate with plant height [6]. We then compared the QTLs got from the GWAS analysis and from the current study, it was found that the QTLs were not same in the two studies. The reason was maybe the different genetic background of the research materials. Besides comparing the results from two different studies, we compared the QTL mapping results from the two populations. The results showed that most of the QTLs were unique, meaning that the QTLs were significantly influenced by the genetic background. One unique QTL uq.C11–1 located in Lu11 was detected in both populations, and the candidate gene Lus10038920 was identified. This showed that a common candidate gene contributed to the plant height trait in both of the populations. Thus, we could focus on this QTL and try to use the sequence information from the SNP markers for MAS in flax.

Conclusions

Flax is an important oilseed crop over the world, while until now there were few researches about gene and genomic information. In the current study, two RIL populations were used as materials to generate SNP markers and investigate the genetic control of plant height-related traits. As a result, a consensus high density linkage map was constructed based on the two individual linkage maps. Based on the linkage map and phenotypes, QTL mapping analysis was done for plant height and technical length. Totally there were 19 QTLs were identified. For MH population, eight plant height QTL and 7 technical length QTL were identified, and 5 are common QTLs. For PH population, there were 6 plant height QTL and 3 technical length QTL respectively. After comparing the QTL and candidate gene information of the two populations, two common QTLs and three candidate genes were discovered. This study provides the foundation for assisting in map-based cloning of the QTL and marker assisted selection of plant height related traits in linseed and potentially fibre flax.

Methods

Plant materials

Two populations were used for linkage map construction and QTL mapping. The first was a Macbeth/Heiya No.14 (MH) RIL population. The MH RIL population RILs was developed from. “Macbeth×Heiya No.14” by single-seed descent in seven generations. Macbeth is a Canadian ‘conventional’ oilseed-type cultivar with [38] 55–57% linolenic acid, and Heiya No.14 is a Chinese fiber cultivar [39]. In total, 110 RILs were randomly selected from the original 235 lines and were used for genotyping and phenotyping analysis. The second population was a P.I.249991/Heiya No.14 (PH) population, which consisted of 123 R7-derived RILs. The RILs were generated from a cross between the oilseed-type P.I.249991 with about 53% linolenic acid [40] and the Chinese fiber flax variety Heiya No.14 by single-seed descent. As with the MH population, the plant height phenotypes of the two parents were significantly different (Fig. 1). The two populations were developed in Crop Institute of Gansu Academy of Agricultural Sciences.

Field trial experiments and phenotype measurements

The two populations and their parents were grown in four different locations over two years (Additional file 3). In 2015, the two populations were planted in the fields of Crop Institute of Gansu Academy of Agricultural Sciences, Lanzhou and Jingtai in Gansu Province of China, and Yuanmou in Yunnan Province. Then, the two populations were planted in the fields of Biotechnology institute of Chinese Academy of Agricultural Sciences, Langfang in Hebei Province in 2016. Three replications were included in each of the field trial experiments. Three lines including about 30 plants were included in each plot. Each plot was with 1 m long and 0.6 m wide. Five plants were randomly selected for plant height (PH) and technical length (TL) measurements with unit cm. The plant height was measured from the bottom to the top of the whole plant, and the technical length was referred to the length of the main stem, that mean the length from the bottom until the first branch. PH values were measured in all four environments, and TL values were recorded in three environments. Then the average values, standard deviation, coefficient of variation, skewness and kurtosis of each trait were analyzed by SPSS17 software.

DNA extraction, library construction and sequencing

Genomic DNA was extracted from young leaves of a single plant for each inbred line of each RIL population. The extraction method was as described by Murray and Thompson [41]. The DNA quality was checked with a Nanodrop2000 and by running it on a 1% agarose gel. Then, 200 ng qualified genomic DNA was used for sequence library construction and SNP marker development. The double digested restriction site-associated DNA (ddRAD) sequencing method was used for sequence library construction and SNP marker development. ddRAD libraries for the parents and all RILs were constructed as described previously [29]. First, the 200 ng DNA was digested with two restriction endonucleases SacI and MseI, and then adaptors with unique barcodes were added to the restriction fragments for each individual. The final ligates from 12 individuals were pooled, and then the sizes between 220 to 500 bp were separated on 2% agarose gel. Finally, the purified fragments were amplified and the sizes ranged from 270 to 550 bp were purified with a Qiagen gel purification kit and submitted for sequencing. The sequencing was performed on Hiseq2000 platform with paired-end reads of 90 bp.

SNP discovery and genotyping

The sequencing procedure was performed when the sequence libraries were constructed.. After getting the raw data for each individual, the 5 bp barcode and the 5 bp on the 3′ end of sequences were trimmed and the remaining 80 nucleotides of each PE read were kept for further analysis. The genome-wide SNP discovery was performed using the RFAP tool pipeline, which included assembly of a pseudo-reference sequence, SNP discovery, genotyping and discrimination of allelic SNPs [29]. Next, the flanking sequences were used to perform BLAST searches against the known reference genome (https://phytozome.jgi.doe.gov/pz/portal.html) [23]. Additionally, previously published SNP sequences and SSR primer sequences from linkage maps [13, 42, 43] were collected for BLAST searches against the reference genome [23]. All of the SNP marker information was listed in Additional file 2.

Individual linkage map construction and consensus linkage map integration

When the BLAST results for SNPs collected in this study matched known SNP or SSR locus information, the SNPs could be anchored to specific LGs. After calculating the allelic SNPs, all SNP loci with less than 25% maximum missing data and without distorted segregation by Chi-square test with 1: 1 were used for linkage map construction. The MSTMap software [44] was used to construct a high-density genetic map. All the loci were first partitioned into LGs. The major parameters for loci partitioning were as follows: distance_function, Kosambi; p_value, 0.0000001; no_map_dist, 10; missing_threshold, 0.25; objective_function, COUNT. After the individual linkage maps were constructed, the Mergemap software (http://alumni.cs.ucr.edu/~yonghui/mgmap.html) was used to integrate the two individual maps.

QTL mapping for plant height (PH) and technical length (TL) traits in the two populations

QTL mapping analysis was first done for the two related populations. QTL mapping was performed using the composite interval method (CIM) with the WinQTL cartographer 2.5 software [45]. CIM was used to scan the genetic map and estimate the likelihood of a QTL and its corresponding effect every 2 cM. The forward and backward regression algorithm was used to get cofactors. After initial QTL mapping analysis, a permutation test was done to get the LOD threshold value for each QTL. After QTLs were extracted for each of the population, the QTLs were combined in the consensus linkage map. Then, the flanking sequences of all the SNP markers in the QTL confidence interval were used to blast with the flax genome sequence(https://phytozome.jgi.doe.gov/pz/portal.html), and then the genes located in the confidence interval regions were considered as the potential candidate genes of QTLs. The consensus linkage map from the two populations. The linkage map consisted of 15LGs. The markers in the LG with asterisk were the common markers which are used for marker integration. (PDF 2076 kb) SNP marker information for consensus linkage map from PH and MH populations (XLSX 712 kb) Detailed environmental information of the 4 locations (XLSX 9 kb)
  31 in total

1.  QTL for fatty acid composition and yield in linseed (Linum usitatissimum L.).

Authors:  Santosh Kumar; Frank M You; Scott Duguid; Helen Booker; Gordon Rowland; Sylvie Cloutier
Journal:  Theor Appl Genet       Date:  2015-03-08       Impact factor: 5.699

2.  Rapid isolation of high molecular weight plant DNA.

Authors:  M G Murray; W F Thompson
Journal:  Nucleic Acids Res       Date:  1980-10-10       Impact factor: 16.971

3.  Integrated consensus genetic and physical maps of flax (Linum usitatissimum L.).

Authors:  Sylvie Cloutier; Raja Ragupathy; Evelyn Miranda; Natasa Radovanovic; Elsa Reimer; Andrzej Walichnowski; Kerry Ward; Gordon Rowland; Scott Duguid; Mitali Banik
Journal:  Theor Appl Genet       Date:  2012-08-14       Impact factor: 5.699

4.  Construction of an SNP-based high-density linkage map for flax (Linum usitatissimum L.) using specific length amplified fragment sequencing (SLAF-seq) technology.

Authors:  Liuxi Yi; Fengyun Gao; Bateer Siqin; Yu Zhou; Qiang Li; Xiaoqing Zhao; Xiaoyun Jia; Hui Zhang
Journal:  PLoS One       Date:  2017-12-21       Impact factor: 3.240

5.  SLAF-seq: an efficient method of large-scale de novo SNP discovery and genotyping using high-throughput sequencing.

Authors:  Xiaowen Sun; Dongyuan Liu; Xiaofeng Zhang; Wenbin Li; Hui Liu; Weiguo Hong; Chuanbei Jiang; Ning Guan; Chouxian Ma; Huaping Zeng; Chunhua Xu; Jun Song; Long Huang; Chunmei Wang; Junjie Shi; Rui Wang; Xianhu Zheng; Cuiyun Lu; Xiaowu Wang; Hongkun Zheng
Journal:  PLoS One       Date:  2013-03-19       Impact factor: 3.240

6.  Genome wide SNP discovery in flax through next generation sequencing of reduced representation libraries.

Authors:  Santosh Kumar; Frank M You; Sylvie Cloutier
Journal:  BMC Genomics       Date:  2012-12-06       Impact factor: 3.969

7.  Distribution of photoperiod-insensitive allele Ppd-A1a and its effect on heading time in Japanese wheat cultivars.

Authors:  Masako Seki; Makiko Chono; Tsutomu Nishimura; Mikako Sato; Yasuhiro Yoshimura; Hitoshi Matsunaka; Masaya Fujita; Shunsuke Oda; Katashi Kubo; Chikako Kiribuchi-Otobe; Hisayo Kojima; Hidetaka Nishida; Kenji Kato
Journal:  Breed Sci       Date:  2013-09-01       Impact factor: 2.086

8.  Rapid SNP discovery and genetic mapping using sequenced RAD markers.

Authors:  Nathan A Baird; Paul D Etter; Tressa S Atwood; Mark C Currey; Anthony L Shiver; Zachary A Lewis; Eric U Selker; William A Cresko; Eric A Johnson
Journal:  PLoS One       Date:  2008-10-13       Impact factor: 3.240

9.  Complexity reduction of polymorphic sequences (CRoPS): a novel approach for large-scale polymorphism discovery in complex genomes.

Authors:  Nathalie J van Orsouw; René C J Hogers; Antoine Janssen; Feyruz Yalcin; Sandor Snoeijers; Esther Verstege; Harrie Schneiders; Hein van der Poel; Jan van Oeveren; Harold Verstegen; Michiel J T van Eijk
Journal:  PLoS One       Date:  2007-11-14       Impact factor: 3.240

10.  Association mapping of seed quality traits using the Canadian flax (Linum usitatissimum L.) core collection.

Authors:  Braulio J Soto-Cerda; Scott Duguid; Helen Booker; Gordon Rowland; Axel Diederichsen; Sylvie Cloutier
Journal:  Theor Appl Genet       Date:  2014-01-26       Impact factor: 5.699

View more
  5 in total

1.  Identification of QTNs Associated With Flowering Time, Maturity, and Plant Height Traits in Linum usitatissimum L. Using Genome-Wide Association Study.

Authors:  Ankit Saroha; Deepa Pal; Sunil S Gomashe; Vikender Kaur; Shraddha Ujjainwal; S Rajkumar; J Aravind; J Radhamani; Rajesh Kumar; Dinesh Chand; Abhishek Sengupta; Dhammaprakash Pandhari Wankhede
Journal:  Front Genet       Date:  2022-06-14       Impact factor: 4.772

2.  Mapping QTLs for Alternaria blight in Linseed (Linum usitatissimum L.).

Authors:  Neha Singh; Rajendra Kumar; Sujit Kumar; P K Singh; Hemant Kumar Yadav
Journal:  3 Biotech       Date:  2021-01-23       Impact factor: 2.406

3.  Combining QTL Analysis and Genomic Predictions for Four Durum Wheat Populations Under Drought Conditions.

Authors:  Meryem Zaïm; Hafssa Kabbaj; Zakaria Kehel; Gregor Gorjanc; Abdelkarim Filali-Maltouf; Bouchra Belkadi; Miloudi M Nachit; Filippo M Bassi
Journal:  Front Genet       Date:  2020-05-06       Impact factor: 4.599

4.  Resequencing 200 Flax Cultivated Accessions Identifies Candidate Genes Related to Seed Size and Weight and Reveals Signatures of Artificial Selection.

Authors:  Dongliang Guo; Haixia Jiang; Wenliang Yan; Liangjie Yang; Jiali Ye; Yue Wang; Qingcheng Yan; Jiaxun Chen; Yanfang Gao; Lepeng Duan; Huiqing Liu; Liqiong Xie
Journal:  Front Plant Sci       Date:  2020-01-16       Impact factor: 5.753

5.  Genes Associated with the Flax Plant Type (Oil or Fiber) Identified Based on Genome and Transcriptome Sequencing Data.

Authors:  Liubov V Povkhova; Nataliya V Melnikova; Tatiana A Rozhmina; Roman O Novakovskiy; Elena N Pushkova; Ekaterina M Dvorianinova; Alexander A Zhuchenko; Anastasia M Kamionskaya; George S Krasnov; Alexey A Dmitriev
Journal:  Plants (Basel)       Date:  2021-11-28
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.