Literature DB >> 32985526

Meta-analysis of QTL reveals the genetic control of yield-related traits and seed protein content in pea.

Anthony Klein1, Hervé Houtin2, Céline Rond-Coissieux2, Myriam Naudet-Huart2, Michael Touratier2, Pascal Marget2,3, Judith Burstin2.   

Abstract

Pea is one of the most important grain legume crops in temperate regions worldwide. Improving pea yield is a critical breeding target. Nine inter-connected pea recombinant inbred line populations were evaluated in nine environments at INRAE Dijon, France and genotyped using the GenoPea 13.2 K SNP array. Each population has been evaluated in two to four environments. A multi-population Quantitative Trait Loci (QTL) analysis for seed weight per plant (SW), seed number per plant (SN), thousand seed weight (TSW) and seed protein content (SPC) was done. QTL were then projected on the multi-population consensus map and a meta-analysis of QTL was performed. This analysis identified 17 QTL for SW, 16 QTL for SN, 35 QTL for TSW and 21 QTL for SPC, shedding light on trait relationships. These QTL were resolved into 27 metaQTL. Some of them showed small confidence intervals of less than 2 cM encompassing less than one hundred underlying candidate genes. The precision of metaQTL and the potential candidate genes reported in this study enable their use for marker-assisted selection and provide a foundation towards map-based identification of causal polymorphisms.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32985526      PMCID: PMC7522997          DOI: 10.1038/s41598-020-72548-9

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Grain legumes play a central role in sustainable agriculture and food security. They produce protein-rich seeds and thanks to their special nitrogen nutrition largely based on a symbiosis with nitrogen-fixing soil bacteria, they allow for the reduction of nitrogen fertilizer use in cropping systems thus lowering agriculture energy costs and greenhouse gas production[1,2]. Pea (Pisum sativum L.) is one of the most cultivated grain legumes in temperate areas[3]. It has a high nutritional value and is used for human food and animal feed. With a demand for plant proteins rising worldwide, increasing pea seed yield and protein content are important breeding targets. Seed yield and seed protein content are complex traits and highly quantitative. Numerous loci controlling seed yield and seed quality have been identified in pea[4-14]. The r and rb loci encoding Starch-Branching Enzyme 1[15] and ADP glucose-pyrophosphorylase[16], respectively, and which control the wrinkled seed phenotype in pea have long been known to impact seed development, yield and seed protein content[17,18]. Burstin et al.[7] suggested that seed yield and protein content QTL corresponded either to (i) genes controlling the plant source capacity to produce and fill seeds, or (ii) genes controlling seed sink strength such as genes involved in seed formation and storage products’ synthesis. Genes controlling plant architecture such as Le which determines inter-node length or Afila (Af) which determines leaf type would more likely correspond to source capacity loci while rugosus genes or a gene encoding a subtilase shown to be associated with a seed size QTL[19] would correspond to sink strength loci. Bourgeois et al.[8] have further shown that most protein quality QTL were co-localized with genes encoding major storage proteins. Identifying causal polymorphisms for these traits has been challenging in pea. Most QTL-based research studies have considered a few bi-parental populations. As a result, the QTL detection was limited to phenotypic variability of both parents and the QTL confidence intervals were generally large, probably due to the limited population size or low-density linkage maps. To narrow QTL confidence intervals and provide a foundation to marker-assisted breeding for yield components and seed protein content in pea, we investigated these traits in nine inter-connected recombinant inbred line populations (RIL) derived from parents showing contrasted phenotypes[20] providing a wide phenotypic variability. We used a high-density single nucleotide polymorphism (SNP) based genotyping platform, namely GenoPea 13.2 K SNP array[20], to ensure high-quality dense genetic maps. We performed QTL mapping taking advantage of the multi-cross design[21] to define QTL locations and genotypic effects. We then integrated all QTL results through a meta-analysis approach[22]. The wide phenotypic variability and the high-density linkage map allowed the identification of metaQTL for seed weight per plant (SW), seed number per plant (SN), thousand seed weight (TSW) and seed protein content (SPC). This refined confidence intervals and pinpointed some candidate genes thanks to the recently published pea genome sequence[23].

Results

Phenotypic variation for SW, SN, TSW and SPC in multiple populations

SW, SN, TSW and SPC measured in the field and the glasshouse for 1213 RIL from nine bi-parental populations revealed highly-significant line, year, and population effects (Supplementary Table S1). The traits exhibited continuous distributions in the nine populations indicating a polygenic inheritance (Supplementary Fig. S1). Transgressive lines were observed for all traits but TSW in Pop11, indicating that favourable alleles are brought by both parents in most cases. Highly-significant genotype effects (P < 0.0001) and high heritabilities were detected for the four traits in almost all populations and environments, except for Pop3, Pop4 and Pop5 in the field experiment in 2004 due to the impact of diseases on plants and Pop11 in the glasshouse in 2008 related to the F3 state of the population (Supplementary Table S1). The range of the data of the multi-population lines phenotyped in field environment was extensive: in spring sowing, SW varied from 1.7 to 47.9 g per plant; SN from 9.9 to 300.0, TSW from 78.6 to 317.3 g per plant, and seed protein content from 18.3 to 32.2% of seed dry weight. In winter sowing (i.e. Pop9 in 2008, 2009 and 2010), SW varied from 5.1 to 119.1 g per plant; SN from 23.8 to 586.4, TSW from 101.4 to 258.7 g per plant, and seed protein content from 19.3 to 30.3% of seed dry weight. The environment showed a highly significant effect on all traits in all populations, except SPC in Pop9. Genotype-by-environment (GxE) interactions were significant for Pop3, Pop5, and Pop9 (Supplementary Table S2) but not for Pop4. These effects were partly associated with different responses to environmental conditions such as diseases impacting plants in 2004 or the sowing period for Pop9 (Supplementary Table S2 and Table S1). However, a major determinant of these interactions was that the population composition was changed in Pop3, Pop5, and Pop9 in 2011 as compared to other years since tall plants carrying the Le allele were discarded for this trial (see “Material and methods”; Supplementary Fig. S3). In the glasshouse, SN and SW were significantly lower than in the field, especially for Pop10, unlike TSW and SPC that were stable across environments (Supplementary Table S1). Several parents of RIL populations were chosen for their high seed protein content i.e. Caméor, VavD265 and China. Progenies of the crosses involving two of these parents (i.e. Pop3, Pop9) showed high range of SPC up to > 30%. Transgressive segregants were notable for SPC, even in progenies where parents were not chosen for their high SPC such as in the Pop8 ‘Kazar’ x ‘Melrose’ where some of the RILs show SPC values up to 29% (Supplementary Table S1 and supplementary Fig. S1).

Correlations among yield components and seed protein content

A principal component analysis of mean values for each line in each environment highlighted the wide phenotypic variability among the pea populations (Fig. 1). Axis 1 explained 48.3% of the phenotypic variance based mainly on SN and SW, and in a lesser extent, on TSW and SPC. Axis 2 explained 28% of the phenotypic variance and represented mainly TSW and SPC variability (Fig. 1a), and SW in a lesser extent. Axis 3 explained 21.7% of the phenotypic variance and opposed SPC and TSW (Fig. 1b). Accordingly, the overall correlation coefficients between SW, SN and TSW revealed that SW was highly correlated with SN (Pearson r = 0.89). Besides, TSW was significantly negatively correlated with SN (r = − 0.35) (Fig. 2). However, correlations between SW, SN and TSW slightly differed among populations and environments: SN was in all cases highly positively correlated with SW; TSW was positively correlated with SW in most cases except for Pop9 for which the correlation was negative in 2009 in autumn sowing, not significant in 2008 and 2010, and positive in 2011 in spring sowing, like the other populations. In Pop9, seed weight and seed number were lower in spring sowing than in autumn sowings (Supplementary Table S1). In most cases, SN and TSW were negatively correlated except in Pop8 in 2011 (Supplementary Fig. S2). The correlation between SPC and SW was slightly negative overall (Pearson r = − 0.11) (Fig. 2) but inconsistent across populations and environments. It ranged from significantly negative for Pop9 in 2008 (r = − 0.47) to significantly positive for Pop3 in 2011 and Pop10 in 2008 (r = 0.33). Likewise, SPC was significantly positively correlated with TSW (r = 0.16) except for Pop6, Pop8, and Pop9 in 2011 (Supplementary Fig. S2). TSW were lower in these populations (i.e. Pop6, Pop8 and Pop9) in 2011. In general, SPC was significantly negatively correlated with SN (r = − 0.16), except for Pop9 in 2011 and Pop10 in 2009.
Figure 1

(a, b) Principal component analysis of phenotypic traits of Pop3 to Pop10 observed on the field trials between 2004 and 2011 at INRAE Dijon (a axis 1&2, b axis 3&4). SN seed number per plant, SW seed weight per plant (g), TSW thousand seed weight (g) and SPC seed protein content (% of seed dry weight).

Figure 2

Pearson correlation coefficients between phenotypic traits recorded from 2004 to 2011 in Pop3 to Pop10 on the field trials at INRAE Dijon. SN seed number per plant, SW seed weight per plant (g), TSW thousand seed weight (g) and SPC seed protein content (% seed dry weight). *, ** and *** significant correlation at the P < 0.05, P < 0.01 and P < 0.001 probability level, respectively.

(a, b) Principal component analysis of phenotypic traits of Pop3 to Pop10 observed on the field trials between 2004 and 2011 at INRAE Dijon (a axis 1&2, b axis 3&4). SN seed number per plant, SW seed weight per plant (g), TSW thousand seed weight (g) and SPC seed protein content (% of seed dry weight). Pearson correlation coefficients between phenotypic traits recorded from 2004 to 2011 in Pop3 to Pop10 on the field trials at INRAE Dijon. SN seed number per plant, SW seed weight per plant (g), TSW thousand seed weight (g) and SPC seed protein content (% seed dry weight). *, ** and *** significant correlation at the P < 0.05, P < 0.01 and P < 0.001 probability level, respectively.

Multi-population QTL and metaQTL identification

A total of eighty-nine multi-population QTL were identified: 17 QTL for SW explaining from 4 to 30% of SW variance, 16 QTL for SN explaining from 4 to 39% of SN variance, 35 QTL for TSW explaining from 5 to 32% of TSW variance and 21 QTL of SPC explaining from 4 to 22% of SPC variance (Supplementary Table S3). For all traits, alleles having positive effects had different parent origin, no parent bringing all positive effect for a given trait. The meta-analysis of these QTL defined a number of metaQTL regions encompassing from one to fifteen multi-population QTL and reducing their confidence intervals: 27 metaQTL were identified for yield-related traits and seed protein content (Table 1, Fig. 3 and supplementary Table S4). 4 out of 7 metaQTL detected for SW were consistently detected in two to five environments (mQTL1.5, mQTL2.1, mQTL3.1, mQTL3.4), 10 out of 16 for TSW in two to five environments (mQTL1.1, mQTL1.3, mQTL1.4, mQTL2.2, mQTL3.1, mQTL3.4, mQTL4.4, mQTL5.1, mQTL5.3, mQTL6.3, mQTL7.2), 6 out of 12 for SPC in two to four environments (mQTL1.1, mQTL3.2, mQTL3.4, mQTL4.2, mQTL4.3, mQTL7.1) and only 2 out of 10 for SN in three to five environments (mQTL3.1, mQTL3.4). Two metaQTL (mQTL3.1 and mQTL3.4) controlling all the traits were consistently detected across environments. mQTL3.1 mapped at 24.73 cM on LG3-Chr5 with a confidence interval (CI) of 1.69 cM controlled the phenotypic variation of SN, SW, TSW and SPC in five environments. On the same chromosome, mQTL3.4 at 131.8 cM with a CI of 0.03 cM controlled SN, SW, TSW and SPC in five environments. MetaQTL mQTL1.4, mQTL1.5, mQTL2.1, mQTL2.3 and mQTL7.2 controlled yield and its components. Six metaQTL controlled a specific yield trait and the seed protein content: mQTL4.2, mQTL4.4, mQTL5.3, mQTL6.2 controlled TSW and SPC, mQTL3.2 controlled SW and SPC and mQTL7.1 controlled SN and SPC. Four metaQTL specific of TSW were detected in two to three environments: mQTL2.2 on the LG2-Chr6 at 52.70 cM (CI = 9.45 cM) in three environments, mQTL1.3 (LG1-Chr2 at 25.54 cM, CI = 5.66 cM), mQTL5.1 (LG5-Chr3 at 18.84 cM, CI = 10.29 cM) and mQTL6.3 (LG6-Chr1 at 86.17 cM, CI = 12.84 cM) in two environments. Conversely, mQTL4.3 (LG4-Chr4 at 50.63 cM, CI = 7.64 cM) and mQTL4.5 (LG4-Chr4 at 105.40 cM, CI = 10.50 cM) controlled exclusively SPC in two and one environments respectively.
Table 1

MetaQTL parameters detected for seed weight (SW), seed number (SN), thousand seed weight (TSW) and seed protein content (SPC) from Pop3 to Pop11 between 2004 and 2011 in the fields environments at INRAE Dijon. For each metaQTL (mQTL), position and confidence interval on the genetic map (Tayeh et al.)[20] and on the physical map (Kreplak et al.)[23] are indicated. The number, the potential candidates genes and the annotation are listed.

Linkage groupChromosomemetaQTL namePredicted metaQTL position (cM)Meta QTL 95% genetic confidence interval (cM)Flanking markers (Tayeh et al.—Table S10)[20]Position start–gene at start (Kreplak et al.)[23]Position end–gene at end(Kreplak et al.)[23]Peak position–peak gene(Kreplak et al.)[23]Annotation (Kreplak et al.)[23]
LG1Chr2mQTL1.12.662.19PsCam016921_10518_200 ; PsCam042632_26683_1732454,467Psat2g0006405,774,049Psat2g0066802,980,750Psat2g004160Protein kinase domain
LG1Chr2mQTL1.215.244.92PsCam044143_28060_1723 ; PsCam000692_604_68414,937,233Psat2g01608031,547,380Psat2g02628023,203,491Psat2g021920Unknown gene
LG1Chr2mQTL1.325.545.66PsCam001049_894_1330 ; PsCam040389_25143_34432,549,163Psat2g02716080,822,825Psat2g04644056,661,835Psat2g037640LIM domain
LG1Chr2mQTL1.474.036.08PsCam036267_21415_2674 ; PsCam046088_29624_710372,862,478Psat2g145000401,022,163Psat2g166320387,065,449Psat2g156040Pre-rRNA-processing protein TSR2
LG1Chr2mQTL1.585.141.47PsCam049764_32384_3938 ; PsCam047748_30645_563409,802,804Psat2g171720413,236,825Psat2g174040411,495,507Psat2g172600Amino-transferase class IV
LG2Chr6mQTL2.142.305.06PsCam045557_29214_1582 ; PsCam037747_22816_125271,913,567Psat6g064200157,742,624Psat6g097360114,863,224Psat6g082080Unknown gene
LG2Chr6mQTL2.252.709.45PsCam034889_20236_728 ; PsCam050914_33467_78688,985,653Psat6g070840290,738,342Psat6g148120189,681,633Psat6g111440Unknown gene
LG2Chr6mQTL2.388.974.98PsCam050254_32847_1148 ; PsCam053661_35501_2019369,699,084Psat6g183960404,311,257Psat6g203560388,681,088Psat6g193240Plant organelle RNA recognition domain
LG3Chr5mQTL3.124.731.69PsCam001506_1254_232 ; PsCam039566_24482_37856,764,094Psat5g03008066,228,363Psat5g03584061,462,435Psat5g032160MIR domain
LG3Chr5mQTL3.264.326.60PsCam036453_21593_1599 ; PsCam038194_23240_124191,691,654Psat5g107640239,064,211Psat5g133080215,141,118Psat5g121200Unknown gene
LG3Chr5mQTL3.3102.707.70PsCam006936_5161_789 ; PsCam038944_23937_299422,221,988Psat5g207040511,578,964Psat5g257360467,013,019Psat5g233240Unknown gene
LG3Chr5mQTL3.4131.800.03PsCam034779_20141_1638 ; PsCam053662_35502_3991566,380,665Psat5g299280568,296,029Psat5g299920567,365,719Psat5g2997202OG-Fe(II) oxygenase superfamily
LG4Chr4mQTL4.16.826.30PsCam056984_37672_88 ; PsCam008438_5952_4311,743,557Psat4g00196020,433,499Psat4g01452011,038,786Psat4g009760Intracellular non-membrane-bounded organelle
LG4Chr4mQTL4.222.394.80PsCam005173_3921_366 ; PsCam038342_23380_12230,049,059Psat4g02116054,024,236Psat4g03696042,061,945Psat4g028480Xanthine/uracil/vitamin C permease
LG4Chr4mQTL4.350.637.64PsCam043089_27118_711 ; PsCam059345_39539_779117,446,584Psat4g071040178,101,781Psat4g094840147,602,594Psat4g0846002OG-Fe(II) oxygenase superfamily
LG4Chr4mQTL4.480.948.36PsCam044314_28211_1843 ; PsCam035946_21100_283229,359,999Psat4g123400315,404,846Psat4g161720271,800,566Psat4g139360SRP54-type protein + GTPase domain
LG4Chr4mQTL4.5105.4010.50PsCam001458_1216_3409 ; PsCam046293_29776_939320,544,838Psat4g164800417,776,284Psat4g205800369,227,093Psat4g181120BURP domain
LG5Chr3mQTL5.118.8410.29PsCam037375_22461_2049 ; PsCam044994_28731_424713,496,546Psat3g00468067,064,585Psat3g03192040,364,479Psat3g017840Unknown gene
LG5Chr3mQTL5.255.909.59PsCam034798_20158_236 ; PsCam012541_8505_720144,455,569Psat3g068320208,140,583Psat3g105520176,232,116Psat3g085120MatE
LG5Chr3mQTL5.3112.910.37PsCam009698_6488_1702 ; PsCam004880_3686_1478434,277,272Psat3g205320436,339,177Psat3g207360435,352,873Psat3g206400Alpha/beta hydrolase fold
LG6Chr1mQTL6.113.9010.41PsCam042987_27022_1291 ; PsCam023473_13286_4283,521,432Psat1g00248063,895,910Psat1g04116033,786,851Psat1g024160Ammonium Transporter Family
LG6Chr1mQTL6.247.957.36PsCam035272_20467_4972 ; PsCam043936_27889_1794118,331,306Psat1g074560240,558,639Psat1g122920179,602,535Psat1g101120AP2 domain
LG6Chr1mQTL6.386.1712.84PsCam050644_33215_1066 ; PsCam037643_22718_391311,313,139Psat1g162520367,259,508Psat1g218360339,253,643Psat1g187800Glutaredoxin
LG7Chr7mQTL7.120.512.87PsCam020840_11620_1382 ; PsCam044103_28028_51951,210,245Psat7g03160069,196,947Psat7g04004060,542,282Psat7g036360SBP domain
LG7Chr7mQTL7.258.520.96PsCam051665_34135_1538 ; PsCam000487_426_1043202,181,919Psat7g122680211,451,196Psat7g127960206,761,590Psat7g125120PLAC8 family
LG7Chr7mQTL7.374.668.38PsCam056507_37317_287 ; PsCam027452_16087_1460319,631,535Psat7g166760371,178,765Psat7g195480345,409,046Psat7g184840Probable lipid transfer
LG7Chr7mQTL7.488.3810.93PsCam050926_33478_1100 ; PsCam042627_26678_74359,853,588Psat7g190640459,632,407Psat7g229640409,725,211Psat7g206160PPR repeat
Figure 3

Mapping of 27 metaQTL detected for seed weight (SW), seed number (SN), thousand seed weight (TSW) and seed protein content (SPC). Position (cM) and the maximum of phenotypic variance explained for each metaQTL (mQTL) are indicated to the right of the linkage groupe (LG). Confidances intervals are represented by the color on the linkage group. The lines on the left on the linkage group indicate the each QTL position on the consensus map. Color of the line indicates membership to the metaQTL.

MetaQTL parameters detected for seed weight (SW), seed number (SN), thousand seed weight (TSW) and seed protein content (SPC) from Pop3 to Pop11 between 2004 and 2011 in the fields environments at INRAE Dijon. For each metaQTL (mQTL), position and confidence interval on the genetic map (Tayeh et al.)[20] and on the physical map (Kreplak et al.)[23] are indicated. The number, the potential candidates genes and the annotation are listed. Mapping of 27 metaQTL detected for seed weight (SW), seed number (SN), thousand seed weight (TSW) and seed protein content (SPC). Position (cM) and the maximum of phenotypic variance explained for each metaQTL (mQTL) are indicated to the right of the linkage groupe (LG). Confidances intervals are represented by the color on the linkage group. The lines on the left on the linkage group indicate the each QTL position on the consensus map. Color of the line indicates membership to the metaQTL. MetaQTL were located on all linkage groups (Fig. 3). Taking advantage of the newly published genome sequence of pea[23] we could identify the position and the genes underlying metaQTL and highlight potential candidate genes. For example, 17 genes were identified in the confidence interval of mQTL3.4 (CI = 0.03 cM), 51 genes in the confidence interval of mQTL5.3 (CI = 0.37 cM) and 56 genes in the confidence interval of mQTL1.5 (CI = 1.47 cM) (Table 1). However, even for the shortest confidence intervals, the number of underlying genes is still high: 126 genes were identified in the confidence interval of mQTL7.2 (CI = 0.96 cM), 135 genes for mQTL3.1 (CI = 1.69 cM), 141 genes are underlying for mQTL1.1 (CI = 2.19 cM) and 199 genes for mQTL7.1 (CI = 2.87 cM).

Discussion

Diversity and correlation of seed traits in pea

Our study shows a large range of genetic variation for seed productivity and quality traits carried by cultivated pea varieties and their progenies, suggesting avenues for breeding (Supplementary Table S1). Our results confirm that the seed weight per plant is highly positively correlated with the seed number per plant, and generally positively correlated with the thousand seed weight (Fig. 2, Supplementary Fig. S2). The relationship between seed weight per plant and seed protein content is varied: for example, the correlation was slightly positive within Pop4 and was negative within Pop9. Our study reveals that in pea, by contrast to cereals[24,25], the relationship between seed yield per plant and seed protein content is not necessarily negative but depends on the environment and the genetic background.

Genetic control of traits

QTL detection is impacted by many factors, including the genetic background, the population size, the quality and density of the genetic map and the statistical method used for detection. Meta-analysis of QTL initially developed by Goffinet and Gerber[26] and implemented by Veyrieras et al.[22] and Sosnowski et al.[27] is an efficient approach to identify consensus QTL positions among experiments and reduce their confidence intervals. The method has been used in a wide range of species[28-35] and was applied in the present study for a large set of phenotypic data obtained from nine RIL populations and seven environments (Table 2). A total of 89 QTL explaining a part of phenotypic variation were detected across the seven pea chromosomes (Table 1).
Table 2

RIL populations for QTL meta-analysis. For each population, the population size (number of individuals (ind.)), the generation, the number of lines genotyped and phenotyped in each environment at INRAE Dijon are indicated.

PopulationCrossPopulation size and generationNumber of genotyped linesNumber of mapped SNP markers on the framework map (Tayeh et al.)[20]Number of phenotyped linesMain QTLReferences
2004 in field2006 in field2008 in field2008 in glasshouse2009 in field2009 in glasshouse2010 in field2011 in field2011 in glasshouse
Pop3VavD265 x Cameor210 ind.—F6:8176117617515284*Seed quality and productivity, other agronomic traitsBourgeois et al.[8], Tayeh et al.[20], Bordat et al.[47]
Pop4Ballet x Cameor210 ind.—F6:815948415913473Seed quality and productivity, other agronomic traits, root developmentBourgeois et al. [8], Tayeh et al.[20], Bordat et al.[47], Bourion et al.[50]
Pop5VavD265 x Ballet210 ind.—F6:8/F6:916893715571*Seed quality and productivity, other agronomic traitsBourgeois et al.[8], Tayeh et al.[20], Bordat et al.[47]
Pop6Cameor x Melrose283 ind.—F8120856182Seed quality, frost tolerance, M. pinodes resistanceTayeh et al.[20]
Pop7Kazar x Cameor280 ind.—F88441190Seed quality, frost tolerance, M. pinodes resistanceTayeh et al.[20]
Pop8Kazar x Melrose220 ind.—F8:10118496142Seed quality, frost tolerance, M. pinodes resistanceTayeh et al.[20]
Pop9China x Cameor129 ind.—F6:81249131149311461*Seed quality and productivity, frost toleranceKlein et al.[10], Tayeh et al.[20], Deulvot et al.[48], Duarte et al.[49]
Pop10Cameor x Sommette146 ind.—F614465314314285Nitrogen nutritionTayeh et al.[20]
Pop11Cameor x Cerise120 ind.—F3120231115112Seed size, seed qualityTayeh et al.[20]
TOTAL12131869 markers on the consensus map—794.9 cM48928611425893142114788112

*Only short lines carrying the le allele were selected from the population to be tested in this trial.

RIL populations for QTL meta-analysis. For each population, the population size (number of individuals (ind.)), the generation, the number of lines genotyped and phenotyped in each environment at INRAE Dijon are indicated. *Only short lines carrying the le allele were selected from the population to be tested in this trial. The meta-analysis of these QTL revealed 27 consensus QTL, or metaQTL; each metaQTL corresponding to one to 15 initial QTL (Table 1). Most metaQTL were consistently detected in different environments, in spite of significant environmental and GxE effects. Fifteen metaQTL controlled more than one trait. When different traits were controlled by the same metaQTL, allelic effects were of the same sign for SN and SW QTL (5 metaQTL out of 5) and of opposite sign for SN or SW and TSW (5 metaQTL out of 6), similarly to the correlation study of these traits. Allelic effects of SPC QTL were of the same sign with SN and SW QTL in one case out of 5. A survey of the literature allowed, when common markers where used or when markers’ genomic positions were known, to identify likely correspondence between the metaQTL from this study and QTL previously detected for seed yield, seed yield components and seed protein content in different pea populations and in different environments (supplementary Table S5). For example, mQTL1.1 was shown to control SPC and TSW in several environments in the present study, and was also detected by Moreau et al.[12] as a locus controlling SW and by Gali et al.[13] as a region controlling SPC; mQTL2.1 controlled SW in 3 environments in the present study as well as in Tar’an et al.[5]; mQTL3.1 controlled TSW in 5 environments in the present study as well as in Klein et al.[8], Burstin et al.[7] and Gali et al.[13]; mQTL3.2. controlled SPC in three environments in the present study as well as in Gali et al.[13]; mQTL3.4 controlled SW in 5 environments in the present study and also in Klein et al.[8], Burstin et al.[7] and Gali et al.[13]; Some metaQTL only controlled one trait in one environment in the present study but were also detected in other studies, such as mQTL5.2, a QTL of SPC also in Burstin et al.[7] and Gali et al.[13] (Table S4). Because the genetics of seed yield and seed protein content have been widely studied in soybean, we also searched for any related QTL to the pea metaQTL. Orthologous genes for the ones harbouring the peak markers of the metaQTL were identified and QTL within 50 kb around these genes were searched. This further reinforced the interest of the SW QTL mQTL2.1 and mQTL4.1, of the TSW QTL mQTL6.2 and mQTL2.2 and the SPC QTL mQTL4.3 and mQTL4.5 which corresponded to QTL of the same traits in soybean[34,36-40] (Supplementary Table S5).

Candidate genes

The meta-analysis method allowed to locate with more confidence QTL regions associated with seed productivity and quality traits. In some cases, the number of the genes underlying the metaQTL confidence intervals was narrow. The confidence intervals of metaQTL mQTL3.4, mQTL5.3, mQTL1.5, mQTL7.2, mQTL3.1 and mQTL1.1 include 17, 51, 56, 126, 135, 141 genes, respectively. mQTL3.4 and mQTL1.5 encompass, respectively, the Le and Afila regions previously described as controlling a number of traits in pea[7]. Le is Psat5g299720 and encodes a 2OG-Fe(II) oxygenase involved in Gibberrelin biosynthesis. Selecting short plants for Pop3, 5, and 9 in 2011 did not prevent to detect this QTL because this gene also segregates in Pop6 and Pop8. The region also includes Psat5g299400, a gene belonging to the AUX/IAA family putatively involved in early response to auxin. The mQTL1.5 region also contains several genes encoding transcription factors expressed in flowers (Psat2g173120, Psat2g173160) or seeds (Psat2g173360, Psat2g173520, Psat2g173880) and genes encoding an aminotransferase (Psat2g172800) and a malate transporter (Psat2g173480) that could be relevant candidates for this metaQTL. The mQTL5.3 region includes two genes encoding phosphatidylethanolamine-binding proteins which expression peaks in the upper leaves as revealed by the Pea Gene Atlas[41]. The phosphatidylethanolamine-binding proteins in soybean and Arabidopsis thaliana are involved in flowering time, plant architecture and seed germination[42]. The mQTL1.1 region includes the locus AGPS2, a gene encoding ADP-glucose pyrophosphorylase (Psat2g005160) previously reported to be associated with seed size QTL in pea[16], as well as a gene encoding a subtilase (Psat2g005680) expressed in flowers and pods[41] different to subtilase gene (Psat0s1712g0120) in D’Erfurth et al.[19]. The mQTL3.1 region contains a gene encoding a Phosphoenolpyruvate carboxylase (Psat5g031640) expressed in above ground vegetative and reproductive tissues[41] that could impact C assimilation and partitioning in the plant[43,44]. The mQTL3.1 region encompass two tandem genes (Psat7g127600, Psat7g127680) putatively encoding Kelch motif proteins. Interestingly, Kelch Motif-containing serine/threonine protein phosphatase was associated with a seed size QTL in rice[45]. The present study pinpointed several robust metaQTL of seed yield and seed protein content in pea and proposed some candidate genes. This useful knowledge for marker assisted breeding, highlights the position and function of underlying genes to discover the causal polymorphisms of QTL in pea.

Materials and methods

Plant material and field experiments

A total of 1213 recombinant inbred lines (RIL) derived from nine mapping populations (Pop3 to Pop11) were developed by single seed descent (SSD). Pop3 to Pop10 are eight advanced inter-connected biparental RIL populations with six of them having Cameor as a common parent. Pop11 is an F3 population obtained from a cross between Cameor and Cerise. Passport data and phenotypic information relative to the parental lines of mapping population are described in Table S6 and in Tayeh et al.[20]. The population size, the generation, the number of lines genotyped and phenotyped used for QTL meta-analysis are indicated in Table 2. Phenotypic variability of morphological traits of RIL populations are given in supplementary Fig. S3. Pop3 to Pop10 and parental lines were evaluated in six field environments at INRAE Dijon, Domaine d’Epoisses, Bretenière, France (47°14′N, 5°05′E, altitude 210 m) between 2004 and 2011 in spring sowing, except for Pop9 in 2008, 2009, 2010 evaluated in winter sowing (Table 2). In 2011, a subset of the populations was sown. In Pop3, Pop5, and Pop9 where the Le gene controlling internode length segregates, only short lines were sampled for the trial in order to limit the shading of tall plant plots on their neighbours (supplementary Fig. S3). In Pop6 and Pop8, the Le gene also segregates but the lines Le type information was lacking before sowing. Furthermore, Burstin et al.[7] have shown the major effect of this gene in the variation of seed traits and sampling only short plants intended to improve the detection of other QTL regions. Field experiments were carried out using a randomized complete block design. Each plot consisted of twenty-five seeds sown in a row of two meters long, with one meter spacing between two adjacent rows. Plants were grown against trellises. Weeds, insects and diseases were controlled chemically. At maturity, a sample of ten plants per line was harvested and Seed Number per plant (SN), Seed Weight per plant (SW, gram), Thousand Seed Weight (TSW, gram) were measured and Seed Protein Content (SPC, percentage of seed dry weight) was analysed by near-infrared spectrometry as described in Burstin et al.[7]. Pop10 in 2008–2009 and Pop11 in 2008, 2009 and 2011 were phenotyped in glasshouses at INRAE Dijon, France (47° 32′ N, 5°07′ E, altitude 245 m) (Table 2). Two replicates of two plants per RIL were grown in 4-L pots filled with a 1:1 (v/v) mixture of sterilized atapulgite and clay balls (2–6 mm diameter) with a nitrate content (10 mM). The temperature and minimal day length were controlled (22° C/16 °C, 16-h photoperiod). SN, SW and TSW were measured at maturity per plant.

Statistical analysis

Statistical analysis of each environment dataset was carried out using R software v3.3 and v3.6[46]. ANOVA were performed using the “aov” function in R, to determine the significance levels of the genotype and replication effects. The statistical model was: Yijk = µ + gi + rj + bk/j + eijk where Yijk is the value of the trait for genotype i in block k of the replicate j, µ the general mean, gi the genotypic effect, rj the replicate effect, bk/j the block k effect in the replicate j and eijk the residual. In the case of the populations phenotyped in several environments (Table 2), environment and genotype-by-environment interaction effects were added to the linear model of ANOVA. Broad sense heritability (h2) was estimated from ANOVA by h2 = σ2g / [σ2g + (σ2e/n)] with σ2g the genetic variance, σ2e the residual variance and n the number of replicates. Normality of residuals and homogeneity of variances were checked using Shapiro–Wilk and Bartlett’s test (P ≥ 0.05). RILs’ adjusted means calculated using the “lsmeans” library were used for QTL analysis. Pearson correlation coefficients between the traits for all environments were calculated from RILs adjusted means using “chart.correlation” function and “PerformanceAnalytics” library. Frequency distributions from RILs adjusted means using “car” library and the “hist” function. Principal component analysis from RILs adjusted means was performed using “fviz_pca” function and “factoextra” library.

Genotyping and QTL analyses

The 1213 RIL derived from nine mapping populations (Pop3 to Pop11) were previously genotyped using the GenoPea 13.2 K SNP Array and used for the construction of individual genetic maps and a consensus map as described in Tayeh et al.[20]. The framework consensus map included 1869 markers and had a total length of 794.9 cM Haldane. The references of these population[8,10,20,47-50] are listed in Table 2. QTL composite interval mapping was carried out using the iterative QTL mapping method (iQTLm) of the MCQTL software v5.2.4[21]. Cofactor selection and QTL detection Pvalue thresholds were determined after 1000 permutation tests on all traits, for a global genome-wide type I risk of 10% for cofactor selection, and 5% for QTL detection. Cofactors were searched by forward regression, using a threshold of P value = 3.50. QTL were searched by iQTLm, using a threshold of P value = 3.80. For each environment, MultiPop detection was performed using the genotyping from GenoPea 13.2 K SNP Array and the consensus genetic map (1869 markers—794.9 cM) developed by Tayeh et al.[20]. Model additive and interpop connected was used in Multipop function for the populations in the same environment. The Pvalue, global R2, individual R2, confidence interval and allelic effect at each QTL were estimated for each trait and used for metaQTL analyses.

MetaQTL analyses

Meta-analyses were performed using BioMercator version 4.2 software[27]. Meta-analysis was implemented on each chromosome to estimate the number, the position, the probability of individual QTL belonging to the metaQTL and 95% confidence interval (CI) of the each metaQTL. QTLProj command enabled the homothetic projection of the positions and the confidence intervals of the individual QTL onto the consensus map. QTLClust command performed the clustering of the projected QTL referring to the same trait on a given chromosome into all possible numbers of hypothetic clusters. This command determined the best clustering model based on the following criteria[22]: AIC (Akaike Information Criterion), AICc, AIC3, BIC (Bayesian information criterion) and AWE (Average Weight of Evidence). The best QTL model was selected when values of the model selection criteria were the lowest in at least three of the five models. It corresponds to the optimal number of clusters that best explain the observed QTL distribution along the consensus chromosome map. Finally, the QTLClustInfo command provided the number of metaQTL for each chromosome, the better position, the confidence interval and the contribution of each individual QTL[20]. The metaQTL map was drawn using BioMercator software[27].

Candidate genes, functional annotation and expression

QTL flanking markers were positioned on ‘Cameor’ genome sequence[23] through BLAST search and annotated genes in the QTL interval were retrieved and listed. QTL and orthologous genes in soybean were identified using USDA-ARS Soybean Genetics Database, SoyBase[36] (https://www.soybase.org). Pea gene expression was obtained from Pea RNA-seq Gene Atlas[41] (https://bios.dijon.inra.fr/FATAL/cgi/pscam.cgi). Supplementary Figure 1. Supplementary Figure 2. Supplementary Figure 3. Supplementary Table 1. Supplementary Table 2. Supplementary Table 3. Supplementary Table 4. Supplementary Table 5. Supplementary Table 6. Supplementary Legends.
  33 in total

1.  Full-length de novo assembly of RNA-seq data in pea (Pisum sativum L.) provides a gene expression atlas and gives insights into root nodulation in this species.

Authors:  Susete Alves-Carvalho; Grégoire Aubert; Sébastien Carrère; Corinne Cruaud; Anne-Lise Brochot; Françoise Jacquin; Anthony Klein; Chantal Martin; Karen Boucherot; Jonathan Kreplak; Corinne da Silva; Sandra Moreau; Pascal Gamas; Patrick Wincker; Jérôme Gouzy; Judith Burstin
Journal:  Plant J       Date:  2015-10       Impact factor: 6.417

2.  Most significant genome regions involved in the control of earliness traits in bread wheat, as revealed by QTL meta-analysis.

Authors:  E Hanocq; A Laperche; O Jaminon; A-L Lainé; J Le Gouis
Journal:  Theor Appl Genet       Date:  2006-12-15       Impact factor: 5.699

3.  Functional evolution of phosphatidylethanolamine binding proteins in soybean and Arabidopsis.

Authors:  Zheng Wang; Zhengkui Zhou; Yunfeng Liu; Tengfei Liu; Qing Li; Yuanyuan Ji; Congcong Li; Chao Fang; Min Wang; Mian Wu; Yanting Shen; Tian Tang; Jianxin Ma; Zhixi Tian
Journal:  Plant Cell       Date:  2015-02-06       Impact factor: 11.277

4.  Genetic dissection of nitrogen nutrition in pea through a QTL approach of root, nodule, and shoot variability.

Authors:  Virginie Bourion; Syed Masood Hasan Rizvi; Sarah Fournier; Henri de Larambergue; Fabien Galmiche; Pascal Marget; Gérard Duc; Judith Burstin
Journal:  Theor Appl Genet       Date:  2010-02-24       Impact factor: 5.699

5.  BioMercator V3: an upgrade of genetic map compilation and quantitative trait loci meta-analysis algorithms.

Authors:  Olivier Sosnowski; Alain Charcosset; Johann Joets
Journal:  Bioinformatics       Date:  2012-06-01       Impact factor: 6.937

6.  Highly-multiplexed SNP genotyping for genetic mapping and germplasm diversity studies in pea.

Authors:  Chrystel Deulvot; Hélène Charrel; Amandine Marty; Françoise Jacquin; Cécile Donnadieu; Isabelle Lejeune-Hénaut; Judith Burstin; Grégoire Aubert
Journal:  BMC Genomics       Date:  2010-08-11       Impact factor: 3.969

7.  The genetic architecture of water-soluble protein content and its genetic relationship to total protein content in soybean.

Authors:  Dan Zhang; Haiyan Lü; Shanshan Chu; Huairen Zhang; Hengyou Zhang; Yuming Yang; Hongyan Li; Deyue Yu
Journal:  Sci Rep       Date:  2017-07-11       Impact factor: 4.379

8.  Construction of high-density linkage maps for mapping quantitative trait loci for multiple traits in field pea (Pisum sativum L.).

Authors:  Krishna K Gali; Yong Liu; Anoop Sindhu; Marwan Diapari; Arun S K Shunmugam; Gene Arganosa; Ketema Daba; Carolyn Caron; Reddy V B Lachagari; Bunyamin Tar'an; Thomas D Warkentin
Journal:  BMC Plant Biol       Date:  2018-08-16       Impact factor: 4.215

9.  A Combined Linkage and GWAS Analysis Identifies QTLs Linked to Soybean Seed Protein and Oil Content.

Authors:  Tengfei Zhang; Tingting Wu; Liwei Wang; Bingjun Jiang; Caixin Zhen; Shan Yuan; Wensheng Hou; Cunxiang Wu; Tianfu Han; Shi Sun
Journal:  Int J Mol Sci       Date:  2019-11-25       Impact factor: 5.923

10.  MetaQTL: a package of new computational methods for the meta-analysis of QTL mapping experiments.

Authors:  Jean-Baptiste Veyrieras; Bruno Goffinet; Alain Charcosset
Journal:  BMC Bioinformatics       Date:  2007-02-08       Impact factor: 3.169

View more
  9 in total

Review 1.  Omics-Facilitated Crop Improvement for Climate Resilience and Superior Nutritive Value.

Authors:  Tinashe Zenda; Songtao Liu; Anyi Dong; Jiao Li; Yafei Wang; Xinyue Liu; Nan Wang; Huijun Duan
Journal:  Front Plant Sci       Date:  2021-12-01       Impact factor: 5.753

2.  Phenotypic and Genotypic Characterization and Correlation Analysis of Pea (Pisum sativum L.) Diversity Panel.

Authors:  Ana Uhlarik; Marina Ćeran; Dalibor Živanov; Radu Grumeza; Leif Skøt; Ellen Sizer-Coverdale; David Lloyd
Journal:  Plants (Basel)       Date:  2022-05-16

Review 3.  Genetic and Genomic Resources for Soybean Breeding Research.

Authors:  Jakob Petereit; Jacob I Marsh; Philipp E Bayer; Monica F Danilevicz; William J W Thomas; Jacqueline Batley; David Edwards
Journal:  Plants (Basel)       Date:  2022-04-27

4.  Unraveling the Genetic Basis of Key Agronomic Traits of Wrinkled Vining Pea (Pisum sativum L.) for Sustainable Production.

Authors:  Admas Alemu; Agnese Kolodinska Brantestam; Aakash Chawade
Journal:  Front Plant Sci       Date:  2022-03-14       Impact factor: 5.753

Review 5.  Ensuring Global Food Security by Improving Protein Content in Major Grain Legumes Using Breeding and 'Omics' Tools.

Authors:  Uday C Jha; Harsh Nayyar; Swarup K Parida; Rupesh Deshmukh; Eric J B von Wettberg; Kadambot H M Siddique
Journal:  Int J Mol Sci       Date:  2022-07-12       Impact factor: 6.208

6.  Delineating meta-quantitative trait loci for anthracnose resistance in common bean (Phaseolus vulgaris L.).

Authors:  Safoora Shafi; Dinesh Kumar Saini; Mohd Anwar Khan; Vanya Bawa; Neeraj Choudhary; Waseem Ali Dar; Arun K Pandey; Rajeev Kumar Varshney; Reyazul Rouf Mir
Journal:  Front Plant Sci       Date:  2022-08-25       Impact factor: 6.627

7.  Identification of Quantitative Trait Loci Associated with Seed Protein Concentration in a Pea Recombinant Inbred Line Population.

Authors:  Junsheng Zhou; Krishna Kishore Gali; Ambuj Bhushan Jha; Bunyamin Tar'an; Thomas D Warkentin
Journal:  Genes (Basel)       Date:  2022-08-26       Impact factor: 4.141

Review 8.  Integrated breeding approaches to enhance the nutritional quality of food legumes.

Authors:  Rintu Jha; Hemant Kumar Yadav; Rahul Raiya; Rajesh Kumar Singh; Uday Chand Jha; Lekshmy Sathee; Prashant Singh; Mahendar Thudi; Anshuman Singh; Sushil Kumar Chaturvedi; Shailesh Tripathi
Journal:  Front Plant Sci       Date:  2022-09-07       Impact factor: 6.627

9.  Pea Grain Protein Content Across Italian Environments: Genetic Relationship With Grain Yield, and Opportunities for Genome-Enabled Selection for Protein Yield.

Authors:  Margherita Crosta; Nelson Nazzicari; Barbara Ferrari; Luciano Pecetti; Luigi Russi; Massimo Romani; Giovanni Cabassi; Daniele Cavalli; Adriano Marocco; Paolo Annicchiarico
Journal:  Front Plant Sci       Date:  2022-01-03       Impact factor: 5.753

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.