Literature DB >> 29018458

Genome-Wide Association Study of Major Agronomic Traits Related to Domestication in Peanut.

Xingguo Zhang1, Jianhang Zhang1, Xiaoyan He1, Yun Wang1, Xingli Ma1, Dongmei Yin1.   

Abstract

Peanut (Arachis hypogaea) consists of two subspecies, hypogaea and fastigiata, and has been cultivated worldwide for hundreds of years. Here, 158 peanut accessions were selected to dissect the molecular footprint of agronomic traits related to domestication using specific-locus amplified fragment sequencing (SLAF-seq method). Then, a total of 17,338 high-quality single nucleotide polymorphisms (SNPs) in the whole peanut genome were revealed. Eleven agronomic traits in 158 peanut accessions were subsequently analyzed using genome-wide association studies (GWAS). Candidate genes responsible for corresponding traits were then analyzed in genomic regions surrounding the peak SNPs, and 1,429 genes were found within 200 kb windows centerd on GWAS-identified peak SNPs related to domestication. Highly differentiated genomic regions were observed between hypogaea and fastigiata accessions using FST values and sequence diversity (π) ratios. Among the 1,429 genes, 662 were located on chromosome A3, suggesting the presence of major selective sweeps caused by artificial selection during long domestication. These findings provide a promising insight into the complicated genetic architecture of domestication-related traits in peanut, and reveal whole-genome SNP markers of beneficial candidate genes for marker-assisted selection (MAS) in future breeding programs.

Entities:  

Keywords:  domestication; genome-wide association studies; peanut; selective sweeps; single-nucleotide polymorphisms (SNPs)

Year:  2017        PMID: 29018458      PMCID: PMC5623184          DOI: 10.3389/fpls.2017.01611

Source DB:  PubMed          Journal:  Front Plant Sci        ISSN: 1664-462X            Impact factor:   5.753


Introduction

Peanut, also known as groundnut (Arachis hypogaea L.), is one of the most important edible oil crops in the world. Cultivated peanut is an allotetraploid (AABB, 2n = 40), harboring homoeologous A and B genomes putatively derived from the natural hybridization of two wild diploid species, A. duranensis (AA, 2n = 20) and A. ipaensis (BB, 2n = 20) (Seijo et al., 2007; Moretzsohn et al., 2012). Based on the presence or absence of floral axes on the main stem, cultivated peanut is classified into two subspecies: hypogaea and fastigiata. Subspecies hypogaea is generally described as having a prostrate growth habit with no floral axes on the main stem, while in subspecies fastigiata flowers arise on leaf axils on branches as well as the main stem (Krapovickas and Gregory, 1994). As the only cultivated species of Arachis, peanut has been grown for hundreds of years in more than 100 countries worldwide (Huang et al., 2012). The evolution of Arachis is therefore closely related to the domestication of cultivated peanut, with a vast number of morphological forms having evolved under cultivation. For example, selection of a more upright growth habit and shorter branches, which allow easier harvesting and increased seed size, has also resulted in a decrease in resistance to a number of important pathogens (Stalker and Simpson, 1995; Stalker et al., 2013). Domestication-related quantitative trait loci (QTLs) associated with agronomic traits and resistance have already been mapped; however, the utilization of potential alleles has been relatively limited because of the lack of appropriate molecular tools for analysis of these traits in cultivated peanut (Burow et al., 2001; Chu et al., 2011; Ravi et al., 2011; Fonceka et al., 2012; Tseng et al., 2016; Zhou et al., 2016). With the development of high throughout sequencing technologies, whole-genome sequencing (WGS) has become much more straightforward, allowing analysis of the impact of domestication on genomic variation. Specific-locus amplified fragment sequencing (SLAF-seq) is an efficient method of large-scale single nucleotide polymorphism (SNP) identification and genotyping using high-throughput sequencing, with many advantages such as lower costs and reduced genome complexity (Mamanova et al., 2010; Sun et al., 2013). So far, this new method has been successfully used to address fundamental questions regarding soybean domestication (Han et al., 2016). Genome-wide association studies (GWAS) have also been used to determine the genetic basis of traits underlying domestication in a wide range of organisms (Lin et al., 2014; Han et al., 2016). However, information on peanut domestication remains limited, largely due to the relatively large size (~2.8 Gb) and complexity of the tetraploid peanut genome (Bertioli et al., 2014). However, in 2014, genomes of the two wild progenitors of cultivated peanut were released by the International Peanut Genome Initiative (IPGI), benefiting studies of agronomic traits related to domestication (Bertioli et al., 2016). In the present study, high quality SNPs distributed throughout the peanut genome were mined using SLAF-seq of 158 peanut accessions. GWAS was subsequently conducted to identify the genetic architecture of 11 major agronomic traits related to domestication. The results present the first comprehensive view of genome-wide sequence variation in a diverse group of peanut accessions. Moreover, the SNPs and candidate genes related to major agronomic traits will help accelerate peanut breeding programs.

Materials and methods

Plant materials and trait analyses

A total of 158 peanut (A. hypogeae L.) accessions were examined in the present study, including 36 hypogaea type (group I) with no floral axes on the main stem, and 122 fastigiata type (group II) with flowers growing on both branch and the main stem (Table S1). These accessions were elite cultivars collected from different provinces of China, and some of them were greatly produced in special areas. Seeds from a single plant of each of the 158 accessions were grown in a randomized complete block design with three replications. Four plants from each replicate were then selected to investigate the following 11 agronomic traits: height of the main stem, total number of branches, branch type, leaf color, pod length, pod width, seed length, seed width, 10-pod weight, 10-seed weight and seed coat color.

SLAF sample preparation and sequencing

Genomic DNA was isolated from fresh leaves of a single plant per accession, and analyzed using SLAF-seq (Sun et al., 2013). To obtain >200,000 SLAF tags per genome, evenly distributed in unique genomic regions, different restriction enzyme combinations were tested using in silico digestion prediction. Two restriction enzymes (RsaI and HaeIII) were selected based on uniqueness and uniformity of simulated fragment alignments to the reference genome sequence of two diploids, A. ipaensis (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA_000816755.1_Araip1.0, gene model is prefixed by Araip) and A. duranensis (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA_000817695.1_Aradu1.0, gene model is prefixed by Aradu). Different length fragments of genomic DNA after digestion were then simulated in silico. Oryza sativa indica (http://rapdb.dna.affrc.go.jp/) was selected as the control genome to test the accuracy of the restriction enzyme digestion protocol using SOAP software (Li R. et al., 2009). A total of 10-ug of genomic DNA from each accession was used for the restriction reaction and subsequent restriction-ligation reactions, including the addition of A to the 3′ end and ligation with the Dual-index adapter. PCR was performed with the restriction-ligation samples (diluted) then the PCR products were purified with a Quick Spin column (Qiagen, Hilden, Germany) and electrophoresed on 2% (w/v) agarose gel. Fragments with expected lengths were isolated using a Gel Extraction Kit (Qiagen) and diluted for sequencing. Fragments of 314–344 bp were isolated for use as SLAF tags.

SNP calling

All reads were processed for quality control and filtered using Seqtk (https://github.com/lh3/seqtk). High quality paired-end reads were mapped onto the reference genome (A. ipaensis and A. duranensis) using the Burrows-Wheeler Aligner (BWA) (Li and Durbin, 2009). Realigner Target Creator and InDel-Realigner in GATK (McKenna et al., 2010) were used to realign InDels, and Unified Genotyper was used to call genotypes across the 158 accessions using the default parameters. Sequencing depths of each sample were calculated using the “Depth of Coverage” module of GATK. Single SNP markers were confirmed using GATK (McKenna et al., 2010) and SAMtools (Li H. et al., 2009). Given the allotetraploid nature of the peanut genome, the genotyping errors caused by partial homologous alignment were resolved by comparing the sequencing depth first, then filtered those SNPs with integrity (genotyped rate) and minor allele frequencyand (MAF). The exceptionally high homologous regions were not under special analyses, because there were not too much SNPs were found in these regions.

Population structure analysis and phylogenetic tree construction

Population structure was calculated using ADMIXTURE software (Alexander et al., 2009). The number of genetic clusters (K) was predefined as 1–10 to explore the population structure of the tested accessions. This analysis provided maximum likelihood estimates of the proportion of each sample derived from each of the K populations. SNPs were then used to calculate genetic distances among the 158 accessions, and phylogenetic trees were constructed using the neighbor-joining method in MEGA5 (Tamura et al., 2011). Principal component analysis (PCA) was performed using GAPIT software (Lipka et al., 2012).

GWAS of agronomic traits

High-integrity SNPs from the tested peanut accessions were used in association analyses using the general linear model (GLM) and compressed mixed linear model (MLM) with TASSEL software (Bradbury et al., 2007). The following formula was used: where Q is the population structure derived from ADMIXTURE software (Alexander et al., 2009), K is the relationship between samples obtained from SPAGeDi (Hardy and Vekemans, 2002), using Q in GLM and Q + K in MLM. X represents the genotype and Y the phenotype, allowing associated values of each SNP to be calculated. A value of <0.01 was used as the threshold to determine the existence of a significant association. Gene predictions were annotated according to the method used in Zhang et al. (2015). Candidate genes associated with each trait located within a 100-kb region upstream or downstream of peak SNPs, because the size of the larger linkage disequilibrium (LD) Block is mostly distributed around 200 kb when the whole genome was analyzed for LD Block. The r2 value (Marker_Rsq, is the marginal R-squared for the marker) was used to explain the phenotypic variation of each marker. It was calculated as SS Marker (after fitting all other model terms) / SS Total, where SS stands for sum of squares (https://bitbucket.org/tasseladmin/tassel-5-source/wiki/UserManual/GLM/GLM).

Population differentiation (F) and putative selective sweeps

The divergence index, F-statistics (F), is a measure of population differentiation or genetic distance based on genetic polymorphism data (Hudson et al., 1992). To determine potentially differentiated regions, FST estimations and sequence diversity (π) ratios were evaluated using a 100-kb sliding window with 10-kb steps (Lam et al., 2010). Highly differentiated genomic regions with a significant FST value (p = 5%) and the top 5% of π ratios were defined as potential selective sweeps (Li et al., 2013).

Results

Phenotypic variation among peanut accessions

Phenotypic evaluation revealed a broad range of variation among the 158 peanut accessions (Figure 1, Table S1). The descriptive statistics of phenotypic variation of eight traits were listed in Table S2. Height of main stem ranged from 12 to 87.33 cm, with an average of 38.35 cm. Total number of branches also varied with an average of 12.8. All accessions showed continuous distribution of the three pod-related and three seed-related traits, with a coefficient of variation (CV) of 18.63% for seed length and 66.98% for the total number of branches, suggesting a quantitative inheritance pattern. The remaining three traits, branch type, leaf color and seed coat color, were evaluated as quality variation.
Figure 1

Phenotypic variation of eight traits among 158 peanut accessions.

Phenotypic variation of eight traits among 158 peanut accessions.

SLAF-Tags and SNP data

A total of 369,725 high quality SLAFs evenly distributed on 10 A and 10 B chromosomes were obtained from 397.19 M paired-end reads after sequence alignment with the reference genome (A. duranensis and A. ipaensis). Sequencing depths ranged from 32.42 to 4.94 X, with an average of 8.06 X. Polymorphic SLAFs defined by both GATK and SAMtools were recorded as reliable SNPs, resulting in a total of 268,889 SNP markers among the 158 accessions. After filtering SNPs located on scaffolds, 17,338 high quality SNPs with an MAF > 0.05 and integrity >0.5 were selected for further analyses (Table 1, Figure S1).
Table 1

Distribution of 17,338 SNPs in 20 chromosomes identified in 158 peanut accessions.

ChrChr length(Mb)Number of SNPsNumber of genesAverage number of SNPs per MbAverage number of genes per Mbr2 of chr LD
A01106.0272534937330.177
A0292.631059312911340.356
A03133.1497049297370.071
A04121.1874938336320.185
A05108.2879338687360.166
A06110.7386237128340.149
A0777.9548228506370.161
A0848.9431429406600.151
A09119.0088736947310.173
A10107.2669734886330.186
B01136.91107337918280.084
B02108.6471035807330.217
B03135.601427513311380.029
B04133.21104742768320.159
B05149.44101144337300.251
B06136.7287643726320.094
B07125.99109338139300.032
B08129.1596237107290.074
B09146.5065444094300.196
B10135.8194741647310.238
A1025.127538359367350.177
B1337.969800416817310.137
AB2363.0817338776177330.157

Chr, Chromosome; r.

Distribution of 17,338 SNPs in 20 chromosomes identified in 158 peanut accessions. Chr, Chromosome; r. The selected SNP markers were not evenly distributed across the whole genome, with 7,538 and 9,800 markers on the A and B subgenomes, respectively. Chromosome B03 harbored the highest proportion of SNPs (8.23%; 1,427 of 17,338), while chromosome A08, the shortest chromosome at 48.94 Mb, contained the least (1.81%; 314 of 17,338). The average number of SNPs/Mb was seven on both the A and B subgenomes, while average genes/Mb were 35 and 31, respectively. Chromosome B03 had the highest number of genes (5,188 of 77,617), while its counterpart, chromosome A03, contained the second highest (4,929 of 77,617). LD was estimated as the r2 value, revealing uneven distribution of SNPs on each chromosome. r2 values of the A subgenome ranged from 0.071 on chromosome A03 to 0.356 on chromosome A02, while those of the B subgenome ranged from 0.029 on chromosome B03 to 0.251 on chromosome B05. Average r2 values of the A and B subgenomes were 0.177 and 0.137, respectively, revealing differences in the level of LD between different chromosomes and subgenomes.

Population structure and genome-wide divergence in peanut

To examine divergence of the 158 accessions during evolution, analysis of population structure, phylogenetic relationships and PCA were carried out using the 17,338 selected SNPs. According to the K genetic clusters, the most likely number of inferred members was 2 with ΔK = 0.47 (Figure 2C). Nevertheless, accessions were classified into three major clusters in the NJ phylogenetic tree, with hypogaea accessions (group I) forming a separate cluster, and fastigiata accessions (group II) forming another large cluster (Figure 2B). PCA was conducted using the first two principal components (Figure 2A), PC1 (with variance explain 8.16%) and PC2 (with variance explain 5.23%), revealing that the accessions probably divided into two groups, with different degrees of introgression between the two subspecies during cultivation.
Figure 2

Principal component analysis (PCA), phylogenetic tree construction and population structure analysis of the 158 peanut accessions. (A) Scatter plots of the first two principal components. The horizontal and vertical coordinates represent PC1 (with variance explain 8.16%) and PC2 (with variance explain 5.23%). Each dot represents an accession. (B) Phylogenetic tree constructed with 17,338 high quality SNPs. (C) Population structure dividing the accessions into two groups (ΔK value was 0.47): subsp. hypogaea (group I) and subsp. fastigiata (group II).

Principal component analysis (PCA), phylogenetic tree construction and population structure analysis of the 158 peanut accessions. (A) Scatter plots of the first two principal components. The horizontal and vertical coordinates represent PC1 (with variance explain 8.16%) and PC2 (with variance explain 5.23%). Each dot represents an accession. (B) Phylogenetic tree constructed with 17,338 high quality SNPs. (C) Population structure dividing the accessions into two groups (ΔK value was 0.47): subsp. hypogaea (group I) and subsp. fastigiata (group II).

GWAS of loci underlying domestication traits in peanut

Eleven traits were selected for identification of underlying genetic loci and regions. In total, 51 SNP peaks associated with six traits reached the corrected P value according to the Bonferroni method (P < 5.76e−07 at a = 0.01 or −log10(P) = 6.238) (Figure 3, Table S3). A 100-kb genomic region on each side of the peak SNP associated with the corresponding traits was subsequently analyzed for identification of candidate genes (Table 2).
Figure 3

Genome-wide association studies (GWAS) of traits associated with peanut domestication. Manhattan plots with corresponding small QQ plots are shown in each figure of each trait. Associated significant SNPs are marked by arrows with reported candidate genes. The Bonferroni multiple test threshold is shown by a dotted blue line. MD: Malate dehydrogenase; P450: Cytochrome P450 superfamily protein; bHLH: bHLH transcription factor; ARF: Auxin response factor; MAP: Microtubule-associated protein.

Table 2

Six significant SNPs and predicted candidate genes associated with major agronomic traits in 158 peanut accessions.

TraitSNP location (bp)Gene modelDistance to SNP (kb)Functional annotation
HMSA03-26481539Aradu 52T5J32 (26,514,081–26,516,597)Malate dehydrogenase
TNBA10-90376017Aradu J85DC8 (90,384,547–90,391,660)Cytochrome P450 superfamily protein
Seed lengthA03-6992035Aradu T1PSR57 (7,049,674–7,051,562)bHLH transcription factor
Seed weight
Seed weightA03-119879303Aradu PZ2UH42 (1,19,921,885–1,19,926,789)Auxin response factor
Pod weightA05-32373760Aradu CVC5Q32 (32,341,000–32,343,951)Microtubule-associated protein
Seed coat colorB03-22076736Araip P4GTD12 (22,064,568–22,066,022)bHLH transcription factor

HMS, Height of main stem; TNB, Total number of branches.

Genome-wide association studies (GWAS) of traits associated with peanut domestication. Manhattan plots with corresponding small QQ plots are shown in each figure of each trait. Associated significant SNPs are marked by arrows with reported candidate genes. The Bonferroni multiple test threshold is shown by a dotted blue line. MD: Malate dehydrogenase; P450: Cytochrome P450 superfamily protein; bHLH: bHLH transcription factor; ARF: Auxin response factor; MAP: Microtubule-associated protein. Six significant SNPs and predicted candidate genes associated with major agronomic traits in 158 peanut accessions. HMS, Height of main stem; TNB, Total number of branches. As a result, a total of 13 significant SNPs were associated with height of the main stem, with the peak SNP A03-26481539 explaining 27.55% of the phenotypic variation. The Aradu 52T5J gene was located ~32 kb from this SNP, and is known to encode a malate dehydrogenase, which is thought to be related to biomass and plant height in maize (Carrari et al., 2005). Peak SNP A10-90376017 for the total number of branches explained 26.64% of the phenotypic variation, and was located only ~8 kb from the nearest gene, Aradu J85DC, which encodes a cytochrome P450 superfamily protein reportedly involved in the strigolactone synthetic pathway in rice (Gomez-Roldan et al., 2008) and soybean. The peak SNP A03-6992035 explained 15.52 and 19.99% of the phenotypic variation of seed length and 10-seed weight, respectively. A candidate gene encoding a bHLH transcription factor was located ~57 kb from this peak, and is thought to be a pleiotropic gene involved in seed development (Kondou et al., 2008). Aradu PZ2UH was located ~42 kb from A03-119879303, another peak SNP related to 10-seed weight (explaining 18.69% phenotypic variation) on chromosome A03, and encodes an auxin response factor (ARF) involved in plant growth and seed development (Okushima et al., 2005; Attia et al., 2009). A peak SNP explaining 26.32% of the phenotypic variation in pod width was located on chromosome A05. One candidate gene, Aradu CVC5Q, was located ~32 kb away, and encodes a microtubule-associated protein (MAP) that reportedly influences seed shape by regulating microtubule growth (Deng et al., 2012). The candidate gene for seed coat color, located ~12 kb from the peak SNP B03-22076736, which explained 21.94% of the phenotypic variation), also represented a bHLH transcription factor previously found to influence seed coat color in rice (Sweeney et al., 2006). These results suggest that GWAS was effective in clarifying candidate genes related to major domestication-related traits in peanut.

Genomic changes and target regions associated with selection

Genomic changes related to selective processes during domestication can be determined using genotypic data. In this study, a total of 1,429 genes were defined in 335 highly differentiated genomic regions in the two groups using an F threshold of 0.261 (determined by the 5% right tails of the F distribution) and a π I / π II ratio threshold of 2.03 (Figures 4B,C). The gene distribution of the resulting sweeps was subsequently determined (Table S4). A03 contained the most number of genes (662 genes, 45.94%), and presented stronger selective sweep signals in group I than group II. A total of 186 and 158 genes were found on chromosomes B03 and B08, respectively, corresponding to 12.91 and 11.06%, respectively, and ranking them second and third during peanut artificial selection. No selective sweep signals were detected on chromosomes B01 or B10.
Figure 4

Selective sweeps and population differentiation analyses of subsp. hypogaea (group I) and subsp. fastigiata (group II). (A) Regional plot of 15 SNPs in the 200-kb selective sweep region on chromosome A03. The bottom panel indicates the extent of LD in the region based on pairwise r2-values which are shown in the LD triangles. (B) Distributions of selective sweeps in subsp. hypogaea (group I) and subsp. fastigiata (group II). (C) Manhattan plots of FST values of peanut chromosome.

Selective sweeps and population differentiation analyses of subsp. hypogaea (group I) and subsp. fastigiata (group II). (A) Regional plot of 15 SNPs in the 200-kb selective sweep region on chromosome A03. The bottom panel indicates the extent of LD in the region based on pairwise r2-values which are shown in the LD triangles. (B) Distributions of selective sweeps in subsp. hypogaea (group I) and subsp. fastigiata (group II). (C) Manhattan plots of FST values of peanut chromosome. A total of 15 SNPs were found in the 200-kb selective sweep regions of major peak SNP A03-6992035 (Figure 4A), which is related to seed length and seed weight. Three SNPs were also found in the two gene models, Aradu D69CU and Aradu T1PSR, respectively, both of which encode a bHLH transcription factor. In total, 21 genes were found in this region, and only nine showed annotation. In addition to the two bHLH genes mentioned above, three major intrinsic protein (MIP) genes involved in carbohydrate transport and metabolism, and one F-box gene and one proline-rich protein (PRP) gene both involved in plant growth and stress response, were also identified in this region. Nucleotide-binding leucine-rich repeat (NB-LRR)-encoding genes are of particular interest because they confer resistance against pests and disease. Possible resistant genes in the highly differentiated genomic regions were therefore analyzed. Ten and 25 genes containing the NB-LRR domains were identified on the A and B subgenomes, respectively. Of the 10 NB-LRR-encoding genes on the A subgenome, eight were located on chromosome A03, and of the 25 on the B subgenome, 18 were located in two genomic regions of chromosome B07. One region was located within 282 kb of chromosome B07 (between 1714315 and 1996823) and harbored 15 NB-LRR-encoding genes (Table S5), suggested that this area contains a resistance gene family. The second region was located between 4775007 and 4820955 on chromosome B07, and included three NB-LRR-encoding genes. Interestingly, five NB-LRR-encoding genes were found in a major selective sweep on chromosome B03. This region covered 6.89 Mb and contained 107 selective genes, including one gene encoding a Gibberellin-related protein and three genes related to flavonoid biogenesis and regulation. These findings suggest that this region plays important roles in both resistance and plant-type-related traits.

Discussion

Species in the genus Arachis are widely distributed across tropical, subtropical and warm temperate zones, but only the cultivated peanut (A. hypogaea) is an important food crop. Peanut evolved morphologically during domestication, allowing it to adapt to various agroecological environments (Stalker and Simpson, 1995). Subsp. fastigiata has more advanced traits than subsp. hypogaea in terms of plant habit and pod morphology, and in this study, represented a larger portion (77.2%) of the selected accessions (Figure 1 and Table S1). In line with this, Krapovickas (1969) postulated that hypogaea (subsp. hypogaea) represents the most ancient variety due to its runner habit, lack of floral spikes and branching patterns, which are similar to the characteristics of wild Arachis species. In this study, analysis of phylogenic relationships, population structure and PCA among 158 peanut accessions revealed that accessions in subsp. fastigiata contain unique genomic regions that differ from those in hypogaea (Figure 2). Four wild diploid accessions were previously sequenced and removed due to ploidy difference (data not shown). Interestingly, some accessions were not clearly distinguishable, possibly because they underwent differing degrees of genetic introgression during manual selection. Cultivated peanut has a narrow genetic base, possibly resulting from a single polyploidization event (Kochert et al., 1996), and can therefore be improved using introgression genes for disease resistance and other important agronomic traits from wild species (Dwivedi et al., 2007; Holbrook et al., 2008; Isleib et al., 2011). Molecular markers for economically significant traits have been widely used to improve the speed and efficiency of MAS breeding in peanut (Selvaraj et al., 2009; Chu et al., 2011). SNP markers, the most abundant molecular marker, are a cost-effective method of high-throughput molecular genotyping, but have limited use in peanut because of the homeologous A and B subgenomes. In this study, the two diploid genome (A. ipaensis and A. duranensis) were used as the reference genomes. In order to accurately allocate marker tags to the correct genome, sequencing depth was combined with integrity and MAF to filter SNPs. There are not too much SNPs found in exceptionally high homologous regions, which would not influence the following GWAS results. Finally, a total of 17,338 high quality SNPs were identified across the whole genome by reduced representation sequencing technology (the SLAF-seq method) (Table 1 and Figure S1). The average sequencing depth was 8.06-fold, and the average number of SNPs / Mb was seven in both the A and B subgenomes, suggesting a reasonable density of markers given the relatively low cost of the genotyping method (He et al., 2011; Li et al., 2013; Morris et al., 2013). GWAS is considered an efficient method for genetic analysis of complex trait variation (Han et al., 2016; Fahrenkrog et al., 2017). Based on the markers developed in this study, 51 association SNP peaks were identified in this study, along with candidate genes or loci corresponding to domestication-related traits (Figure 3 and Table S3). A total of 13 association peaks for height of the main stem were found on eight chromosomes, while 18 peaks representing seed weight were found on six chromosomes, suggesting that there is a group of QTLs responsible for these traits. The basic transcription factor, bHLH family, is involved in plant growth and developmental processes as well as stress responses and secondary metabolism (Heim et al., 2003), and in this study, was linked with seed shape and seed coat color (Table 2). The Aradu PBR53 gene located ~37 kb from SNP A05-32373760 for pod weight, which was predicted to encode formin protein, a newly revealed regulatory factor of cell skeleton assembly (Guo and Ren, 2006). In Arabidopsis thaliana, it was found to play a role in cell division and cell polarity (Zhang et al., 2016), but studies in peanut are limited, making it a good focal point for future research. Using the SNPs identified in this study, further analysis of agronomic traits will be possible, allowing rapid identification of candidate genes for future peanut breeding programs. Similar to other important crop plants, peanut has undergone continuous selection through domestication and intensive breeding events. Domestication from a wild to a cultivated species is largely associated with genome-wide duplications, mutations, selection and genetic drift (Kim et al., 2010; Li et al., 2013). Many traits thought to be involved in peanut domestication were previously clustered in three genomic regions on chromosomes A07, B02, and B05 (Fonceka et al., 2012), suggesting that several linked genes are responsible for the phenotypic variation in 158 peanut accessions collected from all growing regions of China. In this study, selective sweeps in the two peanut groups were measured using F values and sequence diversity (π) ratios. The existence of major selective sweeps on chromosome A03 indicated that this chromosome was subjected to primary selection pressure (Figure 4). This finding was similar to the results of GWAS, whereby several important genes related to domestication traits were located on chromosome A03. SNPs in these regions are therefore likely to be valuable for MAS breeding. There are several major diseases, like early leaf spot(caused by Cercospora arachidicola), late leaf spot (caused by Cercosporidium personatum), and Tomato spotted wilt virus (TSWV, spread by thrips), may cause significant yield loss in peanut production (Nigam et al., 2012). Breeding of resistance cultivars is the most cost-effective method of reducing disease damage in peanuts. The role of NB-LRR proteins in plant defense against pathogens has been extensively studied (DeYoung and Innes, 2006; Nagy and Bennetzen, 2008). In this study, genes encoding NB-LRR protein were mainly distributed on chromosomes A03 and B07, especially in a 280 kb region on B07, which contained 15 promising candidate genes (Table S5). Wang et al. (2013) constructed two genetic maps to identify QTLs for thrips, TSWV and leaf spot (LS) in peanut. One QTL for TSWV, qF2TSWV3, was identified in the same marker interval (seq5D5-GM2744) on linkage group AhII with one QTL for LS, qF2LS1. Another QTL, qF5LS10 for LS, was identified between GM1254 and seq15C10 on linkage LGT17. The two linkage groups, AhII and LGT17, were colinearized with B07 of reference consensus genetic map (the three QTLs mentioned above covered 24 cM), suggested that there is a cluster of resistance QTLs on B07. Pandey et al. (2017) found that of the total 42 QTLs linked to diseases resistance in peanut, 34 were mapped on the A sub-genome and eight mapped on the B sub-genome, suggesting that the A sub-genome harbors more resistance genes than the B sub-genome, which is in agreement with Bertioli et al. (2016), who reported that there are more NB-LRR-encoding disease resistance-like genes in the “A” genome (397 genes in A. duranensis) than in the “B” genome (345 genes in A. ipaensis). Though function of these genes needs to be further validated, these findings suggest that SNPs located in these major selective sweeps will facilitate future breeding of resistant cultivars in peanut. The major selective sweep on chromosome B03 harbored a number of genes related to both resistance and plant-type-related traits, similar to a recent study (Zhou et al., 2016), suggesting that this genetic region is worthy of further investigation.

Conclusion

In summary, this study provides the first insight into the complex genetic relationship between agronomic traits and domestication processes in peanut. Chromosomes A03 and B03 harbored major genes related to peanut domestication, while chromosome B07 contained a cluster of NB-LRR-encoding genes. Further studies are now needed to understand the genetic mechanisms underlying yield- and seed-related traits and identify potential resistant genes for future peanut breeding programs.

Author contributions

XZ, JZ, XH, DY carried out phenotyping and genotyping. XZ, YW, DY managed the project. XZ, XM, DY analyzed the data. XZ wrote the paper. All of the authors read and approved the manuscript.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  41 in total

Review 1.  Target-enrichment strategies for next-generation sequencing.

Authors:  Lira Mamanova; Alison J Coffey; Carol E Scott; Iwanka Kozarewa; Emily H Turner; Akash Kumar; Eleanor Howard; Jay Shendure; Daniel J Turner
Journal:  Nat Methods       Date:  2010-02       Impact factor: 28.547

2.  SOAP2: an improved ultrafast tool for short read alignment.

Authors:  Ruiqiang Li; Chang Yu; Yingrui Li; Tak-Wah Lam; Siu-Ming Yiu; Karsten Kristiansen; Jun Wang
Journal:  Bioinformatics       Date:  2009-06-03       Impact factor: 6.937

3.  RETARDED GROWTH OF EMBRYO1, a new basic helix-loop-helix protein, expresses in endosperm to control embryo growth.

Authors:  Youichi Kondou; Miki Nakazawa; Mika Kawashima; Takanari Ichikawa; Takeshi Yoshizumi; Kumiko Suzuki; Akie Ishikawa; Tomoko Koshi; Ryo Matsui; Shu Muto; Minami Matsui
Journal:  Plant Physiol       Date:  2008-06-20       Impact factor: 8.340

4.  Population genomic and genome-wide association studies of agroclimatic traits in sorghum.

Authors:  Geoffrey P Morris; Punna Ramu; Santosh P Deshpande; C Thomas Hash; Trushar Shah; Hari D Upadhyaya; Oscar Riera-Lizarazu; Patrick J Brown; Charlotte B Acharya; Sharon E Mitchell; James Harriman; Jeffrey C Glaubitz; Edward S Buckler; Stephen Kresovich
Journal:  Proc Natl Acad Sci U S A       Date:  2012-12-24       Impact factor: 11.205

5.  GAPIT: genome association and prediction integrated tool.

Authors:  Alexander E Lipka; Feng Tian; Qishan Wang; Jason Peiffer; Meng Li; Peter J Bradbury; Michael A Gore; Edward S Buckler; Zhiwu Zhang
Journal:  Bioinformatics       Date:  2012-07-13       Impact factor: 6.937

6.  Genome-wide association study reveals putative regulators of bioenergy traits in Populus deltoides.

Authors:  Annette M Fahrenkrog; Leandro G Neves; Márcio F R Resende; Ana I Vazquez; Gustavo de Los Campos; Christopher Dervinis; Robert Sykes; Mark Davis; Ruth Davenport; William B Barbazuk; Matias Kirst
Journal:  New Phytol       Date:  2016-09-06       Impact factor: 10.151

7.  Enhanced photosynthetic performance and growth as a consequence of decreasing mitochondrial malate dehydrogenase activity in transgenic tomato plants.

Authors:  Adriano Nunes-Nesi; Fernando Carrari; Anna Lytovchenko; Anna M O Smith; Marcelo Ehlers Loureiro; R George Ratcliffe; Lee J Sweetlove; Alisdair R Fernie
Journal:  Plant Physiol       Date:  2005-01-21       Impact factor: 8.340

8.  Genomic analyses identify distinct patterns of selection in domesticated pigs and Tibetan wild boars.

Authors:  Mingzhou Li; Shilin Tian; Long Jin; Guangyu Zhou; Ying Li; Yuan Zhang; Tao Wang; Carol K L Yeung; Lei Chen; Jideng Ma; Jinbo Zhang; Anan Jiang; Ji Li; Chaowei Zhou; Jie Zhang; Yingkai Liu; Xiaoqing Sun; Hongwei Zhao; Zexiong Niu; Pinger Lou; Lingjin Xian; Xiaoyong Shen; Shaoqing Liu; Shunhua Zhang; Mingwang Zhang; Li Zhu; Surong Shuai; Lin Bai; Guoqing Tang; Haifeng Liu; Yanzhi Jiang; Miaomiao Mai; Jian Xiao; Xun Wang; Qi Zhou; Zhiquan Wang; Paul Stothard; Ming Xue; Xiaolian Gao; Zonggang Luo; Yiren Gu; Hongmei Zhu; Xiaoxiang Hu; Yaofeng Zhao; Graham S Plastow; Jinyong Wang; Zhi Jiang; Kui Li; Ning Li; Xuewei Li; Ruiqiang Li
Journal:  Nat Genet       Date:  2013-10-27       Impact factor: 38.330

9.  Quantitative Trait Locus Analysis of Late Leaf Spot Resistance and Plant-Type-Related Traits in Cultivated Peanut (Arachis hypogaea L.) under Multi-Environments.

Authors:  Xiaojing Zhou; Youlin Xia; Junhua Liao; Kede Liu; Qiang Li; Yang Dong; Xiaoping Ren; Yuning Chen; Li Huang; Boshou Liao; Yong Lei; Liying Yan; Huifang Jiang
Journal:  PLoS One       Date:  2016-11-21       Impact factor: 3.240

10.  SLAF-seq: an efficient method of large-scale de novo SNP discovery and genotyping using high-throughput sequencing.

Authors:  Xiaowen Sun; Dongyuan Liu; Xiaofeng Zhang; Wenbin Li; Hui Liu; Weiguo Hong; Chuanbei Jiang; Ning Guan; Chouxian Ma; Huaping Zeng; Chunhua Xu; Jun Song; Long Huang; Chunmei Wang; Junjie Shi; Rui Wang; Xianhu Zheng; Cuiyun Lu; Xiaowu Wang; Hongkun Zheng
Journal:  PLoS One       Date:  2013-03-19       Impact factor: 3.240

View more
  21 in total

1.  Genetic diversity, SNP-trait associations and genomic selection accuracy in a west African collection of Kersting's groundnut [Macrotyloma geocarpum(Harms) Maréchal & Baudet].

Authors:  Félicien Akohoue; Enoch Gbenato Achigan-Dako; Clay Sneller; Allen Van Deynze; Julia Sibiya
Journal:  PLoS One       Date:  2020-06-30       Impact factor: 3.240

2.  Genetic Diversity, Population Structure, and Botanical Variety of 320 Global Peanut Accessions Revealed Through Tunable Genotyping-by-Sequencing.

Authors:  Zheng Zheng; Ziqi Sun; Yuanjin Fang; Feiyan Qi; Hua Liu; Lijuan Miao; Pei Du; Lei Shi; Wei Gao; Suoyi Han; Wenzhao Dong; Fengshou Tang; Feng Cheng; Haiyan Hu; Bingyan Huang; Xinyou Zhang
Journal:  Sci Rep       Date:  2018-09-28       Impact factor: 4.379

3.  Single-Locus and Multi-Locus Genome-Wide Association Studies in the Genetic Dissection of Fiber Quality Traits in Upland Cotton (Gossypium hirsutum L.).

Authors:  Chengqi Li; Yuanzhi Fu; Runrun Sun; Yuanyuan Wang; Qinglian Wang
Journal:  Front Plant Sci       Date:  2018-08-17       Impact factor: 5.753

Review 4.  Genetics, genomics and breeding of groundnut (Arachis hypogaea L.).

Authors:  Haile Desmae; Pasupuleti Janila; Patrick Okori; Manish K Pandey; Babu N Motagi; Emmanuel Monyo; Omari Mponda; David Okello; Dramane Sako; Candidus Echeckwu; Richard Oteng-Frimpong; Amos Miningou; Chris Ojiewo; Rajeev K Varshney
Journal:  Plant Breed       Date:  2018-08-29       Impact factor: 1.832

5.  Identification of candidate genes for key fibre-related QTLs and derivation of favourable alleles in Gossypium hirsutum recombinant inbred lines with G. barbadense introgressions.

Authors:  Furong Wang; Jingxia Zhang; Yu Chen; Chuanyun Zhang; Juwu Gong; Zhangqiang Song; Juan Zhou; Jingjing Wang; Chengjie Zhao; Mengjia Jiao; Aiying Liu; Zhaohai Du; Youlu Yuan; Shoujin Fan; Jun Zhang
Journal:  Plant Biotechnol J       Date:  2019-09-20       Impact factor: 9.803

6.  Profiling of Nutraceuticals and Proximates in Peanut Genotypes Differing for Seed Coat Color and Seed Size.

Authors:  Spurthi N Nayak; Viresh Hebbal; Pushpa Bharati; Hajisab L Nadaf; Gopalkrishna K Naidu; Ramesh S Bhat
Journal:  Front Nutr       Date:  2020-04-15

7.  Dissection of the Genetic Basis of Yield-Related Traits in the Chinese Peanut Mini-Core Collection Through Genome-Wide Association Studies.

Authors:  Xiaojing Zhou; Jianbin Guo; Manish K Pandey; Rajeev K Varshney; Li Huang; Huaiyong Luo; Nian Liu; Weigang Chen; Yong Lei; Boshou Liao; Huifang Jiang
Journal:  Front Plant Sci       Date:  2021-05-20       Impact factor: 5.753

8.  The Genetic Base for Peanut Height-Related Traits Revealed by a Meta-Analysis.

Authors:  Juan Wang; Caixia Yan; Dachuan Shi; Xiaobo Zhao; Cuiling Yuan; Quanxi Sun; Yifei Mou; Haoning Chen; Yuan Li; Chunjuan Li; Shihua Shan
Journal:  Plants (Basel)       Date:  2021-05-25

9.  Genetic Divergence and Population Structure in Weedy and Cultivated Broomcorn Millets (Panicum miliaceum L.) Revealed by Specific-Locus Amplified Fragment Sequencing (SLAF-Seq).

Authors:  Chunxiang Li; Minxuan Liu; Fengjie Sun; Xinyu Zhao; Mingyue He; Tianshu Li; Ping Lu; Yue Xu
Journal:  Front Plant Sci       Date:  2021-06-24       Impact factor: 5.753

Review 10.  Adapting legume crops to climate change using genomic approaches.

Authors:  Mahsa Mousavi-Derazmahalleh; Philipp E Bayer; James K Hane; Babu Valliyodan; Henry T Nguyen; Matthew N Nelson; William Erskine; Rajeev K Varshney; Roberto Papa; David Edwards
Journal:  Plant Cell Environ       Date:  2018-06-13       Impact factor: 7.228

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.