Literature DB >> 27576450

Identification of favorable SNP alleles and candidate genes for traits related to early maturity via GWAS in upland cotton.

Junji Su1,2,3, Chaoyou Pang2, Hengling Wei2, Libei Li2, Bing Liang2, Caixiang Wang2, Meizhen Song2, Hantao Wang2, Shuqi Zhao2, Xiaoyun Jia1,2, Guangzhi Mao2, Long Huang4, Dandan Geng4, Chengshe Wang5, Shuli Fan6, Shuxun Yu7,8.   

Abstract

BACKGROUND: Early maturity is one of the most important and complex agronomic traits in upland cotton (Gossypium hirsutum L). To dissect the genetic architecture of this agronomically important trait, a population consisting of 355 upland cotton germplasm accessions was genotyped using the specific-locus amplified fragment sequencing (SLAF-seq) approach, of which a subset of 185 lines representative of the diversity among the accessions was phenotypically characterized for six early maturity traits in four environments. A genome-wide association study (GWAS) was conducted using the generalized linear model (GLM) and mixed linear model (MLM).
RESULTS: A total of 81,675 SNPs in 355 upland cotton accessions were discovered using SLAF-seq and were subsequently used in GWAS. Thirteen significant associations between eight SNP loci and five early maturity traits were successfully identified using the GLM and MLM; two of the 13 associations were common between the models. By computing phenotypic effect values for the associations detected at each locus, 11 highly favorable SNP alleles were identified for five early maturity traits. Moreover, dosage pyramiding effects of the highly favorable SNP alleles and significant linear correlations between the numbers of highly favorable alleles and the phenotypic values of the target traits were identified. Most importantly, a major locus (rs13562854) on chromosome Dt3 and a potential candidate gene (CotAD_01947) for early maturity were detected.
CONCLUSIONS: This study identified highly favorable SNP alleles and candidate genes associated with early maturity traits in upland cotton. The results demonstrate that GWAS is a powerful tool for dissecting complex traits and identifying candidate genes. The highly favorable SNP alleles and candidate genes for early maturity traits identified in this study should be show high potential for improvement of early maturity in future cotton breeding programs.

Entities:  

Keywords:  Candidate gene; Early maturity traits; GWAS; Gossypium hirsutum L; SLAF-seq; SNP alleles

Mesh:

Year:  2016        PMID: 27576450      PMCID: PMC5006539          DOI: 10.1186/s12864-016-2875-z

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Cotton is the most important natural textile fiber source worldwide. The tetraploid species Gossypium hirsutum L. (2n = 4x = 52, AD genome), also referred to as ‘upland cotton’, accounts for 95 % of the world’s cotton production. Early fiber production is one of the most important traits in cotton, and the selection and popularization of early-maturing cotton varieties are of significant value in reducing the dilemma of whether to plant farmlands with cotton or cereals during cropping system optimization in China [1, 2]. Early maturity is a complex quantitative trait that mainly includes components such as the growth period, growth stages (including the seedling period, squaring period, flowering and boll-setting period (FBP) and boll-opening period), yield percentage before frost (YPBF), node of the first fruiting branch (NFFB), and height of the node of the first fruiting branch (HNFFB) [1, 2]. These components of this quantitative trait are regulated by quantitative trait loci (QTLs) and the environment, as reflected in different genetic models in different cultivars [3]. Early maturity has been reported to be negatively correlated with yield and fiber quality [3]. It is difficult to simultaneously improve early maturity, yield and fiber quality using conventional breeding methods. Fortunately, the rapid development of applied genomics research has provided alternative tools to improve efficiency in plant breeding programs. For example, molecular markers linked to causal genes or QTLs can be used for marker-assisted selection (MAS) and genomic selection. Over the last two decades, many QTLs related to target traits have been identified using QTL-mapping methods by constructing intraspecific segregating populations of G. hirsutum with different target traits, such as fiber quality traits [4-6], yield and its components [7], resistance traits [8-10], early maturation traits [2, 11, 12] and drought-related traits [13]. In a study of traits associated with early maturity in cotton, more than 70 related QTLs were detected by linkage mapping [2, 11, 12]. These QTLs may be valuable for improving early maturity by MAS. Association mapping is another effective approach for connecting phenotypes and genotypes in plants when information on population structure and linkage disequilibrium (LD) is available [14]. This method is convenient because it helps to avoid the difficulty of screening large biparental mapping populations. Association mapping was introduced to maize genetics in 2001 [14] and has been subsequently applied in studies of many plant species [15]. Association mapping is widely used to identify molecular markers associated with target traits, and it has been employed in genetic studies of rice, maize, wheat and other important agricultural crops [16-19]. Genome-wide association studies (GWAS) represent a powerful approach for identifying the locations of genetic factors that underlie complex traits [20]. GWAS have been successfully implemented in Arabidopsis thaliana [21, 22], rice [20, 23], maize [24] and soybean [25] for the identification of single nucleotide polymorphism (SNP) loci and candidate genes for various ecological and agricultural traits. In recent years, association mapping has also been widely used in studies of cotton [10, 19, 26–30]. For example, Abdurakhmonov et al. [19] performed association mapping to examine QTLs related to fiber-quality traits in G. hirsutum accessions using microsatellite markers. Further, Kantartzi and Stewart [26] detected QTLs related to fiber quality in G. arboreum accessions using association mapping with simple sequence repeat (SSR) markers. Recently, Association mapping was performed to assess QTL alleles during three cotton breeding periods, revealing that some alleles could be detected in nearly all of the Chinese cotton cultivars studied [29]. Favorable QTL alleles for yield and its components have been identified via association mapping in Chinese upland cotton cultivars [28]. Some QTL alleles associated with verticillium wilt resistance in upland cotton have also been detected using this approach [10]. However, few QTLs for cotton early maturity traits have been identified via association mapping. To better understand the genetic architecture of early maturity traits in upland cotton, genome-wide SNP discovery based on the specific-locus amplified fragment sequencing (SLAF-seq) method and a GWAS strategy were used to identify the SNP loci associated with early maturity traits. We successfully identified several significant associations between SNP loci and early maturity traits using the generalized linear model (GLM) and mixed linear model (MLM). The highly favorable SNP alleles for early maturity traits were mined by computing the phenotypic effect of each SNP locus identified, and the pyramiding effects of the highly favorable SNP alleles for these traits were assessed. Moreover, major SNP loci and potential candidate genes for early maturity were detected. The results of this important study serve as a foundation for analyses of the genetic mechanisms underlying cotton earliness and for MAS for early maturity in cotton.

Results

Genome and chromosome characteristics of SLAF-based SNPs in upland cotton varieties

SLAF-seq was performed with an Illumina HiSeq 2500 (Illumina, Inc.; San Diego, CA, US) at Biomarker Technologies Corporation in Beijing to genotype 355 cotton varieties/accessions. The sequencing run generated 96.10 Gb of data, including 874.44 million paired-end reads with an length of ~80 bp. The Q30 ratio and guanine-cytosine (GC) content, which are indicators of sequencing quality, were 89.75 and 39.11 %, respectively, indicative of good quality. A total of 678,397 high-quality SLAF tags were obtained for each of the 355 genotypes, and 505,823 polymorphic SLAFs were identified from these reads by performing sequence alignments with the TM-1 reference genome [31]. The SLAFs, which had an average depth of 5.39-fold per sample among the 355 varieties/accessions, were used for calling SNPs. A total of 691,978 SNPs were initially called for the 355 genotypes (Fig. 1). SNP loci with a minor allele frequency (MAF) of <5 % cannot be used in association analyses; thus, most of the SNPs (88.20 %) were removed, and the remaining 81,675 SNPs with an MAF ≥0.05 were used in subsequent analyses.
Fig. 1

Single nucleotide polymorphism (SNP) distributions on the 26 chromosomes of upland cotton. At1 ~ At13 and Dt1 ~ Dt13 in vertical axis are the serial number of 26 chromosomes; The horizontal axis shows chromosome length (Mb); 0 ~ 50 depicts SNP density (the number of SNPs per window)

Single nucleotide polymorphism (SNP) distributions on the 26 chromosomes of upland cotton. At1 ~ At13 and Dt1 ~ Dt13 in vertical axis are the serial number of 26 chromosomes; The horizontal axis shows chromosome length (Mb); 0 ~ 50 depicts SNP density (the number of SNPs per window) The 81,675 SNP markers covered all 26 chromosomes. The largest number of markers was identified on chromosome Dt1 (5882 SNPs), and the smallest was identified on chromosome At7 (1006 SNPs). The average marker density was approximately one SNP per 24.85 kb. The highest marker density was detected on chromosome Dt8 (one SNP per 15.76 kb), and the smallest was identified on chromosome At3 (one SNP per 36.24 kb) (Fig. 1, Table 1).
Table 1

SNP distribution on each chromosome

ChromosomeSNP numberChromosome length (Mb)SNP densitya (kb)ChromosomeSNP numberChromosome length (Mb)SNP densitya (kb)
At14553106.9923.50Dt15882117.2419.93
At2289061.7521.37Dt2280150.1917.92
At32769100.3536.24Dt3146442.8029.23
At4420792.2921.94Dt4216637.3617.25
At53688102.5627.81Dt5390365.5716.80
At6272156.4820.76Dt6237844.8018.84
At7100623.2723.13Dt7148047.9832.42
At8254078.7531.00Dt8361156.8915.76
At9426891.1221.35Dt9377470.9118.79
At10487795.4819.58Dt10302657.8019.10
At114032102.0225.30Dt11256761.1223.81
At12256479.1630.87Dt12165839.3823.75
At13445691.5520.55Dt13239455.8723.34

aSNP density is presented as the average physical distance between two adjacent SNP loci

SNP distribution on each chromosome aSNP density is presented as the average physical distance between two adjacent SNP loci

Population structure and linkage disequilibrium

To estimate the number of subgroups in the population of 355 upland cotton accessions, structure analysis was performed using 81,675 SNPs from the 355 accessions. The results indicated that the minimum number of cross-validation errors was K = 9, which was thus determined to be the optimum K, and that the testing accessions could be separated into nine subpopulations (Fig. 2a, b). Subpopulations 1–9 included 60, 30, 25, 27, 45, 66, 20, 65, and 17 accessions, respectively. To represent the genetic diversity among the 355 accessions, a total of 185 upland cotton lines were screened, which included approximately 50 % of the accessions of each of the subpopulations, taking into consideration the diverse geographic origins and maturity traits. A total of 32, 16, 13, 15, 24, 35, 12, 30 and 8 lines were selected from each of the subpopulations 1–9, respectively. Most of these upland cotton accessions from each subpopulation had mixed ancestry, and the obvious geographic subpopulation was not found, indicating that these lines might have experienced introgression or gene flow during cotton breeding in China.
Fig. 2

Population structure and linkage disequilibrium (LD) decay of upland cotton accessions. a Population structure of upland cotton accessions; each line is represented by a single vertical line, and each color represents one cluster; b estimated ln(cross-validation errors in the data) calculated for K, ranging from 2 to 10; c the mean LD decay rate was estimated as the squared correlation coefficient (r2) using all pairs of SNPs located within 600 kb of physical distance in genomic regions in a population of 355 upland cotton germplasm accessions

Population structure and linkage disequilibrium (LD) decay of upland cotton accessions. a Population structure of upland cotton accessions; each line is represented by a single vertical line, and each color represents one cluster; b estimated ln(cross-validation errors in the data) calculated for K, ranging from 2 to 10; c the mean LD decay rate was estimated as the squared correlation coefficient (r2) using all pairs of SNPs located within 600 kb of physical distance in genomic regions in a population of 355 upland cotton germplasm accessions To determine the mapping resolution for GWAS, we quantified the average extent of LD decay. Using the whole set of SNPs, the LD decay rate of the population for the entire genome was estimated to be 100 kb, with r2 = 0.07 at half of the maximum value (Fig. 2c).

Phenotypic characteristics of traits related to early maturity

A core set of 185 upland cotton lines was selected for association analysis based on analysis of the population structure, and the traits of these lines related to early maturity were investigated across four field environments. The mean whole growth period (WGP) durations were 116.61, 117.92, 118.03 and 120.39 d in the four experiments, respectively. The minimum WGP was 96.67 d in SU-2013, and the maximum WGP was 147.00 d in SP-2014. Analogously, the FT and FBP exhibited wide ranges of 53.00–80.67 d and 38.00–73.67 d, with means of 66.59 and 51.64 d, respectively. The NFFB ranged from 3.00 to 12.00, with a mean of 6.50. The mean HNFFB values exhibited continuous variation, ranging from 15.45 to 34.03 cm. The YPBF exhibited the largest range of variation, ranging from 1.55 to 100 %. The mean coefficients of variance (CVs) for the WGP, FT, FBP, NFFB, HNFFB and YPBF were 6.88, 5.91, 8.79, 15.79 16.92 and 18.11 %, respectively. These data indicated a high degree of diversity in early maturity phenotypic traits in the natural population. Based on the WGP, the number of early-maturing accessions (106 d < WGP ≤112 d), early-middle-maturing accessions (114 d < WGP ≤120 d) and middle-late-maturing accessions (122 d < WGP ≤128 d) were 62 (33.51 %), 20 (10.81 %) and 59 (31.89 %), respectively. The early-middle-maturing accessions accounted for a very small percentage, thus these traits were typically bimodally distributed (Fig. 3, Additional file 1: Table S1).
Fig. 3

Frequency distributions of the mean values of six maturity traits of 185 cotton accessions in four environments. a whole growth period (WGP); b flowering time (FT); c flowering and boll-setting period (FBP); d node of the first fruiting branch (NFFB); e height of the node of the first fruiting branch (HNFFB); and f yield percentage before frost (YPBF)

Frequency distributions of the mean values of six maturity traits of 185 cotton accessions in four environments. a whole growth period (WGP); b flowering time (FT); c flowering and boll-setting period (FBP); d node of the first fruiting branch (NFFB); e height of the node of the first fruiting branch (HNFFB); and f yield percentage before frost (YPBF) Analysis of variance (ANOVA) indicated that the genotype (G) and interactions between the genotype and environmental factors (G × E) were both significant (P < 0.01) for all six traits (Additional file 1: Table S1). The correlation coefficients for the association of the WGP with the FT, FBP, NFFB, HNFFB and YPBF were 0.9541, 0.9659, 0.8775, 0.8513 and −0.9230, respectively. These results indicated that the WGP was significantly associated with the FT, FBP, NFFB, HNFFB and YPBF in all four environments (P < 0.01) (Additional file 1: Table S2).

GWAS for early maturity traits

To investigate the genotypic variation that underlies the traits related to early maturity in cotton, GWAS was performed to identify the associated SNP loci in upland cotton accessions. In the GLM, 13 associations were found to be significant between 8 SNP loci and five traits related to early maturity (all traits except for the HNFFB) according to the best linear unbiased predictions (BLUPs) and in at least two of the four environments (-lg(p) ≥6.21). Of these SNP loci, 50 % were distributed on chromosome Dt3, and 25 % were distributed on chromosome At3. Among these associations, five associations each with the WGP and FT were identified, as well as one association each with the FBP, NFFB and YPBF; the corresponding SNP loci were distributed on chromosome Dt3. The SNP loci for various early maturity traits identified through GWAS explained 5.36–15.56 % of the phenotypic variance (Additional file 1: Table S3, Fig. 4 and Additional file 2: Figure S1 and Additional file 3: Figure S2). Among these associated SNP loci, three were co-associated with two or more different traits. For example, rs13562854 (Dt3) was simultaneously associated with the WGP, FT, NFFB and YPBF (Additional file 1: Table S3, Fig. 4). The MLM results indicated that two associations were significant between one SNP locus and two traits (-lg(p) ≥ 6.21), i.e., one SNP locus (rs13562854) on chromosome Dt3 was found to be simultaneously associated with the WGP and FT according to BLUPs and in two of the four environments, explaining 9.23–16.46 % of the phenotypic variance (Additional file 1: Table S3, Fig. 5 and Additional file 4: Figure S3 and Additional file 5: Figure S4). It was very important and meaningful that the SNP locus rs13562854 was simultaneously associated with the WGP and FT and was detected via both the GLM and MLM (Additional file 1: Table S3, Figs. 4 and 5).
Fig. 4

Manhattan plots of genome-wide association studies (GWAS) for the WGP (a), FT (b), NFFB (c) and YPBF (d) measured with the generalized linear model (GLM) using the best linear unbiased prediction (BLUP) values for the four environments. The SNP locus rs13562854 is indicated by the black arrow. The general and highly significant trait-associated SNPs are distinguished by the red and blue threshold lines, respectively

Fig. 5

Manhattan plots of genome-wide association studies (GWAS) for the WGP (a) and FT (b) measured with the mixed linear model (MLM) using the best linear unbiased prediction (BLUP) values for the four environments. The SNP locus rs13562854 is indicated by the black arrow. The general and highly significant trait-associated SNPs are distinguished by the red and blue threshold lines, respectively

Manhattan plots of genome-wide association studies (GWAS) for the WGP (a), FT (b), NFFB (c) and YPBF (d) measured with the generalized linear model (GLM) using the best linear unbiased prediction (BLUP) values for the four environments. The SNP locus rs13562854 is indicated by the black arrow. The general and highly significant trait-associated SNPs are distinguished by the red and blue threshold lines, respectively Manhattan plots of genome-wide association studies (GWAS) for the WGP (a) and FT (b) measured with the mixed linear model (MLM) using the best linear unbiased prediction (BLUP) values for the four environments. The SNP locus rs13562854 is indicated by the black arrow. The general and highly significant trait-associated SNPs are distinguished by the red and blue threshold lines, respectively

Mining of highly favorable SNP alleles associated with early maturity traits

In our study, SNP alleles with positive effects that led to decreases in the WGP, FT, FBP, NFFB and HNFFB or an increase in the YPBF were defined as “favorable alleles”, and those that resulted in increases in the WGP, FT, FBP, NFFB and HNFFB or a decrease in the YPBF were defined as “unfavorable alleles”. Among the favorable SNP alleles, rs26538646 (tightly linked with rs26538688), rs13562854, rs8917898 and rs13153008 had the strongest positive phenotypic effects on the WGP, shortening it by 6.70 d, 7.53 d, 7.58 d and 7.76 d, respectively; in addition, rs22465987, rs48627288, rs13562854, rs8917898 and rs37255056 shortened the FT by 0.07 d, 0.55 d, 3.88 d, 3.69 d and 3.40 d, respectively; rs13153008 shortened the FBP by 4.09 d; and rs13562854 shortened the NFFB by 0.91, whereas it increased the YPBF by 10.45 %. These findings indicated that the phenotypic characteristics of the genotypes with favorable SNP alleles were significantly enhanced compared with those of the genotypes with unfavorable SNP alleles, with the exception of rs22465987 and rs48627288 (ANOVA; P < 0.01). The highly favorable SNP alleles exhibited significantly different traits compared with the unfavorable alleles (P < 0.01). Finally, the eleven highly favorable SNP alleles were mined by ANOVA. The numbers of highly favorable SNP alleles for the WGP, FT, FBP, NFFB and YPBF were 5, 3, 1, 1 and 1, respectively (Table 2).
Table 2

Favorable SNP alleles, their phenotypic effects (ai), and representative accessions

TraitsSNPPositionAllelesFavorable allelesai AccessionsRepresentative accessionsa
WGP rs26538646 At3:26538646A/GA−6.70** 59zhongmiansuo74, xia25, zhong416
rs26538688 At3:26538688G/TG−6.70** 59zhongmiansuo74, xia25, zhong416
rs13562854 Dt3:13562854A/GA−7.53** 66zhongmiansuo74, xia25, zhong416
rs8917898 Dt3:8917898A/GG−7.58** 49zhong6426, zhong51822, xia13-7
rs13153008 Dt3:13153008A/GA−7.76** 42xia25, zhong416, baimian17
FT rs22465987 At4:22465987A/GG−0.07941476, zhongmiansuo74, xiaomian3
rs48627288 At12:48627288A/GA−0.5586xia25, xiazao3, zhongmiansuo14
rs13562854 Dt3:13562854A/GA−3.88** 66xiazao2,zhongmiansuo74,xia25
rs8917898 Dt3:8917898A/GG−3.69** 49xia25, 1476, xiazao3
rs37255056 Dt3:37255056A/GA−3.40** 56xia25, 1476, xiazao3
FBP rs13153008 Dt3:13153008A/GA−4.09** 42zhong416, xia25, zhongmiansuo64
NFFB rs13562854 Dt3:13562854A/GA−0.91** 66xiaozao2, xiazao3, xia25
YPBF rs13562854 Dt3:13562854A/GA10.45** 66zhongmiansuo74, xia25, xiazao3

aRepresentative accessions consist of the top 3 entries for the target trait values of accessions with the corresponding favorable alleles; **highly favorable SNP alleles that exhibit significantly different traits compared with the unfavorable alleles (P < 0.01)

Favorable SNP alleles, their phenotypic effects (ai), and representative accessions aRepresentative accessions consist of the top 3 entries for the target trait values of accessions with the corresponding favorable alleles; **highly favorable SNP alleles that exhibit significantly different traits compared with the unfavorable alleles (P < 0.01)

Pyramiding effects of highly favorable SNP alleles associated with early maturity traits

To determine whether the highly favorable SNP alleles for traits related to early maturity had pyramiding effects, the mean WGP, FT, FBP, NFFB and YPBF values of the accessions that contained different numbers of highly favorable SNP alleles were analyzed by ANOVA. The results indicated that earlier maturation occurred in the cotton accessions with the highly favorable SNP alleles compared with those without these alleles, as well as those with fewer of these alleles (Table 3). For example, the average WGP of the genotypes without highly favorable alleles was 125.05 d, that of those with a single highly favorable allele was 117.39 d, that of those with two highly favorable alleles was 113.55 d, and that of those with four highly favorable alleles was 108.84 d.
Table 3

Pyramiding effects of the highly favorable alleles that contribute to early maturity

TraitsNo. of favorable allelesMean ± SDFrequency (%)
WGP (d)0125.05 ± 2.66 (A)41.46
1117.39 ± 5.83 (B)7.32
2113.55 ± 5.89 (B)12.20
3
4108.84 ± 2.63 (C)39.02
FT (d)069.75 ± 2.26 (A)57.48
164.26 ± 1.99 (B)11.81
264.42 ± 2.36 (B)1.57
362.3 ± 1.10 (C)29.13
FBP (d)054.71 ± 3.36 (A)39.62
147.55 ± 2.1 (B)60.38
NFFB07.23 ± 0.82 (A)59.51
15.59 ± 0.38 (B)40.49
YPBF (%)061.03 ± 9.59 (A)59.51
182.19 ± 4.48 (B)40.49

Values with different letters are significantly different (P < 0.05)

Pyramiding effects of the highly favorable alleles that contribute to early maturity Values with different letters are significantly different (P < 0.05) In addition, to further assess the pyramiding effects of the highly favorable SNP alleles on the early maturity response, linear regression was conducted with the number of highly favorable SNP alleles and the average WGP and FT values for the four environments. Two significant linear correlations were detected between the WGP and number of highly favorable SNP alleles (R2 = 0.8107) and between the FT and number of highly favorable SNP alleles (R2 = 0.6988), further confirming the pyramiding effects of the highly favorable alleles (Fig. 6). These findings demonstrate that the highly favorable SNP alleles had significant pyramiding effects on the WGP and FT.
Fig. 6

Linear regression analyses of the numbers of highly favorable SNP alleles and average WGP (a) and FT values (b) in four environments

Linear regression analyses of the numbers of highly favorable SNP alleles and average WGP (a) and FT values (b) in four environments

A major locus on chromosome Dt3 and candidate genes that potentially underlie early maturity

The most favorable SNP locus (rs13562854) associated with both the WGP and FT in the GLM and MLM was used to compare the differences between the accessions that carried favorable alleles and those that carried unfavorable alleles in six traits related to early maturity. The mean phenotypic value of 66 accessions that contained a favorable allele (A) was significantly better (lower for the WGP, FT, FBP, NFFB and HNFFB and higher for the YPBF) compared with the remaining accessions that contained unfavorable alleles (G) (Fig. 7). This finding demonstrates that rs13562854 on chromosome Dt3 is a major locus for early maturity in upland cotton.
Fig. 7

Favorable alleles (A) and unfavorable alleles (G) at the SNP locus rs13562854 for six traits related to early maturity in the four environments. a–f represent six traits related to early maturity WGP, FT, FBP, NFFB, HNFFB and YPBF, respectively; *, ** indicate significance at probability levels of 0.05 and 0.01, respectively; SP-2013, SU-2013, SP-2014 and SU-2014 are the four environments

Favorable alleles (A) and unfavorable alleles (G) at the SNP locus rs13562854 for six traits related to early maturity in the four environments. a–f represent six traits related to early maturity WGP, FT, FBP, NFFB, HNFFB and YPBF, respectively; *, ** indicate significance at probability levels of 0.05 and 0.01, respectively; SP-2013, SU-2013, SP-2014 and SU-2014 are the four environments A total of 32 genes were annotated in the 1 Mb regions within 500 kb on either side of the most favorable SNP allele (rs13562854) (Table 4). Among these genes, definite biological function annotations could not be determined for six, and ten were annotated as putative or hypothetical proteins; among the remaining genes, 16 possessed domains of known function, and four of these 16 genes (CotAD_01914, CotAD_01926, CotAD_01936 and CotAD_01947) had potential involvement in the early maturity response in plants. Two early-maturing cotton varieties and two late-maturing varieties were selected. The WGPs of the early-maturing varieties zhongmiansuo50 and zhongmiansuo74 were 107.92 d and 102.75 d, respectively, and those of the late-maturing varieties lumianyan28 and zhongmiansuo41 were 124.17 d and 126.67 d, respectively (Fig. 8a and b). Similarly, the FT of the early-maturing varieties was significantly shorter than that of the late-maturing varieties (P < 0.01) (Fig. 8c). The transcription levels of the 32 genes were assessed by qRT-PCR using samples from the roots, stems, leaves, flowers, ovules and fibers of upland cotton. Examples of these results are shown in Additional file 6: Figure S5A. In particular, high expression levels of CotAD_01947 and CotAD_01914 were detected in the leaves, whereas low expression levels were identified in the roots, stems, flowers, ovules and fibers (Fig. 8d and Additional file 6: Figure S5A). In addition, qRT-PCR was used to examine the expression patterns of 16 genes in two early-maturing varieties and two late-maturing cotton varieties at five different leaf growth stages (cotyledon and one-leaf to four-leaf stages). From the two-leaf stage to the four-leaf stage, the expression of CotAD_01947 in the early-maturing varieties zhongmiansuo50 and zhongmiansuo74 was significantly higher than that in the late-maturing varieties lumianyan28 and zhongmiansuo41 (P < 0.01) (Fig. 8e). However, the expression of the other genes investigated did not significantly differ between the early-maturing and late-maturing varieties (Additional file 6: Figure S5B and C). These data provide support for CotAD_01947 as a candidate gene for early maturity in upland cotton.
Table 4

Candidate genes most highly associated with early maturity within 500 kb of either side of the SNP locus rs13562854

#GeneIDStartStopDirectionDistance to SNP (kb)Annotation
CotAD_01929 1348273613483020Forward79.83
CotAD_01940 1383698313837348Reverse274.13Tetratricopeptide repeat-like superfamily protein, putative
CotAD_01935 1368553613686374Reverse122.68Zinc finger protein, putative isoform 1
CotAD_01920 1317386913177058Forward385.80Enolase 1, chloroplastic-like protein
CotAD_01932 1366962013671044Reverse106.77Zinc finger protein, putative isoform 1
CotAD_01921 1321546413215928Reverse346.93Proline and serine-rich 1
CotAD_01931 1364193113643458Reverse79.08Ribonuclease P subunit p30
CotAD_01934 1368007513680812Forward117.22Hypothetical protein F383_23360
CotAD_01930 1354990113550143Forward12.71
CotAD_01939 1383521113836617Forward272.36UDP-glycosyltransferase 89B1-like
CotAD_01941 1383742613838590Reverse274.57Tetratricopeptide repeat-like superfamily protein, putative
CotAD_01943 1389412213897285Reverse331.27Hypothetical protein F383_21541
CotAD_01949 1402701514029396Reverse464.16ADP, ATP carrier protein ER-ANT1-like
CotAD_01928 1342557813427698Reverse135.16DNA-directed RNA polymerases I and III subunit RPAC1
CotAD_01944 1392230613923205Forward359.45
CotAD_01942 1383950113840400Reverse276.65UDP-glucosyl transferase 89B1, putative
CotAD_01919 1316962713172234Forward390.62DnaJ, mitochondrial
CotAD_01937 1376342013764002Forward200.57
CotAD_01926 1331330113314846Forward248.01Zinc finger CONSTANS-LIKE 2-like protein
CotAD_01915 1309542813096529Forward466.33UDP-N-acetylmuramoyl-alanine-D-glutamate ligase
CotAD_01922 1323392113234454Forward328.40
CotAD_01914 1306657113067059Forward495.80Agamous-like MADS-box protein A
CotAD_01938 1377158813773328Forward208.73Crooked neck-like protein 1
CotAD_01924 1325291013256745Forward306.11Serine/threonine protein kinase 16
CotAD_01946 1399007513991007Forward427.22OBF-binding protein 4, putative
CotAD_01948 1402094714021135Reverse458.09Hypothetical protein CISIN_1g035470mg
CotAD_01947 1401568414017498Reverse452.83MADS-box protein
CotAD_01918 1313591913143770Forward419.08Putative acyl-activating enzyme 17, peroxisomal-like protein
CotAD_01945 1395697413957540Reverse394.12ARM repeat superfamily protein
CotAD_01923 1323633513238168Forward324.69Hypothetical protein F383_15236
CotAD_01916 1310375613106200Forward456.65
CotAD_01936 1371738413722424Reverse154.53WD repeat and HMG-box DNA-binding 1
Fig. 8

Increased expression of the MADS-box family gene CotAD_01947 in early-maturing cultivars of upland cotton. a Plants at the boll-opening stage of two early-maturing and two late-maturing cotton varieties. b and c Phenotypic effect values of the WGP and FT for two early-maturing and two late-maturing varieties. d Tissue-specific expression patterns of CotAD_01947. e Expression levels of CotAD_01947 during the five different leaf growth stages. **indicates significance at the 0.01 probability level

Candidate genes most highly associated with early maturity within 500 kb of either side of the SNP locus rs13562854 Increased expression of the MADS-box family gene CotAD_01947 in early-maturing cultivars of upland cotton. a Plants at the boll-opening stage of two early-maturing and two late-maturing cotton varieties. b and c Phenotypic effect values of the WGP and FT for two early-maturing and two late-maturing varieties. d Tissue-specific expression patterns of CotAD_01947. e Expression levels of CotAD_01947 during the five different leaf growth stages. **indicates significance at the 0.01 probability level

Discussion

Identification and verification of SNP loci associated with traits related to early maturity in upland cotton

Both linkage mapping and association analysis provide tools for interpreting the genes that underlie complex traits. To date, linkage mapping is a major method for the mining of QTLs for early maturity traits in cotton. Based on the findings of previous studies, it can be concluded that only preliminary progress has been achieved toward localization of QTLs for cotton early maturity traits with desirable effects in the segregation population (F2 populations and recombinant inbred lines (RILs)) [2, 11, 32], and these findings require further verification. Although several studies have identified QTLs for early maturity traits by association analysis in upland cotton [33, 34], these studies were limited by the sizes of the SSR markers and germplasm populations. As the availability of whole-genome sequences has increased and they have become more cost-effective to generate, the practicality of GWAS has increased. In our study, to improve the efficiency and accuracy of association analysis, a wider selection of germplasm resources for upland cotton was collected that was selected based on maturity traits. Further, a substantial number of SNP markers were developed by genome sequencing. Thirteen associations were identified between 8 SNP loci and five early maturity traits (-lg(p) ≥6.21) (Additional file 1: Table S3). Thus, this study has addressed gaps in the study of cotton early maturity traits using GWAS. Most importantly, a main SNP locus for the WGP and FT was identified on chromosome Dt3. In a previous study, one significant QTL for the GP, BP and YPBF was found to be located close to the bridge markers DPL0041 and CIR347 on Chr17 (D3) in two biparental populations, explaining 20.00 % of the phenotypic variation [2]. The physical locations of these SSR markers were mapped to the genome sequence by electronic PCR (e-PCR) (Fig. 9), and a main SNP locus (rs13562854) for the WGP and FT was positioned between DPL0200 and CIR347. This finding validates the GWAS results and increases confidence in the identity of the main SNP locus (rs13562854).
Fig. 9

Physical maps and linkage relationships among quantitative trait loci (QTLs) in previous and present studies. Physical maps of the reference Gossypium hirsutum genome D03 [56] and Dt3 [31] from the present study, respectively. Linkage map of C17 (D3) from a previous study [2]

Physical maps and linkage relationships among quantitative trait loci (QTLs) in previous and present studies. Physical maps of the reference Gossypium hirsutum genome D03 [56] and Dt3 [31] from the present study, respectively. Linkage map of C17 (D3) from a previous study [2]

Mining of favorable SNP alleles and candidate genes to improve early maturity in cotton

Obtaining satisfactory yield and quality during a short growing season is complicated due to conflict between early maturity and yield, as well as between early maturity and fiber quality; thus, it is increasingly difficult to simultaneously improve upon these agriculturally desirable traits in early-maturing cotton using traditional breeding methods. Therefore, the mining of favorable SNP (or QTL) alleles is necessary for improving important agronomic traits in upland cotton cultivars via MAS. Association mapping is one of the most effective approaches for the mining of favorable alleles. Elite alleles for fiber-quality traits [30] and yield and its components [28] in upland cotton cultivars/accessions were explored via association analysis. In our study, by comparing the average phenotypic effect value of each allele for the target traits in the thirteen stable associations detected, we identified eleven highly favorable alleles for five early maturity traits (Table 1). Moreover, the examination of favorable SNP alleles and germplasm resources for early maturity traits, such as zhongmiansuo74, xia25, and xiazao3, could be useful for plant breeders; however, the effects of these alleles must be verified. Therefore, the positive effects of highly favorable alleles were selected and assessed. To date, many studies have demonstrated that marker-based gene pyramiding strategies are very effective [35-37]. Dosage pyramiding effects of the highly favorable SNP alleles were also demonstrated (Table 2, Fig. 5); thus, the highly favorable alleles identified in this study have substantial potential for the development of early-maturing upland cotton cultivars in future breeding programs. Of particular interest, the detailed annotations revealed that the major locus rs13562854 was located on chromosome Dt3 and that the 32 candidate genes in the nearby region were the most highly associated with the WGP and FT. Specifically, four candidates (CotAD_01914, CotAD_01926, CotAD_01936 and CotAD_01947) related to plant floral development were annotated. CotAD_01947 and CotAD_01914 were located -452.83 kb (backward) and 495.80 kb (forward), respectively, from the peak SNP (rs13562854), with MADS-box genes that encode transcription factors involved in plant developmental control and signal transduction [38]. Notably, a WD repeat (WDR) gene (CotAD_01936) was identified 154.53 kb from the rs13562854 locus. Plant WDR proteins are intimately involved in various cellular and organismal processes, including cell division and cytokinesis, apoptosis, light signaling and vision, cell motility, flowering, floral development and meristem organization [39]. CotAD_01947 expression in the early-maturing varieties zhongmiansuo50 and zhongmiansuo74 was significantly higher than that in the late-maturing varieties lumianyan28 and zhongmiansuo41. However, expression of the other genes did not significantly differ between the early-maturing and late-maturing varieties (Additional file 6: Figure S5 B and C). MADS-box family genes play significant roles in plant growth and development, and they also control flowering time and flower initiation [40, 41]. AGAMOUS-LIKE8 (AGL8, AT5G60910) in Arabidopsis is another MADS-box family member that regulates the transcription of genes required for cellular differentiation and floral determination [42-44]. The BLAST alignment results indicated that the coding sequence (CDS) identity of CotAD_01947 with the Arabidopsis AGL8 gene was as high as 47.50 % (Additional file 7: Figure S6A) and that CotAD_01947 encoded a protein that shared 50.90 % sequence identity with the Arabidopsis AGL8 protein (Additional file 7: Figure S6B). In addition, although fifty-three MADS-box genes have been identified in upland cotton to date [45], few molecular studies of MADS-box genes in G. hirsutum have been conducted. For example, GhMADS11 affects cell elongation in fibers [46], GhMADS7 regulates anther development [47], and GhMADS3 participates in flower development [48]. GhMADS42 in Arabidopsis accelerates flowering, and GhMADS42 transgenic plants exhibit abnormal floral organ phenotypes [49]. In addition, we found that CotAD_01947 shared 50.90 % amino acid sequence identity with Arabidopsis AGL8 (Additional file 7: Figure S6B), that most MADS-box family genes in upland cotton regulated flower development, and that CotAD_01947 expression in early-maturing cotton was higher than that in late-maturing cotton (Fig. 8e). Thus, it is reasonable to postulate that CotAD_01947 may be a candidate gene for improving early maturity traits via the regulation and control of early flowering time in upland cotton. However, clear and definite identification of CotAD_01947 as an annotated MADS-box family gene requires further validation.

Conclusions

A substantial number of SNP markers in upland cotton were developed through SLAF-seq technology and were used in a GWAS. Thirteen significant associations were identified among eight SNP loci and five traits related to early maturity using the GLM and MLM, and two of the 13 associations were observed in both models. Eleven highly favorable SNP alleles for the WGP, FT, FBP, NFFB and YPBF were identified. Moreover, dosage pyramiding effects of the highly favorable SNP alleles and significant linear correlations between the number of highly favorable alleles and the phenotypic values of target traits were detected. Most importantly, a major locus (rs13562854) on chromosome Dt3 and a potential candidate gene (CotAD_01947) for early maturity were detected. The beneficial alleles and candidate gene should be useful for improving early maturity in upland cotton breeding via a molecular design approach.

Methods

SLAF-seq, sequencing data analysis and SNP calling

Three hundred fifty-five upland cotton accessions (260 varieties, 71 accessions collected from China, and ten additional varieties, ten accessions introduced from the United States, including the genetic standard line TM-1 and four varieties from central Asia) were used for genome sequencing. Seeds from the 355 upland cotton accessions were obtained from the cotton germplasm collection in our laboratory and from the low-temperature germplasm genebank of the Cotton Research Institute, Chinese Academy of Agricultural Sciences (CRI-CAAS). All accessions had been self-pollinated for more than three generations. Young leaves of ten plants from each of the 355 varieties/accessions were collected, mixed, frozen in liquid nitrogen, and used for DNA extraction. Genomic DNA was isolated from samples from each cotton variety/accession using the cetyltrimethylammonium bromide (CTAB) method, as described by Paterson et al. [50]; RNase A and proteinase K treatments were used to prevent RNA and protein contamination, and then the DNA extracts were subjected to Illumina sequencing and SSR-PCR amplification. The SLAF library was constructed as described by Sun et al. [51] with several modifications. A SLAF pilot experiment was performed, and the SLAF library was generated in accordance with the predesigned scheme. For this population, two enzymes (RsaI and HaeIII, New England Biolabs, NEB, USA) were used to digest the genomic DNA. A single nucleotide (A) overhang was subsequently added to the digested fragments using Klenow Fragment (3′ → 5′ exo−) (NEB) and dATP at 37 °C. Duplex tag-labeled sequencing adapters (PAGE-purified, Life Technologies, USA) were then ligated to the A-tailed fragments using T4 DNA ligase. PCR was performed using diluted restriction-ligation DNA samples, dNTP, Q5® High-Fidelity DNA Polymerase and PCR primers (forward primer: 5′-AATGATACGGCGACCACCGA-3′; and reverse primer: 5′-CAAGCAGAAGACGGCATACG-3′) (PAGE-purified, Life Technologies). Next, the PCR products were purified using Agencourt AMPure XP beads (Beckman Coulter, High Wycombe, UK) and pooled. The pooled samples were separated by 2 % agarose gel electrophoresis. Fragments that ranged in size from 314 to 364 bp (with indexes and adaptors) were excised and purified using a QIAquick gel extraction kit (Qiagen, Hilden, Germany). The gel-purified products were subsequently diluted. Paired-end sequencing (125 bp at each end) was performed using an Illumina HiSeq 2500 system (Illumina, Inc.; San Diego, CA, USA) according to the manufacturer’s recommendations. The raw reads (100 bp in length) were filtered and trimmed as follows: reads with ≥10 % unknown nucleotides were removed; reads with ≥30 % low-quality bases (base quality ≤10) were removed; reads with clear index information were trimmed; and low-quality bases at the 3′ ends of reads were removed. Read quality was considered acceptable if the Q30 ratio was ≥80 % after trimming and a paired sequence length of 80 bp was retained at each end. To evaluate sequence quality, real-time monitoring was performed in each cycle during sequencing, and the ratio of the number of high-quality reads with quality scores > Q30 (a quality score of 30 indicates a 0.10 % chance of an error and thus 99.90 % confidence) to the total number of raw reads and the GC content were calculated. BWA software was used to map the raw paired-end reads to the reference genome (Gossypium hirsutum v 1.0) [31]. SLAF groups were generated by grouping reads that were mapped to the same position. If an accession was only partly digested by the restriction enzymes, some reads that mapped to the reference genome overlapped by two SLAF tags. These reads were assigned to both SLAF tags in the accession. The GATK and SAMtools packages were used for SNP calling.

Population structure and linkage disequilibrium estimation

The ADMIXTURE [52] program was used to assess the population structure based on the maximum-likelihood method with 10,000 iterations, and the number of clusters (K) was set from 2 to 10. The SNPs were used after filtering for an MAF >0.05 and an identity of greater than 80 %. Pairwise LD between markers was calculated as the squared correlation coefficient (r2) of alleles using GAPIT software [53].

Field experiments and collection and analysis of phenotypic data

A subset of 185 lines was selected from the 355 upland cotton accessions from the cotton germplasm collection in our laboratory and from the low-temperature germplasm genebank of the CRI-CAAS. Selection was based on analyses of population structure and maturity, with the genotypes from the nine subpopulations characterized into two main groups according to maturity traits. The first group (103 genotypes) contained the early-maturing genotypes, including 76 varieties/accessions that originated from the Yellow River region, 15 varieties/accessions that originated from the northern specific early-maturing region, ten varieties/accessions that originated from the northwestern inland early-maturing region and two varieties introduced from the United States. The second group (82 genotypes) contained the late-maturing genotypes, including 69 varieties/accessions that originated from the Yellow River region, five varieties/accessions that originated from the Yangtze River region and 8 varieties introduced from the United States (Additional file 1: Table S4). The population was planted at the experimental station of the CRI-CAAS in Anyang, Henan (36°05 N; 114°21E). All cotton lines were sown at two time points, including late April and late May (referred to as SP-sowing and SU-sowing, respectively), in 2013 and 2014. The different cotton varieties/accessions were each grown in a single-row plot (5.0 m long and 0.8 m row wide), with three replicates and a random complete block design. The field management conformed to local practices. The following six traits related to early maturity were investigated in this study: WGP (the period from sowing to the first boll opening), FT (the period from sowing to the first flower blooming), FBP (the period from the first flower blooming to the first boll opening), NFFB (the number of nodes from the cotyledon node to the first fruiting branch node), and HNFFB (the distance between the cotyledon node and the NFFB) and YPBF (the seed yield percentage before October 25th). Ten consecutive plants in the middle of each row were tagged for trait measurements. These plants were observed, and the average value of three replicates was recorded. The phenotypic data were analyzed using SAS 9.3 statistical software (SAS, Chicago, IL, USA). To reduce environmental error, BLUPs for six early maturity traits per genotype were obtained using the PROC MIXED procedure of SAS9.3. ANOVA was performed using PROC ANOVA. Linear regression analysis was conducted using the GLM procedure in SAS.

GWAS and favorable allele identification

For all SNP loci and phenotypic data, we applied the GLM and MLM. In addition, to minimize the effects of environmental variation, BLUPs were computed for GWAS [24]. The BLUP values for the four environments and the phenotypic values of six early maturity traits for each environment were used in GWAS. The high-quality SNPs were filtered according to the MAF (MAF >0.05) and the integrity of each SNP (>50 %). These SNPs from 185 cotton accessions were used in association analysis conducted using the GLM and MLM with GAPIT software [53]. Bonferroni-adjusted P-values of ≤0.01 and 0.05 (-lg(p) ≥ 6.91 and -lg(p) ≥ 6.21, respectively) were used as thresholds to determine whether significant associations existed [54]. SNP loci significantly associated with the target traits based on the GWAS results were analyzed. According to the computational method described by Zhang et al. [29], the phenotypic effect of each SNP locus (ai) was estimated through comparison of the average phenotypic value for each accession for the specific locus with that of all accessions. The favorable alleles were subsequently identified according to the breeding objective of each target trait. For the WGP, FT, FBP, NFFB and HNFFB, ai < 0 indicates a favorable allele, and for the YPBF, ai > 0 indicates a favorable allele.

Quantitative real-time PCR

Total RNA was isolated from the samples using a Plant RNA Purification Kit (Tiangen, Beijing, China). Reverse transcription was conducted using a SuperScript III First-Stand Synthesis System to obtain cDNA for qRT-PCR (Invitrogen, Carlsbad, CA, USA). Transcript levels were subsequently determined by qRT-PCR using a 7500 Real-Time PCR System (Applied Biosystems, Foster City, CA, USA) and SYBR PremixEx Taq (2×) (TaKaRa). The gene-specific primer pairs used for PCR amplification are listed in Additional file 1: Table S5 and were designed to avoid conserved regions. To normalize the variance among samples, actin was used as an endogenous control, and the gene expression levels were calculated using the 2−ΔΔCT method [55].

Abbreviations

ANOVA, analysis of variance; BLUP, best linear unbiased prediction; CV, coefficients of variance; FBP, flowering and boll-setting period; FT, flowering time; GLM, generalized linear model; GWAS, genome-wide association study; HNFFB, height of the node of the first fruiting branch; LD, linkage disequilibrium; MAF, minor allele frequency; MAS, marker-assisted selection; MLM, mixed linear model; NFFB, node of the first fruiting branch; SLAF-seq, specific-locus amplified fragment sequencing; SNP, single nucleotide polymorphism; SSR, simple sequence repeat; WGP, whole growth period; YPBF, yield percentage before frost
  38 in total

1.  Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method.

Authors:  K J Livak; T D Schmittgen
Journal:  Methods       Date:  2001-12       Impact factor: 3.608

2.  Genome-wide association studies of 14 agronomic traits in rice landraces.

Authors:  Xuehui Huang; Xinghua Wei; Tao Sang; Qiang Zhao; Qi Feng; Yan Zhao; Canyang Li; Chuanrang Zhu; Tingting Lu; Zhiwu Zhang; Meng Li; Danlin Fan; Yunli Guo; Ahong Wang; Lu Wang; Liuwei Deng; Wenjun Li; Yiqi Lu; Qijun Weng; Kunyan Liu; Tao Huang; Taoying Zhou; Yufeng Jing; Wei Li; Zhang Lin; Edward S Buckler; Qian Qian; Qi-Fa Zhang; Jiayang Li; Bin Han
Journal:  Nat Genet       Date:  2010-10-24       Impact factor: 38.330

Review 3.  Development of floral organ identity: stories from the MADS house.

Authors:  G Theissen
Journal:  Curr Opin Plant Biol       Date:  2001-02       Impact factor: 7.834

Review 4.  MADS domain proteins in plant development.

Authors:  J L Riechmann; E M Meyerowitz
Journal:  Biol Chem       Date:  1997-10       Impact factor: 3.915

5.  GAPIT: genome association and prediction integrated tool.

Authors:  Alexander E Lipka; Feng Tian; Qishan Wang; Jason Peiffer; Meng Li; Peter J Bradbury; Michael A Gore; Edward S Buckler; Zhiwu Zhang
Journal:  Bioinformatics       Date:  2012-07-13       Impact factor: 6.937

6.  Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1) provides insights into genome evolution.

Authors:  Fuguang Li; Guangyi Fan; Cairui Lu; Guanghui Xiao; Changsong Zou; Russell J Kohel; Zhiying Ma; Haihong Shang; Xiongfeng Ma; Jianyong Wu; Xinming Liang; Gai Huang; Richard G Percy; Kun Liu; Weihua Yang; Wenbin Chen; Xiongming Du; Chengcheng Shi; Youlu Yuan; Wuwei Ye; Xin Liu; Xueyan Zhang; Weiqing Liu; Hengling Wei; Shoujun Wei; Guodong Huang; Xianlong Zhang; Shuijin Zhu; He Zhang; Fengming Sun; Xingfen Wang; Jie Liang; Jiahao Wang; Qiang He; Leihuan Huang; Jun Wang; Jinjie Cui; Guoli Song; Kunbo Wang; Xun Xu; John Z Yu; Yuxian Zhu; Shuxun Yu
Journal:  Nat Biotechnol       Date:  2015-04-20       Impact factor: 54.908

7.  QTL analysis for early-maturing traits in cotton using two upland cotton (Gossypium hirsutum L.) crosses.

Authors:  Chengqi Li; Xiaoyun Wang; Na Dong; Haihong Zhao; Zhe Xia; Rui Wang; Richard L Converse; Qinglian Wang
Journal:  Breed Sci       Date:  2013-06-01       Impact factor: 2.086

8.  Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines.

Authors:  Susanna Atwell; Yu S Huang; Bjarni J Vilhjálmsson; Glenda Willems; Matthew Horton; Yan Li; Dazhe Meng; Alexander Platt; Aaron M Tarone; Tina T Hu; Rong Jiang; N Wayan Muliyati; Xu Zhang; Muhammad Ali Amer; Ivan Baxter; Benjamin Brachi; Joanne Chory; Caroline Dean; Marilyne Debieu; Juliette de Meaux; Joseph R Ecker; Nathalie Faure; Joel M Kniskern; Jonathan D G Jones; Todd Michael; Adnane Nemri; Fabrice Roux; David E Salt; Chunlao Tang; Marco Todesco; M Brian Traw; Detlef Weigel; Paul Marjoram; Justin O Borevitz; Joy Bergelson; Magnus Nordborg
Journal:  Nature       Date:  2010-03-24       Impact factor: 49.962

9.  SLAF-seq: an efficient method of large-scale de novo SNP discovery and genotyping using high-throughput sequencing.

Authors:  Xiaowen Sun; Dongyuan Liu; Xiaofeng Zhang; Wenbin Li; Hui Liu; Weiguo Hong; Chuanbei Jiang; Ning Guan; Chouxian Ma; Huaping Zeng; Chunhua Xu; Jun Song; Long Huang; Chunmei Wang; Junjie Shi; Rui Wang; Xianhu Zheng; Cuiyun Lu; Xiaowu Wang; Hongkun Zheng
Journal:  PLoS One       Date:  2013-03-19       Impact factor: 3.240

10.  Quantitative trait loci pyramiding for fruit quality traits in tomato.

Authors:  Adriana Sacco; Antonio Di Matteo; Nadia Lombardi; Nikita Trotta; Biancavaleria Punzo; Angela Mari; Amalia Barone
Journal:  Mol Breed       Date:  2012-06-28       Impact factor: 2.589

View more
  40 in total

1.  Detection of Stable Elite Haplotypes and Potential Candidate Genes of Boll Weight Across Multiple Environments via GWAS in Upland Cotton.

Authors:  Zhen Feng; Libei Li; Minqiang Tang; Qibao Liu; Zihan Ji; Dongli Sun; Guodong Liu; Shuqi Zhao; Chenjue Huang; Yanan Zhang; Guizhi Zhang; Shuxun Yu
Journal:  Front Plant Sci       Date:  2022-06-13       Impact factor: 6.627

2.  Genome-wide association study identified genetic variations and candidate genes for plant architecture component traits in Chinese upland cotton.

Authors:  Junji Su; Libei Li; Chi Zhang; Caixiang Wang; Lijiao Gu; Hantao Wang; Hengling Wei; Qibao Liu; Long Huang; Shuxun Yu
Journal:  Theor Appl Genet       Date:  2018-03-01       Impact factor: 5.699

3.  Detection of Favorable QTL Alleles and Candidate Genes for Lint Percentage by GWAS in Chinese Upland Cotton.

Authors:  Junji Su; Shuli Fan; Libei Li; Hengling Wei; Caixiang Wang; Hantao Wang; Meizhen Song; Chi Zhang; Lijiao Gu; Shuqi Zhao; Guangzhi Mao; Chengshe Wang; Chaoyou Pang; Shuxun Yu
Journal:  Front Plant Sci       Date:  2016-10-21       Impact factor: 5.753

4.  Two genomic regions associated with fiber quality traits in Chinese upland cotton under apparent breeding selection.

Authors:  Junji Su; Libei Li; Chaoyou Pang; Hengling Wei; Caixiang Wang; Meizhen Song; Hantao Wang; Shuqi Zhao; Chi Zhang; Guangzhi Mao; Long Huang; Chengshe Wang; Shuli Fan; Shuxun Yu
Journal:  Sci Rep       Date:  2016-12-07       Impact factor: 4.379

5.  Genome-Wide Single-Nucleotide Polymorphisms in CMS and Restorer Lines Discovered by Genotyping Using Sequencing and Association with Marker-Combining Ability for 12 Yield-Related Traits in Oryza sativa L. subsp. Japonica.

Authors:  Imdad U Zaid; Weijie Tang; Erbao Liu; Sana U Khan; Hui Wang; Edzesi W Mawuli; Delin Hong
Journal:  Front Plant Sci       Date:  2017-02-08       Impact factor: 5.753

6.  Genome-wide association study reveals candidate genes influencing lipids and diterpenes contents in Coffea arabica L.

Authors:  Gustavo C Sant'Ana; Luiz F P Pereira; David Pot; Suzana T Ivamoto; Douglas S Domingues; Rafaelle V Ferreira; Natalia F Pagiatto; Bruna S R da Silva; Lívia M Nogueira; Cintia S G Kitzberger; Maria B S Scholz; Fernanda F de Oliveira; Gustavo H Sera; Lilian Padilha; Jean-Pierre Labouisse; Romain Guyot; Pierre Charmetant; Thierry Leroy
Journal:  Sci Rep       Date:  2018-01-11       Impact factor: 4.379

7.  High-density 80 K SNP array is a powerful tool for genotyping G. hirsutum accessions and genome analysis.

Authors:  Caiping Cai; Guozhong Zhu; Tianzhen Zhang; Wangzhen Guo
Journal:  BMC Genomics       Date:  2017-08-23       Impact factor: 3.969

8.  A genome-wide association study uncovers novel genomic regions and candidate genes of yield-related traits in upland cotton.

Authors:  Zhengwen Sun; Xingfen Wang; Zhengwen Liu; Qishen Gu; Yan Zhang; Zhikun Li; Huifeng Ke; Jun Yang; Jinhua Wu; Liqiang Wu; Guiyin Zhang; Caiying Zhang; Zhiying Ma
Journal:  Theor Appl Genet       Date:  2018-08-21       Impact factor: 5.699

9.  QTL Mapping for Fiber Quality and Yield Traits Based on Introgression Lines Derived from Gossypium hirsutum × G. tomentosum.

Authors:  Ayaz Ali Keerio; Chao Shen; Yichun Nie; Muhammad Mahmood Ahmed; Xianlong Zhang; Zhongxu Lin
Journal:  Int J Mol Sci       Date:  2018-01-14       Impact factor: 5.923

10.  Genome-Wide Association Study Identifying Candidate Genes Influencing Important Agronomic Traits of Flax (Linum usitatissimum L.) Using SLAF-seq.

Authors:  Dongwei Xie; Zhigang Dai; Zemao Yang; Jian Sun; Debao Zhao; Xue Yang; Liguo Zhang; Qing Tang; Jianguang Su
Journal:  Front Plant Sci       Date:  2018-01-09       Impact factor: 5.753

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.