Literature DB >> 31199554

Genome-wide quantitative trait loci reveal the genetic basis of cotton fibre quality and yield-related traits in a Gossypium hirsutum recombinant inbred line population.

Zhen Zhang1,2, Junwen Li1,3, Muhammad Jamshed1,4, Yuzhen Shi1, Aiying Liu1, Juwu Gong1, Shufang Wang1, Jianhong Zhang1, Fuding Sun1, Fei Jia1, Qun Ge1, Liqiang Fan1, Zhibin Zhang1, Jingtao Pan1, Senmiao Fan1, Yanling Wang1, Quanwei Lu5, Ruixian Liu1, Xiaoying Deng1, Xianyan Zou1, Xiao Jiang1, Ping Liu1, Pengtao Li5, Muhammad Sajid Iqbal1, Chuanyun Zhang6, Juan Zou6, Hong Chen7, Qin Tian7, Xinhe Jia8, Baoqin Wang8, Nijiang Ai9, Guoli Feng9, Yumei Wang10, Mei Hong11, Shilin Li11, Wenming Lian12, Bo Wu12, Jinping Hua13, Chaojun Zhang1, Jinyong Huang3, Aixia Xu2, Haihong Shang1,3, Wankui Gong1,5, Youlu Yuan1,5.   

Abstract

Cotton is widely cultivated globally because it provides natural fibre for the textile industry and human use. To identify quantitative trait loci (QTLs)/genes associated with fibre quality and yield, a recombinant inbred line (RIL) population was developed in upland cotton. A consensus map covering the whole genome was constructed with three types of markers (8295 markers, 5197.17 centimorgans (cM)). Six fibre yield and quality traits were evaluated in 17 environments, and 983 QTLs were identified, 198 of which were stable and mainly distributed on chromosomes 4, 6, 7, 13, 21 and 25. Thirty-seven QTL clusters were identified, in which 92.8% of paired traits with significant medium or high positive correlations had the same QTL additive effect directions, and all of the paired traits with significant medium or high negative correlations had opposite additive effect directions. In total, 1297 genes were discovered in the QTL clusters, 414 of which were expressed in two RNA-Seq data sets. Many genes were discovered, 23 of which were promising candidates. Six important QTL clusters that included both fibre quality and yield traits were identified with opposite additive effect directions, and those on chromosome 13 (qClu-chr13-2) could increase fibre quality but reduce yield; this result was validated in a natural population using three markers. These data could provide information about the genetic basis of cotton fibre quality and yield and help cotton breeders to improve fibre quality and yield simultaneously.
© 2019 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.

Entities:  

Keywords:  QTL clusters; consensus genetic map; fibre quality; fibre yield; gene expression level; genetic correlation; upland cotton

Mesh:

Substances:

Year:  2019        PMID: 31199554      PMCID: PMC6920336          DOI: 10.1111/pbi.13191

Source DB:  PubMed          Journal:  Plant Biotechnol J        ISSN: 1467-7644            Impact factor:   9.803


Introduction

Among the four cultivated Gossypium species, G. hirsutum (upland cotton) accounts for 90%–95% of the total cotton fibre production because of its wide adaptability and attractive fibre quality (Lacape et al., 2009; Park et al., 2005; Percival et al., 1999; Rong et al., 2004). In conventional breeding procedures, which use sexual hybridization, parents are selected based on their desired phenotypic performance for the target traits; for complex traits such as cotton fibre quality and yield, however, it becomes challenging to further improve one without compromising the other (Cao et al., 2015; Li et al., 2016a; Ning et al., 2014; Qin et al., 2008; Shen et al., 2005; Yu et al., 2013a,b). Therefore, uncovering the genetic basis of the fibre quality and yield traits in upland cotton is becoming increasingly necessary for the simultaneous improvement of fibre quality and yield in the future (Ning et al., 2014; Sun et al., 2012). Sequencing of the genomes of G. raimondii (Paterson et al., 2012; Wang et al., 2012), G. arboreum (Du et al., 2018; Li et al., 2014), G. barbadense (Liu et al., 2015; Yuan et al., 2015) and G. hirsutum (Li et al., 2015a; Wang et al., 2019; Zhang et al., 2015b) has provided new platforms for linkage map‐based QTL identification. To explain the genetic basis of cotton fibre quality and yield‐related traits, many genetic linkage maps have been constructed, including interspecific maps mainly between G. barbadense and G. hirsutum (Chandnani et al., 2018; Chee et al., 2005a,b; He et al., 2007; Lacape et al., 2010; Song et al., 2017; Wang et al., 2016; Yang et al., 2015; Zhai et al., 2016) and intra‐hirsutum maps (Liu et al., 2017; Ning et al., 2014; Shang et al., 2015, Sun et al., 2012; Tan et al., 2014; Tang et al., 2015; Zhang et al., 2012, 2017b; Zhang et al., 2016); QTLs have been identified based on these maps. Restriction fragment length polymorphism (RFLP) markers were used to construct the initial genetic map in cotton (Paterson et al., 2003; Rong et al., 2004; Ulloa et al., 2002, 2005), followed by simple sequence repeat (SSR) markers (Liu et al., 2017; Sun et al., 2012; Tan et al., 2014; Tang et al., 2015; Yang et al., 2015; Zhang et al., 2012, 2015c) and single nucleotide polymorphism (SNP) markers (Diouf et al., 2018; Hulse‐Kemp et al., 2015; Li et al., 2016a; Qi et al., 2017; Wang et al., 2015a,b; Zhang et al., 2016, 2017b). A QTL is a chromosome region that might cumulatively contribute to phenotypic variation; each QTL has a varying degree of efficacy for a specific trait (Paterson et al., 2003; Rong et al., 2004; Shen et al., 2005, 2007; Zhang et al., 2012; Tan et al., 2014; Wang et al., 2015a and Wang et al., 2015b). QTLs for fibre quality and yield traits have contributed to cotton breeding through marker‐assisted selection, fine mapping, functional characterization of genes and gene cloning. However, the maps used did not cover the whole genome, and the environments used to assess the phenotypes were insufficient (Diouf et al., 2018; Hulse‐Kemp et al., 2015; Jia et al., 2018; Li et al., 2016a; Liu et al., 2017; Qi et al., 2017; Tang et al., 2015; Wang et al., 2015a,b). Thus, many QTLs for fibre quality and yield traits remain to be identified and characterized across the upland cotton genome. It remains necessary to dissect the genetic basis of traits related to fibre quality and yield by identifying QTLs using a high‐density genetic linkage map with coverage of the entire upland cotton genome. In this study, we developed a population of intraspecific RILs in upland cotton consisting of 196 lines with the parents 0‐153 and sGK9708. The fibre quality and yield traits were evaluated across 22 environments. A whole‐genome‐based high‐density genetic linkage map was constructed by integrating five previously reported basic linkage maps generated using SSR (Jamshed et al., 2016; Sun et al., 2012), SSR and SNP‐BSA (bulked segregation analysis) (Zhang et al., 2015c), SNP‐specific‐locus amplified fragment sequencing (SLAF) (Zhang et al., 2016) and SNP‐chip (Zhang et al., 2017b). QTLs for fibre quality and yield traits were identified in 17 of the 22 environments. QTL clusters were also identified to determine the genetic basis of the correlation between fibre quality and fibre yield. This study demonstrates the integration of various linkage maps with different marker types to construct a high‐density genetic map covering the whole genome. This technique could be a powerful complementary strategy to dissect the genetic basis of complex traits and their genetic correlations in cotton. In addition to providing new insights into the genetic basis of the traits related to fibre quality and yield and their relationships, this study also provided information about improving cotton fibre quality and yield simultaneously.

Results

Phenotypic variation and correlation of traits relating to fibre quality and yield

A RIL population developed between the two parents 0‐153 and sGK9708 was planted from 2007 to 2015 across 22 environments (Table S1). The phenotypic variation ranges of fibre strength (FS), fibre length (FL), micronaire (FM), boll weight (BW), lint percentage (LP) and seed index (SI) for the parents and the RIL population are summarized in Table S2 and Figure 1. All the traits showed approximately normal distributions with an absolute skewness value of less than one and were characterized by transgressive segregation with respect to parental performance.
Figure 1

The phenotype of the parents and a transgressive segregation line in 22 environments and the correlation analysis. (a) The phenotype of the parents and transgressive segregation line in 22 environments. (b) The correlation analysis between the different traits in the same environments.

The phenotype of the parents and a transgressive segregation line in 22 environments and the correlation analysis. (a) The phenotype of the parents and transgressive segregation line in 22 environments. (b) The correlation analysis between the different traits in the same environments. The heritability for FS, FL, FM, BW and LP was calculated across 21 environments, excluding 2013Anyang; the heritability for SI was calculated across 20 environments, excluding 2012Zhengzhou and 2013Anyang. The heritability for all traits was more than 50%, except in 2015Shihezi (49.80%) and 2014Alaer (31.20%), which indicates that these six traits had high heritability. The variance analysis and heritability analysis results are shown in Table S3. To establish correlations between different traits in the same environment, three categories of correlations were observed among the six traits. In the first category, the trait pairs (FS and FL, FS and SI, FL and SI, FM and BW, and BW and SI) showed significant medium or high positive correlations in most of the environments; in the second category, the trait pairs (FS and FM, FS and LP, FL and FM, FL and LP, and LP and SI) showed significant medium or high negative correlations in most of the environments; and in the third category, the trait pairs (FM and LP, FS and BW, FL and BW, FM and SI, BW and LP) showed no or weak positive or negative correlations in most of the environments (Table S4, Figure 1). In 2012Zhengzhou, 2014Kuerle and 2015Kuerle, the correlations between the traits differed significantly from those in the other environments. In 2011Anyang, 2011Zhengzhou and 2012Zhengzhou, some traits did not show any significant positive correlations with those in the other environments. These results indicated that these five environments were distorted; thus, the other 17 environments were used for QTL identification (Table S5).

Genotyping for constructing the genetic linkage map

To ensure the quality of the genetic consensus map, all three types of markers (SSR, SLAF‐SNP and chip‐SNP) were first filtered. For the SLAF‐SNP markers, the processing and the results of filtering the markers are shown in Zhang's reports (Zhang et al., 2016) (Table S6). For the chip‐SNP markers, 6989 SNPs were shown to be polymorphic between the parents, 4863 were located on the genome of upland cotton, 2734 were genotyped as AA × BB with a missing rate of <40%, and 2315 showed nonsignificant segregation distortion with P values of more than 0.001 and could therefore be used to construct the consensus genetic map (Table S6) (Zhang et al., 2017b). For the SSR markers, 398 markers with 458 loci from Sun's and Jamshed's research could be located in the upland cotton genome and used to construct the consensus genetic map; these markers were developed by other organizations (Table S6) (Jamshed et al., 2016; Sun et al., 2012).

Construction of the consensus genetic linkage map

After combining the filtered markers, the consensus genetic linkage map consisted of 8295 markers (458 SSRs, 5521 SLAF‐SNPs and 2316 chip‐SNPs) and spanned 5197.17 cM, with 2384.94 cM for the At subgenome and 2812.23 cM for the Dt subgenome. The average interval between adjacent markers was 0.88 cM. The gaps (more than 10 cM) in the integrated genetic linkage map were mainly found on Dt subgenome chromosomes, with five on chromosome 15 and four on chromosome 26. The largest map gap was found on chromosome 25 (24.36 cM). The segregation distortion of the markers (SDM) (0.001 < P value <0.05) in the integrated linkage map remained at almost the same level as in the component maps. Chromosome 14 (447) and chromosome 25 (249) were the two chromosomes with the highest levels of SDM. The marker density in the genetic and physical maps along the whole genome indicated high coverage, although the markers were unevenly distributed throughout the upland cotton genome, and there were large variations across chromosome regions. For the collinearity analysis, most of the SNP loci on the linkage map were in the same order as those on the corresponding chromosomes of the physical map of the upland cotton genome. These results demonstrated that the current linkage map could provide an effective and reliable tool for QTL mapping (Table 1, Figure 2).
Table 1

Detailed information on the consensus genetic map for the RIL population

No. ChrMarker NumTotal Genetic Distance (cM)Average Genetic Distance (cM)Largest GapNo. SLAF‐SNP MarkersNo. chip‐SNP MarkersNo. SSR MarkersNumber of Gap (>5 cM)Number of SDMs (0.001 < = P < 0.05)Percentage of SDMs (%)Chi‐square P‐valueSDR
chr01414181.810.446.772971089111928.742.840.213
chr02238151.990.6411.181805446218.821.320.452
chr03329191.940.5916.292181001156820.672.350.483
chr04646208.850.3215.125746012781.240.940.471
chr05571221.340.398.1343411918519634.332.870.295
chr06181207.631.1516.98101423837641.996.800.354
chr07422168.990.44.763189770296.871.130.452
chr087885.31.114.795621102025.647.010.311
chr09376193.160.5214.15274911159023.942.600.407
chr10218199.120.929.901336817117433.949.360.242
chr11120154.861.35.3988284154.171.550.331
chr12368212.730.584.7627389608924.182.610.245
chr13766207.220.275.07604151111648.361.670.366
chr14727234.630.322.9840830514047765.617.320.152
chr1585226.612.713.202927291889.411.490.381
chr16620245.760.416.1739917546820232.585.370.285
chr17184199.411.0916.59102602210115.981.010.511
chr18286236.880.8323.661729618155218.182.100.314
chr19180219.651.2313.951095615145027.782.670.355
chr20150226.791.5213.51606228124328.676.030.344
chr21271221.930.825.44174898282.950.900.532
chr22149152.861.034.76756410021.340.540.650
chr23302244.880.8113.2914213129710936.097.950.226
chr24109132.171.223.906046301614.681.820.431
chr25353230.040.6524.3616612166724970.5412.160.108
chr26152240.621.5916.07755621122214.471.860.423
Figure 2

Detailed information about the consensus genetic map. (a) The density of markers in the genetic map in a 5 cM window. (b) The density of markers in the physical map in a 500 kb window. (c) The distribution of the SSR markers, SLAF‐SNP, markers and chip makers in the consensus genetic map. (d) Collinearity analysis of markers between the physical map and genetic map in the A subgenome. (e) Collinearity analysis of markers between the physical map and genetic map in D subgenome.

Detailed information on the consensus genetic map for the RIL population Detailed information about the consensus genetic map. (a) The density of markers in the genetic map in a 5 cM window. (b) The density of markers in the physical map in a 500 kb window. (c) The distribution of the SSR markers, SLAF‐SNP, markers and chip makers in the consensus genetic map. (d) Collinearity analysis of markers between the physical map and genetic map in the A subgenome. (e) Collinearity analysis of markers between the physical map and genetic map in D subgenome.

QTL identification for fibre quality and yield

Based on the consensus genetic map and the phenotype data across 17 environments, 983 QTLs (548 on At and 435 on Dt) for fibre quality and yield were identified. Among them, 198 were stable (118 on At and 80 on Dt) in at least three environments, and 53.53% of stable QTLs were mainly distributed on chromosomes 4, 6, 7, 13, 21 and 25, especially the three‐environment‐stable QTLs (Tables 2, S7 and S8, Figures S1 and 3).
Table 2

The number of QTLs and stable QTLs for the six fibre quality and yield traits in 17 environments

No. ChrFSFLFMBWLPSITotal
TotalStableTotalStableTotalStableTotalStableTotalStableTotalStableTotalStable
Chr0110124718421403410
Chr02003032604151214
Chr03404060518360334
Chr0412513814380831346823
Chr057071821125051436
Chr06103428218362515113
Chr0714316382134922278221
Chr08422161205020214
Chr094010111035050264
Chr1010501211627071484
Chr117110252413052348
Chr121020501212010231
Chr1383113113164113706416
Chr1412182120809380576
Chr15308042512010233
Chr16834181837180439
Chr17101033315051185
Chr18515071805010312
Chr19616160505092374
Chr20314130406230234
Chr218264635161813912
Chr22623060613081324
Chr23302041511030182
Chr240010205111251244
Chr25651021311681421237121
Chr26401020414241194
Total138331493517032211431532816227983198
Figure 3

Detailed information about the QTLs and QTL clusters. (a) The position of the QTLs on the consensus genetic map. (b) The distribution of the stable QTLs for the six traits on the 26 chromosomes. (c) The position of QTL clusters on the consensus genetic map of At. (d) The position of QTL clusters on the consensus genetic map of Dt. (e) The number of two‐pair‐trait QTL clusters with the same and different direction for additive effect.

The number of QTLs and stable QTLs for the six fibre quality and yield traits in 17 environments Detailed information about the QTLs and QTL clusters. (a) The position of the QTLs on the consensus genetic map. (b) The distribution of the stable QTLs for the six traits on the 26 chromosomes. (c) The position of QTL clusters on the consensus genetic map of At. (d) The position of QTL clusters on the consensus genetic map of Dt. (e) The number of two‐pair‐trait QTL clusters with the same and different direction for additive effect. There were 138 QTLs for FS, 149 for FL, 170 for FM, 211 for BW, 153 for LP and 162 for SI. Among them, 33 (At was 17 and Dt was 16), 35 (At was 24 and Dt was 11), 32 (At was 20 and Dt was 12), 43 (At was 25 and Dt was 18), 28 (At was 15 and Dt was 13) and 27 (At was 17 and Dt was 10) QTLs were stable, respectively. These stable QTLs were distributed on all 26 chromosomes of cotton, mainly on chromosomes 4, 6, 7, 11, 13, 16, 20, 21 and 25. Detailed information about the confidence intervals, observed phenotypic variance (PV) values and additive effects of each QTL is shown in Tables 2, S7, S8, Figures S1 and 3.

QTL cluster identification

Among the QTLs identified for fibre quality and yield, there were QTLs for different traits that shared the same confidence intervals and were considered to constitute QTL clusters. In this research, 37 QTL clusters were identified (23 on At and 14 on Dt). All these QTL clusters were distributed on 17 chromosomes (chromosomes 2, 4, 5, 6, 7, 8, 11, 13, 15, 16, 17, 19, 20, 21, 22, 25 and 26). Six chromosomes (chromosomes 4, 6, 7, 13, 21 and 25) harboured 25 QTL clusters, accounting for 67.6% of the total clusters identified, and could be considered important chromosomes in this study. Among these QTL clusters, there were six QTL clusters that harboured QTLs for FS or FL and BW or LP. The 0‐153 allele increased FS and FL but decreased LP or BW in the qClu‐chr16‐1, qClu‐chr20‐1, qClu‐chr6‐2, qClu‐chr25‐2 and qClu‐chr13‐2 cluster; it also increased LP but decreased FS in qClu‐chr21‐3. Detailed information about the QTL clusters and the QTLs in these clusters is shown in Table S9, Figures S2 and 3.

Candidate gene identification and annotation

The genes in the confidence intervals related to the six traits were annotated with the Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) and euKaryotic Orthologous Groups (KOG) databases. The GO term ‘nucleus’, the KEGG pathway ‘Metabolic pathways’ and the KOG basket ‘Signal transduction mechanisms’ were associated with the highest numbers of genes for each trait (Tables S10, S11, S12). In the confidence intervals of the QTL clusters, there were a total of 1297 candidate genes, 946 of which belonged to At and 351 to Dt of the upland cotton genome (Table S13). The qClu‐chr11‐1 cluster harboured the most candidate genes at 194, while qClu‐chr7‐3 harboured only one candidate gene. Six QTL clusters (qClu‐chr4‐1, qClu‐chr4‐2, qClu‐chr4‐5, qClu‐chr5‐2, qClu‐chr7‐3 and qClu‐chr13‐2) harboured fewer than ten candidate genes, and nine QTL clusters (qClu‐chr7‐6, qClu‐chr13‐3, qClu‐chr13‐4, qClu‐chr15‐1, qClu‐chr16‐1, qClu‐chr17‐1 qClu‐chr20‐1, qClu‐chr21‐3 and qClu‐chr25‐2) did not harbour any candidate genes because of their small confidence intervals. To predict the functions of the candidate genes, each was annotated with the Arabidopsis thaliana database and using GO, KEGG and KOG. In the Arabidopsis annotation database, 1165 candidate genes had annotation information (Table S14). For GO annotation, the 1122 candidate genes were annotated with 3326 GO terms, and some candidate genes were annotated with more than one GO term. The biological process category included 1997 GO terms; the cellular component category contained 429 GO terms, and the molecular function category held 900 GO terms (Table S15). For KEGG annotation, all 1297 candidate genes were annotated with 91 KEGG pathways, with some candidate genes having no KEGG annotation information (Table S15). For KOG annotation, 861 of the 1122 candidate genes had KOG annotation information. Detailed information on the annotation of the genes is shown in Table S15 and Figure S3.

Candidate gene expression patterns

The total number of genes located in the confidence intervals of the 37 QTL clusters was 1297. Among the QTLs in the 37 QTL clusters, there were 20 QTLs for FS, harbouring 763 genes; 18 QTLs for FL, harbouring 580 genes; 11 QTLs for FM, harbouring 433 genes; 15 QTLs for BW, harbouring 307 genes; nine QTLs for LP, harbouring 70 genes; and 13 QTLs for SI, harbouring 255 genes. To select the candidate genes, the RNA‐Seq data of all the tissues in TM‐1 were analysed (Zhang et al., 2015b). Overall, 777 of the 1297 candidate genes were expressed in at least one tissue with fragments per kilobase of transcript per million fragments mapped (FPKM) values of no less than 10, and 552 of the candidate genes were expressed in cotton fibres or ovules in Zhang's research (Zhang et al., 2015b). Examining our RNA‐Seq data for cotton fibre development indicated that 495 of the 1297 candidate genes were expressed in at least one developmental period. Combining the RNA‐Seq results of our and Zhang's research, we determined that 414 genes were expressed in both data sets and were distributed on 25 QTL clusters on chromosomes 2, 4, 5, 6, 7, 8, 11, 13, 19, 21, 22, 25 and 26. Overall, 246 of the 414 genes (59.42%) were located on six chromosomes (chromosomes 4, 6, 7, 13, 21 and 25). Among the 414 genes, there were 237 for FS, 192 for FL, 150 for FM, 89 for BW, 16 for LP and 167 for SI (Table S16, Figures S4, S5 and 4).
Figure 4

Heatmap of the expression level of candidate genes in six QTL clusters during cotton fibre development. (a–f) Heatmap of the expression level of candidate genes in during cotton fibre development. (g–i) The qRT‐PCR result of the genes Gh_A13G0393, Gh_A13G0394 and Gh_A13G0395.

Heatmap of the expression level of candidate genes in six QTL clusters during cotton fibre development. (a–f) Heatmap of the expression level of candidate genes in during cotton fibre development. (g–i) The qRT‐PCR result of the genes Gh_A13G0393, Gh_A13G0394 and Gh_A13G0395.

qRT‐PCR results

qRT‐PCR was performed to determine the expression levels of the four genes located on chromosome 13. The results indicated that there was no expression of the Gh_A13G0392 gene; Gh_A13G0393 was highly expressed at 20 days post‐anthesis (DPA), 25 DPA and 30 DPA, and the expression levels in the materials with high fibre quality were slightly higher than those with low fibre quality. The expression levels of Gh_A13G0394 and Gh_A13G0395 were higher in the later stages (20 DPA, 25 DPA and 30 DPA) than in the early stages (5, 10 and 15 DPA) of cotton fibre development (Table S17, Figure 4).

The descriptive statistics, quality of sequencing and population structure of the nature population

The descriptive statistics results showed that FS was from 19.10 to 49.9 cN/tex; FL from 23.9 to 37.70 mm; FM from 2.43 to 7.78; BW from 2.22 to 8.73 g; LP from 27.32% to 48.84%; and SI from 7.90 to 18.80 g in the nature population (Table S18, Figure S6). The SNP number was from 93 4736 to 1 628 878; Q30 value from 91.08% to 93.79%; GC percentage from 33.06% to 38.70%; average depth from 7.19 to 29.54; integrity from 36.98% to 47.02%; and coverage from 0.03 to 0.05 in the nature population (Figure S7). For the population structure analysis, the delta K value reached a sharp peak at K = 3. Therefore, this association population was clustered into three subpopulations (Figure S8).

Discussion

Coverage and saturation of the consensus genetic map

Based on the reference genome of upland cotton, the total genetic distance of the genetic map should be between 4500 and 5500 cM without visible gaps (Du et al., 2018; Li et al., 2016a; Paterson et al., 2012; Wang et al., 2012; Zhang et al., 2015b). In previous studies, several high‐density genetic maps were constructed using different markers with different populations; these maps, however, included genetic gaps of more than 20 cM (Diouf et al., 2018; Hulse‐Kemp et al., 2015; Jia et al., 2018; Li et al., 2016a; Liu et al., 2015, 2017; Qi et al., 2017; Tan et al., 2014; Tang et al., 2015; Wang et al., 2015a,b; Zhang et al., 2016, 2017a,b). In this study, we used three types of markers to construct a consensus genetic map that could fill some of the gaps in the previously generated maps. This consensus genetic map covered the whole genome of upland cotton with high saturation and is therefore a valuable tool for QTL and candidate gene identification, functional characterization and pyramiding breeding across the whole genome.

QTL numbers compared with previous reports

Because quantitative traits are influenced by the environment, some QTLs could be identified in several environments, while others could be identified only in specific environments; QTLs that could be identified in multiple environments or generations are considered stable QTLs (Sun et al., 2012) and may be of use for marker‐assisted breeding and gene cloning. As cotton fibre quality and yield traits are quantitative traits, they are controlled by multiple loci/genes. In previous studies, no more than 16 stable QTLs were identified within a population for each of the fibre quality or yield traits, that is the maximum of 16 was found for FS, seven for FL, five for FM, 16 for BW, seven for LP and four for SI (Diouf et al., 2018; Hulse‐Kemp et al., 2015; Li et al., 2016a; Liu et al., 2017; Qi et al., 2017; Tan et al., 2014, 2018; Tang et al., 2015; Wang et al., 2015a,b; Yang et al., 2015). In this study, we used a consensus genetic map with high coverage and saturation and identified more than 25 stable QTLs across 17 environments. Therefore, this study identified the greatest number of stable QTLs for fibre quality and yield. These results could provide more information about the genetic mechanisms underlying cotton fibre development and provide important loci related to the improvement of both cotton fibre quality and yield.

Congruence with previously reported QTLs

To determine whether the QTLs in our study were novel or had been previously identified, we compared our results with those from the cotton QTL database based on their physical confidence intervals and the reports of previous genome‐wide association studies (GWASs). A total of 43 QTLs shared the same or overlapping confidence intervals with QTLs identified in previous studies (Fang et al., 2014; Lacape et al., 2010; Rong et al., 2007; Said et al., 2013, 2015). Of these, 22 were in the cotton QTL database, and 25 were identified in previous GWAS (Table S19) (Fang et al., 2017b,2017c; Huang et al., 2017; Ma et al., 2018; Sun et al., 2017; Wang et al., 2017a). Overall, of the 198 stable QTLs identified in our study, 43 have been identified in previous studies (seven for FS, seven for FL, 11 for FM, four for BW, ten for LP and four for SI.) The others (78.3% of 198) were newly identified (26 for FS, 28 for FL, 21 for FM, 39 for BW, 18 for LP and 23 for SI) and could provide more information about the mechanisms of cotton fibre development and yield formation.

Congruence with previously reported QTL clusters

In previous studies, many QTL clusters were reported and are available in the cotton QTL database (Fang et al., 2014; Lacape et al., 2010; Rong et al., 2007; Said et al., 2013, 2015). In this database, all the chromosomes harboured QTL clusters for no fewer than two traits among FS, FL, FM, BW, LP and SI. Among the 37 QTL clusters identified in our study, there were six QTL clusters (qClu‐chr6‐2, qClu‐chr7‐1, qClu‐chr13‐1, qClu‐chr13‐2, qClu‐chr17‐1 and qClu‐chr19‐1) that shared the same or overlapping confidence intervals with those in the database; the other 31 clusters were newly identified in our research. Three markers, Marker8331, Marker8332 and Marker8333 related to qClu‐chr13‐2, were selected to genotype a natural population with 278 breeding parents (Table S18 and Figure S6). For all three markers, the phenotypes of the materials with markers and without markers showed significant differences, with P values <0.05. The FS increased from 9.77 to 10.61 cN/tex, and the FL increased from 6.40 to 4.21 mm with the markers. However, the BW decreased from 2.19 to 2.70 g, and the LP decreased from 3.17% to 4.20% with the markers (Table S20, Figure 5). These results could provide information about the mechanism underlying the negative correlation between yield traits and fibre quality traits and act as a reference for improving the yield and fibre quality of cotton.
Figure 5

The detail information of the QTL cluster . (a) The QTLs located in the QTL cluster . (b) The genotype of the three markers in the QTL cluster based on the RIL population. (c) The genotype of the three markers in the QTL cluster based on the breeding parents. (d) The t‐test of the three markers in the QTL cluster based on the breeding parents in 2017. (e) The t‐test of the three markers in the QTL cluster based on the breeding parents in 2016.

The detail information of the QTL cluster . (a) The QTLs located in the QTL cluster . (b) The genotype of the three markers in the QTL cluster based on the RIL population. (c) The genotype of the three markers in the QTL cluster based on the breeding parents. (d) The t‐test of the three markers in the QTL cluster based on the breeding parents in 2017. (e) The t‐test of the three markers in the QTL cluster based on the breeding parents in 2016.

QTL clusters and genetic correlation between fibre quality and fibre yield

Some previous studies in cotton have addressed the correlation between traits and QTL clusters but did not combine these results for further analysis (Diouf et al., 2018; Li et al., 2016a; Liu et al., 2017; Tan et al., 2018). In these studies, the number of QTL clusters ranged from 2 to 18, and the paired‐trait QTL clusters ranged from 2 to 27 (Table S21). Through detailed analysis of these results, we found that when paired traits had a medium or high significant positive correlation, the QTL additive effect directions for these two traits were the same (positive or negative) in most QTL clusters (Diouf et al., 2018; Liu et al., 2017; Tan et al., 2018) (Table S21); when paired traits had a medium or high significant negative correlation, the QTL additive effect directions for these two traits were opposite (positive and negative) in most of the QTL clusters (Liu et al., 2017); and when paired traits had no or only a weakly significant correlation, the QTL additive effect directions for these two traits were sometimes the same and sometimes opposite (Liu et al., 2017; Tan et al., 2018). In our research, a total of 59 paired‐trait QTL clusters were identified for the fibre yield and quality traits; 28 (47.5%) showed the same QTL additive effect direction (positive or negative), and 31 (52.5%) showed the opposite QTL additive effect directions (positive and negative). Five paired traits (FS and FL, FS and SI, FL and SI, FM and BW, and BW and SI) showed significant medium or high positive correlations in most environments; 27 paired‐trait QTL clusters were identified for these traits (Tables S3, S8, Figures S2 and 3). Except for two of the paired‐trait QTL clusters between FM and BW and SI and BW, the other 25 paired‐trait QTL clusters (92.8%) showed the same QTL additive effect directions (positive or negative), especially the 14 QTL clusters identified between FS and FL. Five paired traits (FS and FM, FS and LP, FL and FM, FL and LP, and LP and SI) showed significant medium or high negative correlations in most environments, and 16 paired‐trait QTL clusters were identified for these traits (Tables S3, S8, Figures S2 and 3), all of which (100%) showed opposite QTL additive effect directions (positive and negative). Five paired traits (FM and LP, FS and BW, FL and BW, FM and SI, and BW and LP) showed no or weak positive or negative correlations in most environments, and 16 paired‐trait QTL clusters were identified for these traits (Tables S3, S8, Figures S2, 3), 13 of which (81.2%) showed opposite QTL additive effect directions (positive and negative). Seventeen paired‐trait QTL clusters were identified for BW with FS, FM, SI and LP, nine of which (53%) showed the same QTL additive effect direction, and eight of which (47%) showed opposite additive effect directions. Overall, the QTL clusters for cotton fibre quality and yield throughout the genome showed that paired traits with significant medium or high positive correlations always had the same QTL additive effect direction, indicating that it could be easy to simultaneously improve these traits. However, the paired traits with significant medium or high negative correlations always had opposite QTL additive effect directions, indicating that it could be difficult to simultaneously improve these traits. The paired traits with no or weak positive or negative correlations had either the same or opposite QTL additive effect directions, indicating that the positive and negative additive effects of QTLs in different clusters might have a mutually neutralizing effect with respect to these traits. These conclusions indicated that genes and QTLs in clusters might be closely linked (Cai and Morishima, 2002; Durand et al., 2012; Gu et al., 2015; Ku et al., 2015; Li et al., 2015b; Reddy et al., 2013; Vikram et al., 2015; Zhao et al., 2016) or have pleiotropic effects (Li et al., 2016c; Xie et al., 2018; You et al., 2019; Yuan et al., 2018; Zhao et al., 2018) in the same genomic region, which provides an explanation for the significant phenotypic correlations between related traits and linkage drag. However, as in previous reports, especially in breeding populations, the negative genetic correlations between traits could be broken to a certain degree (Cai and Morishima, 2002; Durand et al., 2012; Gu et al., 2015; Ku et al., 2015; Li et al., 2015b; Reddy et al., 2013; Vikram et al., 2015; Zhao et al., 2016). This result suggests that genetic linkage may be the main genetic basis of trait correlation. To break the negative correlations between fibre quality traits and yield traits, the six QTL clusters (qClu‐chr16‐1, qClu‐chr20‐1, qClu‐chr6‐2, qClu‐chr25‐2, qClu‐chr13‐2 and qClu‐chr21‐3) should be considered for fine mapping of QTLs for FS, FL and LP. Techniques such as further separation of the confidence intervals of QTL clusters via secondary separation populations (Cao et al., 2015; Fang et al., 2017a,b,c; Liu et al., 2016; Xu et al., 2017), mutation with ethyl methanesulfonate (EMS) and radiation to change genes related to traits (Naoumkina et al., 2017; Patel et al., 2014) and the use of clustered regularly interspaced short palindromic repeats (CRISPR) technology to edit genes related to traits (Gao et al., 2017; Janga et al., 2017; Wang et al., 2017b, 2018) could be used to improve fibre quality and yield.

Important candidate genes in QTL clusters

To explore the mechanism of the negative correlation between fibre quality and yield from the standpoint of candidate genes, all the genes located in the six QTL clusters (qClu‐chr16‐1, qClu‐chr20‐1, qClu‐chr6‐2, qClu‐chr25‐2, qClu‐chr13‐2 and qClu‐chr21‐3) were analysed. Among the six QTL clusters, only two (qClu‐chr6‐2 and qClu‐chr13‐2) harboured genes. In qClu‐chr6‐2, 14 of the 49 genes were expressed in both our and Zhang's transcript data. In the qClu‐chr6‐2 GO analysis, Gh_A06G0506 was related to ‘plant‐type cell wall’; Gh_A06G0543 and Gh_A06G0515 were related to ‘Golgi’; Gh_A06G0545 was related to ‘cytoskeleton’; and ‘Gh_A06G0551’ was related to ‘auxin’. Gh_A06G0543, Gh_A06G0545 and Gh_A06G0551 were expressed in both ovules and cotton fibre and may be related to both fibre quality and yield traits; Gh_A06G0506 was expressed only at 5 DPA during cotton fibre development and may be related to FL; Gh_A06G0515 was expressed only in ovules and may be related to yield traits. In qClu‐chr13‐2, three of the four genes were expressed in both our and Zhang's transcript data. The qRT‐PCR analysis results also showed that these three genes were highly expressed at 20, 25 and 30 DPA during cotton fibre development. Based on the numbers and annotation information of the candidate genes in the QTL clusters, the genes located in QTL clusters qClu‐chr7‐2, qClu‐chr25‐4, qClu‐chr11‐1 and qClu‐chr5‐1 were also examined. Of these, five genes have been previously reported in cotton research; Gh_A07G1524 belongs to the WRKY family and is related to the development of fibres, anthers and other tissues and is also reportedly involved in stress response (Cai et al., 2014; Dou et al., 2014); Gh_D06G0864 is a plant LKR/SDH locus that can encode three related but distinct enzymes associated with Lys catabolism and could finely regulate Lys catabolism during plant development (Tang et al., 2002); Gh_D06G0865 is related to pectin degradation and cotton fibre development (Li et al., 2016b; Liu et al., 2013); Gh_A11G0147 and Gh_A05G2095 are related to the NAC family and are associated with plant development and especially cotton fibre development (Meng et al., 2007; Meng et al., 2007, Shang et al., 2013, 2016, Zhang et al., 2017a). Based on the annotation information, Gh_A07G1480, Gh_A07G1602 and Gh_A05G2107 were related to ‘Starch and sucrose metabolism’ in KEGG; Gh_A07G1590, Gh_A07G1535, Gh_A11G0125, Gh_A11G0232, Gh_A11G0223, Gh_A11G0165, Gh_A05G2107 and Gh_A05G2088 were directly related to the cell wall, such as ‘cell wall’ and ‘plant‐type cell wall’ in GO and ‘Cell wall/membrane/envelope biogenesis’ in KOG. Based on the expression information, Gh_A07G1602, Gh_A07G1590, Gh_A07G1535 and Gh_A11G0223 were expressed at 5 DPA and/or 10 DPA during cotton fibre development and may be related to FL; Gh_A05G2107 was expressed at 15 DPA during cotton fibre development and may be related to FS; Gh_A07G1480, Gh_A11G0125 and Gh_A05G2088 were expressed in both the ovule and fibre and may be related to both fibre quality traits and yield traits; and Gh_A11G0232 and Gh_A11G0165 were expressed only in ovules and may be related to yield traits. In total, there were 23 promising genes (six from qClu‐chr6‐2, five from qClu‐chr7‐2, five from qClu‐chr25‐4, three from qClu‐chr11‐1 and two each from qClu‐chr5‐1 and qClu‐chr13‐2). To determine whether these genes may contribute to the improvement of cotton fibre quality and yield, further studies will need to be completed in the future (Figures S4, S5 and 4).

Experimental procedures

Development and phenotypic evaluation of the mapping population

A RIL population derived from two upland cotton cultivars, 0‐153 and sGK9708, was evaluated and used throughout this study. Briefly, the cross was made in 2001, and a segregation population consisting of 196F6:8 RILs was developed as detailed by Sun et al. (2012). From 2007 to 2015, multi‐environmental evaluations were conducted in 22 different ecological locations across two principal cotton planting areas, Xinjiang Uygur Autonomous Region (XUAR) and Yellow River Valley (YRV). In the years 2007–2010 and 2013, this experiment was described in Zhang's report (Zhang et al., 2015b). In 2011, the population was planted in Anyang and Zhengzhou; in 2012, it was planted in Zhengzhou; in 2014 and 2015, it was planted in Anyang, Alaer, Shihezi and Kuerle. The phenotypic evaluations of the population at each location were conducted in a completely randomized block design of one‐row plots with two replicates in the YRV locations. Each row was 5 m long and 0.8 m apart, with 20 plants planted in each row except for 2013Anyang, which had only one replicate. In the XUAR locations, two‐narrow‐row plots were applied; each row was 3 m long and planted in wide/narrow alternating row spacing of 0.66/0.10 m, with 25 plants in each row. Field management was performed according to the local farming practices. Three fibre quality traits, FL, FS and FM, and three yield traits, BW, LP and SI, were evaluated for their phenotypic performances throughout the population. Thirty normally opened bolls were sampled from each plot during the harvesting season in September of each testing year to evaluate the phenotypic performances of the aforementioned traits. Briefly, after the seed cotton samples were weighed and ginned, BW, LP and SI were evaluated. The methods of testing the fibre quality traits are described in Sun's and Zhang's reports (Sun et al., 2012; Zhang et al., 2016). One‐way ANOVA was used to test the significance of the differences in the traits between the two parents. The descriptive statistics were calculated with SPSS 20.0 software (SPSS, Chicago, IL), and the variance and heritability were analysed with IciMapping software (Li et al., 2007, 2015c; Meng et al., 2015).

Development and phenotype data analysis of the nature population

The nature population included 278 breeding materials was planted in Anyang of Henan province in 2016 and 2017. The method of planting, phenotype data collection and descriptive statistics were the same as that in the RIL population.

Correlation analysis of the six phenotype traits

The correlations between the different traits in the same environments and between the same traits in different environments were analysed with SPSS 20.0 software if the P value was more than 0.05, and the two traits had no correlation. When the P value was less than 0.05, if the correlation coefficient was from 0 to 0.3 or from −0.3 to 0, the two traits had a weak positive or negative correlation; if the correlation coefficient was from 0.3 to 0.5 or from −0.5 to −0.3, the two traits had a medium positive or negative correlation; and if the correlation coefficient was from 0.5 to 1 or from −1 to −0.5, the two traits had a high positive or negative correlation.

Marker filtration

The SSR markers (Jamshed et al., 2016; Sun et al., 2012; Zhang et al., 2015c), the markers developed by SLAF‐Seq (Zhang et al., 2016) and markers discovered using the IntlCottonSNPConsortium_70k chip (Zhang et al., 2017b) were used to generate the consensus genetic map. Before the construction of the consensus genetic map, the markers were filtered. Markers with no position or more than one position on the genome of upland cotton were removed (Li et al., 2015a; Zhang et al., 2015b), as were markers with unclear genotyping, those with no polymorphism between the parents, those that were heterozygous in at least one parent, those with a missing rate of more than 40% and those that showed segregation distortion with a P value of the chi‐square test less than 0.001.

Consensus genetic map construction

All the markers remaining after filtration were initially partitioned into linkage groups and located on the chromosomes following the procedures detailed in Zhang's research (Zhang et al., 2015a) and according to the reference genome database developed from TM‐1 (Li et al., 2015a; Zhang et al., 2015b). HighMap software was applied to order the alleles and correct genotyping errors within all chromosomes (w = 3, l = 200, r = 7) (Jansen et al., 2001; Liu et al., 2014; Van Ooijen, 2011). The method of processing and constructing the consensus map was the same as described in Zhang's research (Zhang et al., 2016). The density of markers in the physical map and genetic map was calculated with Perl script with windows of 500 kb and 5 cM, and the picture was drawn by CIRCOS 0.66 (Krzywinski et al., 2009). The segregation distortion of each marker was analysed by the chi‐square test using JoinMap 4.0 software (Van Ooijen, 2006). The SDM loci showing significance with P values between 0.001 and 0.05 were maintained to construct the consensus genetic map, and any region on the map with more than three SDM loci was defined as a segregation distortion region (SDR) (Zhang et al., 2013).

QTL and QTL cluster identification for fibre quality and yield

Windows QTL Cartographer 2.5 (Wang et al., 2001) was applied to map the QTLs using the composite interval mapping method. QTLs identified consistently in at least three environments were considered to be stable (Sun et al., 2012). A positive additive effect indicates that the favourable allele comes from the parent 0‐153, while a negative effect indicates that the favourable allele comes from the parent sGK9708. The parameter of the software and the rules of QTL naming were described in Sun's and Zhang's reports (Sun et al., 2012; Zhang et al., 2016). The stable QTLs identified were compared with the CottonQTLdb database (http://www.cottonqtldb.org) to determine whether they were novel or had been published in previous studies (Fang et al., 2014; Lacape et al., 2010; Rong et al., 2007; Said et al., 2013, 2015) or previous GWAS research (Fang et al., 2017a,b,c; Huang et al., 2017; Ma et al., 2018; Sun et al., 2017; Wang et al., 2017a). The stable QTLs in the database that shared the same or overlapping confidence intervals as the QTLs found in this study were considered to have been identified in previous studies. Stable QTLs for different traits that shared the same or overlapping confidence intervals were considered to reside in QTL clusters, and the overlapping regions were considered to be the confidence intervals of the QTL clusters. The genes located in the confidence intervals of the QTLs and QTL clusters were considered candidate genes. All candidate genes and their corresponding protein sequences were compared with TAIR 10, the Arabidopsis annotation database (https://www.arabidopsis.org/), using BLASTP software. The candidate genes were annotated with GO, KEGG and KOG. The GO annotation information was obtained from the GO databases at http://archive.geneontology.org/latest-lite/ and ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/. The BLASTX software was used to compare the sequences of the candidate genes and the sequences in the database. The results with e values of less than e‐10 were considered to be significant. The KEGG annotation information was obtained from the KOBAS 3.0 software (Wu et al., 2006; Xie et al., 2011). The KOG annotation information for the candidate genes was obtained from the KOG database (ftp://ftp.ncbi.nih.gov/pub/COG/KOG). The rules of comparison were the same as those used for the GO analysis.

RNA sample collection and extraction

Four lines were selected for RNA‐Seq, including the two parents, 0‐153 and sGK9708, and two offspring, 69 307 and 69 362. The 69 307 and 69 362 lines belonged to the RIL population and showed transgressive segregation. The 69 307 had positive transgressive segregation with longer and stronger fibre, while the 69 362 line had negative transgressive segregation with short and weak fibre. All the four lines were selected for quantitative real‐time PCR (qRT‐PCR) analysis. Sample collection and RNA extraction were performed as described in Zou's research (Zou et al., 2018).

RNA library preparation, sequencing and quality control

The RNA samples were used to generate sequencing libraries with Illumina TruSeq™ RNA Sample Preparation Kits (Illumina, San Diego, CA) according to the manufacturer's instructions. Transcriptome sequencing was carried out on an Illumina HiSeq 2500 platform that produced 125 base pair (bp) paired‐end (PE) raw reads (Berry Genomics Co., Ltd). Raw data (raw reads) in FASTQ format were processed using Trimmomatic software (Bolger et al., 2014). In this step, clean data were obtained as follows: first, reads with ≥10% unidentified nucleotides (N) were removed; second, reads with >50% bases having Phred quality <5 were removed; and third, reads with >10 nt aligned to the adapter were removed. Reads with 0 nt aligned to the adapter were also removed to eliminate putative PCR duplicates generated by PCR amplification in the library construction process (read 1 and read 2 of two PE reads that were completely identical). All downstream analyses were based on clean data.

Read mapping and expression level analysis

First, the RNA‐Seq data from Zhang's research were downloaded from the National Center for Biotechnology Information (NCBI) database in SRA format (PRJNA248163) (Zhang et al., 2015b). The processes of transforming the SRA format into FASTQ format, mapping of reads to the upland cotton genome, calculating FPKM values and generating heatmaps are detailed in Zou's research (Zou et al., 2018).

qRT‐PCR verification

The RNA samples of ten materials from 5, 10, 15, 20, 25 and 30 DPA during cotton fibre development were used to conduct qRT‐PCR. The primers of the four genes were designed on the qPrimerDB website (https://biodb.swu.edu.cn/qprimerdb/browse_plants). The procedure was according to Zou and Zhang (Zhang et al., 2019; Zou et al., 2018).

Genotyping and population structure analysis for the nature population

SLAF sequence was used to develop SNP markers for the nature population with the two enzymes of HaeIII and SspI. The sequencing was done with the Illumina HiSeq Xten sequencing platform (Illumina, San Diego, CA), paired‐end 150 bp. The steps to trim the row data were detailed in Su's research (Su et al., 2016). The strategies of analysing the SLAF‐Seq data and population structure were the same as that it in Fang's research (Fang et al., 2017b). The original data have been uploaded to NCBI SRA database (PRJNA542946).

Conflict of interest

The authors declare no conflicts of interest.

Author contribution

Zhang Z analysed and summed all the data, drew the figures and wrote the manuscript; Li JW collected and analysed the phenotype data; Jamshed M developed the SSR markers; Shi YZ, Wang SF and Zhang JH managed the RIL population; Liu AY, Gong JW, Sun FD and Jia F collected and analysed the phenotype data in Huang He basin; Ge Q, Fan LQ and Zhang ZB collected and analysed the phenotype data in Xinjiang; Pan JT, Fan SM and Wang YL developed the chip‐SNP markers; Lu QW, Liu RX Deng XY and Liu P involved in the development of the SLAF‐SNP markers, Zou XY, Jiang X and Zhang CJ analysed the RNA‐Seq data; Li PT, Iqbal MS and Wang YM collected the RNA sample and performed the qRT‐PCR; Zhang CY and Zou J managed and collected the phenotype data in Linqing; Chen H and Tian Q managed and collected the phenotype data in Akesu; Jia XH, Wang BQ and Huang JY managed and collected the phenotype data in Zhengzhou; Ai NJ and Feng GL managed and collected the phenotype data in Shihezi; Hong M and Li SL managed and collected the phenotype data in Kuerle; Lian WM and Wu B managed and collected the phenotype data in Alaer; Hua JP managed and collected the phenotype data in Quzhou; Xu Aixia collected and analysis all the data of nature population; and Shang HH, Gong WK and Yuan YL provided the resources, designed the experiment and revised the manuscript. Figure S1 The position of all the QTLs for fiber quality and yield traits across 17 environments. Click here for additional data file. Figure S2 The number of two‐pair‐trait QTL clusters with the same and different directions for additive effect. Click here for additional data file. Figure S3 The annotation information of the candidate genes in the QTL clusters. Click here for additional data file. Figure S4 Heatmap of the expression level of 1297 candidate genes during cotton fiber development. Click here for additional data file. Figure S5 Heatmap of the expression level of 1297 candidate genes in different cotton tissues. Click here for additional data file. Figure S6 The distribution of the FS, FL, BW and LP in breeding parents. Click here for additional data file. Figure S7 The information of sequencing data in each line in the whole nature population. Click here for additional data file. Figure S8 The K value and the population structure of the nature population. Click here for additional data file. Table S1 Detailed information about the field experiments for the RIL population from 2007 to 2015. Click here for additional data file. Table S2 Descriptive statistics of the fiber quality and yield traits for the parents and RIL population in 22 environments. Click here for additional data file. Table S3 Variance and heritability analysis of the fiber quality and yield traits for the RIL population in 22 environments. Click here for additional data file. Table S4 Correlation analysis between different traits in the same environments. Click here for additional data file. Table S5 Correlation analysis between the same traits in different environments. Click here for additional data file. Table S6 Detailed information on the filtering of SLAF‐SNP, chip‐SNP and SSR markers. Click here for additional data file. Table S7 Detailed information about all the QTLs associated with the six fiber quality and yield traits in 17 environments. Click here for additional data file. Table S8 The marker intervals and the physical confidence intervals of the stable QTLs. Click here for additional data file. Table S9 Detailed information about the QTL clusters. Click here for additional data file. Table S10 The GO annotation information of the candidate genes in the QTLs for each trait. Click here for additional data file. Table S11 The KEEG annotation information of the candidate genes in the QTLs for each trait. Click here for additional data file. Table S12 The KOG annotation information of the candidate genes in the QTLs for each trait. Click here for additional data file. Table S13 A list of candidate genes in the QTL clusters. Click here for additional data file. Table S14 A list of candidate genes and their annotations in Arabidopsis thaliana. Click here for additional data file. Table S15 The GO, KEGG and KOG annotation information of the candidate genes in the QTL clusters. Click here for additional data file. Table S16 The expression of 1297 candidate genes during cotton fiber development and in different tissues. Click here for additional data file. Table S17 The primer sequence of the qRT‐PCR. Click here for additional data file. Table S18 The descriptive statistics result of the breeding parents. Click here for additional data file. Table S19 The congruence analysis results with the previously QTL and GWAS reports. Click here for additional data file. Table S20 The T‐text results of the breeding parents. Click here for additional data file. Table S21 The relationship between trait correlation and QTL clusters in previous studies. Click here for additional data file.
  87 in total

1.  Reference genome sequences of two cultivated allotetraploid cottons, Gossypium hirsutum and Gossypium barbadense.

Authors:  Maojun Wang; Lili Tu; Daojun Yuan; Chao Shen; Jianying Li; Fuyan Liu; Liuling Pei; Pengcheng Wang; Guannan Zhao; Zhengxiu Ye; Hui Huang; Feilin Yan; Yizan Ma; Lin Zhang; Min Liu; Jiaqi You; Yicheng Yang; Zhenping Liu; Fan Huang; Baoqi Li; Ping Qiu; Qinghua Zhang; Longfu Zhu; Shuangxia Jin; Xiyan Yang; Ling Min; Guoliang Li; Ling-Ling Chen; Hongkun Zheng; Keith Lindsey; Zhongxu Lin; Joshua A Udall; Xianlong Zhang
Journal:  Nat Genet       Date:  2018-12-03       Impact factor: 38.330

2.  Meta-analysis of polyploid cotton QTL shows unequal contributions of subgenomes to a complex network of genes and gene clusters implicated in lint fiber development.

Authors:  Junkang Rong; F Alex Feltus; Vijay N Waghmare; Gary J Pierce; Peng W Chee; Xavier Draye; Yehoshua Saranga; Robert J Wright; Thea A Wilkins; O Lloyd May; C Wayne Smith; John R Gannaway; Jonathan F Wendel; Andrew H Paterson
Journal:  Genetics       Date:  2007-06-11       Impact factor: 4.562

3.  QTL delineation for five fiber quality traits based on an intra-specific Gossypium hirsutum L. recombinant inbred line population.

Authors:  Xiaoyun Jia; Hantao Wang; Chaoyou Pang; Qifeng Ma; Junji Su; Hengling Wei; Meizhen Song; Shuli Fan; Shuxun Yu
Journal:  Mol Genet Genomics       Date:  2018-02-08       Impact factor: 3.291

4.  A new interspecific, Gossypium hirsutum x G. barbadense, RIL population: towards a unified consensus linkage map of tetraploid cotton.

Authors:  Jean-Marc Lacape; J Jacobs; T Arioli; R Derijcker; N Forestier-Chiron; D Llewellyn; J Jean; E Thomas; C Viot
Journal:  Theor Appl Genet       Date:  2009-04-23       Impact factor: 5.699

5.  Comprehensive phenotypic analysis and quantitative trait locus identification for grain mineral concentration, content, and yield in maize (Zea mays L.).

Authors:  Riliang Gu; Fanjun Chen; Bingran Liu; Xin Wang; Jianchao Liu; Pengcheng Li; Qingchun Pan; Jordon Pace; Ayaz-Ali Soomro; Thomas Lübberstedt; Guohua Mi; Lixing Yuan
Journal:  Theor Appl Genet       Date:  2015-06-10       Impact factor: 5.699

6.  Meta-analysis of cotton fiber quality QTLs across diverse environments in a Gossypium hirsutum x G. barbadense RIL population.

Authors:  Jean-Marc Lacape; Danny Llewellyn; John Jacobs; Tony Arioli; David Becker; Steve Calhoun; Yves Al-Ghazi; Shiming Liu; Oumarou Palaï; Sophie Georges; Marc Giband; Henrique de Assunção; Paulo Augusto Vianna Barroso; Michel Claverie; Gérard Gawryziak; Janine Jean; Michèle Vialle; Christopher Viot
Journal:  BMC Plant Biol       Date:  2010-06-28       Impact factor: 4.215

7.  Construction and analysis of high-density linkage map using high-throughput sequencing data.

Authors:  Dongyuan Liu; Chouxian Ma; Weiguo Hong; Long Huang; Min Liu; Hui Liu; Huaping Zeng; Dejing Deng; Huaigen Xin; Jun Song; Chunhua Xu; Xiaowen Sun; Xilin Hou; Xiaowu Wang; Hongkun Zheng
Journal:  PLoS One       Date:  2014-06-06       Impact factor: 3.240

8.  Population structure and genetic basis of the agronomic traits of upland cotton in China revealed by a genome-wide association study using high-density SNPs.

Authors:  Cong Huang; Xinhui Nie; Chao Shen; Chunyuan You; Wu Li; Wenxia Zhao; Xianlong Zhang; Zhongxu Lin
Journal:  Plant Biotechnol J       Date:  2017-04-12       Impact factor: 9.803

9.  A comprehensive meta QTL analysis for fiber quality, yield, yield related and morphological traits, drought tolerance, and disease resistance in tetraploid cotton.

Authors:  Joseph I Said; Zhongxu Lin; Xianlong Zhang; Mingzhou Song; Jinfa Zhang
Journal:  BMC Genomics       Date:  2013-11-11       Impact factor: 3.969

10.  Overexpression of OsHox32 Results in Pleiotropic Effects on Plant Type Architecture and Leaf Development in Rice.

Authors:  Ying-Ying Li; Ao Shen; Wei Xiong; Qiong-Lin Sun; Qian Luo; Ting Song; Zheng-Long Li; Wei-Jiang Luan
Journal:  Rice (N Y)       Date:  2016-09-13       Impact factor: 4.783

View more
  17 in total

1.  UDP-Glucose Dehydrogenases: Identification, Expression, and Function Analyses in Upland Cotton (Gossypium hirsutum).

Authors:  Tingting Jia; Qun Ge; Shuya Zhang; Zhen Zhang; Aiying Liu; Senmiao Fan; Xiao Jiang; Yulong Feng; Lipeng Zhang; Doudou Niu; Shen Huang; Wankui Gong; Youlu Yuan; Haihong Shang
Journal:  Front Genet       Date:  2021-01-11       Impact factor: 4.599

2.  QTL mapping and candidate gene prediction for fiber yield and quality traits in a high-generation cotton chromosome substitution line with Gossypium barbadense segments.

Authors:  Quanwei Lu; Pengtao Li; Rui Yang; Xianghui Xiao; Ziyin Li; Qiuyu Wu; Juwu Gong; Qun Ge; Aiying Liu; Shengli Du; Jundi Wang; Yuzhen Shi; Youlu Yuan
Journal:  Mol Genet Genomics       Date:  2022-02-20       Impact factor: 3.291

3.  Identification and analysis of oil candidate genes reveals the molecular basis of cottonseed oil accumulation in Gossypium hirsutum L.

Authors:  Zhibin Zhang; Juwu Gong; Zhen Zhang; Wankui Gong; Junwen Li; Yuzhen Shi; Aiying Liu; Qun Ge; Jingtao Pan; Senmiao Fan; Xiaoying Deng; Shaoqi Li; Quanjia Chen; Youlu Yuan; Haihong Shang
Journal:  Theor Appl Genet       Date:  2021-10-29       Impact factor: 5.699

4.  The transcriptome landscapes of citrus leaf in different developmental stages.

Authors:  Camila Ribeiro; Jin Xu; Doron Teper; Donghwan Lee; Nian Wang
Journal:  Plant Mol Biol       Date:  2021-04-19       Impact factor: 4.076

5.  Multi-responses of O-methyltransferase genes to salt stress and fiber development of Gossypium species.

Authors:  Abdul Hafeez; Qún Gě; Qí Zhāng; Jùnwén Lǐ; Jǔwǔ Gōng; Ruìxián Liú; Yùzhēn Shí; Hǎihóng Shāng; Àiyīng Liú; Muhammad S Iqbal; Xiǎoyīng Dèng; Abdul Razzaq; Muharam Ali; Yǒulù Yuán; Wànkuí Gǒng
Journal:  BMC Plant Biol       Date:  2021-01-11       Impact factor: 4.215

6.  A statistical framework for QTL hotspot detection.

Authors:  Po-Ya Wu; Man-Hsia Yang; Chen-Hung Kao
Journal:  G3 (Bethesda)       Date:  2021-04-15       Impact factor: 3.154

7.  Quantitative Trait Loci and Transcriptome Analysis Reveal Genetic Basis of Fiber Quality Traits in CCRI70 RIL Population of Gossypium hirsutum.

Authors:  Xiao Jiang; Juwu Gong; Jianhong Zhang; Zhen Zhang; Yuzhen Shi; Junwen Li; Aiying Liu; Wankui Gong; Qun Ge; Xiaoying Deng; Senmiao Fan; Haodong Chen; Zhengcheng Kuang; Jingtao Pan; Jincan Che; Shuya Zhang; Tingting Jia; Renhui Wei; Quanjia Chen; Shoujun Wei; Haihong Shang; Youlu Yuan
Journal:  Front Plant Sci       Date:  2021-12-16       Impact factor: 5.753

8.  Whole-genome resequencing of 240 Gossypium barbadense accessions reveals genetic variation and genes associated with fiber strength and lint percentage.

Authors:  Jingwen Yu; Yixuan Hui; Jinhong Chen; Hurong Yu; Xinpeng Gao; Zhaohui Zhang; Qin Li; Shuijin Zhu; Tianlun Zhao
Journal:  Theor Appl Genet       Date:  2021-07-08       Impact factor: 5.699

9.  Disequilibrium evolution of the Fructose-1,6-bisphosphatase gene family leads to their functional biodiversity in Gossypium species.

Authors:  Qún Gě; Yànli Cūi; Jùnwén Lǐ; Jǔwǔ Gōng; Quánwěi Lú; Péngtāo Lǐ; Yùzhēn Shí; Hǎihóng Shāng; Àiyīng Liú; Xiǎoyīng Dèng; Jìngtāo Pān; Qúanjiā Chén; Yǒulù Yuán; Wànkuí Gǒng
Journal:  BMC Genomics       Date:  2020-06-01       Impact factor: 3.969

10.  Co-expression network and comparative transcriptome analysis for fiber initiation and elongation reveal genetic differences in two lines from upland cotton CCRI70 RIL population.

Authors:  Xiao Jiang; Liqiang Fan; Pengtao Li; Xianyan Zou; Zhen Zhang; Senmiao Fan; Juwu Gong; Youlu Yuan; Haihong Shang
Journal:  PeerJ       Date:  2021-07-21       Impact factor: 2.984

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.