Literature DB >> 35428176

Construction of a high-density genetic map and identification of QTLs related to agronomic and physiological traits in an interspecific (Gossypium hirsutum × Gossypium barbadense) F2 population.

Zhanfeng Si1,2, Shangkun Jin1, Jiedan Chen1, Sen Wang3, Lei Fang1, Xiefei Zhu3, Tianzhen Zhang1, Yan Hu4.   

Abstract

BACKGROUND: Advances in genome sequencing technology, particularly restriction-site associated DNA sequence (RAD-seq) and whole-genome resequencing, have greatly aided the construction of cotton interspecific genetic maps based on single nucleotide polymorphism (SNPs), Indels, and other types of markers. High-density genetic maps can improve accuracy of quantitative trait locus (QTL) mapping, narrow down location intervals, and facilitate identification of the candidate genes. RESULT: In this study, 249 individuals from an interspecific F2 population (TM-1 and Hai7124) were re-sequenced, yielding 6303 high-confidence bin markers spanning 5057.13 cM across 26 cotton chromosomes. A total of 3380 recombination hot regions RHRs were identified which unevenly distributed on the 26 chromosomes. Based on this map, 112 QTLs relating to agronomic and physiological traits from seedling to boll opening stage were identified, including 15 loci associated with 14 traits that contained genes harboring nonsynonymous SNPs. We analyzed the sequence and expression of these ten candidate genes and discovered that GhRHD3 (GH_D10G0500) may affect fiber yield while GhGPAT6 (GH_D04G1426) may affect photosynthesis efficiency.
CONCLUSION: Our research illustrates the efficiency of constructing a genetic map using binmap and QTL mapping on the basis of a certain size of the early-generation population. High-density genetic map features high recombination exchanges in number and distribution. The QTLs and the candidate genes identified based on this high-density genetic map may provide important gene resources for the genetic improvement of cotton.
© 2022. The Author(s).

Entities:  

Keywords:  Agronomic and physiological traits; Cotton; Genetic map; QTL mapping; Re-sequencing

Mesh:

Year:  2022        PMID: 35428176      PMCID: PMC9013169          DOI: 10.1186/s12864-022-08528-2

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   4.547


Background

Gossypium hirsutum and G. barbadense are the most two important cultivated species of allotetraploid cotton in the world. Among the domesticated Gossypium species, G. hirsutum is the most widely cultivated, dominating modern cotton production due to its high lint yield and broad adaptability [1, 2]; meanwhile, G. barbadense provides excellent fiber that is finer, longer and stronger than fiber of G. hirsutum [3, 4]. Efficient and extensive transmission of valuable genes between G. barbadense and G. hirsutum is of extremely important practical significance for improving fiber quality while maintaining fiber yield, which is mainly limited by linkage drag. Quantitative traits exhibit continuous variation and are generally controlled by multiple genes, hence having a complex genetic basis; moreover, they are readily affected by the environment. Genetic research on quantitative traits is therefore difficult, and investigating the inheritance and QTL mapping of cotton quantitative traits is of great significance to the advancement of cotton genetics and breeding. Since Shappley et al. [5] constructed the first genetic map of cotton, many studies have conducted QTL mapping for important cotton traits. A high-density molecular genetic map is the foundation of plant genome research. Interspecific maps have been constructed for cotton, mainly between G. barbadense and G. hirsutum, and used to explore species differences such as in yield and quality traits [6-27]. These studies have provided very useful information for cotton molecular design and breeding. There are many QTL-enriched regions in the cotton genome, and there may be large numbers of related genes that play important roles in the plant’s growth and development [28]. Notably, QTLs for important traits are unevenly distributed among 26 different chromosomes of cotton. In interspecific populations, fiber quality QTLs are more typically located in the A subgenome, while in intraspecific populations, fiber yield and quality QTLs are more frequent in the D subgenome [29, 30]. Although DD diploid species do not have spinnable fibers, many studies have shown that the D subgenome of allotetraploid cotton contains many QTLs that control fiber quality [31, 32]. However, while previous studies have revealed these and other useful findings, the different groups and markers employed combined with the impact of environmental factors on QTL effects mean that the comparability of extant data is relatively poor. Therefore, QTL research on cotton is still advancing. Recent advances in genome sequencing technology allow the construction of ultra-high-density genetic maps based on SNP loci. Consequently, more comprehensive and accurate map information can be used to analyze QTLs associated with important traits. Since bin genetic linkage maps based on SNP loci were first constructed in rice [33], it has been widely applied in other plants such as cotton [20], maize [34], soybean [35], Cucumis melo [36], radish [37] and so on. These genetic linkage maps have yielded many fine-mapped QTLs for which corresponding target genes were identified and cloned. Recent high-quality assemblies of G. barbadense and G. hirsutum [2, 38–42] has provided good references for linkage map-based QTL identification. In light of these resources, we constructed an interspecific F2 population between G. hirsutum and G. barbadense and performed whole-genome sequencing of all 249 F2 individuals, achieving resequencing data on average over 5 × genome coverage of each material and generating a high-density genetic map containing 6303 bin markers. Based on the map, we subsequently identified 112 QTLs associated with an array of traits including plant type traits and physiological traits at the seedling stage, leaf chlorophyll content, plant type traits at flower and boll stage, yield traits, and fiber quality traits. Combining the SNPs located within the predicted genes in the target region and their expression pattern of the predicted genes, possible causable genes that are responsible for the mapping traits were identified. These QTLs and the related candidate genes are valuable in cotton breeding to improve plant biomass, physiological characteristics, and yield quality.

Methods

Plant materials and DNA extraction

The plant materials used consisted of G. hirsutum acc. TM-1 supplied by Dr. Kohel of Southern Plain Agricultural Research Center, USDA [43] and G. barbadense cv. Hai7124 which was selected by Cotton Research Institute of Nanjing Agricultural University for genetic research [17]. TM-1 is a genetic standard line of G. hirsutum developed through single plant selection. Hai7124, grown extensively in China, was also the offspring of a single plant selection before being used as a parent in the construction of the linkage map. Two highly homozygous parents, as well as 249 F2 individuals derived from a cross between TM-1 as the recipient and Hai7124 as the donor were planted in Pailou greenhouse of Nanjing Agricultural University, Jiangsu, China. Genomic DNA was extracted from young leaf tissues following the method cetyltriethylammnonium bromide (CTAB) described by Paterson [44] with increased RNase A and proteinase K treatment to prevent RNA and protein contamination. The isolated DNA was then subjected to Illumina sequencing technology. To obtain the phenotypic data of two parents, F1, and all 249 F2 individual plants at different environments. All of them were cut off the trunk, transferred in the large nutrient bowls, and moved into the greenhouse in autumn. In the next spring, these materials were planted in the field for investigation of yield and fiber traits. The same operation was repeated twice in 2011 and 2012.

Phenotype data collection and evaluation

Plant type traits at seedling stage

The following plant type traits of the parents, F1, and F2 individual plants respectively were investigated at the cotton seedling stage: plant height (PH1, cm); cotyledonary node height (CNH, cm); first true leaf height (FTLH, cm); second true leaf height (STLH, cm); distance between the cotyledonary node and first true leaf (D1, cm); and distance between first true leaf and second true leaf (D2, cm). Each measurement was repeated three times and the average value was used in the analysis.

Physiological traits at seedling stage

Physiological characteristics such as leaf area and photosynthetic rate were measured in the parents, F1, and F2 individual plants at the cotton seedling stage. A portable leaf area meter (CI-202, Portable Laser Leaf Area Meter, USA) was used to measure the second true leaf area (SLA, cm2). At the same time, from 8:00 to 11:00 in the morning on a sunny day, a Li-6400 portable photosynthesis instrument was used to determine the photosynthesis ratio (Pn, μmol CO2·m−2·s−1) of the second true leaf. Also measured were intercellular CO2 concentration (Ci, μmol·mol−1), stomatal conductance (Cond, mmol·m−2·s−1), and transpiration rate (Tr, g·m−2·h−1). The intensity of the built-in light source was set to 1200 μmol·m−2·s−1, each leaf was measured three times, and the average value was used in the analysis. For instrument principle, sampling technique, and detailed settings, refer to "Using the LI-6400 Portable Photosynthesis System."

Determination of chlorophyll content in leaves

The leaf chlorophyll content of the parents, F1, and F2 individual plants was determined by UV/visible spectrophotometer. The main stem functional leaves were collected from each individual plant, and ten pieces were cut out with a 9-mm punch and weighed. About 0.1–0.2 g leaves were then placed in a 10-ml test tube, the fresh weight recorded, 10 ml of 95% ethanol added, and the tube sealed and stored for 48 h in the dark. Tubes were shaken in the middle of the incubation and mixed until the leaves were completely white. After the incubation, the extracted chlorophyll of each sample was placed in a spectrophotometer and the optical density was measured at 665 nm, 649 nm, and 470 nm to respectively determine chlorophyll a (Chl a), chlorophyll b (Chl b), and carotenoid (Car) content. Subsequently, the chlorophyll a/b ratio (Chl a/b) and total chlorophyll (Total Chl) were calculated. Each sample was repeated three times, and the average was taken as the result. Pigment concentrations were calculated according to the following formulas: in which Ca, Cb, and Cx•c represent the concentration in mg/L of chlorophyll a, chlorophyll b and carotenoids, respectively. The pigment content of the leaves was then calculated as follows: where C represents the pigment concentration (mg/L), V represents the total amount of extract (ml), W represents the fresh weight of the sample (g), and the subscript x represents the pigment: chlorophyll a or b, or carotenoids.

Plant type traits at flowering and boll stage

Plant height (PH2) and fruit branch number (FBN) were investigated at the first flowering and boll stage in the parents, F1, and F2 individual plants.

Yield traits

Yield constituent factors were assayed during the boll opening stage. The traits investigated consisted of boll number per plant (bolls/plant, BN), seed cotton yield (SCY), lint yield (LY), boll weight (BW), lint percentage (LP), lint index (LI), and seed index (SI).

Fiber quality traits

Middle and upper fibers were collected from the parents, F1, and F2 individual plants and sent for testing at the Cotton Quality Supervision, Inspection and Testing Center of the Ministry of Agriculture (HVI SPECTRUM 4.05.01 version, HVICC calibration level). Tested fiber quality properties included: fiber length (FL), fiber strength (FS), micronaire value (MIC), fiber length uniformity (FU), fiber elongation (FE). Due to high temperatures and too much rain in the summer of 2011, which caused abortion of pollen and super-separation of the sea-land hybrid population, some families failed to receive enough mature fiber, resulting in a lack of yield and quality trait data in some lines.

Population DNA preparation, resequencing, and genotyping

Sequencing libraries were constructed with an insert size of 150 bp and sequenced on the Illumina HiSeq 2000 platform (Illumina, San Diego, CA, USA). To construct paired-end libraries, DNA was fragmented by sonication, and DNA ends were blunted before adding an A base to each 3′ end. DNA adaptors with a single T-base 3′ end overhang were ligated to the above products. Ligation products were purified on 2% agarose gels that each targeted a specific range of insert sizes. Quantification and quality assessment were carried out by running 1 μL of the library on an Agilent DNA 1000 LabChip analyzer (Agilent Technology 2100 Bioanalyzer). All raw reads were processed for quality control and filtered using fastp (https://github.com/OpenGene/fastp) with default parameters. The clean reads were mapped to the TM-1 reference genome [38] using Burrows–Wheeler Aligner (BWA) with the parameters of ‘mem -t 20 -M -R’. The mapping results were sorted and duplicates marked using functions implemented in SAMtools and Picard (http://broadinstitute.github.io/picard/). Only reads that mapped uniquely to the reference genome sequence were used to call SNPs. Identification of SNPs between the parental lines and F2 individuals was performed with Genome Analysis Toolkit 4 (GATK4). High-quality SNPs were filtered following the best practices workflow developed by the GATK team. SNPs with minor allele frequency (MAF) < 5% and represented in less than 30% of the F2 population were excluded using VCFtools. Polymorphic markers between the two parental lines were retained if they had the aa × bb segregation pattern in F2 individuals.

Bin map construction

Recombinant breakpoints were identified using a slightly modified sliding window approach based on the ratio of SNP alleles derived from TM-1 and Hai7124 [38]. Consecutive 100-Kb intervals having the same genotype in the whole F2 population were merged as a recombination bin. Bins with significantly distorted segregation (P-value < 0.001) were filtered using the Chi-square test, and those remaining served as genetic markers for the construction of a genetic linkage map using Icimapping [45]. Collinearity between the genetic map and physical positions was visualized using ALLMAPS (https://github.com/tanghaibao/jcvi/wiki/ALLMAPS). A region containing three or more closely linked bins that exhibited significant segregation distortion (P < 0.001) was defined as an SDR.

Statistics of phenotypic traits

For all traits, ANOVA was used to test for significant differences between parents, F1, and F2 individuals, and correlation coefficients and phenotypic variation were also calculated using SPSS v18.0 (SPSS, Chicago, IL, USA). The heterosis (H) of each trait is expressed by two values, mid-parent heterosis and over-parent heterosis: MH = (F1-MP)/MP × 100%, where MP is the average value of the parents.

QTL mapping

IciMapping 3.0 (http://www.isbreeding.net) was used to detect the effects of QTLs in the F2 population. An LOD threshold of 2.5 was used to define significant additive QTLs; that is, when LOD ≥ 2.5 for a marker interval, it was considered to contain a significant QTL. At the same time, the additive effect (A), dominant effect (D), and contribution rate (R2) of each QTL on corresponding traits were calculated. The QTL genetic action mode uses the absolute value of D/A to judge the action effect of each QTL; a value greater than 1.20 indicates an over dominant effect, 0.81–1.20 a dominant effect, and 0.21–0.80 a partially dominant effect. Less than 0.20 indicates an additive effect. The method of naming QTLs follows that used for rice: QTL + traits + chromosome + QTL number.

Candidate gene identification and expression

The putative candidate genes for the QTLs were predicted as follows. First, we analyzed the SNP types located in QTLs based on our assembled genome sequence for TM-1. We focused on significantly associated nonsynonymous SNPs located in exons or SNPs in the upstream of the candidate genes. Second, based on expression profiling data for sixteen vegetative and reproductive tissues from TM-1 (cotton.zju.edu.cn). We checked whether these selected candidate genes were dominantly and/or specific expressed in a development stage that is critical for the target trait. We further narrowed down the candidate genes according to their expression levels between TM-1 and Hai7124 (cotton.zju.edu.cn).

Results

High-density genetic map construction and characteristics of the bin marker loci

We developed an interspecific F2 population from a cross between G. hirsutum acc. TM-1 and G. barbadense cv. Hai7124, which contained 249 individuals in total. Whole-genome sequencing of all individuals was performed on an Illumina Hiseq2000. In total, 3.01 Tb clean reads were generated, with an average of 5.3 × depth genome coverage for each individual. For the parents ‘TM-1’ and ‘Hai7124’, we utilized clean data from our previous research totaling 185 Gb and 111.8 Gb respectively [20], with an average depth of over 50 × . All clean reads were mapped to the TM-1 as the reference genome. After filtering SNPs by established criteria, a total of 4,257,943 high-quality SNPs (Fig. 1) were retained and used to generate bin markers (a group of consecutive SNPs in the same block for genotyping) with a modified sliding window approach [33]. After filtering 1428 bins that exhibited significant segregation distortion (P < 0.001), a total of 6303 bin markers were generated, with an average length of 363.1 Kb (Table 1, Fig. 1). Finally, the high-density genetic map was constructed, covering 5057 cM with an average inter-bin genetic distance of 0.8 cM (Fig. 1, Table 1). The 26 linkage groups of the map was corresponding to 26 cotton chromosomes. Each of the linkage group contained 242.4 bins on average, ranging from 157 (D04) to 405 (A11), overall comprising 3,455 in the A subgenome and 2,848 in the D subgenome. The total length of the A subgenome was 2663.24 cM, and that for the D subgenome was 2393.89 cM. The longest linkage group was A11 of 284.58 cM, and the shortest one was A08 of 126.68 cM. The largest average distance between markers was 1.1 cM in the D07 linkage group, while the smallest average distance between markers in A10 was 0.64 cM (Table 1).
Fig. 1

High-density genetic map construction of the (TM-1×Hai7124)F2 population. A Bin maps for the 241 scored F2 individual lines. Colored tracks represent the 241 individual lines of the THF2 population that were used for linkage map construction: red, alleles inherited from maternal parent (TM-1); green, alleles inherited from paternal parent (Hai7124); blue, alleles inherited from heterozygous genotype (TM-1 × Hai7124)F1. The horizontal scale indicates physical distance. B Distribution of markers across 26 chromosomes; ordinate is genetic distance, cM. C Genetic map quality as indicated by recombination fractions of all markers

Table 1

Characteristics of the 26 linkage groups in allotetraploid cotton

ChrNo. binsDistance (cM)Average distanceAverage length of bin (Kb)Gap number (> 10 cM)Recombination rate (cM/Mb)RHRs
A01234168.320.72505.011.42104
A02197159.410.81549.601.4776
A03254180.20.71439.311.61113
A04239234.940.98366.912.68110
A05256272.681.07433.022.46147
A06206167.240.81614.001.3299
A07255191.840.75378.801.99138
A08189126.690.67661.701.0178
A09262189.010.72317.602.27150
A10326208.360.64353.101.81151
A11405284.580.7299.702.34219
A12284210.330.74378.801.95166
A13348269.640.77317.122.44166
At-Total34552663.240.78431.8971.781717
D01232154.670.67278.902.39119
D02257199.830.78271.502.86115
D03173152.180.88311.502.8291
D04157134.930.86362.602.3795
D05282233.470.83226.703.65207
D06221180.130.82296.212.75103
D07173189.571.10337.713.25107
D08217187.010.86318.312.71127
D09185185.671.00281.123.57121
D10228199.290.87293.322.98126
D11317218.190.69225.103.06200
D12195195.581.00316.403.17143
D13211163.370.77305.402.53109
Dt-Total28482393.890.86294.2172.851663
Total63035057.130.80363.1142.213380
High-density genetic map construction of the (TM-1×Hai7124)F2 population. A Bin maps for the 241 scored F2 individual lines. Colored tracks represent the 241 individual lines of the THF2 population that were used for linkage map construction: red, alleles inherited from maternal parent (TM-1); green, alleles inherited from paternal parent (Hai7124); blue, alleles inherited from heterozygous genotype (TM-1 × Hai7124)F1. The horizontal scale indicates physical distance. B Distribution of markers across 26 chromosomes; ordinate is genetic distance, cM. C Genetic map quality as indicated by recombination fractions of all markers Characteristics of the 26 linkage groups in allotetraploid cotton A total of fourteen gaps that larger than 10 cM were distributed across the all 26 chromosomes, seven at the A subgenome and seven at the D subgenome. The average ratio of bin marker interval (< 5 cM) for all linkage groups was more than 99%. A region containing three or more closely linked bins that exhibited significant segregation distortion (P < 0.001) was considered a segregation distortion region (SDR). There were 88 and 32 SDRs in the A and D subgenome, respectively (Table 1). The quality of the genetic map was further examined by comparing genetic and physical distances, which showed good collinearity (Supplementary Fig. 1). Chi-square tests of the 6303 co-dominance bins identified 724 that do not conform to the 1:2:1 genetic law ratio of Mendelian theory. Among these 724 partial segregation bins, 86 were biased toward the parent TM-1, 638 toward the parent Hai7124, and none toward the heterozygote. In addition, significantly more of the partial segregation bins were located on the A subgenome (450) than on the D subgenome (274), and these bins comprised a higher proportion of the A subgenome (13.02%) than of the D subgenome (9.62%). Moreover, the partial segregation bins were unevenly distributed across the 26 chromosomes; the ratio of partial segregation bins to total bins in a given chromosome was more than 30% on chromosomes A05, A11, and D07 and more than 20% on A08, D09, and D10, but less than 1% on A01, A03, and D01. At the same time, some partial segregation bins exhibited an aggregation phenomenon; namely, bins distributed on four chromosomes (A05, A11, D07, and D08) account for 45% of all partial segregation bins (Supplementary Table 1). To provide a comprehensive overview of recombination in cotton, the recombination rate along each chromosome was estimated by comparing genetic and physical distances. Across the entire genome, the average recombination rate was 2.2 cM/Mb. High rates of recombination were observed in the telomere regions of all nine chromosomes, whereas recombination was suppressed in centromere regions (Fig. 2). Chromosomal regions with recombination rates greater than 1.0 cM/Mb [37] were defined as recombination hot regions (RHRs). A total of 3380 RHRs were identified, and were unevenly distributed on the 26 chromosomes (Table 1, Fig. 2).
Fig. 2

Chromosomal features of (TM-1 × Hai7124)F2 population with genetic data. A The length of 6303 bins along each chromosome; B The bin marker placements in the genetic maps on the chromosome; C SNP (aa × bb) density of each chromosome; D Recombination rates of each chromosome; E Genetic positions of the RHRs in each chromosome; F Structural variations density of each chromosome

Chromosomal features of (TM-1 × Hai7124)F2 population with genetic data. A The length of 6303 bins along each chromosome; B The bin marker placements in the genetic maps on the chromosome; C SNP (aa × bb) density of each chromosome; D Recombination rates of each chromosome; E Genetic positions of the RHRs in each chromosome; F Structural variations density of each chromosome

Analysis of 35 traits in parents and F1 and F2 generations

We surveyed 35 traits in the parents and F1 and F2 generations, including six plant type traits, ten leaf morphology and physiological traits at the seedling stage, five leaf chlorophyll content traits, two plant type traits at flower and boll stage, seven yield traits, and five fiber quality traits (Supplementary Table 2). TM-1, Hai7124, and their F1 progeny differed to varying degrees in plant type, leaf morphology, physiology, yield, and fiber quality. Concerning plant type traits, TM-1 and F1 had extremely significant differences; TM-1 and Hai7124 likewise had extremely significant differences, except in CNH; but Hai7124 and F1 had no extremely significant differences in traits except for D1. Regarding leaf morphology and physiological traits, TM-1 exhibited extremely significant difference from Hai7124 and from F1 only in SLA and SPn; other traits were not significantly different among the three. In terms of chlorophyll content, TM-1 and F1 exhibited extremely significant differences; TM-1 and Hai7124 likewise had extremely significant differences in traits other than Chla; but Hai7124 and F1 did not differ significantly except in Chla/b. With regard to the 12 yield and fiber traits, G. hirsutum and G. barbadense are characterized by extremely significant differences; most of these characteristic differences were observed in comparisons of TM-1 and Hai7124 and of TM-1 and F1 individuals. When comparing Hai7124 and F1, only the five traits D1, SCY, LY, SI, and LI differed significantly, indicating that the F1 progeny of G. barbadense and G. hirsutum are more biased towards the G. barbadense phenotype. Taken together, these genetic differences provide a good basis for the screening of important trait QTLs (Supplementary Table 3). In the F2 population, the average value and variance of each trait exhibited large changes relative to their parents, and the coefficients of variation differed between traits. Overall, physiology and yield traits featured the largest coefficients; the values for each yield component ranked as follows: BN > SCY > LY > LI > BW > SI > LP. This ranking indicates that in the offspring, different degrees of genetic variation are present for different traits, indicating that these traits are controlled by multiple genes (Table 2, Supplementary Fig. 2).
Table 2

Phenotypic variation of 35 traits

TraitsNumber of individualsMeanMean squared errorMinMaxStandard deviationvarianceSkewnesskurtosisCV (%)mid-parent heterosis
PH124920.40.239.3032.303.6913.61-0.410.1418.08-4.23
CNH2497.590.083.5011.401.321.750.110.2817.419.21
FTLH24915.420.178.2023.002.737.470.11-0.217.736.53
STLH24718.620.28.9030.503.149.84-0.20.4816.85-2.10
D12497.830.141.9014.102.154.630.360.0927.494.05
D22473.190.110.209.801.793.210.640.2556.12-29.89
SLA24930.60.547.9361.668.5472.870.430.6827.904.42
TLA23325.070.644.9954.899.8096.080.34-0.1939.10-3.00
SPn2499.240.250.7418.643.9515.640.05-0.7142.796.39
TPn24911.620.253.2420.893.9815.85-0.18-0.7934.271.18
SCi249235.773.1484.24325.2449.532452.76-0.640.1021.011.34
TCi249237.193.1330.12369.8049.372437.74-0.590.7820.823.63
SCond2490.130.010.010.370.080.010.56-0.5163.7823.81
TCond2490.160.010.020.440.090.010.42-0.554.3714.29
STr2494.640.190.4912.902.928.540.63-0.5663.0216.88
TTr2495.530.180.7112.832.898.380.59-0.4452.3410.82
Chl a2400.780.010.371.270.130.020.301.4916.424.00
Chl b2400.280.000.150.440.040.000.340.9615.6912.00
Car2400.160.000.070.270.030.000.251.1917.30-5.88
Chl a/b2402.820.012.043.210.170.03-0.421.376.06-6.16
Total Chl2401.060.010.531.710.170.030.321.3916.026.00
PH2238144.012.2648.00220.0034.941220.57-0.660.4124.2648.21
FBN23813.080.263.0024.004.0616.50-0.37-0.0431.05-12.80
BN22413.620.520.0042.007.8461.401.141.1357.55-34.09
SCY21548.811.874.08142.8327.45753.451.000.6756.24-36.10
LY21514.720.551.1140.328.0865.360.890.3454.93-41.27
BW2203.560.061.577.470.920.850.751.1825.93-13.59
SI22010.450.135.8520.001.993.960.672.0219.04-3.86
LP22030.650.3620.7952.385.3828.980.971.6317.56-7.49
LI2204.640.082.2711.141.261.581.394.8727.09-13.91
FL7729.690.2425.4135.432.064.260.21-0.096.95-3.49
FS7730.260.3425.0040.903.029.130.891.359.98-5.19
MIC773.140.072.004.790.630.400.23-0.3120.03-28.15
FU7783.90.2178.888.201.813.29-0.320.192.16-1.53
FE776.260.055.207.980.470.221.312.847.43-5.22
Phenotypic variation of 35 traits

QTL mapping of important agronomic traits in cotton

A total of 112 QTLs, 41 in the A subgenome and 71 in the D subgenome, distributing across almost all 26 chromosomes except A03, A08, and D08, were assessed for association with 35 traits using ICIM analysis. The position, LOD score, additive effects, dominance effect, and percentage of phenotypic variance explained (PVE) of the QTLs are given in Table 3. Among them, 16 QTLs were located overlapped with the QTL regions in the previous studies (Table 4). PVE values ranged from 2.95 to 24.89%. The regions occupied by identified QTLs ranged in size from 0.20 to 8.45 Mb, with an average length of 0.78 Mb. With respect to traits, the number of QTLs per trait ranged from 0 to 10 with the most QTLs (up to 10) being detected for STr.
Table 3

Analysis of QTLs for 35 traits

TraitQTLChromsomeLeft MarkerRight MarkerLeftCI (cM)RightCI (cM)StartEndLODPVE(%)AddDom
PH1qPH1-A13A13bin395bin396248.5252.5107,859,745108,097,1373.166.89-1.09-0.8
PH1qPH1-D05D05bin223bin229155.5167.539,478,28341,627,41236.47-1.210.7
CNHqCNH-A07A07bin3bin406.5330,907915,6213.376.65-0.320.51
CNHqCNH-A12A12bin454bin455181.5186.5104,326,658104,861,6143.958.350.5-0.1
FTLHqFTLH-A04A04bin2bin32.56.5295,380912,8942.986.3-0.49-1.1
STLHqSTLH-A13A13bin393bin394245.5248.5107,403,460107,859,7453.024.86-0.98-0.25
STLHqSTLH-D01D01bin57bin5851.554.510,754,69811,063,1723.044.931.09-0.06
STLHqSTLH-D05-1D05bin221bin227151.5154.552,507,47852,707,4782.984.55-0.551.2
STLHqSTLH-D05-2D05bin250bin251173.5174.538,370,76240,442,8553.225.01-0.631.21
D1qD1-A01A01bin58bin5953.555.512,558,16313,053,1042.755.620.680.13
D1qD1-A04A04bin2bin31.55.5295,380912,8943.447.18-0.25-1.02
D2qD2-A11A11bin7bin5529.548.5807,4289,258,4603.684.46-1.3-1.28
SLAqSLA-A09A09bin176bin177103.5105.563,066,21163,735,0592.627.160.383.73
SPnqSPn-D05D05bin59bin6043.544.58,567,7858,789,6524.357.561.60.01
SPnqSPn-D06D06bin6bin701.5617,114818,1402.874.890.28-1.78
SPnqSPn-D12D12bin138bin139107.5110.547,167,18047,756,2493.315.69-0.011.87
SPnqSPn-D13D13bin209bin211135.5139.561,357,80961,953,2853.245.37-1.20.71
TPnqTPn-A01A01bin48bin4943.551.59,757,02310,168,6322.565-0.621.4
TPnqTPn-A06A06bin40bin4143.544.58,721,8879,167,4583.336.91-0.12-1.88
TPnqTPn-D12D12bin187bin188148.5149.555,359,86755,595,8463.337.030.071.92
SCiqSCi-A01A01bin43bin4442.545.58,036,0848,969,6462.634.436.7818.33
SCiqSCi-A04A04bin182bin183145.5146.557,244,81057,548,8502.785.031.72-22.14
SCiqSCi-D04-1D04bin32bin3328.532.54,450,0784,794,1453.195.67-7.0321.69
SCiqSCi-D04-2D04bin131bin13286.588.546,227,65747,544,3472.944.94-4.56-20.82
SCiqSCi-D13D13bin208bin209135.5139.561,187,33761,554,7102.54.3-14.51.89
TCiqTCi-A04A04bin103bin104101.5103.533,378,34633,609,6083.233.33-3.67-23.45
TCiqTCi-A11A11bin7bin5514.553.5807,4289,258,4602.7512.52-21.4936.36
TCiqTCi-D04-1D04bin32bin3328.532.54,450,0784,794,1452.862.95-2.4122.13
TCiqTCi-D04-2D04bin131bin13286.588.546,227,65747,544,3473.323.21-5.82-21.48
SCondqSCond-A10A10bin352bin353196.5202.5109,106,699109,336,0222.53.910.020.02
SCondqSCond-A12A12bin423bin425158.5159.598,555,40099,115,0523.916.420.03-0.01
SCondqSCond-D04D04bin142bin14388.594.550,189,43450,421,5872.594.03-0.01-0.03
SCondqSCond-D05D05bin51bin5239.540.57,536,4777,742,5955.198.670.03-0.01
SCondqSCond-D06D06bin6bin701.5617,114818,1402.94.740.01-0.04
SCondqSCond-D13D13bin196bin197121.5123.559,385,04859,708,1593.575.9-0.030.01
TCondqTCond-A12A12bin427bin430157.5164.599,337,542100,288,6352.534.620.03-0.02
TCondqTCond-D04-1D04bin128bin12981.585.545,564,06345,808,1863.426.18-0.01-0.04
TCondqTCond-D04-2D04bin131bin13286.587.546,227,65747,544,3473.626.52-0.02-0.04
STrqSTr-A05A05bin119bin12095.5103.520,625,36021,148,7832.823.30.581.03
STrqSTr-A07A07bin285bin286141.5145.588,546,00488,983,6612.843.30.76-0.67
STrqSTr-A10A10bin228bin229120.5124.577,749,54078,563,7872.83.290.950.07
STrqSTr-A12A12bin7bin88.519.51,418,4151,932,4282.532.960.51-0.89
STrqSTr-D02D02bin66bin6355.559.511,571,33111,240,3513.84.38-0.55-1.17
STrqSTr-D04D04bin23bin2416.518.52,781,5713,052,4122.773.34-0.491.02
STrqSTr-D05D05bin51bin5239.540.57,536,4777,742,5954.896.121.23-0.11
STrqSTr-D06D06bin6bin701.5617,114818,1403.744.630.15-1.49
STrqSTr-D10D10bin7bin89.515.51,651,9641,902,9012.863.32-0.91-0.05
STrqSTr-D13D13bin196bin197121.5123.559,385,04859,708,1592.793.45-0.870.11
TTrqTTr-A12A12bin8bin912.521.51,656,7022,140,8123.566.60.69-1.07
Chl aqChl a-A13-1A13bin270bin278145.5159.585,560,44986,656,9392.734.64-0.04-0.03
Chl aqChl a-A13-2A13bin286bin287173.5184.580,629,13283,806,9162.695.21-0.04-0.01
Chl aqChl a-D01D01bin239bin240136.5146.562,926,80463,199,9013.055.09-0.020.05
Chl aqChl a-D03D03bin153bin154114.5119.548,191,31848,417,9334.27.9700.07
Chl aqChl a-D09D09bin213bin214158.5160.547,673,82948,167,0592.534.43-0.040
Chl bqChl b-A13A13bin289bin290173.5184.587,115,05387,473,1652.75.01-0.01-0.01
Chl bqChl b-D01D01bin239bin240136.5146.562,926,80463,199,9012.795.22-0.010.02
Chl bqChl b-D02D02bin252bin253145.5153.562,524,67262,831,3432.534.78-0.010.02
Chl bqChl b-D03D03bin153bin154114.5119.548,191,31848,417,9333.978.1-0.010.02
CarqCar-A12A12bin314bin31589.593.576,853,35777,247,2712.845.5700.01
CarqCar-D03D03bin153bin154114.5119.548,191,31848,417,9332.926.4600.01
Chl a/bqChl a/b-D05D05bin276bin277184.5194.557,297,69957,671,5732.96.81-0.05-0.05
Chl a/bqChl a/b-D06D06bin228bin229123.5125.559,098,82059,707,3622.826.74-0.050.04
Chl a/bqChl a/b-D12D12bin172bin173125.5138.552,342,56753,232,6132.566.27-0.050.05
Total ChlqTotal Chl-A13A13bin48bin4947.549.55,904,9876,263,0872.514.46-0.05-0.01
Total ChlqTotal Chl-D01D01bin238bin239136.5147.562,826,80463,033,9402.674.49-0.030.06
Total ChlqTotal Chl-D03D03bin153bin154114.5118.548,191,31848,417,9334.999.49-0.020.1
PH2qPH2-A11A11bin414bin439258.5264.592,237,50597,642,8142.664.320.32-15.33
PH2qPH2-D01D01bin209bin210111.5117.559,207,02159,470,3542.734.047.0510.07
PH2qPH2-D03D03bin153bin154114.5119.548,191,31848,417,9334.066.47-13.14-0.61
PH2qPH2-D06D06bin59bin6060.566.57,355,7577,642,3652.663.949.934.05
PH2qPH2-D09D09bin215bin216158.5165.548,167,05948,412,5672.543.774.1511.91
PH2qPH2-D12D12bin221bin222189.5192.560,587,23160,897,0334.867.8714.21-1.36
FBNqFBN-A04A04bin39bin4058.571.57,802,3658,134,3662.643.821.13-0.96
FBNqFBN-A05A05bin44bin4534.535.57,019,1767,291,9763.585.290.47-1.89
FBNqFBN-A06A06bin22bin2326.535.53,658,6244,677,8443.976.640.23-2.18
FBNqFBN-D04D04bin51bin5245.548.57,626,8947,826,8942.83.880.72-1.34
FBNqFBN-D11D11bin89bin9073.575.514,642,79015,006,0023.445.2-0.37-1.83
FBNqFBN-D12D12bin106bin10777.578.541,087,55341,476,8163.45.05-1.330.37
BNqBN-D05D05bin104bin10570.576.515,695,73816,137,5832.554.66-2.13-2.04
BNqBN-D10D10bin11bin2219.543.52,290,2404,417,0872.855.342.581.96
SCYqSCY-D07D07bin231bin232188.518957,783,34158,417,1782.535.541.9212.2
SCYqSCY-D10D10bin22bin2121.545.54,200,6664,417,0872.675.858.647.69
LYqLY-D07D07bin231bin232188.518957,783,34158,417,1782.646.040.743.6
BWqBW-A10A10bin57bin5848.552.510,608,26710,844,3473.557.6-0.32-0.22
SIqSI-D01D01bin101bin10373.574.519,943,73021,931,2082.686.140.050.91
SIqSI-D05D05bin10bin118.510.52,123,4512,323,4513.78.560.80.13
LPqLP-A07A07bin107bin10883.587.519,417,63919,813,4912.664.84-0.56-2.13
LPqLP-D09D09bin221bin222164.5165.549,209,12649,409,1273.486.77-1.94-0.37
LPqLP-D10D10bin22bin2136.545.54,200,6664,417,0874.438.69-2.480.53
LIqLI-D09D09bin225bin226168.5169.549,640,72549,892,9673.416.69-0.44-0.05
LIqLI-D10D10bin21bin5436.552.54,200,66610,690,8593.768.08-0.540.14
LIqLI-D12D12bin191bin192149.5155.555,816,59456,150,6592.835.170.32-0.3
FLqFL-A06A06bin26bin2735.539.55,979,7016,426,2752.9213.58-0.940.85
FLqFL-A10A10bin53bin5049.551.59,611,1449,322,1252.8214.170.99-0.94
FLqFL-D12D12bin109bin11075.584.541,741,94642,083,7752.5311.91.17-0.25
FSqFS-A02A02bin279bin280152.5156.5104,826,173105,601,2375.3717.110.212.46
FSqFS-A11A11bin129bin130108.5110.521,948,34322,148,3432.968.550.28-1.65
FSqFS-D02D02bin94bin9568.575.519,742,79520,003,2893.7411.130.3-1.95
FSqFS-D12D12bin154bin155124.5127.549,828,94250,092,2277.5124.892.34-0.74
MICqMIC-A13A13bin343bin344206.5208.596,488,82297,053,4062.9314.40.310.05
MICqMIC-D01D01bin58bin5951.554.510,890,90511,226,4344.4421.98-0.370.2
MICqMIC-D02D02bin3bin45.57.5620,0591,091,7992.6311.750.03-0.39
MICqMIC-D03D03bin10bin1115.518.51,178,4221,526,8553.0512.350.22-0.3
FUqFU-A13A13bin233bin234134.5136.569,500,82470,270,3686.8620.510.26-1.6
FUqFU-D03D03bin13bin1422.529.51,728,3642,715,3883.178.540.680.32
FUqFU-D06D06bin37bin3840.547.54,733,9574,933,9573.8210.620.08-1.24
FUqFU-D12-1D12bin10bin1116.522.559,195,55459,480,9274.5412.670.22-1.3
FUqFU-D12-2D12bin212bin213170.5175.51,425,5341,957,3835.1515.290.121.38
FEqFE-A10A10bin217bin218115.5117.575,292,93075,555,4282.557.690.19-0.19
FEqFE-D05-1D05bin94bin9569.574.560,252,76760,452,7674.0811.71-0.26-0.37
FEqFE-D05-2D05bin295bin296205.5207.514,282,82314,559,5012.67.86-0.2-0.13
FEqFE-D09D09bin131bin15186.599.533,095,13037,366,0303.7620.30.39-0.44
Table 4

Sixteen QTLs detected in this study overlapped with that identified in previous studies

This StudyPrevious Report
NameChrStartEndReportedStartEndQTLLODR2ParentsRef
FBND047,626,8947,826,894FBN4,447,1158,199,618qFBN-22–23.8417.3TM-1*CSB22sh[46]
BND0515,695,73816,137,583BN15,468,70317,887,617qBN-D5-11.663.5(STV2B*Foster6)*(DPL15*CRI7)[47]
LPD104,200,6664,417,087LP4,294,7695,493,007Cs9_PF_20_(5.27 +)、Cs8_PF_20_(6.57 +)5.2719.10Guazuncho2(Gh)*VH8-4602(Gb)[48]
FLA065,979,7016,426,275FL4,440,3916,580,618qFLchr64.547.80(Emian22*3–79)*Emian22[49]
FLA065,979,7016,426,275FL4,439,4926,580,618qChr06FLNANA(Emian22*3–79)*Emian22[50]
FLA109,611,1449,322,125FL7,964,14810,069,511qFL-A10-1*3.336.33Zhongmiansuo12*8891[51]
FLA109,611,1449,322,125FL7,964,14810,069,511qFL-A10-1_4.3210.46Zhongmiansuo12*8891[52]
FSA02104,826,173105,601,237FS104,227,599105,227,815qFS-C2-13.955.69Lumianyan22*Luyuan343[53]
FSD0219,742,79520,003,289FS19,710,78120,711,127qFS-D2-18.1214.4(Simian3*Sumian12)*(Zhong4133*8891)[54]
FMD02620,0591,091,799FM456,5941,457,031qFM-D2-13.596.80(Simian3*Sumian12)*(Zhong4133*8891)[54]
FMD02620,0591,091,799FM991,6913,224,078qFM-D2-212.8221.92CRI12*J8891[55]
FMD02620,0591,091,799FM456,5943,223,772qFMIC-D2-1_3.329.1Zhongmiansuo12*8891[52]
FMD02620,0591,091,799FM456,5943,224,164qFMIC-D2-1*8.0112.49Zhongmiansuo12*8891[51]
FUD031,728,3642,715,388FU2,695,2543,695,267TC-qFU-c17-23.115.7CRI36*Hai7124[56]
FEA1075,292,93075,555,428FE72,032,34279,020,626qFE-10-1b30.9311.8HS427-10*TM-1[57]
FED0514,282,82314,559,501FE13,336,54114,337,210qFE-D5-12.917.49Zhongmiansuo12*8891[52]
Analysis of QTLs for 35 traits Sixteen QTLs detected in this study overlapped with that identified in previous studies Twelve QTLs were associated with plant type traits at seedling stage, most of which (75%) had positive effects and originated from TM-1, suggesting that G. hirsutum has a growth advantage in the seedling stage. Among these QTLs, the PVE varied from 4.46 to 8.35%; the QTL qCNH-A12 with the highest PVE (8.35%) had positive effects and came from Hai7124. Thirty-seven QTLs were detected for leaf morphology and physiological traits at seedling stage, featuring positive effects and coming from both TM-1 and Hai7127 (19 and 18 QTLs, respectively). We found that all nine QTLs associated with intracellular CO2 concentration had positive effects and originated with TM-1, and 7/9 demonstrated positive effects, which is the main component of heterosis. A total of 17 QTLs were identified for leaf chlorophyll, with PVE values ranging from 4.43 to 8.1%; both the additive and dominant effects of these QTLs were close to 0. Twenty-six QTLs were identified for yield or yield-related traits. Most QTLs associated with qPH2 and qFBN, and all those with qSCY, qLY, and qSI, exhibited positive effects and came from Hai7124. Meanwhile, QTLs having positive effects associated with qBW, qLP, and qLI came from TM-1, suggesting that G. barbadense has a larger biomass but G. hirsutum has higher fiber yield. Of QTLs associated with fiber quality traits, 80% of those having positive effects came from Hai7124; only four QTLs (qFL-A06, qMIC-D01, qFE-D05-1, and qFE-D05-2) with positive effects originated from TM-1. This result indicated that the genetics governing excellent fiber quality come from G. barbadense. All QTLs and the corresponding location information, LOD, PVE, additive effect, and dominant effect values were presented in Table 3 and Fig. 3.
Fig. 3

Chromosomal distribution of QTLs associated with 35 traits. Black lines indicate QTL positions on the chromosomes

Chromosomal distribution of QTLs associated with 35 traits. Black lines indicate QTL positions on the chromosomes

Candidate gene identification and expression analysis

We identified ten genes that has nonsynonymous SNPs in exons or SNP in their upstream regions was located within the 15 loci of interest for 14 traits (D1, CHN, FLTH, PH1, Sci, TCi, Tcond, BW, SCY, LP, Li, SPn, TPn, TTr). We analyzed their expression in sixteen vegetative and reproductive tissues of TM-1 and compared values with those in Hai7124 (Supplementary Fig. 2, Supplementary Table 4, 5). Some SNP variants corresponding to QTLs associated with different traits were mapped to the same position or related to the same gene, such as GH_A04G0054/GB_A04G0055, GH_D04G1426/GB_D04G1512, and GH_D10G0500/GB_D06G1730 (Supplementary Table 4, 5). A representative QTL that related to multiple traits BN, SCY, LP, and LI was located on chromosome D10 (Fig. 4A). This locus encompassed fourteen genes harboring nonsynonymous SNPs. Considering the expression of these genes during fiber development, one was identified as a putative causal gene: root hair defective 3 GTP-binding protein (GhRHD3, GH_D10G0500), which was dominantly expressed during secondary cell-wall bio-synthesis (20 DPA) (Fig. 4B-C). Interestingly, its Hai7124 homolog showed high expression during fiber initiation (0, 1, and 3 DPA) (Fig. 4D). Three nonsynonymous SNPs in GH_D10G0500, D10Gh: 4,228,677/4228733/4229273 (TTG versus GCA 33:58), demonstrated significant associations with BN, SCY, LP, and LI (Fig. 4E-H). The orthologous gene in Arabidopsis thaliana was identified as involved in the regulation of cell expansion [58-60]
Fig. 4

Functional haplotypes in associated loci from the TM-1 × Hai7124 F2 population on D10. A Genetic mapping of a QTL on the D10 chromosome identified as related to BN, SCY, LP, and LI. B Genes with nonsynonymous SNPs in the QTL region. C Transcriptomic expression of QTL-region genes with nonsynonymous SNPs in TM-1 tissues, based on FPKM values. D Transcriptomic expression of GH_D10G0500 in TM-1 and Hai7124 tissues, based on FPKM values. E–H Boxplot of GH_D10G0500 haplotypes. Center line, median; box limits, upper and lower quartiles; whiskers, 1.5 × the interquartile range; dots, outliers (* P < 0.01, ** P < 0.001, two-tailed t-test)

Functional haplotypes in associated loci from the TM-1 × Hai7124 F2 population on D10. A Genetic mapping of a QTL on the D10 chromosome identified as related to BN, SCY, LP, and LI. B Genes with nonsynonymous SNPs in the QTL region. C Transcriptomic expression of QTL-region genes with nonsynonymous SNPs in TM-1 tissues, based on FPKM values. D Transcriptomic expression of GH_D10G0500 in TM-1 and Hai7124 tissues, based on FPKM values. E–H Boxplot of GH_D10G0500 haplotypes. Center line, median; box limits, upper and lower quartiles; whiskers, 1.5 × the interquartile range; dots, outliers (* P < 0.01, ** P < 0.001, two-tailed t-test) Supplementary Fig. 3 illustrates a QTL, located on chromosome D04, which related to SCi, TCi, and Tcond. There were thirteen genes harboring nonsynonymous SNPs in this region. Combining their expression level during fiber development, one was identified as a putative causal gene: glycerol-3-phosphate acyltransferase 6 (GhGPAT6, GH_D04G1426), which was highly expressed in leaves. Within this gene, the nonsynonymous SNP D04Gh:47,064,565 (CC versus AA 65:50) was significant associated with SCi, TCi, and Tcond. As reported, its orthologous gene in tomato was involved in regulating the outer wall diameter of leaf epidermal cells [61].

Discussion

Bin markers are effective for constructing high-density genetic maps and QTL fine mapping in G. hirsutum and G. barbadense

In recent years, scientists have used specific-locus amplified fragment sequencing (SLAF-seq), genotyping by sequencing (GBS), and other sequencing methods to genotype the complex genome of cotton, and the resulting genetic map is based on SNP phasing. This method can identify markers with high throughput; in addition, the chromosome coverage is more uniform and the marker density greatly improved compared with traditional PCR-based markers. With the help of newly developed bioinformatics software, it is possible to complete genotyping and construct genetic maps in a very short time. Compared to the GBS-based enzyme digestion method, a binmap based on resequencing offers the following improvements: scanning of and mutation identification at all sites in the whole genome, without any prior marker information, yielding complete allelic variant information with higher accuracy than previous experimental methods. In this study, we obtained a total of 6303 high-confidence bin markers that not only extends the length of the cotton genetic map but also improve its resolution. In our previous studies, we constructed an SSR-based genetic map using a (TM-1 × Hai7124)F2 population [27] that spanned 3414 loci in 26 linkage groups, covering 3667.62 cM with an average inter-locus distance of 1.08 cM. The present 6303 bin markers expand that map to cover 5057.13 cM while also narrowing the average distance between adjacent markers on to an interval of 0.8 cM. The bin marker length varies from 0.64 to 1.10 Mb, which indicates that the final location of a QTL can be reduced to dozens of candidate genes. Thanks to the binmap algorithm, SNPs within a haplotype can be corrected, decreasing the false positive possibility of a single SNP. The binmap also allows obtaining the fragments of the tested samples on the whole chromosome to exchange recombination information.

Most traits show heterosis in F1 and F2

In most F2 populations, the traits exhibited by individual plants fall between those of their parents, while a few exceed their parents; thus, most traits demonstrate different degrees of over-parental segregation. Such phenotypic trait data exhibits an approximately normal distribution. Each pair of the 35 traits was evaluated for significant negative or positive correlations (Supplementary Fig. 2), and we also evaluated the heterosis of each trait in the F1 and F2 populations. In the F1 population, BN exhibited the highest mid-parent heterosis of all yield traits, at 59.7%; other yield traits ranked LY > SCY > LI > SI > LP, while the mid-parent heterosis of BW was negative. Accordingly, the mid-parent heterosis of BN contributes most to the heterosis of yield. With respect to plant type, morphology, and physiological traits, the mid-parent heterosis of SLA was the highest at 40.49%, followed by TLA at 36.16%, and then other traits in the range of 2.1%-23.81 except for D1, which had a negative value. Regarding fiber quality traits, these ranked in terms of mid-parent heterosis as FS > FL > FE, with the values for MIC and FU being negative (Supplementary Table 3). In the F2 population, both yield and quality traits exhibited negative mid-parent heterosis values. Among plant type, morphology, and physiological traits, most mid-parent heterosis values were positive, ranging from 1.18% to 23.81% except for the values associated with PH1, STLH, D2, TLA, Car, and Chl a/b. (Supplementary Table 3; Table 2).

Non-uniform distribution of QTLs in the A and D subgenomes

In this study, a total of 112 QTLs were detected, of which 71 were in the D subgenome, much more than the 41 in the A subgenome (Table S6). Of QTLs associated with the six plant type traits at seedling stage, more were sited in the A subgenome than in the D subgenome; in contrast, QTLs associated with the other ten leaf morphology and physiological traits at seedling stage, five traits reflecting leaf chlorophyll content, two plant type traits at flower and boll stage, seven yield traits, and five fiber quality traits were all less commonly located in the A subgenome than in the D subgenome. In particular, the D subgenome showed a strong advantage with regard to leaf chlorophyll content, yield traits, and fiber quality traits. This is consistent with previous reports that the D subgenome contributes more to the genetic control of fiber [62-64], and suggests that molecular marker selection in the D subgenome may be more efficient for breeding to improve yield and fiber quality.

QTLs and candidate genes may contribute to the improvement of cotton through breeding

Studies involving cotton QTL mapping and candidate gene identification generally focus on traits related to yield and fiber quality; considerably less research has been conducted concerning seedling traits, leaf physiology, and chlorophyll content. Nonetheless, these traits are also important for cotton growth: plant height and leaf area at the seedling stage determine growth vigor, which in turn affects adversity resistance; meanwhile, leaf physiological and chlorophyll content can enhance photosynthesis efficiency and solar energy utilization, eventually helping adaptation to dense planting and increasing production. Here, the candidate gene GH_D04G1426 demonstrated significant associations with SCi, TCi, and Tcond. Its orthologous gene in tomato has been reported to affect the outer wall diameter of leaf epidermal cells; such functionality may indirectly affect photosynthesis in cotton [61]. In looking beyond direct effects on yield and fiber quality, other QTL and candidate genes in our data may provide additional solutions for cotton molecular breeding.

Conclusions

In conclusion, we constructed a high-density genetic map based on the resequencing data of 249 individuals from an interspecific F2 population (TM-1 and Hai7124). This genetic map consists of 6303 high-confidence bin markers spanning 5057.13 cM across 26 chromosomes. Based on this map, 112 QTLs relating to agronomic and physiological traits from seedling to boll opening stage were identified. Through the analysis of sequence and expression of the candidate genes within the QTLs mapping regions, ten causal putative genes might responsible for the target traits. Of them, GhRHD3 (GH_D10G0500) was associated with fiber yield and GhGPAT6 (GH_D04G1426) might play important role in photosynthesis efficiency. Additional file 1: Supplementary Figure 1. Comparisons of the TM-1 genome with (TM-1 × Hai7124) F2 genetic map. Supplementary Figure 2. Frequency distribution of phenotypic variation of 30 traits and correlation coefficients among the traits in the F2 population. Supplementary Figure 3. Functional haplotypes in associated loci from the TM-1×Hai7124 F2 population on D10. Supplementary Figure 4. Functional haplotypes in associated loci from the TM-1×Hai7124 F2 population on D04. Supplementary Table 1. The distribution characteristics of partial segregation markers. Supplementary Table 2. Shortened form and Unit of measurement of the 35 traits. Supplementary Table 3. Statistical analysis of 35 traits phenotypic differences in TM-1 Hai7124 and F1. Supplementary Table 4. Haplotype and gene ID of the 10 candidate genes in TM-1 and Hai7124. Supplementary Table 5. Expression of the 10 candidate genes in different tissues of TM-1 and Hai7124. Supplementary Table 6. The distribution of QTLs in the At and Dt subgenomes. Additional file 2: Table S7. All markers in geneticmap.
  45 in total

1.  High-throughput genotyping by whole-genome resequencing.

Authors:  Xuehui Huang; Qi Feng; Qian Qian; Qiang Zhao; Lu Wang; Ahong Wang; Jianping Guan; Danlin Fan; Qijun Weng; Tao Huang; Guojun Dong; Tao Sang; Bin Han
Journal:  Genome Res       Date:  2009-05-06       Impact factor: 9.043

2.  LUNAPARK Is an E3 Ligase That Mediates Degradation of ROOT HAIR DEFECTIVE3 to Maintain a Tubular ER Network in Arabidopsis.

Authors:  Jiaqi Sun; Nooshin Movahed; Huanquan Zheng
Journal:  Plant Cell       Date:  2020-07-02       Impact factor: 11.277

3.  Gossypium barbadense and Gossypium hirsutum genomes provide insights into the origin and evolution of allotetraploid cotton.

Authors:  Yan Hu; Jiedan Chen; Lei Fang; Zhiyuan Zhang; Wei Ma; Yongchao Niu; Longzhen Ju; Jieqiong Deng; Ting Zhao; Jinmin Lian; Kobi Baruch; David Fang; Xia Liu; Yong-Ling Ruan; Mehboob-Ur Rahman; Jinlei Han; Kai Wang; Qiong Wang; Huaitong Wu; Gaofu Mei; Yihao Zang; Zegang Han; Chenyu Xu; Weijuan Shen; Duofeng Yang; Zhanfeng Si; Fan Dai; Liangfeng Zou; Fei Huang; Yulin Bai; Yugao Zhang; Avital Brodt; Hilla Ben-Hamo; Xiefei Zhu; Baoliang Zhou; Xueying Guan; Shuijin Zhu; Xiaoya Chen; Tianzhen Zhang
Journal:  Nat Genet       Date:  2019-03-18       Impact factor: 38.330

4.  Mapping quantitative trait loci for lint yield and fiber quality across environments in a Gossypium hirsutum × Gossypium barbadense backcross inbred line population.

Authors:  Jiwen Yu; Ke Zhang; Shuaiyang Li; Shuxun Yu; Honghong Zhai; Man Wu; Xingli Li; Shuli Fan; Meizhen Song; Daigang Yang; Yunhai Li; Jinfa Zhang
Journal:  Theor Appl Genet       Date:  2012-10-12       Impact factor: 5.699

5.  A detailed RFLP map of cotton, Gossypium hirsutum x Gossypium barbadense: chromosome organization and evolution in a disomic polyploid genome.

Authors:  A J Reinisch; J M Dong; C L Brubaker; D M Stelly; J F Wendel; A H Paterson
Journal:  Genetics       Date:  1994-11       Impact factor: 4.562

6.  Quantitative trait loci analysis of Verticillium wilt resistance in interspecific backcross populations of Gossypium hirsutum × Gossypium barbadense.

Authors:  Yuzhen Shi; Baocai Zhang; Aiying Liu; Wentan Li; Junwen Li; Quanwei Lu; Zhen Zhang; Shaoqi Li; Wankui Gong; Haihong Shang; Juwu Gong; Tingting Chen; Qun Ge; Tao Wang; Heqin Zhu; Zhi Liu; Youlu Yuan
Journal:  BMC Genomics       Date:  2016-11-05       Impact factor: 3.969

7.  Genome-wide quantitative trait loci reveal the genetic basis of cotton fibre quality and yield-related traits in a Gossypium hirsutum recombinant inbred line population.

Authors:  Zhen Zhang; Junwen Li; Muhammad Jamshed; Yuzhen Shi; Aiying Liu; Juwu Gong; Shufang Wang; Jianhong Zhang; Fuding Sun; Fei Jia; Qun Ge; Liqiang Fan; Zhibin Zhang; Jingtao Pan; Senmiao Fan; Yanling Wang; Quanwei Lu; Ruixian Liu; Xiaoying Deng; Xianyan Zou; Xiao Jiang; Ping Liu; Pengtao Li; Muhammad Sajid Iqbal; Chuanyun Zhang; Juan Zou; Hong Chen; Qin Tian; Xinhe Jia; Baoqin Wang; Nijiang Ai; Guoli Feng; Yumei Wang; Mei Hong; Shilin Li; Wenming Lian; Bo Wu; Jinping Hua; Chaojun Zhang; Jinyong Huang; Aixia Xu; Haihong Shang; Wankui Gong; Youlu Yuan
Journal:  Plant Biotechnol J       Date:  2019-07-08       Impact factor: 9.803

8.  A high-density genetic map and multiple environmental tests reveal novel quantitative trait loci and candidate genes for fibre quality and yield in cotton.

Authors:  Qishen Gu; Huifeng Ke; Zhengwen Liu; Xing Lv; Zhengwen Sun; Man Zhang; Liting Chen; Jun Yang; Yan Zhang; Liqiang Wu; Zhikun Li; Jinhua Wu; Guoning Wang; Chengsheng Meng; Guiyin Zhang; Xingfen Wang; Zhiying Ma
Journal:  Theor Appl Genet       Date:  2020-09-07       Impact factor: 5.699

9.  Genetic dissection of maize plant architecture with an ultra-high density bin map based on recombinant inbred lines.

Authors:  Zhiqiang Zhou; Chaoshu Zhang; Yu Zhou; Zhuanfang Hao; Zhenhua Wang; Xing Zeng; Hong Di; Mingshun Li; Degui Zhang; Hongjun Yong; Shihuang Zhang; Jianfeng Weng; Xinhai Li
Journal:  BMC Genomics       Date:  2016-03-03       Impact factor: 3.969

10.  The genome sequence of Sea-Island cotton (Gossypium barbadense) provides insights into the allopolyploidization and development of superior spinnable fibres.

Authors:  Daojun Yuan; Zhonghui Tang; Maojun Wang; Wenhui Gao; Lili Tu; Xin Jin; Lingling Chen; Yonghui He; Lin Zhang; Longfu Zhu; Yang Li; Qiqi Liang; Zhongxu Lin; Xiyan Yang; Nian Liu; Shuangxia Jin; Yang Lei; Yuanhao Ding; Guoliang Li; Xiaoan Ruan; Yijun Ruan; Xianlong Zhang
Journal:  Sci Rep       Date:  2015-12-04       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.