Literature DB >> 22384381

A high-density simple sequence repeat and single nucleotide polymorphism genetic map of the tetraploid cotton genome.

John Z Yu, Russell J Kohel, David D Fang, Jaemin Cho, Allen Van Deynze, Mauricio Ulloa, Steven M Hoffman, Alan E Pepper, David M Stelly, Johnie N Jenkins, Sukumar Saha, Siva P Kumpatla, Manali R Shah, William V Hugie, Richard G Percy.   

Abstract

Genetic linkage maps play fundamental roles in understanding genome structure, explaining genome formation events during evolution, and discovering the genetic bases of important traits. A high-density cotton (Gossypium spp.) genetic map was developed using representative sets of simple sequence repeat (SSR) and the first public set of single nucleotide polymorphism (SNP) markers to genotype 186 recombinant inbred lines (RILs) derived from an interspecific cross between Gossypium hirsutum L. (TM-1) and G. barbadense L. (3-79). The genetic map comprised 2072 loci (1825 SSRs and 247 SNPs) and covered 3380 centiMorgan (cM) of the cotton genome (AD) with an average marker interval of 1.63 cM. The allotetraploid cotton genome produced equivalent recombination frequencies in its two subgenomes (At and Dt). Of the 2072 loci, 1138 (54.9%) were mapped to 13 At-subgenome chromosomes, covering 1726.8 cM (51.1%), and 934 (45.1%) mapped to 13 Dt-subgenome chromosomes, covering 1653.1 cM (48.9%). The genetically smallest homeologous chromosome pair was Chr. 04 (A04) and 22 (D04), and the largest was Chr. 05 (A05) and 19 (D05). Duplicate loci between and within homeologous chromosomes were identified that facilitate investigations of chromosome translocations. The map augments evidence of reciprocal rearrangement between ancestral forms of Chr. 02 and 03 versus segmental homeologs 14 and 17 as centromeric regions show homeologous between Chr. 02 (A02) and 17 (D02), as well as between Chr. 03 (A03) and 14 (D03). This research represents an important foundation for studies on polyploid cottons, including germplasm characterization, gene discovery, and genome sequence assembly.

Entities:  

Keywords:  cotton (Gossypium spp.) genomes; genetic linkage map; recombinant inbred line (RIL) population; simple sequence repeat (SSR); single nucleotide polymorphism (SNP)

Year:  2012        PMID: 22384381      PMCID: PMC3276184          DOI: 10.1534/g3.111.001552

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


Cotton belongs to the Gossypium genus, which consists of approximately 45 diploid and 5 allotetraploid species of global distribution (Beasley 1942; Endrizzi ; Kohel ; Stewart 1994; Wendel and Cronn 2003). The gametic chromosome number of all diploid species is 13, but significant differences among the genomes in meiotic affinity and relative size led to the recognition of eight genome groups: A through G and K (Beasley 1942; Endrizzi et al. 1985; Stewart 1994). Of the approximately 50 Gossypium species, four have been domesticated independently: two diploid species, G. arboreum L. and G. herbaceum L. (n = x = 13) with A1 and A2 genomes, and two allotetraploid species, G. hirsutum L. and G. barbadense L. (n = 2x = 26) with (AD)1 and (AD)2 genomes (Bowers ; Lee 1984; Percival and Kohel 1990). The allotetraploid cotton species are the products of a presumed single polyploidization event between ancient A-genome and D-genome diploids that occurred approximately 1-2 million years ago (Stelly ; Wendel and Cronn 2003). Chromosome numbers assigned in allotetraploid cottons are based on pairing relationships in diploid x tetraploid crosses, with chromosomes 1−13 corresponding to the At subgenome and chromosomes 14−26 to the Dt subgenome (Brown 1980). Cotton species serve as a model system for polyploid plants and plant cell elongation, cell wall and cellulose biosynthesis because they are the only known plants that produce single-celled fibers (Jiang ; Kim and Triplett 2001). The genes that make cotton valuable function in unique ways, requiring long-term research into the development of molecular tools such as DNA markers and genome maps to translate genomic information into agronomic benefits and to other biological systems. Cotton researchers have explored genetic mapping with multiple types of DNA markers, including restriction fragment-length polymorphism (RFLP) (Reinisch ; Rong ; Shappley ), amplified fragment-length polymorphism (Lacape ), random-amplified polymorphic DNA (Kohel ), and simple sequence repeats (SSRs) (Guo ; Lacape ; Park ; Xiao ; Yu ). Although early genetic mapping with hybridization-based markers such as RFLP opened the door to important genomic studies (Jiang ; Shappley ), recent genetic mapping with polymerase chain reaction (PCR)-based markers such as SSR have facilitated portable applications among different mapping populations and research programs (Abdurakhmonov ; Zhang ). As such, the cotton research community has made efforts to develop many portable markers to overcome the problem of low DNA polymorphism rates among various cultivated cotton breeding programs (http://www.cottonmarker.org/; Blenda ). To date, approximately 17,000 pairs of SSR primers have been developed from four cotton species (G. arboreum, G. barbadense, G. hirsutum, and G. raimondii Ulbrich) and a portion of this number have been surveyed for polymorphism against a 12-genotype panel of six Gossypium species (Blenda ; Yu 2004). As single nucleotide polymorphism (SNP) markers are explored in other plant species (Ganal ), new research has been initiated to examine nucleotide sequence diversity in Gossypium genomes (An ; Van Deynze ). These findings are laying the groundwork for developing large numbers of SNP markers in cotton. The growing collection of portable markers in cotton provides a cost-effective tool for genome mapping and gene discovery to understand and improve the cotton plant. High-resolution mapping in cotton has been conducted with segregating populations that were derived from interspecific crosses between Gossypium species because of limited DNA polymorphism within a cotton species. The resulting segregating populations used in major mapping projects often were either F2 or BC1 progeny (Guo ; Lacape ; Rong ; Yu ). In addition, these maps relied heavily on a single marker type such as RFLP or SSR markers derived from limited sources. Rong reported the first high-density map in cotton using 57 F2 plants derived from an interspecific cross between G. hirsutum race “palmeri” and G. barbadense acc. “K101.” The majority of markers used in this map were RFLP markers. This map provided one of the first insights into the allotetraploid cotton genome structure and evolution, although the RFLP markers have proven to have limited portability and utility for marker assisted breeding (Ulloa ). Guo reported the first comprehensive SSR map by using 138 BC1 plants derived from an interspecific cross of G. hirsutum TM-1/G. barbadense Hai 7124//G. hirsutum TM-1. The majority of SSR markers in this map were derived from cotton expressed sequence tag (EST) sequences. Lacape reported a genetic linkage map that consisted of a total of approximately 800 (amplified fragment-length polymorphism, RFLP, and SSR) marker loci via the use of 140 recombinant inbred lines (RILs); derived from an interspecific cross between G. hirsutum Guazuncho 2 and G. barbadense VH8-4602. Recently, Yu used 141 BC1 plants derived from an interspecific cross of G. hirsutum Emian 22/G. barbadense 3-79//G. hirsutum Emian 22. As with Guo , this map also contained SSR markers, the majority of which were derived from ESTs. In addition, a whole-genome radiation hybrid population of 93 plants derived from an interspecific cross of G. barbadense 3-79/G. hirsutum TM-1 was also explored for mapping the cotton genome (Gao ). Here we report the development of a high-density cotton (Gossypium spp.) genetic map by using representative sets of SSR markers and the first public set of SNP markers to genotype 186 RILs derived from an interspecific cross between G. hirsutum TM-1 and G. barbadense 3-79. Both TM-1 and 3-79 are considered genetic standards for their respective species because of breeding and history of genetic/genomic research conducted by the cotton community. These two lines are highly homozygous, and extensive genetic and cytogenetic materials have been developed using them as reference parents, including mutants and hypoaneuploids (Kohel ; Stelly 1993; Stelly ). RILs possess several advantages over F2 or BC1 populations for mapping genes and quantitative trait loci (QTL), and high levels of homozygosity and recombination in the RILs enable replicate studies across different environments by different research groups. This immortal TM-1 × 3-79 RIL population is maintained at USDA-ARS, College Station, Texas, USA, and it is used by the cotton research community for genetic investigations, including QTL mapping studies. In addition, we selected SSR markers derived from different sequence sources (EST, genomic, and BAC clones). These markers were developed by 16 research groups (all 16 sources available to the public at Cotton Marker Database (http://www.cottonmarker.org/). This combined high-density genetic map will facilitate the advancement of many basic and applied genomic studies in cotton.

Materials and Methods

Plant materials and DNA extraction

The mapping population was an immortalized set of 186 RILs. At the time of genomic DNA extraction for this study, the average generation was F7. These lines were derived from selfing via single-seed descent original individual F2 plants from a cross between G. hirsutum TM-1 and G. barbadense 3-79, two highly homozygous parents (Kohel ; Niles and Feaster 1984). Factors in selecting TM-1 and 3-79 as parents in creation of a segregating population for genetic mapping are the unique high-quality fiber characteristics of extra long staple cotton 3-79 and the high productivity and modest environmental sensitivity of Upland cotton TM-1 (Kohel ). The parents (TM-1 and 3-79) and their 186 RIL progeny are maintained as living specimens to produce seed, fiber, and leaf tissue for this mapping effort and other genetic studies. Interspecific F1 hypoaneuploid hybrids for specific chromosomes were used for deficiency mapping by means of loss of heterozygosity. All but one were derived previously by pollinating monosomic and monotelodisomic aneuploids quasi-isogenic to TM-1 with pollen from euploid 3-79, and recovering the respective deficiency among F1 progeny. The F1 aneuploid monosomic for chromosome 26 was unusual in that the deficiency arose de novo in 3-79 pollen, i.e. not via transmission from the maternal TM-1−like stock. The general procedures for mapping with cotton monosomic (2n = 51) and monotelodisomic stocks have been described previously (Beasley 1942; Stelly 1993; Stelly ). Genomic DNA was extracted from fresh young leaf tissue of individual cotton plants grown in the greenhouse in accordance with the modified CTAB DNA extraction procedure as described by Kohel .

PCR primers and assays

The primer pairs used for PCR were developed by collaborators of the cotton research community (Table 1). Approximately 10,000 pairs of SSR primers from 16 different research projects (http://www.cottonmarker.org/) were first analyzed to identify polymorphic markers between TM-1 and 3-79. Nine genomic DNA sources for SSR primer pairs included BNL, CIR, CM, DOW, DPL, GH, JESPR, MUSB, and TMB, and seven EST sources of SSR primer pairs included HAU, MGHES, MUCS, MUSS, NAU, STV, and UCD. While EST SSR primer pairs were developed from Gossypium cDNA clones that contain SSR, genomic primer pairs were developed from Gossypium random enriched small insert libraries except MUSB and TMB. MUSB was developed from the end sequences of the bacterial artificial chromosome (BAC) clones of G. hirsutum acc. Acala Maxxa (Frelichowski Jr. et al. 2006). TMB was developed from the BAC clones and/or physical contigs of TM-1 (Guo ). MUSB and TMB markers facilitate an integration of genetic and physical maps of the allotetraploid cotton genome (Xu ). The first public SNP set (UC) also was included in this mapping project (Van Deynze ). SNP primer pairs were largely derived from G. arboreum EST unigenes. The actual sequence of the individual primer pairs and source clone for each SSR or SNP marker set can be found at http://www.cottonmarker.org/.
Table 1 

Primer sources of cotton molecular markers (http://www.cottonmarker.org/)

Marker setNo. Mapped Marker LociNo. Mapped Primer Pairs
Genomic SSRs
 BNL304239
 CIR123104
 CM3224
 DOW6060
 DPL213200
 GH149144
 JESPR12289
 MUSB155123
 TMB310266
Subtotal14681249
EST SSRs
 HAU128
 MGHES2014
 MUCS6354
 MUSS11293
 NAU11390
 STV97
 UCD2817
Subtotal357283
SNPs
 UC247247
Total20721779

EST, expressed sequence tag; SNP, single nucleotide polymorphism; SSR, simple sequence repeat.

PCR assays for amplifying SSR markers were performed in a cocktail of 10 μL containing 20 ng of DNA, 0.25 μM forward primer, 0.25 μM reverse primer, 0.25 mM dNTPs, 2.5 mM MgCl2, and 0.65 unit DNA Taq polymerase. Thirty-five PCR cycles were used to amplify SSR products, using a primer annealing temperature of 55° or 60°. For nonlabeled SSR primers, amplified DNA products were electrophoresed in a 20-cm-long horizontal agarose gel system (Owl Separation Systems, Portsmouth, NH) with 1X TBE (45 mM tris-borate, 1 mM EDTA, pH 8) running buffer and 3.5% Hi-Resolution agarose (e.g. Metaphor agarose, Cambrex, East Rutherford, NJ; or SFR agarose, Amresco, Solon, OH). PCR product sizes were estimated by comparison with DNA size standard ladders (E and K Scientific, Santa Clara, CA). For fluorescently labeled primers (forward primer only with 6-FAM, HEX, or NED), amplified DNA products were separated using 36-cm or 50-cm capillary electrophoresis of automated ABI PRISM 3130xl or ABI PRISM 3730 Genetic Analyzer (Applied Biosystems/Life Technology, Foster City, CA). In a separate project, an array for Ilumina (San Diego, CA) Golden Gate assay was designed to analyze 384 SNP markers between TM-1 and 3-79 (Van Deynze ). Polymorphic SNP markers based on the parental survey were used to genotype the 186 RILs.

Marker data acquisition and linkage map construction

SSR data collection was performed either manually for gel-based assays or with the GeneMapper 3.7. Among nearly 10,000 pairs of primers that were surveyed, more than 2000 primer pairs that detected the best resolution of polymorphisms between TM-1 and 3-79 were selected to genotype the 186 RILs. These primer pairs included subsets (54 MUCS, 123 MUSB, and 93 MUSS) that were previously used to genotype the same population (Park ; Frelichowski Jr. et al. 2006), and the genotyping data were incorporated into this mapping project. Genotyping of the RIL population for SSR and SNPs was performed as previously described (Park ; Frelichowski Jr. et al. 2006; Van Deynze ). SSR markers were generally codominant, but the calling or scoring of the tetraploid cotton alleles at a specific locus required careful examination of gel images or electrographs. Allotetraploid cottons likely had multiple copies of DNA fragments or alleles amplified with a single primer-pair. To distinguish dominant markers from codominant markers, any RIL missing one pair of the parental polymorphic fragments/alleles indicated that alleles were nonallelic or simply an existence of two dominant marker loci after all pairing attempts had failed. A missing data point of a RIL was determined if there was a lack of any signal attributable to failed PCR amplification. Duplicate marker loci were designated by adding a lower-case letter in alphabetical order after the primer name. The raw scores were first inspected for any coding error and segregation distortion before using the data as input for the JoinMap 4.0 program (Van Ooijen 2006) for mapping analysis. Using the JoinMap’s function “identify identical loci,” we identified 47 identical or cosegregating loci (supporting information, Table S2) and removed them in subsequent mapping. The Kosambi mapping function (Kosambi 1944) was selected to convert a recombination frequency to a genetic distance (centiMorgan, or cM), and 40 cM was the threshold to determine linkage between two markers. Linkage groups and marker orders were determined on the basis of likelihood ratio statistic (or LOD) 10 or greater (up to LOD 15). Chromosome assignment was determined by the common markers that were located by authors in previous publications (Frelichowski Jr. et al. 2006; Guo , 2008; Lacape ; Liu ; Park ; Yu ) and by use of the subsets of new SSR markers (GH, Table 3) with the cotton hypoaneuploid stocks described previously. SSR loci localized to one of the chromosomes (Chr.) 1 to 13 were assigned to the A-subgenome (At), whereas loci localized to Chr. 14 to 26 were assigned to the D-subgenome (Dt).
Table 3 

Assignment of 37 GH SSR markers to specific allotetraploid cotton chromosomes

Marker NameFragment Size, bpHypoaneuploidMapped Chromosome
TM-1 allele3-79 allele
GH0027565H16Chr.16(D07)
GH0277080H09Chr.09(A09)
GH034130120H07Chr.13(A13)
GH039125120H06Chr.06(A06)
GH0489098H20Chr.20(D10)
GH055175170Te18shChr.18(D13)
GH082175155H06Chr.06(A06)
GH098130145H09Chr.09(A09)
GH11010580H20Chr.20(D10)
GH119150165H20Chr.20(D10)
GH2959575H16Chr.16(D07)
GH312110102H12Chr.12(A12)
GH330105115Te22LoChr.22(D04)
GH3369886H01Chr01(A01)
GH345115103H16Chr.16(D07)
GH422116126Te5LoChr.05(A05)
GH428195170H20Chr.20(D10)
GH433168150H06Chr.06(A06)
GH441175150H06Chr.06(A06)
GH443150120H18Chr.18(D13)
GH462170152Te14LoChr.14(D03)
GH463150165H12Chr.12(A12)
GH47890100H25Chr.25(D06)
GH484140145H09Chr.09(A09)
GH486155130H09Chr.09(A09)
GH4958072H09Chr.09(A09)
GH499148144H09Chr.09(A09)
GH501200202H18Chr.18(D13)
GH506134160H07Chr.07(A07)
GH511135130H20Chr.20(D10)
GH526100200Te22LoChr19(D05)
GH537175170H25Chr.25(D06)
GH548120140H07Chr.07(A07)
GH584140120H09Chr.09(A09)
GH603154158H26Chr.26(D12)
GH629128132H26Chr.26(D12)
GH68410290H16Chr.16(D07)

SSR, simple sequence repeat.

Results

Parental polymorphisms and genotype frequencies of the mapping population

Approximately 25% of the genomic SSR markers and approximately 15% of the cDNA SSR markers were polymorphic between TM-1 and 3-79. A total of 1601 pairs of polymorphic SSR primers were selected and analyzed for genotyping 186 RILs. Of the 1601 SSR primer pairs that revealed 1895 marker loci, 1344 primer pairs revealed one locus, 234 revealed two loci, and the remaining 23 revealed more than two loci. Among the 1895 SSR marker loci, 1785 were codominant; 43 were dominant loci that received alleles from TM-1, and 67 were dominant loci that received alleles from 3-79. Of these 1895 marker loci, 1825 were mapped (Table 1). The remaining 70 loci were not mapped because of highly skewed segregation (χ2 > 8.5) and high levels of missing data. Fifty-five of the unmapped loci were dominant loci. In addition, 247 of the 384 SNP primer pairs were polymorphic between parents and used to genotype the 186 RILs. All 247 SNP markers were codominant and revealed 247 loci (Table S1). Of these, 207 SNP loci were mapped in unique positions, and the remaining 40 SNP loci were identical to other mapped loci (Table S2). In summary, a total of 1848 pairs of SSR and SNP primers were used to genotype the 186 RILs, and 2142 marker loci were scored, of which 2072 marker loci revealed by 1532 pairs of SSR and 247 pairs of SNP primers were mapped (Table 1). Approximately 98% of 2072 total marker loci were mapped in unique positions, with only 47 identical or cosegregating markers including 40 SNP markers (Table S2). EST, expressed sequence tag; SNP, single nucleotide polymorphism; SSR, simple sequence repeat. This RIL population displayed a greater-than-expected level of residual heterozygosity, i.e. 4.2% instead of the expected 1.6% for an F7 population derived by single-seed descent. Residual heterozygosity in individual lines ranged from 0.8% to 19.9%. Among the 2032 codominant SSR and SNP loci, the average residual heterozygosity for individual markers was 4.2%, ranging from 0% to 66.7% with SSR marker STV129 demonstrating the greatest heterozygosity. Markers that detected more than 20% residual heterozygosity of the RIL population were usually difficult to map because determining linkage of these markers conflicted with more than one marker. Analysis of the genotyping data revealed a statistically significant preference of TM-1 alleles to 3-79 alleles (χ = 768; Figure 1). Overall, the allele frequencies of TM-1 and 3-79 were 52.3% and 47.7%, respectively.
Figure 1 

Distribution of the TM-1 and 3-79 allele frequencies in the RIL mapping population (χ2 = 768 and P < 0.0001).

Distribution of the TM-1 and 3-79 allele frequencies in the RIL mapping population (χ2 = 768 and P < 0.0001).

Genetic linkage maps of the allotetraploid cotton

The genetic linkage map comprises 2072 SSR and SNP loci mapped to the 26 linkage groups, corresponding to 26 chromosomes of allotetraploid cotton, for a total map distance of 3380 cM (Table 2 and Figure 2). The average marker interval in this map is 1.63 cM. Forty-seven pairs of marker loci were found to be either identical or cosegregated (Table S2), and therefore only one locus from each pair is shown on the map. For example, BNL3545b is identical to or cosegregated with BNL3545a, so only BNL3545a is shown on Chr. 14 (D03).
Table 2 

Distribution of 2072 SSR and SNP marker loci among the 26 allotetraploid cotton chromosomes

ChromosomeNo. Marker LociRecombinational Size, cMAverage Marker Interval, cMNo. Gaps >10 cM (Largest)
A-subgenome
Chr.01(A01)66144.42.192 (14.46)
Chr.02(A02)60118.41.971 (13.46)
Chr.03(A03)87116.41.342 (13.09)
Chr.04(A04)56101.61.812 (15.88)
Chr.05(A05)139199.21.431 (10.03)
Chr.06(A06)89131.01.470 (8.35)
Chr.07(A07)87128.91.481 (10.23)
Chr.08(A08)92140.01.522 (16.52)
Chr.09(A09)99139.41.410 (9.00)
Chr.10(A10)75109.01.450 (6.55)
Chr.11(A11)140166.51.190 (9.98)
Chr.12(A12)84122.81.460 (8.79)
Chr.13(A13)64109.31.710 (7.17)
Subtotal-At11381726.81.5211 (16.52)
D-subgenome
Chr.15(D01)93118.01.271 (10.05)
Chr.17(D02)42114.62.732 (22.01)
Chr.14(D03)79126.41.601 (15.92)
Chr.22(D04)4577.91.731 (14.09)
Chr.19(D05)132227.21.721 (15.78)
Chr.25(D06)70126.91.810 (9.249)
Chr.16(D07)58124.42.152 (17.02)
Chr.24(D08)62118.81.921 (10.47)
Chr.23(D09)83146.01.761 (11.07)
Chr.20(D10)76119.01.570 (5.03)
Chr.21(D11)80136.81.710 (9.27)
Chr.26(D12)53112.32.120 (9.23)
Chr.18(D13)61104.81.720 (7.23)
Subtotal-Dt9341653.11.7710 (22.01)
Total207233801.6321 (22.01)

SNP, single nucleotide polymorphism; SSR, simple sequence repeat.

Figure 2 

Genetic linkage maps of 26 allotetraploid cotton chromosomes that are presented in 13 At and Dt subgenome homeologous pairs (in parentheses). The names of DNA markers are shown on the right, and the positions of the markers are shown in Kosambi centiMorgan (cM) on the left. A line bar connects duplicate marker loci between a pair of homeologous chromosomes. Marker loci in bold are assigned to cotton chromosomes by previously published studies (Frelichowski Jr. et al. 2006; Guo , 2008; Lacape ; Liu ; Park ; Yu ) and marker loci in italic bold are assigned to cotton chromosomes in this study (Table 3). Homeologous marker linkage relationships indicate of reciprocal rearrangement between ancestral forms of Chr. 02 and 03 and/or 14 and 17 relative to each other; they also indicate that centromeric regions are homeologous between Chr. 02 (A02) and Chr. 17 (D02), as well as between Chr. 03 (A03) and Chr. 14 (D03). Intrachromosomal duplications were noted in Chr. 5, 11, and 21, the latter two in homeologous segments.

SNP, single nucleotide polymorphism; SSR, simple sequence repeat. Genetic linkage maps of 26 allotetraploid cotton chromosomes that are presented in 13 At and Dt subgenome homeologous pairs (in parentheses). The names of DNA markers are shown on the right, and the positions of the markers are shown in Kosambi centiMorgan (cM) on the left. A line bar connects duplicate marker loci between a pair of homeologous chromosomes. Marker loci in bold are assigned to cotton chromosomes by previously published studies (Frelichowski Jr. et al. 2006; Guo , 2008; Lacape ; Liu ; Park ; Yu ) and marker loci in italic bold are assigned to cotton chromosomes in this study (Table 3). Homeologous marker linkage relationships indicate of reciprocal rearrangement between ancestral forms of Chr. 02 and 03 and/or 14 and 17 relative to each other; they also indicate that centromeric regions are homeologous between Chr. 02 (A02) and Chr. 17 (D02), as well as between Chr. 03 (A03) and Chr. 14 (D03). Intrachromosomal duplications were noted in Chr. 5, 11, and 21, the latter two in homeologous segments. The At subgenome consisted of 1138 marker loci (927 SSR and 211 SNP), and the total genetic distance was 1726.8 cM with an average marker interval of 1.52 cM. The largest chromosome in terms of recombination frequency was Chr. 05 (A05), which spans 199.2 cM with 139 marker loci. The second largest was Chr. 11 (A11), which spans 166.5 cM with 140 loci. The shortest was Chr. 04 (A04), which spans 101.6 cM with 56 loci (Table 2 and Figure 2). In the At subgenome, there were 11 gaps greater than 10 cM, and the largest gap between two loci was 16.52 cM on Chr. 08 (A08). The Dt subgenome consisted of 934 marker loci (898 SSR and 36 SNP), and the total genetic distance was 1653.1 cM, with an average marker interval of 1.77 cM. The largest chromosome with respect to recombination frequency was Chr. 19 (D05), which spans 227.2 cM with 132 loci, and the shortest chromosome was Chr. 22 (D04), which spans 77.9 cM with 45 loci (Table 2 and Figure 2). There were 10 gaps greater than 10 cM, and the largest gap between two loci was 22.01 cM on Chr. 17 (D02). Although SNP marker loci were largely mapped in the At subgenome because of the A-genome origin of SNP primers (Van Deynze ), the At subgenome and Dt subgenome had virtually similar numbers of SSR marker loci and total genetic distances. Furthermore, there were similar amounts of recombination between each of 13 pairs of cotton homeologous chromosomes.

Complete assignment of linkage groups to cotton chromosomes

A complete set of 26 cotton chromosomes (13 At subgenome and 13 Dt subgenome) were identified that correspond to 26 respective linkage groups (Figure 2). Assignment of SSR markers and linkage groups to the cotton chromosomes was achieved in part by comparison of the common markers (bold font in Figure 2) with the previous SSR mapping reports (Frelichowski Jr. et al. 2006; Guo ; Lacape ; Park ; Yu ) and with the three aneuploid studies for TMB markers (Guo ) and BNL markers (Gutiérrez ; Liu ), respectively. In addition, hypoaneuploid cottons were also analyzed to identify TM-1 deficiency with 37 newly developed GH markers (bold italic in Figure 2) from G. hirsutum and other SSR markers of interest in the mapping study (Table 3 and Figure 3) (Hoffman ). Although most SSR markers generally agreed with published reports, a few incongruities, such as GH034 and GH526, between various data types were encountered when cotton hypoaneuploid stocks were used along with individual mapping populations. Additional mapping analyses in the present research confirmed or reassigned such SSR markers to the corresponding cotton chromosomes (Table 3).
Figure 3 

Deletion analysis of cotton SSR markers. GH584 amplified (from L to R) cotton hemizygous F1 hypoaneuploids as well as homozygous TM-1 and 3-79. TM-1 allele (140 bp) was missing in both lanes (see arrow) with the H09 template, suggesting the location of GH584 locus on chromosome 09 (A09).

SSR, simple sequence repeat. Deletion analysis of cotton SSR markers. GH584 amplified (from L to R) cotton hemizygous F1 hypoaneuploids as well as homozygous TM-1 and 3-79. TM-1 allele (140 bp) was missing in both lanes (see arrow) with the H09 template, suggesting the location of GH584 locus on chromosome 09 (A09).

Genomic duplication and chromosomal translocation of allotetraploid cottons

Among 1601 SSR primer pairs that amplified 1895 loci in TM-1 and 3-79, 257 SSR primer pairs amplified two or more loci, resulting in a total of 551 duplicate loci. Excluding dominant loci amplified by these SSRs, there were 494 codominant loci that were duplicated, resulting in 247 pairs (Table S3). Most of the duplicate loci were mapped on the homeologous chromosome pairs (Table 4 and Figure 2). The relative orders of most duplicate loci on the homeologous chromosomes were similar (Figure 2). The duplicate loci identified by these SSR markers demonstrated the complex but linear features of the allotetraploid cotton genomes. A few duplicate loci also were present between nonhomeologous chromosomes and/or within the same subgenome, which indicated likely genome rearrangements (Table 4 and Table S3). For example, an intrasubgenome duplication was revealed by the marker BNL1044 between Chr. 04 (A04) and Chr. 05 (A05). Distinct intrachromosome duplications were indicated by one SSR duplication in Chr. 11 (A11) and three SSRs in Chr. 21 (D11) (Figure 2). In chromosome 11 (A11), TMB0426 revealed two loci that were mapped 8.1 cM apart. In chromosome 21 (D11), three markers (i.e. CM0160, JESPR211, and JESPR244) each revealed two loci. In the latter, the recombination rates remained similar (~8-9 cM) but the relative orders among duplicated loci were altered.
Table 4 

Pairs of duplicate marker loci between homeologous and nonhomeologous chromosomes in cotton

Homeologous ChromosomesNo. Pairs of Duplicate LociNonhomeologous ChromosomesNo. Pairs of Duplicate Loci
Chr.01(A01)-Chr.15(D01)19Chr.02(A02)-Chr.14(D03)5
Chr.02(A02)-Chr.17(D02)6Chr.03(A03)-Chr.17(D02)5
Chr.03(A03)-Chr.14(D03)13Chr.02(A02)-Chr.03(A03)1
Chr.04(A04)-Chr.22(D04)15Chr.04(A04)-Chr.05(A05)1
Chr.05(A05)-Chr.19(D05)27Chr.05(A05)-Chr.22(D04)1
Chr.06(A06)-Chr.25(D06)12
Chr.07(A07)-Chr.16(D07)18
Chr.08(A08)-Chr.24(D08)9
Chr.09(A09)-Chr.23(D09)27
Chr.10(A10)-Chr.20(D10)10
Chr.11(A11)-Chr.21(D11)24
Chr.12(A12)-Chr.26(D12)13
Chr.13(A13)-Chr.18(D13)11
Totals20413
A postpolyploidization reciprocal translocation of chromosomes 02 (A02) and 03 (A03) was suggested by 10 pairs of duplicate loci (Figure 2 and Table 4). Five pairs of duplicate loci were identified between chromosomes 02 (A02) and 14 (D03) and 5 pairs between chromosomes 03 (A03) and 17 (D02). The marker TMB1025 revealed duplicate loci between chromosomes 02 (A02) and 03 (A03), which inferred a possible breakpoint for the reciprocal translocation in these two At subgenome chromosomes. Additional mapping data in the vicinity of TMB1025 will be necessary to confirm this conclusion. Another translocation between At subgenome chromosomes 04 (A04) and 05 (A05), as previously suggested by Guo , was observed by the marker BNL1044 loci (BNL1044a at 33.6 cM of A05) and (BNL1044c at 48.1 cM of A04) (Figure 2 and Table S3). Furthermore, the marker GH252 loci showed a translocation between non-homeologous chromosomes 05 (A05) with GH252a at 136.4 cM and 22 (D04) with GH252b at 18.3 cM.

Discussion

The high-density genetic linkage map created in this research is composed of 2072 SSR and SNP loci representing many individual groups of the cotton research community, and it provides a transferable platform that is essential for a broad spectrum of basic and applied studies aimed at understanding and manipulating complex cotton genomes. Among the 17 sets of SSR and SNP marker loci, BAC-derived SSRs (310 TMB Table S1 and 155 MUSB) facilitate an integration of genetic and physical maps of the cotton chromosomes (Frelichowski Jr. et al. 2006; Xu ). The markers linked to the novel genes can be used to screen cotton BAC clones or physical contigs from which the SSR markers were developed (Yin ). The 357 EST-derived SSR markers mapped herein offer an opportunity to study functional genes and gene islands for fiber development and other important traits of interest. In addition, the genetic mapping of the 247 SNP markers is the first major public effort to use nucleotide sequence diversity in cotton species by mapping SNP loci (Table S1). Localization of these SNP markers to the 26 individual cotton chromosomes and their integration with large numbers of SSR markers will facilitate other studies in cotton genomics. We believe that the high-density genetic map reported herein is a saturated one for the allotetraploid cotton, as evidenced by a separate mapping analysis (data not shown). Further increase in the map density may not significantly change the total genetic length of this map but will facilitate whole-genome physical alignment, sequencing, and mapping of genes for cotton improvement. Deviation from a Mendelian segregation ratio is common in intra- and interspecific crosses (Causse ; Lacape ; Rong ; Ulloa ; Yu ). An extremely severe distortion (99%) toward G. hirsutum was observed by Lacape when 140 RILs were used to produce a low-density map of approximately 800 loci. Only 15 of the 140 RILs exhibited 50% or more G. barbadense parental alleles. In this research, TM-1 was less environmentally sensitive than 3-79, as reflected by the allele transmission preference in the advancement of generations of the RIL population (Figure 1). Of the 2072 mapped marker loci, 1391 (67.1%) fit an expected 1:1 segregation ratio, and 681 (32.9%) deviated significantly (χ2 > 3.8) from expectations among 186 RILs. The 681 segregation-distorted loci (SDL) were mapped in all 26 groups with 349 mapped in At subgenome, and 332 in Dt subgenome chromosomes. Four chromosomes, i.e. Chr. 15 (D01), Chr. 05 (A05), Chr. 07 (A07), and Chr. 08 (A08), had the most SDL, with 68, 60, 57, and 42 loci, respectively. However, Chr. 26 (D12) has the greatest percentage of SDL, 75%, followed by Chr. 15 (D01) with 73.9%, Chr. 07 (A07) with 68.7% and Chr. 05 (A05) with 45.1%. In most cases, the SDL were mapped at centromeric regions. Our mapping studies indicate that the two subgenomes of allotetraploid cottons are equivalent in recombination frequencies despite the extra repetitive DNA in the At subgenome (Zhao ). This result is consistent with other independent mapping studies in which the authors used different allotetraploid cotton populations (F2 or BC1) where variation between At and Dt map sizes supports the ratio of our genetic distances between the two subgenomes. Rong mapped a total of 2584 STS loci that span 4447 cM, with the A subgenome being 9.5% larger genetically than the D subgenome. To the contrary, Guo mapped a total of 1790 SSR loci that span 3426 cM, with the D subgenome being 4.5% larger genetically than the A subgenome. Yu mapped a total of 2316 SSR loci that span 4419 cM with the A subgenome being 3.9% larger genetically than the D subgenome. In this study, the tetraploid cotton were mapped with 1106 loci (54.5%) on 13 At chromosomes at 1726.8 cM (51.1%) and 922 loci (45.5%) on 13 Dt chromosomes at 1653.1 cM (48.9%). Variation in the ratio of subgenome map distances is likely the result of differences in mapping population sizes, as well as in the numbers and sources of DNA markers. As evidenced in our mapping data, two reciprocal translocations (between Chr. 02 and 03 and between Chr. 04 and 05) are inferred during or after the polyploidization process of two ancestral diploid genomes (A and D). The translocation breakpoint between Chr. 02 and Chr. 03 may be at or near homeologous SSR marker TMB1025. Further investigation is needed to identify additional markers in the vicinity of TMB1025. On the basis of homeologous markers of the two chromosome pairs (A02-D02 and A03-D03), the majority of duplicate loci were mapped to individual pairs of Chr. 02 vs. Chr. 17 and Chr. 03 vs. Chr. 14. The centromeric cores of these chromosomes seem to show the homeologous relationship, either reciprocal insertional translocations or two temporally separate traditional reciprocal translocations. Thus, we propose to name Chr. 17 as D02 and Chr. 14 as D03, whereas Chr. 02 and Chr. 03 remain as A02 and A03, respectively, which is a revision to Wang and Guo . Duplication of marker loci revealed genome rearrangements within the same individual chromosomes and/or between nonhomeologous chromosomes (Table S3). We recognize that nomenclature revision of cotton chromosomes and linkage groups would be needed in the future, but this could be accomplished by an international committee of experts in the subject matter. Genetic mapping coupled with physical alignment of genomic regions into chromosomal maps will expedite the discovery of resistance (R) or pathogen-induced R genes underlying QTL involved in resistance to nematode and Fusarium wilt (Ulloa ). Chromosomes 11 (A11) and 21 (D11) are homologs that harbor important genes for cotton improvement because these chromosomes contain genes for resistance to reniform (Dighe ) and root-knot nematodes (Wang ), race 1 of Fusarium (Ulloa ) and other traits affecting fiber yield and quality. The high-density genetic map will facilitate and expedite the analysis of plant defense genes against nematodes and other biotrophic pathogens. This high-density cotton map was constructed with an immortal RIL mapping population. A high level of homozygosity in this RIL population (currently in F8-F9) was achieved with less than 5% genome-wide residual heterozygosity. The RILs are maintained as living stocks to produce seed sources for multilocation research on fiber among other traits and to extract fresh DNA samples for a broad spectrum of genomic studies. Our mapping population of 186 RILs is the largest population ever used in high-density cotton genetic mapping. The accuracy of mapping results can be improved substantially as the proportion of recombination between the two linked markers in an inbred population is about twice that of a single meiotic event F2 or BC1 population when linkage distances are small (<12.5 cM) and increase nonlinearly to 50% for unlinked markers (Burr ; Haldane and Waddington 1931). This population provides the greatest mapping power currently known in cotton to detect additional loci between closely linked markers by members of the cotton research community who are interested in SSR and SNP augmentation. The advantages of this immortal RIL population and its parental lines make it practical for high-resolution consensus mapping with additional sequence-based portable markers, enabling better understanding and exploitation of complex Gossypium genomes (Mace ). This information will complement other work because of the use of the same parents in developing genetic resources, such as hypoaneuploid cytogenetic stocks, chromosome substitution lines, chromosome specific RILs, and QTL mapping populations in other research programs (Jenkins , 2007; Saha , 2010, 2011; Stelly ). The International Cotton Genome Initiative (http://icgi.tamu.edu/) has proposed to map and sequence the Gossypium genomes (Brubaker ; Chen ; Paterson 2008; Wilkins 2008; Yu ), but large amounts of dispersed repetitive elements and duplicate loci between and within the allotetraploid cotton chromosomes present great challenges to properly assemble a complex Gossypium genome. Development of additional numbers of SSR and SNP markers from the fingerprinted and sequenced BAC clones or physical contigs, such as the 310 TMB and 155 MUSB markers on the present map, would provide a unique opportunity to facilitate the mapping the gaps (5−15 cM) of genomic regions (Lin ; Xu ; Yin ). A high-density genetic map is essential in the reconciliation with a whole-genome physical map to facilitate genome sequencing, sequence assembly, gene mapping, and the design of targeted genetic markers for better understanding and improvement of the cotton plant.
  38 in total

Review 1.  Cotton fiber growth in planta and in vitro. Models for plant cell elongation and cell wall biogenesis.

Authors:  H J Kim; B A Triplett
Journal:  Plant Physiol       Date:  2001-12       Impact factor: 8.340

2.  Meiotic Chromosome Behavior in Species, Species Hybrids, Haploids, and Induced Polyploids of Gossypium.

Authors:  J O Beasley
Journal:  Genetics       Date:  1942-01       Impact factor: 4.562

3.  A microsatellite-based, gene-rich linkage map reveals genome structure, function and evolution in Gossypium.

Authors:  Wangzhen Guo; Caiping Cai; Changbiao Wang; Zhiguo Han; Xianliang Song; Kai Wang; Xiaowei Niu; Cheng Wang; Keyu Lu; Ben Shi; Tianzhen Zhang
Journal:  Genetics       Date:  2007-04-03       Impact factor: 4.562

4.  Identification and mapping of microsatellite markers linked to a root-knot nematode resistance gene (rkn1) in Acala NemX cotton (Gossypium hirsutum L.).

Authors:  C Wang; M Ulloa; P A Roberts
Journal:  Theor Appl Genet       Date:  2005-12-14       Impact factor: 5.699

5.  Dispersed repetitive DNA has spread to new genomes since polyploid formation in cotton.

Authors:  X P Zhao; Y Si; R E Hanson; C F Crane; H J Price; D M Stelly; J F Wendel; A H Paterson
Journal:  Genome Res       Date:  1998-05       Impact factor: 9.043

6.  Polyploid formation created unique avenues for response to selection in Gossypium (cotton).

Authors:  C Jiang; R J Wright; K M El-Zik; A H Paterson
Journal:  Proc Natl Acad Sci U S A       Date:  1998-04-14       Impact factor: 11.205

7.  Mapping Fusarium wilt race 1 resistance genes in cotton by inheritance, QTL and sequencing composition.

Authors:  Mauricio Ulloa; Congli Wang; Robert B Hutmacher; Steven D Wright; R Michael Davis; Christopher A Saski; Philip A Roberts
Journal:  Mol Genet Genomics       Date:  2011-05-01       Impact factor: 3.291

8.  A new interspecific, Gossypium hirsutum x G. barbadense, RIL population: towards a unified consensus linkage map of tetraploid cotton.

Authors:  Jean-Marc Lacape; J Jacobs; T Arioli; R Derijcker; N Forestier-Chiron; D Llewellyn; J Jean; E Thomas; C Viot
Journal:  Theor Appl Genet       Date:  2009-04-23       Impact factor: 5.699

9.  A draft physical map of a D-genome cotton species (Gossypium raimondii).

Authors:  Lifeng Lin; Gary J Pierce; John E Bowers; James C Estill; Rosana O Compton; Lisa K Rainville; Changsoo Kim; Cornelia Lemke; Junkang Rong; Haibao Tang; Xiyin Wang; Michele Braidotti; Amy H Chen; Kristen Chicola; Kristi Collura; Ethan Epps; Wolfgang Golser; Corrinne Grover; Jennifer Ingles; Santhosh Karunakaran; Dave Kudrna; Jaime Olive; Nabila Tabassum; Eareana Um; Marina Wissotski; Yeisoo Yu; Andrea Zuccolo; Mehboob ur Rahman; Daniel G Peterson; Rod A Wing; Jonathan F Wendel; Andrew H Paterson
Journal:  BMC Genomics       Date:  2010-06-22       Impact factor: 3.969

10.  CMD: a Cotton Microsatellite Database resource for Gossypium genomics.

Authors:  Anna Blenda; Jodi Scheffler; Brian Scheffler; Michael Palmer; Jean-Marc Lacape; John Z Yu; Christopher Jesudurai; Sook Jung; Sriram Muthukumar; Preetham Yellambalase; Stephen Ficklin; Margaret Staton; Robert Eshelman; Mauricio Ulloa; Sukumar Saha; Ben Burr; Shaolin Liu; Tianzhen Zhang; Deqiu Fang; Alan Pepper; Siva Kumpatla; John Jacobs; Jeff Tomkins; Roy Cantrell; Dorrie Main
Journal:  BMC Genomics       Date:  2006-05-31       Impact factor: 3.969

View more
  49 in total

1.  Molecular characterization of the Gossypium Diversity Reference Set of the US National Cotton Germplasm Collection.

Authors:  Lori L Hinze; David D Fang; Michael A Gore; Brian E Scheffler; John Z Yu; James Frelichowski; Richard G Percy
Journal:  Theor Appl Genet       Date:  2014-11-28       Impact factor: 5.699

2.  Mapping genomic loci for cotton plant architecture, yield components, and fiber properties in an interspecific (Gossypium hirsutum L. × G. barbadense L.) RIL population.

Authors:  John Z Yu; Mauricio Ulloa; Steven M Hoffman; Russell J Kohel; Alan E Pepper; David D Fang; Richard G Percy; John J Burke
Journal:  Mol Genet Genomics       Date:  2014-10-15       Impact factor: 3.291

3.  QTL delineation for five fiber quality traits based on an intra-specific Gossypium hirsutum L. recombinant inbred line population.

Authors:  Xiaoyun Jia; Hantao Wang; Chaoyou Pang; Qifeng Ma; Junji Su; Hengling Wei; Meizhen Song; Shuli Fan; Shuxun Yu
Journal:  Mol Genet Genomics       Date:  2018-02-08       Impact factor: 3.291

4.  Construction of a high-density linkage map and mapping quantitative trait loci for somatic embryogenesis using leaf petioles as explants in upland cotton (Gossypium hirsutum L.).

Authors:  Zhenzhen Xu; Chaojun Zhang; Xiaoyang Ge; Ni Wang; Kehai Zhou; Xiaojie Yang; Zhixia Wu; Xueyan Zhang; Chuanliang Liu; Zuoren Yang; Changfeng Li; Kun Liu; Zhaoen Yang; Yuyuan Qian; Fuguang Li
Journal:  Plant Cell Rep       Date:  2015-03-11       Impact factor: 4.570

5.  Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1) provides insights into genome evolution.

Authors:  Fuguang Li; Guangyi Fan; Cairui Lu; Guanghui Xiao; Changsong Zou; Russell J Kohel; Zhiying Ma; Haihong Shang; Xiongfeng Ma; Jianyong Wu; Xinming Liang; Gai Huang; Richard G Percy; Kun Liu; Weihua Yang; Wenbin Chen; Xiongming Du; Chengcheng Shi; Youlu Yuan; Wuwei Ye; Xin Liu; Xueyan Zhang; Weiqing Liu; Hengling Wei; Shoujun Wei; Guodong Huang; Xianlong Zhang; Shuijin Zhu; He Zhang; Fengming Sun; Xingfen Wang; Jie Liang; Jiahao Wang; Qiang He; Leihuan Huang; Jun Wang; Jinjie Cui; Guoli Song; Kunbo Wang; Xun Xu; John Z Yu; Yuxian Zhu; Shuxun Yu
Journal:  Nat Biotechnol       Date:  2015-04-20       Impact factor: 54.908

6.  Comparative assessment of genetic diversity in cytoplasmic and nuclear genome of upland cotton.

Authors:  Sharof S Egamberdiev; Sukumar Saha; Ilkhom Salakhutdinov; Johnie N Jenkins; Dewayne Deng; Ibrokhim Y Abdurakhmonov
Journal:  Genetica       Date:  2016-05-07       Impact factor: 1.082

7.  Molecular markers associated with the immature fiber (im) gene affecting the degree of fiber cell wall thickening in cotton (Gossypium hirsutum L.).

Authors:  Hee Jin Kim; Hong S Moon; Christopher D Delhom; Linghe Zeng; David D Fang
Journal:  Theor Appl Genet       Date:  2012-08-14       Impact factor: 5.699

8.  Inheritance and QTL mapping of Fusarium wilt race 4 resistance in cotton.

Authors:  Mauricio Ulloa; Robert B Hutmacher; Philip A Roberts; Steven D Wright; Robert L Nichols; R Michael Davis
Journal:  Theor Appl Genet       Date:  2013-03-08       Impact factor: 5.699

9.  Discovery and identification of a novel Ligon lintless-like mutant (Lix) similar to the Ligon lintless (Li1) in allotetraploid cotton.

Authors:  Caiping Cai; Xiangchao Tong; Fengju Liu; Fenni Lv; Haihai Wang; Tianzhen Zhang; Wangzhen Guo
Journal:  Theor Appl Genet       Date:  2013-02-09       Impact factor: 5.699

10.  Analysis of root-knot nematode and fusarium wilt disease resistance in cotton (Gossypium spp.) using chromosome substitution lines from two alien species.

Authors:  M Ulloa; C Wang; S Saha; R B Hutmacher; D M Stelly; J N Jenkins; J Burke; P A Roberts
Journal:  Genetica       Date:  2016-02-17       Impact factor: 1.082

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.