| Literature DB >> 25656006 |
Hantao Wang1, Xin Jin1, Beibei Zhang1, Chao Shen1, Zhongxu Lin2.
Abstract
RAD sequencing was performed using DH962 and Jimian5 as upland cotton mapping parents. Sequencing data for DH962 and Jimian5 were assembled into the genome sequences of ≈55.27 and ≈57.06 Mb, respectively. Analysing genome sequences of the two parents, 1,323 SSR, 3,838 insertion/deletion (InDel), and 9,366 single-nucleotide polymorphism (SNP) primer pairs were developed. All of the SSRs, 121 InDels, 441 SNPs, and other 6,747 primer pairs were screened in the two parents, and a total of 535 new polymorphic loci were identified. A genetic map including 1,013 loci was constructed using these results and 506 loci previously published for this population. Twenty-seven new QTLs for yield and fibre quality were identified, indicating that the efficiency of QTL detection was greatly improved by the increase in map density. Comparative genomics showed there to be considerable homology and collinearity between the AT and A2 genomes and between the DT and D5 genomes, although there were a few exchanges and introgressions among the chromosomes of the A2 genome. Here, the development of markers using parental RAD sequencing was effective, and a high-density intraspecific genetic map was constructed. This map can be used for molecular marker-assisted selection in cotton.Entities:
Keywords: Gossypium hirsutum, RAD sequencing; QTL mapping; comparative genomics; genetic map
Mesh:
Substances:
Year: 2015 PMID: 25656006 PMCID: PMC4401325 DOI: 10.1093/dnares/dsu047
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1.Overall workflow of marker discovery and analysis.
Figure 2.Comparative analysis of the flanking sequences of markers of DH962 and Jimian5.
Figure 3.Sequence length distribution of the Gh-D-J data set.
Summary statistics of the RAD tag sequencing via illumina (DH962, Jimian5)
| Feature | DH962 | Jimian5 |
|---|---|---|
| Illumina reads (million) after sequence editing | 62.46 | 61.27 |
| Sequences bases (Gb) after sequence editing | 5.15 | 5.18 |
| Number of contigs | 178,157 (55.27 Mb) | 181,422 (57.06 Mb) |
| Average contig length (bp) | 310 | 314 |
| Common contigs | 105,264 (38.14 Mb) | |
| Contig number of | 251,816 (85.76 Mb) | |
| Number of SSR primer pairs | 1,323 | |
| Number of InDel primer pairs | 3,838 | |
| Number of SNP primer pairs | 9,366 | |
| Number of sequences with markers | 14,433 | |
Figure 4.The representation of known repetitive elements in Gossypium hirsutum RAD sequences.
Figure 5.Gene ontology classification of the Gh-D-J data set. The three GO categories are cellular component, molecular function, and biological process.
Summary of SSRs identified from two libraries
| Research items | DH962 | Jimian5 |
|---|---|---|
| Total number of identified SSRs | 5,158 | 5,461 |
| Number of SSR-containing sequences | 4,911 | 5,214 |
| Number of SSRs present in compound formation | 244 | 246 |
| Frequency of SSRs | 1/10.72 kb | 1/10.44 kb |
| Frequency of motif size | Dinucleotide (18.96%) | Dinucleotide (18.57%) |
| Trinucleotide (17.82%) | Trinucleotide (18.49%) | |
| Tetranucleotide (12.39%) | Tetranucleotide (12.54%) | |
| Pentanucleotide (33.29%) | Pentanucleotide (32.72%) | |
| Hexanucleotide (13.90%) | Hexanucleotide (13.94%) | |
| Heptanucleotide (3.16%) | Heptanucleotide (3.19%) | |
| Octanucleotide (0.48%) | Octanucleotide (0.55%) | |
| Frequency of major motif type | AT/AT (12.62%) | AT/AT (12.43%) |
| AAAAT/ATTTT (12.04%) | AAAAT/ATTTT (11.79%) | |
| AAT/ATT (7.06%) | AAT/ATT (7.34%) | |
| AAAT/ATTT (6.07%) | AAAT/ATTT (6.08%) | |
| AAG/CTT (5.16%) | AAAAG/CTTTT (5.44%) | |
| AAAAG/CTTTT (4.94%) | AAG/CTT (5.13%) | |
| AG/CT (3.78%) | AG/CT (3.77%) | |
| AAATT/AATTT (3.70%) | AAATT/AATTT (3.70%) | |
| AAAAAT/ATTTTT (2.77%) | AAAAAT/ATTTTT (3.19%) | |
| AC/GT (2.56%) | ATC/ATG (2.44%) | |
| ATC/ATG (2.27%) | AC/GT (2.36%) | |
| AATAT/ATATT (1.78%) | AAAAAG/CTTTTT (1.50%) | |
| AATT/AATT (1.43%) | AATAT/ATATT (1.46%) | |
| ACAT/ATGT (1.40%) | AATT/AATT (1.39%) | |
| AAAAAG/CTTTTT (1.40%) | ACAT/ATGT (1.37%) | |
| AAAAC/GTTTT (1.28%) | AAAG/CTTT (1.28%) | |
| AAAG/CTTT (1.22%) | AAAAC/GTTTT (1.26%) | |
| AAC/GTT (1.07%) | AAC/GTT (0.97%) |
Figure 6.Characteristics and distribution of InDels and SNPs in two RAD libraries. (a) Distribution of insertions (+) and deletions (−). (b) Transitions and transversions occurring within the mined SNPs.
Figure 7.The distribution of the 14,103 contigs on the A2 and D5 genomes. (a) The distribution of contigs on the A2 genome; (b) the distribution of contigs on the D5 genome.
Figure 8.The distribution of 562 markers on the A2 and D5 genomes. (a) Analysis of homology between the AT and A2 genomes; (b) analysis of homology between the DT and D5 genomes.