| Literature DB >> 25359292 |
Amanda M Hulse-Kemp, Hamid Ashrafi, Xiuting Zheng, Fei Wang, Kevin A Hoegenauer, Andrea B V Maeda, S Samuel Yang, Kevin Stoffel, Marta Matvienko, Kimberly Clemons, Joshua A Udall, Allen Van Deynze, Don C Jones, David M Stelly1.
Abstract
BACKGROUND: Cotton (Gossypium spp.) is the largest producer of natural fibers for textile and is an important crop worldwide. Crop production is comprised primarily of G. hirsutum L., an allotetraploid. However, elite cultivars express very small amounts of variation due to the species monophyletic origin, domestication and further bottlenecks due to selection. Conversely, wild cotton species harbor extensive genetic diversity of prospective utility to improve many beneficial agronomic traits, fiber characteristics, and resistance to disease and drought. Introgression of traits from wild species can provide a natural way to incorporate advantageous traits through breeding to generate higher-producing cotton cultivars and more sustainable production systems. Interspecific introgression efforts by conventional methods are very time-consuming and costly, but can be expedited using marker-assisted selection.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25359292 PMCID: PMC4298081 DOI: 10.1186/1471-2164-15-945
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Transcriptome sequence information
| Species | Sample | Raw reads (#) | Trimmed reads (#) | Mapped reads (#) | Depth | Reference coverage (%) |
|---|---|---|---|---|---|---|
|
| 3-79 | 101,276,621 | 101,276,621 | 43,519,596 | 48.45 | 84% |
|
| 19909036.05 | 63,119,599 | 63,118,203 | 38,439,408 | 31.77 | 87% |
|
| 200508123.02 | 65,940,564 | 65,940,111 | 40,767,777 | 33.07 | 86% |
|
| D2-1-6 | 53,279,426 | 53,279,087 | 20,266,019 | 20.90 | 68% |
|
| 200908137.04 | 52,050,537 | 52,050,305 | 23,393,277 | 24.50 | 65% |
Raw and processed read information of Illumina GA-II (Solexa) sequence generated from RNA-Seq libraries for G. barbadense, G. tomentosum, G. mustelinum, G. armourianum, and G. longicalyx.
List of unfiltered SNPs determined for all species
| Class I | Class II | Class III | Total | |
|---|---|---|---|---|
|
| 3,257 | 6,385 | 1,246 | 10,888 |
|
| 1,520 | 6,526 | 1,474 | 9,520 |
|
| 1,678 | 7,584 | 1,726 | 10,988 |
|
| 7,331 | 14,523 | 5,120 | 26,974 |
|
| 14,546 | 18,960 | 4,711 | 38,217 |
Number of SNPs derived in silico relative to G. hirsutum inbred line TM-1 for species G. barbadense, G. tomentosum, G. mustelinum, G. armourianum, and G. longicalyx. SNPs are classified into three categories, Class I are SNPs from contigs with no other SNP residing within the contig. Class II are SNPs from contigs that contain one or more additional SNP outside of the 50-bp flanking sequences (none within). Class III are SNPs from contigs that contain one or more additional SNPs within the 50-bp flanking sequences.
Figure 1Overlap of SNPs among species. The overlap and specificity of the Class I and Class II SNPs for G. barbadense cv. 3–79, G. tomentosum, G. mustelinum, G. armourianum, and G. longicalyx.
Figure 2Distributions of SNPs relative to (D ) draft genome. All SNPs for each species were plotted according to BWA alignment positions (X-axis) across the G. raimondii (D5) draft genome over a sliding window in R. Density (Y-axis) is the proportion of the number of SNPs within a species-specific data set calculated over a sliding window. A figure was produced for each of the 13 scaffolds of the draft genome sequence. G. barbadense cv. 3–79 is shown in blue, G. tomentosum is shown in red, G. mustelinum is shown in brown, G. armourianum is shown in yellow, G. longicalyx is shown in purple.
Figure 3KASP marker screening panels. Screening panels containing control and mapping samples used for determining successful and unsuccessful markers via KASP assay genotyping. (A.) Panel used for screening markers derived from G. barbadense, “G. barbadense screening panel”. (B.) Panel used for screening markers derived from G. tomentosum and G. mustelinum. (C.) Panel used for screening markers derived from G. longicalyx. (D.) Panel used for screening markers derived from G. armourianum.
Figure 4Wide-cross whole-genome radiation hybrid bin map. Wide-cross whole-genome radiation hybrid map generated from genotypes of 131 irradiated F1 (G. hirsutum line TM-1 × G. barbadense cv. 3–79) individuals in Carthagene using LOD score of 3. Bins consist of all markers which fall in a single syntenic group as determined by Carthagene. Bins are aligned to the G. raimondii (D5) draft genome sequence by BWA mapping of individual SNP markers. Bold markers indicate markers from the Van Deynze et al. [20] data set that were mapped in the Yu et al. [14] paper. Underlined markers indicate markers for which the sequences were overlapped.
Figure 5KASP genotyping of marker UCcg10563_649. A1 quadrant of a 384-well plate KlusterCaller image of genotyping KASP assay for marker UCcg10563_649 after 38 cycles. A.) G. barbadense L. (AD)2 cv. 3–79, B.) G. hirsutum L. (AD)1 line TM-1, C.) F1 euploid hybrid G. barbadense L. (AD)2 cv. 3–79 and G. hirsutum L. (AD)1 line TM-1, D.) F1 hypo-aneuploid lines for AD-Chromosome 17, E.) Wide-cross whole-genome radiation hybrid samples, X.) Non-template (negative control), in B2 quadrant (not shown). Both samples in D and 6 of the samples in E show deletions due to the shift in genotype from F1-green to homozygous for the G. barbadense L. (AD)2 allele – blue. [The two samples shown in white and yellow were considered questionable genotypes as they did not fall directly in a cluster].