| Literature DB >> 27576450 |
Junji Su1,2,3, Chaoyou Pang2, Hengling Wei2, Libei Li2, Bing Liang2, Caixiang Wang2, Meizhen Song2, Hantao Wang2, Shuqi Zhao2, Xiaoyun Jia1,2, Guangzhi Mao2, Long Huang4, Dandan Geng4, Chengshe Wang5, Shuli Fan6, Shuxun Yu7,8.
Abstract
BACKGROUND: Early maturity is one of the most important and complex agronomic traits in upland cotton (Gossypium hirsutum L). To dissect the genetic architecture of this agronomically important trait, a population consisting of 355 upland cotton germplasm accessions was genotyped using the specific-locus amplified fragment sequencing (SLAF-seq) approach, of which a subset of 185 lines representative of the diversity among the accessions was phenotypically characterized for six early maturity traits in four environments. A genome-wide association study (GWAS) was conducted using the generalized linear model (GLM) and mixed linear model (MLM).Entities:
Keywords: Candidate gene; Early maturity traits; GWAS; Gossypium hirsutum L; SLAF-seq; SNP alleles
Mesh:
Year: 2016 PMID: 27576450 PMCID: PMC5006539 DOI: 10.1186/s12864-016-2875-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Single nucleotide polymorphism (SNP) distributions on the 26 chromosomes of upland cotton. At1 ~ At13 and Dt1 ~ Dt13 in vertical axis are the serial number of 26 chromosomes; The horizontal axis shows chromosome length (Mb); 0 ~ 50 depicts SNP density (the number of SNPs per window)
SNP distribution on each chromosome
| Chromosome | SNP number | Chromosome length (Mb) | SNP densitya (kb) | Chromosome | SNP number | Chromosome length (Mb) | SNP densitya (kb) |
|---|---|---|---|---|---|---|---|
| At1 | 4553 | 106.99 | 23.50 | Dt1 | 5882 | 117.24 | 19.93 |
| At2 | 2890 | 61.75 | 21.37 | Dt2 | 2801 | 50.19 | 17.92 |
| At3 | 2769 | 100.35 | 36.24 | Dt3 | 1464 | 42.80 | 29.23 |
| At4 | 4207 | 92.29 | 21.94 | Dt4 | 2166 | 37.36 | 17.25 |
| At5 | 3688 | 102.56 | 27.81 | Dt5 | 3903 | 65.57 | 16.80 |
| At6 | 2721 | 56.48 | 20.76 | Dt6 | 2378 | 44.80 | 18.84 |
| At7 | 1006 | 23.27 | 23.13 | Dt7 | 1480 | 47.98 | 32.42 |
| At8 | 2540 | 78.75 | 31.00 | Dt8 | 3611 | 56.89 | 15.76 |
| At9 | 4268 | 91.12 | 21.35 | Dt9 | 3774 | 70.91 | 18.79 |
| At10 | 4877 | 95.48 | 19.58 | Dt10 | 3026 | 57.80 | 19.10 |
| At11 | 4032 | 102.02 | 25.30 | Dt11 | 2567 | 61.12 | 23.81 |
| At12 | 2564 | 79.16 | 30.87 | Dt12 | 1658 | 39.38 | 23.75 |
| At13 | 4456 | 91.55 | 20.55 | Dt13 | 2394 | 55.87 | 23.34 |
aSNP density is presented as the average physical distance between two adjacent SNP loci
Fig. 2Population structure and linkage disequilibrium (LD) decay of upland cotton accessions. a Population structure of upland cotton accessions; each line is represented by a single vertical line, and each color represents one cluster; b estimated ln(cross-validation errors in the data) calculated for K, ranging from 2 to 10; c the mean LD decay rate was estimated as the squared correlation coefficient (r2) using all pairs of SNPs located within 600 kb of physical distance in genomic regions in a population of 355 upland cotton germplasm accessions
Fig. 3Frequency distributions of the mean values of six maturity traits of 185 cotton accessions in four environments. a whole growth period (WGP); b flowering time (FT); c flowering and boll-setting period (FBP); d node of the first fruiting branch (NFFB); e height of the node of the first fruiting branch (HNFFB); and f yield percentage before frost (YPBF)
Fig. 4Manhattan plots of genome-wide association studies (GWAS) for the WGP (a), FT (b), NFFB (c) and YPBF (d) measured with the generalized linear model (GLM) using the best linear unbiased prediction (BLUP) values for the four environments. The SNP locus rs13562854 is indicated by the black arrow. The general and highly significant trait-associated SNPs are distinguished by the red and blue threshold lines, respectively
Fig. 5Manhattan plots of genome-wide association studies (GWAS) for the WGP (a) and FT (b) measured with the mixed linear model (MLM) using the best linear unbiased prediction (BLUP) values for the four environments. The SNP locus rs13562854 is indicated by the black arrow. The general and highly significant trait-associated SNPs are distinguished by the red and blue threshold lines, respectively
Favorable SNP alleles, their phenotypic effects (ai), and representative accessions
| Traits | SNP | Position | Alleles | Favorable alleles | ai | Accessions | Representative accessionsa |
|---|---|---|---|---|---|---|---|
| WGP |
| At3:26538646 | A/G | A | −6.70** | 59 | zhongmiansuo74, xia25, zhong416 |
|
| At3:26538688 | G/T | G | −6.70** | 59 | zhongmiansuo74, xia25, zhong416 | |
|
| Dt3:13562854 | A/G | A | −7.53** | 66 | zhongmiansuo74, xia25, zhong416 | |
|
| Dt3:8917898 | A/G | G | −7.58** | 49 | zhong6426, zhong51822, xia13-7 | |
|
| Dt3:13153008 | A/G | A | −7.76** | 42 | xia25, zhong416, baimian17 | |
| FT |
| At4:22465987 | A/G | G | −0.07 | 94 | 1476, zhongmiansuo74, xiaomian3 |
|
| At12:48627288 | A/G | A | −0.55 | 86 | xia25, xiazao3, zhongmiansuo14 | |
|
| Dt3:13562854 | A/G | A | −3.88** | 66 | xiazao2,zhongmiansuo74,xia25 | |
|
| Dt3:8917898 | A/G | G | −3.69** | 49 | xia25, 1476, xiazao3 | |
|
| Dt3:37255056 | A/G | A | −3.40** | 56 | xia25, 1476, xiazao3 | |
| FBP |
| Dt3:13153008 | A/G | A | −4.09** | 42 | zhong416, xia25, zhongmiansuo64 |
| NFFB |
| Dt3:13562854 | A/G | A | −0.91** | 66 | xiaozao2, xiazao3, xia25 |
| YPBF |
| Dt3:13562854 | A/G | A | 10.45** | 66 | zhongmiansuo74, xia25, xiazao3 |
aRepresentative accessions consist of the top 3 entries for the target trait values of accessions with the corresponding favorable alleles; **highly favorable SNP alleles that exhibit significantly different traits compared with the unfavorable alleles (P < 0.01)
Pyramiding effects of the highly favorable alleles that contribute to early maturity
| Traits | No. of favorable alleles | Mean ± SD | Frequency (%) |
|---|---|---|---|
| WGP (d) | 0 | 125.05 ± 2.66 (A) | 41.46 |
| 1 | 117.39 ± 5.83 (B) | 7.32 | |
| 2 | 113.55 ± 5.89 (B) | 12.20 | |
| 3 | |||
| 4 | 108.84 ± 2.63 (C) | 39.02 | |
| FT (d) | 0 | 69.75 ± 2.26 (A) | 57.48 |
| 1 | 64.26 ± 1.99 (B) | 11.81 | |
| 2 | 64.42 ± 2.36 (B) | 1.57 | |
| 3 | 62.3 ± 1.10 (C) | 29.13 | |
| FBP (d) | 0 | 54.71 ± 3.36 (A) | 39.62 |
| 1 | 47.55 ± 2.1 (B) | 60.38 | |
| NFFB | 0 | 7.23 ± 0.82 (A) | 59.51 |
| 1 | 5.59 ± 0.38 (B) | 40.49 | |
| YPBF (%) | 0 | 61.03 ± 9.59 (A) | 59.51 |
| 1 | 82.19 ± 4.48 (B) | 40.49 |
Values with different letters are significantly different (P < 0.05)
Fig. 6Linear regression analyses of the numbers of highly favorable SNP alleles and average WGP (a) and FT values (b) in four environments
Fig. 7Favorable alleles (A) and unfavorable alleles (G) at the SNP locus rs13562854 for six traits related to early maturity in the four environments. a–f represent six traits related to early maturity WGP, FT, FBP, NFFB, HNFFB and YPBF, respectively; *, ** indicate significance at probability levels of 0.05 and 0.01, respectively; SP-2013, SU-2013, SP-2014 and SU-2014 are the four environments
Candidate genes most highly associated with early maturity within 500 kb of either side of the SNP locus rs13562854
| #GeneID | Start | Stop | Direction | Distance to SNP (kb) | Annotation |
|---|---|---|---|---|---|
|
| 13482736 | 13483020 | Forward | 79.83 | |
|
| 13836983 | 13837348 | Reverse | 274.13 | Tetratricopeptide repeat-like superfamily protein, putative |
|
| 13685536 | 13686374 | Reverse | 122.68 | Zinc finger protein, putative isoform 1 |
|
| 13173869 | 13177058 | Forward | 385.80 | Enolase 1, chloroplastic-like protein |
|
| 13669620 | 13671044 | Reverse | 106.77 | Zinc finger protein, putative isoform 1 |
|
| 13215464 | 13215928 | Reverse | 346.93 | Proline and serine-rich 1 |
|
| 13641931 | 13643458 | Reverse | 79.08 | Ribonuclease P subunit p30 |
|
| 13680075 | 13680812 | Forward | 117.22 | Hypothetical protein F383_23360 |
|
| 13549901 | 13550143 | Forward | 12.71 | |
|
| 13835211 | 13836617 | Forward | 272.36 | UDP-glycosyltransferase 89B1-like |
|
| 13837426 | 13838590 | Reverse | 274.57 | Tetratricopeptide repeat-like superfamily protein, putative |
|
| 13894122 | 13897285 | Reverse | 331.27 | Hypothetical protein F383_21541 |
|
| 14027015 | 14029396 | Reverse | 464.16 | ADP, ATP carrier protein ER-ANT1-like |
|
| 13425578 | 13427698 | Reverse | 135.16 | DNA-directed RNA polymerases I and III subunit RPAC1 |
|
| 13922306 | 13923205 | Forward | 359.45 | |
|
| 13839501 | 13840400 | Reverse | 276.65 | UDP-glucosyl transferase 89B1, putative |
|
| 13169627 | 13172234 | Forward | 390.62 | DnaJ, mitochondrial |
|
| 13763420 | 13764002 | Forward | 200.57 | |
|
| 13313301 | 13314846 | Forward | 248.01 | Zinc finger CONSTANS-LIKE 2-like protein |
|
| 13095428 | 13096529 | Forward | 466.33 | UDP-N-acetylmuramoyl-alanine-D-glutamate ligase |
|
| 13233921 | 13234454 | Forward | 328.40 | |
|
| 13066571 | 13067059 | Forward | 495.80 | Agamous-like MADS-box protein A |
|
| 13771588 | 13773328 | Forward | 208.73 | Crooked neck-like protein 1 |
|
| 13252910 | 13256745 | Forward | 306.11 | Serine/threonine protein kinase 16 |
|
| 13990075 | 13991007 | Forward | 427.22 | OBF-binding protein 4, putative |
|
| 14020947 | 14021135 | Reverse | 458.09 | Hypothetical protein CISIN_1g035470mg |
|
| 14015684 | 14017498 | Reverse | 452.83 | MADS-box protein |
|
| 13135919 | 13143770 | Forward | 419.08 | Putative acyl-activating enzyme 17, peroxisomal-like protein |
|
| 13956974 | 13957540 | Reverse | 394.12 | ARM repeat superfamily protein |
|
| 13236335 | 13238168 | Forward | 324.69 | Hypothetical protein F383_15236 |
|
| 13103756 | 13106200 | Forward | 456.65 | |
|
| 13717384 | 13722424 | Reverse | 154.53 | WD repeat and HMG-box DNA-binding 1 |
Fig. 8Increased expression of the MADS-box family gene CotAD_01947 in early-maturing cultivars of upland cotton. a Plants at the boll-opening stage of two early-maturing and two late-maturing cotton varieties. b and c Phenotypic effect values of the WGP and FT for two early-maturing and two late-maturing varieties. d Tissue-specific expression patterns of CotAD_01947. e Expression levels of CotAD_01947 during the five different leaf growth stages. **indicates significance at the 0.01 probability level
Fig. 9Physical maps and linkage relationships among quantitative trait loci (QTLs) in previous and present studies. Physical maps of the reference Gossypium hirsutum genome D03 [56] and Dt3 [31] from the present study, respectively. Linkage map of C17 (D3) from a previous study [2]