| Literature DB >> 34093598 |
Zareen Sarfraz1, Muhammad Shahid Iqbal1,2, Xiaoli Geng1, Muhammad Sajid Iqbal1,2, Mian Faisal Nazir1, Haris Ahmed1, Shoupu He1, Yinhua Jia1, Zhaoe Pan1, Gaofei Sun3, Saghir Ahmad2, Qinglian Wang4, Hongde Qin5, Jinhai Liu6, Hui Liu7, Jun Yang8, Zhiying Ma9, Dongyong Xu10, Jinlong Yang6, Jinbiao Zhang11, Zhikun Li9, Zhongmin Cai6, Xuelin Zhang12, Xin Zhang4, Aifen Huang13, Xianda Yi5, Guanyin Zhou6, Lin Li11, Haiyong Zhu1, Baoyin Pang1, Liru Wang1, Junling Sun1, Xiongming Du1.
Abstract
For about a century, plant breeding has widely exploited the heterosis phenomenon-often considered as hybrid vigor-to increase agricultural productivity. The ensuing F1 hybrids can substantially outperform their progenitors due to heterozygous combinations that mitigate deleterious mutations occurring in each genome. However, only fragmented knowledge is available concerning the underlying genes and processes that foster heterosis. Although cotton is among the highly valued crops, its improvement programs that involve the exploitation of heterosis are still limited in terms of significant accomplishments to make it broadly applicable in different agro-ecological zones. Here, F1 hybrids were derived from mating a diverse Upland Cotton germplasm with commercially valuable cultivars in the Line × Tester fashion and evaluated across multiple environments for 10 measurable traits. These traits were dissected into five different heterosis types and specific combining ability (SCA). Subsequent genome-wide predictions along-with association analyses uncovered a set of 298 highly significant key single nucleotide polymorphisms (SNPs)/Quantitative Trait Nucleotides (QTNs) and 271 heterotic Quantitative Trait Nucleotides (hQTNs) related to agronomic and fiber quality traits. The integration of a genome wide association study with RNA-sequence analysis yielded 275 candidate genes in the vicinity of key SNPs/QTNs. Fiber micronaire (MIC) and lint percentage (LP) had the maximum number of associated genes, i.e., each with 45 related to QTNs/hQTNs. A total of 54 putative candidate genes were identified in association with HETEROSIS of quoted traits. The novel players in the heterosis mechanism highlighted in this study may prove to be scientifically and biologically important for cotton biologists, and for those breeders engaged in cotton fiber and yield improvement programs.Entities:
Keywords: F1 hybrid; GWAS; hQTNs; heterosis; multiple environments; upland cotton
Year: 2021 PMID: 34093598 PMCID: PMC8173050 DOI: 10.3389/fpls.2021.565552
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
FIGURE 1Violin plots based on phenotypic variation of ten agronomic and fiber quality traits of F1 hybrids (Yaxis) from four sets (SetA, SetC, SetD, and SetE) across multiple environments for two years; 2012 and 2013 (Xaxis). Legends on top right in different colors are representing ten evaluated phenotypic traits.
FIGURE 2Distribution of SCA and five heterosis types (HB, MP, HI, K3, and K4) of agronomic and fiber quality traits among four F1 hybrid sets (SetA, SetC, SetD, and SetE) across multiple environments for years 2012 and 2013. Legends on the top right in different colors are depicting ten evaluated phenotypic traits.
FIGURE 3Population structure of 284 female parents in the association panel (A) Principal component analysis (PCA) of female lines (B) Population structure of 284 female parents (K = 3).
FIGURE 4Single nucleotide polymorphism (SNP) distributions on 26 chromosomes of (A) parents, (B) F1_A, (C) F1_C, (D) F1_D, and (E) F1_E. A1∼A13 and D1∼D13 in vertical axis are the serial number of 26 chromosomes; the horizontal axis shows chromosome length (Mb); = 0 ∼>702 depicts SNP density (the number of SNPs per window).
FIGURE 5Phenogram displaying the 2847 significant (–log (p) ≥ 4) associations among phenotypic traits and 1348 significant SNPs residing on 26 chromosomes of upland cotton.
FIGURE 6Summary of significant association signals and significant SNPs. (A) Representation of significant associations among 10 phenotypic traits, four F1 sets, five heterosis types, SCA and significant SNPs (B) details of significant SNPs commonly associated across four different sets of F1 hybrids (C) number of significant SNPs/hQTNs associated with heterosis types, SCA and F1 sets.
FIGURE 7Depicted here are results from the multivariate analysis of pleiotropy. For each associated SNP, the method returns the best-fitting solution of which phenotypes were associated with that SNP. All SNPs with one or more associated phenotypes are shown here. For example, every SNP associated with FE was found to be pleiotropic for other phenotypes. The total number of pleiotropic as well as unique associated SNPs for each trait from these analyses were 181 (FE), 176 (BW), 147 (LP), 146 (MIC), 141 (FUI), 134 (FS), 113 (FL), 111 (BN), 107 (PH), and 92 (FU).
FIGURE 8Detailed view of significant SNPs related to five types of heterosis, F1, SCA and studied traits on 26 chromosomes with their physical positions (bp).
FIGURE 9Heat map for expression patterns of the 275 genes nearby significant keys SNPs/QTNs associated with studied agronomic and fiber quality traits. Shaded portion is representing expression >1 while white portion is representing <1.
List of 64 candidate genes related to fiber quality and yield related traits with their details about biological function (GO items) annotations.
| Trait | QTN/hQTN | Gene ID | Name | Start | Stop | Length (bp) | Direction | Function Description | Function annotation |
| BN | hqBND02_ 1523160_D | Gh_D02G0165 | ARA, ARA-1, ATRAB11D, ATRABA5E, RABA5E | 1520207 | 1522172 | 1965 | − | RAA5E_ARATH Ras-related protein RABA5e OS = | RAB GTPase homolog A5E |
| BN | hqBND06_ 182732_E | Gh_D06G0024 | GSO1 | 183401 | 184660 | 1259 | − | RLP12_ARATH Receptor-like protein 12 OS = | Leucine-rich repeat transmembrane protein kinase |
| BN | hqBND13_ 8663474_C | Gh_D13G0621 | EIF4A-III | 8654254 | 8657278 | 3024 | − | RH2_ORYSJ DEAD-box ATP-dependent RNA helicase 2 OS = | Eukaryotic initiation factor 4A-III |
| BW | qBWA08_ 931754_E | Gh_A08G0110 | – | 925654 | 939192 | 13538 | − | KC1_TOXGO Casein kinase I OS = | Protein kinase family protein |
| BW | hqBWD08_ 17717836_C | Gh_D08G0894 | AIL5, CHO1, EMK | 17697130 | 17699976 | 2846 | − | AIL5_ARATH AP2-like ethylene-responsive transcription factor AIL5 OS = | AINTEGUMENTA-like 5 |
| FE | hqFEA10_ 64287322_D | Gh_A10G1233 | – | 64261579 | 64264117 | 2538 | − | FRS3_ARATH Protein FAR1-RELATED SEQUENCE 3 OS = | Far-red impaired responsive (FAR1) family protein |
| FE | hqFEA11_ 18660123_E | Gh_A11G1405 | – | 18663333 | 18667100 | 3767 | − | 0 | Protein of unknown function (DUF1664) |
| FE | hqFED02_ 37337297_E | Gh_D02G1201 | FHY3 | 37310027 | 37311060 | 1033 | − | FHY3_ARATH Protein FAR-RED ELONGATED HYPOCOTYL 3 OS = | Far-red elongated hypocotyls 3 |
| FE | hqFED09_ 42469217_C | Gh_D09G1491 | ATRAB8, RAB8 | 42459487 | 42472061 | 12574 | + | RAE1A_ARATH Ras-related protein RABE1a OS = | RAB GTPase homolog 8 |
| FE | hqFED10_ 23461795_E | Gh_D10G1283 | ATX2 | 23432267 | 23444107 | 11840 | − | ATX2_ARATH Histone-lysine | Trithorax-like protein 2 |
| FE | hqFED13_ 49321521_D | Gh_D13G1618 | ATGSTU19, GST8, GSTU19 | 49327339 | 49328744 | 1405 | + | GSTX4_TOBAC Probable glutathione | Glutathione S-transferase TAU 19 |
| FL | hqFLA11_ 45333662_A | Gh_A11G1858 | OST1, SNRK2-6, SRK2E, SNRK2.6, P44, ATOST1 | 45306099 | 45308154 | 2055 | + | SAPK1_ORYSJ Serine/threonine-protein kinase SAPK1 OS = | Protein kinase superfamily protein |
| FS | hqFSA01_ 13635803_E | Gh_A01G0714 | FLA1 | 13649630 | 13651601 | 1971 | + | FLA1_ARATH Fasciclin-like arabinogalactan protein 1 OS = | FASCICLIN-like arabinogalactan 1 |
| FS | hqFSA01_ 83601604_A | Gh_A01G1348 | – | 83638071 | 83646061 | 7990 | − | YAB4_ARATH Axial regulator YABBY 4 OS = | Plant-specific transcription factor YABBY family protein |
| FS | hqFSA04_ 48361719_A | Gh_A04G0705 | ATPHAN, AS1, ATMYB91, MYB91 | 48314343 | 48315413 | 1070 | − | AS1_ARATH Transcription factor AS1 OS = | Myb-like HTH transcriptional regulator family protein |
| FS | hqFSA05_ 30254202_A | Gh_A05G2423 | AtPP2-A12, PP2-A12 | 30397232 | 30398615 | 1383 | − | P2A12_ARATH F-box protein PP2-A12 OS = | Phloem protein 2-A12 |
| FS | hqFSD05_ 61238929_A | Gh_D05G3669 | TIM50, emb1860 | 61253153 | 61256280 | 3127 | − | TIM50_ARATH Mitochondrial import inner membrane translocase subunit TIM50 OS = | Haloacid dehalogenase-like hydrolase (HAD) superfamily protein |
| FS | hqFSD12_ 51753728_C | Gh_D12G1895 | – | 51753693 | 51760260 | 6567 | + | MORC4_MOUSE MORC family CW-type zinc finger protein 4 OS = | Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family protein |
| FU | hqFUA01_ 7658880_E | Gh_A01G0481 | TT7, CYP75B1, D501 | 7653246 | 7654296 | 1050 | − | C71A3_SOLME Cytochrome P450 71A3 (Fragment) OS = | Cytochrome P450 superfamily protein |
| FU | hqFUA07_ 42751600_D | Gh_A07G1461 | PKT3, PED1, KAT2 | 42762126 | 42764782 | 2656 | + | THIK2_ARATH 3-ketoacyl-CoA thiolase 2, peroxisomal OS = | Peroxisomal 3-ketoacyl-CoA thiolase 3 |
| FU | hqFUA11_ 4557423_C | Gh_A11G0474 | OST1, SNRK2-6, SRK2E, SNRK2.6, P44, ATOST1 | 4550580 | 4552694 | 2114 | + | SAPK2_ORYSI Serine/threonine-protein kinase SAPK2 OS = | Protein kinase superfamily protein |
| FU | hqFUA11_ 4557423_C | Gh_A11G0475 | AtRABA4a, RABA4a | 4556537 | 4558130 | 1593 | − | RB11A_LOTJA Ras-related protein Rab11A OS = | RAB GTPase homolog A4A |
| FU | hqFUA12_ 3005239_D | Gh_A12G0200 | ATPEN2, PEN2 | 2997737 | 3013352 | 15615 | − | RD21A_ARATH Cysteine proteinase RD21a OS = | PTEN 2 |
| FU | hqFUA13_ 35371010_E | Gh_A13G0792 | – | 35286057 | 35287535 | 1478 | − | FDL1_ARATH F-box/FBD/LRR-repeat protein At1g13570 OS = | F-box/RNI-like superfamily protein |
| FU | hqFUD07_ 30035840_E | Gh_D07G1581 | – | 30009941 | 30010888 | 947 | + | 0 | 0 |
| FU | hqFUD11_ 47125178_D | Gh_D11G2391 | ATO | 47096352 | 47101439 | 5087 | − | SF3A3_HUMAN Splicing factor 3A subunit 3 OS = | Splicing factor-related |
| FUI | hqFUIA01_ 38777600_E | Gh_A01G1066 | – | 38765672 | 38767468 | 1796 | − | 0 | Protein of unknown function (DUF668) |
| FUI | hqFUIA03_ 6721843_D | Gh_A03G0366 | SCL14, ATGRAS2, GRAS2 | 6718533 | 6720452 | 1919 | + | SCL33_ARATH Scarecrow-like protein 33 OS = | SCARECROW-like 14 |
| FUI | qFUIA04_ 2436958_A | Gh_A04G0150 | – | 2429276 | 2432309 | 3033 | − | Y5162_ARATH Uncharacterized protein At5g41620 OS = | 0 |
| FUI | hqFUIA07_ 72361363_C | Gh_A07G1773 | – | 72335561 | 72338459 | 2898 | − | ACOT9_MOUSE Acyl-coenzyme A thioesterase 9, mitochondrial OS = | Thioesterase/thiol ester dehydrase-isomerase superfamily protein |
| FUI | hqFUIA09_ 4569432_E | Gh_A09G0172 | – | 4552893 | 4556944 | 4051 | − | PUR6_VIGAC Phosphoribosylaminoimidazole carboxylase, chloroplastic (Fragment) OS = | Phosphoribosylaminoimidazole carboxylase, putative/AIR carboxylase, putative |
| FUI | qFUIA12_ 86958433_A | Gh_A12G2444 | – | 86955300 | 86955536 | 236 | + | 0 | 0 |
| FUI | qFUIA12_ 86958433_A | Gh_A12G2445 | KT2, ATKT2, SHY3, KUP2, ATKUP2, TRK2 | 86956856 | 86960376 | 3520 | − | POT2_ARATH Potassium transporter 2 OS = | Potassium transporter 2 |
| FUI | qFUID01_ 3569307_D | Gh_D01G0333 | – | 3561218 | 3590579 | 29361 | + | DRL28_ARATH Probable disease resistance protein At4g27220 OS = | NB-ARC domain-containing disease resistance protein |
| FUI | hqFUID06_ 20424401_A | Gh_D06G0982 | TIP4;1 | 20390073 | 20393472 | 3399 | − | TIP41_ARATH Aquaporin TIP4-1 OS = | Tonoplast intrinsic protein 4;1 |
| LP | qLPA02_ 75700481_D | Gh_A02G0517 | AAE7, ACN1 | 7668368 | 7749288 | 80920 | − | AEE7_ARATH Acetate/butyrate–CoA ligase AAE7, peroxisomal OS = | Acyl-activating enzyme 7 |
| LP | qLPA02_ 75700481_D | Gh_A02G1302 | – | 75725680 | 75725946 | 266 | − | 5GT_VERHY Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase OS = | UDP-glucosyltransferase 75B1 |
| LP | hqLPA05_ 3272852_E | Gh_A05G0285 | GPA1, GP ALPHA 1, ATGPA1 | 3267161 | 3270331 | 3170 | + | GPA1_LUPLU Guanine nucleotide-binding protein alpha-1 subunit OS = | G protein alpha subunit 1 |
| LP | hqLPA05_ 3272852_E | Gh_A05G0286 | – | 3273106 | 3273747 | 641 | + | 0 | 0 |
| LP | hqLPA13_ 13637771_E | Gh_A13G0580 | – | 13628944 | 13630380 | 1436 | − | FBK8_ARATH F-box/kelch-repeat protein At1g22040 OS = | Galactose oxidase/kelch repeat superfamily protein |
| LP | hqLPA13_ 35727868_C | Gh_A13G0793 | – | 35719907 | 35724367 | 4460 | − | UCKC_DICDI Uridine-cytidine kinase C OS = | Phosphoribulokinase/Uridine kinase family |
| LP | hqLPD01_ 5332116_D | Gh_D01G0448 | CUC2, ANAC098, ATCUC2 | 5343860 | 5345867 | 2007 | + | NAC98_ARATH Protein CUP-SHAPED COTYLEDON 2 OS = | NAC (No Apical Meristem) domain transcriptional regulator superfamily protein |
| LP | hqLPD03_ 35830462_A | Gh_D03G1066 | – | 35823773 | 35825722 | 1949 | − | 0 | 0 |
| LP | hqLPD03_ 35830462_A | Gh_D03G1067 | – | 35834718 | 35835578 | 860 | + | BICC1_MOUSE Protein bicaudal C homolog 1 OS = | Sterile alpha motif (SAM) domain-containing protein |
| LP | hqLPD07_ 26600866_E | Gh_D07G1500 | – | 26616541 | 26618053 | 1512 | + | SETH3_ARATH Probable arabinose 5-phosphate isomerase OS = | Sugar isomerase (SIS) family protein |
| LP | hqLPD07_ 31568560_AD | Gh_D07G1617 | – | 31577220 | 31577585 | 365 | − | 0 | 0 |
| LP | qLPD10_ 10866370_ E | Gh_D10G0861 | PIP3 | 10850015 | 10851179 | 1164 | − | PIP27_ARATH Aquaporin PIP2-7 OS = | Plasma membrane intrinsic protein 3 |
| LP | hqLPD12_ 43066746_CDE | Gh_D12G1396 | – | 43057670 | 43058647 | 977 | + | 0 | Protein of unknown function (DUF506) |
| LP | hqLPD12_ 44140224_E | Gh_D12G1432 | – | 44129272 | 44129901 | 629 | − | 0 | 0 |
| MIC | hqMICA02_ 7300875_C | Gh_A02G0495 | – | 7292521 | 7294588 | 2067 | + | P2C34_ARATH Probable protein phosphatase 2C 34 OS = | Protein phosphatase 2C family protein |
| MIC | hqMICA03_ 5930639_D | Gh_A03G0332 | HTB4 | 5931201 | 5931644 | 443 | + | H2B_GOSHI Histone H2B OS = | Histone superfamily protein |
| MIC | hqMICA03_ 95492151_D | Gh_A03G1505 | CPK6, ATCDPK3, ATCPK6 | 95507304 | 95510335 | 3031 | − | CDPK4_SOLTU Calcium-dependent protein kinase 4 OS = | Calcium-dependent protein kinase family protein |
| MIC | hqMICA07_ 16344257_D | Gh_A07G0911 | TOC75-III, MAR1 | 16336770 | 16340890 | 4120 | − | TC753_ARATH Protein TOC75-3, chloroplastic OS = | Translocon at the outer envelope membrane of chloroplasts 75-III |
| MIC | hqMICD02_ 57281334_A | Gh_D02G1668 | – | 57265701 | 57267586 | 1885 | − | RZP23_ORYSJ Serine/arginine-rich splicing factor RSZ23 OS = | Serine/arginine-rich 22 |
| MIC | hqMICD05_ 8772003_D | Gh_D05G1039 | MMT | 8767120 | 8777841 | 10721 | + | MMT1_ARATH Methionine | Methionine |
| MIC | hqMICD06_ 37879191_D | Gh_D06G1287 | – | 37856300 | 37859354 | 3054 | − | CX5B2_ARATH Cytochrome c oxidase subunit 5b-2, mitochondrial OS = | Rubredoxin-like superfamily protein |
| MIC | hqMICD09_ 43629201_C | Gh_D09G1604 | PDE318 | 43620944 | 43628324 | 7380 | − | NOG1_MOUSE Nucleolar GTP-binding protein 1 OS = | P-loop containing nucleoside triphosphate hydrolases superfamily protein |
| MIC | hqMICD09_ 43679359_C | Gh_D09G1610 | – | 43674575 | 43681355 | 6780 | − | 0 | BAH domain;TFIIS helical bundle-like domain |
| MIC | qMICD11_ 59408001_D | Gh_D11G2909 | – | 59407946 | 59409678 | 1732 | − | 0 | Uncharacterised conserved protein (UCP012943) |
| MIC | hqMICD12_ 45767769_E | Gh_D12G1518 | HMGA | 45767190 | 45767829 | 639 | + | HMGYA_SOYBN HMG-Y-related protein A OS = | High mobility group A |
| MIC | hqMICD13_ 39150324_D | Gh_D13G1274 | – | 39197782 | 39198346 | 564 | + | GOT1A_BOVIN Vesicle transport protein GOT1A OS = | Got1/Sft2-like vescicle transport protein family |
| PH | hqPHA11_ 88756735_C | Gh_A11G2657 | – | 88760230 | 88762677 | 2447 | − | PP348_ARATH Pentatricopeptide repeat-containing protein At4g33990 OS = | Pentatricopeptide repeat (PPR) superfamily protein |
| PH | qPHD10_8389255_D | Gh_D10G0727 | – | 8392851 | 8396066 | 3215 | − | 0 | SNARE-like superfamily protein |
| PH | hqPHD13_ 39108563_A | Gh_D13G1273 | SUC2, SUT1, ATSUC2 | 39078454 | 39081595 | 3141 | − | SUC2_ARATH Sucrose transport protein SUC2 OS = | Sucrose-proton symporter 2 |
FIGURE 10(A) Summary of GWAS results for Fiber micronaire (MIC) including Manhattan plots, QQ plots, violin plots displaying differences for MIC among two haplotypes of SNP/hQTN D09_43629201 in trait phenotype and five heterosis types. (B) Regional Manhattan plots showing presence of hQTN D09_43629201 in trait phenotype and five types of heterosis further narrowing down to genomic location of gene hqMICD09_43629201_C on chromosome D09, sun plot displaying the p-values of variables harboring hqMICD09_43629201_C and gene Gh_D09G1604, Expression levels of representative gene associated with MIC during different growth stages and Haplotype region (55 kb) surrounding the peak on chromosome D09 associated with MIC.