| Literature DB >> 27582745 |
Peng Zhang1, Kaizhen Zhong1, Muhammad Qasim Shahid2, Hanhua Tong1.
Abstract
Association analysis based on linkage disequilibrium (LD) is an efficient way to dissect complex traits and to identify gene functions in rice. Although association analysis is an effective way to construct fine maps for quantitative traits, there are a few issues which need to be addressed. In this review, we will first summarize type, structure, and LD level of populations used for association analysis of rice, and then discuss the genotyping methods and statistical approaches used for association analysis in rice. Moreover, we will review current shortcomings and benefits of association analysis as well as specific types of future research to overcome these shortcomings. Furthermore, we will analyze the reasons for the underutilization of the results within association analysis in rice breeding.Entities:
Keywords: Oryza sativa; association analysis; genotyping; linkage disequilibrium; marker density; phenotyping; population structure
Year: 2016 PMID: 27582745 PMCID: PMC4987372 DOI: 10.3389/fpls.2016.01202
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Summary of association analysis in rice.
| CGAS | 105 | Landraces | – | – | Five sequencing markers based on | Nucleotide diversity measure | Glutinous traits | Olsen and Purugganan, |
| GWAS | 218 | Lines from US and Asia | – | <10−3 | 66 SSRs and 114 RFLPs | Discriminant analysis | 12 agronomic traits | Zhang N. et al., |
| GWAS | 103 | USDA germplasm | 7 | <5 × 10−2 | 123 SSRs | MLM | 5 yield traits | Agrama et al., |
| GWAS | 90 | USDA mini-core collection | 3 | <10−3 | 108 SSRs and 1 InDel | MLM | Stigma and spikelet characteristics | Yan et al., |
| GWAS | 170 | Diverse landraces and cultivars | 2 | <10−4 | 126 SSRs and 6 InDels | MLM | 6 morphological traits of Cheng's index and 3 agronomic traits | Wen et al., |
| GWAS | 84 | Landraces | 4 | <5 × 10−2 | 24 SSRs | MLM | Amino acid contents | Zhao et al., |
| CGAS | 70 | Diverse varieties | 2 | <5 × 10−2 | Starch synthesis related genes in rice | MLM | Amylose content, gel consistency, and gelatinization temperature | Tian et al., |
| GWAS | 293 | Landraces and cultivars | – | <10−3 | 179 RFLPs | Elliptic Fourier analysis | Grain shape | Iwata et al., |
| GWAS | 416 | Landraces, cultivars, and breeding lines | 7 | <10−3 | 100 SSRs | GLM+Q | Starch quality traits | Jin et al., |
| GWAS | 192 | Elite lines and varieties | – | <5 × 10−2 | 97 SSRs | MLM | Apparent amylose content, heading date, and head rice | Ordonez et al., |
| GWAS | 517 | Landraces | 2 | <10−7 | 3,600,000 SNPs | LR–Q and MLM | 14 agronomic traits | Huang et al., |
| CGAS | 303 | Cultivars | 2 | <5 × 10−2 | 24 SSRs | LR+Q | Awn | Hu et al., |
| CGAS | 118 | Glutinous accessions | 2 | <5 × 10−2 | 43 gene-specific molecular markers based on 17 starch synthesis-related genes | MLM | Rapid visco analyzer profile parameters | Yan et al., |
| GWAS | 416 | Landraces, cultivars, and breeding lines | 7 | <5 × 10−2 | 100 SSRs | GLM+Q and MLM | Grain color, phenolic content, flavonoid content, and antioxidant capacity | Shao et al., |
| GWAS | 217 | USDA mini-core collection | 5 | <6.45 × 10−4 | 154 SSRs and 1 InDel | MLM | 14 agronomic traits | Li X. et al., |
| GWAS | 174 | USDA mini-core collection | 5 | <10−2 | 156 SSRs, 2 InDels and 6 SNPs | GLM+Q and MLM | Silica concentration in hull | Bryant et al., |
| GWAS | 180 | European Rice Core collection | 3 | <10−2 | 124 SNPs and 52 SSRs | GLM+Q, MLM–Q and MLM | Salt tolerant | Ahmadi et al., |
| CGAS | 346 | Cultivated and wild rice | 6 | <5 × 10−2 | 2 sequencing markers based on | GLM+Q and nested clade analysis | Starch quality | Yu et al., |
| GWAS | 383 | Diverse landraces and cultivars | 5 | <10−4 | 44,000 SNPs | LR+Q, principle component analysis, and GLM+Q | Relative root growth in aluminum toxic | Famoso et al., |
| GWAS | 413 | Diverse landraces and cultivars | 5 | <10−4 | 44,100 SNPs | SLM−Q, LR−Q and MLM | 34 traits including agronomic, quality, and biotic stress | Zhao et al., |
| GWAS | 203 | USDA mini-core collection | 5 | <6.45 × 10−4 | 154 SSRs and 1 InDel | MLM | 14 agronomic traits | Li et al., |
| GWAS | 217 | USDA mini-core collection | 5 | <5 × 10−2 | 154 SSRs and 1 InDel | MLM | Sheath blight resistance | Jia et al., |
| CGAS | 104 | Diverse Landraces and cultivars | 3 | <5 × 10−2 | 8 sequencing markers based on | GLM+Q | Plant height, heading date, and spikelets per panicle | Lu et al., |
| GWAS | 950 | Worldwide varieties | 5 | <10−7 | 4,109,366 SNPs | LR–Q and MLM | Flowering time and with 10 grain-related traits | Huang et al., |
| GWAS | 167 | 6 | <5 × 10−4 | 9727 DArT markers and 6717 SNPs | GLM+Q and MLM | Root traits | Courtois et al., | |
| GWAS | 50 | Waxy rice accessions | 2 | 455 AFLPs and ISSR | MLM | Starch physicochemical properties | Xu et al., | |
| GWAS | 529 | Landraces and elite varieties | 2 | <5 × 10−2 | 6,400,000 SNPs | LR and LMM | 840 metabolic traits | Chen et al., |
| GWAS | 150 | Landraces | 2 | 274 SSRs | GLM and MLM | 12 agronomic traits | Zhang et al., | |
| GWAS | 366 | – | <5 × 10−2 | 800,000 SNPs | EMMAX | Blast resistance for 16 strains | Wang C. et al., | |
| GWAS | 529 | Landraces and elite varieties | – | <10−8 | 4,358,600 SNPs | FaST-LMM | 13 traditional agronomic traits and 2 newly defined traits during the rice growth period | Yang W. et al., |
| GWAS | 126 | High-yielding or primary ancestral cultivars | 4 | <10−4 | 1152 SNPs | MLM | 6 yield traits | Yonemaru et al., |
| GWAS | 270 | Landraces | 2 | <5 × 10−4 | 241 DArT markers and 25,971 SNPs 262 SSRs | MLM | Flowering time | Phung et al., |
| GWAS | 540 | Landraces | 7 | <10−2 | 262 SSRs | GLM | Seed vigor (root length, shoot length, and shoot dry weight) | Dang et al., |
| GWAS | 100 | Landraces and cultivars | 3 | <10−2 | 81 molecular markers | MLM | 15 morphological traits | Jahani et al., |
| GWAS | 300 | Cultivars | 4 | <10−4 | 369,000 SNPs | MLM | Grain concentrations of arsenic, copper, molybdenum, and zinc | Norton et al., |
| GWAS | 220 | Landraces and cultivars | 3 | <10−5 | 4929 SNPs | CMLM | Salinity tolerance | Kumar et al., |
| GWAS | 328 | Cultivars | 5 | <10−4 | 30,000 SNPs | MLM | Ozone tolerance | Ueda et al., |
| GWAS | 363 | Elite breeding lines | 4 | <5 × 10−6 | 71,170 SNPs | LMM | 19 agronomic traits | Begum et al., |
| GWAS | 95 | Landraces and cultivars | 7 | <5 × 10−2 | 263 SSRs | GLM | Grain-filling rate | Liu E. et al., |
| GWAS | 175 | Japanese rice collection | – | <10−5 | 3168 SNPs | EMMA | Metabolites | Matsuda et al., |
| GWAS | 1495 | Elite hybrid rice varieties | 2 | <10−6 | 1,654,030 SNPs | EMMAX | 38 agronomic traits | Huang et al., |
| CGAS | 529 | Chinese core collection and world core collection | 5 | <10−3 | 41 CCT genes based on | GLM+Q | Heading date | Zhang et al., |
| GWAS | 176 | Japanese | 0 | <10−5 | 43,323 SNPs | GLM+K | Agronomic traits | Yano et al., |
GWAS, Genome-wide association study; CGAS, Candidate-gene association study.
+, considering population structure; −, not considering population structure; GLM, generalized linear model; LR, logistic regression; MLM, mixed linear model (Q+K model); SLM, simple linear model; LMM, linear mixed model. Q, population structure; EMMAX, efficient mixed-model association eXpedited; FaST-LMM, factored spectrally transformed linear mixed model; CMLM, compressed mixed linear model; EMMA, efficient mixed-model association.
Figure 1Schematic representation of various steps involved in association analysis (AA). NAM, nested association mapping. MAGIC, multi-parent advanced generation intercross. RFLP, restriction fragment length polymorphism. SSR, simple sequence repeat. InDel, insertion-deletion. SNP, single nucleotide polymorphisms. DArT, diversity array technology. CNVs, copy number variations. PAVs, presence and absence variations. ISBPs, insertion-site-based polymorphisms. PCA, principal component analysis. PCoA, principal coordinate analysis. LAP, laplacian eigenfunctions. LR, logistic regression. GLM, generalized linear model. MLM, mixed linear model. CMLM, compressed mixed linear model. FaST-LMM, factored spectrally transformed linear mixed model. EMMA, efficient mixed model association. EMMAX, EMMA eXpedited. MLMM, multi-locus mixed model. MTMM, multi-trait mixed model. A-D, Anderson-Darling test. SUPER, settlement of MLM under progressively exclusive relationship. G, genotype. E, environment. JLAM, joint linkage association mapping. GWAS, genome wide association study. GS, genomic selection. Plus sign (Red) represents the performance which should be improved in future rice association analysis.