| Literature DB >> 32194631 |
Kai Che1, Xi Chen1, Maozu Guo1,2,3, Chunyu Wang1, Xiaoyan Liu1.
Abstract
Identification of genetic variants associated with complex traits is a critical step for improving plant resistance and breeding. Although the majority of existing methods for variants detection have good predictive performance in the average case, they can not precisely identify the variants present in a small number of target genes. In this paper, we propose a weighted sparse group lasso (WSGL) method to select both common and low-frequency variants in groups. Under the biologically realistic assumption that complex traits are influenced by a few single loci in a small number of genes, our method involves a sparse group lasso approach to simultaneously select associated groups along with the loci within each group. To increase the probability of selecting out low-frequency variants, biological prior information is introduced in the model by re-weighting lasso regularization based on weights calculated from input data. Experimental results from both simulation and real data of single nucleotide polymorphisms (SNPs) associated with Arabidopsis flowering traits demonstrate the superiority of WSGL over other competitive approaches for genetic variants detection.Entities:
Keywords: genetic variants; genome-wide association studies; minimum allele frequency; single nucleotide polymorphisms; sparse group lasso
Year: 2020 PMID: 32194631 PMCID: PMC7063084 DOI: 10.3389/fgene.2020.00155
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Parameter estimation for weighted sparse group lasso.
| Input: Genotype |
|---|
|
|
| 1: calculate |
| 2: |
| 3: |
| 4: |
| 5: update |
| 6: return |
Figure 1Precision-recall (PR) curves of WSGL and the other methods.
Figure 2Precision-recall (PR) curves of WSGL and the other methods for varying λ.
Description of the 10 flowering related phenotypes in A.thaliana in real data application.
| Phenotype | Accessions | Phenotype description | Growths conditions | Phenotype scoring |
|---|---|---|---|---|
| LD | 167 | Days to flowering time (FT) under Long Day (LD) and Short Days (SD) +/− vernalization | 18°C 16-h daylight | Number of days following stratification to opening of the first flower. The experiment was stopped at 200d, and accessions that had not flowered at the point were assigned a value of 200. |
| LDV | 168 | 18°C 16-h daylight, vernalized (5wks 4) | ||
| SD | 162 | 18°C 16-h daylight | ||
| SDV | 159 | 18°C 16-h daylight, vernalized (5wks 4) | ||
| FT10 | 194 | 10°C 16-h daylight | ||
| FT16 | 193 | 16°C 17-h daylight | Plants were checked bi-weekly for presence of first buds, and the average flowering time and average leaf number of four plants of the same accession at each temperature were collected. | |
| FT22 | 193 | Flowering time (FT) and leaf number at flowering time (LN) | 22 °C 18-h daylight | |
| LN10 | 177 | 10 °C 19-h daylight | ||
| LN16 | 176 | 16°C 20-h daylight | ||
| LN22 | 176 | 22°C 21-h daylight |
Summary of four methods associations found in real data.
| Phenotype | Method | Number of genes covered by top 100 SNPs | Number of genes in the 19 genes | Ratio of candidate genes |
|---|---|---|---|---|
| FT10 | Lasso | 86 | 4 | 4.65% |
| GL | 66 | 6 | 9.09% | |
| SGL | 78 | 4 | 5.13% | |
| WSGL | 26 | 6 | 23.08% | |
| FT16 | Lasso | 76 | 8 | 10.53% |
| GL | 62 | 8 | 12.9% | |
| SGL | 64 | 7 | 10.94% | |
| WSGL | 67 | 10 | 14.93% | |
| FT22 | Lasso | 78 | 7 | 8.79% |
| GL | 72 | 7 | 9.72% | |
| SGL | 77 | 6 | 7.79% | |
| WSGL | 71 | 9 | 12.68% | |
| LD | Lasso | 81 | 9 | 11.11% |
| GL | 67 | 9 | 13.43% | |
| SGL | 73 | 11 | 15.07% | |
| WSGL | 74 | 12 | 16.22% | |
| LDV | Lasso | 6 | 6 | – |
| GL | 6 | 6 | – | |
| SGL | 6 | 6 | – | |
| WSGL | 6 | 6 | – | |
| SD | Lasso | 78 | 5 | 6.41% |
| GL | 70 | 5 | 7.14% | |
| SGL | 79 | 6 | 7.59% | |
| WSGL | 77 | 6 | 7.79% | |
| SDV | Lasso | 84 | 1 | 1.19% |
| GL | 66 | 1 | 1.52% | |
| SGL | 78 | 2 | 2.56% | |
| WSGL | 72 | 2 | 2.78% | |
| LN10 | Lasso | 6 | 6 | – |
| GL | 6 | 6 | – | |
| SGL | 6 | 6 | – | |
| WSGL | 6 | 6 | – | |
| LN16 | Lasso | 6 | 6 | – |
| GL | 6 | 6 | – | |
| SGL | 6 | 6 | – | |
| WSGL | 6 | 6 | – | |
| LN22 | Lasso | 6 | 6 | – |
| GL | 6 | 6 | – | |
| SGL | 6 | 6 | – | |
| WSGL | 6 | 6 | – |