| Literature DB >> 28511641 |
Jie Ren1, Tao He2, Ye Li3, Sai Liu4, Yinhao Du1, Yu Jiang5, Cen Wu6.
Abstract
BACKGROUND: Over the past decades, the prevalence of type 2 diabetes mellitus (T2D) has been steadily increasing around the world. Despite large efforts devoted to better understand the genetic basis of the disease, the identified susceptibility loci can only account for a small portion of the T2D heritability. Some of the existing approaches proposed for the high dimensional genetic data from the T2D case-control study are limited by analyzing a few number of SNPs at a time from a large pool of SNPs, by ignoring the correlations among SNPs and by adopting inefficient selection techniques.Entities:
Keywords: Case–control association study; Network-based regularization; Regularized logistic regression; Type 2 diabetes; Variable selection
Mesh:
Year: 2017 PMID: 28511641 PMCID: PMC5434559 DOI: 10.1186/s12863-017-0495-5
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Simulation for SNP data: mean(sd) of true positives (TP) and false positives (FP) based on 100 replicates. Upper panel: (n, p) = (500, 750); Lower panel: (n, p) = (1000, 1500)
| A1 | A2 | A3 | A4 | ||
|---|---|---|---|---|---|
| AR | TP | 41.04(7.07) | 39.28(7.40) | 61.14(2.86) | 53.30(4.23) |
| AR | TP | 64.86(6.89) | 41.46(8.62) | 64.66(2.89) | 57.22(3.96) |
| AR | TP | 74.98(0.14) | 31.22(4.57) | 64.18(2.68) | 51.98(3.11) |
| Block | TP | 47.70(8.62) | 45.82(8.92) | 62.70(2.75) | 55.70(4.20) |
| Block | TP | 67.92(6.35) | 39.62(7.05) | 65.30(3.23) | 57.00(3.68) |
| Block | TP | 72.06(4.16) | 30.28(6.08) | 64.08(2.29) | 50.94(3.44) |
| AR | TP | 94.20(12.15) | 94.16(12.28) | 126.22(4.55) | 113.78(6.27) |
| AR | TP | 142.68(3.61) | 84.28(10.71) | 130.12(3.79) | 116.66(4.48) |
| AR | TP | 149.96(0.20) | 64.40(9.54) | 124.06(3.05) | 98.96(4.46) |
| Block | TP | 101.22(12.37) | 98.96(13.55) | 132.46(3.82) | 121.84(4.57) |
| Block | TP | 145.68(5.93) | 75.84(9.18) | 129.94(3.13) | 114.92(4.25) |
| Block | TP | 147.40(8.42) | 56.32(9.93) | 120.78(4.10) | 92.62(6.34) |
Fig. 1ROC curves for SNP data. ROC curves corresponding to Table 1
Simulation for Gene expression data: mean(sd) of true positives (TP) and false positives (FP) based on 100 replicates. Upper panel: (n, p) = (500, 750); Lower panel: (n, p) = (1000,1500)
| A1 | A2 | A3 | A4 | ||
|---|---|---|---|---|---|
| AR | TP | 43.50(8.64) | 40.48(8.48) | 61.46(2.92) | 53.50(4.59) |
| AR | TP | 68.74(9.23) | 38.56(7.04) | 64.46(2.56) | 55.54(3.69) |
| AR | TP | 74.34(2.00) | 27.68(5.58) | 65.30(1.62) | 45.50(3.11) |
| Block | TP | 44.92(8.75) | 42.92(7.96) | 64.20(2.91) | 56.82(3.70) |
| Block | TP | 72.72(4.01) | 38.94(6.86) | 65.36(2.84) | 56.88(3.54) |
| Block | TP | 70.12(4.29) | 25.24(4.38) | 64.62(2.58) | 43.50(3.22) |
| AR | TP | 88.86(15.09) | 86.72(15.32) | 126.16(4.51) | 113.38(5.89) |
| AR | TP | 146.14(2.65) | 81.68(12.44) | 129.88(3.27) | 115.14(4.93) |
| AR | TP | 149.42(2.26) | 52.64(7.78) | 122.50(4.03) | 82.42(5.24) |
| Block | TP | 91.70(12.04) | 88.92(12.70) | 131.92(3.26) | 120.76(4.32) |
| Block | TP | 148.38(4.95) | 74.02(10.59) | 127.70(3.70) | 110.94(4.60) |
| Block | TP | 145.10(4.85) | 45.60(7.37) | 117.90(3.71) | 73.56(4.73) |
Fig. 2ROC curves for Gene Expression data. ROC curves corresponding to Table 2
Simulation for (n, p) = (500, 1500): mean(sd) of true positives (TP) and false positives (FP) based on 100 replicates. Upper panel: SNP data; Lower panel: gene expression data
| A1 | A2 | A3 | A4 | ||
|---|---|---|---|---|---|
| AR | TP | 48.52(12.31) | 37.57(12.59) | 57.55(14.03) | 46.60(14.58) |
| AR | TP | 126.91(10.98) | 59.37(10.48) | 92.57(6.94) | 78.83(6.70) |
| AR | TP | 148.43(9.83) | 47.82(12.45) | 105.29(5.34) | 80.05(4.81) |
| Block | TP | 67.16(12.67) | 52.23(11.89) | 81.51(7.92) | 70.51(7.25) |
| Block | TP | 146.63(7.27) | 57.24(13.70) | 105.11(4.96) | 89.64(4.46) |
| Block | TP | 143.11(5.33) | 43.92(10.42) | 105.69(5.16) | 82.12(5.11) |
| AR | TP | 47.41(10.56) | 45.27(11.08) | 61.12(12.18) | 49.37(13.06) |
| AR | TP | 137.85(9.22) | 50.61(9.79) | 96.19(6.07) | 81.23(5.95) |
| AR | TP | 148.80(3.65) | 38.57(9.09) | 107.95(4.73) | 70.91(4.23) |
| Block | TP | 60.37(11.61) | 53.43(12.30) | 86.67(6.85) | 74.79(5.96) |
| Block | TP | 140.67(14.14) | 53.89(13.26) | 104.83(4.67) | 86.65(4.25) |
| Block | TP | 145.12(5.00) | 33.07(7.61) | 106.67(5.81) | 64.76(4.71) |
Fig. 3ROC curves for (n, p) = (500,1500). ROC curves corresponding to Table 3
Fig. 4Subnetworks of DAAM1 (upper panel) and CASP9 (lower panel). SNPs connected in the network are joined with edges
Fig. 5Subnetworks of DAAM1 (upper panel) and CASP9 (lower panel). Gene names are corresponding to the SNP id in Fig. 4