| Literature DB >> 35624098 |
Saptarshi Bej1,2, Jit Sarkar3,4, Saikat Biswas5, Pabitra Mitra6, Partha Chakrabarti7,8, Olaf Wolkenhauer9,10,11.
Abstract
BACKGROUND: Studies on Type-2 Diabetes Mellitus (T2DM) have revealed heterogeneous sub-populations in terms of underlying pathologies. However, the identification of sub-populations in epidemiological datasets remains unexplored. We here focus on the detection of T2DM clusters in epidemiological data, specifically analysing the National Family Health Survey-4 (NFHS-4) dataset from India containing a wide spectrum of features, including medical history, dietary and addiction habits, socio-economic and lifestyle patterns of 10,125 T2DM patients.Entities:
Mesh:
Year: 2022 PMID: 35624098 PMCID: PMC9142500 DOI: 10.1038/s41387-022-00206-2
Source DB: PubMed Journal: Nutr Diabetes ISSN: 2044-4052 Impact factor: 4.725
Fig. 1Workflow describing the analysis of the T2DM-NFHS-4 Dataset.
A visualisation for the novel feature type-wise clustering paradigm used in the study, to account for the bias of UMAP towards over representation of continuous variables during dimension reduction.
Fig. 2The low dimensional UMAP visualisations of data for several data types.
a UMAP clusters for all the features with the Euclidean metric. b UMAP clusters for continuous features with Euclidean metric. c UMAP clusters for ordinal features with Canberra metric. d UMAP clusters for nominal features with Hamming metric.
Fig. 3The information on clusters detected in the data.
a Distribution of clusters detected by DBSCAN on the five-dimensional reduced representation of the data. b UMAP clusters for five-dimensional reduced representation of the data annotated by the DBSCAN generated clusters.
Detailed cluster-specific analysis for all numerical and categorical variables.
| Cluster size (N) | 2898 | 2301 | 2226 | 1315 |
| Cont. variables (Mean ± SE) | ||||
| Age (years) | 41.3 ± 0.14 | 38.3 ± 0.19 | 39.9 ± 0.18 | 37.9 ± 0.26 |
| Body mass index (kg/m2) | 26.7 ± 0.09 | 23.9 ± 0.1 | 26 ± 0.11 | 23.6 ± 0.13 |
| Haemoglobin (gm/dl) | 12.5 ± 0.04 | 12.3 ± 0.04 | 12.1 ± 0.04 | 12.3 ± 0.06 |
| Time to the water source (min) | 0.1 ± 0.01 | 0.02 ± 0.01 | 0.09 ± 0.01 | 18.6 ± 0.39 |
| Cat.variables | ||||
| Value for cat. variables | ||||
| Sex | ||||
| Male | 558 (19.25) | 457 (19.86) | 270 (12.13) | 323 (24.56) |
| Female | 2340 (80.75) | 1844 (80.14) | 1956 (87.87) | 992 (75.44) |
| History of asthma | ||||
| No | 2737 (94.44) | 2064 (89.7) | 1999 (89.8) | 1121 (85.25) |
| Yes | 161 (5.56) | 237 (10.3) | 227 (10.2) | 194 (14.75) |
| History of thyroid disorder | ||||
| No | 2636 (90.96) | 2135 (92.79) | 1992 (89.49) | 1196 (90.95) |
| Yes | 262 (9.04) | 166 (7.21) | 234 (10.51) | 119 (9.05) |
| History of heart disease | ||||
| No | 2729 (94.17) | 2107 (91.57) | 1996 (89.67) | 1174 (89.28) |
| Yes | 169 (5.83) | 194 (8.43) | 230 (10.33) | 141 (10.72) |
| History of cancer | ||||
| No | 2876 (99.24) | 2272 (98.74) | 2161 (97.08) | 1246 (94.75) |
| Yes | 22 (0.76) | 29 (1.26) | 65 (2.92) | 69 (5.25) |
| Ever suffered from TB | ||||
| No | 2890 (99.72) | 2287 (99.39) | 2218 (99.64) | 1305 (99.24) |
| Yes | 8 (0.28) | 14 (0.61) | 8 (0.36) | 10 (0.76) |
| Milk/curd intake freq. | ||||
| Never | 201 (6.94) | 183 (7.95) | 110 (4.94) | 123 (9.35) |
| Weekly | 461 (15.91) | 551 (23.95) | 293 (13.16) | 405 (30.8) |
| Occasionally | 611 (21.08) | 669 (29.07) | 447 (20.08) | 291 (22.13) |
| Daily | 1625 (56.07) | 898 (39.03) | 1376 (61.81) | 496 (37.72) |
| Pulses/beans intake freq. | ||||
| Never | 13 (0.45) | 17 (0.74) | 18 (0.81) | 9 (0.68) |
| Weekly | 255 (8.8) | 248 (10.78) | 152 (6.83) | 198 (15.06) |
| Occasionally | 1263 (43.58) | 937 (40.72) | 936 (42.05) | 574 (43.65) |
| Daily | 1367 (47.17) | 1099 (47.76) | 1120 (50.31) | 534 (40.61) |
| Green vegetables intake freq. | ||||
| Never | 7 (0.24) | 12 (0.52) | 10 (0.45) | 9 (0.68) |
| Weekly | 324 (11.18) | 259 (11.26) | 279 (12.53) | 142 (10.8) |
| Occasionally | 1000 (34.51) | 796 (34.59) | 792 (35.58) | 483 (36.73) |
| Daily | 1567 (54.07) | 1234 (53.63) | 1145 (51.44) | 681 (51.79) |
| Fruit intake freq. | ||||
| Never | 50 (1.73) | 65 (2.82) | 74 (3.32) | 41 (3.12) |
| Weekly | 897 (30.95) | 1148 (49.89) | 872 (39.17) | 750 (57.03) |
| Occasionally | 1203 (41.51) | 818 (35.55) | 810 (36.39) | 386 (29.35) |
| Daily | 748 (25.81) | 270 (11.73) | 470 (21.11) | 138 (10.49) |
| Egg intake freq. | ||||
| Never | 97 (3.35) | 85 (3.69) | 1983 (89.08) | 41 (3.12) |
| Weekly | 1005 (34.68) | 963 (41.85) | 153 (6.87) | 520 (39.54) |
| Occasionally | 1537 (53.04) | 1100 (47.81) | 80 (3.59) | 678 (51.56) |
| Daily | 259 (8.94) | 153 (6.65) | 10 (0.45) | 76 (5.78) |
| Fish intake freq. | ||||
| Never | 222 (7.66) | 106 (4.61) | 2162 (97.12) | 83 (6.31) |
| Weekly | 994 (34.3) | 1006 (43.72) | 35 (1.57) | 593 (45.1) |
| Occasionally | 1210 (41.75) | 987 (42.89) | 20 (0.9) | 563 (42.81) |
| Daily | 472 (16.29) | 202 (8.78) | 9 (0.4) | 76 (5.78) |
| Chicken/meat intake freq. | ||||
| Never | 53 (1.83) | 58 (2.52) | 2175 (97.71) | 33 (2.51) |
| Weekly | 1274 (43.96) | 1150 (49.98) | 32 (1.44) | 640 (48.67) |
| Occasionally | 1475 (50.9) | 1032 (44.85) | 18 (0.81) | 612 (46.54) |
| Daily | 96 (3.31) | 61 (2.65) | 1 (0.04) | 30 (2.28) |
| Fried food intake freq. | ||||
| Never | 179 (6.18) | 161 (7) | 276 (12.4) | 95 (7.22) |
| Weekly | 1275 (44) | 988 (42.94) | 1114 (50.04) | 631 (47.98) |
| Occasionally | 1071 (36.96) | 849 (36.9) | 715 (32.12) | 408 (31.03) |
| Daily | 373 (12.87) | 303 (13.17) | 121 (5.44) | 181 (13.76) |
| Aerated drink intake freq. | ||||
| Never | 512 (17.67) | 475 (20.64) | 409 (18.37) | 262 (19.92) |
| Weekly | 1579 (54.49) | 1258 (54.67) | 1200 (53.91) | 744 (56.58) |
| Occasionally | 597 (20.6) | 449 (19.51) | 497 (22.33) | 236 (17.95) |
| Daily | 210 (7.25) | 119 (5.17) | 120 (5.39) | 73 (5.55) |
| Alcoholic | ||||
| No | 2627 (90.65) | 2027 (88.09) | 2171 (97.53) | 1127 (85.7) |
| Yes | 271 (9.35) | 274 (11.91) | 55 (2.47) | 188 (14.3) |
| Smoker | ||||
| No | 2770 (95.58) | 2192 (95.26) | 2197 (98.7) | 1234 (93.84) |
| Yes | 128 (4.42) | 109 (4.74) | 29 (1.3) | 81 (6.16) |
| Indoor smoking freq. | ||||
| Never | 1849 (63.8) | 1138 (49.46) | 1429 (64.2) | 690 (52.47) |
| Weekly | 222 (7.66) | 264 (11.47) | 176 (7.91) | 129 (9.81) |
| Less than monthly | 72 (2.48) | 72 (3.13) | 71 (3.19) | 33 (2.51) |
| Monthly | 78 (2.69) | 72 (3.13) | 68 (3.05) | 36 (2.74) |
| Daily | 677 (23.36) | 755 (32.81) | 482 (21.65) | 427 (32.47) |
| Residence | ||||
| Urban | 1991 (68.7) | 704 (30.6) | 1131 (50.81) | 368 (27.98) |
| Rural | 907 (31.3) | 1597 (69.4) | 1095 (49.19) | 947 (72.02) |
| Wealth index | ||||
| Poorest | 1 (0.03) | 287 (12.47) | 82 (3.68) | 301 (22.89) |
| Poorer | 8 (0.28) | 519 (22.56) | 154 (6.92) | 285 (21.67) |
| Middle | 151 (5.21) | 698 (30.33) | 245 (11.01) | 339 (25.78) |
| Richer | 882 (30.43) | 698 (30.33) | 523 (23.5) | 280 (21.29) |
| Richest | 1856 (64.04) | 99 (4.3) | 1222 (54.9) | 110 (8.37) |
| Highest education level | ||||
| No education | 388 (13.39) | 758 (32.94) | 416 (18.69) | 472 (35.89) |
| Primary level | 347 (11.97) | 373 (16.21) | 303 (13.61) | 240 (18.25) |
| Secondary level | 1641 (56.63) | 1006 (43.72) | 1106 (49.69) | 530 (40.3) |
| Higher level | 522 (18.01) | 164 (7.13) | 401 (18.01) | 73 (5.55) |
| Religion | ||||
| Hindu | 1822 (62.87) | 1544 (67.1) | 1947 (87.47) | 975 (74.14) |
| Muslim | 627 (21.64) | 472 (20.51) | 46 (2.07) | 210 (15.97) |
| Christian | 313 (10.8) | 210 (9.13) | 13 (0.58) | 97 (7.38) |
| Others | 136 (4.69) | 75 (3.26) | 220 (9.88) | 33 (2.51) |
| Caste/tribe | ||||
| OBC | 1331 (45.93) | 871 (37.85) | 805 (36.16) | 472 (35.89) |
| SC | 384 (13.25) | 517 (22.47) | 328 (14.73) | 343 (26.08) |
| ST | 303 (10.46) | 385 (16.73) | 86 (3.86) | 258 (19.62) |
| General | 880 (30.37) | 528 (22.95) | 1007 (45.24) | 242 (18.4) |
| Hypertensive | ||||
| No | 1594 (55) | 1443 (62.71) | 1281 (57.55) | 849 (64.56) |
| Yes | 1304 (45) | 858 (37.29) | 945 (42.45) | 466 (35.44) |
| Possess refrigerator | ||||
| No | 131 (4.52) | 2296 (99.78) | 762 (34.23) | 989 (75.21) |
| Yes | 2767 (95.48) | 5 (0.22) | 1464 (65.77) | 326 (24.79) |
| Possess bicycle | ||||
| No | 1503 (51.86) | 1055 (45.85) | 1013 (45.51) | 617 (46.92) |
| Yes | 1395 (48.14) | 1246 (54.15) | 1213 (54.49) | 698 (53.08) |
| Possess motorbike | ||||
| No | 825 (28.47) | 1590 (69.1) | 734 (32.97) | 884 (67.22) |
| Yes | 2073 (71.53) | 711 (30.9) | 1492 (67.03) | 431 (32.78) |
| Possess car/truck | ||||
| No | 2217 (76.5) | 2226 (96.74) | 1840 (82.66) | 1273 (96.81) |
| Yes | 681 (23.5) | 75 (3.26) | 386 (17.34) | 42 (3.19) |
| Cooking fuel used | ||||
| Other | 1 (0.03) | 4 (0.17) | 0 (0) | 1 (0.08) |
| Plant based | 354 (12.22) | 1018 (44.24) | 437 (19.63) | 723 (54.98) |
| Livestock based | 47 (1.62) | 297 (12.91) | 211 (9.48) | 104 (7.91) |
| Gas/oil | 2460 (84.89) | 965 (41.94) | 1562 (70.17) | 476 (36.2) |
| Electricity | 36 (1.24) | 17 (0.74) | 16 (0.72) | 11 (0.84) |
| Household structure | ||||
| Non-nuclear | 1310 (45.2) | 1016 (44.15) | 1120 (50.31) | 564 (42.89) |
| Nuclear | 1588 (54.8) | 1285 (55.85) | 1106 (49.69) | 751 (57.11) |
| Possess livestock | ||||
| No | 2226 (76.81) | 1155 (50.2) | 1474 (66.22) | 646 (49.13) |
| Yes | 672 (23.19) | 1146 (49.8) | 752 (33.78) | 669 (50.87) |
| Drinking water source | ||||
| Unprotected sources | 76 (2.62) | 146 (6.35) | 44 (1.98) | 204 (15.51) |
| Protected sources | 739 (25.5) | 998 (43.37) | 686 (30.82) | 522 (39.7) |
| Community service | 1991 (68.7) | 1112 (48.33) | 1448 (65.05) | 508 (38.63) |
| Bottled water | 86 (2.97) | 43 (1.87) | 46 (2.07) | 77 (5.86) |
| Other | 6 (0.21) | 2 (0.09) | 2 (0.09) | 4 (0.3) |