| Literature DB >> 35433869 |
Chaochao Ma1, Lei Li1, Xinlu Wang2, Li'an Hou1, Liangyu Xia1, Yicong Yin1, Xinqi Cheng1, Ling Qiu1,3.
Abstract
Objective: The level of Homocysteine (Hcy) in males is generally higher than that of females, but the same reference interval (RI) is often used in clinical practice. This study aims to establish a sex-specific RI of Hcy using five data mining algorithms and compare these results. Furthermore, age-related continuous RI was established in order to show the relationship between Hcy concentration distribution and age.Entities:
Keywords: GAMLSS algorithm; aging model; big data mining; homocysteine; reference interval
Year: 2022 PMID: 35433869 PMCID: PMC9005842 DOI: 10.3389/fcvm.2022.846685
Source DB: PubMed Journal: Front Cardiovasc Med ISSN: 2297-055X
The basic information of data set 1 and data set 2.
| Units | Data set 1 | Data set 2 | |
|
| 2,261 | 11,074 | |
| Sex | Female: male | 1,173:1,124 | 7,685:3,385 |
| Age | Year | 50 (36, 63) | 47 (38, 55) |
| Vitamin B12 | pg/mL | 356 (281, 475) | 368 (288, 479) |
| Folate | ng/mL | 10.4 (7.8, 14.8) | 10.9 (8.2, 14.9) |
| ALT | U/L | 18 (14, 25) | 16 (12, 23) |
| AST | U/L | 20 (17, 23) | 19 (16, 22) |
| Cr | μmol/L | 70 (61, 80) | 65 (58, 75) |
| Urea | mmol/L | 4.94 (4.28, 5.66) | 4.77 (4.09, 5.53) |
| TSH | μIU/L | 1.903 (1.381, 2.602) | 1.888 (1.366, 2.589) |
| FT3 | pg/mL | 3.32 (3.11, 3.55) | 3.27 (3.06, 3.48) |
| FT4 | ng/dL | 1.23 (1.14, 1.34) | 1.22 (1.12, 1.32) |
Results were description as Median (P25, P75).
Results of multiple linear regression.
| Sex | Age | |||||||||||||
| A1 | A2 | A3 | A4 | A5 | A6 | |||||||||
|
|
|
|
|
|
|
| ||||||||
| β |
| β |
| β |
| β |
| β |
| β |
| β |
| |
|
| 0.448 | < 0.001 | −0.033 | 0.160 | 0.004 | 0.874 | 0.038 | 0.110 | 0.101 | < 0.001 | 0.179 | < 0.001 | 0.146 | < 0.001 |
FIGURE 1Age 1, Age 2, Age 3, Age 4, Age 5, Age 6, and Age 7 are, respectively, standing for 18–29 years, 30–39 years, 40–49 years, 50–59 years, 60–69 years, 70–79 years, and > 80 years. Sex0 stands for females and sex1 stands for males.
SDR of sex and age.
| SDresi | Sex | Age | |||
|
| SDR |
| SDR | ||
|
| 1.876 | 1.378 | 0.735 | 0.586 | 0.312 |
SD, standard deviation; SDR, standard deviation ratio.
Reference interval of Hcy established by using five algorithms.
| Hoffmann | Bhattacharya | Expectation maximization | kosmic | Refine R | |||
| Total | P2.5 | 7.8 | 7.6 | 9.5 | 8.5 | 8.4 | 7.92–8.79 |
| P5 | 8.5 | 8.3 | 9.9 | 9.0 | 8.9 | 8.53–9.21 | |
| P25 | 10.6 | 10.3 | 11.3 | 10.5 | 10.5 | 10.30–10.76 | |
| Median | 12.1 | 11.7 | 12.5 | 11.6 | 11.7 | 11.42–12.10 | |
| P75 | 13.6 | 13.1 | 13.9 | 12.8 | 13.0 | 12.50–13.60 | |
|
|
|
|
|
|
|
| |
| P97.5 | 16.4 | 15.8 | 17.6 | 15.1 | 15.7 | 14.48–17.06 | |
| Female | P2.5 | 7.8 | 8.4 | 9.0 | 8.1 | 7.9 | 7.38–9.82 |
| P5 | 8.3 | 8.8 | 9.3 | 8.5 | 8.3 | 7.95–10.08 | |
| P25 | 10.0 | 10.2 | 10.3 | 9.9 | 9.8 | 9.62–10.93 | |
| Median | 11.1 | 11.1 | 11.1 | 11.0 | 10.9 | 10.32–11.53 | |
| P75 | 12.3 | 12.1 | 12.1 | 12.1 | 11.9 | 10.85–12.29 | |
|
|
|
|
|
|
|
| |
| P97.5 | 14.4 | 14.0 | 14.6 | 14.7 | 14.1 | 11.79–14.95 | |
| Male | P2.5 | 9.4 | 9.8 | 11.4 | 9.3 | 9.7 | 8.97–11.24 |
| P5 | 10.0 | 10.3 | 11.8 | 9.9 | 10.1 | 9.63–11.59 | |
| P25 | 11.8 | 11.7 | 12.8 | 11.6 | 11.7 | 11.29–12.73 | |
| Median | 13.1 | 12.7 | 13.7 | 12.9 | 12.9 | 12.17–13.58 | |
| P75 | 14.4 | 13.8 | 14.7 | 14.1 | 14.1 | 12.91–14.51 | |
|
|
|
|
|
|
|
| |
| P97.5 | 16.8 | 15.7 | 17.1 | 16.5 | 16.4 | 14.31–17.05 |
The results of EM algorithm are estimated using mean and standard deviation; P2. 5, P5, P25, P75, P95, P97.5 represents 2.5 quantile, 5 quantile, 25 quantile, 75 quantile, 95 quantile, and 97.5 quantile, respectively; The P95 quantile is the upper limit of the reference interval established in our study and the lower limit is 0. Hoffmann, Bhattacharya, Expectation maximization, kosmic, Refine R represents five algorithms which were used in our study. The bold values are the upper limits of RI for Hcy.
FIGURE 2(A–E) Respectively, stand for Hoffman, Bhattacharya, EM, kosmic and refineR algorithms. The first column represents the total reference interval, the second column represents the reference interval for females, and the third column represents the reference interval for males. (A,B) In the plot of Hoffmann and Bhattacharya, linear region represents the distribution of healthy individuals and RI were determined by extending the linear region of the healthy subgroup. (C) The green curve represents the distribution of healthy individuals and RI were determined by distribution of green curve. (D) The algorithm minimizes the difference between an estimated parametrical distribution and a truncated part (the interval between two vertical dashed lines) of the observed distribution, and RI were determined by estimated parametrical distribution (the distribution curve in the graph). (E) The green curve in plot represents the distribution of healthy individuals and the green vertical dotted line indicates the upper limit of the estimated RI.
FIGURE 3The solid lines from top to bottom represent the 95th, 75th, 50th, and 25th, 5th quantiles of concentration of Hcy, respectively; The gray shadows represent 95% confidence intervals for each percentile curve.