| Literature DB >> 31384712 |
Cecilie R Buskbjerg1, Claus H Gravholt2,3, Helene R Dalby4, Ali Amidi1, Robert Zachariae1.
Abstract
Testosterone supplementation (TS) is assumed important for cognitive functioning in men, but conflicting results have prevented firm conclusions. The current study systematically reviewed available randomized controlled trials (RCTs) on effects of TS on cognitive functioning in men, subjected the findings to meta-analysis, and explored between-study differences as possible moderators of the effects. Following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines, two authors independently searched for eligible records in the electronic databases of PubMed, PsycINFO, Web of Science, the Cochrane Library, Cumulative Index of Nursing and Allied Health, and Embase and determined eligibility using the following (population, intervention, comparison, outcome) criteria: population, male adults (>18 years); intervention, TS; comparison, placebo; and outcome, results of standardized neuropsychological tests. Following duplicate removal, 3873 records were screened with 92 remaining for full-text screening. Twenty-one papers reporting results of 23 independent RCTs were included, of which none treated samples of clinically hypogonadal men. The small improvement found in overall cognitive functioning (Hedges g = 0.09; CI 95%: -0.02 to 0.19) failed to reach statistical significance (P = 0.108) and approached zero when adjusting for possible publication bias (g = 0.04). The effects for the 11 individual cognitive domains did not reach statistical significance (g: -0.04 to 0.19, P: 0.061 to 0.989). Small statistically significant (P < 0.05) effects were found for five study subsets but failed to meet the fail-safe criterion. The available evidence indicates that effects of TS on cognitive functioning in men with testosterone levels within normal ranges are less robust and of insufficient magnitude to be of clinical relevance. The effects in clinically hypogonadal men remain to be investigated.Entities:
Keywords: androgens; cognitive functions; meta-analysis; systematic review; testosterone supplementation
Year: 2019 PMID: 31384712 PMCID: PMC6676074 DOI: 10.1210/js.2019-00119
Source DB: PubMed Journal: J Endocr Soc ISSN: 2472-1972
Figure 1.Flowchart of study selection.
Description of Included Studies
| Study | Cognitive Outcome | n | n | Duration | Type | Doses | Mean Age (Range/SD) | Cognitive Status | Gonadal Status Pre | Gonadal Status Post | TS Effect | T Medium | Assay | Time |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Janowsky | Executive function; verbal memory; visuomotor function | 56 | 56 | 12.00 | Patch | 15 mg/d | 67.40 (60–75) | Normal | Normal 555.2 (132.6) pM | High 850.2 (397.3) pM | 1.11 | Serum | Direct RIA | Morning/NR |
| Janowsky | Attention/working memory | 19 | 19 | 4.00 | Injection | 150 mg/wk | 67.45 (61–75) | Normal | Normal 423.3 (128.7) pM | High 1568.4 (1086.1) pM | 0.12 | Serum | Direct RIA | Morning/NR |
| Cherrier | Executive function; language; verbal memory; visuospatial memory; attention/working memory | 28 | 25 | 6.00 | Injection | 100 mg/wk | 67.40 (50–80) | Normal | Normal 576.4 (66.9) ng/dL | High 1239.2 (53.0) ng/dL | 7.68 | Serum | Direct RIA | Random/not fasting |
| O’Connor | Executive function; language; verbal memory; visuomotor function; attention/working memory | 30 | 29 | 8.00 | Injection | 200 mg/wk | 28.30 (19–45) | Normal | Normal 625.4 (66.9) ng/dL | High 1107.6 (111.2) ng/dL | 2.43 | Serum | Direct CIA | Random/not fasting |
| Kenny | Executive function; language; attention/working memory | 67 | 44 | 52.00 | Patch | 2–2.5 mg/d | 76.00 (65–87) | Normal | Normal 93.0 (34.0) ng/dL | Normal 162.0 (100) ng/dL | 1.00 | Serum | Direct RIA | Morning/fasting |
| Kenny | Cognitive status; executive function; visuomotor function; attention/working memory | 11 | 11 | 12.00 | Injection | 200 mg/2 wk | 80.00 (73–87) | Impaired (MCI) | Normal 410.0 (112.0) ng/dL | High 1211.0 (360.0) ng/dL | 2.59 | Serum | Direct RIA | NR/fasting |
| Cherrier | Executive function; language; visuospatial memory; visuomotor function; attention/working memory | 41 | 38 | 6.00 | Injection | 100 mg/wk | 65.00 (50–85) | Normal | Normal 403.5 (230.6) ng/dL | High 1476.5 (734.9) ng/dL | 1.73 | Serum | Direct CIA | Random/not fasting |
| Cherrier | Executive function; language; verbal memory; visuospatial memory; verbal learning; visuomotor function | 32 | 28 | 6.00 | Injection | 100 mg/wk | 76.00 (63–85) | Impaired (AD + MCI) | Normal 403.5 (172.9) ng/dL | High 1441.9 (230.6) ng/dL | 5.11 | Serum | Direct CIA | Random/not fasting |
| Haren | Cognitive status; visuomotor function; attention/working memory | 76 | 58 | 52.00 | Pellets | 80 mg/d | 68.50 (60–86) | Normal | Normal 482.4 (130.6) ng/dL | Normal 449.3 (236.6) ng/dL | 0.02 | Serum | Direct CIA | Morning/fasting |
| Lu | Cognitive status; verbal memory; visuomotor function; visuospatial function | 20 | 14 | 24.00 | Gel (1%) | 75 mg/d | 69.80 (8.7) | Impaired (AD) | Normal 387.7 (76.6) ng/dL | Normal 597.1 (554.3) ng/dL | 0.64 | Serum | Direct RIA | NR/NR |
| Lu | Cognitive status; verbal memory; visuomotor function; visuospatial function | 29 | 20 | 24.00 | Gel (1%) | 75 mg/d | 62.36 (6.7) | Normal | Normal 385.8 (170.1) ng/dL | Normal 737.5 (241.9) ng/dL | 1.77 | Serum | Direct RIA | NR/NR |
| Cherrier | Verbal memory; visuospatial memory | 57 | 50 | 6.00 | Injection | 50, 100, or 300 mg/6 wk | 67.00 (56–78) | Normal | Normal 396.3 (160.5) ng/dL | High 1915.5 (1230.8) ng/dL | 2.50 | Serum | Direct CIA | Random/not fasting |
| Maki | Executive function; verbal memory; verbal learning; visuospatial learning; attention/working memory | 15 | 15 | 12.90 | Injection | 200 mg/2 wk | 73.90 (66–86) | Normal | Normal 10.2 ± 3.2 pg/mL | High 970.21 (359.1) ng/dL | 1.82 | Serum | Direct RIA and CIA | NR/NR |
| Vaughan | Executive function; verbal memory; verbal learning; visuospatial learning; visuospatial function; attention/working memory | 47 | 32 | 156.00 | Injection | 200 mg/2 wk | 70.80 (65–83) | Normal | Low–normal 285.3 (46.1) ng/dL | Normal 587.9 (279.5) ng/dL | 1.10 | Serum | NR | Morning/NR |
| Emmelot-Vonk | Executive function; verbal memory; verbal learning; visuospatial function | 237 | 223 | 26.00 | Pellets | 160 mg/d | 67.25 (60–80) | Normal | Low–normal 317.0 (54.8) ng/dL | Low–normal “unchanged” | −0.56 | Serum | Direct CIA | Morning/fasting |
| Young | Executive function; language; verbal memory; verbal learning; visuospatial learning; visuospatial function; attention/working memory | 13 | 13 | 6.00 | Gel | 100 mg/d | 29.31 (3.3) | Normal | Normal 411 (125.8) pM | Normal 541.9 (310.2) pM | 0.23 | Serum | Direct RIA | NR/NR |
| Young | Executive function; language; verbal memory; verbal learning; visuospatial learning; visuospatial function; attention/working memory | 15 | 15 | 6.00 | Gel | 75 mg/d | 67.40 (5.5) | Normal | Normal 241 (65.5) pM | Normal 347.6 (155.2) pM | 1.06 | Serum | Direct RIA | NR/NR |
| Borst | Executive function; visuospatial memory; visuospatial learning; visuospatial function; attention/working memory | 30 | 19 | 52.00 | Injection | 125 mg/wk | 70.00 (8.9) | Normal | Low–normal 245.0 (73.0) ng/dL | Normal 474.0 (193.5) ng/dL | 1.46 | Serum | Direct CIA | NR/NR |
| Cherrier | Executive function; language; verbal memory; visuospatial memory; verbal learning; visuospatial learning; visuospatial function; reaction time | 22 | 19 | 24.00 | Derma gel | 50–100 mg/d | 70.50 (60–88) | Impaired (MCI) | Low–normal 308.2 (92.1) ng/dL | Normal 600.7 (19.7) ng/dL | 1.91 | Serum | LC-MS/MS | Random/not fasting |
| Huang | Executive function; language; verbal memory; visuospatial memory; verbal learning; visuospatial learning | 308 | 240 | 156.00 | Gel (1%) | 7.5 g 1% T gel/d | 67.55 (5.10) | Normal | Low–normal 305.5 (63.4) | Normal 567.7 (265.1) ng/dL | 1.40 | Serum | Direct IA | Morning/fasting |
| Melehan | Executive function; reaction time | 67 | 54 | 18.00 | Injection | 1000 mg/6 wk | 49.00 (1.6) | Normal | Normal 352.7 (161.4) ng/dL | Normal 539.04 (115.16) ng/dL | 1.40 | Serum | LC-MS/MS | Morning/NR |
| Wahjoepramono | Cognitive status; verbal memory; verbal learning | 50 | 44 | 24.00 | Cream (5%) | 50 mg/d | 61.05 (7.7) | Normal | Normal 474.4 (126.8) ng/dL | Normal 769.5 (348.7) ng/dL | 1.40 | Serum | LC-MS/MS | NR/NR |
| Resnick | Cognitive status; executive function; verbal memory; verbal learning; visuospatial learning; reaction time | 493 | 438 | 52.00 | Gel (1%) | 5.00 g 1% T gel/d | 72.20 (6.0) | Impaired (AAMI) | Low–normal 234.4 (65.2) ng/dL | Normal 490 (86.2) ng/dL | 1.80 | Serum | LC-MS/MS | NR/NR |
Abbreviations: AAMI, age-associated memory impairment; AD, Alzheimer’s disease patients; CIA, chemiluminescent immunoassay; IA, immunoassay; LC, liquid chromatography; MCI, mild cognitively impaired patients; MS/MS, tandem mass spectrometry; NR, not reported; Post, post-treatment assessment; Pre, pretreatment assessment; RIA, radioimmunoassay; T, testosterone.
Number of participants randomized to the study.
Number of participants completing the study.
Duration of treatment is presented in weeks.
Type refers to the type of treatment used to administer TS.
Total sample mean age and associated range or SD depending on the available data.
The treatment groups’ gonadal status before and after treatment was categorized depending on mean total testosterone (TT) levels as follows: (i) low, TT ≤ 231 ng/dL; (ii) low–normal, TT between 232 and 320 ng/dL; (iii) normal, TT between 321 and 865 ng/dL; and (iv) supraphysiological (high), TT > 865 ng/dL. When only mean free testosterone (FT) levels were available, treatment groups’ gonadal status were categorized as the following: (i) low, FT ≤ 174 pM; (ii) low–normal, FT between 175 and 220 pM; (iii) normal, FT between 221 and 846 pM; and (iv) supraphysiological (high), FT > 846 pM. In one case [49], only mean bioavailable testosterone (BT) levels were provided, and these were categorized as normal (BT between 72 and 235 ng/dL). Mean TT, FT, or BT and associated SDs are shown below the gonadal category.
The effect of TS on testosterone levels was calculated as an effect size (Hedges g) for the increase in serum testosterone in the active vs control group from pre- to post-treatment. Positive values indicate an effect in the expected direction, i.e., largest increase in the active groups’ testosterone levels compared with the control group.
Lu et al. [43] presented separate data on Alzheimer’s disease patients and healthy men, and the reference is thus represented with two different studies in the table.
Young et al. [44] presented separate data on young men and older men, and the reference is thus represented with two different studies in the table.
Treatment doses were adjusted to keep testosterone concentrations within specified ranges.
Resnick et al. [63] classified patients as having AAMI when they had both subjective memory complaints and relative impairment on objective tests of memory performance.
Pooled Effect Sizes Across Outcomes
| Sample Size | Heterogeneity | Effect Size | Fail-Safe n | Criterion | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Outcome | K | n |
|
|
|
|
| 95% CI |
| ||
| Overall combined effect | 23 | 1638 | 20.34 | 22 | 0.562 | 0.00 | 0.09 | −0.02 to 0.19 | 0.108 | — | — |
| Cognitive domain | |||||||||||
| Attention/working memory | 11 | 488 | 7.028 | 10 | 0.723 | 0.00 | 0.16 | −0.07 to 0.33 | 0.061 | — | — |
| Cognitive status | 6 | 657 | 1.24 | 5 | 0.624 | 0.00 | −0.04 | −0.19 to 0.11 | 0.624 | — | — |
| Executive function | 16 | 1447 | 16.15 | 15 | 0.372 | 7.11 | 0.02 | −0.07 to 0.12 | 0.611 | — | — |
| Language | 7 | 423 | 9.64 | 6 | 0.141 | 37.77 | 0.07 | −0.11 to 0.24 | 0.442 | — | — |
| Verbal learning | 10 | 1201 | 5.51 | 9 | 0.788 | 0.00 | 0.03 | −0.62 to 0.11 | 0.572 | — | — |
| Verbal memory | 16 | 1378 | 23.11 | 15 | 0.082 | 35.10 | 0.08 | −0.06 to 0.22 | 0.256 | — | — |
| Visuomotor function | 10 | 563 | 22.12 | 9 |
| 59.31 | 0.15 | −0.14 to 0.43 | 0.311 | — | — |
| Visuospatial function | 10 | 878 | 3.99 | 9 | 0.912 | 0.00 | 0.00 | −0.11 to 0.12 | 0.989 | — | — |
| Visuospatial learning | 7 | 902 | 10.82 | 6 | 0.094 | 44.56 | 0.19 | −0.12 to 0.49 | 0.234 | — | — |
| Visuospatial memory | 7 | 454 | 6.81 | 6 | 0.339 | 11.92 | 0.05 | −0.11 to 0.22 | 0.525 | — | — |
| Visuospatial skills (combined) | 16 | 1322 | 12.25 | 15 | 0.660 | 0.00 | 0.04 | −0.06 to 0.15 | 0.420 | — | — |
| Participants’ cognitive status | |||||||||||
| Cognitively normal | 18 | 1072 | 15.21 | 17 | 0.580 | 0.00 | 0.14 | −0.01 to 0.28 |
| — | — |
| Cognitively impaired | 5 | 566 | 4.09 | 4 | 0.394 | 2.26 | 0.03 | −0.15 to 0.21 | 0.739 | — | — |
| Participants gonadal status | |||||||||||
| Normal | 17 | 571 | 18.06 | 16 | 0.320 | 11.42 | 0.17 | 0.00 to 0.33 |
| 7 | 95 |
| Low/normal | 6 | 1067 | 0.69 | 5 | 0.983 | 0.00 | 0.02 | −0.13 to 0.17 | 0.785 | — | — |
| Age dichotomized | |||||||||||
| Young age (<68 y) | 12 | 591 | 10.50 | 11 | 0.486 | 0.00 | 0.20 | 0.02 to 0.36 |
| 7 | 70 |
| Old age (≥68 y) | 11 | 1048 | 7.65 | 10 | 0.663 | 0.00 | 0.03 | −0.10 to 0.16 | 0.676 | — | — |
| Administration type | |||||||||||
| Gel/cream | 8 | 914 | 2.40 | 7 | 0.934 | 0.00 | 0.02 | −0.13 to 0.16 | 0.839 | — | — |
| Pellets | 2 | 281 | 1.26 | 1 | 0.263 | 20.34 | −0.11 | −0.50 to 0.28 | 0.593 | — | — |
| Injection | 11 | 325 | 11.18 | 10 | 0.343 | 10.59 | 0.25 | 0.04 to 0.57 |
| 10 | 65 |
| Time of measure | |||||||||||
| Morning | 9 | 1277 | 9.86 | 8 | 0.275 | 18.86 | 0.07 | −0.09 to 0.23 | 0.378 | — | — |
| Not stated | 8 | 181 | 3.00 | 7 | 0.941 | 0.00 | 0.07 | −0.20 to 0.34 | 0.608 | — | — |
| Random | 6 | 180 | 5.37 | 5 | 0.372 | 6.92 | 0.32 | 0.02 to 0.61 |
| 3 | 40 |
| T measurement assay | |||||||||||
| LC-MS/MS | 4 | 611 | 2.18 | 3 | 0.536 | 0.00 | 0.03 | −0.12 to 0.18 | 0.691 | — | — |
| Direct CIA/RIA | 19 | 1027 | 17.15 | 18 | 0.513 | 0.00 | 0.14 | −0.01 to 0.29 |
| — | — |
| Publication year | |||||||||||
| Early studies | 11 | 360 | 15.63 | 10 | 0.111 | 36.01 | 0.24 | −0.02 to 0.49 |
| — | — |
| Late studies | 12 | 1278 | 3.02 | 11 | 0.990 | 0.00 | 0.04 | −0.09 to 0.17 | 0.524 | — | — |
| Industry sponsoring | |||||||||||
| Sponsored | 12 | 1254 | 3.91 | 11 | 0.972 | 0.00 | 0.05 | −0.08 to 0.18 | 0.470 | — | — |
| Not sponsored | 10 | 308 | 10.37 | 9 | 0.322 | 13.18 | 0.29 | 0.06 to 0.52 |
| 12 | 60 |
Abbreviations: df, degrees of freedom; K, number of studies in the analysis; n, number of subjects in the analysis.
Q statistic, P < 0.1 taken to suggest heterogeneity (bold) [33]; I statistic, 0% (no heterogeneity), 25% (low heterogeneity), 50% (moderate heterogeneity), 75% (high heterogeneity) [31].
Effect size = Hedges g. A positive value indicates an effect size in the hypothesized direction, i.e., improvement in the active groups’ cognitive functions compared with the control group. Conventions: small (0.2), medium (0.5), and large (0.8) [28]. Statistically significant P (<0.05) is in bold. Statistically trending P (>0.05 <0.10) is in italics.
In case of statistically significant effects, the robustness of findings was examined by calculation of the fail-safe n (number of nonsignificant studies that would bring P to >0.05) [36].
Fail-safe n exceeding the criterion (5K + 10) indicates a robust result [36].
Cognitive domains using visuospatial skills (visuospatial function, visuospatial learning, and visuospatial memory) were pooled.
Figure 2.Individual effect sizes and forest plots. Individual studies’ overall effect sizes (Hedges g) are shown in the second column and are the unit of analysis. The summary pooled effect size (Hedges g) computed with a random effect model is shown in the bottom row of the second column. Notation: Forest plot. Each square represents the effect size of one individual study. Size of the square indicates the relative weight assigned to the study, with more weight assigned to more precise studies, as indicated by larger squares. Lines: Each line represents a 95% CI of the effect size of the study. The diamond represents the summary pooled g of all of the individual gs. Precision of the summary effect is indicated by the width of the diamond.
Risk of Bias of Included Studies
|
|
Studies were rated as having low risk, high risk, or unclear risk of bias using the Cochrane Collaboration tool for assessing risk of bias [29].
Figure 3.Funnel plot of the overall pooled effect with missing studies imputed. Funnel plot of precision [1/SE (Std Err)] by effect sizes (Hedges g) with imputed studies. Each open circle represents precision as a measure of the sample size on the y-axis as a function of the effect size of each independent study on the x-axis. Closed circles represent imputed studies. The open diamond in the bottom of the plot indicates the summary effect size of the analysis. The closed diamond indicates the summary effect size with imputed studies included. A 95% CI is indicated by the lines. Within these lines, 95% of the circles are expected to be located. The overall pooled effect size is indicated by a vertical line in the middle of the plot.
Results From Meta-Regression Analyses
| Unadjusted Models | Adjusted Models | |||||||
|---|---|---|---|---|---|---|---|---|
| Variable | K | B | 95% CI |
| B | 95% CI |
| VIF |
| Treatment characteristics | ||||||||
| TS Type | 23 | |||||||
| Gel/cream (vs injection) | −0.27 | −0.47 to 0.02 |
| |||||
| Patches (vs injection) | −0.01 | −0.41 to 0.38 | 0.949 | |||||
| Pellets (vs injection) | −0.36 | −0.76 to 0.04 |
| |||||
| Treatment duration, wk | 23 | −0.00 | −0.00 to 0.00 | 0.336 | ||||
| TS effect (Hedges | 23 | 0.09 | 0.01 to 0.19 |
| 0.08 | −0.03 to 0.19 | 0.140 | 1.47 |
|
| ||||||||
| 0.07 | −0.33 to 0.18 | 0.180 | 1.37 | |||||
|
| ||||||||
| TS effect, young age (<68 y) | 12 | 0.08 | −0.04 to 0.19 | 0.183 | ||||
| TS effect, old age (≥68 y) | 11 | 0.14 | −0.02 to 0.30 |
| ||||
| T measurement characteristics | ||||||||
| Time of measurement | 23 | |||||||
| Random times (vs morning) | 0.27 | −0.05 to 0.58 |
| |||||
| Not stated (vs morning) | 0.02 | −0.27 to 0.32 | 0.877 | |||||
| Measurement assay | 23 | |||||||
| Direct CIA/RIA (vs LC-MS/MS) | 0.11 | −0.10 to 0.32 | 0.317 | |||||
| Participant characteristics | ||||||||
| Mean age | 23 | 0.00 | −0.01 to 0.01 | 0.758 | ||||
| Cognitive status | 23 | |||||||
| Impaired (vs normal) | −0.11 | −0.33 to 0.10 | 0.309 | |||||
| Gonadal status | 23 | |||||||
| Low–normal (vs normal) | −0.14 | −0.35 to 0.08 | 0.208 | |||||
| Study characteristics | ||||||||
| Early publication (vs late) | 23 | −0.15 | −0.39 to 0.08 | 0.194 | −0.19 | −0.49 to 0.11 | 0.205 | 1.68 |
|
| ||||||||
| Sponsored (vs not sponsored) | 22 | −0.23 | −0.48 to 0.02 |
| −0.13 | −0.42 to 0.18 | 0.407 | 1.66 |
|
| ||||||||
Statistically significant P (<0.05) is in bold. Statistically trending P (>0.05 <0.10) is in italic.
Abbreviations: Adj., adjusted; VIF, variance inflation factor.
Variables were explored individually in unadjusted models.
When theoretically sound intercorrelations between variables reached statistical significance (P < 0.05), they were explored together in adjusted models.
The association between moderators and the magnitude of the effect is expressed in unstandardized regression coefficients (B).
The variance inflation factor (VIF), a measure of multicollinearity, was calculated when variables were explored together in adjusted models. Conventions: VIF > 10 indicates a serious problem with bias [39, 40]. If VIF is substantially greater than one, then the regression may be biased [40].
Sensitivity Analyses of Statistically Significant Effects With Original and Revised Pooled Effects
| Sample Size | Heterogeneity | Effect Size | Fail-Safe n | Criterion | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Outcome | Analysis | K | n |
|
|
|
|
| 95% CI |
| ||
| Overall cognitive functioning | Original | 23 | 1638 | 20.34 | 22 | 0.562 | 0.00 | 0.09 | −0.02 to 0.19 | 0.108 | — | — |
| Imputed | 23 | 1638 | 14.71 | 22 | 0.874 | 0.00 | 0.07 | −0.04 to 0.18 | 0.189 | — | — | |
| Winsorized | 23 | 1638 | 18.24 | 22 | 0.692 | 0.00 | 0.07 | −0.03 to 0.16 | 0.169 | — | — | |
| Normal gonadal status | Original | 17 | 571 | 18.06 | 16 | 0.320 | 11.42 | 0.17 | 0.00 to 0.33 |
| 7 | 95 |
| Imputed | 17 | 571 | 16.17 | 16 | 0.441 | 1.079 | 0.14 | −0.01 to 0.30 |
| — | — | |
| Winsorized | 17 | 571 | 15.33 | 16 | 0.501 | 0.00 | 0.16 | −0.00 to 0.31 |
| — | — | |
| Young age | Original | 12 | 591 | 10.50 | 11 | 0.486 | 0.00 | 0.20 | 0.02 to 0.36 |
| 7 | 70 |
| Imputed | 12 | 591 | 7.71 | 11 | 0.739 | 0.00 | 0.17 | −0.01 to 0.35 |
| — | — | |
| Winsorized | 12 | 591 | 9.71 | 11 | 0.557 | 0.00 | 0.18 | 0.02 to 0.35 |
| 6 | 70 | |
| Using injection | Original | 11 | 325 | 11.18 | 10 | 0.343 | 10.59 | 0.25 | 0.04 to 0.57 |
| 11 | 70 |
| Imputed | 11 | 355 | 6.97 | 10 | 0.728 | 0.00 | 0.19 | −0.01 to 0.38 |
| — | — | |
| Winsorized | 11 | 355 | 11.19 | 10 | 0.342 | 10.69 | 0.24 | 0.04 to 0.45 |
| 11 | 70 | |
| Random times | Original | 6 | 180 | 5.37 | 5 | 0.372 | 6.92 | 0.32 | 0.02 to 0.61 |
| 3 | 40 |
| Imputed | 6 | 180 | 1.60 | 5 | 0.902 | 0.00 | 0.21 | −0.09 to 0.50 | 0.169 | — | — | |
| Not sponsored | Original | 10 | 308 | 10.37 | 9 | 0.322 | 13.18 | 0.29 | 0.06 to 0.52 |
| 12 | 60 |
| Imputed | 10 | 308 | 6.49 | 9 | 0.690 | 0.00 | 0.22 | 0.01 to 0.43 |
| 3 | 60 | |
Sensitivity analyses of the primary outcome, i.e., the effect of TS on overall cognitive functioning, defined as the individual cognitive domain scores aggregated into one combined estimate, and of five study subsets showing statistically significant improvements: (i) studies of men with mean testosterone levels within normal range (total testosterone between 321 and 865 ng/dL), i.e., normal gonadal status; (ii) studies of younger men (age <68 y); (iii) studies administering testosterone with injection; (iv) studies assessing testosterone at random times; and (v) trials not sponsored by the industry.
Q statistic, P < 0.1 taken to suggest heterogeneity (bold) [33]; I statistic, 0% (no heterogeneity), 25% (low heterogeneity), 50% (moderate heterogeneity), 75% (high heterogeneity) [31].
Effect size = Hedges g. A positive value indicates an effect size in the hypothesized direction, i.e., improvement in the active groups’ cognitive functions compared with the control group. Conventions: small (0.2); medium (0.5); large (0.8) [28]. Statistically significant P (<0.05) is highlighted in bold. Statistically trending P (>0.05 <0.10) is highlighted in italic.
In case of statistically significant effects, the robustness of findings was examined by calculation of the fail-safe n (number of nonsignificant studies that would bring P to >0.05) [36].
Fail-safe n exceeding the criterion (5K + 10) indicates a robust result [36].
Imputed = imputed effect size of zero in two cases [47, 52], when included publications stated they had obtained nonsignificant results on one or more neuropsychological tests without presenting the data.
A range of 2 SDs below and above the global effect size was used to truncate outliers. In two study subsets, i.e., studies assessing testosterone at random times and studies not sponsored by the industry, no studies were below or above this range; thus, no sensitivity analyses with truncated outliers were conducted.