Literature DB >> 27291501

Psychometric testing of two new patient-reported outcome instruments for the evaluation of treatment for hypogonadism.

R P Hayes1, X Ni1, D E Heiselman1, K Kinchen1.   

Abstract

AIM: The aim of this study was to perform psychometric testing and estimate minimal important change (MIC) of two new patient-reported outcome (PRO) instruments - Sexual Arousal, Interest and Drive Scale (SAID) and Hypogonadism Energy Diary (HED).
METHODS: New PRO instruments were administered immediately after screening (Time 1, test-retest subset only) and immediately prior to both randomisation (Time 2) and end-point (Time 3) to men participating in a randomised clinical trial comparing the effect of testosterone solution 2% (TS) and placebo on serum total testosterone. Psychometric analyses included reliability, validity and responsiveness. Total scores for both PRO instruments were transformed to a 0-100 scale.
RESULTS: Study participants (n = 694) were 80% age ≤ 65 years, 79% White, with mean baseline testosterone = 202 ng/dl. Clinicians identified 86% subjects as having low sex drive, 86% with low energy and 76% with both symptoms. Reliability analyses for SAID and HED yielded reliability coefficients > 0.70. SAID scores discriminated between men having low sex drive (n = 553) and those who did not (n = 80) (34.5 vs. 42.8, p < 0.001). HED scores discriminated between men having low energy (n = 541) and those who did not (n = 64) (48.9 vs. 60.2, p < 0.001). In the men randomised to TS (vs. placebo), SAID and HED detected effect sizes of 0.61 (vs. 0.39) and 0.68 (vs. 0.48), respectively. MIC estimates for SAID and HED were approximately 10 and 8, respectively.
CONCLUSIONS: This study provided evidence of the reliability, validity and responsiveness of SAID and HED as measures of sex drive and energy, respectively, making them potentially useful for evaluation of hypogonadal treatment.
© 2016 Eli Lilly & Company. International Journal of Clinical Practice Published by John Wiley & Sons Ltd.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27291501      PMCID: PMC5089588          DOI: 10.1111/ijcp.12828

Source DB:  PubMed          Journal:  Int J Clin Pract        ISSN: 1368-5031            Impact factor:   2.503


What's known

Loss of sex drive and loss of energy are symptoms frequently reported by men with hypogonadism. To adequately evaluate the benefit of hypogonadal treatment, these symptoms should be assessed using patient‐reported outcome (PRO) instruments. The content validity of two new PRO instruments assessing sex drive (SAID) and energy (HED) has been established in men with both early and late onset hypogonadism.

What's new

This article describes psychometric testing of SAID and HED in men with hypogonadism participating in a clinical trial. Psychometric testing suggests that these two instruments are reliable, valid, and responsive to treatment and that score improvement of 10 points for SAID and 8 for HED may represent an important change to patients. These instruments add value to the evaluation of treatment of hypogonadism by providing an assessment of patient‐reported benefit.

Introduction

Many men with hypogonadism report decreased sex drive, decreased energy or both as symptoms of their hypogonadism 1, 2, 3, 4, 5, 6, 7. There are multidimensional instruments that assess either or both symptoms but there are few, if any, instruments focused on accurately and reliably assessing sex drive and energy as unidimensional concepts. More importantly, there are few, if any, existing instruments that assess these symptoms and have been developed according to regulatory guidance that stresses the importance of patient input in the item development process 8. In a previous article, Hayes et al. described the development of the Sexual Arousal, Interest, and Drive Scale (SAID) and the Hypogonadism Energy Diary (HED) 9. Results of a large qualitative study involving men with both late and early onset hypogonadism led to the identification of concepts that were deemed important and relevant to men with hypogonadism 9. These concepts served as the basis for the SAID and HED item generation. Cognitive interviewing provided evidence that items were comprehensive and potential respondents’ interpretations of items were consistent with intended meanings. Cognitive interviewing also established the equivalency of the paper‐based SAID and HED, with an electronic (ePRO) version appearing on an electronic handheld device. The purpose of these two new patient‐reported outcome PRO instruments was to serve as key secondary end‐points in clinical trials of hypogonadism treatment. However, before the value of the SAID and HED can be determined, evidence of their psychometric properties is needed. The aim of this study was to address the following psychometric research questions for each of the PRO instruments in a sample of men with hypogonadism: What are the factor structure and reliability (internal consistency and test–retest or reproducibility) of the SAID and HED? How well do the SAID and HED correlate with similar existing instruments (i.e. have convergent validity)? How well do the SAID and HED discriminate between those men clinically identified as having symptoms of loss of sex drive or loss of energy, respectively, and those who are not clinically identified as having these symptoms (known group validity)? How responsive (i.e. ability to detect change) are the HED and SAID to symptom improvement as a result of treatment for hypogonadism? If the SAID and HED are responsive to change in self‐reported loss of sex drive or loss of energy, what is the change in SAID and HED scores that represents patient‐perceived benefit [i.e. minimal important change (MIC)] 10, 11.

Methods

This study was conducted in the context of a multicenter, randomised, double‐blind, parallel, placebo‐controlled trial designed to assess the effect of testosterone solution on total testosterone (primary end‐point), sex drive, and energy in men with hypogonadism (ClinicalTrials.gov, number NCT01816295) 12. This study included a protocol addendum of which the primary objective was to evaluate test–retest reliability and perform an exploratory factor analysis (EFA) of the SAID and HED with a subset of study participants. To obtain data for the addendum, an interim blinded data snapshot was created.

Study participants

Eligibility criteria for this study were the same as those for the primary clinical trial (TSAT) and included the following: (i) being male and at least 18 years of age; (ii) having two total testosterone levels < 300 ng/dl (10.4 nmol/l) taken at least 1 week apart and (iii) having been identified by a clinician as having at least one symptom of testosterone deficiency, including decreased energy or decreased sex drive 12.

Procedure

Under the protocol addendum, ePRO versions of the SAID and HED were administered twice within the 3‐week period between Visit 2 (screening) and Visit 3 (randomisation) to a subset of study participants. That is, the first 7‐day administration (Time 1) occurred during the 7‐day period immediately after Visit 2, and the second 7‐day administration (Time 2) occurred during approximately the 7‐day period prior to Visit 3 or randomisation. Men completed the HED three times per day for seven consecutive days. On the seventh day, the SAID was administered only once and prior to the administration of the HED for that day. In addition, the ePRO version of the Psychosexual Daily Questionnaire (PDQ) was administered for all 7 days after the last administration of the HED for the day. The SAID, HED and PDQ were administered on the same handheld ePRO device. The decision to administer the three PROs as ePROs was based on the findings that electronic administration is the optimal approach for ensuring data integrity of patient diaries (i.e. HED and PDQ) 13. For the remaining psychometric analyses (e.g. responsiveness), the same procedure as was used in the protocol addendum was used with all study participants at Time 2 and during approximately the 7‐day period prior to Visit 12 or study end‐point (Time 3, approximately 12 weeks after Time 2). The Sexual Desire domain of the International Index of Erectile Function Questionnaire (IIEF), also included in the psychometric testing, was administered at Visit 3. This study was conducted in accordance with the ethical principles originating in the Declaration of Helsinki, good clinical practices and all applicable laws and regulations. The institutional review board at each site approved the primary study and the protocol addendum, and all men provided written informed consent before participating in the study or addendum.

Patient‐reported outcome instruments

Sexual Arousal, Interest and Drive Scale (SAID)

The SAID is a five‐item self‐administered instrument intended to assess the following in men with hypogonadism: the level of thinking about sex (2 items), arousal (1 item), and rating the level of interest in sex and sex drive (2 items). Men with hypogonadism are asked to recall the past 7 days and respond to all five items on 5‐point Likert‐type scales scored from 1 to 5, with 5 corresponding to greater levels of sexual arousal, interest or drive 9. A total score is obtained by summing item scores, and dividing by the number of items (n = 5) with higher scores corresponding to greater sex drive. When used with other PROs, the SAID total score is linearly transformed to a scale of 0–100 to facilitate comparisons.

Hypogonadism Energy Diary (HED)

The HED is a self‐administered instrument intended to assess real‐time energy levels, including the extent to which a respondent feels energetic or has feelings of tiredness/exhaustion. There are two unique items administered three times during the day: approximately 2‐h after waking (proposed time for administration begins at 8 a.m., with three reminder alarms over a 2‐h period), late afternoon (proposed time for administration begins at 3 p.m., with three reminders over a 2‐h period), and late evening (proposed time of administration begins at 8 p.m., with three reminders over a 2‐h period). Men with hypogonadism respond to both energy items using an 11‐point numerical rating scale, with 10 corresponding to full of energy or extreme tiredness. The tiredness item is then reverse scored so that higher total scores correspond to greater levels of energy 9. For each of the six HED daily items (two questions for three times per day), a weekly score is derived as the 7‐day average for that item. The total HED scale score is the sum of each of the six item weekly scores, with higher scores corresponding to greater energy level. When used with other PRO instruments, the HED total score is linearly transformed to a scale of 0–100 to facilitate comparisons.

Psychosexual Daily Questionnaire (PDQ)

The PDQ was developed to assess sexual function and mood on a daily basis. The PDQ has been used in previous studies of men with hypogonadism receiving testosterone replacement therapy 14. Although the entire PDQ was administered at Times 1–3, only one item pertaining to the overall level of sexual desire [rated on a 7‐point numerical rating scale from 1 (none) to 7 (very high)] and two items pertaining to the extent to which full of energy or tired describes the respondent [rated on a 7‐point numerical rating scale from 1 (not at all true) to 7 (very true)] administered at Time 2 were used in the psychometric analyses.

International Index of Erectile Function Questionnaire (IIEF)

The IIEF was developed to assess levels of erectile dysfunction (ED) in men who are sexually active with the same female partner. The IIEF is commonly used to assess therapeutic efficacy of ED therapy 15. Although the entire IIEF was administered at Visit 3, only the Sexual Desire domain (Questions 11 and 12 pertaining to the feeling and rating of sexual desire in the past 4 weeks as rated on a 5‐point Likert‐type scale) was used for the psychometric analyses.

Patient Global Impression and Improvement (PGI‐I) Scales

The PGI‐I, adapted from trials of other conditions 16, 17, was used to develop a measure of a patient's perception of changes in both their sexual drive (PGI‐I‐S) and energy level (PGI‐I‐E) at end of the double‐blind treatment phase (Visit 8). Each questionnaire asks the patient to rate, on a 7‐point numerical rating scale from 1 (very much better) to 7 (very much worse), how his PGI‐I‐S or PGI‐I‐E is now, compared with how it was before he began taking medication.

Statistical analysis

With the exception of the analysis for known group validity, the SAID was psychometrically tested only in men who were identified by their clinician as having a reported history of decreased sex drive (Low Sex Drive analysis set). Correspondingly, with the exception of the analysis for known group validity, the HED was psychometrically tested only in men who were identified by their clinician as having a reported history of decreased energy (Low Energy Analysis set). It was anticipated that data from some men would be included in both the Low Sex Drive and the Low Energy analysis sets. All psychometric analyses were performed using the final data from all TSAT participants with the exception of test–retest reliability and exploratory factor analysis (EFA), which was based on a subset of study participants through a protocol addendum (and an interim blinded data snapshot). For the protocol addendum, sample size considerations included: (i) a minimum of 75 men for the test–retest reliability analysis based on Bonett 18, assuming an intraclass correlation coefficient (ICC) of 0.75 with a 95% confidence interval (CI) width of 0.2 for each of the analysis sets (Low Sex Drive and Low Energy) and (ii) a minimum of 100 men for the EFAs performed for SAID and HED items. Unless otherwise specified, all psychometric analyses were based on usable questionnaires defined as follows at each visit: for the SAID, subjects had to provide at least four complete items of five (i.e. ≥ 80%, or no more than one item was missing) and for each item of the HED, subjects had to provide data for at least 5 of 7 days (no more than two missing responses per item). Missing responses in usable questionnaires were imputed using the average of the non‐missing responses. Note that for the HED, imputations were performed as necessary for each item to obtain the item weekly scores, which were then summed to derive the total score as previously described.

Descriptive statistics

Frequency distributions, inter‐item correlations, and item‐total correlations were evaluated for individual items of the SAID and HED using data from Time 2 for all TSAT study participants.

Factor analysis

Both the SAID and HED were designed to measure a single (i.e. unidimensional) concept: sex drive and energy, respectively. To evaluate the factor structure of each PRO instrument so as to make decisions regarding the use of a SAID and HED total score as sum of item scores, an EFA was performed for the SAID and the HED. For the HED, the item weekly scores (7‐day average) were used for the factor analysis. Factors were extracted using the principal component analysis with varimax rotation. Factors associated with Eigenvalue ≥ 1 were retained. Data for these analyses were obtained at Time 2 from the subset of men participating in the protocol addendum.

Reliability

Test–retest reliability for the 2 PRO instruments was assessed by calculating the ICC and 95% CI between the Time 1 (7‐day period immediately after Visit 2) and Time 2 (7‐day period prior to Visit 3 or randomisation) administrations based on a one‐way random analysis of variance (ANOVA) model, according to Shrout and Fleiss 19. Data for these analyses were obtained at Time 2 from the subset of men participating in the protocol addendum. Internal consistency of the SAID and HED was assessed by calculating Cronbach's alpha coefficient using data from Time 2 for all study participants. ICC coefficients > 0.60 and Cronbach's alpha > 0.70 were considered acceptable.

Construct validity (convergent and known group validity)

For convergent validity, it was hypothesised that the Pearson correlation coefficients calculated between SAID total scores (administered at Time 2) and both the IIEF Sexual Desire domain scores and the weekly average of the PDQ overall level of sexual desire item (both administered at Visit 3) would be significant and > 0.60. Similarly, it was hypothesised that the Pearson correlation coefficients calculated between the HED scores and the weekly average of the PDQ ‘full of pep/energetic?’ and ‘tired?’ items would be significant and > 0.60. The ‘tired’ item was reverse scored to be consistent with the scoring of HED in which a higher score corresponds to more positive outcome. For known group validity, it was hypothesised that significant differences would be observed in SAID and HED scores between men clinician‐identified with decreased sex drive or decreased energy, respectively, and those who had not been clinician‐identified as having the symptom. Student's t‐tests were used to detect the hypothesised differences.

Responsiveness

Responsiveness of the SAID and HED to changes in sex drive and energy, respectively, were tested by performing a paired t‐test using data from all study participants from Time 2 (baseline) and Time 3 (end‐point). Responsiveness indices, including effect size (mean change divided by the standard deviation of the baseline score) and standardised response mean [(SRM), mean change divided by the standard deviation of the change score], were calculated.

MIC

To estimate MIC, Crosby et al. 20 recommend an integrated approach that includes both distribution‐ and anchor‐based methods. Distribution‐based methods use a statistical property [e.g. standard deviation (SD)] of the sample or PRO measure to estimate MIC, while anchor‐based methods compare changes in the PRO measure with, for example, patient impression of change or other clinically relevant variables. For this study, distribution‐based methods were 0.5 SD and 1 standard error (SE) of measurement [i.e. 1 SEM (square root of 1 minus reliability multiplied by baseline SD)]. These two statistics have been shown to be good approximations of MIC determined by other methods 10, 11, 21, 22. For an anchor‐based method, the SAID and HED scores were compared with the corresponding PGI‐I‐S and PGI‐I‐E, respectively. Respondents to the PGIs were categorised as either reporting ‘no change to very much worse’ or ‘a little better to very much better’ regardless of treatment group. The average SAID and HED scores were then compared between the groups. All statistical tests were based on a two‐side alpha level of 0.05. All analyses were performed using SAS version 9.2 software (SAS Institute, Inc., Cary, NC).

Results

The majority of men for whom data were included in one or more of the psychometric analyses (n = 694) were < 65 years of age (80%) and white (79%) with a mean total testosterone level (average of testosterone levels taken at Visits 1 and 2 at least 7 days apart) of 202 ng/dl. Clinicians identified 86% subjects as having low sex drive, 86% as having low energy, and 76% as having both (Table 1). Of the 595 men in the Low Sex Drive analysis set (usable SAID questionnaire), 97 participated in and met criteria for (complete data for Time 1 and Time 2) the test–retest analysis. Of the 599 men in the Low Energy analysis set (usable HED questionnaire), 127 met criteria for the test–retest analysis.
Table 1

Baseline characteristics for subjects in all psychometric analysis dataset with usable ePRO

Patient characteristicsLow sex drive (N = 595)Low energy (N = 599)Low sex drive or low energy or both (N = 694)
Mean (SD)Min–maxMean (SD)Min–maxMean (SD)Min–max
Age (years)55.7 (10.7)19–8555.1 (11.0)19–9255.4 (11.0)19–92
n % n % n %
Age groups
< 65 years473804848155480
≥ 65 years122211151914020
Race (% White)468794727954579
Region (% North America)382644036745265
Mean (SD)Min–maxMean (SD)Min–maxMean (SD)Min–max
Body mass index30.6 (4.2)11–3930.7 (4.2)11–3930.6 (4.1)11–39
Mean (SD)Min, MaxMean (SD)Min, MaxMean (SD)Min, Max
Total testosterone (average of Visit 1 and Visit 2) (ng/dL)202.5 (67.4)4.3, 296.8200.5 (68.0)4.3, 296.8201.5 (67.1)4.3, 296.8
n % n % n %
Clinician‐identified symptom
Decreased sexual drive7312007411
Low energy0089159113
Both522885108552976

ePRO, electronic patient‐reported outcome.

Baseline characteristics for subjects in all psychometric analysis dataset with usable ePRO ePRO, electronic patient‐reported outcome.

Descriptive analysis

For the five SAID items, frequency distributions showed mean and median item scores ranging from 2.0 to 2.5 (on a scale of 1–5, with higher scores corresponding to greater sex drive) (Table 2). Inter‐item correlations for SAID items ranged from 0.48 to 0.70 (all p < 0.001). For the six HED items, frequency distributions showed mean and median item scores for HED ranging from 4.4 to 5.2 (on a 0–10 numerical rating scale, with higher scores corresponding to more energy) (Table 3). Inter‐item correlations for all six HED items ranged from 0.38 to 0.88 (all p < 0.001).
Table 2

Item and Total Scale Statistics for SAID at Time 2 (prior to Visit 3 or randomisation)

Abbreviated itemsMean (SD) (n = 553)Median (n = 553)Min–max (n = 553)Inter‐item correlations (n = 553)Factor loadings (n = 175)Item‐total correlation (n = 553)
…THINK about sexual activity?2.5 (0.9)2.01–50.56–0.700.880.76
…FANTASIZE about sexual activity?2.4 (0.9)2.01–50.48–0.700.850.72
…PHYSICALLY FEEL a sense of sexual arousal, or a PHYSICAL stirring or tingling of arousal?2.3 (1.0)2.01–50.49–0.590.770.65
…rate your level of interest in sex?2.5 (0.8)2.01–50.56–0.670.860.76
…rate your sex drive?2.2 (0.7)2.01–50.48–0.670.740.65
SAID Scale Transformed Total Score (0–100)34.5 (17.8)30.00–90

Scoring: Higher item and total scores correspond to greater sex drive, scores range from 1 to 5 and 0–100, respectively. Factor analysis (performed with data from Protocol Addenda participants only): eigenvalue = 3.4, Variance explained = 67% (n = 175). Internal consistency: Cronbach's alpha = 0.87 (n = 553). Test–retest Reliability: Intraclass correlation coefficient = 0.75 (n = 97, 95% CI = 0.65–0.83). SAID, Sexual Arousal, Interest and Drive; SD, standard deviation.

Table 3

Item and Total Scale Statistics for HED

Abbreviated Items (7‐day average)Mean (SD) (n = 541)Median (n = 541)Min–max (n = 541)Inter‐item correlations (n = 541)Factor loadings (n = 204)Item‐total correlation (n = 541)
Full of energy (morning)4.8 (1.9)4.80.0–10.00.38–0.760.790.69
Not at all tired (morning)5.2 (2.0)5.10.0–10.00.44–0.730.770.71
Full of energy (afternoon)5.0 (1.8)5.00.0–10.00.53–0.880.910.82
Not at all tired (afternoon)5.2 (1.8)5.00.0–10.00.48–0.860.880.81
Full of energy (evening)4.6 (1.8)4.60.1–10.00.44–0.880.830.78
Not at all tired (evening)4.6 (1.9)4.40.0–10.00.38–0.860.790.74
HED Transformed Total Score (0–100)48.9 (15.6)48.82.9–95.0

Scoring: Higher item and total scores correspond to greater energy, scores range from 1 to 10 and 0–100, respectively. Factor analysis (performed with data from Protocol Addenda participants only): Eigenvalue = 4.1, Variance explained = 69% (n = 204). Internal consistency: Cronbach's alpha = 0.91 (n = 541). Test–retest reliability: Intraclass correlation coefficient = 0.88 (n = 127, 95% CI = 0.83–0.91). HED, Hypogonadism Energy Diary; SD, standard deviation.

Item and Total Scale Statistics for SAID at Time 2 (prior to Visit 3 or randomisation) Scoring: Higher item and total scores correspond to greater sex drive, scores range from 1 to 5 and 0–100, respectively. Factor analysis (performed with data from Protocol Addenda participants only): eigenvalue = 3.4, Variance explained = 67% (n = 175). Internal consistency: Cronbach's alpha = 0.87 (n = 553). Test–retest Reliability: Intraclass correlation coefficient = 0.75 (n = 97, 95% CI = 0.65–0.83). SAID, Sexual Arousal, Interest and Drive; SD, standard deviation. Item and Total Scale Statistics for HED Scoring: Higher item and total scores correspond to greater energy, scores range from 1 to 10 and 0–100, respectively. Factor analysis (performed with data from Protocol Addenda participants only): Eigenvalue = 4.1, Variance explained = 69% (n = 204). Internal consistency: Cronbach's alpha = 0.91 (n = 541). Test–retest reliability: Intraclass correlation coefficient = 0.88 (n = 127, 95% CI = 0.83–0.91). HED, Hypogonadism Energy Diary; SD, standard deviation.

Factor analysis

Both EFAs performed yielded only one factor with an eigenvalue > 1.0. For the five SAID items, this factor had an eigenvalue of 3.4 (67% variance explained) and factor loadings ≥ 0.74 (Table 2). For the six HED items, this factor had an eigenvalue of 4.1 (69% variance explained) and factor loadings ≥ 0.77 (Table 3). Both factor analyses suggested that the SAID and HED assess unidimensional concepts; therefore, SAID and HED item scores were summed for SAID and HED total scores, respectively. The mean and median SAID transformed total score (0–100 scale) was 34.5 and 30.0, respectively (Table 2). The mean and median HED transformed total score (0–100 scale) was 48.9 and 48.8 (Table 3).

Evaluation of reliability

Cronbach's alpha (internal consistency) calculated for the SAID items was 0.87, with item‐total correlations ranging from 0.65 to 0.76 (Table 2). Cronbach's alpha calculated for the HED items was 0.91, with item‐total correlations ranging from 0.69 to 0.82 (Table 3). The ICC coefficient (test–retest reliability) was 0.75 (95% CI = 0.65–0.83) for the SAID (Table 2) and 0.88 (CI = 0.83–0.91) for the HED (Table 3).

Construct validity

SAID total scores were significantly (p < 0.05) positively correlated with both the IIEF Sexual Desire domain scores (r = 0.64, n = 515) and the PDQ sexual desire question (r = 0.68, n = 519). HED total scores were significantly (p < 0.05) positively correlated with both the PDQ ‘full of pep/energetic?’ (r = 0.76, n = 535) and reversed ‘tired?’ items (r = 0.66, n = 534) (Table 4).
Table 4

Convergent validity for SAID and HED

Patient‐reported outcome instrumentIIEF Sexual Desire domain (n = 515) r (95% CI)PDQ Weekly average of ‘overall level of sexual desire’ item (n = 519) r (95% CI)PDQ Weekly average of ‘full of pep/energetic?’ item (n = 535) r (95% CI)PDQ Weekly average of ‘tired?’ itema (n = 534) r (95% CI)
SAID Scale0.64 (0.58–0.69)0.68 (0.64–0.73)
HED0.76 (0.72–0.79)0.66 (0.71–0.61)

PDQ ‘Tired’ item was reverse scored to be consistent with the scoring of the HED in which a higher score corresponds to more positive outcome. SAID, Sexual Arousal, Interest and Drive; HED, Hypogonadism Energy Diary; IIEF, International Index of Erectile Function; PDQ, Psychosexual Daily Questionnaire; r, Pearson correlation coefficient; CI, confidence interval.

Convergent validity for SAID and HED PDQ ‘Tired’ item was reverse scored to be consistent with the scoring of the HED in which a higher score corresponds to more positive outcome. SAID, Sexual Arousal, Interest and Drive; HED, Hypogonadism Energy Diary; IIEF, International Index of Erectile Function; PDQ, Psychosexual Daily Questionnaire; r, Pearson correlation coefficient; CI, confidence interval. The mean SAID total scores of men who were identified by their clinicians as having low or decreased sex drive (n = 553) were significantly different from the mean scores of those men not identified (n = 80) (34.5 vs. 42.8, respectively; p < 0.001). The mean HED total scores of men who were identified by their clinicians as having low or decreased energy (n = 541) were significantly different from the mean scores of those men who were not identified (n = 64) (48.9 vs. 60.2, respectively; p < 0.001; Table 5).
Table 5

Known‐group validity of SAID and HED

Patient‐reported outcome instrumentDecreased or low sexual drive Mean (SD)No decreased or low sexual drive Mean (SD)p
SAID Scale n = 553 34.5 (17.8) n = 80 42.8 (17.9) < 0.001
HED n = 541 48.9 (15.6) n = 64 60.2 (18.3) < 0.001

SAID, Sexual Arousal, Interest and Drive; HED, Hypogonadism Energy Diary; SD, standard deviation.

Known‐group validity of SAID and HED SAID, Sexual Arousal, Interest and Drive; HED, Hypogonadism Energy Diary; SD, standard deviation.

Responsiveness

From baseline to end‐point, SAID scores significantly (p < 0.001) improved in the men with low sex drive and randomised to testosterone solution (n = 244), with effect size and SRM of 0.61 and 0.63, respectively (Table 6). From baseline to end‐point, HED scores significantly (p < 0.001) improved in men with low energy and randomised to testosterone solution (n = 230), with effect size and SRM of 0.68 and 0.74, respectively.
Table 6

Evaluation of responsiveness of SAID and HED from baseline to end‐point for treatment arm and placebo

Instrument n Mean changep‐valueEffect sizeSRM
SAID
Testosterone solution24411.0< 0.0010.610.63
Placebo2576.7< 0.0010.390.40
HED
Testosterone solution23010.6< 0.0010.680.74
Placebo2437.4< 0.0010.480.46

SAID, Sexual Arousal, Interest and Drive; HED, Hypogonadism Energy Diary; SRM, standardised response mean.

Evaluation of responsiveness of SAID and HED from baseline to end‐point for treatment arm and placebo SAID, Sexual Arousal, Interest and Drive; HED, Hypogonadism Energy Diary; SRM, standardised response mean.

MIC

The two distribution‐based methods used to estimate MIC in SAID and HED scores were 0.5 SD and 1 SEM. For SAID in men randomised to testosterone solution (n = 244), these were calculated as 9.1 and 5.1, respectively. Corresponding values for HED in men randomised to testosterone solution (n = 230), were calculated as 7.9 and 3.7, respectively. MIC was also estimated by ‘anchoring’ SAID and HED scores to PGI‐I‐S and PGI‐I‐E, respectively. For SAID, patients who improved according to the PGI‐I‐SD scored 9.6 points higher on average than those patients who reported no change or worsening (p < 0.001). For HED, patients who improved according to the PGI‐I‐E scored 8.3 points higher on average than those patients who reported no change or worsening (p < 0.001).

Discussion

The SAID and HED are two new PRO instruments that have been developed to assess two common symptoms of hypogonadism – loss of (or decreased) sex drive and loss of (or decreased) energy. These two instruments were developed according to regulatory guidance that stresses the need for qualitative study to ensure patient input into the item development process 8. The overall goal of this study was to provide the initial step in the extensive and continued psychometric testing required to establish the reliability, validity and responsiveness of all new evaluation instruments 8. Through the addendum to a clinical trial, we explored the factor structure and evaluated the test–retest reliability. In general, the SAID and HED appear to have acceptable psychometric properties when administered to men with hypogonadism with decreased sex drive and decreased energy, respectively. For a PRO instrument to be useful as a treatment evaluation measure, total score ceiling effects (endorsement of the highest possible positive outcome score) should be small in untreated populations, thereby indicating room for positive movement in scores with an intervention or treatment 23. SAID and HED item score distributions showed little to no ceiling effects. In addition, the mean item and total scores for both instruments were in the middle of their respective score ranges. These results suggest that the SAID and HED scores have the potential for positive movement (i.e. improved score) with treatment. Results of the factor analyses performed through an addendum protocol suggest that total scores derived from summing of SAID or HED item scores (i.e. one domain) are productive for measurement. The internal consistency reliability analyses also suggested one domain for both the SAID and HED with item‐total correlations for all items for both ≥ 0.65. Construct validity was demonstrated for the SAID and HED both in terms of the correlations between their total scores and items or domains measuring similar constructs (i.e. convergent validity) and their ability to discriminate between men who were identified by their clinicians as having low or decreased sex drive or energy as a symptom of their hypogonadism from those who were not identified by their clinicians (i.e. known group validity). The intent of the SAID, as well as the HED, is to serve as an instrument to evaluate treatment benefit. The most important type of reliability for an evaluative instrument is to demonstrate test–retest reliability or stability between two time points in which no known change in the construct being measured has occurred. Both the SAID and HED demonstrated good test–retest reliability. Following test–retest reliability, the true test of an evaluative instrument is the extent to which the change in scores from baseline to end‐point detects an improvement in the construct being measured. Both SAID and HED detected significant improvements in sex drive and energy, respectively, for men randomised to the testosterone treatment group. Moderate to large effect sizes (0.60–0.70) were observed. In the primary study 12, efficacy analyses comparing testosterone solution to placebo for the SAID was significant at alpha level of 0.05 and reached the prespecified more stringent alpha level of 0.01. For the subset with low energy at baseline, participants assigned testosterone solution also showed a statistically significant baseline to end‐point improvement in HED scores compared with those assigned placebo (p = 0.02); however, the difference did not reach the prespecified significance level of p < 0.01 12. Further research may be needed to clarify whether administration of the HED as a diary three times a day for a 7‐day period at baseline and end‐point is the optimal timing to capture improvements in the fluctuations in energy reported by men with hypogonadism 9. The distribution‐based methods for determining MIC indicated an approximately 5–9 point change in the SAID and an approximately 4–8 point change in the HED. An anchor‐based approach, which was based on patient input, suggested a MIC of approximately 10 points for the SAID and eight points for the HED. Thus, for sample size estimation aimed at demonstrating a change in which patients actually perceive a difference in their sexual drive or their energy level, the MIC may be closer to nine points for the SAID and eight points for the HED on a 0–100 scale. It should be noted, however, that, as with other psychometric properties of instruments, the MIC is sample‐specific. Additional research is needed to provide support for the MIC estimates reported in this study. The results of the previous manuscript by Hayes et al. suggest that the SAID and HED meet current regulatory standards 8 for content validity 9. The research reported here provides preliminary psychometric evidence to support the use of these instruments in the evaluation of treatment for hypogonadism. However, the suitability of PRO instruments as end‐points for inclusion in product labelling is determined on a case‐by‐case basis by regulatory agencies and takes into account study population and desired indications. Moreover, all psychometric analyses are sample‐specific and will need to be repeated for each unique patient population of interest. Future studies will increase our understanding of the reliability and validity of the SAID and HED and their potential for aiding clinicians in the assessment of treatment benefit.

Author contributions

RPH contributed to the development of the study concept and design, interpreted the results and drafted the manuscript. XN contributed to the study concept and design, supervised the statistical analysis, interpreted the results and critically revised the manuscript for important intellectual content. DEH and KK contributed to study concept and design, supervised the study, and critically revised the manuscript for important intellectual content. All authors had full access to the data and take full responsibility for the integrity of the data and the accuracy of the data analysis.
  22 in total

1.  Andropause and quality of life: findings from patient focus groups and clinical experts.

Authors:  Annoesjka Novák; Meryl Brod; Jolanda Elbers
Journal:  Maturitas       Date:  2002-12-10       Impact factor: 4.342

2.  Sample size requirements for estimating intraclass correlations with desired precision.

Authors:  Douglas G Bonett
Journal:  Stat Med       Date:  2002-05-15       Impact factor: 2.373

3.  A methodological framework for assessing health indices.

Authors:  B Kirshner; G Guyatt
Journal:  J Chronic Dis       Date:  1985

4.  Establishing the content validity of the Sexual Arousal, Interest, and Drive Scale and the Hypogonadism Energy Diary.

Authors:  R P Hayes; J Henne; K S Kinchen
Journal:  Int J Clin Pract       Date:  2014-11-07       Impact factor: 2.503

5.  An integrated method to determine meaningful changes in health-related quality of life.

Authors:  Ross D Crosby; Ronette L Kolotkin; G Rhys Williams
Journal:  J Clin Epidemiol       Date:  2004-11       Impact factor: 6.437

6.  Linking measurement error to minimal important change of patient-reported outcomes.

Authors:  Caroline B Terwee; Leo D Roorda; Dirk L Knol; Michiel R De Boer; Henrica C W De Vet
Journal:  J Clin Epidemiol       Date:  2009-02-20       Impact factor: 6.437

7.  Validation of two global impression questionnaires for incontinence.

Authors:  Ilker Yalcin; Richard C Bump
Journal:  Am J Obstet Gynecol       Date:  2003-07       Impact factor: 8.661

8.  American Association of Clinical Endocrinologists Medical Guidelines for clinical practice for the evaluation and treatment of hypogonadism in adult male patients--2002 update.

Authors:  Steven M Petak; Howard R Nankin; Richard F Spark; Ronald S Swerdloff; Luis J Rodriguez-Rigau
Journal:  Endocr Pract       Date:  2002 Nov-Dec       Impact factor: 3.443

9.  Standard error of measurement as a valid alternative to minimally important difference for evaluating the magnitude of changes in patient-reported outcomes measures.

Authors:  Javier Rejas; Antonio Pardo; Miguel Angel Ruiz
Journal:  J Clin Epidemiol       Date:  2007-10-15       Impact factor: 6.437

10.  Effect of Testosterone Solution 2% on Testosterone Concentration, Sex Drive and Energy in Hypogonadal Men: Results of a Placebo Controlled Study.

Authors:  Gerald Brock; Darell Heiselman; Mario Maggi; Sae Woong Kim; José M Rodríguez Vallejo; Hermann M Behre; John McGettigan; Sherie A Dowsett; Risa P Hayes; Jack Knorr; Xiao Ni; Kraig Kinchen
Journal:  J Urol       Date:  2015-10-20       Impact factor: 7.450

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.