Literature DB >> 29769073

Measuring bothersome menopausal symptoms: development and validation of the MenoScores questionnaire.

Kamma Sundgaard Lund1, Volkert Dirk Siersma2, Karl Bang Christensen3, Frans Boch Waldorff4, John Brodersen5,6.   

Abstract

BACKGROUND: The experience of menopausal symptoms is common and an adequate patient-reported outcome measure is crucial in studies where women are treated for these symptoms. The aims of this study were to identify a patient-reported outcome measure for bothersome menopausal symptoms and, in the absence of an adequate tool, to develop a new measure with high content validity, and to validate it using modern psychometric methods.
METHODS: The literature was reviewed for existing questionnaires and checklists for bothersome menopausal symptoms. Relevant items were extracted and subsequently tested in group interviews, single interviews, and pilot tests. A patient-reported outcome measure was drafted and completed by 1504 women. Data was collected and psychometrically validated using item-response theory Rasch Models.
RESULTS: All questionnaires identified in the literature lacked content validity regarding bothersome menopausal symptoms and none were validated using item-response theory. Our content validation resulted in a draft measurement encompassing 122 items across eight domains. Following psychometrical validation, the final version of our patient-reported outcome measure, named the MenoScores Questionnaire, encompassed 51 items, including one single item, covering 11 scales.
CONCLUSION: Menopausal symptoms are multidimensional with some symptoms unquestionably related to the menopausal transition. We identified four constructs of importance: hot flushes, day-and-night sweats, general sweating, and menopausal-specific sleeping problems. The MenoScores Questionnaire is condition-specific with high content validity and adequate psychometrical properties. It is designed to measure bothersome menopausal symptoms and all scales are developed and psychometrically validated using item-response theory Rasch Models. TRIAL REGISTRATION: Approved by the Danish Data Agency (J.nr. 2015-41-4057). Ethics Committee approval was not required.

Entities:  

Keywords:  Item response theory; Menopausal symptoms; Menopause; Patient-reported outcome measures; Psychometrics; Questionnaire; Rasch analysis; Scale validation

Mesh:

Year:  2018        PMID: 29769073      PMCID: PMC5956969          DOI: 10.1186/s12955-018-0927-6

Source DB:  PubMed          Journal:  Health Qual Life Outcomes        ISSN: 1477-7525            Impact factor:   3.186


Background

Menopause is the cessation of women’s menstruation and can be determined retrospectively 12 months after the final menstrual period (FMP) [1, 2]. On average, women experience the menopausal transition in their mid-to-late forties [1] and the FMP in their early fifties, with large variations [1, 3, 4]. Around 75% of menopausal women experience hot flushes [5-7] and 10–20% of postmenopausal women find these symptoms very bothersome [5]. Some women also experience night sweats, emotional vulnerability, sleeping difficulties, fatigue, headache, joint and muscle pain, cognitive changes, vaginal dryness, and loss of sexual desire [1, 5, 8–10]. Menopausal symptoms are commonly experienced for 4–5 years in the years before and after the FMP; however, for some women the duration is longer [1, 6, 11]. Menopausal symptoms differ between cultures and ethnic groups, and also between individuals within a homogenous population [12, 13]. Therefore, measuring self-reported menopausal symptoms presents a challenge, and so does the distinction between menopausal symptoms and the symptoms of aging. Several questionnaires regarding menopausal symptoms exist. However, to help women who are bothered by menopausal symptoms it requires a PROM that focuses solely on the bothersome symptoms. Such a PROM must also possess high content validity as well as adequate psychometric properties. Item response theory Rasch models is preferred when establishing ideal measurement psychometric properties such as unidimensionality, invariance (specific objectivity or no differential item functioning), statistical sufficiency and additivity [14-16]. The aims of this study are threefold: 1) To review existing questionnaires and symptoms checklists (which we also refer to as questionnaires) measuring bothersome menopausal symptoms, and, if we cannot identify an adequate existing questionnaire from the literature search then: 2) To develop a patient-reported outcome measure (PROM) for bothersome menopausal symptoms with high content validity, and: 3) To validate this new PROM for dimensionality, invariance, known-groups validity, and reliability using modern psychometric methods.

Methods

The study took place in Denmark and was divided into three phases: 1) a literature review; 2) qualitative interviews securing high face and content validity; 3) a validation survey where the draft PROM was distributed cross-sectionally and the data analyzed using classical test theory (CTT) and item response theory (IRT) models, securing high construct validity of the final PROM.

Phase 1:Literature review

A literature search in PubMed, Embase, and the Cochrane Library was conducted at the end of 2014 and early 2015 to identify existing questionnaires encompassing menopausal symptoms. We also consulted gynaecologists and general practice specialists to locate relevant questionnaires. We included questionnaires that contained at least one item referring to a bothersome menopausal symptom. Questionnaires on the quality of life (i.e. no items referring to specific menopausal symptoms) or concerning interference with or reaction to menopausal symptoms were not included. Questionnaires had to be freely available and written in English, Swedish, Norwegian or Danish. To be interpreted as adequate, the identified questionnaires should have high content validity encompassing items that were up-to-date, not double-barrelled, or ambiguous. Moreover, the psychometric properties of the questionnaires should be assessed using IRT. None of the identified questionnaires fulfilled all the above criteria. Therefore, we extracted an item-pool encompassing unique items about solely bothersome menopausal symptoms from the identified questionnaires. The meaningful content of relevant items was identified and assessed for redundancy, double-barrelled items were divided into separate items, and ambiguous items were rephrased. The items’ response options were not transferred [17]. The subject matter of these items was translated into Danish ad-hoc by KSL and JB. The unique items were grouped into domains by KSL based on clinical experience and the literature, and these were subsequently reviewed by JB. Any discrepancies were discussed until we reached consensus.

Phase 2: Qualitative interviews

To test the content validity (content relevance and content coverage) and the understandability of the unique items, two group interviews were conducted with women bothered by menopausal symptoms. The group interviews were audio-recorded, they lasted for two hours, and were moderated by KSL and JB. The first part of the interview was an open-ended discussion about bothersome menopausal symptoms. If new themes (suggested domains) were revealed in the discussion, we generated new items covering these themes using the women’s verbatim expressions from the audio recordings (see below). These new items were tested in the following group or in single interviews (see below). In the second part of the group interviews, the women were asked to assess if they found the subject matter of the unique items relevant. Items found irrelevant were deleted from the unique item-pool and, in case of lack of content coverage, new items were generated. We subsequently asked the women to which of the stated themes (suggested domain), their symptoms belonged. A draft PROM was created after the first group interview. At the end of the second group interview, the women were asked to complete the draft PROM. The themes (suggested domains), a recall period, and suggestions for response options were discussed. Instructions were tested for understandability. Some symptoms postulated to be caused by menopause could also be caused by aging, therefore a global item was developed: “Have you, within the past three months, been bothered by menopausal symptoms?”, with four response options: “no, not at all”, “yes, a bit”, “yes, quite a bit”, or “yes, a lot”. Later, this global item was used to evaluate the association between women with and without bothersome menopausal symptoms and the scales’ ability to discriminate between the four groups in the global item: none, mild, moderate, or severe bothersome menopausal symptoms. The draft PROM was further tested for functionality, understandability, and content validity in four single interviews conducted by KSL. The women included in these interviews were all bothered by menopausal symptoms. A paper version of the draft PROM was tested in the first two interviews and an online draft version was tested in the two final interviews. If any problems were revealed, they were corrected between interviews. Finally, the online draft PROM was tested for functionality (including the response option) and understandability in four individual pilot tests, followed by a short interview, among women aged 50–64 where two of the women were bothered by menopausal symptoms. The group and single interviews were audio-recorded and we measured the time taken to complete the PROM. Notes and important citations were listed during the interviews. After each interview the recording was audited by KSL and used when the key issues and results from the interviews were analyzed.

Phase 3: Validation survey and analysis

The final draft PROM was distributed by a link (SurveyXact) in emails, social media (Facebook groups for women), project research homepage [18], general practices, and the women’s lifestyle magazine “Liv” [19] (through their online newsletter and Facebook page). Women aged 45–65 years, with and without bothersome menopausal symptoms, were asked to complete the PROM.

Reliability and validity

To secure adequate psychometric properties of the final PROM we conducted Rasch analysis on the data collected verifying if items in each suggested domain fitted a partial credit Rasch model for polytomous items [20]. We tested differential item functioning (DIF) [21, 22], i.e. if items performed differently depending on the variables: occupation, education, living (living alone), smoking, BMI, age, hormonal intrauterine device, bilateral ovariectomized, hysterectomized, having menstruation within the past year. Local dependence (LD) was also evaluated [15, 23], i.e. whether items were correlated beyond what could be expected by measuring the same underlying construct using item screening and log-linear Rasch model tests [24, 25]. Where evidence of DIF and/or LD was disclosed, a log-linear Rasch model was considered indicating a scaling solution with desirable measurement properties [14]. Andersen’s conditional likelihood ratio test (CLR-χ2) was used to evaluate the overall model fit [26] and individual item fit was assessed by comparing observed and expected rank correlation between the item and rest-score (sum of other items in scale) [27]. Items that demonstrated the most problematic properties and/or poor fit were deleted stepwise from the scales, until fit of the Rasch model was achieved. Items with misfit but high face and content validity it were kept as a single item. Cronbach’s alpha was used as a measure of reliability [28, 29]. The Benjamini-Hochberg procedure was used to account for multiple testing [30]. The sum-scores of the resulting Rasch-fitting scales (see below) was tested by comparison to the global item. For each of the four categories of the global item the means and standard deviations (SD) of the sum-scores were calculated and compared using ANOVA; also, the order of the means in a sum-score should reflect the order of the categories of the global item. We calculated the number of subjects needed in a hypothetical randomized trial to find, with 80% power, the difference between the means corresponding to the two last categories of the global item in a t-test with a significance level of 5%; low numbers indicate a high discriminating ability. We used SAS v9.4 and DIGRAM v3.05.3 software. The study was approved by the Danish Data Agency (J.nr. 2015–41-4057). Ethics Committee approval was not required.

Results

Phase 1

We identified 15 questionnaires written in English or Danish in the literature search, many of which referred to each other: Kupperman index [31, 32], Modified Blatt-Kupperman index [33, 34], Greene (1976) [35], Greene climacteric scale (GCS) [36], WHQ [37, 38], MENQOL [39], MENQOL-intervention [40], Menopause symptom list (MSL) [41], Menopause rating scale (MRS) [42], 10-items Cervantes scale (CS-10) [43], Menopause health state classification [44], Menopause health questionnaire [4], Neugarten and Kraines [45], Hvas et al. [46], MQOL [47]. None of the identified questionnaires were adequate in relation to all our adequacy criteria: some were not up-to-date [31, 35, 45], some not sufficiently validated [31, 32, 46] or with missing information about validation [4, 44]. Some had ambiguous or double-barrelled items [35, 36, 42], and some were primarily designed to measure quality of life in menopausal women [39, 40, 47] or economic evaluations of the impact of menopause [44], and not just the level of bothersome menopausal symptoms. None were assessed using IRT. These questionnaires had in total 356 items, of which 126 were unique items divided into five domains (Additional file 1: Appendix 1).

Phase 2

The first group interview included five women (aged 50–63 years), and the second included four women (aged 49–59 years). In the two group interviews 95 (75.4%) of the 126 items were endorsed and 27 new items (five of these due to double-barrelled items) and three new domains were generated (Additional file 1: Appendix 1). In the first group interview it was revealed that hot flushes and day-and-night sweats were experienced as two different things (constructs). Some women were bothered by hot flushes but did not experience day-and-night sweats. Others were bothered by both hot flushes and day-and-night sweats, but described it as different experiences. This was confirmed in the second group interview. The women agreed on a three-month recall period and preferred the four response options; “no, not at all”, “yes, a bit”, “yes, quite a bit”, or “yes, a lot” (Table 1. Item layout). In the sexual domain it was decided to create an additional response option “I do not know” for respondents not sexually active, with or without a partner. These preferences were later confirmed in the single interviews. By the end of the second group interview no new items or domains were generated.
Table 1

Example of item layout and response options

Have you – within the past three months – experienced the following symptoms?
No, not at allYes, a bitYes, quite a bitYes, a lot
I have had hot flushes during the day
I have had hot flushes during the night
Example of item layout and response options Women interviewed individually were aged 50–52 and the women who participated in pilot testing were aged 50–64. In these interviews, almost all comments were about linguistic issues or layout suggestions and only one extra item was desired and another item perceived as redundant and deleted. At this point we achieved data saturation. Finally, one woman requested a comment box at the end of the PROM. Table 2 presents the number and age of participants in the interviews. The final version of the draft PROM encompassed 122 items covering 8 domains (Additional file 2: Appendix 2) and took, on average, 10 min to complete.
Table 2

Number and age of participants (BMS = bothersome menopausal symptoms)

NumberNumber with/without BMSAge (women with BMS)Mean (range)Age (women without BMS)Mean (range)
Group interviews99/052.89 (49–63)
Single interviews44/050.75 (50–52)
Pilot test42/252 (50–54)63 (62–64)
Survey (cross sectional)15041073/43151.97 (45–65)50.69 (45–65)
Number and age of participants (BMS = bothersome menopausal symptoms)

Phase 3

Survey

Within 48 h 1511 women had completed the draft PROM. Seven completed questionnaires were excluded; six respondents were under the age of 45 years and one respondent had ambiguous and inconsistent responses. The characteristics of the remaining 1504 respondents are listed in Tables 2 and 3.
Table 3

Characteristics of respondents (survey)

Characteristics/exogenous variablesTotal no. (% of total)Bothersome menopausal symptomsNo. (%)
Total1504 (100)1073 (71.34)
Age (years):
 45–48394 (26.20)209 (53.05)
 49–52543 (36.10)417 (76.80)
 53–55304 (20.21)264 (86.84)
 56–60205 (13.63)154 (75.12)
 61–6558 (3.86)29 (50.00)
BMI
 0–1810 (0.67)8 (80.00)
 19–24682 (45.77)469 (68.77)
 25–29518 (34.77)379 (73.17)
 ≥ 30280 (18.79)206 (73.57)
 Missing14
Occupationa
 Yes1250 (83.11)893 (71.44)
 No75 (4.99)50 (66.67)
 Sick leave51 (3.39)34 (66.67)
 Retired or similar128 (8.51)96 (75.00)
Education
 No education56 (3.73)41 (73.21)
 Skilled worker, apprentice, assistant nurse or likewise273 (18.16)206 (75.46)
 Short higher educationb < 3 years213 (14.17)167 (78.40)
 Medium higher education 3–4 years633 (42.12)449 (70.93)
 Long higher education > 4 years233 (15.50)141 (60.52)
 Other90 (5.99)63 (70.00)
 Do not know5 (0.33)5 (100)
 Missing1
Living alone
 Yes310 (20.78)214 (69.03)
 No1182 (79.22)851 (72.00)
 Missing12
Smoking
 Yes238 (15.82)179 (75.21)
 No1266 (84.18)894 (70.62)
Gynecological history
Hormonal intrauterine device
 Yes247 (16.42)144 (58.30)
 No1257 (83.58)929 (73.91)
Bilateral ovariectomized
 Yes40 (2.66)32 (80.00)
 No1464 (97.34)1041 (71.11)
Hysterectomized
 Yes121 (8.05)95 (78.51)
 No1383 (91.95)978 (70.72)
Having menstruation within the past year
 Yes717 (47.67)464 (64.71)
 No787 (52.33)609 (77.38)

aOccupation Employed, working or studying. bhigher education = education after high school or likewise

Characteristics of respondents (survey) aOccupation Employed, working or studying. bhigher education = education after high school or likewise

Psychometric analysis

The analyses revealed eleven uni-dimensional scales fitting a Rasch model. One single item was retained due to high face validity. The final PROM was named the MenoScores Questionnaire (MSQ) and the eleven scales cover the constructs: hot flushes (HF), 2 items; day-and-night sweats (DNS), 2 items; general sweating (GS), 2 items; menopausal-specific sleeping problems (MSSP), 2 items; emotional (EM), 12 items; memory (MEM), 2 items; skin-hair (SH), 8 items; physical (PHY), 8 items; abdominal (ABD), 4 items; urinal-vaginal (URIN), 4 items, and sexual (SEX), 4 items. Including the retained single item (more tired than usual) the MSQ encompasses 51 items in total. Item numbers are listed in Table 4.
Table 4

Individual item fit

Item no.Item wordingA priori domainFinal scaleFit to Rasch modelFit to log linear Rasch model
ObservedExpectedPObservedExpectedP
4I have had hot flushes during the dayD2 - VasomotorHF0.8240.8230.91070.8210.8200.9157
5I have had hot flushes during the nightD2 - VasomotorHF0.8240.8230.91070.8210.8200.9157
6I have had bouts of sweating during the dayD2 - VasomotorDNS0.6940.6910.87680.6940.6910.8757
7I have had bouts of night sweatsD2 - VasomotorDNS0.6940.6910.87680.6940.6910.8757
8I have been sweating more than usual.D2 - VasomotorGS0.6320.6330.97200.6320.6330.9628
9I have had cold sweatsD2 - VasomotorGS0.6320.6330.97200.6320.6330.9628
10I have not been able to sleep because of night sweatsD3 - SleepMSSP0.8720.8700.8853
11I have not been able to sleep because of hot flushesD3 - SleepMSSP0.8720.8700.8853
22I have been depressedD4 - EmotionalEM0.7170.6800.0360
27I have had mood swingsD4 - EmotionalEM0.6460.6770.0545
30I have felt anxietyD4 - EmotionalEM0.6900.6930.9022
31I have felt nervousD4 - EmotionalEM0.6960.6800.3557
33I have been needlessly worriedD4 - EmotionalEM0.6670.6780.5050
34I have been worried about having a nervous breakdownD4 - EmotionalEM0.7270.7000.1399
40I have had less confidenceD4 - EmotionalEM0.6770.6870.5561
43I have not had the energy to socializeD4 - EmotionalEM0.6710.6770.7186
45I have felt isolatedD4 - EmotionalEM0.6890.6930.7989
47I have done less than I would likeD4 - EmotionalEM0.6940.6810.3996
48I can accomplish less than I used toD4 - EmotionalEM0.6800.6820.9221
53I have had difficulty concentratingD4 - EmotionalEM0.6820.6830.9548
54My memory has been worse than usualD4 - EmotionalMEM0.9230.9230.9841
55I have had problems with remembering everyday thingsD4 - EmotionalMEM0.9230.9230.9841
58I have had dry skinD5 - Skin, hair and mucosaSH0.4620.4110.02730.4620.351< 0.0001
62I have had a crawling feeling over the skinD5 - Skin, hair and mucosaSH0.4110.4180.82150.4090.4450.2133
63I have had itching of the scalpD5 - Skin, hair and mucosaSH0.4360.4050.21110.4360.4160.4045
64I have had vaginal drynessD5 - Skin, hair and mucosaSH0.4180.4310.55440.4170.4630.0378
65I have had vaginal itchingD5 - Skin, hair and mucosaSH0.4750.4050.01310.4740.4780.8649
66I have shed more hair than usualD5 - Skin, hair and mucosaSH0.4170.4210.85410.4170.4180.9889
67My nails split more than usualD5 - Skin, hair and mucosaSH0.3640.4250.01280.3710.4230.0337
69I have more body hair growthD5 - Skin, hair and mucosaSH0.3330.4120.00620.3330.3460.6786
71I have had heart palpitationsD6 - PhysicalPHY0.4680.4910.30390.4680.4910.3039
73I have had headacheD6 - PhysicalPHY0.4610.4960.09340.4610.4960.0934
75I have had a blind spot in front of my eyeD6 - PhysicalPHY0.5100.5100.98960.5100.5100.9896
76I have been dizzyD6 - PhysicalPHY0.5330.4850.03960.5330.4850.0396
80One or more of my joints has been soreD6 - PhysicalPHY0.5280.5130.43670.5280.5130.4367
84I have had neck painD6 - PhysicalPHY0.5300.5180.53320.5300.5180.5332
86I have had pins and needles in my feetD6 - PhysicalPHY0.5460.5270.45020.5460.5270.4502
95I have been more clumsy than usualD6 - PhysicalPHY0.5050.5060.96060.5050.5060.9606
77I have had nauseaD6 - PhysicalABD0.4210.4640.13310.4210.4640.1331
96My stomach has tended to be bloatedD6 - PhysicalABD0.5000.4760.28370.5000.4760.2837
98I have had uncontrollable loss of gasD6 - PhysicalABD0.5070.4720.14740.5070.4720.1474
102My stool has been looseD6 - PhysicalABD0.4530.4660.65070.4530.4660.6507
106I need to pass urine more frequently than usualD6 - PhysicalURIN0.5320.4810.032410.5320.4920.0975
107I sometimes leak urineD6 - PhysicalURIN0.4590.4740.553200.4590.4890.2257
108My urine has smelled differentD6 - PhysicalURIN0.5110.4820.285800.5110.4620.0770
110My vaginal discharge has been differentD6 - PhysicalURIN0.4190.4830.035390.4190.4770.0569
115I have had pain during intercourseD7 - SexualSEX0.7010.6290.00100.7010.6760.2232
116I have had bleeding after intercourseD7 - SexualSEX0.7140.6990.56650.7140.7640.0222
117I have been too tired for sexD7 - SexualSEX0.5790.5800.97030.5790.5500.1721
118I have had difficulty achieving an orgasmD7 - SexualSEX0.5350.5760.05540.5350.5440.6740
91I have felt more tired than usualD6 - PhysicalSingle item

HF = hot flushes (2 items), DNS = day-and-night sweats (2 items), GS = general sweating (2 items), MSSP = menopausal-specific sleeping problems (2 items), EM = emotional (12 items), MEM = memory (2 items), SH = skin-hair (8 items), PHY = physical (8 items), ABD = abdominal (4 items), URIN = urin-vaginal (4 items), SEX = sexual (4 items)

Individual item fit HF = hot flushes (2 items), DNS = day-and-night sweats (2 items), GS = general sweating (2 items), MSSP = menopausal-specific sleeping problems (2 items), EM = emotional (12 items), MEM = memory (2 items), SH = skin-hair (8 items), PHY = physical (8 items), ABD = abdominal (4 items), URIN = urin-vaginal (4 items), SEX = sexual (4 items)

Vasomotor symptoms

This suggested six-item domain showed misfit. Based on evidence of LD and results from the qualitative interviews, where hot flushes and day-and-night sweats were described as two different constructs, three two-item scales were formed. These scales all fitted a Rasch model and had no evidence of LD and were named: hot flushes (HF), day-and-night sweats (DNS) and general sweating (GS). In the HF scale, item 4 (hot flushes during the day) showed DIF with respect to (wrt.) having menstruation within the past year (p = 0.0013), and item 5 (hot flushes during the night) showed DIF wrt. BMI (p = 0.0008). In the DNS scale, item 6 (sweats during the day) showed DIF wrt. BMI (p < 0.0001), and item 7 (night-sweats) showed DIF wrt. Having menstruation within the past year (p = 0.0045). In the GS scale there was no evidence of DIF.

Sleep

The suggested 10-item domain did not fit a Rasch model. A two-item menopausal-specific sleeping problems (MSSP) scale was found to fit a Rasch model with no evidence of DIF or LD.

Emotional

The suggested 36-item domain did not fit a Rasch model. We omitted poor fitting items and found a 12-item EM scale (items 22, 27, 30, 31, 33, 34, 40, 43, 45, 47, 48, 53) with adequate fit to the partial credit Rasch model, but with substantial evidence of LD. The LD suggests four clusters of items: depression (three items: 22 [been depressed], 27 [mood swings], 34 [worried about nervous breakdown]); anxiety (three items: 30 [felt anxiety], 31 [felt nervous], 33 [needlessly worried]); social (two items: 40 [less confidence], 45 [felt isolated]), and energy (four items: 43 [no energy to socialize], 47 [do less], 48 [can accomplish less] 53 [difficulty concentrating]). No satisfactory log-linear Rasch model could be identified. We analyzed items 54 and 55 separately because of high content validity and because the content seemed different from the remaining items. They formed the Memory (MEM) scale where no DIF or LD was revealed.

Skin, hair and mucosa

This suggested 15-item domain did not fit a Rasch model, but an eight-item scale (58, 62, 63, 64, 65, 66, 67, 69), the skin-hair (SH) scale, was found to fit the log-linear Rasch model. Evidence of LD was disclosed for three item pairs: 62 (crawling feeling over the skin) and 63 (itching of the scalp); 64 (vaginal dryness) and 65 (vaginal itching); 66 (shed more hair than usual) and 67 (nails split more than usual). Item 62 showed DIF wrt. Smoking; item 64 showed DIF wrt. Age and wrt. Having menstruation within the past year, and item 65 showed DIF wrt. Having menstruation within the past year.

Physical

This suggested physical 41-item domain was divided into 3 hypothesized scales due to the content of the items: physical (PHY), 25 items; abdominal (ABD), 10 items, and urinary-vaginal (URIN), 6 items. Physical (PHY) scale. This 25-item scale was rejected, but a scale with eight items (71, 73, 75, 76, 80, 84, 86, 95) was found to fit the log-linear Rasch model where evidence of LD was found for the three item pairs 71 (heart palpitation) and 76 (been dizzy); 73 (headache) and 84 (neck pain); 80 (sore joints) and 86 (pins and needles in feet). Furthermore, item 73 showed DIF wrt. Age and item 80 showed DIF wrt. BMI.

Abdominal (ABD) scale

This 10-item scale was rejected, but a 4-item scale comprising the items 77, 96, 98, and 102 was found to fit a log-linear Rasch model. In this scale, LD was found between item 77 (nausea) and item 98 (uncontrollable loss of gas). Item 96 (bloated stomach) showed DIF wrt. Age and item 98 showed DIF wrt. Education.

Urinary-vaginal (URIN) scale

The 6-item scale was rejected, but a 4-item scale comprising the four items 106, 107, 108, and 110 was obtained. Item 108 (urine smells different) showed DIF wrt. Smoking and LD was found between item 106 (need to pass urine more frequently) and 107 (sometimes leak urine), and between 108 (urine smells different) and 110 (vaginal discharge has been different). Item 91 (more tired than usual) did not fit any of the scales. The item was also tested with the MSSP scale but without a fit to a Rasch model. Finally, the item was tested with the three related items 92, 93, and 94 but they did not fit a Rasch model. Nevertheless item 91 was retained as a single item because of its high face validity.

Sexual

Four items (115, 116, 117, 118) from this domain fitted a Rasch model and were named the sexual (SEX) scale. LD was found between the items 115 (pain during intercourse) and 116 (bleeding after intercourse). Item 115 showed DIF wrt. Age and being bilaterally ovariectomized and item 116 showed DIF wrt. Having a hormonal intrauterine device and having menstruation within the past year; while item 117 (too tired for sex) showed DIF wrt. Living alone. The SH, ABD scales showed signs of dichotomization in the category probability curves. The SH, ABD and SEX (with the additional response option “I do not know”) scales were re-tested in three single interviews (with women age 50 to 65) and all women preferred the three-response option instead of four (“no, not at all”, “yes, a bit”, or “yes, a lot”, plus the additional option in the SEX scale). In order to optimize model fit, the response options in these scales were reduced to the three options above (including the addition option in the SEX scale).

Work and spare time

Two-thirds of respondents were asked to complete this domain (i.e. women who claimed to be bothered by menopausal symptoms by answering “yes” to the global item). The 3-item domain fitted a Rasch model (p = 0.117) but items 1 and 3 with extremely poor item fit (p = 0.0001) and (p = < 0.0001). Thus, we decided to exclude this domain from the final PROM.

Menstruation

Only women who had menstruated within the past year were asked to complete this domain (approximately half of the respondents) (Table 3). This suggested 3-item domain did not fit a Rasch model (p = 0.000) and the items were not included in the final PROM.

Association (discrimination)

The HF, DNS, GS, and MSSP scales showed best performance in discriminating between the response options of the global item (Fig. 1. HF, DNS, GS, MSSP scales). The discriminating ability is presented in Table 5.
Fig. 1

HF, DNS, GS, MSSP scales

Table 5

Fit statistics, the Cronbach’s alpha and discriminating ability of the scales included in the MSQ

ScaleNumberofitemsFit to Rasch modelFit to log linear Rasch modelCronbach’salphadiscriminating abilitya
χ2dfPχ2dfP
Vaso-motor691.917< 0.00010.90
1. HF26.150.294119.7260.80570.8530
2. DNS29.950.077821.6250.65880.7634
3. GS213.650.018313.2170.72250.6062
MSSP10308.229< 0.00010.90
4. MSSP26.050.30770.8742
Emotional36511.4107< 0.00010.97
5. EM1245.1350.11890.93158
6. MEM22.950.71760.90258
SH15137.344< 0.00010.81
7. SH827.7230.228961.5710.78280.71300
Physical25768.674< 0.00010.93
8. PHY825.1230.343387.2730.12350.79218
Abdominal10219.029< 0.00010.81
9. ABD413.2110.282233.4470.92600.63522
Urinary665.817< 0.00010.74
10. URIN418.7110.06747.0321.00000.671040
Sexual8230.431< 0.00010.95
11. SEX415.8150.398137.6580.98250.91320

aTotal number of responders needed to show a clinically relevant difference in symptoms (moderate vs. severe bothersome menopause-related symptoms) with an 80% power and 5% level of significance. Low numbers indicate a high discriminating ability.

df = Degrees of freedom, HF = hot flushes, DNS = day-and-night sweats, GS = general sweating, MSSP = menopausal-specific sleeping problems, EM = emotional, MEM = memory, SH = skin-hair, PHY = physical, ABD = abdominal, URIN = urin-vaginal, SEX = sexual

HF, DNS, GS, MSSP scales Fit statistics, the Cronbach’s alpha and discriminating ability of the scales included in the MSQ aTotal number of responders needed to show a clinically relevant difference in symptoms (moderate vs. severe bothersome menopause-related symptoms) with an 80% power and 5% level of significance. Low numbers indicate a high discriminating ability. df = Degrees of freedom, HF = hot flushes, DNS = day-and-night sweats, GS = general sweating, MSSP = menopausal-specific sleeping problems, EM = emotional, MEM = memory, SH = skin-hair, PHY = physical, ABD = abdominal, URIN = urin-vaginal, SEX = sexual

Reliability

The reliability of the scales was moderate to high with Cronbach’s alpha values between 0.60 and 0.91 (Table 5). Table 4 presents individual item fit and Table 5 presents fit statistics, Cronbach’s alpha, and discriminating ability.

Discussion

We found that all existing questionnaires lacked content validity regarding bothersome menopausal symptoms and none were validated using IRT. Moreover, they all regarded hot flushes and day-and-night sweats as a single construct, which this study could not confirm. We found that the suggested vasomotor domain was three-dimensional concluding that hot flushes and day-and-night sweats are two different constructs. This was revealed in the qualitative interviews and confirmed by the Rasch analysis. Furthermore, these findings were confirmed when screening potential participants for a current randomized controlled trial (RCT) [48]. This study also revealed that only some symptoms are unquestionably related to the menopausal transition and four constructs are of importance when measuring bothersome menopausal symptoms: hot flushes, day-and-night sweats, general sweating and menopausal-specific sleeping problems. A strength of this study is the combination of rigorous qualitative and quantitative processes. Through the qualitative interviews we secured high content validity. Subsequently we used Rasch models to assess if the suggested domains behaved psychometrically as we expected. Another strength is the assessment of discriminating ability. Using the responses to the global item, in relation to the responses to the remaining items, we assessed how well the individual scales within the MSQ discriminated between the response options of the global item. We found the HF, DNS, GS and MSSP scales performed best in discriminating. Our interpretation of this is that only these constructs (HF, DNS, GS, MSSP) are unquestionably related to the menopausal transition. Many other symptoms may be, more or less, caused by aging. A limitation could be that as the data was collected cross-sectionally, test-retest analysis is not reported. Women with bothersome menopausal symptoms report fluctuations in their symptoms from day-to-day. Therefore, a test-retest with a 2-week interval would not be meaningful. Instead we assessed the internal consistency of the scales using Cronbach’s alpha. A further limitation is the broad sampling procedure which makes it difficult to know exactly what population the sample is representative of, due to the element of self-selection inherent in survey data using web-based enrolment. The fact that Rasch validation is performed without distributional assumptions mitigates this challenge. We identified DIF and LD in some of the final scales which may limit MSQ’s applicability in some situations. Items 4 and 5 from the HF scale and items 6 and 7 from the DNS scale all possessed DIF. Nevertheless, these items were maintained because of their high face validity. If the developed scales are used in a RCT, DIF is far less problematic because any exogenous variables will presumably be equally distributed among the randomized groups. However, if the scales are used in non-randomized studies, and any exogenous variables that can cause DIF appear in the studied cohort, one should adjust for the magnitude of the identified DIF [22]. Another approach would be to refrain from items possessing DIF or refrain from using the scales encompassing items possessing DIF [22]. Scales with many items may be preferred, since many items in a scale could increase the sensitivity, specificity, reliability, and ability to discriminate between the groups being tested. In the present study, our interest was to assess if the women were “not at all”, “a bit”, “quite a bit”, or “a lot” bothered by menopausal symptoms. We found the best discriminating scales among four 2-item scales: the vasomotor and sleeping scales (HF, DNS, GS and MSSP) and not among scales encompassing more items. There could be two reasons for this lack of discrimination: 1) LD, but even after deleting items with LD, these scales still did not discriminate as well as the scales from the vasomotor and sleeping domains; 2) that the subject matter of the other scales is related more to aging than to menopause. Due to the large item-pool we identified, we could discharge problematic and poor fitting items using a stepwise procedure. However, we ensured that no important items were lost just because of a psychometric misfit. Therefore, items with high content validity but psychometric misfit were kept as a single item, e.g. item 91 (more tired than usual). Even though the “work and spare time” domain fitted a Rasch model the items showed poor item fit. Since these items were not symptoms in themselves, but referred to how menopausal symptoms affected women’s work and spare time, we decided to disband these items and omit this domain from the final PROM. Moreover, the 3-item menstruation domain did not fit a Rasch model, and as these items were not of high relevance to this study, they were excluded from the final PROM. Since the timing of menopause and the experience of menopausal symptoms vary so widely [1, 4], the MSQ is designed to measure self-reported bothersome menopausal symptoms both in peri- and post-menopausal women. The intention is for the MSQ to be used as an outcome measure in studies where women are treated for bothersome menopausal symptoms. The time needed to complete the MSQ is estimated at 5 min, as the MSQ contains fewer than half the items in the draft PROM. The MSQ only addresses bothersome menopausal symptoms since these would be the target for treatment. It is important to note that some women also have positive experiences in relation to the menopause [49]; however, this is beyond the scope of the present study. The MSQ was developed in Danish and any new language or modified version may need an additional validation study to secure adequate measurement properties.

Conclusion

Menopausal symptoms are multidimensional with only some symptoms unquestionably related to the menopausal transition. The MenoScores Questionnaire (MSQ) is a new, condition-specific PROM with high content validity and adequate psychometrical properties measuring bothersome menopausal symptoms. To the best of our knowledge this is the first PROM measuring only bothersome menopausal symptoms, wherein all scales are developed via interviews with women having bothersome menopausal symptoms and thereafter psychometrically validated using IRT Rasch Models. The focus on bothersome symptoms will assist with identifying and evaluating treatments for women bothered by menopausal symptoms. Appendix 1. Unique item-pool (separated into the suggested domains and the new items and domains generated in the interviews). (DOCX 28 kb) Appendix 2. Draft PROM. (DOCX 26 kb)
  32 in total

1.  The efficacy of acupuncture on menopausal symptoms (ACOM study): protocol for a randomised study.

Authors:  Kamma Sundgaard Lund; John Brodersen; Volkert Siersma; Frans Boch Waldorff
Journal:  Dan Med J       Date:  2017-03       Impact factor: 1.240

2.  "MENOPAUSAL SYMPTOMS" IN WOMEN OF VARIOUS AGES.

Authors:  B L NEUGARTEN; R J KRAINES
Journal:  Psychosom Med       Date:  1965 May-Jun       Impact factor: 4.312

3.  Development of the menopause symptom list: a factor analytic study of menopause associated symptoms.

Authors:  J M Perz
Journal:  Women Health       Date:  1997

4.  The Blatt-Kupperman menopausal index: a critique.

Authors:  E Alder
Journal:  Maturitas       Date:  1998-05-20       Impact factor: 4.342

5.  Menopausal symptoms within a Hispanic cohort: SWAN, the Study of Women's Health Across the Nation.

Authors:  R Green; A J Polotsky; R P Wildman; A P McGinn; J Lin; C Derby; J Johnston; K T Ram; C J Crandall; R Thurston; E Gold; G Weiss; N Santoro
Journal:  Climacteric       Date:  2010-08       Impact factor: 3.005

6.  Management of menopause-associated vasomotor symptoms: Current treatment options, challenges and future directions.

Authors:  Deirdre R Pachman; Jason M Jones; Charles L Loprinzi
Journal:  Int J Womens Health       Date:  2010-08-09

7.  Assessment of menopause-related symptoms in mid-aged women with the 10-item Cervantes Scale.

Authors:  Faustino R Pérez-López; Ana M Fernández-Alonso; Gonzalo Pérez-Roncero; Peter Chedraui; Alvaro Monterrosa-Castro; Plácido Llaneza
Journal:  Maturitas       Date:  2013-08-01       Impact factor: 4.342

Review 8.  Menopause and sexuality: prevalence of symptoms and impact on quality of life.

Authors:  Rossella E Nappi; Michèle Lachowsky
Journal:  Maturitas       Date:  2009-05-21       Impact factor: 4.342

9.  Estimating a preference-based index for a menopause specific health quality of life questionnaire.

Authors:  John E Brazier; Jennifer Roberts; Maria Platts; York F Zoellner
Journal:  Health Qual Life Outcomes       Date:  2005-03-15       Impact factor: 3.186

10.  The Women's Health Questionnaire (WHQ): Frequently Asked Questions (FAQ).

Authors:  Myra S Hunter
Journal:  Health Qual Life Outcomes       Date:  2003-09-10       Impact factor: 3.186

View more
  3 in total

1.  Efficacy of a standardised acupuncture approach for women with bothersome menopausal symptoms: a pragmatic randomised study in primary care (the ACOM study).

Authors:  Kamma Sundgaard Lund; Volkert Siersma; John Brodersen; Frans Boch Waldorff
Journal:  BMJ Open       Date:  2019-02-19       Impact factor: 2.692

2.  Factors associated with a clinically relevant reduction in menopausal symptoms of a standardized acupuncture approach for women with bothersome menopausal symptoms.

Authors:  Frans Boch Waldorff; Christine Winther Bang; Volkert Siersma; John Brodersen; Kamma Sundgaard Lund
Journal:  BMC Complement Med Ther       Date:  2021-01-13

3.  Haematologists' experiences implementing patient reported outcome measures (PROMs) in an outpatient clinic: a qualitative study for applied practice.

Authors:  Stine Thestrup Hansen; Mette Kjerholt; Sarah Friis Christensen; Bibi Hølge-Hazelton; John Brodersen
Journal:  J Patient Rep Outcomes       Date:  2019-12-28
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.