Literature DB >> 33274372

Development of a common scale for measuring healthy ageing across the world: results from the ATHLOS consortium.

Albert Sanchez-Niubo1,2, Carlos G Forero3, Yu-Tzu Wu4, Iago Giné-Vázquez1,2, Matthew Prina4,5, Javier De La Fuente6,7, Christina Daskalopoulou4, Elena Critselis8, Alejandro De La Torre-Luque2,6,7, Demosthenes Panagiotakos8, Holger Arndt9, José Luis Ayuso-Mateos2,6,7, Ivet Bayes-Marin1,2, Jerome Bickenbach10,11, Martin Bobak12, Francisco Félix Caballero13,14, Somnath Chatterji15, Laia Egea-Cortés1, Esther García-Esquinas13,14, Matilde Leonardi16, Seppo Koskinen17, Ilona Koupil18,19, Blanca Mellor-Marsá1,20, Beatriz Olaya1,2, Andrzej Pająk21, Martin Prince5,22, Alberto Raggi16, Fernando Rodríguez-Artalejo13,14, Warren Sanderson23,24, Sergei Scherbov23,25,26, Abdonas Tamosiunas27, Beata Tobias-Adamczyk28,29, Stefanos Tyrovolas1,2, Josep Maria Haro1,2.   

Abstract

BACKGROUND: Research efforts to measure the concept of healthy ageing have been diverse and limited to specific populations. This diversity limits the potential to compare healthy ageing across countries and/or populations. In this study, we developed a novel measurement scale of healthy ageing using worldwide cohorts.
METHODS: In the Ageing Trajectories of Health-Longitudinal Opportunities and Synergies (ATHLOS) project, data from 16 international cohorts were harmonized. Using ATHLOS data, an item response theory (IRT) model was used to develop a scale with 41 items related to health and functioning. Measurement heterogeneity due to intra-dataset specificities was detected, applying differential item functioning via a logistic regression framework. The model accounted for specificities in model parameters by introducing cohort-specific parameters that rescaled scores to the main scale, using an equating procedure. Final scores were estimated for all individuals and converted to T-scores with a mean of 50 and a standard deviation of 10.
RESULTS: A common scale was created for 343 915 individuals above 18 years of age from 16 studies. The scale showed solid evidence of concurrent validity regarding various sociodemographic, life and health factors, and convergent validity with healthy life expectancy (r = 0.81) and gross domestic product (r = 0.58). Survival curves showed that the scale could also be predictive of mortality.
CONCLUSIONS: The ATHLOS scale, due to its reliability and global representativeness, has the potential to contribute to worldwide research on healthy ageing.
© The Author(s) 2020. Published by Oxford University Press on behalf of the International Epidemiological Association.

Entities:  

Keywords:  Healthy ageing; data integration; functional ability; intrinsic capacity; item response theory; scale

Mesh:

Year:  2021        PMID: 33274372      PMCID: PMC8271194          DOI: 10.1093/ije/dyaa236

Source DB:  PubMed          Journal:  Int J Epidemiol        ISSN: 0300-5771            Impact factor:   7.196


This study used an item response theory approach to develop a common scale for measuring healthy ageing, based on data harmonized and integrated from 16 international cohorts. The scale can measure the biopsychosocial aspects of health and functioning, since it covers domains of vitality, sensory functions, locomotion, cognition and activities of daily living that imply interaction with the individual’s environment. The scale is intended to be universally applicable for evaluating healthy ageing, as it arises from the use of international cohorts covering 38 countries from all populated continents. Notwithstanding efforts at integration, as far as we know, no other study has yet produced a common measurement approach, based concomitantly on the combination of intrinsic capacity and functional ability, for assessing internationally healthy ageing. The development of this scale may help researchers and policy makers to have a better understanding of healthy ageing and will move forward in epidemiological research of healthy ageing.

Introduction

Population ageing poses enormous challenges to the social welfare state as a result of greater needs for the elderly’s health and social care., To address this issue, research initiatives have analysed the concept of healthy, active or successful ageing internationally at several levels., However, few of these research efforts have adequately tackled the complex challenge of addressing the concept of healthy ageing, since international consensus regarding how healthy ageing should be measured, to account for the diversity of populations globally, has not been achieved to date. Existing indices or scores of healthy ageing address different concepts and are limited to specific populations. Hence, efforts are needed to obtain a validated universally applicable measurement tool. The World Health Organization (WHO) proposed in 2015 to define healthy ageing as an ‘ongoing process of developing and maintaining the functional ability that enables well-being in older age’., This framework has moved away from focusing on the presence of disease experienced at a single time point to considering healthy ageing as a function of an individual’s functional ability over time. Functional ability is determined by the interaction of individuals’ intrinsic capacity and their environment. In turn, an individual’s intrinsic capacity is comprehensively considered by addressing all physical and mental capacities; and the environment should at least include access to medications, personal support, assistive devices and physical barriers that may be either facilitate or hinder functional ability. To this effect, a measure which combines an individual’s intrinsic capacity and functional ability may be able to more broadly capture a person’s healthy ageing level. In addition, such a measure may have the ability to stimulate more effective prevention strategies by fostering either intrinsic capacity or the resulting functional ability through environmental interventions. Epidemiological studies on ageing tend to collect heterogeneous information on biopsychosocial aspects of health and functioning. Integrating data from multiple cohort studies can be a viable way to combine knowledge gained with a sustainable methodology and provide a nuanced understanding of ageing in different populations. It increases sample size and improves statistical power to accurately estimate health outcomes and their determinants. Additionally, it facilitates comparisons within and across study populations due to variety in geography, composition, socioeconomic status and other factors of interest. This provides significant opportunities for researchers to pool data from multiple studies and conduct data analyses simultaneously. Some key harmonization and integration activities have been conducted as the Health and Retirement Studies family, the Integrative Analysis of Longitudinal Studies on Aging, or the WHO Study on Global Ageing and Adult Health. Yet there is a need to develop a common approach that will facilitate temporal and regional assessments of healthy ageing across diverse populations. This study is based on the Ageing Trajectories of Health-Longitudinal Opportunities and Synergies (ATHLOS) project, which has produced a large harmonized dataset from 38 countries from all populated continents. This study aims to develop a novel scale measuring healthy ageing using items about intrinsic capacity and functional ability and to provide evidence of validity. The scale is intended to be universal, since individual data from any one study can be used to estimate healthy ageing scores comparable to all of them.

Methods

The study protocol was approved by the Committee on the Ethics of Clinical Research, CEIC Fundació Sant Joan de Déu (protocol number: PIC-22–15). All data were anonymized and electronic health record confidentiality was respected in accordance with national and international law.

Data sources

The ATHLOS cohort is composed of harmonized datasets of international cohorts related to health and ageing. To this effect, data from the following 16 studies were considered: the 10/66 Dementia Research Group Population-Based Cohort Study (10/66) with waves 1 and 2; the Australian Longitudinal Study of Aging (ALSA) from wave 1 to 13; the China Health and Retirement Longitudinal Study (CHARLS) with waves 1 and 2; Collaborative Research on Ageing in Europe (COURAGE) with waves 1 and 2; the English Longitudinal Study of Ageing (ELSA) from wave 1 to 7; Study on Cardiovascular Health, Nutrition and Frailty in Older Adults in Spain (ENRICA) from wave 1 to 3; the Health, Alcohol and Psychosocial factors in Eastern Europe Study (HAPIEE) with waves 1 and 2; the Health 2000/2011 Survey (H2000/11) with waves 1 and 2; the Health and Retirement Study (HRS) from wave 1 to 11; the Japanese Study of Aging and Retirement (JSTAR) from wave 1 to 3, the Korean Longitudinal Study of Ageing (KLOSA) from wave 1 to 4; the pilot-study Longitudinal Aging Study in India (LASI); the Mexican Health and Aging Study (MHAS) from wave 1 to 3; the Study on Global Ageing and Adult Health (SAGE) with only wave 1; the Survey of Health, Ageing and Retirement in Europe (SHARE) from wave 1 to 5; and the Irish Longitudinal Study of Ageing (TILDA) with waves 1 and 2. The above studies include populations from 38 countries across five continents. The data harmonization aimed to transform study-specific variables in a homogeneous definition and format across studies. More detailed information about the harmonization process in the ATHLOS project can be found elsewhere.

Intrinsic capacity and functional ability items

To develop the healthy ageing scale, the ATHLOS consortium agreed on a comprehensive list of 41 items related to intrinsic capacity and functional ability, covering the biopsychosocial aspects of health and functioning usually found in general population surveys. All items were assessed across studies and successfully harmonized in at least three studies. Study-specific variables were harmonized into dichotomous items expressing the presence or absence of difficulties. Continuous variables were dichotomized in the first quartile, indicating the presence of difficulties. The harmonization process of each item can be found at Supplementary Table S1, available as Supplementary data at IJE online.

Data selection

The sample size used to construct the scale included all individuals above 18 years of age. We selected all individuals with available data on at least one of the 41 items and used their earliest observed assessment.

Statistical analysis

We developed the healthy ageing scale using item response theory (IRT) models. We chose the two-parameter logistic IRT model, where the probability of endorsing a response category is modelled as a function of two item parameters, item discrimination and item difficulty, and a person parameter. To test the adequacy of the model as a measurement scale, its fit was assessed using the Root Mean Square Error of Approximation (RMSEA; good fit <0.06), the Comparative Fit Index (CFI; good fit >0.95) and the Tucker-Lewis index (TLI; good fit >0.95). The initial estimation of item and person parameters (calibration) was obtained by applying full information maximum likelihood estimation on the sample matrix of response patterns. The score of each individual was calculated using the expected a posteriori estimation method. Maximum score reliability and model marginal reliability were assessed. As each study had a different subset of available items, differences between the scores on the scale and scores on study-specific scales were assessed using intraclass correlation coefficients (ICC). High ICC values indicate that the scoring system is stable to obtain same person scores despite different item subsets. To establish the homogeneity of the scale across studies, a logistic regression framework was used to detect differential item functioning (DIF), which indicates whether items are measured in the same way for all studies. If a study had any items exhibiting DIF, specific parameters were estimated applying the overall model in the study-specific sample. To rescale specific parameters to the full sample, scaling parameters were linearly moved using the Stocking-Lord equating approach. Thus, the procedure took into account study-specific IRT parameters for items with DIF by equating study-specific parameters to the main scale using a test characteristic curve equating procedure. The resulting scores were converted to T-scores with a mean of 50 and a standard deviation of 10. These constituted the final healthy ageing scores of the sample based on the main scale. To check concurrent validity of the scale, weighted linear trends of the T-score means by age groups were assessed for each study. Second, a multiple linear regression model was carried out to investigate whether the T-scores were associated with known risk factors for poor health in adulthood. The following variables were included, which were all self-reported by the participant: socidemographic variables such as country, sex, age, year of birth, education and wealth; lifestyles such as smoking and obesity; most frequent physician-diagnosed diseases such as high blood pressure, cardiovascular disease, respiratory disease, diabetes, joint disorders; one frequent disorder such as depression that was based on the psychometric instrument of each study; and one important social environment such as loneliness which was based in some studies on a symptom of the depression instrument and in others on questions about the feeling of loneliness. Variables with too many missing structural values were avoided because some studies were not found or were not valid to be harmonized. Predictive validity was supported using the harmonized age at death variable in those studies with follow-up and provided by each study. Kaplan-Meier survival estimators of T-scores categorized into four groups (less than or equal to 40, 41–50, 51–60 and 61 or higher) and time from participant assessment to death were plotted. Finally, to check convergent validity, each country’s T-score means were compared with two established indicators for health and wealth: the Healthy Life Expectancy (HALE) at birth and the Gross Domestic Product (GPD) per capita, both from the year 2016 in each country. Potential sex, age and birth cohort effects were previously removed by calculating the difference between T-scores and predicted T-scores from a linear regression adjusted by sex, age, year of birth and their interactions. All analyses were conducted with the statistical software R and appended as Supplementary materials, available as Supplementary data at IJE online.

Results

The sample size was 343 915 individuals, 55% were female and the median age was 60 years (see Table 1). This sample size was 98.9% of the total eligible participants who had information on at least one item. Of the remaining 1.1% (3978 individuals), 60% were from the SAGE study. Studies that provided the largest sample sizes were SHARE (30.8%), SAGE (12.3%) and HRS (10.6%). Years of interview were mostly between 2000 and 2015 except in ALSA and HRS, which started in the early 90s. In HRS, new cohorts were refreshed over the years. Medians of years of birth were between 1940 and 1950 except in the studies 10/66, ALSA and HRS, which included individuals who were born earlier.
Table 1

Descriptive statistics of sex, age, years of interview and birth by study of participants included in the construction of the healthy ageing scale

StudiesSample size
FemalesAge
Years of interview
Years of birth
N %%minmedmaxminmedmaxminmedmax
10/66168864.9626573110200120052010189619321944
ALSA20870.6496478103199219921993188919141927
CHARLS202735.9521957101201120112013191019541993
COURAGE107803.1571860104201120112012190319511994
ELSA179845.255195994200220022015190819441987
ENRICA25190.753606793200820092010191519411950
HAPIEE266647.853445875200220042008193219451962
HRS3632010.6561856103199219942013189019381992
H2000-1184172.4553049101200020002012190019511981
JSTAR71052.152466377200720072011193019451964
KLOSA10 2543.0574561105200620062006190119451961
LASI14130.4562153102201020102010190719561989
MHAS19 8485.8441857114200120012012189519461992
SAGE42 26812.3571858114200720072010189319491991
SHARE105 82930.8562262104200420112013190019461991
TILDA84632.556496282201020102012193019481961
All343 915100551860114199220072015188919451994

10/66, 10/66 Dementia Research Group Population-Based Cohort Study; ALSA, Australian Longitudinal Study of Ageing; CHARLS, China Health and Retirement Longitudinal Study; COURAGE, Collaborative Research on Ageing in Europe; ELSA, English Longitudinal Study of Ageing; ENRICA, Seniors-ENRICA; HAPIEE, Health, Alcohol and Psychosocial factors In Eastern Europe study; HRS, Health and Retirement Study; H2000-11, Health 2000/2011 study; JSTAR, Japanese Study of Aging and Retirement; KLOSA, Korean Longitudinal Study of Ageing; LASI, Longitudinal Aging Study in India; MHAS, Mexican Health and Aging Study; SAGE, Study on Global Ageing and Adult Health; SHARE, Survey of Health, Ageing and Retirement in 20 countries from Europe; TILDA, Irish Longitudinal study on Ageing; min, minimum; med, medium; max, maximum.

Descriptive statistics of sex, age, years of interview and birth by study of participants included in the construction of the healthy ageing scale 10/66, 10/66 Dementia Research Group Population-Based Cohort Study; ALSA, Australian Longitudinal Study of Ageing; CHARLS, China Health and Retirement Longitudinal Study; COURAGE, Collaborative Research on Ageing in Europe; ELSA, English Longitudinal Study of Ageing; ENRICA, Seniors-ENRICA; HAPIEE, Health, Alcohol and Psychosocial factors In Eastern Europe study; HRS, Health and Retirement Study; H2000-11, Health 2000/2011 study; JSTAR, Japanese Study of Aging and Retirement; KLOSA, Korean Longitudinal Study of Ageing; LASI, Longitudinal Aging Study in India; MHAS, Mexican Health and Aging Study; SAGE, Study on Global Ageing and Adult Health; SHARE, Survey of Health, Ageing and Retirement in 20 countries from Europe; TILDA, Irish Longitudinal study on Ageing; min, minimum; med, medium; max, maximum. The IRT model converged successfully with an excellent fit (RMSEA = 0.03, TLI = 0.99 and CFI = 0.99). The IRT parameter estimates showed that daily activity items had the highest values for discrimination and cognitive items had the lowest values (see Table 2).
Table 2

List of domains and items related to functioning and health and their parameter estimates from the item response theory (IRT) model

DomainsPresence or absence of difficultiesIRT parameter estimates and standard errors
DiscriminationDifficulty
CognitionMemory0.8332(0.0059)−0.5914(0.0065)
Immediate recalla0.6325(0.0049)−0.9731(0.0092)
Delayed recalla0.6714(0.0051)−1.2130(0.0100)
Verbal fluencya0.6080(0.0057)−1.6981(0.0160)
Orientation in time0.8449(0.0078)−1.7307(0.0155)
Processing speeda0.5912(0.0144)−1.9291(0.0453)
Numeracya1.0404(0.0118)−1.9586(0.0203)
Psychology symptomsSleeping0.8334(0.0052)−0.5605(0.0057)
VitalityExperiences some degree of pain1.0616(0.0059)−0.1463(0.0042)
Having high level of energy0.9119(0.0054)−0.5781(0.0054)
Urinary incontinence1.0969(0.0111)−2.3546(0.0196)
Sensory functionsNear vision0.9438(0.0061)−0.9104(0.0069)
Far vision1.2639(0.0075)−1.1091(0.0062)
Eyesight using glasses or lens as usual0.9421(0.0086)−1.5122(0.0126)
Hearing in general0.8212(0.0067)−1.9818(0.0151)
Hearing in a conversation0.8426(0.0107)−2.2121(0.0262)
Locomotion/mobilityStooping, kneeling or crouching2.4717(0.0120)−0.5059(0.0029)
Lifting or carrying weights2.7130(0.0134)−0.7834(0.0031)
Climbing stairs2.7327(0.0137)−0.7940(0.0031)
Getting up from sitting down2.4166(0.0125)−0.8256(0.0035)
Walking by yourself and without any equipment3.1335(0.0161)−0.8763(0.0030)
Pulling or pushing large objects3.1691(0.0202)−0.8848(0.0036)
Sitting for long periods2.0455(0.0114)−1.1322(0.0047)
Reaching or extending arms2.1929(0.0129)−1.4727(0.0054)
Walking speeda0.8995(0.0111)−1.6364(0.0160)
Dizziness when walking on a level surface1.3230(0.0128)−1.7363(0.0142)
Picking up things with fingers2.3139(0.0156)−1.8427(0.0068)
Activities of daily livingGetting in or out of bed3.4954(0.0239)−1.4949(0.0044)
Bathing or showering3.5997(0.0253)−1.5996(0.0045)
Getting dressed2.6935(0.0180)−1.6404(0.0055)
Moving around the home3.4746(0.0271)−1.6976(0.0053)
Using the toilet3.5668(0.0277)−1.7960(0.0054)
Eating3.0627(0.0255)−2.0989(0.0073)
Instrumental activities of daily livingDoing housework3.1663(0.0185)−1.0530(0.0035)
Shopping for groceries4.3188(0.0376)−1.3521(0.0045)
Getting out of the house3.4506(0.0377)−1.3939(0.0056)
Difficulties in preparing meals3.8589(0.0360)−1.5886(0.0058)
Using a map1.6814(0.0149)−1.6329(0.0108)
Managing money, bills or expenses2.4045(0.0266)−1.8498(0.0107)
Taking medications3.0022(0.0311)−2.0162(0.0096)
Making telephone calls2.9271(0.0332)−2.0392(0.0108)

Item was dichotomized in the first quartile indicating presence of difficulties.

List of domains and items related to functioning and health and their parameter estimates from the item response theory (IRT) model Item was dichotomized in the first quartile indicating presence of difficulties. T-scores were computed for all the individuals, with high scores indicating healthier ageing. The T-scores range was from 12 to 69, left-skewed with a mean of 50.2 and a standard deviation of 10. The model had maximum reliability of 0.975 at the T-score 35.3, with reliability over 0.90 from T-scores 23.2 to 48.5, and a model marginal reliability of 0.83. ICCs between the T-scores for the main scale and each study-specific subset of items were higher than 0.89 (see Table S1). Items with DIF were found in three or fewer studies, except for the item ‘energy’ that presented DIF in 6 studies. On the other hand, ENRICA, HRS, and MHAS were the only studies without any items with DIF. All others exhibited from 1 to 8 items with DIF. 10/66, ALSA and SAGE were the studies with the highest proportion of items with DIF (see Table S1). In the linear transformation of the equating approach, additive parameter estimates ranged from -0.15 to 0.11, and the multiplicative parameter estimates ranged from 0.90 to 1.11. Studies without items with DIF had the same scores (see Table S1). In all studies, we observed that T-scores were lower in each older group (see Figure 1). COURAGE and H2000/11 had the highest decreasing slopes, -6.2 and -7.0, respectively, which indicate the number of T-score units that decreased in each older group.
Figure 1

Trends of healthy ageing T-scores by age groups in each study

Trends of healthy ageing T-scores by age groups in each study Regarding the results of the multiple linear regression, we found that males and/or individuals with higher educational level, greater wealth and never smoking had higher T-scores. In contrast, obesity, arterial hypertension, depression, physical diseases and loneliness were associated with lower T-scores (see Table 3).
Table 3

Multiple linear regression between the T-scores of the healthy ageing scale and sociodemographic, life and health factors, adjusted by country

VariablesMean (SDb) or %Standardized coefficients95% confidence interval
Age62 (12)−4.25(-4.36, -4.15)
Year of birth1944 (13)−1.65(-1.76, -1.54)
Age x year of birth··0.30(0.28, 0.31)
Sex (reference: females)550··
 Males440.83(0.81, 0.86)
 Missing1−0.10(-0.13, -0.07)
Education (reference: less than primary)130··
 Primary220.25(0.21, 0.30)
 Secondary431.04(0.99, 1.09)
 Tertiary151.27(1.23, 1.31)
 Missing7−0.42(-0.46, -0.39)
Wealth c (reference: 1st quintile = less wealthy)180··
 2nd quintile160.18(0.15, 0.21)
 3rd quintile170.34(0.31, 0.37)
 4th quintile170.51(0.48, 0.54)
 5th quintile180.75(0.71, 0.78)
 Missing140.53(0.49, 0.58)
Smoking (reference: never)490
 Past23−0.14(-0.17, -0.11)
 Currently20−0.33(-0.36, -0.30)
 Missing80.24(0.20, 0.28)
Obesity (reference: no)660··
 Yes17−0.66(-0.69, -0.64)
 Missing170.68(0.63, 0.72)
Arterial hypertension (reference: no)540··
 Yes36−0.33(-0.35, -0.30)
 Missing10−0.35(-0.39, -0.30)
Depression (reference: no)700··
 Yes19−2.05(-2.08, -2.02)
 Missing11−1.45(-1.48, -1.41)
CVDd (reference: no)830··
 Yes14−1.23(-1.26, -1.20)
 Missing30.33(0.29, 0.36)
Respiratory diseasee (reference: no)890··
 Yes9−0.76(-0.79, -0.74)
 Missing2−0.08(-0.12, -0.04)
Diabetes (reference: no)660··
 Yes11−0.52(-0.55, -0.49)
 Missing230.52(0.48, 0.56)
Joint disordersf (reference: no)690··
 Yes22−1.70(-1.73, -1.67)
 Missing9−0.60(-0.66, -0.55)
Loneliness (reference: no)570··
 Yes17−0.55(-0.58, -0.52)
 Missing26−0.004(-0.044, 0.036)
Adjusted R-squaredg = 0.42

Variable ‘country’ is categorical with 38 countries.

SD: standard deviation.

Wealth: quantiles of household incomes and asset information from participants within their country cohort.

At least one of the following cardiovascular diseases (CVD): angina, stroke, myocardial infarction, heart attack, coronary heart disease, congestive heart failure, heart murmur, valvular disease, cerebral vascular disease.

At least one of the following respiratory diseases: asthma, chronic obstructive pulmonary disease, bronchitis, emphysema.

At least one of the following joint disorders: arthritis, rheumatism, osteoarthritis.

R-squared refers to the coefficient of determination.

Multiple linear regression between the T-scores of the healthy ageing scale and sociodemographic, life and health factors, adjusted by country Variable ‘country’ is categorical with 38 countries. SD: standard deviation. Wealth: quantiles of household incomes and asset information from participants within their country cohort. At least one of the following cardiovascular diseases (CVD): angina, stroke, myocardial infarction, heart attack, coronary heart disease, congestive heart failure, heart murmur, valvular disease, cerebral vascular disease. At least one of the following respiratory diseases: asthma, chronic obstructive pulmonary disease, bronchitis, emphysema. At least one of the following joint disorders: arthritis, rheumatism, osteoarthritis. R-squared refers to the coefficient of determination. The studies H2000/11, MHAS and TILDA did not provide mortality information for reasons of confidentiality, and LASI and SAGE had available only one wave. In the Kaplan-Meier estimations, mortality risk was higher for each lower T-score group throughout the observed time period. The group with the lowest T-scores had a 50% survival probability in 10 years and for the other groups it was in at least 20 years (see Figure 2).
Figure 2

Kaplan-Meier curves of healthy ageing T-scores categorized into four groups (<40, 40–49, 50–59, >60) and time from participant assessment to death

Kaplan-Meier curves of healthy ageing T-scores categorized into four groups (<40, 40–49, 50–59, >60) and time from participant assessment to death Graphical representations of T-scores by country show that cohorts from Switzerland (mean of 56.5), Japan (55.6) and Denmark (55.4) had the highest T-scores (see Figure 3). In contrast, cohorts from Ghana (40.4), India (40.7) and Russia (42.7) had the lowest T-scores. Correlations between T-score means by country and ecological country indicators were 0.81 with HALE and 0.58 with GDP (see Figure 4).
Figure 3

Distributions of healthy ageing T-scores by country

Figure 4

Healthy ageing T-score means by country compared with the indicators ‘healthy life expectancy’ and ‘gross domestic product’, respectively

Distributions of healthy ageing T-scores by country Healthy ageing T-score means by country compared with the indicators ‘healthy life expectancy’ and ‘gross domestic product’, respectively

Discussion

In this study, we developed a scale for measuring healthy ageing comprising 41 items of intrinsic capacity and functional ability, by employing harmonized data from 16 international ageing cohorts. The IRT model resulted in excellent reliability (>0.90) in T-scores from 23.5 to 48.5, with marginal reliability of 0.83, rendering the model adequate for group comparisons. Concurrent validity of the scale with sociodemographic, life and health factors, and predictive validity with mortality, have shown that this scale corresponds well with health status and could be potentially useful for conducting international ageing research. We found that the scale was related to HALE, which is known to differ between countries. Regarding GDP, countries’ well-being is known to be a determinant of health. The fact that the scale is sensitive to differences in GDP between countries indicates that it is a potential health outcome useful for aggregate comparisons. However, these comparisons are perhaps most interesting when examining what is left behind. For example, why do countries with a high HALE, such as Spain or South Korea, or with a high GDP, such as the USA or Ireland, show worse ageing outcomes than countries such as Denmark or Peru? Such observations provide the basis for further studies and thought-provoking hypotheses. Unfortunately, they are far beyond the scope of this study. Several scales are currently available for measuring specific aspects of health and ageing, albeit none concomitantly including the comprehensive assessment of intrinsic capacity and functional ability. For example, the WHODAS 2.0 scale has been widely used to assess individual disability at population levels or in clinical practice. However, it focuses on the individual’s functioning in interaction with the social environment, thus assessing the functional ability of a person rather than intrinsic capacity. Similarly, the Active Ageing Index, which covers diverse aspects of active and healthy ageing particularly in European populations, focuses on individuals’ functional ability as well as interaction with social and labour environments, rather than on intrinsic capacity. On the other hand, a recent composite ageing measure arising from the ELSA study included only intrinsic capacity. The latter showed predictive capacity for measuring an individual’s functioning, thus separating the concepts of intrinsic capacity and functional ability. Hence, the available scales are limited to specific populations and separate or merge some aspects of intrinsic capacity and functional ability with other domains used to describe ageing: for instance, physiological and physical health, personal perception and social environment, among others. It must be underlined that most existing cohort studies were not designed to measure intrinsic capacity and functional ability separately. For example, self-reported vision problems might reflect both intrinsic capacity (those who do not use glasses) and functional ability (those who use glasses). Therefore, it is more appropriate to incorporate all measures related to intrinsic capacity and functional ability, to capture healthy ageing when using existing cohort data. The ATHLOS scale was based on the IRT modelling framework that allows for estimating item parameters that are independent of person scores in the sample from which they are obtained. This entails that individual scores from the IRT measurement scale are comparable between individuals from different studies responding to different item subsets. Moreover, even though measurement biases can occur, IRT modelling allows testing and adjusting the effect of potential confounders on individual responses, such as specific effects of cohorts, gender and cultural factors. Using modelling differential item functioning, it is possible to obtain directly comparable measures across groups., IRT also provides the means to scores in a universal reference, yielding the possibility of rescaling individual scores obtained in a group to an arbitrary reference scale of choice., Methodological attempts to create the scale with similar subsets of items were successfully conducted by using only some specific studies within the ATHLOS project. However, the integration of multiple independent samples in the ATHLOS dataset had the challenge of obtaining and harmonizing a large number of intrinsic capacity and functional ability items that are difficult to collect with individual samples. Having information about many items makes it more feasible to capture the diverse heterogeneity of the individual’s healthy ageing. Moreover, the IRT model overcomes the potential obstacle of the presence of incomplete data, wherein some items serve as anchors for equating responses in different studies. Scores of individuals from smaller subsets of items should obtain the score estimates with the full subset of items, but with higher measurement error. Nevertheless, the ICC that was conducted showed that the scoring with different subsets of items was well correlated to scores from the main scale. Furthermore, in the case of choosing a subset of items for a new study, priority should be given to those with the highest IRT discrimination parameters, such as ADL/IADL (activities of daily living, instrumental activities of daily living) items. Using these items, it would be possible to establish a minimum set of information for measuring the ageing status of individuals. Our results must be interpreted taking into account that observational data can introduce selection and information biases. The data come from the general population and therefore the scale may under-represent people with greater dependency living in nursing homes or other facilities, or with greater cognitive impairment. These biases can be aggravated when integrating data from studies with different sampling designs, questionnaires and ways of asking for the same information. In addition, the harmonization process had to address multiple issues of heterogeneity across studies. Items had to be dichotomized to include the maximum number of items per study. For instance, similar questions with different levels required choosing the cut-off point for dichotomization. Harmonizing questions addressing the same type of difficulty (e.g. sleep was restless versus difficulty sleeping) was more difficult. For this reason, potential sources of DIF are usually studied by subgroups as a means of detecting heterogeneity when obtaining an item. Future research should address other potential sources of DIF such as sex and age, even though in preliminary analyses there were already very few items with DIF. From the list of 41 items covering the biopsychosocial aspects of health and functioning, others may be missing such as lung function, grip strength or even biomarkers of immune function. These items, which may already be quite correlated with the items included, are usually less common and would therefore add more complexity to the scale. Other items more related to depressive/anxious mental health symptoms, such as as feelings of sadness or loneliness, were considered candidates for inclusion in the scale. However, in carrying out the analyses, these elements introduced new dimensions related to psychopathology. As it would have been more difficult to interpret a single measure of ageing, we chose to exclude these items from the scale, leaving them aside for further research for the diagnosis of affective disorders in ageing studies. Healthy ageing scores have become useful tools in daily clinical practice for patient prognosis, as well as for the development of future public health strategies with the globally rapid pace of ageing. It is a fact that accuracy is the cornerstone of this kind of score, so the wide use of a scale like this in populations with divergent ethnic, genetic, social and cultural characteristics, and hence variable risk factors, could lead to a specific variability in the prediction of healthy ageing. However, through our methodological approach, the accuracy and the validity of the healthy ageing estimation models, using multinational populations and using different variables, represents an important topic in the field of healthy ageing and is the first step towards understanding the complex process of ageing. We believe that the development of this scale will make it possible to move forward the epidemiological research of healthy ageing. This single scale can then be used across studies conducted internationally. When applied to longitudinal data, these scores may provide reliable measures of healthy ageing scores over time. In this way, the ATHLOS scale can identify patterns of healthy ageing trajectories and their determinants, and critical points in time when changes in trajectories are induced, enabling the design and implementation of timely clinical and public health interventions to optimize and promote healthy ageing. All the studies included in the ATHLOS project gave permission for the secondary use of their data by the ATHLOS Consortium. Data may be shared on request to the corresponding author with permission of the ATHLOS consortium.

Supplementary Data

Supplementary data are available at IJE online

Funding

This work was supported by the 5-year Ageing Trajectories of Health: Longitudinal Opportunities and Synergies (ATHLOS) project. The ATHLOS project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 635316. See Supplementary material for more information about funding for each study. Click here for additional data file.
  34 in total

1.  Data Resource Profile: the Survey of Health, Ageing and Retirement in Europe (SHARE).

Authors:  Axel Börsch-Supan; Martina Brandt; Christian Hunkler; Thorsten Kneip; Julie Korbmacher; Frederic Malter; Barbara Schaan; Stephanie Stuck; Sabrina Zuber
Journal:  Int J Epidemiol       Date:  2013-06-18       Impact factor: 7.196

2.  [Rationale and methods of the study on nutrition and cardiovascular risk in Spain (ENRICA)].

Authors:  Fernando Rodríguez-Artalejo; Auxiliadora Graciani; Pilar Guallar-Castillón; Luz M León-Muñoz; M Clemencia Zuluaga; Esther López-García; Juan Luis Gutiérrez-Fisac; José M Taboada; M Teresa Aguilera; Enrique Regidor; Fernando Villar-Álvarez; José R Banegas
Journal:  Rev Esp Cardiol       Date:  2011-08-06       Impact factor: 4.753

3.  Is rigorous retrospective harmonization possible? Application of the DataSHaPER approach across 53 large studies.

Authors:  Isabel Fortier; Dany Doiron; Julian Little; Vincent Ferretti; François L'Heureux; Ronald P Stolk; Bartha M Knoppers; Thomas J Hudson; Paul R Burton
Journal:  Int J Epidemiol       Date:  2011-07-30       Impact factor: 7.196

4.  Cohort profile: the English longitudinal study of ageing.

Authors:  Andrew Steptoe; Elizabeth Breeze; James Banks; James Nazroo
Journal:  Int J Epidemiol       Date:  2012-11-09       Impact factor: 7.196

5.  Design and methodology of the Irish Longitudinal Study on Ageing.

Authors:  Brendan J Whelan; George M Savva
Journal:  J Am Geriatr Soc       Date:  2013-05       Impact factor: 5.562

6.  "Healthy Aging" Concepts and Measures.

Authors:  Jean-Pierre Michel; Ritu Sadana
Journal:  J Am Med Dir Assoc       Date:  2017-05-04       Impact factor: 4.669

7.  Cohort Profile: The Ageing Trajectories of Health - Longitudinal Opportunities and Synergies (ATHLOS) project.

Authors:  Albert Sanchez-Niubo; Laia Egea-Cortés; Beatriz Olaya; Francisco Félix Caballero; Jose L Ayuso-Mateos; Matthew Prina; Martin Bobak; Holger Arndt; Beata Tobiasz-Adamczyk; Andrzej Pająk; Matilde Leonardi; Ilona Koupil; Demosthenes Panagiotakos; Abdonas Tamosiunas; Sergei Scherbov; Warren Sanderson; Seppo Koskinen; Somnath Chatterji; Josep Maria Haro
Journal:  Int J Epidemiol       Date:  2019-08-01       Impact factor: 7.196

8.  Data Resource Profile: Cross-national and cross-study sociodemographic and health-related harmonized domains from SAGE plus CHARLS, ELSA, HRS, LASI and SHARE (SAGE+ Wave 2).

Authors:  Nadia Minicuci; Nirmala Naidoo; Barbara Corso; Ilaria Rocco; Somnath Chatterji; Paul Kowal
Journal:  Int J Epidemiol       Date:  2019-02-01       Impact factor: 7.196

9.  Associations of Lifestyle Behaviour and Healthy Ageing in Five Latin American and the Caribbean Countries-A 10/66 Population-Based Cohort Study.

Authors:  Christina Daskalopoulou; Artemis Koukounari; José Luis Ayuso-Mateos; Martin Prince; A Matthew Prina
Journal:  Nutrients       Date:  2018-10-30       Impact factor: 6.706

10.  The structure and predictive value of intrinsic capacity in a longitudinal study of ageing.

Authors:  John R Beard; A T Jotheeswaran; Matteo Cesari; Islene Araujo de Carvalho
Journal:  BMJ Open       Date:  2019-11-02       Impact factor: 2.692

View more
  6 in total

1.  Trajectories of healthy ageing among older adults with multimorbidity: A growth mixture model using harmonised data from eight ATHLOS cohorts.

Authors:  Hai Nguyen; Dario Moreno-Agostino; Kia-Chong Chua; Silia Vitoratou; A Matthew Prina
Journal:  PLoS One       Date:  2021-04-06       Impact factor: 3.240

2.  Development and Validation of a Multidimensional Population-Based Healthy Aging Scale: Results From the China Health and Retirement Longitudinal Study.

Authors:  Junling Gao; Jixiang Xu; Yingwei Chen; Yujie Wang; Bo Ye; Hua Fu
Journal:  Front Med (Lausanne)       Date:  2022-02-14

3.  How Healthy Aging and Contact With Children Are Associated With Satisfaction in Middle-Aged and Older Parents in China: A Mediation Analysis.

Authors:  Jiangyun Chen; Yixin Zeng; Wenjun He; Jiao Yang; Dong Xu; Haomiao Li
Journal:  Front Public Health       Date:  2022-03-14

4.  Relationship between Dietary Macronutrients Intake and the ATHLOS Healthy Ageing Scale: Results from the Polish Arm of the HAPIEE Study.

Authors:  Urszula Stepaniak; Maciej Polak; Denes Stefler; Magdalena Kozela; Martin Bobak; Albert Sanchez-Niubo; José Luis Ayuso-Mateos; Josep Maria Haro; Andrzej Pająk
Journal:  Nutrients       Date:  2022-06-14       Impact factor: 6.706

5.  Immune age and biological age as determinants of vaccine responsiveness among elderly populations: the Human Immunomics Initiative research program.

Authors:  Jaap Goudsmit; Anita Huiberdina Johanna van den Biggelaar; Wouter Koudstaal; Albert Hofman; Wayne Chester Koff; Theodore Schenkelberg; Galit Alter; Michael Joseph Mina; Julia Wei Wu
Journal:  Eur J Epidemiol       Date:  2021-06-12       Impact factor: 8.082

6.  Risk of all-cause mortality associated with chronic obstructive pulmonary disease and the role of healthy ageing trajectories: a population-based study of middle-aged and older adults.

Authors:  Ivet Bayes-Marin; Albert Sanchez-Niubo; Daniel Fernández; Josep Maria Haro; Beatriz Olaya
Journal:  BMJ Open       Date:  2021-07-28       Impact factor: 2.692

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.