Literature DB >> 34040363

Assessing Frailty with the Tilburg Frailty Indicator (TFI): A Review of Reliability and Validity.

Robbert J Gobbens1,2,3, Izabella Uchmanowicz4.   

Abstract

OBJECTIVE: The Tilburg Frailty Instrument (TFI) is an instrument for assessing frailty in community-dwelling older people. Since its development, many studies have been carried out examining the psychometric properties. The aim of this study was to provide a review of the main findings with regard to the reliability and validity of the TFI.
METHODS: We conducted a literature search in the PubMed and CINAHL databases on May 30, 2020. An inclusion criterion was the use of the entire TFI, part B, referring to the 15 components. No restrictions were placed on language or year of publication.
RESULTS: In total, 27 studies reported about the psychometric properties of the TFI. By far, most of the studies (n = 25) were focused on community-dwelling older people. Many studies showed that the internal consistency and test-retest reliability are good, which also applies for the criterion and construct validity. In many studies, adverse outcomes of interest were disability, increased health-care utilization, lower quality of life, and mortality. Regarding disability, studies predominantly show results that are excellent, with an area under the curve (AUC) >0.80. In addition, the TFI showed good associations with lower quality of life and the findings concerning mortality were at least acceptable. However, the association of the TFI with some indicators of health-care utilization can be indicated as poor (eg, visits to a general practitioner, hospitalization).
CONCLUSION: Since population aging is occurring all over the world, it is important that the TFI is available and well known that it is a user-friendly instrument for assessing frailty and its psychometric properties being qualified as good. The findings of this assessment can support health-care professionals in selecting interventions to reduce frailty and delay its adverse outcomes, such as disability and lower quality of life.
© 2021 Gobbens and Uchmanowicz.

Entities:  

Keywords:  Tilburg Frailty Indicator; frailty; older people; reliability; validity

Mesh:

Year:  2021        PMID: 34040363      PMCID: PMC8140902          DOI: 10.2147/CIA.S298191

Source DB:  PubMed          Journal:  Clin Interv Aging        ISSN: 1176-9092            Impact factor:   4.458


Introduction

The Tilburg Frailty Instrument (TFI) is an instrument for assessing frailty in community-dwelling older people. It has been developed as a self-report instrument in that older people have to complete the TFI themselves.1 The TFI is based on the following conceptual definition of frailty: ‘frailty is a dynamic state affecting an individual who experiences losses in one or more domains of human functioning (physical, psychological and social), which is caused by the influence of a range of variables and increases the risk of adverse outcomes’.2 Both the conceptual definition of frailty and the TFI derived from it consider frailty as a multidimensional concept, including physical, psychological and social functioning of older people, and emphasize the importance of an integral approach to human functioning. The World Health Organization recommends this holistic approach to take care of frail, older people.3 Paying sole attention to physical frailty can lead to fragmentation of care2,4 and possibly to a reduction of the quality of care and a decrease in the experienced quality of life in frail older people. According to Gilardi et al, a multidimensional approach to frailty can be more effective to plan and implement care services, as well as establish prevention programs for frail older people.5 The TFI contains two parts: part A, on 10 determinants of frailty, and part B, on 15 components of frailty.1 The determinants are sex, age, marital status, education, income, ethnicity, lifestyle, life events, multimorbidity and living environment. The components of frailty refer to physical frailty (eight), psychological frailty (four) and social frailty (three). Physical frailty includes physically unhealthy, unexplained weight loss, difficulty walking, difficulty maintaining balance, poor hearing, poor vision, lack of strength in the hands and physical tiredness. Psychological frailty consists of the components of memory problems, feeling down, feeling nervous or anxious and unable to cope with problems. Social frailty includes living alone, lack of social relations and lack of social support. The total score of the TFI is 0–15, with a score ranging from 0 to 8 for physical frailty, 0 to 4 for psychological frailty and 0 to 4 for social frailty. Higher scores refer to greater frailty, as older persons with a total TFI score ≥5 are considered to be frail.1 Originally, the TFI was developed in the Netherlands by Gobbens et al and based on an extensive literature search and opinions of an international group of frailty experts, including geriatricians, gerontologists, nurses and psychologists.6 Their first study examined the psychometric properties of the TFI in two Dutch samples of community-dwelling persons aged 75 years and older.1 Subsequently, the TFI was translated into several languages, including Brazilian Portuguese,7 Danish,8 Italian,9 Portuguese,10 Polish,11 German,12 Chinese,13 Spanish,14 and Turkish.15 Until now, two systematic reviews and one narrative review have been published, indicating that the TFI is very suitable for assessing frailty among community-dwelling older people.5,16,17 According to Pialoux et al, both the TFI1 and the SHARE Frailty Index18 are potentially suitable for screening frailty in older people in primary care settings.16 Sutton et al concluded that the TFI has the most robust evidence of reliability and validity of 38 frailty assessment instruments, including frequently used instruments, such as the Phenotype of Frailty19 and the Frailty Index.17 In addition, the narrative review by Gilardi et al identified the TFI as the best screening instrument to use in public health because it was the only one of the selected instruments (including the Phenotype of Frailty,19 Vulnerable Elders Survey,20 Frailty Index,21 and the SHARE Frailty Index)18 with three features: a multidimensional structure, quick and easy to use, and an accurate risk prediction of adverse outcomes of frailty.5 In addition, De Witte et al used the TFI as a gold standard for the validation of their instrument, The Comprehensive Frailty Assessment Instrument.22 Since the TFI was developed approximately 10 years ago,1 and many studies into its psychometric properties have been carried out since then,17 this study aims to provide a review of the main findings regarding this issue. We present the reliability and validity of these studies.

Methods

Literature Search

We conducted a literature search in the PubMed and CINAHL databases on May 30, 2020, using “Tilburg Frailty Indicator AND psychometric properties”, “TFI AND psychometric properties”, “Tilburg Frailty Indicator AND validity”, “TFI AND validity”, “Tilburg Frailty Indicator AND reliability”, and “TFI AND reliability”. An inclusion criterion was the use of the entire TFI, part B, referring to the 15 components. No restrictions were placed on language or year of publication. The studies were screened and selected for inclusion by the first author. In total, 27 studies were selected for the purpose of this review.

Reliability and Validity

Reliability

Four types of reliability were distinguished: internal consistency, test-retest, inter-rater reliability, and parallel forms reliability. Internal consistency refers to consistency across items of measurement. Statistical techniques used for this purpose were Cronbach’s alpha, Kuder–Richardson Formula 20 (KR-20) and item correlations. Cronbach’s alpha and KR-20 values >0.70 were considered acceptable.23,24 The higher the item correlations, the better the internal consistency of the measurement instrument. Test-retest indicates consistency among time (stability). Correlations, simple agreement, kappa (chance-corrected agreement) and intraclass correlation coefficient (ICC) were used to determine test–retest reliability. The higher the correlation, simple agreement, kappa value and ICC, the higher the concordance between the two assessments will be. The correlation coefficient was evaluated using the classification of Callegari-Jacques (weak, <0.30; moderate, 0.30–0.60; strong, 0.60–0.90; very strong, ≥0.90).25 For the interpretation of the Kappa value, we used the Landis and Koch evaluation (absent, <0.10; weak, 0.10–0.20; fair, 0.21–0.40; moderate, 0.41–0.60; substantial, 0.61–0.80; nearly perfect, 0.81–1.00).26 The ICC was evaluated using the guideline provided by Koo and Li (poor <0.50; moderate, 0.50–0.75; good, 0.75–0.90; excellent >0.90).27 Inter-rater reliability concerns consistency across different researchers. Frequently used techniques to establish inter-rater reliability are Kappa and ICC. The fourth and final type of reliability, parallel forms reliability, assesses the correlation between two equivalent versions of a measurement instrument. High correlation between the two instruments indicates high forms reliability.

Validity

Six types of validity can be distinguished: criterion, construct, content, face, structural, and known-group validity. Criterion validity concerns the relation between the score on a measurement instrument and some external criterion.28 If the measurement instrument corresponds to a criterion assessed simultaneously, the validity is considered concurrent. If the measurement instrument forecasts a criterion value in the future, the validity is labeled predictive.28 Criterion validity can be checked by determining a correlation coefficient and conducting receiver operating characteristics (ROC) curve analyses and calculating the Area Under the Curve (AUC). An AUC <0.7, 0.7–0.8, 0.8–0.9, and ≥0.9, is considered poor (no discrimination), acceptable, excellent, and outstanding, respectively.29 Construct validity refers to the simultaneous process of measure and theory validation.30 Convergent and divergent (discriminant) validities constitute construct validity. Convergent validity involves the degree by which two measures of constructs should be or are related. By contrast, divergent validity indicates whether measures that should be unrelated, are unrelated.31 Correlation tests are performed to establish convergent and divergent validity and thus, construct validity. Content validity concerns the extent in which a measurement instrument includes all necessary components of the construct to be assessed.32 According to Burns and Grove, content validity is obtained from literature, representatives of the population concerned and experts.33 A measurement instrument has face validity if it appears to assess what it is supposed to assess and that it will work.34 As well as when determining content validity, representatives of the population concerned and experts can be involved; however, establishing face validity is more informal, compared to content validity. Structural validity refers to the extent to which an instrument covers the hypothetical dimension of a construct.32 According to Souza et al, factorial analysis and structural equation modeling are the appropriate statistical techniques to assess structural validity.35 Finally, known-group validity involves an instrument’s ability to make a distinction between groups. Group differences can be determined using a chi-square test and t-test. When describing the findings on the reliability and validity of the TFI in the included studies, the classification described above has been used foremost.

Results

Characteristics of the Studies

Table 1 presents the characteristics of the 27 included studies. The first studies were performed in 2010.1,36 Thirteen studies were carried out in the Netherlands,1,36–47 of which two were part of a large study that also collected data in other European countries.46,47 Three studies were exclusively conducted in Denmark,8,48,49 and two in Poland11,50 and Brazil.7,51 By far, most of the studies were conducted among community-dwelling older people. Two exceptions were studies including older Danish people admitted to a hospital49 and one study evaluating older Dutch residents of assisted living facilities.39 Two-thirds of the studies used a cross-sectional design, two of which were qualitative studies conducted in Denmark,8,48 and the other studies were characterized by a longitudinal design. The sample size varied from 14 to 27,527 people.46,48 The most commonly used age groups were people aged ≥65 years (eight studies) and aged ≥70 years (seven studies). One study only showed the mean age of the sample.8 The highest mean age was observed among Dutch people residing at assisted living facilities (84.8 years).39
Table 1

General Characteristics of the Studies Included

AuthorsYearCountrySettingDesignnAge in years, mean ± SDPrevalence figure
Gobbens et al12010The Netherlandscommunity-dwellingcross-sectional479≥ 75 years, 80.3 ± 3.847.1%
Metzelthin et al362010The Netherlandscommunity-dwellingcross-sectional532≥ 70 years, 77.2 ± 5.540.2%
Gobbens et al422012The Netherlandscommunity-dwellinglongitudinal484≥ 75 years, 80.3 ± 3.8-
Daniels et al372012The Netherlandscommunity-dwellinglongitudinal430≥ 70 years, 77.2 ± 5.540.2%
Gobbens et al402013The Netherlandscommunity-dwellingcross-sectional1,031≥ 65 years, 73.4 ± 5.827.7%
Theou et al462013Eleven European countries, including the Netherlandscommunity-dwellinglongitudinal27,527≥ 50 years, 65.3 ± 10.529.2%***
Santiago et al72013Brazilcommunity-dwellingcross-sectional219≥ 60 years, sample 1: 69.8 ± 7.8and sample 2: 71.3 ± 8.035.6% (sample 1); 31.7% (sample 2)
Andreasen et al82014Denmarkcommunity-dwelling*****cross-sectional****13; 2175.9 ± 6.9; 80.6 ± 6.5-
Uchmanowicz et al112014Polandcommunity-dwellingcross-sectional100≥ 60 years, 68.2 ± 6.540%
Gobbens & Van Assen412014The Netherlandscommunity-dwellinglongitudinal484≥ 75 years, 80.3 ± 3.8-
Mulasso et al92015Italycommunity-dwellingcross-sectional267≥ 65 years, 73.4 ± 6.044.6%
Gobbens et al392015The Netherlandsassisted living facilitiescross-sectional221≥ 55 years, 84.8 ± 8.976.5%
Coelho et al102015Portugalcommunity-dwellingcross-sectional252≥ 65 years, 79.2 ± 7.354.8*
Andreasen et al482015Denmarkcommunity-dwelling******cross-sectional****14≥ 65 years, 80.6-
Uchmanowicz et al502016Polandcommunity-dwellingcross-sectional212≥ 60 years, 70.6 ± 7.244.1%
Freitag et al122016Germanycommunity-dwellingcross-sectional210≥ 64 years, sample 1: 76.9 ± 5.7 and sample 2: 72.5 ± 5.546.7% (sample 1); 32% (sample 2)
Dong et al132017Chinacommunity-dwellingcross-sectional917≥ 60 years, 68.6 ± 6.612.4%
Renne & Gobbens452018The Netherlandscommunity-dwellingcross-sectional241≥ 70 years, 76.5 ± 5.132.8%
Santiago et al512018Brazilcommunity-dwelling*******longitudinal963≥ 60 years, 70.5 ± 8.244.2%
Vrotsou et al142018Spaincommunity-dwellingcross-sectional856≥ 70 years, 78.1 ± 4.9-
Topcu et al152019Turkeygeriatrics outpatient cliniccross-sectional198≥ 70 years, 77.7 ± 5.563.6%
Op Het Veld et al442019The Netherlandscommunity-dwellinglongitudinal2,420≥ 65 years, 76.3 ± 6.6-
Op Het Veld et al432019The Netherlandscommunity-dwellinglongitudinal2,420≥ 65 years, 76.3 ± 6.664.8%**
Alqahtani et al522020Saudi Arabiacommunity-dwelling********cross-sectional84≥ 65 years, 72.0 ± 4.728.0%
Gobbens et al382020The Netherlandscommunity-dwellinglongitudinal180≥ 70 years, 76.3 ± 5.129.4%
Gobbens & Andreasen492020Denmarkacutely admitted patientslongitudinal1,328≥ 65 years, 76.9 ± 7.553.1%
Zhang et al472020The Netherlands, Spain, Greece, Croatia, United Kingdomcommunity-dwellingcross-sectional2,250≥ 70 years, 79.7 ± 5.7-

Notes: *Cut-off for frailty was 6; **This study was conducted among pre-frail and frail individuals; ***This study did not use the original TFI; ****Cross-sectional qualitative research; *****Acutely admitted to hospital; ******Acutely admitted discharged to their own home; *******Users of primary health care services; ********in senior-living facilities, visiting outpatient clinic.

Abbreviation: SD, standard deviation.

General Characteristics of the Studies Included Notes: *Cut-off for frailty was 6; **This study was conducted among pre-frail and frail individuals; ***This study did not use the original TFI; ****Cross-sectional qualitative research; *****Acutely admitted to hospital; ******Acutely admitted discharged to their own home; *******Users of primary health care services; ********in senior-living facilities, visiting outpatient clinic. Abbreviation: SD, standard deviation. Using the original TFI cut-off point of 5,1 prevalence figures concerning the general population of community-dwelling older people ranged from 12.4% to 47.1% in samples of Chinese and Dutch individuals.1,13 Specifically, prevalence was higher among Turkish people admitted to a geriatrics outpatient clinic (63.6%),15 residents of assisted living facilities (76.5%)39 and among physical pre-frail and frail community-dwelling older people (64.8%).43 In the latter group, the frailty status of the participants was first assessed by the Phenotype of Frailty.19 Among community-dwelling older people, the prevalence of frailty was highest in a sample of Portuguese people (54.8%), while the cut-off point was 6.10 It should be noted that seven studies did not present a prevalence figure of frailty.8,14,41,42,44,47,48 A Dutch sample including 479/484 participants and a Dutch sample consisting of 2420 participants were used in three1,41,42 and two studies,43,44 respectively. Moreover, two other Dutch studies partially used the same sample.36,37 Additional details are displayed in Table 1.

Reliability of the TFI

In total,16 studies report the reliability of the TFI. The four types of reliability observed were internal consistency, test–retest, inter-rater and parallel forms reliability. Fifteen studies determined the internal consistency reliability and one study failed to do so (see Table 2).46 The Cronbach’s alpha for the TFI total was 0.66 (lowest)9 to 0.80 (highest),45 whereas the KR-20, calculated in three studies was 0.69, 0.70 and 0.78.10,14,52 Eight studies also present the Cronbach’s alpha for physical, psychological and social frailty, ranging from 0.57 to 0.79,7,9 0.37 to 0.63,1,50 and 0.25 to 0.59,13,50 respectively. The lowest and highest values of the KR-20, with regard to physical, psychological, and social frailty, were 0.6414 and 0.75,10 0.4810 and 0.58,14 and 0.2214 and 0.49,10 respectively. Six studies examined the internal consistency reliability of the TFI using corrected item-total correlations.9,12,15,36,50,52
Table 2

Internal Consistency Reliability of the TFI

AuthorsInternal Consistency Reliability
Gobbens et al1Cronbach’s alpha: total 0.73, physical 0.70, psychological 0.63, social 0.34
Metzelthin et al36Cronbach’s alpha: total 0.79Corrected item-total correlations: correlations ranged from 0.18 to 0.58 (mean 0.39)
Gobbens et al40Cronbach’s alpha: total 0.71, physical 0.67, psychological 0.54, social 0.51
Santiago et al7Cronbach’s alpha: total 0.78, physical 0.79, psychological 0.53, social 0.38
Uchmanowicz et al11Cronbach’s alpha: total 0.72 Cronbach’s alpha reliability coefficients after the removal of an item ranged from 0.68 (coping) to 0.73 (anxiety)
Mulasso et al9Cronbach’s alpha: total 0.66, physical 0.57, psychological 0.51, social 0.36Corrected item-total correlations for each item with the domains in general, accepted values, with some exceptions (unexplained weight loss, poor hearing)
Coelho et al10KR-20: total 0.78, physical 0.75, psychological 0.48, social 0.49
Uchmanowicz et al50Cronbach’s alpha: total 0.74, physical 0.72, psychological 0.37, social 0.59Corrected item-total correlations: ranged from 0.12 to 0.55
Freitag et al12Cronbach’s alpha: total 0.67, physical 0.66, psychological 0.43, social 0.36 Cronbach’s alpha reliability coefficients after the removal of an item ranged from 0.6 (physical tiredness)–0.69 (coping)Corrected item-total correlations ranged from 0.12 (memory problems) to 0.58 (physical tiredness)
Dong et al13Cronbach’s alpha: total 0.71, physical 0.71, psychological 0.51, social 0.25
Renne & Gobbens45Cronbach’s alpha: total 0.80, physical 0.74, psychological 0.61, social 0.51
Vrotsou et al14KR-20: total 0.69, physical 0.64, psychological 0.58, social 0.22
Topcu et al15Cronbach’s alpha: total 0.68 Cronbach’s alpha reliability coefficients after the removal of an item ranged from 0.62 (physical tiredness)–0.69 (lack of social relations)Corrected item-total correlations: ranged from −0.05 (living alone) to 0.57 (physical tiredness)
Alqahtani et al52KR-20: total 0.70, physical 0.68, psychological 0.57, social 0.42; the KR-20 after the removal of an item ranged from 0.66 (coping)–0.72 (poor hearing, physical tiredness) Corrected item-total correlations ranged from 0.10 (unexplained weight loss) to 0.47 (coping)
Zhang et al47Cronbach’s alpha: varied among five countries involved: total 0.70 (Spain)–0.75 (Croatia), physical 0.60 (Spain)–0.73 (The Netherlands), psychological 0.38 (UK)–0.55 (Greece, Croatia), social 0.22 (Greece)–0.43 (The Netherlands)

Abbreviation: KR-20, Kuder–Richardson formula.

Internal Consistency Reliability of the TFI Abbreviation: KR-20, Kuder–Richardson formula. Test–retest reliability was observed in nine studies, using Pearson correlations,1,7,10 Kappa,7,10,14,50 simple agreement,7,10,14 and ICC12,13,15,52 (see Table 3). Pearson correlations, with regard to frailty total were 0.88,7 0.90,1 and 0.91,10 using a period less than 3 weeks. The Pearson correlation coefficient was 0.79 for a 1-year period.1 The correlation coefficients, with respect to the frailty domains, are detailed in Table 3. Using Kappa, the level of agreement varied greatly at item level.7,10,14,50 Obviously, the level of agreement was higher when the simple agreement technique was used7,10,14 (displayed in Table 3). For frailty total, the ICC ranged from 0.86 to 0.9915,52 in two studies involving a follow-up period of 1 week.
Table 3

Test–Retest Reliability, Inter-Rater Reliability, and Parallel Forms Reliability of the TFI

AuthorsTest-Retest Reliability
Gobbens et al1Two weeks (Pearson correlation coefficient): total 0.90, physical 0.87, psychological 0.77, social 0.86One-year period (Pearson correlation coefficient): total 0.79, physical 0.78, psychological 0.67, social 0.76
Santiago et al77–10 days (Pearson correlation coefficient): total 0.88, physical 0.88, psychological 0.67, social 0.89.7–10 days (simple agreement); items ranged from 0.63 (memory) to 1.00 (live alone, lack of support)7–10 days (Kappa coefficients): five items had nearly perfect agreement (0.81–1.00), four items had substantial agreement (0.61–0.80) agreement, two items had moderate agreement (0.41–0.60), four items had fair agreement (0.21–0.40)
Coelho et al1012–16 days: (Pearson correlation coefficient): total 0.91, physical 0.87, psychological 0.75, social 0.87.12–16 days (simple agreement): items ranged from 0.78 (depression, anxiety) to 0.97 (living alone)12–16 days (Kappa coefficients): ranged from 0.52 to 0.95
Uchmanowicz et al5010–14 days: (Kappa coefficient): a high level of agreement with regard to items was demonstrated with coefficients ranging from 0.96 to 1.00
Freitag et al1220 weeks (ICC): total 0.87, physical 0.85, psychological 0.75, social 0.84
Dong et al1310–25 days (ICC): total 0.88, physical 0.80, psychological 0.65, social 0.81
Vrotsou et al147–14 days (simple agreement): item ranged from 0.77 to 0.99, except anxious (0.66)7–14 days (Kappa coefficient): 0.98 (living alone), 0.23 to 0.34 (physical tiredness, anxious, coping, support) 0.46 to 0.57 (all other items, except unintentional weight loss)
Topcu et al15One week (ICC): 0.99
Alqahtani et al52One week (ICC): 0.86
Inter-rater reliability
Topcu et al15Two observers on the same day (ICC): 0.99
Parallel forms reliability
Theou et al46Kappa coefficients: TFI and Frailty Index 0.52, Frailty Index based on Comprehensive Geriatric Assessment 0.52, Clinical Frailty Scale 0.38, Frailty Phenotype 0.37, Edmonton Frail Scale 0.27, FRAIL scale 0.27, Groningen Frailty Indicator (GFI) 0.50
Dong et al13Kappa coefficients: ranged for TFI items and alternative measures from 0.12 (hearing problems)–1.00 (living alone)

Abbreviation: ICC, intraclass correlation coefficient.

Test–Retest Reliability, Inter-Rater Reliability, and Parallel Forms Reliability of the TFI Abbreviation: ICC, intraclass correlation coefficient. Table 3 also shows the inter-rater and parallel forms reliability of the TFI. Inter-rater reliability was identified in only one study in which, on the same day, two observers came to almost perfect agreement (ICC = 0.99).15 Finally, in two studies, parallel forms reliability was determined.13,46 One of these studies examined the agreement between the TFI (frailty total) and other validated frailty instruments. The highest agreement existed with the Frailty Index (FI) and the Comprehensive Geriatric Assessment and less agreement was found with the Edmonton Frail Scale and the Frail scale.46 The other study used Kappa to establish the level of agreement between TFI items and alternative measures resulting in low and high levels of agreement (0.12 for hearing problems and 1.00 for living alone).13

Validity of the TFI

Criterion Validity

Tables 4–6 show an overview of the validity of the TFI, which includes 24 of the 27 studies. Criterion validity was the most frequently presented type of validity, with concurrent and predictive characteristics reported in 10 and 9 studies, respectively (see Table 4). Concurrent validity was determined using different techniques: correlations, AUC and regression analyses. Frequently occurring adverse outcomes of interest were lower quality of life,1,10,39,40,45 disability,1,9,10,13,39,47 and an increase in health-care utilization.1,9,10,13,39 Using different instruments for assessing quality of life (WHOQOL-BREF,53 WHOQOL-OLD54 and EUROHIS-QOL)55, four included studies demonstrated that higher scores on the TFI were correlated with lower quality of life.1,10,40,45 Regarding disability and referring to limitations in performing activities of daily living (ADL) and/or instrumental activities of daily living (IADL), the AUCs were excellent in three studies,1,9,47 and acceptable for ADL and poor for IADL in two studies.10,13 Many different indicators of health-care utilization were used, eg, visits to a general practitioner, hospitalization and receiving nursing care. In most studies, the findings were poor9,10,13; however, Gobbens et al observed an excellent AUC for reporting personal care, and acceptable AUCs for reporting nursing and informal care.1 Three studies determined the discriminating ability of the TFI to identify frailty with other validated frailty measures using the AUC: the Groningen Frailty Indicator (GFI),10 the phenotype of frailty,10,13 Survey of Health, Ageing and Retirement in Europe-Frailty Instrument (SHARE-FI),47 and the Frailty Index (FI).13 The AUCs for GFI, SHARE-FI and the FI were excellent; however, the AUCs for the phenotype of frailty were not unequivocal. Two studies examined the correlations between the TFI and other frailty measures: the GFI,36 Sherbrooke Postal Questionnaire (SPQ),36 and the Phenotype of Frailty.14 The correlations between the TFI and GFI, SPQ and phenotype of frailty were 0.76, 0.42, and 0.49, respectively.14,36
Table 4

Criterion Validity of the TFI

AuthorsCriterion Validity
Gobbens et al1Concurrent using correlations: large for total frailty and quality of life domains physical, psychological, environmental, and medium to large for quality of life domain social assessed with the WHOQOL-BREF Concurrent using AUC: excellent for disability and reporting personal care, acceptable for reporting nursing and informal care, poor for reporting visits general practitioner and hospitalization
Metzelthin et al36Concurrent using correlations: correlation between TFI and Groningen Frailty Indicator (GFI) was 0.76; the correlation between TFI and Sherbrooke Postal Questionnaire (SPQ) was 0.42
Gobbens et al42Predictive, one and two years later, using multiple regression analyses: an increase in predictive accuracy of most adverse outcomes (disability, indicators of health care utilization, and quality of life) Predictive, one and two years later, using AUC: excellent for disability and reporting personal care, acceptable for reporting nursing, informal care, and facilities in residential care, poor for contacts with health care professionals and hospitalization, not significant for visits to a general practitioner
Gobbens et al40Concurrent using sequential regression analyses: all components of the TFI together explained the scores on quality of life domains physical health, psychological, social relations, environmental assessed with the WHOQOL-BREF
Daniels et al37Predictive, one year later, using OR unadjusted: disability 3.96, 95% CI = 2.48–6.30, mortality 3.08, 95% CI = 1.04–9.13, hospitalization 2.59, 95% CI = 1.36–4.90 Predictive, one year later, using AUC: poor for disability, mortality, and hospitalization
Theou et al46Predictive, two and five years later, using AUC: acceptable for mortality
Gobbens and Van Assen41Predictive, two and four years later, using sequential regression analyses): the items physical unhealthy, difficulty in walking, difficulty in maintaining balance, physical tiredness, feeling down, and lack of social support predicted quality of life scores assessed with the WHOQOL-BREF
Mulasso et al9Concurrent using AUC: excellent for disability, poor for falls and visits to general practitioner
Gobbens et al39Concurrent using regression analyses: all three domains (physical, psychological, social) together had an effect on disability, quality of life (physical health, psychological, social relationships, environmental), visits to a general practitioner, and falls; no effects were observed with contacts with health care professionals, hospitalization, receiving personal care, receiving nursing care, receiving informal care, and facilities in nursing home/rehabilitation center
Coelho et al10Concurrent using multiple regression analyses: the TFI domains predicted 38.7% and 42.1% of quality of life variance, assessed with EUROHIS-QOL, and WHOQOL-OLD, respectivelyConcurrent using AUC: acceptable for disability in ADL, poor for disability in IADL, and health care utilizationConcurrent using AUC: discriminating ability was excellent regarding identifying frailty by the Groningen Frailty Indicator (GFI) (0.86, 95% CI = 0.85–0.93) and acceptable for frailty assessed with the Frailty Phenotype by Fried et al (0.75, 95% CI = 0.68–0.81)
Dong et al13Concurrent using AUC: excellent for depression; acceptable for disability in ADL, and low social support; poor for disability in IADL, and for health care utilization (hospitalization, emergency use) Concurrent using AUC: discriminating ability was excellent regarding identifying frailty by the Frailty Phenotype by Fried et al (0.87, 95% CI = 0.87–0.93) and the Frailty Index (0.86, 95% CI = 0.82–0.91)
Renne and Gobbens45Concurrent using sequential multiple linear regression analyses): all fifteen items together explained 36.5% of the variance of the score of quality of life
Santiago et al51Predictive, 1 year later, using sequential logistic regression analyses: total frailty predicted mortality, adjusted for sex and age (HR = 2.72, 95% CI = 1.01–7.31); after controlling for sociodemographic variables the frailty domains (physical, psychological, social) improved the prediction of hospitalization (OR = 1.83, 95% CI = 1.10–3.06), falls (OR = 2.08, 95% CI = 1.21–3.58), disability in ADL (OR = 3.03, 95% CI = 1.45–6.29), disability in IADL (OR = 1.51, 95% CI = 1.05–2.17)
Vrotsou et al14Concurrent using correlations: the correlation between total and the Frailty Phenotype by Fried et al was 0.49
Op Het Veld et al44Predictive, 2 years later: positive predictive value 42.6% and negative predictive value 75.2% for disability in IADL
Op Het Veld et al43Predictive, 2 years later, using AUC: poor for mortality, hospitalization, and disability in IADL
Gobbens et al38Predictive, 1 year later, using linear and logistic regression analyses: the three frailty domains together predicted disability, visits general practitioner, contacts with health care professionals, receiving nursing; no effects were found on hospitalization, receiving personal care, falls (after controlling for sociodemographic characteristics and multimorbidity)Predictive, 1 year later, using AUC: excellent for total frailty with respect to disability and receiving personal care; poor for receiving nursing, falls, and hospitalization
Gobbens and Andreasen49Predictive, 6 months later, using sequential logistic regression analyses: physical and social frailty predicted readmission and mortality; psychological frailty predicted only readmission
Zhang et al47Concurrent using AUC: all AUC were excellent for SHARE-FI, and disability; all AUC were acceptable for limited function, poor mental health, and feeling lonely

Abbreviations: AUC, area under the curve; CI, confidence interval; ADL, activities of daily living; IADL, instrumental activities of daily living; OR, odds ratio; SHARE-FI, SHARE Frailty Instrument.

Table 6

Content Validity, Face Validity, Structural Validity, and Known-Groups Validity of the TFI

AuthorsContent Validity
Gobbens et al1Determined by representatives of professional disciplines and people aged ≥75 years
Theou et al46The TFI records items referring to limitations in self-rated health, nutrition, mobility, energy, cognition, mood
Andreasen et al48Determined by interviewing frail community-dwelling older people: the majority of important frailty items were covered by the TFI; pain, sleep quality, meaningful activities and spirituality are not present in the TFI
Face Validity
Gobbens et al1Checked by participants at geriatric meetings
Andreasen et al8A pretest was performed by cognitive interviewing. The TFI was translated and adapted in such a manner that it can be implemented and further tested in clinical practice
Structural Validity
Vrotsou et al14Confirmatory factor analysis) (CFA) showed that fit indexes of a second-order model of three factors (frailty domains) were acceptable
Known-Groups Validity
Vrotsou et al14Total and physical frailty scores differentiated well between frail and non-frail people defined by the GFST and the SPPB

Abbreviations: GFST, Gérontopôle Frailty Screening Tool; SPPB, Short Physical Performance Battery.

Criterion Validity of the TFI Abbreviations: AUC, area under the curve; CI, confidence interval; ADL, activities of daily living; IADL, instrumental activities of daily living; OR, odds ratio; SHARE-FI, SHARE Frailty Instrument. Construct Validity of the TFI Content Validity, Face Validity, Structural Validity, and Known-Groups Validity of the TFI Abbreviations: GFST, Gérontopôle Frailty Screening Tool; SPPB, Short Physical Performance Battery. The predictive validity was established using regression analyses and AUC. The prediction period ranged from 6 months49 to 5 years.46 Most studies only used a period of 1 or 2 years.37,38,42–44,51 In particular, adverse outcomes of interest were disability,37,38,42–44,51 mortality,37,43,46,49,51 increased health-care utilization37,38,42,43,51 and lower quality of life.41,42 Disability was predicted by the TFI; however, Gobbens et al38,42 indicated that the predictive value was excellent, while Op Het Veld et al concluded that it was poor,43 presenting a positive predictive value of 42.6% and a negative predictive value of 75.2%.44 In four studies, the findings concerning mortality were at least acceptable; only Op Het Veld et al qualified the predictive value of the TFI as poor.43 As with the determination of concurrent validity, many different indicators of health-care utilization were used as outcome variables, resulting in findings that were not unanimous. For instance, the TFI predicted hospitalization in a sample of 430 Dutch people ≥70 years (OR = 2.59, 95% CI = 1.36–4.90),37 in comparison to a poor AUC in a sample of 2420 Dutch people ≥65 years.43 Both studies that aimed to assess the predictive value of the TFI for quality of life provided evidence that the TFI predicts lower quality of life using a follow-up period of 1, 2 and 4 years.41,42

Construct Validity

The reviewed studies frequently determined the criterion validity of the TFI, as well as the construct validity. Ten studies determined the construct validity of the TFI,1,7,9,10,12–14,45,47,52 with nine of these studies found to address the issue of convergent and divergent validity, with exception of the study by Renne and Gobbens (see Table 5).45 Most of the studies found the expected correlations between total frailty, domains, items, and alternative measures, while two studies observed similar correlations between the psychological and social domains of the TFI and alternative psychological.10,13 Moreover, the Spanish study demonstrated a stronger correlation between social frailty and IADL, assessed with the Lawton scale, than between social frailty and physical frailty, which was expected.14 Finally, in the Brazilian study, the item coping was not correlated as expected.7
Table 5

Construct Validity of the TFI

AuthorsConstruct Validity
Gobbens et al1Convergent and divergent validity using correlations: the 15 single components and three domains correlated as expected with validated measures
Santiago et al7Convergent and divergent validity using correlations: the correlations between the items and their corresponding measures were as expected, except the item ‘coping’
Mulasso et al9Convergent and divergent validity using correlations: all items correlated with single corresponding frailty measures
Coelho et al10Convergent and divergent validity using correlations): physical and social domains correlated as expected with alternative measures; psychological measures showed similar correlations with the psychological and physical domains of the TFI
Freitag et al12Convergent and divergent validity using correlations: total frailty was correlated with all alternative measures of frailty. In addition, the domains correlated good with corresponding alternative measures
Dong et al13Convergent and divergent validity using correlations: the three domains correlated with alternative measures; however, psychological measures had similar correlations with the psychological and physical domain
Renne & Gobbens45Construct validity using correlations: total and all three domains correlated with all six quality of life domains assessed with the WHOQOL-OLD
Vrotsou et al14Convergent and divergent validity using correlations: the three frailty domains correlated as expected, except social frailty that had a stronger correlation with the Lawton scale than physical frailty
Alqahtani et al52Convergent and divergent validity: correlations as expected between total and six frailty-related measures
Zhang et al47Convergent and divergent validity using correlations: validity of physical, psychological and social frailty was supported by all the alternative measures in all five countries

Content and Face Validity

Table 6 presents the content, face, structural and known-groups validity of the TFI. Three studies determined the content validity of the TFI,1,46,48 showing that the TFI contains the majority of important frailty items. Based on interviews with community-dwelling older people, Andreasen et al argue that items referring sleep quality, pain, spirituality, and meaningful activities should be included in the TFI.48 Face-validity was established in only two studies.1,8 In the first study, the TFI was checked by participants at geriatric meetings and in the second study, conducted in Denmark, after translating the TFI, a pretest was performed by cognitive interviewing concluding that the TFI could be further tested in practice.

Structural Validity and Known-Groups Validity

Both structural validity and known-groups validity were only determined by Vrotsou et al (see Table 6).14 Fit indexes of a second-order model of three factors (frailty domains) were acceptable and the TFI differentiated well between frail and non-frail people, as defined by the Gérontopôle Frailty Screening Tool (GFST)56 and the Short Physical Performance Battery (SPPB).57

Discussion

The TFI is a questionnaire that is increasingly used to determine frailty in older people. Since the introduction of the TFI in 2010, many studies on its psychometric properties have been conducted. In this study, we aimed to present a review of findings regarding the reliability and validity of the TFI. The present study is the first to assess these psychometric properties of the TFI. The literature search performed through May 30, 2020, showed that 27 studies reported on the psychometric properties of the TFI, as related to reliability, validity or both. Most of the studies (n = 25) were focused on community-dwelling older people. Since the TFI was developed in the Netherlands,1 it is not surprising that 13 of the included studies were conducted in that country and there appeared to be large differences in the prevalence of frailty. The lowest and highest prevalence figures were 12.4%13 and 76.5%,39 respectively. Our review shows higher prevalence figures of frailty are closely related to greater age. This finding is supported by a systematic review containing cross‐sectional data from community‐based cohorts.58 In addition, we found higher prevalence figures among people residing in settings other than the community, eg, acutely admitted patients,49 residents of assisted living facilities,39 and people admitted to a geriatrics outpatient clinic.15 Zhang et al emphasized that the mean score on the TFI is influenced by the country of residence of the participants.47 For example, the mean score for people with an average age of 75.3 years from Greece was 5.80, while this score was 4.25 in a sample of Dutch people with an average age of 81.5 years.47 The country of residence of the participants may also be the possible explanation for the high prevalence among older Portuguese people (54.8%);10 this prevalence is exceptionally high if we consider that the TFI cut-off point of 610 was taken, not the established cut-off point of 5.1 Our study shows that the reliability of the TFI has been comprehensively assessed; 15 and nine studies examined the internal consistency and test–retest reliability, respectively. In many cases, the reliability of the TFI, reflected by Cronbach’s alpha, was >0.70, which indicates satisfactory reliability.24 The internal consistency of the individual domains of the TFI was worse, particularly for the psychological and social domains, which can be explained by the fact that these domains only contain four and three items. Test–retest reliability, reflected by correlations (>0.60) and ICC (>0.75), was good.25,27 Moreover, kappa coefficients showed a substantial or nearly perfect level of agreement concerning many individual TFI items. The two other types of reliability, inter-rater and parallel forms, have only been examined to a limited extent.13,15,46 The validity of the TFI has been established in 24 studies. In many studies focusing on criterion validity (concurrent and predictive), adverse outcomes of interest were disability, increased health-care utilization, lower quality of life and mortality. Regarding disability, studies predominantly show results that are excellent,1,9,38,42,47 with AUCs >0.80.29 The findings pertaining to an increase in health-care utilization present a less unambiguous picture and seem to depend strongly on the indicator used (eg, personal care, informal care, visits to a general practitioner or hospitalization). The TFI, however, clearly has a poor association with visits to a general practitioner1,9,42 and hospitalization.1,13,37–39,42,43 In all six studies using quality of life as the outcome, the TFI showed good associations with lower quality of life, independent of the quality of life instrument that has been used.1,10,40–42,45 Moreover, the TFI predicted mortality, with only Op Het Veld et al43 qualifying the predicted value for this outcome as low. It should be noted that follow-up periods were short, consisting of 1 or 2 years, except the study by Theou et al46 that included a follow-up of 5 years. Unfortunately, Theou et al did not use the original TFI for assessing frailty.46 Therefore, we recommend determining the prediction value of the original TFI for mortality using a follow-up period >5 years. Finally, concerning criterion validity, the TFI showed discriminating ability in regards to GFI,10 SHARE-FI,47 and FI,13 reflected by excellent AUCs (>0.80). Construct validity of the TFI, distinguishing convergent and divergent validity, was established in nine studies. In many cases, the domains and components of the TFI correlated as expected with alternative measures providing evidence for good construct validity. Content and face validity were only determined in three1,46,48 and two studies,1,8 respectively. Two of these studies were qualitative in nature,8,48 which seems ideally suited to establish these types of validity. In our opinion, to validating an instrument like the TFI, the involvement of preferably older people, is necessary, whom, from their perspective, can indicate what needs to be questioned in the context of frailty. Only one study assessed the structural and known groups validity.14 These findings were satisfactory. Some limitations of our study should be noted. First, different instruments and questions have been used to assess disability, eg, Groningen Activity Restriction Scale (GARS)59 and Katz scale,60 and indicators of health-care utilization. Differences concerning variables controlled in regression analyses also exist, which could have influenced the findings. Secondly, many of the included studies aimed to determine the reliability and validity of the TFI were conducted in samples of community-dwelling older people. More studies are needed in order to establish good reliability and validity of the TFI in other samples of older people (eg, residents of assisted living facilities, nursing homes, mental health institutions and hospitalized patients). Thirdly, many data on reliability and validity were available and it was impossible to present all these data. A selection based on the most relevant data for the present study was therefore utilized and the individual studies were referred to for more detailed information. Finally, the majority of studies have been performed in Europe and especially in the Netherlands, with less in Brazil,7,51 Saudi Arabia,52 and China.13 The determination of the psychometric properties of the TFI in the other continents of the world, such as Africa, North America and Australia, is recommended. The TFI is specifically designed for screening of multidimensional frailty among community-dwelling older people; our review provided much evidence that the TFI is ideally suited to that target group. Well-known determinants of frailty, assessed with the TFI, are higher age, being a woman, and a low socio-economic status, expressed by low-educational and low-income level.36,61,62 To prevent frailty, it is recommended to start screening among people that meet those criteria. Based on this screening, primary health-care professionals (eg, general practitioners, nurses, physiotherapists, occupational therapists) can determine, preferably in a multidisciplinary consultation, whether these people need a comprehensive geriatric assessment (CGA) and a care needs assessment. In addition, the findings of the screening of frailty using the TFI can provide primary health-care professionals a first direction to the interventions that should be conducted next. Evidence of beneficial effects of multidomain interventions compared to unidomain interventions on frailty status or score is limited but increasing.63 In conclusion, our literature search revealed 27 studies examining the reliability and/or validity of the TFI. Many studies showed that the internal consistency and test–retest reliability are noteworthy, as well as the criterion and construct validities. In contrast, the association of the TFI with some indicators of health-care utilization can be indicated as poor (eg, visits to a general practitioner or hospitalization). Knowing that population aging is occurring all over the world, the availability of the TFI is critical. In addition to the qualification of its psychometric properties as good, it is well known that the TFI is a user-friendly instrument for assessing frailty. The findings of this assessment can support health-care professionals in selecting interventions to reduce frailty and delay its adverse outcomes, such as disability and lower quality of life.
  56 in total

1.  The EUROHIS-QOL 8-item index: psychometric results of a cross-cultural field study.

Authors:  Silke Schmidt; Holger Mühlan; Mick Power
Journal:  Eur J Public Health       Date:  2005-09-01       Impact factor: 3.367

2.  Measuring frailty in Dutch community-dwelling older people: Reference values of the Tilburg Frailty Indicator (TFI).

Authors:  Marcel A L M van Assen; Esther Pallast; Fatima El Fakiri; Robbert J J Gobbens
Journal:  Arch Gerontol Geriatr       Date:  2016-07-25       Impact factor: 3.250

3.  The predictive validity of the Tilburg Frailty Indicator: disability, health care utilization, and quality of life in a population at risk.

Authors:  Robbert J J Gobbens; Marcel A L M van Assen; Katrien G Luijkx; Jos M G A Schols
Journal:  Gerontologist       Date:  2012-01-04

4.  The psychometric properties of three self-report screening instruments for identifying frail older people in the community.

Authors:  Silke F Metzelthin; Ramon Daniëls; Erik van Rossum; Luc de Witte; Wim J A van den Heuvel; Gertrudis I J M Kempen
Journal:  BMC Public Health       Date:  2010-03-31       Impact factor: 3.295

Review 5.  Construct validity: advances in theory and methodology.

Authors:  Milton E Strauss; Gregory T Smith
Journal:  Annu Rev Clin Psychol       Date:  2009       Impact factor: 18.561

6.  Psychometric properties of the Tilburg Frailty Indicator in older Spanish people.

Authors:  Kalliopi Vrotsou; Mónica Machón; Francisco Rivas-Ruíz; Estefanía Carrasco; Eugenio Contreras-Fernández; Maider Mateo-Abad; Carolina Güell; Itziar Vergara
Journal:  Arch Gerontol Geriatr       Date:  2018-06-01       Impact factor: 3.250

7.  Looking for frailty in community-dwelling older persons: the Gérontopôle Frailty Screening Tool (GFST).

Authors:  B Vellas; L Balardy; S Gillette-Guyonnet; G Abellan Van Kan; A Ghisolfi-Marque; J Subra; S Bismuth; S Oustric; M Cesari
Journal:  J Nutr Health Aging       Date:  2013-07       Impact factor: 4.075

8.  The prediction of readmission and mortality by the domains and components of the Tilburg Frailty Indicator (TFI): A prospective cohort study among acutely admitted older patients.

Authors:  Robbert J J Gobbens; Jane Andreasen
Journal:  Arch Gerontol Geriatr       Date:  2020-04-17       Impact factor: 3.250

Review 9.  Psychometric properties of multicomponent tools designed to assess frailty in older adults: A systematic review.

Authors:  Jennifer L Sutton; Rebecca L Gould; Stephanie Daley; Mark C Coulson; Emma V Ward; Aine M Butler; Stephen P Nunn; Robert J Howard
Journal:  BMC Geriatr       Date:  2016-02-29       Impact factor: 3.921

10.  The Tilburg Frailty Indicator (TFI): New Evidence for Its Validity.

Authors:  Robbert Jj Gobbens; Petra Boersma; Izabella Uchmanowicz; Livia Maria Santiago
Journal:  Clin Interv Aging       Date:  2020-02-21       Impact factor: 4.458

View more
  3 in total

1.  Association between social capital and frailty and the mediating effect of health-promoting lifestyles in Chinese older adults: a cross-sectional study.

Authors:  Shan Hu; Canhuan Jin; Shaojie Li
Journal:  BMC Geriatr       Date:  2022-03-02       Impact factor: 3.921

2.  A Comparison of Different Modeling Techniques in Predicting Mortality With the Tilburg Frailty Indicator: Longitudinal Study.

Authors:  Robbert Gobbens; Tjeerd van der Ploeg
Journal:  JMIR Med Inform       Date:  2022-03-30

3.  A Novel Digital Nutrition Diary for Geriatric Patients at High Risk of Frailty Syndrome.

Authors:  Patrick Elfert; Julia Berndt; Louisa Dierkes; Marco Eichelberg; Norbert Rösch; Andreas Hein; Rebecca Diekmann
Journal:  Nutrients       Date:  2022-01-18       Impact factor: 5.717

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.