Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Cross-cultural adaptation and psychometric evaluation of the Sinhala version of Lawton Instrumental Activities of Daily Living Scale

  • Dhammika Deepani Siriwardhana ,

    Roles Conceptualization, Formal analysis, Investigation, Writing – original draft, Writing – review & editing

    deepani.siriwardhana.15@ucl.ac.uk

    Affiliations Research Department of Primary Care and Population Health, University College London, London, United Kingdom, Department of Disability Studies, Faculty of Medicine, University of Kelaniya, Ragama, Sri Lanka

  • Kate Walters,

    Roles Conceptualization, Writing – review & editing

    Affiliation Research Department of Primary Care and Population Health, University College London, London, United Kingdom

  • Greta Rait,

    Roles Conceptualization, Writing – review & editing

    Affiliation Research Department of Primary Care and Population Health, University College London, London, United Kingdom

  • Juan Carlos Bazo-Alvarez,

    Roles Formal analysis, Writing – review & editing

    Affiliations Research Department of Primary Care and Population Health, University College London, London, United Kingdom, Centro de Estudios de Población, Universidad Católica los Ángeles de Chimbote (ULADECH-Católica), Chimbote, Peru

  • Manuj Chrishantha Weerasinghe

    Roles Conceptualization, Formal analysis, Investigation, Writing – review & editing

    Affiliation Department of Community Medicine, Faculty of Medicine, University of Colombo, Colombo, Sri Lanka

Abstract

Introduction

Instrumental activities of daily living (IADL) are cognitively complex activities related to independent living in the community. Robust IADL scales are needed, however the psychometric properties of instruments have been little evaluated. There is no validated instrument for Sri Lankan older populations. Sri Lanka has the highest proportion of older people in South Asia with rapid population ageing. Therefore, it is essential to have standard instruments to assess activity limitations. We aimed to cross-culturally adapt the original Lawton Instrumental Activities of Daily Living Scale from English to Sinhala and evaluate the psychometric properties of the Sinhala version.

Methods

Cross-cultural adaptation of the instrument was performed. The instrument was validated in a sample of 702 community-dwelling older adults aged 60 years and above in Sri Lanka. Reliability (internal consistency and inter-rater reliability) was assessed. Construct validity of the scale was evaluated by performing exploratory and confirmatory factor analysis and testing convergent and divergent validity.

Results

The Lawton IADL scale was successfully adapted to Sri Lankan context. Internal consistency of the scale was very high (Cronbach’s alpha = 0.91). Very good inter-rater reliability was observed with very good agreement for all items. Inter-class correlations for overall IADL score ranged from 0.57 to 0.91. Results of the exploratory and confirmatory factor analyses supported the unidimensionality of the scale. Goodness of fit indices in confirmatory factor analysis were in acceptable range (CFI = 0.98, SRMR = 0.06, NNFI = 0.97). Strength of associations were significant and in the expected direction. Results of the known group validity were also significant, confirming the convergent and divergent validity.

Conclusion

The Lawton IADL scale was successfully translated and culturally adapted to Sinhala language. The Sinhala version demonstrated excellent reliability and construct validity. Given good psychometric properties, this scale would be recommended for use in future research.

Introduction

‘Activities of daily living’ measurement instruments are commonly used to assess the activity limitations. Two types of activities are assessed; Basic Activities of Daily Living (BADL) and Instrumental Activities of Daily Living (IADL). BADL are cognitively less complex self-maintaining tasks which include feeding, dressing, bathing, toileting, etc. They do not require attentional processes. Conversely, IADL are more complex and require higher level cognitive functions such as memory, attention and executive functions [1, 2]. Example IADL tasks are food preparation, housekeeping tasks, taking own medication, handling finances etc. These activities are important to lead an independent life [3]. IADL limitations often present with mild cognitive impairment and early dementia [4].

A number of questionnaires are available to assess IADL [3, 5], however, no gold standard exists [5]. One of the most widely used is the Lawton Instrumental Activities of Daily Living Scale developed in 1969 [2, 6]. A few modifications to the original scale are also available in the literature; modified Lawton- Brody scale proposed in 1988 [7], Lawton IADL scale in MFA (Multidimentional Functional Assessment of Older Adults) [8] and Lawton IADL scale in MAI (Multilevel Assessment Instrument) [9]. At present no agreement on the quality of IADL questionnaires exists. Moreover, the psychometric properties of commonly used IADL questionnaires are either unavailable or do not meet the standard quality [3].

Cultural adaptability, reliability and validity of the original [1013] and Lawton IADL scale in MAI [14] have been tested in older populations (aged ≥60 or ≥65 years) in studies conducted in Iran, Spain, Greece, Singapore and Hong Kong. Study populations included patients with dementia [10], outpatients of memory clinics [12], patients who attended emergency rooms in with a hip or wrist fracture due to a fall [11], institutionalized older adults [14] and community living older adults [13].

We found three studies reporting IADL in Sri Lankan older adults [1517]. However, none of the studies reported use of standard questionnaire to assess IADL, and instead used a few selected IADL tasks. Only four IADL tasks have been assessed in two studies [15, 16] and six in the remaining study [17]. We could not identify any culturally adapted, psychometrically tested instrument available to assess instrumental activities of daily living in Sri Lanka. It is important to have a standard instrument for this purpose as Sri Lanka has the highest proportion of older adults among South Asian countries [18] and considered as one of the fastest ageing populations in the South East Asia [19]. IADL limitations are associated with both poor quality of life [20] and increased healthcare costs [21]. Understanding the current IADL limitations of older adults in Sri Lanka using a robust standardised measure will inform planning of health and social care services with anticipated rapid population ageing. Therefore, the objective of this study was to cross culturally adapt the original Lawton Instrumental Activities of Daily Living scale from English to Sinhala and to evaluate the psychometric properties of the Sinhala version.

Materials and methods

The methodology of this study comprised of two phases. Phase one involved cross cultural adaptation of the Lawton IADL scale. Phase two was evaluating the psychometric properties of the scale which included testing the reliability (internal consistency and inter-rater reliability) and validity (cross-cultural validity, structural validity, convergent and divergent validity). Fig 1 illustrates the study methodology.

Lawton Instrumental Activities of Daily Living Scale

We choose the original Lawton IADL scale for this study [2] (please see S1 File). It is a widely used instrument to measure IADL of older adults in different settings; community, clinics or hospitals [6]. Moreover, it is easy to administer (within 10–15 minutes). Most newer scales have also been derived from the original Lawton IADL scale [5]. The scale encompasses eight activities which includes ability to use telephone, shopping, food preparation, housekeeping, laundry, transport, ability to handle finances and responsibility for own medication. Each activity has varying number of response options indicating participant’s degree of ability to perform each activity starting from completely independent status to completely dependent status. Despite having number of responses under each activity, participants are classified into two categories as 1 (independent) and 0 (dependent) during the scoring. The total score of the scale ranges from 0 (dependent) to 8 (independent). Historically women were scored on all the items of the scale and men were scored for only five items of the scale excluding the food preparation, housekeeping and laundering activities. However, the current recommendation is to assess all activities with both sexes [22].

The original scale uses the self-repot/surrogate report ‘actual performance’ question stem, and later versions offered options of assessing self-repot/surrogate report ‘actual performance’ and ‘capacity’. We decided to use the self-reported ‘capacity’ question stem with the items and response structure of the original scale as in the Sri Lankan cultural context older adults are often supported by their own children and relatives. According to the recent census 17% of adults aged 60 years and above live with their own children [23]. Hence, some are not fully engaged doing certain IADL activities like housekeeping, shopping, preparing meals, handling finances even though they are fully capable of those. Sri Lanka is a country with good gender equality and we therefore used all the items in the scale with both males and females. Permission was granted from Oxford University Press to translate and republish the original scale in Sinhala language.

Phase I- Cross cultural adaptation process

We used the systematic method proposed by Beaton and colleagues [24] for the cross cultural adaptation of self-reported measures.

  1. Stage 1- Forward translation: Two independent bilingual translators who have a background in public health (DDS) and community medicine (MCW) translated the English version of entire instrument into Sinhala. The mother tongue of the both translators were Sinhala. They independently recorded the issues they had while translating the instrument.
  2. Stage 2- Synthesis of the translations: A common Sinhala version of the instrument was created using the two independent translated versions.
  3. Stage 3-Back translation: The synthesis version created at the second stage was used for back translation process. Two translators (TW, SJ different to stage 1 translators) who are fluent in both English and Sinhala languages conducted the back translations independently. Both were blind to the original instrument or original independent translated versions. Two back translated versions were compared with the original English version of the instrument for a validity check.
  4. Stage 4- Expert committee review: A panel of experts from medical, allied health science, sociological backgrounds and translators (forward and backward) reviewed the two forward translations, two backward translations with the original scale. Consultations were conducted in person or using digital technology. MCW coordinated this stage. Semantic, idiomatic, experiential and conceptual equivalence of the instrument were discussed at these meetings. Any issues raised were addressed and a preliminary version of the instrument was created and circulated among the review members.
  5. Stage 5- Pre-test: The preliminary version of the instrument was pre-tested with five male and female older adults in different age categories living in the district where the psychometric testing was planned. The pre-final version of the instrument was created to use in the psychometric evaluation.

Phase II- Psychometric evaluation

Study design and participants.

Psychometric evaluation; reliability and validity testing of the instrument was carried out as part of a large population based cross sectional study conducted in a district of Sri Lanka (Kegalle). The study population was community dwelling older adults, aged 60 years and above permanently residing in the rural sector of the district. Older adults who were unable to give the informed consent; older adults with severe dual hearing and vision impairment, aphasia following a stroke, severe stages of dementia, and those with unstable severe mental illnesses and terminally ill were excluded. The estimated sample size was 750 participants. Three-stage area probability sampling was utilized to recruit the participants. Fifty clusters were selected using probability proportionate to size technique covering entire district. Fifteen participants were recruited from each cluster based on the population demographics of rural sector of Sri Lanka [25]. According to the scale of sample size adequacy by Comrey and Lee (1992) sample of 500 considered as very good where 1000 or more considered as excellent in Exploratory Factor Analysis (EFA) [26] Five nursing graduates collected the data from the entire sample. They were given a comprehensive training on all aspects of the study. Participation for the study was voluntary and informed written consent was obtained from all the participants prior to collect data. The ethical clearance for this study was obtained from two ethics review committees at University College London (Project ID: 8155/001) and Faculty of Medicine, University of Colombo, Sri Lanka (Protocol No. EC-16-071).

Data analyses: Participants’ characteristics and distribution of Lawton IADL scale-Sinhala version scores.

The characteristics of the study sample was described with descriptive statistics. The eight items of the scale were coded to preserve the original response structure as they do not have uniform response structure (ability to use telephone (1–4), shopping (1–4), food preparation (1–4), housekeeping (1–5), laundry (1–3), transport (1–5), responsibility for own medication (1–3) and ability to handle finances (1–3). The minimum number represents the response indicating complete dependent status for each item whilst the maximum number represent the response indicating highest independent status. However, when assigning scores according to the guidelines of the scale, response for each item was coded either as 0 (dependent) or 1 (independent). Hence, the total score of the scale ranges from 0 to 8.

Reliability testing.

We assessed internal consistency (the extent to which different items measured the same construct [27, 28]) using standardized Cronbach’s alpha (as scale items do not have uniform response structure) and interpreted the same using the criteria proposed by Nunnally [29].

We assessed inter-rater reliability (IRR) [28] in a randomly selected 12% of the total sample (n = 89), representing 26 clusters. The number of participants recruited from each cluster varied from 1–5 with the modes of 3 and 4. Research assistants (5 raters) administered the IADL scale. After a gap of 2.5 to 3 hours the primary investigator (PI- DDS) re-administered the scale with the same participants. Therefore, each participant has been assessed by two raters (A/B/C/D/E and DDS).

We assessed the IRR of the each individual item considering its original response structure (ordinal) and after scoring (binary). Participants with missing values were excluded. For ordinal case, inter-rater reliability was calculated using unweighted percentage agreement coefficient, quadratic weighted Cohen’s kappa and Gwet’s AC2 with quadratic weights [30]. For binary case unweighted percentage agreement coefficient, Cohen’s kappa and Gwet’s AC1 were used. Both Gwet’s AC1 and AC2 agreement coefficients are corrected for chance agreement and adjusted for misclassification errors. Moreover, they are consistent with the percentage agreement [31]. Values of Cohen’s kappa, Gwet’s AC1 and AC2 were interpreted using criteria proposed by Landis and Koch [32]. Intra Class Correlation (ICC) was used to assess the agreement of the overall score of the scale between each rater and the PI. Single rating, absolute agreement, two way mixed effects model was used [33]. All the agreement coefficients and ICCs were computed using kappaetc user written Stata programme. Stata version 14 (StataCorp, College Station, Texas, USA) was used for the analyses. Guidelines for Reporting Reliability and Agreement Studies (GRRAS) proposed by Kottner et al were followed [34].

Validity testing.

We assessed the construct validity of the IADL scale, including cross cultural validation (as above), structural validation (using factor analysis [28, 35]) and hypothesis testing [36] to establish the convergent and divergent validity of the scale [37, 38].

Exploratory factor analysis (EFA) explores the underlying factor structure of a construct [39, 40]. We performed EFA with 702 participants to test the hypothesis that the scale is unidimensional i.e. the 8 items in the scale represents one construct (instrumental activities of daily living). Original response structure of the scale was used in the analysis. Parallel analysis (PA) was run to determine the number of factors to retain in the model. PA was carried out on polychoric (two step) correlations with permuted samples, using principal component estimation and mean eigenvalue criterion [41]. Principal axis factoring was chosen as the factor extraction method because our data is ordinal and it violates the assumption of multivariate normality [40]. Principal axis factoring is also capable of detecting weak factors [42]. The Kaiser-Meyer-Olkin (KMO) statistic and Bartlett’s test of sphericity were used to determine the appropriateness of running the factor analysis. KMO values varies from 0 to 1 and values >0.5 are acceptable [43]. Bartlett’s test requires to yield significant result (p<0.05). Communalities ≥0.4 and factor loadings ≥ 0.5 were considered as satisfactory [39]. The analysis was performed on the polychoric (two step) correlations using SPSS R-menu v2.0 [44].

Confirmatory factor analysis (CFA) was performed to explore whether the observed data fit hypothesised factor structure of the IADL scale. Analysis was performed with the original response structure. To accommodate the ordinal response structure of the scale items, CFA was performed on asymptotic covariance matrix that calculated using the polychoric correlations. Diagonally weighted least square technique was used as the estimation method, recommended use when fitting structural equation model with ordinal variables [45]. Several goodness of fit indices were evaluated to determine the model fit. Evaluated fit indices include chi-square value (Satorra-Bentler scaled chi-square) with its degrees of freedom and associated p value, Relative/normed (χ2/df) chi-square, Root Mean Square Error of Approximation (RMSEA), Non-Normed Fit Index (NNFI)/ Tucker Lewis Index (TLI), Comparative Fit Index (CFI), Standardised Root Mean Residual (SRMR) and Parsimonious Normed Fit Index (PNFI). Insignificant chi square at a threshold of 0.05 is indicative of good model fit [46]. No consensus is available for the acceptable ratio of relative chi-square. Wheaton et al suggested a value of 5.0 [47]. For RMSEA Tuker and Lewis [48] suggested a cut-off of 0.06 whereas Steiger [49] proposed a strict upper limit as 0.07. For NNFI and CFI cut-off value of ≥ 0.95 accepted as good model fit [48, 50]. For SRMR a value of ≤ 0.08 considered as appropriate [50]. No threshold level has been specified for PNFI. CFA was performed on LISREL 9.30 student edition.

Historically Lawton et al (1969) proposed using the full scale (8 items) with females and five items (excluding food preparation, housekeeping and laundry) for males [2]. However, they had not checked the structural validity of IADL scale on this aspect. Therefore, we performed both EFA and CFA for females and males separately including all items.

The Barthel index of daily living measures the disability or dependence in basic activities of daily living, which are cognitively less complex tasks than IADL. [51]. Mild cognitive impairment is also associated with impairments in IADL [52]. Montreal cognitive assessment (MoCA) is a brief screening tool for mild cognitive impairment [53]. To assess the convergent validity, we hypothesised that IADL score is positively correlated with Barthel index score and MoCA score. Spearman’s correlation coefficient was used to quantify the magnitude of the correlation. We used following criteria to interpret the size of the correlation coefficients; (0 to ±0.3) negligible, (±0.3 to ±0.5) low, (±0.5 to ±0.7) moderate, (±0.7 to ±0.9) high and (±0.9 to 1.0) as very high correlation [54].

We used known group validity method to assess the divergent validity of the IADL scale. Advanced age is associated with the limitations of IADL [55]. Therefore, we hypothesised lower IADL scores for older age groups. The participants of this study were divided into two groups based on the median age of the sample. Median IADL score of the two groups were tested using Mann-Whitney U non-parametric test since our IADL score does not follow a normal distribution. The significance level was set as 0.05. Both analyses performed using IBM SPSS 24 software (SPSS INc., Chicago, IL, USA).

Results

Cross cultural adaptation of Lawton IADL scale

  1. Stage 1- Forward translation was performed as planned. Both forward translators encountered following issues. The last response for the item 1- ‘Ability to use telephone’, in the original scale is ‘does not use telephone at all’. We felt that this response can be interpreted in different ways. A person could be not using a telephone at all since he/she does not have a one or incapable of using it. Incapability could be due to an impairment or the person has never used it before and have no skills to use it. The same issue was noted for the last response of item 6-‘Mode of transportation’. Both translators were uncertain about the identical Sinhala word to ‘instrumental’.
  2. Stage 2- PI prepared the synthesis version with the aid of both Sinhala versions.
  3. Stage 3- Backward translation was also carried out as planned. Both backward translated versions showed good agreement with the original English version.
  4. Stage 4- By considering the issues raised in the forward translation process (stage 1), panel of experts agreed to replace the last response of item 1 with the meaning of ‘incapable of using the telephone at all’ in Sinhala. However, they acknowledged that still the response could not be applicable to a person who has never used a telephone. Hence, the suggestion was to ask whether they have ever used a telephone if their response is ‘incapable of using the telephone at all’ and make a note in the questionnaire. Similarly the last response for item 6 (‘does not travel at all’) was replaced from ‘incapable of travelling at all’. Example apparels used in the second response of item 5-‘Laundry’ were replaced from ‘socks’ and ‘stockings’ to ‘small hanker chief’ and ‘small towel’, as more relevant to Sri Lankan older adults living in tropical climate. Example activities used in the final item-‘ability to handle finances’ were ‘budgets, writes checks, pays rent, bills, goes to bank’. They were replaced from making a payment for electricity or water bill and doing bank transactions. We could not find an identical Sinhala word for the word ‘instrumental’. Therefore, we substituted that word to ‘non-basic’ in Sinhala.
  5. Stage 5- No difficulties were encountered in pre-testing and the IADL Sinhala version and showed good acceptability.

Psychometric evaluation of Lawton IADL scale-Sinhala version

Socio-demographic characteristics of the study participants.

Data collection was conducted from 03rd October to 23rd December 2016. Seven hundred forty six participants were recruited for the study. Twenty three participants were excluded as they are not fully conversant in Sinhala language. Twenty one participants were excluded as they had never used a telephone and/or were completely unaware of how to cook. Five males and eight females had never used a telephone. Seven males were unaware of how to cook. One male participant was excluded for both reasons. Therefore, the effective sample was 702.

A sub sample from Sinhala speaking participants was invited to test the IRR. Six participants invited were excluded when testing for IRR. Of the six, two had not used a telephone ever and response for one item in the scale was missing for four participants in the PI’s dataset. Fig 2 demonstrates the study flow chart.

Table 1 presents the socio-demographic characteristics of study sample (n = 702) and the sub sample used to test IRR (n = 83). The percentage of the females in the study sample was 53.7%. The median (IQR) age of the sample was 67 (63, 75) years. The age of the participants ranged from 60 years to 94 years. The median (IQR) age of the sub sample used to test IRR was 68 (63, 73) years. The age of the participants was ranged from 60 years to 91 years.

thumbnail
Table 1. Socio-demographic characteristics of the study participants.

https://doi.org/10.1371/journal.pone.0199820.t001

Distribution of Lawton IADL scale-Sinhala version scores.

The frequency distribution of the responses for each item and overall score are presented in S1 and S2 Figs respectively. A negatively skewed distribution was observed for responses of all the items and overall score. Table 2 presents the median and inter quartile range for the scores of each item. None of the items’ or total score distributed normally.

thumbnail
Table 2. Item descriptive statistics of IADL scale-Sinhala version.

https://doi.org/10.1371/journal.pone.0199820.t002

Internal consistency and Inter-rater reliability.

The internal consistency of the scale with 8 items assessed by Cronbach’s alpha was 0.918. Cronbach’s alpha if an item deleted is presented in the Table 3.

thumbnail
Table 3. Cronbach’s alpha if an item deleted in IADL scale-Sinhala version.

https://doi.org/10.1371/journal.pone.0199820.t003

Table 4 presents the unweighted percentage agreement coefficient, weighted Cohen’s kappa and Gwet’s AC2 agreement coefficient for each item according to the responses in the original scale. Relatively low absolute percentage agreement was observed for the ‘Housekeeping’ item between PI and all five raters compared with the other items. However, for all the items primary investigator had a satisfactory absolute/unweighted percentage agreement coefficient ranged from 0.62 to1.00, poor to excellent weighted Cohen’s kappa (0.00 to 1.00) and substantial to almost perfect Gwet’s AC2 (0.62 to 1.00) agreement coefficient with all raters. Interestingly, weighted Cohen’s kappa was not computed when the percentage agreement too high or too low indicating the ‘kappa paradox’[56]. Please refer S1 Table for IRR for each item after scoring as binary. Rater A, C, and D showed ICC values above 0.8 indicating an excellent reliability. In overall, ICC values for all the raters were above 0.5 and indicate moderate to excellent reliability [33].

thumbnail
Table 4. Item wise inter-rater reliability with original response structure for Lawton IADL scale-Sinhala version and ICC for overall IADL score.

https://doi.org/10.1371/journal.pone.0199820.t004

Exploratory factor analysis.

Kaiser-Meyer-Olkin measure of sampling adequacy was 0.898 which is considered a ‘very good’ value [57]. The Significance value of Bartlett’s test of sphericity was <0.001, indicating that the correlations between the items were significantly different from zero. Results of the parallel analysis suggested to extract one factor (Fig 3), indicating the unidimensionality of the scale. The first factor explained 79.4% of the total variance. As shown in Table 5, the communalities of 8 items varied from 0.392 to 0.903 and factor loadings were varied from 0.626 to 0.950. Item scale correlation (corrected) for all the items were above 0.7 except for item 1. The polychoric (two step) correlations matrix is available in S2 Table.

EFA results by sex also showed a stable item structure (8 items) across both females and males. Parallel analysis suggested to extract one factor in both cases (S3 and S4 Figs). The percentage of variance explained by the first factor was 80.2% for females and 81.5% for males. The communalities of 8 items varied from 0.357 to 0.934 and 0.421 to 0.925 for females and males respectively. The factor loadings were varied from 0.598 to 0.966 for females and from 0.649 to 0.962 for males. The polychoric (two step) correlations matrixes by sex are available in S3 and S4 Tables. Please refer S5 Table for additional results.

Confirmatory factor analysis.

CFA results are presented in Table 6. The measurement model with standard factor loadings and uniquenesses are presented in S5 Fig. Standardized factor loadings were ranged from 0.660 to 0.958. Values of goodness of fit indices; NNFI, CFI and SRMR were in acceptable range indicating an excellent model fit. However, the chi-square value was significant, χ2 (20, 702) = 144.42, p<0.001. The value of relative chi-square (χ2/df) was 7.22 and not in the acceptable range. Similarly, RMSEA value was too high and neither in acceptable range.

In CFA by sex, standardized factor loadings were ranged from 0.645 to 0.973 and from 0.673 to 0.981 for females and males respectively. All the goodness of fit indices except chi-square, and RMSEA were in the acceptable range for both sexes. Please refer S6 and S7 Tables for additional results.

Item 1 (ability to use telephone) consistently demonstrated low communality, factor loading and item-scale correlation in EFA and low standardized factor loading in CFA. Furthermore, Cronbach’s alpha was slightly higher when item 1 was deleted from the scale (see Table 3). This finding was consistent even in the sex stratified analysis.

Convergent validity.

The Spearman’s correlation coefficients between Lawton IADL score and the scores of Barthel index and MoCA were 0.61 and 0.41, indicating a moderate and low strength of association respectively. Both correlation coefficients were significant at p<0.001.

Divergent validity.

The median age of the sample was 67 years. Therefore, the sample divided into two groups as age ≤67 years and >68 years. The median (IQR) IADL scores for both groups were 8 (0) and 7 (1) (Mann-Whitney U = 37,974, p<0.001) demonstrating a lower IADL score in older group as hypothesised.

Discussion

Summary of main findings

The Lawton Instrumental Activities Daily Living scale was successfully translated and culturally adapted to Sri Lankan context. The Sinhala version of the scale demonstrated excellent reliability and construct validity. Internal consistency of the scale was very high. Satisfactory agreement was observed between PI and the raters for all the items in the scale. With regard to the overall score, ICC values were between 0.57 and 0.91 which is indicative of moderate to very good agreement. Findings of EFA and CFA strongly supported the unidimensionality of the scale. In EFA, communalities and factor loadings for all the items were well above the cut-off values. Similarly all the goodness of fit indices in CFA were in the acceptable range except chi-square and RMSEA. Eight item structure scale was stable across both females and males. Results of the sex stratified EFA and CFA were also consistent with the main analysis. We observed moderate and low positive correlations between IADL score and scores of Barthel index and MoCA respectively. The scale was capable of detecting the difference of overall IADL score between age groups.

Reliability

In line with other studies, Lawton IADL-Sinhala version has demonstrated an excellent internal consistency. Cronbach’s alpha coefficient was 0.91. Of all studies Spanish version has demonstrated the highest alpha value of 0.94 [11] whilst 0.86, 0.84 in Hong Kong Chinese (Lawton IADL-CV) and Greek versions respectively [12, 14]. The lowest was observed in the Persian version (Lawton IADL-PV) [10].

We found ICC values ranging from 0.57 to 0.91 for five raters, with three raters having values above 0.8, in a relatively large sample of 83 participants. The inter-rater reliability of the original scale was 0.85, however this study was with a small sample (n = 12), interviewed by one interviewer in the presence of the second rater who did not participate in the interviewing process [2]. Two further studies have reported the inter-rater reliability, with ICCs of 0.96 [10] and 0.99 [14] in similarly small studies. In the latter, inter-rater reliability was assessed with 9 participants on video-taped IADL abilities, and this method (videos) has been shown to produce higher inter-rater reliability [58].

Unlike our study, none of the prior studies have reported the item-wise inter-rater reliability of the scale. In those studies, ICC was computed based on the total score of the scale. It does not reflect how each rater marked the response for each item based on participant’s response.

Validity

EFA results of our study strongly supported the unidimensionality of the scale and corroborate with the existing literature [11, 14]. In our study first factor explained 79.4% variance whilst 70.6% and 50.1% variances explained by the Hong Kong Chinese and Spanish versions correspondingly. Eight item structure has an excellent factorial validity across both sexes [11]. We excluded 8 male participants from the analysis since they were completely unaware of how to cook. However, this was only 2.4% of total males in the sample. In contrast, Ng et al (2006) found two strong factors underlying physical and cognitive domains of IADL in a multi-ethnic Asian population in Singapore [13]. Two factors explained 87.5% of variance. Interestingly, physical IADL domain included 5 items (grocery shopping, getting to places outside the house, doing housework/handyman work, doing laundry and preparing meals) and cognitive domain included remaining three items (using the telephone, taking medications and managing finances).

All the reported goodness of fit indices of CFA (RMSEA, TLI, and CFI) were satisfactory and all factor loadings were significant in the Spanish version [11]. Similarly all the factor loadings were significant in Sinhala version and values of TLI, CFI, and SRMR were in the acceptable range. However, our RMSEA value was not in the acceptable range. One possible explanation could be use of diagonally weighted least square technique estimation. Nye et al showed that RMSEA appears to be affected by sample size [59]. With a sample of 400, they have observed increase of cut-off value for RMSEA whilst SRMR seems to be performing relatively well which is similar to our case. While the chi-square value of our model was significant, this may often be the case with large sample sizes [60] and when data deviate from multivariate normality [61].

We observed a substantially low communality and factor loading for the first item; ‘ability to use the telephone’ in EFA and relatively low standardized factor loading in CFA. Item one has demonstrated a relatively low inter-item correlation with item 3, 4, and 5. This pattern was consistent across both sexes. Interestingly, we observed the same results for EFA with the Spanish version [11]. A possible reason could be transition of use of land/fixed telephones to the mobile devices. At present, most of the households in Sri Lanka use mobile phones. According to the statistics of Telecommunications Regulatory Commission of Sri Lanka, 12.1 and 122 fixed access and cellular mobile subscriptions per 100 inhabitants were reported respectively in 2016 [62]. Using a telephone does not only require a reasonable cognitive function. It is also affected by the sensory function and fine motor skills particularly when using the mobile phones. Therefore, the patterns of activity limitations assessing here is slightly different to other IADLs. However, it is still measuring something different than the other items as it represents the ability to communicate with the outside world, an important part of ageing well.

The Spearman’s correlation coefficient between the scores of Barthel index and Lawton IADL Spanish version was above 0.4 and mean IADL score was significantly different by age [11]. In our study, value of the same correlation was 0.61 and the median IADL scores were significantly different by age. Known group validity results of the Singapore study has also shown significantly different mean IADL scores across different age groups and gradual decrease of the mean IADL score values with increasing age [13]. In line with our findings studies conducted in Greece and Hong Kong also supported the convergent and/or divergent validity of the Lawton IADL scale by means of the strength of associations or known group validity [12, 14].

Strengths of the study

The main strengths of this study were following a comprehensive and rigorous methodology, and using advanced statistical techniques to address the structure and distribution of the data. We performed the psychometric evaluation with a large random sample of Sinhala speaking community dwelling rural older adults. According to the recent census, 99% of Sri Lankan older adults live in the community [23]. In the original validation study, Lawton and Brody (1969) had not explored the factor structure of the IADL scale [2]. Therefore, we performed both EFA and CFA. We adhered to set of guidelines and best practices available in the literature when performing and reporting cross cultural adaptation of instrument, reliability testing, EFA and CFA [24, 33, 34, 39, 63, 64].

Study limitations and recommendations

We excluded 21 participants (<2%) from the study population who reported they had never used a telephone or were completely unaware of how to cook. We could not assess the test-retest reliability and responsiveness of the scale due to inadequate resources available. The limitation of the scale itself is absence of a reference point of time. However, no guidelines exists as to the appropriate choice of reference point of time either [5].

We used the participant’s self-reported capacity of performing each activity in our Lawton IADL-Sinhala version as a measure of self-reported efficacy or capacity in performing activities. Some people may over or underestimate their true capacity and this may therefore not reflect the actual performance of these activities. Alternatively a researcher can also use the self-reported ‘actual performance’ question stem and make notes about not applicable items (where the participant may be capable but does not regularly perform the activity). In our study we used an interviewer-administered questionnaire with the respondent only. In future research self-reported and a key informant reported abilities of performing IADL tasks could also be compared. We have also not specified a reference point of time, instead the scale asks the general ability of performing each activity in day-to-day life.

Conclusions

The Lawton IADL scale was successfully translated and culturally adapted to Sinhala language. The Sinhala version demonstrated an excellent reliability and construct validity with a large representative sample of Sinhala speaking community dwelling older adults. Given good psychometric properties, it is recommended to monitor the limitations of instrumental activities of daily living of community dwelling older adults in Sri Lanka. Lawton IADL-Sinhala version can be found in S2 File.

Supporting information

S1 File. The Lawton Instrumental Activities of Daily Living (IADL) Scale.

https://doi.org/10.1371/journal.pone.0199820.s001

(PDF)

S2 File. The Lawton Instrumental Activities of Daily Living (IADL) Scale-Sinhala version.

https://doi.org/10.1371/journal.pone.0199820.s002

(PDF)

S1 Fig. The frequency distribution of the responses for each item of the Lawton IADL-Sinhala version.

https://doi.org/10.1371/journal.pone.0199820.s003

(PDF)

S2 Fig. The frequency distribution of the overall Lawton IADL score-Sinhala version.

https://doi.org/10.1371/journal.pone.0199820.s004

(PDF)

S3 Fig. Parallel analysis results for females.

https://doi.org/10.1371/journal.pone.0199820.s005

(PDF)

S4 Fig. Parallel analysis results for males.

https://doi.org/10.1371/journal.pone.0199820.s006

(PDF)

S5 Fig. Confirmatory factor analysis model with standardized factor loadings.

https://doi.org/10.1371/journal.pone.0199820.s007

(PDF)

S1 Table. Item wise inter-rater reliability when original responses for Lawton IADL-Sinhala version coded as binary.

https://doi.org/10.1371/journal.pone.0199820.s008

(PDF)

S2 Table. Polychoric (two step) correlation matrix used in EFA (entire sample).

https://doi.org/10.1371/journal.pone.0199820.s009

(PDF)

S3 Table. Polychoric (two step) correlation matrix used in EFA for females.

https://doi.org/10.1371/journal.pone.0199820.s010

(PDF)

S4 Table. Polychoric (two step) correlation matrix used in EFA for males.

https://doi.org/10.1371/journal.pone.0199820.s011

(PDF)

S5 Table. Results of exploratory factor analysis by sex.

https://doi.org/10.1371/journal.pone.0199820.s012

(PDF)

S6 Table. Results of the confirmatory factor analysis by sex.

https://doi.org/10.1371/journal.pone.0199820.s013

(PDF)

S7 Table. Goodness of fit indices of confirmatory factor analysis by sex.

https://doi.org/10.1371/journal.pone.0199820.s014

(PDF)

Acknowledgments

Authors acknowledge all the participants of this study who generously gave their time and effort to respond the questionnaire. We wish to thank the assistance provided by the Director General of Health Services, Provincial Director of Health Services-Sabaragamuwa province, District Secretary and Regional Director of Health Services-Kegalle district, all the Divisional secretaries and the Grama Niladhari officers of the respective areas investigated in this study. We would like to thank the research assistants (H.P.M. Hewavitharane, P.G.I.P. Udayakumara, D.M.N. Kumara, A.P.A. Jayasinghe, M.V.J.D. Abeyrathna) and the field assistants (K.D.S.S. Dissanayaka, W.G.C. Pussallamada, G.R.K.M. Rathnasiri, W.S.T. Weerasinghe, R.J.S. Dharmawansha, W.R.P. Wijenayake, V.P.D.I. Danarathna, R.G.R.D. Anuruddha) for their invaluable support with data collection. We also acknowledge T. Wickramarachchi, S. Jayakodi and D.P.A. Jayawardana for supporting the back translation of the scale and giving critical comments during the cultural adaptation process. We thank Department of Community Medicine, Faculty of Medicine, University of Colombo for handling finances and arranging logistics for the field study. We would also like to extend our thanks to Dr Terry K. Koo (Director & Associate Professor, Foot Levelers Biomechanics Research Laboratory, New York Chiropractic College, Seneca Falls, NY) for providing input on reliability analysis. We would also like to thank the resource persons; Dr William Summerskill and Dr Viroj Tangcharoensathien (Senior Advisor, The International Health Policy Program, Ministry of Public Health, Thailand) of Writing Workshop on Population Ageing Research in ASEAN organized by the Institute for Population and Social Research (IPSR), Mahidol University, Thailand for providing valuable feedback to improve this paper.

References

  1. 1. Sousa LB, Prieto G, Vilar M, Firmino H, Simões MR. The Adults and Older Adults Functional Assessment Inventory. Research on Aging. 2014;37(8):787–814. pmid:25651593
  2. 2. Lawton MP, Brody EM. Assessment of Older People: Self-Maintaining and Instrumental Activities of Daily Living. The Gerontologist. 1969;9(3_Part_1):179–86.
  3. 3. Sikkes SAM, de Lange-de Klerk ESM, Pijnenburg YAL, Scheltens P, Uitdehaag BMJ. A systematic review of Instrumental Activities of Daily Living scales in dementia: room for improvement. Journal of Neurology, Neurosurgery &amp; Psychiatry. 2008;80(1):7–12.
  4. 4. Farias ST, Park LQ, Harvey DJ, Simon C, Reed BR, Carmichael O, et al. Everyday Cognition in older adults: Associations with neuropsychological performance and structural brain imaging. Journal of the International Neuropsychological Society: JINS. 2013;19(4): pmid:23369894
  5. 5. Gold DA. An examination of instrumental activities of daily living assessment in older adults and mild cognitive impairment. Journal of Clinical and Experimental Neuropsychology. 2012;34(1):11–34. pmid:22053873
  6. 6. Graf C. The Lawton Instrumental Activities of Daily Living Scale. AJN The American Journal of Nursing. 2008;108(4):52–62. pmid:18367931
  7. 7. Lawton MP. Scales to measure competence in everyday activities. Psychopharmacology Bulletin. 1988;24(4):609–14. pmid:3074322
  8. 8. Pfeiffer E, Duke University. Multidimensional functional assessment, the OARS methodology: a manual. Durham, N.C.: Center for the Study of Aging and Human Development, Duke University; 1975.
  9. 9. Lawton MP, Moss M, Fulcomer M, Kleban MH. A Research and Service Oriented Multilevel Assessment Instrument. Journal of Gerontology 1982;37(1):91–9. pmid:7053405
  10. 10. Mehraban AH, Soltanmohamadi Y, Akbarfahimi M, Taghizadeh G. Validity and Reliability of the Persian Version of Lawton Instrumental Activities of Daily Living Scale in Patients with Dementia. Medical Journal of the Islamic Republic of Iran. 2014;28:25-. pmid:25250267
  11. 11. Vergara I, Bilbao A, Orive M, Garcia-Gutierrez S, Navarro G, Quintana JM. Validation of the Spanish version of the Lawton IADL Scale for its application in elderly people. Health and Quality of Life Outcomes. 2012;10(1). pmid:23110491
  12. 12. Theotoka I, Kapaki E, Vagenas V, Ilias I, Paraskevas GP, Liappas I. Preliminary Report of a Validation Study of Instrumental Activities of Daily Living in a Greek Sample. Perceptual and Motor Skills. 2007;104(3):958–60.
  13. 13. Ng TP, Niti M, Chiam PC, Kua EH. Physical and cognitive domains of the Instrumental Activities of Daily Living: validation in a multiethnic population of Asian older adults. J Gerontol A Biol Sci Med Sci. 2006;61(7):726–35. pmid:16870636
  14. 14. Tong AYC, Man DWK. The Validation of the Hong Kong Chinese Version of the Lawton Instrumental Activities of Daily Living Scale for Institutionalized Elderly Persons. OTJR: Occupation, Participation and Health. 2002;22(4):132–42.
  15. 15. Nugegoda DB, Balasuriya S. Health and social status of an elderly urban population in Sri Lanka. Social Science & Medicine. 1995;40(4):437–42.
  16. 16. Fernando DN, Seneviratna RDA. Physical health and functional ability of an elderly population in Sri Lanka. The Ceylon Journal of Medical Science. 1993;36(1):9–16.
  17. 17. Østbye T, Malhotra R, Chan A. Thirteen Dimensions of Health in Elderly Sri Lankans: Results from a National Sri Lanka Aging Survey. Journal of the American Geriatrics Society. 2009;57(8):1376–87. pmid:19549023
  18. 18. Perera ELSJ. Ageing Population of Sri Lanka. 2017.
  19. 19. Rajapakse S, Maithripala C, Ibrahim S, Weerasinghe D, A. R. Disease patterns and care of older people in Sri Lanka. Indian J Gerontol. 2012;26:351–66.
  20. 20. Fusco O, Ferrini A, Santoro M, Lo Monaco MR, Gambassi G, Cesari M. Physical function and perceived quality of life in older persons. Aging Clinical and Experimental Research. 2012;24(1):68–73. pmid:22643307
  21. 21. Hill J, Fillit H, Thomas SK, Chang S. Functional Impairment, Healthcare Costs and the Prevalence of Institutionalisation in Patients with Alzheimer’s Disease and Other Dementias. PharmacoEconomics. 2006;24(3):265–80. pmid:16519548
  22. 22. Lawton MP, Moss M, Fulcomer M, Kleban MH. Multi-level assessment instrument manual for full-length MAI. Madlyn and Leonard Abramson Center for Jewish Life: 2003.
  23. 23. Department of Census and Statistics. Increasing Elderly Population in Sri Lanka. 2015.
  24. 24. Beaton DE, Beaton D, Bombardier C, Guillemin F, Ferraz M. Guidelines for the Process of Cross-Cultural Adaptation of Self-Report Measures. Spine (Philadelphia, Pa 1976). 2000;25(24):3186–91.
  25. 25. Department of Census and Statistics: Ministry of Policy Planning and Economic Affairs. Census of Population and Housing 2012. 2015.
  26. 26. Comrey AL, Lee HB. A First Course in Factor Analysis: Hillsdale, N.J: L. Erlbaum Associates 1992.
  27. 27. Terwee CB, Bot SDM, de Boer MR, van der Windt DAWM, Knol DL, Dekker J, et al. Quality criteria were proposed for measurement properties of health status questionnaires. Journal of Clinical Epidemiology. 2007;60(1):34–42. pmid:17161752
  28. 28. Ramaker C, Marinus J, Stiggelbout AM, van Hilten BJ. Systematic evaluation of rating scales for impairment and disability in Parkinson's disease. Movement Disorders. 2002;17(5):867–76. pmid:12360535
  29. 29. Nunnally J. Psychometric theory. New York: McGraw-Hill; 1994.
  30. 30. Gwet KL. Computing inter-rater reliability and its variance in the presence of high agreement. British Journal of Mathematical and Statistical Psychology. 2008;61(1):29–48. pmid:18482474
  31. 31. Gwet KL. Handbook of Inter-Rater Reliability. The Definitive Guide to Measuring the Extent of Agreement Among Raters. Gaithersburg, MD 20886–2696 USA: Advanced Analytics, LLC; 2012.
  32. 32. Landis JR, Koch GG. The Measurement of Observer Agreement for Categorical Data. Biometrics. 1977;33(1):159–74. pmid:843571
  33. 33. Koo TK, Li MY. A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. Journal of Chiropractic Medicine. 2016;15(2):155–63. pmid:27330520
  34. 34. Kottner J, Audigé L, Brorson S, Donner A, Gajewski BJ, Hróbjartsson A, et al. Guidelines for Reporting Reliability and Agreement Studies (GRRAS) were proposed. Journal of Clinical Epidemiology. 2011;64(1):96–106. pmid:21130355
  35. 35. Goodwin LD. The Role of Factor Analysis in the Estimation of Construct Validity. Measurement in Physical Education and Exercise Science. 1999;3(2):85–100.
  36. 36. Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study. Quality of Life Research. 2010;19(4):539–49. pmid:20169472
  37. 37. Guyatt GH, Feeny DH, Patrick DL. Measuring health-related quality of life. Annals of Internal Medicine. 1993;118(8):622–9. pmid:8452328
  38. 38. Streiner DL, Norman GR, Cairney J. Health Measurement Scales: A practical guide to their development and use. Oxford, UK: Oxford University Press; 2015.
  39. 39. Costello AB, Osborne JW. Best Practices in Exploratory Factor Analysis: Four Recommendations for Getting the Most From Your Analysis. Practical Assessment 2005;10(7):1–9.
  40. 40. Fabrigar LR, Wegener D, MacCallum R, Strahan E. Evaluating the use of exploratory factor analysis in psychological research. Psychological methods. 1999;4(3):272–99.
  41. 41. Garrido LE, Abad FJ, Ponsoda V. A new look at Horn’s parallel analysis with ordinal variables. Psychological Methods. 2012;18(4):454–74. pmid:23046000
  42. 42. de Winter JCF, Dodou D. Factor recovery by principal axis factoring and maximum likelihood factor analysis as a function of factor pattern and sample size. Journal of Applied Statistics. 2012;39(4):695–710.
  43. 43. Kaiser HF. A second generation little jiffy. Psychometrika. 1970;35(4):401–15.
  44. 44. Basto M, Pereira JM. An SPSS R-Menu for Ordinal Factor Analysis. Journal of Statistical Software. 2012;46(4).
  45. 45. Li C-H. The performance of ML, DWLS, and ULS estimation with robust corrections in structural equation models with ordinal variables. Psychological methods. 2016;21(3):369–87. pmid:27571021
  46. 46. Barrett P. Structural equation modelling: Adjudging model fit. Personality and Individual Differences. 2007;42(5):815–24.
  47. 47. Wheaton B, Muthen B, Alwin DF, Summers GF. Assessing Reliability and Stability in Panel Models. Sociological Methodology. 1977;8:84–136.
  48. 48. Tucker LR, Lewis C. A reliability coefficient for maximum likelihood factor analysis. Psychometrika. 1973;38(1):1–10.
  49. 49. Steiger JH. Understanding the limitations of global fit assessment in structural equation modeling. Personality and Individual Differences. 2007;42(5):893–8.
  50. 50. Hu Lt, Bentler PM. Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling: A Multidisciplinary Journal. 1999;6(1):1–55.
  51. 51. Mahoney FI, Barthel DW. Functional Evaluation: The Barthel Index. Maryland State Medical Journal. 1965;14:61–5.
  52. 52. Jekel K, Damian M, Wattmo C, Hausner L, Bullock R, Connelly PJ, et al. Mild cognitive impairment and deficits in instrumental activities of daily living: a systematic review. Alzheimer's Research & Therapy. 2015;7(1). pmid:25815063
  53. 53. Nasreddine ZS, Phillips NA, Bédirian V, Charbonneau S, Whitehead V, Collin I, et al. The Montreal Cognitive Assessment, MoCA: A Brief Screening Tool For Mild Cognitive Impairment. Journal of the American Geriatrics Society. 2005;53(4):695–9. pmid:15817019
  54. 54. Hinkle DE, Jurs SG, Wiersma W. Applied statistics for the behavioral sciences. Second ed. Boston: Houghton Mifflin 1988.
  55. 55. van der Vorst A, Zijlstra GAR, Witte ND, Duppen D, Stuck AE, Kempen GIJM, et al. Limitations in Activities of Daily Living in Community-Dwelling People Aged 75 and Over: A Systematic Literature Review of Risk and Protective Factors. PLOS ONE. 2016;11(10):e0165127. pmid:27760234
  56. 56. Feinstein AR, Cicchetti DV. High agreement but low Kappa: I. the problems of two paradoxes. Journal of Clinical Epidemiology. 1990;43(6):543–9. pmid:2348207
  57. 57. Hutcheson G. Factor Analysis. 1999 2017/08/08. In: The Multivariate Social Scientist [Internet]. SAGE Publications, Ltd. http://methods.sagepub.com/book/the-multivariate-social-scientist.
  58. 58. Hokoishi K, Ikeda M, Maki N, Nomura M, Torikawa S, Fujimoto N, et al. Interrater reliability of the Physical Self-Maintenance Scale and the Instrumental Activities of Daily Living Scale in a variety of health professional representatives. Aging & Mental Health. 2001;5(1):38–40. pmid:11513011
  59. 59. Nye CD, Drasgow F. Assessing Goodness of Fit: Simple Rules of Thumb Simply Do Not Work. Organizational Research Methods. 2010;14(3):548–70.
  60. 60. Bentler PM, Bonett DG. Significance tests and goodness of fit in the analysis of covariance structures. Psychological Bulletin. 1980;88(3):588–606.
  61. 61. McIntosh CN. Rethinking fit assessment in structural equation modelling: A commentary and elaboration on Barrett (2007). Personality and Individual Differences. 2007;42(5):859–67.
  62. 62. Telecommunications Regulatory Commission of Sri Lanka. Statistics 2014 [cited 2018 12 January]. http://www.trc.gov.lk/2014-05-13-03-56-46/statistics.html.
  63. 63. Jackson DL, Gillaspy JA, Purc-Stephenson R. Reporting practices in confirmatory factor analysis: an overview and some recommendations. Psychol Methods. 2009;14(1):6–23. pmid:19271845
  64. 64. Schreiber JB, Nora A, Stage FK, Barlow EA, King J. Reporting Structural Equation Modeling and Confirmatory Factor Analysis Results: A Review. The Journal of Educational Research. 2006;99(6):323–38.