Introduction

Regular daily physical activity is associated with a reduced risk of cancer development and a reduction in mortality after the diagnosis of cancer [1,2,3]. Additionally, physical activity and exercise interventions have significant beneficial effects on the level of fatigue and quality of life in patients with cancer [4,5,6]. Previous studies have also shown that the magnitude of these beneficial effects is greater in patients with worse baseline values of physical activity [6]. Correct estimation of the levels of physical activity in patients with cancer is of utmost importance to estimate its effect on various health outcomes and to estimate the effectiveness of intervention programs. Furthermore, it might be important to identify physically inactive patients correctly in order to offer these patients an exercise intervention program. Although various measurement methods are available to estimate levels of physical activity of patients with cancer, there is no gold standard for measuring physical activity in daily life on a large scale [7, 8].

Self-reported questionnaires are a frequently used measurement method to assess physical activity, both in clinical care and in a research setting [9, 10]. The “Physical Activity Scale for the Elderly” (PASE) questionnaire is a short (5 to 10 min) 13-item questionnaire, including questions on the frequency and duration of various leisure time, household, and work-related activities[11], and was developed and validated to assess physical activity in people over the age of 65 years [12]. The PASE questionnaire has previously been used in younger patients with cancer [13, 14]. Patients with cancer are often debilitated and therefore have lower levels of physical activity compared with age-matched healthy individuals [15]. Consequently, their physical activity levels may be more comparable to the elderly. It has been used in several research projects in elderly people [13, 16,17,18,19] and in patients with various types of cancer [9, 20]. Self-reported questionnaires involve minimal time investment, costs, and participant burden, which favors their use in epidemiological studies and large-scale clinical trials [20, 21]. A disadvantage of self-reported questionnaires is that they are prone to recall bias, response shift, and social desirability bias [22, 23] and are therefore likely to over- or underreport physical activity levels [24]. In contrast, accelerometers provide an objective assessment of daily physical activity based on raw accelerations [25]. Unfortunately, accelerometer assessments are expensive and labour intensive, because they have to be initialized before and read out after use, followed by data processing and analysis [26].

In order to be able to make a balanced decision on the choice of using either one or both instruments, it is important to investigate whether the PASE questionnaire and accelerometers provide similar or different insights into the level of physical activity of patients with cancer under free-living conditions. Two previous studies in relatively small samples of patients with cancer showed poor to fair agreement between the PASE questionnaire and accelerometer-assessed physical activity [20, 27]. The agreement between both measurements might differ significantly across different target populations [8], including cancer type, age [10, 28], gender [10], BMI [29], smoking status [28], and employment status [28] of the participants. Furthermore, for epidemiological purposes, it might be important to accurately distinguish a group of physically active from a group of physically inactive patients [30].

The aim of this study was to investigate the agreement between the level of physical activity assessed by the PASE questionnaire and the accelerometer in a large group of patients with different types of cancer. In addition, differences in agreement across various patient characteristics were examined, as well as the agreement between the instruments to distinguish the most and the least physically active patients.

Method

Study design and population

This study used baseline data from three studies in which both PASE and accelerometer data were collected: the Resistance and Endurance exercise After ChemoTherapy (REACT) study [31], the Exercise Intervention after Stem cell Transplantation (EXIST) study [32], and the NETherlands QUality of life and Biomedical cohort studies In head and neck Cancer (NET-QUBIC) study [33]. The REACT study evaluated the effects of a 12-week high-intensity and low-to-moderate intensity supervised resistance and endurance interval exercise intervention on physical fitness, fatigue, and health-related quality of life compared with a waiting list control group in 277 adult patients with cancer (i.e., breast, colon, ovarian, testicular, cervical cancer, and lymphoma) [31]. The EXIST study investigated the effects of an 18-week supervised high-intensity resistance and interval exercise intervention compared with usual care on the same outcomes as the REACT study in 109 patients with multiple myeloma or lymphoma recently treated with high-dose chemotherapy and autologous stem cell transplantation [32]. Patients for both the REACT- and EXIST study were recruited between 2011 and 2014. In the REACT study, baseline assessment took place 4–6 weeks after completion of cancer treatment including ((neo-)adjuvant) chemotherapy and in the EXIST study 6–14 weeks after autologous stem cell transplantation [31, 32]. The NET-QUBIC study is a longitudinal observational cohort study which aims to describe the long-term course of quality of life in 739 newly diagnosed patients with head and neck cancer (HNC) and their informal caregivers and to identify cancer-related, personal, biological, psycho-behavioral, physical, and lifestyle-related and social determinants of quality of life. Baseline assessments took place shortly after the diagnosis of head and neck cancer and before the start of treatment. The total NET-QUBIC assessment protocol involved three components: (1) patient-reported outcome measures; (2) home visit with interviews and tests (including physical fitness), during this home visit, patients were provided with materials to collect data of physical activity (accelerometer) and saliva samples; (3) collection of blood and oral rinse samples. Due to logistic reasons, not all components could always be performed (e.g., short time between diagnosis and start of treatment). Also, patients were allowed not to complete all three components, if this was too much burden. The first data release included baseline data of the first 254 patients who were included between February 2014 and June 2016. All studies were approved by the Medical Ethics Committee of the Amsterdam UMC (VU University Medical Center or Academic Medical Center) and the local ethical boards of all participating hospitals. All patients provided written informed consent prior to participating in the respective studies.

Outcome measurements

The PASE questionnaire

The PASE questionnaire assesses the duration and frequency of physical activities that have been undertaken in the past 7 days [34]. The PASE questionnaire contains 13 questions on leisure, household, and paid or unpaid work-related physical activities. First, patients are asked to estimate the frequency of a particular type of physical activity per week, for which they can choose between never, seldom (1–2 days per week), sometimes (3–4 days per week), or often (5–7 days per week). Second, patients are asked to estimate the duration of that particular type of activity, for which patients could choose between < 1 h, between 1 and 2 h, between 2 and 4 h, and > 4 h for the leisure and household activities. For the work activity, different categories for duration are used: < 1 h, between 1 and 4 h, 5 and 8 h, and > 8 h. For each different types of physical activity, there are published weightings available for the respective PASE scores, based on estimated metabolic equivalent of task (METs) and accelerometer-assessed physical activity [11]. Each physical activity–specific weighting is multiplied by a factor based on the frequency and the duration of this activity, to calculate the activity-specific subscore [35]. Finally, the total score of the PASE questionnaire was calculated by totaling the subscores of all activities [35]. Only fully completed PASE questionnaires were used for this analysis.

The accelerometer

Patients were instructed to wear an accelerometer (ActiGraph wGT3X, ActiGraph LLC, Pensacola, FL, USA) on the hip for seven (REACT and NET-QUBIC study) or five (EXIST study) consecutive days to measure physical activity. This accelerometer measures raw accelerations (i.e., epochs; the rate of change of the velocity) in three axes [7]. Vertical accelerations were converted into counts per minute, with several data reduction steps [36]. Non-wear time was defined as ≥ 60 consecutive minutes of consecutive zeros and a valid day was defined as ≥ 10 h/day of wear time [37, 38]. To be included in the analyses, a patient needed to have at least five valid days, including one weekend day. Total time spent in physical activity was defined as the total of all the time periods with ≥ 100 counts per minute [39] and was calculated for each valid wear day. The sum of all physically active minutes during all valid wear days was divided by the number of valid wear days, in order to calculate the mean number of minutes of physical activity per day. Furthermore, the accelerometer-assessed physical activity was also expressed in counts per minute, because in this way, the intensity of physical activity is also taken into account. Counts per minute were calculated by summing up the total counts and divided by the total wear time during all valid measurement days.

Potential effect modifiers

Age, gender, and cancer type were retrieved from the patients’ medical records. Body weight and height were measured and body mass index (BMI) was calculated based on these measurements (body weight/height2, kg/m2). The overall comorbidity score in the NET-QUBIC study was rated as none, mild, moderate, or severe with the use of the Adult Comorbidity Evaluation-27 (ACE-27) [40]. The comorbidity level of participants in the EXIST study was assessed by a sports physician and by a custom-made patient-reported questionnaire and was retrieved from the patients’ medical records. In the REACT study, the number of comorbidities was retrieved from the medical records and calculated as the sum of each of the following conditions: heart disease, lung disease, diseases of the digestive system, diseases of the nervous system, endocrine disease, mental disorder, rheumatism or arthritis, chronic pain, and other conditions. Subsequently, for all the studies, the number of comorbidities was dichotomized into any versus none. The highest level of education, alcohol consumption, and smoking behavior were assessed through study-specific questions. The highest level of education was dichotomized into low/moderate (primary vocational education to senior general secondary education) versus high education level (higher general secondary education, higher professional education, and university). For smoking behavior and alcohol consumption, all patients who smoked or consumed alcohol at the time of the study were defined as smokers and consumers of alcohol, respectively. Information about alcohol consumption was only available in the NET-QUBIC study.

Statistical analysis

Descriptive statistics (mean, standard deviation (SD), or numbers and percentages) were generated for demographic, clinical, lifestyle-related factors, total PASE score, and accelerometer-assessed physical activity. The agreement between the PASE questionnaire (total score) and accelerometers (minutes of physical activity per day) was analyzed with a linear mixed models regression analysis, with the total score of the PASE questionnaire as independent variable and accelerometer-assessed physical activity as dependent variable. A random intercept on study level was added to take clustering of patients from different studies into account. Standardized regression coefficients (B) and the corresponding 95% confidence intervals (CI) were reported. To check for the differences in agreement across different patient characteristics (age, gender, BMI, education, comorbidity, alcohol, and smoking), we added these variables and their interaction terms with the PASE score into the regression model separately for each characteristic. Significant effect modification was defined as a p value of < 0.05 of the interaction term in the multilevel mixed models regression analysis [41]. If significant effect modification was found, stratified analyses were performed. A standardized regression coefficient of 0.50 or higher was considered as fair agreement and a coefficient of 0.70 or higher was considered as good agreement [42]. A scatter plot was used to visualize the agreement between the PASE questionnaire and accelerometer-assessed physical activity. We performed a sensitivity analysis in which accelerometer-based physical activity was expressed in counts per minute.

To investigate the agreement between both measurement methods for the identification of the most and the least physically active patients, the highest active quartile of patients based on the accelerometer was compared with the highest active quartile of patients based on the PASE questionnaire. Crosstabs were generated to present the proportion of patients that were both in the highest quartile of the PASE score as well as in the highest quartile of the accelerometer-assessed physical activity. This process was repeated for the lowest quartiles of physical activity. All statistical analyses were performed with SPSS version 22 (IBM Corporation, Armonk, NY, USA).

Results

In total, 640 patients participated in the REACT (n = 277), EXIST (n = 109), and NET-QUBIC studies (n = 254). In 38 patients in the NET-QUBIC study, a home visit was not performed and thus had no measurements of physical activity and another 53 patients did not have enough time between diagnosis and start of treatment to measure sufficient valid wear days. Other reasons for missing data for the total group of patients were insufficient valid accelerometer wear days by participants (n = 122), technical problems with the accelerometer (n = 11), and incomplete or missing PASE questionnaire (n = 37) (Fig. 1). In total, 403 patients were included in the current analyses because they had both valid accelerometer data and a complete PASE questionnaire (Fig. 1). The mean (SD) age of these 403 patients was 56 (11) years, 58% of the patients were women, and breast cancer was the most common cancer type (36%), followed by head and neck cancer (25%) and lymphoma (13%) (Table 1). The mean (SD) score of the PASE questionnaire was 95.9 (75.1) and the mean (SD) accelerometer-assessed physical activity was 256.6 (78.8) minutes of physical activity per day (Table 1).

Fig. 1
figure 1

Flowchart of patients with valid measurements

Table 1 Baseline characteristics, PASE-score, and accelerometer-assessed physical activity

The agreement between the PASE questionnaire and accelerometer-assessed physical activity expressed in min/day was significant but poor (B = 0.36, 95%CI = 0.27; 0.44, p < 0.01) (Table 2 and Fig. 2). The agreement between the PASE score and accelerometer output expressed in counts per minute was also poor (B = 0.26, 95%CI = 0.17; 0.35, p < 0.01). No significant differences in agreement were found between the PASE questionnaire and the accelerometer-assessed physical activity across subgroups of patients (Table 2).

Table 2 Agreement between PASE questionnaire and accelerometer-assessed physical activity and potential effect modifiers
Fig. 2
figure 2

Scatter plot between the total score of the PASE questionnaire and accelerometer-assessed physical activity (min/day)

Of the 101 patients in the lowest quartile of physical activity based on accelerometers, 51 were also in the lowest quartile based on the PASE score, indicating an agreement of 50% (Table 3). Forty-four of the 100 patients in the highest quartile of physical activity based on accelerometers were also in the highest quartile based on the PASE-score, indicating an agreement of 44% (Table 3).

Table 3 Distribution of patients across lowest and highest quartiles of PASE questionnaire and accelerometer-assessed physical activity

Discussion

This study investigated the agreement between the PASE questionnaire and accelerometer-assessed physical activity in a large group of patients with cancer. Results showed that the agreement was poor in all subgroups of patients. The results are in line with the poor agreement reported in an earlier study in 48 patients with different types of cancer [27] and the poor-to-fair agreement in patients with lung cancer [20]. Furthermore, in a recent review on the agreement between accelerometer-assessed physical activity and various physical activity questionnaires among healthy persons and patients with various chronic diseases, only 11 of the 57 included studies reported an agreement (Pearson correlation coefficient) between questionnaires and accelerometers of ≥ 0.50, and for most studies only in specific subgroups [10].

The poor agreement between the PASE questionnaire and accelerometer-assessed physical activity might reflect the complexity of obtaining an integral estimation of all aspects of a highly varying behavior such as physical activity [43]. Low-intensity physical activities, which are more typical for patients with cancer compared with those for the general population [44], might be more often overestimated with self-report [43]. The poor agreement between the PASE score and accelerometer-assessed physical activity might also be caused by social desirability and recall bias in the PASE questionnaire [27]. On the other hand, accelerometers also have disadvantages, which might have influenced the agreement with the PASE questionnaire [7]. For example, the use of hip-placed accelerometers may have underestimated upper body movements and activities such as cycling, swimming, and resistance exercises [27]. In addition to the poor agreement for the absolute levels of physical activity, results also showed poor agreement in distinguishing physically active from physically inactive patients. This finding implies that both measurement methods do not measure the same construct and that they cannot be used interchangeably.

A strength of this study is the sample size of more than 400 patients with various types of cancer and treatment regimens. A limitation of the study might be that most of the patients participated in exercise intervention studies or were recently diagnosed with cancer and were all treated with curative intent, which may have resulted in an under- or overestimation of the levels of physical activity. However, considering the variation in physical activity levels, it is unlikely that this has affected the agreement between the PASE questionnaire and accelerometer-assessed physical activity. Another limitation may be that we used the PASE questionnaire that is originally developed for elderly above 65 years in a group of patients with cancer with a mean age of 56 years. However, we found no significant differences in agreement between patients below and above 65 years of age.

The low agreement between the PASE questionnaire and accelerometer output might indicate that these instruments measure different aspects of the construct physical activity. Therefore, it may be recommended to use both measures in research and in clinical practice, as they may produce complementary information. Physical activity questionnaires might especially be suitable to investigate the mode, type, and frequency of moderate-to-vigorous activities that patients perform, but are generally less accurate to estimate light-intensity physical activities [7]. On the contrary, accelerometers provide an objective estimate of the duration and intensity of physical activity, have no risk of social desirability, response shift and recall bias, but provide limited information about the type of physical activities performed [7]. For optimal use in clinical practice, it is important to reduce the labor intensity and costs of accelerometers or to search for alternatives for objective physical activity assessments [45]. Widely available smartphones, pedometers, or fitness trackers may provide such alternatives, and have shown to be feasible, and produce valid and reliable step counts in patients with cancer, when compared with accelerometers [46, 47].

In conclusion, the results of this study showed a poor agreement between the PASE questionnaire and accelerometer-assessed physical activity in patients with cancer. This indicates that they can better be used simultaneously than interchangeably to assess daily physical activity in patients with cancer, both for research and clinical purposes.