This study explored the dynamic interdependencies between poor mental health status and lifestyle factors. We used PVAR models that have lags of all endogenous variables and analysed the weekly interdependencies among variables of interest.
Procedures
We used a smartphone (iPhone, Apple inc.) app called Kurashi-app (“kurashi” means “life” in Japanese) and a wearable device (Silmee W20, TDK Co. Ltd., Japan) to collect lifelog data from 89 patients who had suffered from major depression, but were then in remission. We collected the activity diaries of the patients over a period of 1 year.
There are some advantages of having patients record their activity diaries via a smartphone. First, it is easier for patients to record their activity, regardless of time and place, compared to the conventional method of pen and paper. Secondly, utilizing the lifelogs, patients can record their activity diaries more easily. The Kurashi-app collects individual’s lifelogs via smartphone and estimates their activities based on them. It predicts 16 types of activities from lifelog data of location information, mobility information, and steps information and displays them on screen. The patients are expected to check them every day, and when the prediction is incorrect, they can rectify it. Consequently, the precision of the prediction improves. The recordings of the 16 activities on Kurashi-app can therefore be regarded as semi-automated self-reports. The system is expected to increase precision and to reduce burden on the part of the participants. When being presented with their estimated activities, users can be helped to record activity diaries even when they cannot distinctly remember certain activities from their daily lives [
13].
The Kurashi-app includes 16 kinds of activities: meeting friends or family, bath/shower, childcare/caregiving, commuting, domestic work, exercise, hospital, meal, shopping, sitting idly, sleep, study/work, hobby/entertainment/learning, TV/DVD/game/music, reading/newspaper/magazine, and other activities. These 16 activities were selected from the classification of the Basic Survey on Social Life (Shakai seikatsu kihon chosa) by the Statistics Bureau of the Ministry of Internal Affairs and Communications in Japan. Sitting idly is included because “time spending vaguely without doing anything in particular” is considered important with regard to poor mental health status. The Kurashi-app extracts clusters of activities where patients stay for more than 30 min at a time.
Simee W20 is a wrist watch type wearable device which can collect UV data automatically, in addition to location and mobility information.
We collected the activity diaries of each patient for 1 year. Outpatients were recruited at four university hospitals and their associated hospitals and clinics. Kyoto University was the central secretariat, and the 4 university hospitals were Kochi University, Hiroshima University, Nagoya City University, and Toho University. We recruited a hundred patients in total into the study between October 2016 and March 2017. Ten patients withdrew themselves from the study; while one patient did not meet the inclusion criteria and was therefore excluded. Inclusion criteria were as follows: (1) age between 22 and 69 years; (2) meet DSM-5 criteria for major depressive disorder, recurrent; (3) in remission as defined by the Beck Depression Inventory-II score of 9 or less [
14]; (4) with or without anxiety disorder or dysthymia; (5) able to use a mobile phone; (6) able and willing to participate in the study. We excluded patients with bipolar disorder, substance use disorder, psychosis, and personality disorder.
We used two screening and diagnostic tools to assess depression, K6 and the Patient Health Questionnaire-9 (PHQ-9). Using the Kurashi-app, patients completed the K6 by themselves once a week. However, such recordings are prone to lapses because they rely on self-reports on the smartphone. At the doctor’s consultation every 4 weeks, clinical study coordinators of Kyoto University assessed the patients through the PHQ-9 on the telephone. When the patients failed to visit the doctor, we contacted them by telephone to make the monthly assessments with PHQ-9 in order to minimise carelessness.
The K6 is a six-item screening instrument assessing psychological distress developed by Kessler and his colleagues [
15]. Respondents rated how frequently they had experienced the following six symptoms over the past 7 days: a) feeling nervous, b) feeling hopeless, c) feeling restless or fidgety, d) feeling depressed to the point that nothing could cheer you up, e) feeling everything was an effort, and f) feeling worthless. Respondents rated each item using a 5-point scale: 0 (“none of the time”), 1 (“a little of the time”), 2 (“some of the time”), 3 (“most of the time”), or 4 (“all of the time”). Responses to the six items were summed to yield a K6 score between 0 and 24, with higher scores indicating a greater tendency towards mental illness. Using the receiver operating characteristic curve, Prochaska et al. [
16] identified a K6 score ≥ 5 as the optimal cut-off point indicative of moderate mental distress. The coefficient of correlation between K6 and the Hamilton Depression Rating Scale was 0.516 at the 1% significant level [
17].
The PHQ-9 questionnaire asks respondents how frequently they have experienced the following nine symptoms over the past 2 weeks: a) having little interest or pleasure in doing things, b) feeling depressed or hopeless, c) having trouble staying asleep or sleeping too much, d) feeling tired, e) having poor appetite or overeating, f) feeling bad about oneself, g) having trouble concentrating on things, h) moving or speaking so slowly that other people could have noticed, i) having thoughts that you would be better off dead. Respondents rated each item using a 4-point scale: 0 (“not at all”), 1 (“several days”), 2 (“more than half the days”), or 3 (“nearly every day”). The PHQ-9 is commonly used to screen for depression with 10 as the cut-off score; a score of 10–14 indicates moderate depression, 15–19 moderately severe depression, and 20–27 severe depression. We have slightly modified the time frame for PHQ-9 in this study and asked the participants to rate their symptoms during the worst two weeks of the past month, in order to increase sensitivity to detect a depressed episode between the monthly assessments. The item responses on the PHQ-9 exhibited the same mathematical pattern as the other depression screening scales such as K6 and the Center for Epidemiological Studies Depression Scale [
18].
Daily chart of the 16 activities were visualised on the Kurashi-app, and all participants could check the data themselves at any time. The data from wearable device could be checked by connecting the device to their iPhone (this task was voluntary).
Since participants experienced some recurrence of depression, their motivations to know the sign of recurrence was high. We paid 5000 JPY (= about 47 USD) per month to the participants for 1 year. About half of the participants had their own iPhone and downloaded the app. To the remaining half, we lent our study iPhones, which were returned after the follow up period. We also lent wearable devices to all participants. The patients were expected to don the wearable device except when they bathed. We also accepted it when some patients preferred not to wear it while asleep. All data obtained from the app and wearable device were uploaded to the database server at Kyoto University. We monitored the adherence of the participants and reminded them when the adherence dropped during their monthly visits to the clinics/hospitals.
Collected data
We analysed the data of K6 score in order to observe weekly change of mood. Because K6 is a self-reported questionnaire on the smartphone, some participants forgot to enter their responses on the Kurashi-app weekly. When K6 data is missing, the lagged variables of the PVAR models do not show real-time differences. In order to avoid this problem, the researchers must supplement missing data. Assuming that data is missing at random (MAR), to supplement missing K6 data, we used the PHQ-9 score as an explanatory variable of the multiple regression equation. Daily missing data was not associated with the recurrence of depression, and we used its imputed series for weekly data series.
When dealing with MAR data, we can consider that the probability distribution of missing data is independent of that of non-observational data. We conducted Little’s CDM (covariate-dependent missingness) test [
19] as a special case of MAR. The CDM test gave a
p-value 0.102 and the hypothesis that the variables of interest are MCAR (missing completely at random) were not rejected at the 5% significance level. Therefore, the estimate is biased when one ignores the missing data [
20]. We corrected this bias by estimating the regression equation using auxiliary variables. A previous study showed that the use of many auxiliary variables may contribute to satisfy the premise of MAR [
21].
The regression equation approach may underestimate the standard deviation of the true value. However, Stata (ver. 15) cannot run PVAR models after multiple imputations. We thus estimated the multiple regression equation and imputed missing variables. The long length of activity diaries may increase the number of recurrence episodes over the sample period. Considering this, researchers must pay attention to heteroskedasticity problems. We used the number of weeks from entry in the study as the analytic weight of the regression equation. This variable is inversely proportional to the variance of observations. Lifelog data were aggregated by the week. Explanatory variables of the regression equation to impute K6 data were as follows: PHQ-9 score, mean of sleep hours during the past week, long sleep time, short sleep time, gender, educational attainment, occupational status, and marital status.
We selected the variables of the PVAR models as follows. The first candidate variables were enough time spent sleeping and exposure to UV light. These are good lifestyle factors recommended by Sarris et al. [
12]. The second candidate variables were selected using a two-sample t-test for difference of means. The homogeneity of variance was assumed. The two-sample t-test (the sample was divided between those with long sleep time/short sleep time and without) showed differences for five variables: meal, sitting idly, study/work, domestic work, and exercise. Finally, we calculated the correlation coefficient between K6 scores and these five activity variables, and selected two as explanatory variables of the PVAR models. The correlation coefficients of the two variables with K6 scores were 0.229 for sitting idly and 0.199 for the number of times lunch was not eaten.
The UV light exposure data were collected every minute by a wearable device Silmee W20. We defined a missing UV light value when collected data was below 80% of 1440 min (1152 min). Major reasons for missing out on UV light data were depleting battery life or restrained donning of the wearing device because of periodic irritation. Considering the differences in eating habits, we calculated a standardised variable of the number of times lunch was not eaten and used it as an explanatory variable of the PVAR models. We used standardised variables in order to accommodate patient heterogeneity that may account for large portion of total variances of key variables when using non-standardised variables. Since the standard deviation of the number of times lunch was not eaten was a large value of 2.46, we considered that the raw variable did not capture the differences in eating habits. Like the procedure for K6 scores, we imputed missing values of UV light and the standardised variable of the number of times lunch was not eaten.
PVAR model
As a key dependent variable of weekly K6, we considered a 5-variate PVAR of order p with panel-specific fixed effects represented by the following system of linear equations:
$$ {Y}_{it=}{Y}_{it-1}{A}_1+{Y}_{it-2}{A}_2+\cdots +{Y}_{it-p}{A}_p+{X}_{it}B+{v}_{it}+{e}_{it} $$
(1)
where
Yit is a (1 × 5) vector of dependent variables;
Xit is a (1 × q) vector of exogenous covariates;
vit and
eit are (1 × 5) vectors of dependent variable-specific fixed-effects and idiosyncratic errors, respectively. The (5 × 5) matrices
A1,
A2,…,
Ap and the (q × 5) matrix
B are parameters to be estimated.
With the presence of lagged dependent variables in the right-hand side of the system of equations, estimates would be biased even with a large
N [
22]. Fixed-effects estimation tends to underrate the predictions of the coefficient of the lagged dependent variables. Taking these problems into consideration, following the procedure of Michael-Abrigo and Love [
23], we used unbalanced panel data and estimated PVAR models by fitting a multivariate panel regression of each dependent variable on lags of itself and on exogenous variables. The estimation was done using the generalised method of moments (GMM). Because the presence of a unit root will invalidate the GMM specification, the estimates of the PVAR model must satisfy the stability condition. If all the eigenvalues lie inside the unit circle, the stability condition of the PVAR model is satisfied and the PVAR model is invertible.
We specified the PVAR model as follows. First, using the overall coefficient of determination (CD), we specified the maximum lag order to be included in the PVAR model. As explained below in section 3.1, the PVAR model consisted of five dependent variables as follows: the natural logarithm of {(K6 + 1)/square root of the number of episodes}, dummy variable of long sleep time or short sleep time, standardised variable of the number of times lunch was not eaten, natural logarithm of standardised variable of UV light, and dummy variable of sitting idly. Because the K6 repeated with relatively high frequency may cause respondent’s “learning curve” reaction, we used the square root of the number of previous episodes to control for each patient’s past experiences of depression. All the patients suffered from recurrent depression and those with increased numbers of previous episodes tended to report, on average, higher K6 scores. In order to balance the sensitivity of K6 scores across the subjects with variable number of previous episodes, we divided their natural logarithm of (K6 + 1) scores by the square root of their number of previous episodes.
Second, we confirmed the stability condition of the estimated PVAR model. Third, based on the value of CD shown in Table
1, we decided that the maximum lag order was 4. The CD captures the proportion of variation explained by the PVAR model as follows:
Table 1Maximum lag order of PVAR
Lag | CD |
1 | 0.98562 |
2 | 0.98827 |
3 | 0.98933 |
4 | 0.98972 |
5 | 0.98762 |
6 | 0.96484 |
CD = 1–(determinant of covariance matrix of idiosyncratic errors/determinant of unconstrained covariance matrix of the dependent variables).