Published on in Vol 3, No 2 (2015): Apr-Jun

Automated Personalized Feedback for Physical Activity and Dietary Behavior Change With Mobile Phones: A Randomized Controlled Trial on Adults

Automated Personalized Feedback for Physical Activity and Dietary Behavior Change With Mobile Phones: A Randomized Controlled Trial on Adults

Automated Personalized Feedback for Physical Activity and Dietary Behavior Change With Mobile Phones: A Randomized Controlled Trial on Adults

Original Paper

1Cornell University, Department of Information Science, Ithaca, NY, United States

2Northwestern University, Department of Preventive Medicine, Chicago, IL, United States

3Michigan State University, Department of Electrical and Computer Engineering, East Lansing, MI, United States

Corresponding Author:

Mashfiqui Rabbi, MS

Cornell University

Department of Information Science

219 Gates Hall

Ithaca, NY, 14850

United States

Phone: 1 6036671797

Fax:1 6072555196

Email: ms2749@cornell.edu


Background: A dramatic rise in health-tracking apps for mobile phones has occurred recently. Rich user interfaces make manual logging of users’ behaviors easier and more pleasant, and sensors make tracking effortless. To date, however, feedback technologies have been limited to providing overall statistics, attractive visualization of tracked data, or simple tailoring based on age, gender, and overall calorie or activity information. There are a lack of systems that can perform automated translation of behavioral data into specific actionable suggestions that promote healthier lifestyle without any human involvement.

Objective: MyBehavior, a mobile phone app, was designed to process tracked physical activity and eating behavior data in order to provide personalized, actionable, low-effort suggestions that are contextualized to the user’s environment and previous behavior. This study investigated the technical feasibility of implementing an automated feedback system, the impact of the suggestions on user physical activity and eating behavior, and user perceptions of the automatically generated suggestions.

Methods: MyBehavior was designed to (1) use a combination of automatic and manual logging to track physical activity (eg, walking, running, gym), user location, and food, (2) automatically analyze activity and food logs to identify frequent and nonfrequent behaviors, and (3) use a standard machine-learning, decision-making algorithm, called multi-armed bandit (MAB), to generate personalized suggestions that ask users to either continue, avoid, or make small changes to existing behaviors to help users reach behavioral goals. We enrolled 17 participants, all motivated to self-monitor and improve their fitness, in a pilot study of MyBehavior. In a randomized two-group trial, investigators randomly assigned participants to receive either MyBehavior’s personalized suggestions (n=9) or nonpersonalized suggestions (n=8), created by professionals, from a mobile phone app over 3 weeks. Daily activity level and dietary intake was monitored from logged data. At the end of the study, an in-person survey was conducted that asked users to subjectively rate their intention to follow MyBehavior suggestions.

Results: In qualitative daily diary, interview, and survey data, users reported MyBehavior suggestions to be highly actionable and stated that they intended to follow the suggestions. MyBehavior users walked significantly more than the control group over the 3 weeks of the study (P=.05). Although some MyBehavior users chose lower-calorie foods, the between-group difference was not significant (P=.15). In a poststudy survey, users rated MyBehavior’s personalized suggestions more positively than the nonpersonalized, generic suggestions created by professionals (P<.001).

Conclusions: MyBehavior is a simple-to-use mobile phone app with preliminary evidence of efficacy. To the best of our knowledge, MyBehavior represents the first attempt to create personalized, contextualized, actionable suggestions automatically from self-tracked information (ie, manual food logging and automatic tracking of activity). Lessons learned about the difficulty of manual logging and usability concerns, as well as future directions, are discussed.

Trial Registration: ClinicalTrials.gov NCT02359981; https://clinicaltrials.gov/ct2/show/NCT02359981 (Archived by WebCite at http://www.webcitation.org/6YCeoN8nv).

JMIR mHealth uHealth 2015;3(2):e42

doi:10.2196/mhealth.4160

Keywords



In 2010, the World Health Organization (WHO) attributed 63% of deaths to noncommunicable diseases that are largely preventable [1]. The Centers for Disease Control and Prevention (CDC) estimates that in the US nearly 200,000 deaths annually could be prevented based upon modifications in diet, exercise, and obesity [2]. Obesity alone affects more than one-third of the adult population [3] and burdens the US with an estimated US $190 billion annually in health care costs [4].

A rapid rise has occurred in the development of mobile phone apps and wearable devices to address diet and physical activity. While empirical data is lacking for some commercial apps and sensor-based technologies [5,6], a number of scientific studies have explored the impact of novel technology-supported behavior change strategies on physical activity [7-9]. For example, Weegen et al [10] applied behavior change theories to design a mobile app that visualized a summary of physical activity logs and gave clinicians feedback to support their promotion of physical activity. Food logging has proved to be more difficult, burdensome, and time consuming than tracking physical activity. Recent work, however, has attempted to use image-based systems to decrease burden and enhance accuracy in food tracking with some success [11-13].  The ubiquity and ever-presence of mobile phones gives them the potential to perform assessment and intervention in the right place at the right time.

Although these methods show promise, they continue to fall short by not providing context-specific, relevant, personalized help at the moment when the individual needs it to make healthier choices. The science of how to present daily physical activity and dietary intake data back to users also has been at a suboptimal state.  To date, feedback has been limited to one of three categories: (1) overall numeric summaries [7,8,14] (eg, step counts), (2) tailored suggestions that only adapt to personal characteristics (eg, age, gender) and overall behavior (eg, daily calories consumed and burned) [15], and (3) visualizations that incorporate little processing [16].  Simple goals are offered, but without actionable insights on when, where, and how to achieve them. Visualization of large amounts of minimally processed data produces a related problem—information overload without clear steps to behavior change.  Providing personalized, in-the-moment, actionable guidance that prompts smaller, but more frequent, changes in existing behavior has potential for greater impact. A deeper look into physical activity and dietary intake data can reveal patterns of both healthy and unhealthy behavior that could be leveraged for personalized feedback. With current technologies, this can be achieved automatically, without human interpretation.

Given these observations, MyBehavior was created to address some shortcomings of current mHealth interventions. MyBehavior uses a machine-learning model—multi-armed bandit (MAB)—to automatically create contextualized and personalized suggestions based on the individual’s physical activity and dietary intake data collected solely from a mobile phone. Moreover, MyBehavior is one of the very few mHealth apps designed on the basis of established behavioral theory.  As such, the system reflects and incorporates the contemporary state of the behavioral science knowledge about how to foster healthful change.  Based on effective behavior change principles, MyBehavior provides low-effort suggestions that request small changes to users’ existing repeated behaviors.  To the best of our knowledge, MyBehavior is the first mHealth app that encourages healthy behavior change by automatically providing low-effort suggestions based on the user’s context and personal information.  

The objective of this study was to evaluate a new behavior change technology—MyBehavior—using a mixed-method approach as suggested by others [17]. We focused on (1) whether the users intended to follow the automated MyBehavior suggestions, (2) early indications of behavior change empowered by automated suggestions, and (3) participant feedback that could inform user experience and guide future design of automated health feedback systems.

The outcomes from this study will be used to further refine the features and messages of MyBehavior to optimize its effect on physical activity and dietary intake.


Study Procedure

To evaluate the feasibility of MyBehavior, a small 3-week, two-group randomized control trial (RCT) was conducted. The team that supervised the trial included the builders of the MyBehavior app and authors of this paper. This team recruited participants through advertisements placed around the Cornell University campus. In the advertisement, we invited participants to test a new mobile app to help them stay on track for physical activity and food intake. Recruitment was restricted to participants who owned an Android mobile phone and had an interest in fitness. Prior to the study, the investigators arranged face-to-face meetings with the participants and acquired their informed consent. Participants also completed a brief survey to provide demographic data and information about their prior experience with mobile technologies and weight loss/fitness apps. All participants attended a training session, where they installed MyBehavior on their primary mobile phone and received basic instructions, including how to enter their gender, height, and weight and how to set up a weekly weight goal (ie, lose weight, maintain weight, or gain weight). During the first week, users received a daily summary of their activities and food intake. This baseline week was intended to resemble many modern mobile health apps [5,6] without suggestions on what behaviors to change.

After the first week, the experimenters conducted an in-depth, semistructured interview with participants about their experience to date and then randomized participants into control and experimental groups. A random number generator was used for randomization. Assignment was single blind, as the study participants did not know their condition, while experimenters had full knowledge about the assignments.

We provided MyBehavior’s personalized context-sensitive suggestions to the experimental group, while the control group received generic prescriptive recommendations generated from a pool of 42 suggestions for healthy living, such as “walk for 30 minutes” and “eat fish for dinner.” A certified fitness professional created these generic suggestions after following National Institutes of Health resources [18,19].  An external nutrition counselor also reviewed the suggestions to ensure that they were both healthy and achievable. The list of these 42 suggestions is included as Multimedia Appendix 1 in this paper. For the following 2 weeks, participants continued to log behaviors and receive their respective suggestions on their mobile phones. During the entire study period, we asked participants to complete Web-based daily diaries to better understand their experience in following the suggestions provided. At the conclusion of the 3-week period, all participants were asked to complete a brief survey about the suggestions provided and were interviewed again face-to-face about their experience with the app.

This study was approved by Cornell University Institutional Review Board (1302003617) and a protocol was registered retrospectively at ClinicalTrails.gov (NCT02359981).

Participants

We recruited 18 participants, 17 of whom completed the study. Of the 17 participants, there were 13 students (76%), 4 professionals (24%), 8 females (47%), and 9 males (53%), and all were between the ages of 18 and 49 (mean 28.3, SD 6.96, lower quartile [q25]=22, median [q50]=26.3, upper quartile [q75]=36). All participants reported low-to-moderate levels of physical activity.  The majority of participants were experienced mobile phone users—9 participants (53%) had previous experience using a food diary, and 6 participants (35%) had previously kept an exercise log. After the randomization, participants in the groups were similar in terms of level of active lifestyle and experience with using mobile-based self-management tools. Our sample size was determined based on earlier literature [17,20,21] that suggested that small studies (n≥4) are more suitable to test early feasibility of novel behavior change technologies like MyBehavior. See Figure 1 for the flow of participants in the trial.

Figure 1. Flow of participants in the MyBehavior trial.
View this figure

MyBehavior Mobile App

Overview

MyBehavior is comprised of five key modules: (1) physical activity tracking, (2) food logging, (3) life-log generation, (4) physical activity and food clustering, and (5) suggestion generation.

Physical Activity Tracking

MyBehavior uses the accelerometer and the Global Positioning System (GPS) sensor inside the mobile phone to continuously keep track of an individual’s physical activities. A number of statistical features (eg, mean, variance, zero-crossing rate) are extracted from the sensor data and a machine-learning model—Gaussian Mixture Model (GMM) [22]—is applied to map the extracted feature values into the four most common daily physical activities—walking, running, stationary (sitting or standing), and driving. The technical details of this mobile sensing framework can be found in our previous work [23]. For physical activities that cannot be automatically recognized, MyBehavior provides users with a list of about 800 different physical activities from the compendium of physical activities [24]. Users can manually select the specific physical activity from the list and record the start and end time for the activity. In addition to tracking physical activities, MyBehavior calculates the calories expended during these activities based on the standard Metabolic Equivalents of Task (METS) [25].

Food Logging

Users select food items from a database and enter the consumed quantity to get the corresponding calorie intake. The United States Department of Agriculture (USDA) [26] maintains this database, containing more than 8000 types of food.

MyBehavior provides several features to make the food logging experience easier for users. First, users can take photos of their food. These photos serve as a memory aid when users are prompted to input their food information at 9:30pm every night (see Figure 2). Second, to facilitate entry of frequently repeated food choices, MyBehavior allows users to input food which reuses a prior meal (eg, add yesterday’s breakfast) and prioritizes food items that were selected previously. Finally, MyBehavior provides users with an option to directly input calorie information taken from the label of prepackaged foods (eg, a soft drink can or yogurt cup).

Figure 2. MyBehavior app screenshots: (a) taking photo of a food item, (b) searching for foods from the USDA database, and (c) Life Log, a chronological list of activity and food log events.
View this figure
Life-Log Generation

MyBehavior generates as a “life log,” a chronological list of activity and food log events, as shown in Figure 2.  The log includes food, automatically sensed physical activity, and manually logged exercise entries as life events. To create concise and meaningful activity entries, MyBehavior processes the data into two stages.

In the first stage, activity predictions, which happen every 1 second, are aggregated every minute and labeled automatically. In the second stage, the contiguous activities having the same label are combined into a single entry. For example, if a user is stationary for 50 minutes, MyBehavior will generate a “stationary” activity entry into the life log with a 50-minute duration. Other common life events include a sequence of different activities that happen within a short time interval (ie, 15 minutes). An example might be the following:  walk to the bus stop, wait for a few minutes, ride the bus, and walk to the office after exiting the bus.  In this example, MyBehavior generates a “mixed” activity entry in the life log (eg, taking the bus from home to work) by combining the multiple activity sequences that happen within a 15-minute window.  

Physical Activity and Food Clustering

To enable suggestion generation delivery to the experimental group only, MyBehavior used the life logs to cluster similar physical activities and similar food items.  The food similarity matching process follows a simple logic—food is clustered based on similar food ingredients. For example, MyBehavior will detect if a user is repeatedly having high-calorie burgers with similar ingredients and form the cluster “burger” that groups together the same or similar types of burgers.  

Regarding clustering physical activities, manually tracked activities are clustered based on the type of activities similar to food clustering. Automatically tracked activities, tagged with location information, are clustered by places they occur. Clusters are found using unsupervised machine-learning techniques to identify similarity.  As indoor localization is often accurate up to 150 meters, any stationary activities that fall within 150 meters of each other are clustered together.  For example, a user’s stationary activities in the office are typically in close proximity to each other and, as such, MyBehavior clusters these office locations into a single unit that represents the user’s stationary behavior in the office.  Walking and running activities are more difficult to cluster because MyBehavior needs to determine whether two activity trajectories look similar and happen at a similar location.  To group similar walking or running events, MyBehavior uses an algorithm derived from the literature on handwriting recognition [27]. In handwriting recognition, the task is to find a canonical letter that matches the shape or trajectory of a handwritten letter.  The analogous task in MyBehavior is to find whether a new walking trajectory (eg, office to coffee shop) matches previous walking trajectories. Figure 3 shows some clusters generated by this technique.  The image on the left represents a user’s stationary episode in the office and home, whereas the middle and right-hand images show two walking clusters generated by two different users. The middle image represents a user’s walks near the office, while the cluster in the right-hand image represents another user’s daily walks from home to a bus stand.  

Figure 3. Clusters generated from user activities: (a) locations where user A stayed stationary, (b) location traces for user B where he walked around his office, and (c) walking traces of user A from his house to the bus stop.
View this figure
Suggestion Generation

After clustering user behaviors, MyBehavior uses an exploit-explore strategy to automatically generate suggestions based on users’ past physical activities and food intake. This suggestions-generating strategy is grounded in contemporary behavioral science theories:  (1) learning theory [28], (2) social cognitive theory [29], and (3) the Fogg Behavior Model (FBM) [30].  Behavior analysis applies learning theory first to assess whether a person has the skills needed to perform a behavior [28]. If so, the next step is to increase or decrease the target behavior’s frequency by harnessing its antecedents (ie, its setting and cues) and consequences (ie, reinforcement). For example, if a health suggestion asks a user to swim but the user can’t swim (ie, he never acquired the skills), the user will not follow the suggestion. On the other hand, if a person has performed a behavior before, even if rarely, the skills can be assumed present. The Fogg Behavior Model applies theoretical principles to technology design by creating tools to prompt low-effort actions that can be triggered even when motivation is low [30].   Thus, MyBehavior suggests (ie, cues or triggers) a frequent behavior (eg, a particular walk) that the person often does in a particular life context. This small, low-effort change simply increases the frequency of a behavior that the person already does. Sometimes instead, MyBehavior suggests an infrequent behavior (eg, bike ride) that would burn more calories and that the person has shown he/she can do, but does only rarely. Social cognitive theory [29], the most widely used behavioral theory, suggests that in order to voluntarily initiate an action, a person needs a sense of self-efficacy or confidence that he/she will be able to perform it.  The more frequently the person can be triggered to ride a bike repeatedly in a certain context where bikes are accessible, the more self-efficacy increases, the less effortful the behavior becomes, and the more likely that bike riding becomes a habit.

MyBehavior exploits the frequency principle by suggesting activities that users perform repeatedly. In addition, the algorithm favors actions that are not only frequent, but also result in higher calorie expenditure. For example, short 1-minute walks inside the office, though very frequent, are likely to be superseded in the suggestion-generation engine by a less frequent, but higher-calorie-burning, gym class. On the other hand, if the person rarely visits the gym but walks 30 minutes to work several times a week, the recommender engine will rank the walk higher than the gym since the aggregate calorie loss—frequency x calories burned each instance—is higher.

For stationary activities, the recommender engine suggests small changes, such as walking 3 minutes for every hour spent stationary. The right-most image in Figure 4 shows a prioritization order of MyBehavior suggestions where simply adding 3-minute walks to the user’s hour-long stationary episodes burns more calories compared to rarely occurring gym visits.

Exploit suggestions generated solely from users’ frequent past behavior may not generate sufficient energy expenditure to cause weight loss. Consequently, MyBehavior periodically suggests higher-calorie-burning activities to entice the user to try out and adopt.  Explore suggestions target infrequent, high-calorie-burning behaviors that the user can turn into a more regular activity.  Future behavior is only imperfectly predicted by past behavior, and it could be the case that users will increase infrequent activities if suggested. Hence, if a user walks regularly near her office but sometimes goes to the gym or takes a long walk home, MyBehavior exploits this knowledge by suggesting walking near the office most of the time and by sometimes suggesting a gym visit or a long walk. If the new suggestion sticks and the user starts going to the gym regularly as a result, then MyBehavior learns to target the gym as an exploit suggestion rather than an explore suggestion.

When generating food suggestions, a separate set of suggestions is created based on the exploit-explore strategy. First, MyBehavior distinguishes between meals and snacks. Then it takes into account both intake frequency and calories similar to the physical activity suggestions. Thus, a user’s frequent healthy low-calorie meals are exploited and are encouraged to be continued. During exploration, a random selection of infrequent low-calorie meals/snacks from the past is suggested. Here, the expectation is that users will take up some of these infrequent meals and make them frequent in the future.  

At the start of every day, MyBehavior generates 10 food and 10 activity suggestions.  Of these, 90% are from the users’ most frequent activities (ie, exploit) and 10% are from the users’ infrequent behaviors (ie, explore). This split of 90% exploit and 10% explore was heuristically chosen based on previous literature [31]. This kind of exploit-explore strategy, well grounded in artificial intelligence research, falls under a wider decision-making framework called multi-armed bandit [31]. MAB models have been well studied for modeling dynamic systems where situations can change over time. In our case, user behavior is not fixed and can change over time under MyBehavior’s influence (see Figure 4, left-most and right-most images). The exploit-and-explore strategy models this dynamic nature of human behavior effectively.  MyBehavior exploits the most common user behaviors that promote energy balance to produce short-term health gain. To target long-term health, it occasionally explores infrequent higher-energy-expending behaviors to discover actions that the user might repeat in the future, leading to sustained energy balance that could boost weight loss.

Figure 4 shows different generated suggestions that encourage the user to either continue positive activities (ie, low-calorie foods, walking, or exercise), make small changes in some situations (ie, stationary activities) (left-most image), or avoid negative activities (ie, frequent large meals) (second image). The first and third images in Figure 4 show suggestions for two different users and the first and last images show suggestions for the same user that change over time.  A video demonstrating different features of MyBehavior is included as Multimedia Appendix 2.

Figure 4. Screenshots showing recommended suggestions for exercise and food: (a) physical activity suggestions made by MyBehavior, (b) food suggestions made by MyBehavior, (c) physical activity suggestions for a different user, and (d) physical activity suggestions for the same user as in (a), but at a different point in time.
View this figure

Measures

First, we used a suggestion-rating survey to evaluate user intentions to follow the suggestions. Participants completed this survey after the 3-week study concluded. Participants rated the suggestions, by indicating on a 1-to-5 scale, whether they would be willing and able to do the recommended action on an average day—5 (Strongly Agrees that he/she can follow the suggestion), 1 (Strongly Disagrees). Each participant rated suggestions that she/he saw during the study in an online form. Experimental group participants rated 15 top-ranked—top 8 physical activity and top 7 food—personalized MyBehavior suggestions of their own. On the other hand, the control group participants rated 10 randomly chosen generic prescriptive suggestions. In addition, we quantitatively measured behavior change for all participants using logs of daily physical activity and dietary intakes.

The daily diary and the in-depth, semistructured interviews measured participant feedback regarding the suggestions. For the daily diaries, we queried (1) whether they looked at MyBehavior’s suggestions, and (2) whether they made or wanted to make any changes after seeing the suggestions. The semistructured interviews covered users’ general overall experience with MyBehavior and the quality of the suggestions. Specifically, we inquired about awareness, behavior change, and of any software improvement they would like to see. In addition, in the interview, we asked clarifying questions that explained quantitative results observed from the data.

Analysis Plan

Regarding the user’s intention of following MyBehavior’s suggestions, we gathered ratings for suggestions on a secure website and analyzed the data using RStudio. Since the ratings were in ordinal scale, we used a nonparametric Mann-Whitney U test [32] for statistical significance and effect size.

We measured behavior changes by analyzing activity and dietary logs for statistical significance using MATLAB (MathWorks, Inc) statistics toolbox and RStudio. For each user, we computed median walking length and calories per food item. We considered medians across entire weeks over other central measures since they are less susceptible to spurious noise or outliers (eg, occasional intake of very-high-calorie food or atypical, unusually lengthy walk). We did not report changes in running and manually logged exercises in the data analysis as they often require higher effort and are tough to change within the 3 weeks of the experiment. In our analysis, we first considered the number of positive changes. A positive change is defined as a downward trend in median calories in meals, or an upward trend toward longer-length walks over the first week to the third week. We used the Fisher Exact Test [32] to measure the number of positive changes as an effect of MyBehavior. Because of small sample size, the Fisher Exact Test is used instead of the chi-square test for independence. We used a two-sample independent Student’s t test to measure statistical significance for total walk lengths and total food calories consumed per day. We computed differences in walking distances instead of total number of calories burned, since a walk of a fixed distance can result in a different amount of calories burned for different individuals [25]. We calculated the effect size of walking and eating behavior changes with Cohen’s d measure.

Finally, face-to-face, semistructured interviews were audio recorded and transcribed. Interview transcripts and daily diaries were then broken down into themes using thematic analysis [33].


Adherence

A total of 17 participants completed the 3-week study, yielding almost 2.1 million recorded physical activity instances, amounting to more than 8000 hours of physical activity. During the same period, participants labeled nearly 850 images of food with annotations.

User Acceptance of MyBehavior Suggestions

In the suggestion-rating survey, the experimental group (mean 3.4, SD 1.2, q25=2.75, q50=3, q75=4), with MyBehavior suggestions, intended to follow personalized suggestions more than the control group (mean 2.5, SD 1.6, q25=1, q50=2, q75=4) intended to follow the generic suggestions. A nonparametric Mann-Whitney U test [32] found this difference to be statistically significant (P<.001, 95% CI 0-1.001, effect size = 0.99).

Physical Activity

Figure 5 shows the distribution, in the form of box plots, of walking lengths over time for the experimental (left-hand image) and the control (right-hand image) groups. For each week of the study, we computed these distributions for the different users. To ease interpretation, we joined the median per week with thick green or red lines for each user. A green line implies a positive change as discussed in the data analysis section. A red line indicates the reverse negative trend. We used a log scale for walking-length distribution since walking-length distributions have heavy tails [34].

For walking, 78% (7/9) of participants in the experimental group (Figure 5, left-hand image) showed positive trends, whereas 75% (6/8) of participants in the control group (Figure 5, right-hand image) exhibited negative trends.  A Fisher Exact Test found this ratio in the number of positive changes between the experimental and control groups statistically significant (P=.05) [35]. In addition, MyBehavior users walked an average of 10 minutes more per day within the experiment phase (ie, from the first to the third week). However, we did not observe any change for the control group. A two-sample t test found this difference in change of walking duration to be significant (t15= 2.1, P=.055, 95% CI -0.23 to 19.052, d=0.9).

Qualitative data from daily diary and face-to-face interviews largely supported this quantitative result. However, we also observed some important subtleties. First, participants in the experimental group described the activity suggestions to be actionable and relevant to their lives. Control group participants appreciated that the generic suggestions reminded them of good habits. However, they often faced problems incorporating the suggestions into their daily lives. The following quotes were taken from the daily diaries of participants.

Those suggestions are quite good, which reminds me not to sit too long in one place.
[Experimental group participant #1]
The exercise suggestions made me want to do some more activities and be less stationary. Seeing how long I have been stationary and the low frequency of activity made me want to make a change.
[Experimental group participant #5]
Try to get up from my desk more often...added “walk" notes to my calendar.
[Experimental group participant #2]
I did some walking where I normally walk. The app now shows I walked there 26 times. The app makes me feel that I can do it again since I have done the same walk many times.
[Experimental group participant #7]
The suggestions encourage me to do/plan exercises for the near future...It reminds me that some foods are better than others.
[Control group participant #1]
They seem like good generic suggestions. The kind you would read...as tips in a health magazine or some such...
[Control group participant #4]

Some MyBehavior users reported that even the nonfrequent explore suggestions were actionable and expressed interest in acting on them. For instance, experimental group participant #7 said the following in his/her daily diary:

I saw a walk to my nearest bus stand listed. Normally, I drive my car to go to my office. But looking at the extra walking I got while going to the bus stop makes me think about doing it often and making it a habit.
[Experimental group participant #7]

Results from interviews also revealed that participants at various stages of active lifestyle reacted to suggestions differently [15]. For the experimental group, participants who were considering making changes expressed that they became more self-conscious about their behavior and they were eager to follow the suggested changes (eg, starting to walk more near home, or continuing runs on treadmills). Comparatively, users likely maintaining an active lifestyle expressed that the suggestions reflected their current healthy behavior and considered them as good reinforcements. However, participants in the maintenance phase wanted to change their stationary behavior in the office with occasional small walks. For the control group, users were frustrated because the suggestions were not always feasible and did not blend with their routines and lifestyle. Control group users maintaining an active lifestyle were unaffected by generic suggestions and continued their regular behavior across weeks. For example, control group participants #7 and #8 were maintaining participants and their behavior showed no negative trends in Figure 5 (right-hand image). Control group users who did not already have a maintaining lifestyle gradually became less active or made poorer food choices after the initial phase of the study.

Finally, on a few occasions, MyBehavior suggestions were hard to follow or did not reflect user preferences. For example, one user reported in the interview that he used to play soccer with his friends but his friends recently moved to a new location. He could no longer play soccer, which MyBehavior was suggesting. In addition, often user-preferred activities are not top MyBehavior suggestions. For instance, one user preferred to swim even though she did not do it often. Finally, experimental group participant #8 (subject 8 in Figure 5, left-hand image, with negative trends) reported an inability to follow MyBehavior suggestions because of a looming work deadline during the study.

Figure 5. Box plots showing the distribution of walking lengths for the experimental group (a) and for the control group (b) over the 3-week study. We joined the medians of distributions and showed each trend as a thick green line (increasing trend) or red line (decreasing trend) for walking length.
View this figure

Dietary Behavior

Figure 6 shows the distribution, in the form of box plots, of meal calories for the experimental group (left-hand image) and the control group (right-hand image). For each week of the study, we computed these distributions for different users. Similar to walking-behavior graphs, we joined medians across weeks to show positive or negative changes for each user.

For caloric intake, 78% (7/9) of participants in the experimental group showed positive trends (green lines in Figure 6, left-hand image), and 57% (4/7) of participants in the control group showed negative trends (red lines in Figure 6, right-hand image—1 participant had insufficient data). However, a Fisher Exact Test found this to be nonsignificant (P=.15). For control group participants, we also found their average median calories per day to increase by 211 calories (mean 211.7, SD 263.07, q25=-31.25, q50=187.5, q75=429.35) from the first week to the third week. Comparatively, the experimental group showed an average calorie per day decrease of nearly 100 calories (mean -99.3, SD 481.27, q25=-527.83, q50=-37.3, q75=87.5) from the first week to the third week. This change was not significant in a two-sample t test (t12=1.3234, P=.21, 95% CI -201 to 822.96, d=0.72).

In qualitative feedback, similar to physical activity suggestions, experimental group users found the suggestions to be more actionable and reported to make more changes compared to control group users who found the suggestions to be hard to work on. This feedback is illustrated in the following quotes from participants’ daily diaries.

The pictures of my meals are very useful to keep track of what I've been eating in the past. People tend to forget about their habits, but pictures in this case are a nice way to bring your food history in front of your eyes.
[Experimental group participant #9]
The suggestions remind me that some foods are better than others.
[Control group participant #1]
It recommends me to eat stuff that I don't have at home.
[Control group participant #4]
These suggestions don't take into account my dietary restrictions.
[Control group participant #5]

Similar to activity explore suggestions, MyBehavior users often found the explore suggestions to be actionable.

I just wanted to see what it was...These ones [explore suggestions] seemed to pick up some "good" food habits.
[Experimental group participant #4]

Finally, users reported manual food logging to be time consuming in the interview. However, they also reported that this manual process made them more aware of their foods. Consequently, control group participants reported making dietary changes without personalized suggestions.

Figure 6. Box plots showing the distribution of food calories for the experimental group (a) and for the control group (b) over the 3-week study. We joined the medians of distributions and showed each trend as a thick green line (increasing trend) or red line (decreasing trend) for median food calorie intake.
View this figure

Principal Findings

To our knowledge, MyBehavior is the first system to automatically provide personalized suggestions that relate to users’ lifestyles. In the quantitative results, MyBehavior users demonstrated superior behavior changes compared to the control group. Qualitative measures from the face-to-face interviews and the daily diaries confirmed that the suggestions indeed were perceived to be personalized to their lives. This concordance of superiority in both quantitative behavior change and qualitative user perception makes MyBehavior’s automated health feedback approach very promising and provides support for longitudinal studies and future investigations into automated personalization approaches.

Specifically, in our evaluation, users rated that they could follow MyBehavior personalized suggestions more than the control condition suggestions. Results also revealed a significant change in walking behaviors for MyBehavior users. In qualitative measures, users reported MyBehavior activity suggestions to be more actionable. Interestingly, although users qualitatively reported the dietary suggestions to be more actionable, dietary behavior changes were not found to be different between the groups. This finding could be due to the manual-logging nature of food intake being sufficient for behavior change alone. The manual process of food logging might produce self-awareness and reflection. Indeed, past research demonstrates that simple logging can improve one’s food consumption behavior [16]. However, food logging is an arduous process and it is often hard to continue for an extended period. Thus, we need longer studies to determine if food logging along with suggestions could aid in sustained behavior change. Furthermore, we had a small sample in the study with inadequate statistical power. Thus, larger trials are necessary to further elucidate the effects of food logging and these types of suggestions on eating behavior.

Nonetheless, MyBehavior explores a unique space for health feedback. Earlier studies in this domain predominantly focused on overall behavior [7,14], tailoring [36], or self-tracking [33] without deeper data analysis and personalization. MyBehavior takes a data mining approach to automatically find contextualized suggestions from logged data. This automated approach also relieves users from the burden of self-analyzing their data. Thus, MyBehavior is a marked departure from previous self-monitoring programs found in the literature, where users themselves decide on how to make changes on their own [33]. MyBehavior suggestions relate to a user’s existing behaviors, making them actionable as the user is told where and when to act on them. Furthermore, unique sets of suggestions are generated for each user based on their routine and lifestyle. The literature on N-of-1 approaches [37,38,39] argue that such personalization should yield better efficacy than one-size-fits-all or tailored-suggestion approaches [15], where similar suggestions are provided to users with similar characteristics (eg, age, gender, daily calorie intake, and loss).

Despite this promising direction, the automated data-driven personalization approach of MyBehavior brings its own challenges. Manual logging of food and exercise, in addition to automated logging, are necessary for proper functioning of MyBehavior. Qualitative interviews revealed that manual food and exercise logging were often burdensome. Future iterations could use crowdsourcing-based semiautomated approaches to decrease the burden of manual food journaling [12]. Finally, interviews also highlighted the importance of considering contextual changes in users’ lives and preferences. Thus, giving users control in deciding which suggestion they want to follow is required for well-accepted personalization [7].

Limitations

An important limitation is the short-term and small-scale nature of the study, which makes it difficult to make definitive conclusions. However, the study helped us to identify the potential efficacy of MyBehavior and pinpoint design improvements for future deployments. Indeed, Klasanja et al [17] argued that such short-term studies with similar evaluation goals as in our study are often more suitable for new and untested behavior change technologies like MyBehavior. Another limitation was that the nonpersonalized suggestions were sometimes too specific, for example, “walking with a dog.” In the daily diaries, some users reported that they could not follow this suggestion since they did not own a dog. While designing generic suggestions, we tried to find suggestions that most users could follow, without being overly generic. However, there will always be exceptions where a suggestion does not fit one’s lifestyle.

Despite these limitations, this pilot study demonstrates the potential of using automated personalization for actionable health feedback. As we move into an age where increasingly more people are tracking their health with mobile and other technologies, we believe MyBehavior’s automated technique holds great potential to provide feedback that can be used to improve health outcomes at scale.

Conclusions

MyBehavior is the first mobile health app that can encourage healthy behavior change by automatically providing low-effort suggestions based on the context and personal information of users.  The pilot user study demonstrated the feasibility and acceptability of MyBehavior. Users considered MyBehavior’s personalized, contextualized suggestions to be more actionable and to require less effort to implement than generic prescriptive suggestions. Preliminary evidence of behavior change shows that a high percentage of MyBehavior users did more physical exercise, yet the potential impact on eating behaviors remains unclear. The addition of more human control over the suggestions and providing easier logging mechanisms for food and exercise were identified as key areas of improvement. Future directions should include addressing the identified shortcomings of the system and testing its effectiveness in fostering health behavior change in a larger longitudinal trial.

Acknowledgments

We acknowledge Emiliano Miluzzo, Erin Carroll, Fabian Okeke, Jean Costa, Steven Voida, and Chantelle Farmer for their early contributions to this research. This work was supported by Intel Science and Technology Center for Pervasive Computing.

Conflicts of Interest

None declared.

Multimedia Appendix 1

Generic suggestions used in the control group.

PDF File (Adobe PDF File), 40KB

Multimedia Appendix 2

A video showing how MyBehavior works.

MP4 File (MP4 Video), 42MB

Multimedia Appendix 3

CONSORT-EHEALTH checklist V1.6.1 [40].

PDF File (Adobe PDF File), 1MB

  1. Alwan A. Global Status Report on Noncommunicable Diseases 2010. Geneva, Switzerland: World Health Organization; 2011.   URL: http://www.who.int/nmh/publications/ncd_report_full_en.pdf [accessed 2015-05-02] [WebCite Cache]
  2. Centers for Disease Control and Prevention (CDC). Million hearts: strategies to reduce the prevalence of leading cardiovascular disease risk factors--United States, 2011. MMWR Morb Mortal Wkly Rep 2011 Sep 16;60(36):1248-1251 http://www.cdc.gov/mmwr/pdf/wk/mm6036.pdf [FREE Full text] [Medline]
  3. Kaplan R, Stone A. Bringing the laboratory and clinic to the community: mobile technologies for health promotion and disease prevention. Annu Rev Psychol 2013;64:471-498. [CrossRef] [Medline]
  4. Cawley J, Meyerhoefer C. The medical care costs of obesity: an instrumental variables approach. J Health Econ 2012 Jan;31(1):219-230. [CrossRef] [Medline]
  5. MyFitnessPal.   URL: http://www.myfitnesspal.com/ [accessed 2015-05-07] [WebCite Cache]
  6. Moves.   URL: https://www.moves-app.com/ [accessed 2015-05-02] [WebCite Cache]
  7. Consolvo S, McDonald DW, Toscos TT, Chen MY, Froehlich J, Harrison B, et al. Activity sensing in the wild: a field trial of ubifit garden. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. New York, NY: ACM; 2008 Presented at: SIGCHI Conference on Human Factors in Computing Systems; April 5-10, 2008; Florence, Italy p. 1797-1806. [CrossRef]
  8. James JL, Mamykina L, Lindtner S, Delajoux G, Strub HB. Fish'n'Steps: encouraging physical activity with an interactive computer game. In: Proceedings of the 8th International Conference on Ubiquitous Computing. Berlin, Germany: Springer-Verlag; 2006 Presented at: UbiComp'06: 8th International Conference on Ubiquitous Computing; September 17-21, 2006; Orange County, CA p. 261-278.
  9. Consolvo S, Everitt K, Smith I, Landay JA. Design requirements for technologies that encourage physical activity. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. 2006 Presented at: SIGCHI Conference on Human Factors in Computing Systems; April 22-27, 2006; Montréal, PQ p. 457-466.
  10. van der Weegen S, Verwey R, Spreeuwenberg M, Tange H, van der Weijden T, de Witte L. The development of a mobile monitoring and feedback tool to stimulate physical activity of people with a chronic disease in primary care: a user-centered design. JMIR Mhealth Uhealth 2013;1(2):e8 [FREE Full text] [CrossRef] [Medline]
  11. Daugherty BL, Schap TE, Ettienne-Gittens R, Zhu FM, Bosch M, Delp EJ, et al. Novel technologies for assessing dietary intake: evaluating the usability of a mobile telephone food record among adults and adolescents. J Med Internet Res 2012;14(2):e58 [FREE Full text] [CrossRef] [Medline]
  12. Noronha J, Hysen E, Zhang H, Gajos KJ. Platemate: crowdsourcing nutritional analysis from food photographs. In: Proceedings of the 24th Annual ACM Symposium on User Interface Software and Technology. New York, NY: ACM; 2011 Presented at: The 24th Annual ACM Symposium on User Interface Software and Technology; October 16-19, 2011; Santa Barbara, CA p. 1-12.
  13. Thomaz E, Parnami A, Essa I, Abowd GD. Feasibility of identifying eating moments from first-person images leveraging human computation. In: Proceedings of the 4th International SenseCam and Pervasive Imaging Conference. New York, NY: ACM; 2013 Presented at: The 4th International SenseCam and Pervasive Imaging Conference; November, 18-19, 2013; San Diego, CA p. 26-33.
  14. Lane ND, Mashfiqui M, Lin M, Yang X, Lu H, Ali S, et al. Bewell: A smartphone application to monitor, model and promote wellbeing. In: Proceedings of the 5th International ICST Conference on Pervasive Computing Technologies for Healthcare. 2011 Presented at: 5th International ICST Conference on Pervasive Computing Technologies for Healthcare; May 23-26, 2011; Dublin, Ireland p. 23-26. [CrossRef]
  15. Prochaska JO, Velicer WF. The transtheoretical model of health behavior change. Am J Health Promot 1997;12(1):38-48. [Medline]
  16. Siek KA, Connelly KH, Rogers Y, Rohwer P, Lambert D, Welch JL. When do we eat? An evaluation of food items input into an electronic food monitoring application. In: Proceedings of the Pervasive Health Conference and Workshops. New York, NY: IEEE; 2006 Presented at: Pervasive Health Conference and Workshops; November 29-December 1, 2006; Innsbruck, Austria p. 1-10. [CrossRef]
  17. Predrag K, Consolvo S, Pratt W. How to evaluate technologies for health behavior change in HCI research. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. New York, NY: ACM; 2011 Presented at: SIGCHI Conference on Human Factors in Computing Systems; May 7-12, 2011; Vancouver, BC p. 3063-3072. [CrossRef]
  18. National Heart, Lung, and Blood Institute, National Institutes of Health. 2011 Sep 26. Getting started and staying active   URL: http://www.nhlbi.nih.gov/health/health-topics/topics/phys/getstarted.html [accessed 2015-05-07] [WebCite Cache]
  19. National Heart, Lung, and Blood Institute, National Institutes of Health. 2013 Feb 13. Healthy eating plans   URL: http://www.nhlbi.nih.gov/health/educational/wecan/eat-right/sample-plans.htm [accessed 2015-05-02] [WebCite Cache]
  20. Burns MN, Begale M, Duffecy J, Gergle D, Karr CJ, Giangrande E, et al. Harnessing context sensing to develop a mobile intervention for depression. J Med Internet Res 2011;13(3):e55 [FREE Full text] [CrossRef] [Medline]
  21. Dallery JJ, Cassidy RN, Raiff BR. Single-case experimental designs to evaluate novel technology-based health interventions. J Med Internet Res 2013;15(2):e22 [FREE Full text] [CrossRef] [Medline]
  22. Bishop CM. Pattern Recognition and Machine Learning. New York, NY: Springer-Verlag; 2006.
  23. Lu H, Yang J, Liu Z, Lane ND, Choudhury T, Campbell AT. The Jigsaw continuous sensing engine for mobile phone applications. In: Proceedings of the 8th ACM Conference on Embedded Networked Sensor Systems. New York, NY: ACM; 2010 Presented at: 8th ACM Conference on Embedded Networked Sensor Systems; November 3-5, 2010; Zurich, Switzerland p. 71-84. [CrossRef]
  24. Ainsworth BE, Haskell WL, Herrmann SD, Meckes N, Bassett DR, Tudor-Locke C, et al. 2011 Compendium of Physical Activities: a second update of codes and MET values. Med Sci Sports Exerc 2011 Aug;43(8):1575-1581. [CrossRef] [Medline]
  25. Wikipedia. 2015 Feb 16. Metabolic equivalent   URL: http://en.wikipedia.org/wiki/Metabolic_equivalent [accessed 2015-05-07] [WebCite Cache]
  26. USDA National Nutrient Database for Standard Reference, Release 21. Beltsville, MD: US Department of Agriculture, Agricultural Research Service; 2008.   URL: http://www.ars.usda.gov/News/docs.htm?docid=18880 [accessed 2015-05-07] [WebCite Cache]
  27. Eiter T, Mannila H. Technischen Universität Wien, Knowledge Based Systems Group. Vienna, Austria: Technischen Universität Wien; 1994 Apr 25. Computing discrete Frechet distance   URL: http://www.kr.tuwien.ac.at/staff/eiter/et-archive/cdtr9464.pdf [accessed 2015-05-07] [WebCite Cache]
  28. Madden GJ, editor. APA Handbook of Behavior Analysis. Washington, DC: American Psychological Association; Sep 16, 2013.
  29. Bandura A. Social cognitive theory: an agentic perspective. Annu Rev Psychol 2001;52:1-26. [CrossRef] [Medline]
  30. Fogg BJ. A behavior model for persuasive design. In: Proceedings of the 4th International Conference on Persuasive Technology. New York, NY: ACM; 2009 Presented at: 4th International Conference on Persuasive Technology; April 26-29, 2009; Claremont, CA   URL: http://bjfogg [CrossRef]
  31. Powell WB. Approximate Dynamic Programming: Solving the Curses of Dimensionality. 2nd edition. Hoboken, NJ: Wiley-Interscience; Sep 2011.
  32. Ott RL, Longnecker MT. An Introduction to Statistical Methods and Data Analysis. 6th edition. Belmont, CA: Brooks/Cole Cengage Learning; 2010.
  33. Braun V, Clarke V. Using thematic analysis in psychology. Qual Res Psychol 2006 Jan;3(2):77-101. [CrossRef]
  34. González MC, Hidalgo CA, Barabási AL. Understanding individual human mobility patterns. Nature 2008 Jun 5;453(7196):779-782. [CrossRef] [Medline]
  35. Fisher RA. On the interpretation of X2 from contingency tables, and the calculation of P. J R Stat Soc 1922 Jan;85(1):87-94. [CrossRef]
  36. Kukafka R. Tailored health communication. In: Lewis D, Eysenbach G, Kukafka R, Stavri PZ, Jimison HB, editors. Consumer Health Informatics. New York, NY: Springer; 2005:22-33.
  37. Wiggins JS. Agency and communion as conceptual coordinates for the understanding and measurement of interpersonal behavior, Minnesota Press, Minneapolis. In: Cicchetti D, Grove WM, editors. Thinking Clearly About Psychology: Essays in Honor of Paul E Meehl. Minneapolis, MN: University of Minnesota Press; 1991:89-113.
  38. Tashakkori A, Teddlie C. Mixed Methodology: Combining Qualitative and Quantitative Approaches. Thousand Oaks, CA: SAGE Publications, Inc; 1998.
  39. Estrin D. Small data, where n=me. Commun ACM 2014 Apr 01;57(4):32-34. [CrossRef]
  40. Eysenbach G, CONSORT-EHEALTH Group. CONSORT-EHEALTH: improving and standardizing evaluation reports of Web-based and mobile health interventions. J Med Internet Res 2011;13(4):e126 [FREE Full text] [CrossRef] [Medline]


CDC: Centers for Disease Control and Prevention
FBM: Fogg Behavior Model
GMM: Gaussian Mixture Model
GPS: Global Positioning System
MAB: multi-armed bandit
METS: Metabolic Equivalents of Task
q25: lower quartile
q50: median
q75: upper quartile
RCT: randomized control trial
USDA: United States Department of Agriculture
WHO: World Health Organization


Edited by G Eysenbach; submitted 19.12.14; peer-reviewed by J Galante; comments to author 16.01.15; revised version received 19.02.15; accepted 17.04.15; published 14.05.15

Copyright

©Mashfiqui Rabbi, Angela Pfammatter, Mi Zhang, Bonnie Spring, Tanzeem Choudhury. Originally published in JMIR Mhealth and Uhealth (http://mhealth.jmir.org), 14.05.2015.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR mhealth and uhealth, is properly cited. The complete bibliographic information, a link to the original publication on http://mhealth.jmir.org/, as well as this copyright and license information must be included.