Data and analysis code for the Masters thesis titled "Dyadic Physical Activity Planning with a Virtual Coach: Using Reinforcement Learning to Select Persuasive Strategies"

doi: 10.4121/2796f502-0610-4a7d-a8ee-ebc36639e0b1.v1
The doi above is for this specific version of this dataset, which is currently the latest. Newer versions may be published in the future. For a link that will always point to the latest version, please use
doi: 10.4121/2796f502-0610-4a7d-a8ee-ebc36639e0b1
Datacite citation style:
Stefan, Andrei; Albers, Nele; Brinkman, Willem-Paul (2023): Data and analysis code for the Masters thesis titled "Dyadic Physical Activity Planning with a Virtual Coach: Using Reinforcement Learning to Select Persuasive Strategies". Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/2796f502-0610-4a7d-a8ee-ebc36639e0b1.v1
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Dataset

This is the data and analysis code for the Masters thesis titled "Dyadic Physical Activity Planning with a Virtual Coach: Using Reinforcement Learning to Select Persuasive Strategies." In this thesis, we used data gathered through an observational study with a virtual coach to build a reinforcement learning model for persuading people to commit to plans for walking. Once the model was created, it was analysed using the code provided in this dataset, including simulations of how the model would perform in a real-life setting.


Study

The study was conducted on the online crowdsourcing platform Prolific in June and July 2023. The Human Research Ethics Committee of Delft University of Technology granted ethical approval for the research (Letter of Approval number: 3089). 114 participants participated in a conversation with the virtual coach named Jamie. During these conversations, the virtual coach measured people's confidence in following the plan, their perceived usefulness of planning, and their attitude towards planning. Then, the virtual coach used a persuasive strategy to try to change these aspects of a person's state, after which it measured the three aspects again. This repeated until at least two persuasive strategies were used, and until the three aspects were high enough to satisfy pre-set heuristics for when the person is likely to commit to the plan. When these conditions were met, the virtual coach measured the reward signal, composed of satisfaction with the conversation, commitment to the fist two weeks and to the whole plan, and confidence in reaching the goal.


Data

We provide data on:


  1. participant characteristics (e.g., age, gender, stage of change for becoming physically active),
  2. the state-action-next state-reward samples used to create the reinforcement learning model,
  3. free-text responses to the persuasive strategies used by the virtual coach,
  4. free-text responses about what people found motivating and demotivating in motivational messages.


The file OSF pre-registration explains in detail how each variable was measured.


Analysis

We provide code to reproduce our analysis with Docker. For more information on this, please refer to the README-file in this repository.


In the case of questions, please contact Nele Albers (n.albers@tudelft.nl) or Willem-Paul Brinkman (w.p.brinkman@tudelft.nl).

history
  • 2023-11-16 first online, published, posted
publisher
4TU.ResearchData
format
.zip, .csv, .md, .Rmd, .pdf, .ipynb, .png, .html, .xslx, .txt, .gz
funding
  • Part of Perfect Fit, funded by NWO, project number 628.011.211
organizations
TU Delft, Faculty of Electrical Engineering, Mathematics and Computer Science, Department of Intelligent Systems, Interactive Intelligence Group