Addressing people’s current and future states in a reinforcement learning algorithm for persuading to quit smoking and to be physically active: Data and analysis code
doi: 10.4121/21533055
This is the data and analysis code underlying the paper "Addressing people’s current and future states in a reinforcement learning algorithm for persuading to quit smoking and to be physically active" by Nele Albers, Mark A. Neerincx, and Willem-Paul Brinkman. This paper proposes a Reinforcement Learning (RL)-algorithm for persuading people in the context of a virtual coach for quitting smoking and becoming more physically active.
Study
The paper is based on a longitudinal study on the crowdsourcing platform Prolific run between 20 May 2021 and 30 June 2021. The Human Research Ethics Committee of Delft University of Technology granted ethical approval for the research (Letter of Approval number: 1523).
In this study, daily smokers who were contemplating or preparing to quit smoking interacted with the text-based virtual coach Sam in up to five conversational sessions. In each session, participants were assigned a new preparatory activity for quitting smoking, such as thinking of and writing down reasons for quitting smoking. Since becoming more physically active may make it easier to quit smoking, half of the activities addressed becoming more physically active. The virtual coach chose from five persuasion types to persuade people to do their activity. In the first two sessions, the persuasion type was chosen uniformly at random; in the last three sessions, the persuasion type was determined by a persuasion algorithm. In the next session, participants were asked to indicate the effort they spent on their activity, which served as basis for the reward signal for the persuasion algorithm.
The study was pre-registered in the Open Science Framework (OSF): https://osf.io/k2uac. This pre-registration describes the study design, measures, etc. Note that the data we provide here is only a part of the data collected in the study, namely, the data related to studying the effectiveness of the persuasion algorithm.
Pointers to further resources:
- Data on the acceptance of the virtual coach can be found here: https://doi.org/10.4121/19934783.
- Data on users' needs for a digital smoking cessation application can be found here: https://doi.org/10.4121/20284131.
- Data on users' action plans for doing the activities (n = 469) and free-text responses to reflective questions about the activities (n = 2026) is available here: https://doi.org/10.4121/21905271.
- The implementation of the virtual coach Sam is available here: https://doi.org/10.5281/zenodo.6319356.
- The formulations for the 24 preparatory activities used in the study can be found in the supplementary material of the paper (S8 Appendix).
Data
We collected four main types of data:
- Perceived motivational impact and effort. The perceived motivational impact of the conversational sessions and the effort spent on the activities were used to evaluate the effectiveness of the persuasion algorithm. Both were measured during the conversational sessions.
- Involvement in the activities. We used people's involvement in their activities for an exploratory subgroup analysis comparing the algorithm effectiveness for people with low and high involvement.
- User characteristics (e.g., age, gender, Big-Five personality, quitter self-identity). This data was collected by means of questionnaires and from participants' Prolific profiles.
- RL-samples (states, actions, rewards). This data was collected from the conversational sessions. The actions were the five persuasion types (e.g., consensus, action planning, no persuasion), and the reward was based on the effort.
Please consult the "Data"-folder for more information on the data we collected.
Changes in V3
Fixed an error in extract_RL_samples.py: For the computation of feature means used to make the state features binary, not all the last states of participants were considered. This has now been fixed. The change does not alter the results of the paper in any way because the old and new feature means are so similar that the resulting binary state features are exactly the same.
- 2022-11-10 first online
- 2024-03-22 published, posted
- This work is part of the multidisciplinary research project Perfect Fit, which is supported by several funders organized by the Netherlands Organization for Scientific Research (NWO), program Commit2Data - Big Data & Health (project number 628.011.211).
DATA
- 3,366,150,995 bytesMD5:
76f50722ed34a1a5c3b52b04a2ae93f6
Persuasion_algorithm_dataset_v3.zip -
download all files (zip)
3,366,150,995 bytes unzipped