The one-trial-back stay-switch analysis is the most widely used method for characterizing behavior on the two-step task (Daw et al., 2011 (link); Akam, Costa, & Dayan, 2015 (link); Wunderlich et al., 2012 (link)). This method quantifies the tendency of a participant to repeat the choice made on the last trial or switch to the other choice, as a function of the outcome and transition on the previous trial. We considered four possible outcomes: Common-Rewarded (CR), Rare-Rewarded (RR), Common-Unrewarded (CU), and Rare-Unrewarded (RU). Model-based and model-free indices were computed from the stay probabilities following each outcome according to:
MF=(P(stay|CR)+P(stay|RR))(P(stay|CU)+P(stay|RU)), MB=(P(stay|CR)+P(stay|RU))(P(stay|CU)+P(stay|RR)).
We also examined whether hunger modulated other measures of simple reinforcement learning. We found no effect of hunger on changes in model-free control for second-stage choices or for action-specific, stimulus-irrelevant choices at the first-stage (Fig. S1).
Statistical analyses were implemented in MATLAB and SPSS (IBM Corp. Released 2019. IBM SPSS statistics for Windows, Version 26.0. Armonk, NY: IBM Corp.). We report the effect size with Cohen’s d and ηp2 and report the 95% confidence interval (CI) of the difference between groups.
Free full text: Click here