Investigating goal-directed actions and habit formation in mice. (A) Mice were trained with two reinforcers. In the figure, the task is exemplified with one of the reinforcers, cheese, being delivered in the operant box contingent upon lever pressing, while the other reinforcer, sugar water, is being delivered freely to the mouse in the home cage. The types of reinforcers used in the figures are for illustrative purposes only. (B) Devaluation is performed in two days: Day 1, the mouse is given the reinforcer, cheese, previoulsy earned by lever pressing (devalued condition); Day 2, the mouse receives the reinforcer, sugar water, previously freely available in its home cage (valued condition). The order of the conditions is randomized. Immediately after each feeding session, which last 1 h, the mouse goes through a 5-min extinction test in the operant chamber, with the training lever extended. The number of presses on the training lever under the valued and the devalued conditions are compared. If the mouse presses more under the valued versus devalued condition, then the behavior is goal-directed behavior. However, if the mouse presses both levers equally his behavior is classified as habitual. (C) The generalization test. Two levers are presented in a 5-min extinction test: If the mouse pressed the training lever more than the novel lever, it is discriminating/exploiting. However, if the mouse presses both levers equally then there is significant generalization/exploration. Training lever is in blue and a novel lever is in pink.