Task used to study instrumental learning. A: layout of the behavioral devices. There were 2 response ports with photobeams to detect entries into the response port. The response ports flanked a central fluid dispenser that delivered 20% sucrose solution into the reward port (here a spout; photobeam not shown). Using operant methods, animals learned to make responses in the active response port to receive sucrose in the reward port. Responses in the inactive port had no consequence. The availability of sucrose was signaled by an acoustic noise stimulus. In physiological experiments, a speaker placed directly above the glass spout delivered an acoustic noise stimulus when sucrose solution was available. In behavioral devaluation experiments, a standard fluid dipper made an acoustic stimulus when activated. B and C: rats were trained over a series of sessions, using a progression of operant schedules of reinforcement, to make repetitive entries into the active response port. Rats first experienced an autoshaping session, in which sucrose was delivered into the reward port only in the presence of the acoustic stimulus, which occurred approximately every 60 s. There was no requirement for responding in the response port during the autoshaping sessions. Next, rats were required to enter the response port to earn rewards (FR1, “Fixed Ratio of 1”). Finally, rats could earn rewards only when they entered the response port after the end of a pseudorandomly chosen interval (with mean of 20 or 40 s, chosen from an exponential distribution). Sessions were run daily until the rats earned 50 (Autoshaping, FR1) or 100 sucrose rewards (RI sessions). All rats that were implanted with arrays of electrodes (n = 5) were trained using the full series of procedures for a total of 700 earned rewards, as shown in C. Rats that were trained for devaluation (n = 32) were divided into 2 groups, with half of the rats trained until they received 150 rewards (B) and the other half trained to 700 rewards (C).