Mean contribution to the public good (A), percentage of maximum possible payoff (B) and mean payoff (C) over 50 rounds of play in the control (Yellow), PN (Red), RN (Blue) and RNP (Green) experiments. All three treatments with targeted reciprocity succeed equally well at increasing contributions and percentage of maximum possible payoff relative to the control, and thus the reward treatments RN and RNP result in significantly higher actual payoffs than the punishment treatment PN. All data are analyzed at the level of the group to account for interdependence of outcomes for members of a given group. (A) Sign-rank test comparing contributions in Round 1 vs Round 50: Control, p=0.028, decrease; PN, p=0.18, no change; RN, p=0.036, increase; RNP, p=0.033, increase. (B) Ranksum comparing percentage of maximum possible payoff in the second half of the game: PN vs control, p=0.013, PN higher; RN vs control, p=0.048, RN higher; RNP vs control, p=0.023, RNP higher; PN vs RN, p=0.67; PN vs RNP, p=0.46; RN vs RNP, p=0.40. (C) Ranksum comparing mean payoff in the second half of the game: PN vs control, p=0.013, PN higher; RN vs control, p<0.001, RN higher; RNP vs control, p=0.001, RNP higher; PN vs RN, p=0.001, RN higher; PN vs RNP, p=0.005, RNP higher; RN vs RNP, p=0.40.