A policy’s effectiveness often hinges on whether it reaches the right people. Thanks to recent advances in big data and machine learning, researchers have revealed the gains from directing benefits like the Supplemental Nutrition Assistance Program (SNAP) and Social Security to the neediest recipients. While prior research on policy targeting: Directing specific interventions toward certain groups or areas to achieve optimal outcomes has emphasized static interventions: A treatment strategy where assignments remain fixed and do not adapt based on individual responses or new information. — where beneficiaries are evaluated and receive benefits once — many policies involve dynamic interventions: A treatment strategy that adapts and optimizes assignments over time based on observed responses and updated information. over multiple periods.  In this paper, the authors propose a new method to design and evaluate repeated interventions, which they apply to a randomized controlled trial (RCT): A study design where participants are randomly assigned to either a treatment group or a control group to objectively measure the effects of an intervention. on an energy rebate program. 

The authors recruited 2,400 Japanese households for their RCT, and offered participants assigned to the treatment a rebate to reduce their electricity consumption during peak hours. The RCT compares three different types of policy targeting approaches: static targeting, where pre-intervention information (like household demographics and historical electricity usage) is used to determine treatment assignments, dynamic targeting, where information gathered from the first period is used to inform treatment decisions in the second period, and non-targeting, where no information is used to differentiate among households. 

Using each of these targeting approaches, the authors assign participants to four groups: one untreated in both periods, one treated in both periods, and two that alternate between being treated in the first period and untreated in the second, and vice versa. They summarize their groups as follows: (U,U), (T,T), (U,T), and (T,U), where U denotes untreated and T signifies treated. The authors compare energy conservation across their groups, and find the following:

  • While static targeting significantly enhances welfare benefits compared to non-targeting policies, dynamic targeting significantly augments these benefits, nearly doubling the welfare gain: The increase in overall well-being or economic benefit resulting from a specific policy or intervention. from static targeting policies.
  • The authors attribute these gains to three key mechanisms: learning, where individuals who receive an intervention for multiple periods may learn to respond effectively; habit formation, where individuals develop habits of conserving electricity when exposed to the rebate program; and screening, where information gathered from earlier interventions informs later treatment allocation. Each of these mechanisms tends to vary widely across individuals.  

In an era when policies are carefully directed towards certain beneficiaries, this research demonstrates the capacity to further optimize targeting using a dynamic approach. Furthermore, the authors identify considerable variation in the learning, habit formation, and screening effects across households, and their research illustrates how to leverage these differences to devise optimal dynamic targeting strategies.