Field experiments conducted with the village, city, state, region, or even country as the unit of randomization are becoming commonplace in the social sciences. While convenient, subsequent data analysis may be complicated by the constraint on the number of clusters in treatment and control. Through a battery of Monte Carlo simulations, we examine best practices for estimating unit-level treatment effects in cluster-randomized field experiments, particularly in settings that generate short panel data. In most settings we consider, unit-level estimation with unit fixed effects and cluster-level estimation weighted by the number of units per cluster tend to be robust to potentially problematic features in the data while giving greater statistical power. Using insights from our analysis, we evaluate the effect of a unique field experiment: a nationwide tipping field experiment across markets on the Uber app. Beyond the import of showing how tipping affects aggregate market outcomes, we provide several insights on aspects of generating and analyzing cluster-randomized experimental data when there are constraints on the number of experimental units in treatment and control.

More Research From These Scholars

BFI Working Paper Apr 30, 2019

Measuring Success in Education: The Role of Effort on the Test Itself

John List, Uri Gneezy, Jeffrey A. Livingston, Xiangdong Qin, Sally Sadoff, Yang Xu
Topics:  Early Childhood Education
BFI Working Paper Oct 21, 2019

The Drivers of Social Preferences: Evidence from a Nationwide Tipping Field Experiment

Bharat Chandar, Uri Gneezy, John List, Ian Muir
Topics:  Employment & Wages
BFI Working Paper Oct 30, 2019

How Can Experiments Play a Greater Role in Public Policy? 12 Proposals from an Economic Model of Scaling

Omar Al-Ubaydli, Min Sok Lee, John List, Claire L. Mackevicius, Dana Suskind