Facial recognition technology (FRT) raises profound questions about the role of technology in society. The complex ethical and normative concerns about FRT’s impact on privacy, speech, racial equity, and the power of the state merit serious debate. Yet one requirement common to proposed legislation and regulation of FRT is the testing and assessment of operational performance - how well does FRT actually work? This poses deep challenges given the rapid uptake of FRT in many new domains, such as retail, finance, travel, and criminal justice. In this Article, we provide research-and science-grounded recommendations for how to concretely test the operational accuracy of FRT that will be central to regulation and oversight.