Representative image. Photo: Julie Viken/Pexels.
Once defined in rhetorical but ultimately meaningless terms as “the conscientious, judicious and explicit use of current best evidence in making decisions about the care of individual patients” [1], evidence-based medicine rests on certain philosophical assumptions: a singular truth, ascertainable through empirical enquiry; a linear logic of causality in which interventions have particular effect sizes; rigour defined primarily in methodological terms (especially, a hierarchy of preferred study designs and tools for detecting bias); and a deconstructive approach to problem-solving (the evidence base is built by answering focused questions, typically framed as ‘PICO’ – population-intervention-comparison-outcome) [2].
The trouble with pandemics is that these assumptions rarely hold. A pandemic-sized problem can be framed and contested in multiple ways. Some research questions around COVID-19, most notably relating to drugs and vaccines, are amenable to randomised controlled trials (and where such trials were possible, they were established with impressive speed and efficiency [3, 4]). But many knowledge gaps are broader and cannot be reduced to PICO-style questions. Were care home deaths avoidable [5]? Why did the global supply chain for personal protective equipment break down [6]? What role does health system resilience play in controlling the pandemic [7]? And so on.
Against these – and other – wider questions, the neat simplicity of a controlled, intervention-on versus intervention-off experiment designed to produce a definitive (i.e. statistically significant and widely generalisable) answer to a focused question rings hollow. In particular, upstream preventive public health interventions aimed at supporting widespread and sustained behaviour change across an entire population (as opposed to testing the impact of a short-term behaviour change in a select sample) rarely lend themselves to such a design [8, 9]. When implementing population-wide public health interventions – whether conventional measures such as diet or exercise, or COVID-19 related ones such as hand-washing, social distancing and face coverings – we must not only persuade individuals to change their behaviour but also adapt the environment to make such changes easier to make and sustain [10, 11, 12].
Population-wide public health efforts are typically iterative, locally-grown and path-dependent, and they have an established methodology for rapid evaluation and adaptation [9]. But evidence-based medicine has tended to classify such designs as “low methodological quality” [13]. Whilst this has been recognised as a problem in public health practice for some time [11], the inadequacy of the dominant paradigm has suddenly become mission-critical.
Whilst evidence-based medicine recognises that study designs must reflect the nature of question (randomised trials, for example, are preferred only for therapy questions [13]), even senior scientists sometimes over-apply its hierarchy of evidence. An interdisciplinary group of scholars from the UK’s prestigious Royal Society recently reviewed the use of face masks by the general public, drawing on evidence from laboratory science, mathematical modelling and policy studies [14]. The report was criticised by epidemiologists for being “non-systematic” and for recommending policy action in the absence of a quantitative estimate of effect size from robust randomised controlled trials [15].
Such criticisms appear to make two questionable assumptions: first, that the precise quantification of impact from this kind of intervention is both possible and desirable, and second, that unless we have randomised trial evidence, we should do nothing.
It is surely time to turn to a more fit-for-purpose scientific paradigm. Complex adaptive systems theory proposes that precise quantification of particular cause-effect relationships is both impossible (because such relationships are not constant and cannot be meaningfully isolated) and unnecessary (because what matters is what emerges in a particular real-world situation). This paradigm proposes that where multiple factors are interacting in dynamic and unpredictable ways, naturalistic methods and rapid-cycle evaluation are the preferred study design. The 20th-century logic of evidence-based medicine, in which scientists pursued the goals of certainty, predictability and linear causality, remains useful in some circumstances (for example, the drug and vaccine trials referred to above). But at a population and system level, we need to embrace 21st-century epistemology and methods to study how best to cope with uncertainty, unpredictability and non-linear causality [16].
In a complex system, the question driving scientific inquiry is not “what is the effect size and is it statistically significant once other variables have been controlled for?” but “does this intervention contribute, along with other factors, to a desirable outcome?”. Multiple interventions might each contribute to an overall beneficial effect through heterogeneous effects on disparate causal pathways, even though none would have a statistically significant impact on any predefined variable [11]. To illuminate such influences, we need to apply research designs that foreground dynamic interactions and emergence. These include in-depth, mixed-method case studies (primary research) and narrative reviews (secondary research) that tease out interconnections and highlight generative causality across the system [16, 17].
Table 1 lists some philosophical contrasts between the evidence-based medicine and complex-systems paradigms. Ogilvie et al have argued that rather than pitting these two paradigms against one another, they should be brought together [9]. As illustrated in (Fig. 1), these authors depict randomised trials (what they call the “evidence-based practice pathway”) and natural experiments (the “practice-based evidence pathway”) in a complementary and recursive relationship rather than a hierarchical one. They propose that “… intervention studies [e.g. trials] should focus on reducing critical uncertainties, that non-randomised study designs should be embraced rather than tolerated and that a more nuanced approach to appraising the utility of diverse types of evidence is required.” (page 203) [9].
In the current fast-moving pandemic, where the cost of inaction is counted in the grim mortality figures announced daily, implementing new policy interventions in the absence of randomised trial evidence has become both a scientific and moral imperative. Whilst it is hard to predict anything in real time, history will one day tell us whether adherence to “evidence-based practice” helped or hindered the public health response to COVID-19 – or whether an apparent slackening of standards to accommodate “practice-based evidence” was ultimately a more effective strategy.
Trisha Greenhalgh is a professor of primary care health sciences at Nuffield Department of Primary Care Health Sciences, University of Oxford, Oxford.
This article was published as an editorial on PLOS Medicine and has been republished here under a Creative Commons Attribution license.