Dec 23 / James Kavanagh

The Design Gap in AI Governance

AI governance already has the scaffolding of laws, frameworks, committees, standards and rules. What's missing is the design discipline and practice to build and sustain real safety and security.

This framing matters because AI systems are complex adaptive systems. They change as they encounter new data. The people using them change how they work in response. The organisations deploying them adapt their processes. The environment shifts. A compliance model that treats safety as a property you verify once through a conformity assessment, and then maintain through periodic audits or even ‘post-market monitoring’ fundamentally misunderstands the problem. You’re not safeguarding a static, bounded artifact. You’re trying to guide the behaviour of something that’s constantly evolving, embedded in social and organisational contexts that are also constantly evolving.

Regulation matters. I’m 100% in favour of good regulation and genuinely believe that regulation around safety and security of digital systems, including AI, should be strengthened. Regulation done well sets expectations, creates accountability, forces organisations to collect evidence, and aligns different actors around shared language and comparative baselines. It establishes the scaffolding within which safety can be built. The EU AI Act, the NIST framework, ISO 42001, these all create structure that would otherwise be absent. They ask certain questions and provide method to their answers.

But scaffolding isn’t building. Regulation can require you to think about risk. It can’t do the thinking for you. It can mandate documentation, but it can’t ensure that documentation is critically considered and reflects reality. It can demand human oversight, but it can’t makes sure that interfaces and escalation paths are designed to make oversight meaningful. The hard work of turning abstract requirements and perceived risks into concrete mechanisms, architectures, and interactions - that’s design, and it happens inside the scaffolding that regulation provides. Regulation drives compliance, but compliance does not drive safety. Design drives safety.

Despite good intentions, laws like the EU AI Act can act against good design. For legal convenience, it treats an “AI system” as a bounded, identifiable thing that can be placed on the market, assessed for conformity, and assigned to a risk category. The Act’s own explanatory materials acknowledge this is a simplification. In practice, what we call an “AI system” is often a fluid assembly of components from multiple vendors, updated continuously, behaving differently across contexts, and entangled with human workflows in ways that make neat boundaries impossible to draw.

This isn’t a minor technicality. When regulation assumes a static, bounded system but the actual technology is dynamic and distributed, compliance becomes disconnected from safety. Organisations can satisfy the legal definition while the real risks emerge from exactly the aspects the definition ignores: supply chain dependencies, continuous model updates, emergent behaviours from component interactions, and the gap between tested conditions and deployment reality. Even the lead author of the EU AI Act Gabrielle Mazzini, stepped away critical of how the Act had become overly complex, rigid and lacking in common sense, an instrument of bureaucracy that would have limited ability to adapt to changing technology. ⁹

This is also the core argument I develop in my work on adaptive governance mechanisms: that adaptive governance is best implemented as collections of discrete closed-loop mechanisms with concrete inputs, tooling, ownership, adoption pathways, inspection, and continuous improvement. A committee, a policy and an audited checklist is not governance, even if that is sufficient to satisfy the regulation.10. Course 3 of the AI Governance Foundation Program goes into this in depth, covering how to design mechanisms that adapt because they are embedded into the real lifecycle of AI systems. ¹¹

We know this failure mode well. “Automation bias,” the tendency to over-rely on automated decision support, has a substantial research literature. Microsoft’s literature review on overreliance also frames why this phenomenon makes meaningful oversight harder in practice, because the user becomes the last line of defense while becoming less vigilant.¹³ This is the problem-space I explore directly in Cognitive Calibration where I look at why explainability can backfire and how humans systematically miscalibrate their understanding of complex systems.¹⁴

1 https://www.nirs.org/wp-content/uploads/fukushima/naiic_report.pdf

2 https://world-nuclear.org/information-library/safety-and-security/safety-of-plants/fukushima-daiichi-accident

3 https://www.tepco.co.jp/en/press/corp-com/release/betu11_e/images/111202e16.pdf

4 https://shemesh.larc.nasa.gov/iria03/p13-leveson.pdf

5 https://aicareer.pro/blog/systems-safety-engineering-in-ai

6 https://academic.oup.com/book/26482

7 https://www.schneier.com/blog/archives/2009/11/beyond_security.html

8 https://www.routledge.com/Safety-Theater-How-the-Desire-for-Perfection-Drives-Compliance-Clutter-Inauthenticity-and-Accidents/Dekker/p/book/9781032012476

9 https://www.ft.com/content/6585fb32-8a86-4ffb-a940-06b17e06345a

10 https://aicareer.pro/blog/mechanisms-for-ai-governance

11 https://governance.aicareer.pro/course/pro-3

12 https://escholarship.org/content/qt5dr206s3/qt5dr206s3.pdf

13 https://www.microsoft.com/en-us/research/publication/overreliance-on-ai-literature-review/

14 https://aicareer.pro/blog/cognitive-calibration

15 https://blog.aicareer.pro/p/meaningful-human-oversight-of-ai

16 https://governance.aicareer.pro/program/track-1-foundations

The Design Gap in AI Governance

Fukushima: Lots of rules, fatally bad design

The four layers of design that matter

1. Regulatory and legal design

2. Governance mechanism design

3. Product and system design

4. Product and system design

When governance design matters most