Tag: Public Sector

  • The Sovereign AI Imperative

    The Sovereign AI Imperative

    Why Nations Cannot Afford to Leave Artificial Intelligence in Private Hands


    Executive Summary

    Artificial intelligence has crossed a threshold. It is no longer a commercial technology competing for market share. It is critical national infrastructure, embedded in healthcare systems, financial services, defense procurement, and government operations across allied nations. And yet the governance frameworks applied to it remain those of an early-stage commercial technology: voluntary, fragmented, and entirely dependent on the continued goodwill of a small number of private individuals.

    This industry brief argues that governments and multi-state entities must establish sovereign AI frameworks with urgency. The case is not ideological. It rests on three converging realities.

    The first is geopolitical sovereignty. Allied governments have allowed themselves to become operationally dependent on AI systems controlled by companies subject to the policy preferences of a foreign executive branch and the governance decisions of individual chief executives. Events in February and March 2026, in which the United States government banned a leading AI provider from all federal use within hours of a contractual dispute, and a rival provider immediately moved to fill that gap on different terms, demonstrated with clarity that governments depending on these systems had no seat at any table where those decisions were made.

    The second is systemic risk. The conditions that characterized systemically important financial institutions in 2008 are now present in the AI sector: critical function concentrated in a small number of private entities, asymmetric risk distribution, and governance frameworks built for normal commercial conditions rather than systemic stress. The financial crisis established that the absence of sovereign continuity frameworks does not prevent catastrophic outcomes; it simply ensures that governments bear the cost of them without having shaped the conditions that produced them.

    The third is the demonstrated pattern at the software infrastructure layer. The GSDRF’s primary research into open-source software sustainability has documented, with empirical rigor, the consequences of critical software infrastructure being left without sovereign continuity frameworks. The AI governance argument is that same argument applied one layer above, at higher stakes.

    The central policy recommendation of this paper is not nationalization of the AI industry. It is the establishment of continuity obligations, accountability frameworks, and independent oversight structures commensurate with the systemic importance AI infrastructure has already acquired. The ownership question is a longer-term policy matter that legislators must begin examining. The continuity question is immediate.

    For industry, the message is direct: sovereign AI frameworks are not a distant possibility. The EU AI Act is already in force. Regulatory postures are hardening across the UK, Australia, and New Zealand. The window for constructive engagement in shaping these frameworks is open now. It will not remain so indefinitely.

    Section 1: The Geopolitical Sovereignty Case

    1.1 The Dependency Problem

    Allied governments, public institutions, and critical infrastructure operators have, over the past several years, embedded AI systems from a small number of US-based private companies into their operations at a pace that has significantly outrun the development of any corresponding governance framework. That dependency is now structural.

    Those companies are subject to the governance decisions of individual chief executives. They are incorporated under US law, subject to the policy preferences and executive authority of the US government, and under no obligation, legal or otherwise, to consult with, notify, or consider the interests of the governments and institutions in other jurisdictions that have come to depend on their systems.

    This is not a theoretical future risk. It is the present reality, and it became undeniable in the last days of February 2026.

    1.2 The Anthropic-Pentagon Dispute: A Case Study in Sovereign Exposure

    The dispute between Anthropic and the United States Department of Defense that culminated in late February 2026 is the most instructive case study in sovereign AI risk that has yet occurred. It warrants examination in detail, because its implications extend far beyond its immediate parties.

    Anthropic, the developer of the Claude AI system, held a contract with the US Defense Department worth up to $200 million, under which Claude had become the first AI system deployed in the military’s classified network. The company had placed two restrictions on the use of its model: it could not be used for mass surveillance of American citizens, and it could not be used to power fully autonomous weapons systems. Anthropic’s position was that these restrictions reflected both the current technical limitations of AI and its own ethical commitments regarding the safe deployment of frontier models.

    On 24 February 2026, Defense Secretary Pete Hegseth issued Anthropic CEO Dario Amodei a deadline: remove all restrictions by 5:01pm on 27 February, or face the cancellation of the contract and designation as a supply chain risk to national security. Anthropic refused. On 27 February, President Trump ordered every US government agency to immediately cease using Anthropic’s technology. Hegseth designated the company a supply chain risk, a classification normally reserved for companies associated with foreign adversaries. The designation had the practical effect of threatening the viability of Anthropic’s entire enterprise customer base, many of whose clients hold or seek government contracts.

    The Pentagon’s threats were, as Amodei noted, inherently contradictory: one labelled Anthropic a security risk, the other labelled Claude as essential to national security. The contradiction is itself instructive. A system deemed essential to national security was subject to being withdrawn from service within hours by a single executive decision, with no notice to and no consultation with any allied government or institution that had itself come to depend on it.

    Anthropic subsequently sued the Department of Defense on 9 March 2026, arguing that the supply chain risk designation was unprecedented, unlawful, and had caused the company irreparable harm. The litigation is ongoing. Talks between the company and the Pentagon have, at the time of writing, partially resumed.

    It is important to be clear about what this case illustrates for governments outside the United States. The dispute was between an AI company and the US government, argued on American legal and constitutional grounds, resolved by American political and commercial pressures. No government in Europe, no institution in Australia or New Zealand, no NATO ally that had integrated Claude into any part of its operations, had any standing in those negotiations, any advance notice of the outcome, or any remediation framework prepared for the scenario in which the system they depended on was abruptly withdrawn from service.

    That is the exposure. Not that Anthropic acted wrongly. On the assessment of this paper, Anthropic’s ethical position was defensible and its refusal to permit use for mass surveillance and autonomous weapons was reasonable. The point is that the entire determination of whether allied governments retained access to a piece of critical infrastructure was made without them, by people with no accountability to them, under pressure that had nothing to do with their interests.

    1.3 The OpenAI Parallel: The Whims of CEOs Cut Both Ways

    Within hours of the Trump administration’s ban on Anthropic, OpenAI announced that it had reached a deal with the Pentagon to provide its technology for classified networks. The terms under which OpenAI agreed to operate, as reported at the time, included exclusions preventing use for domestic surveillance or fully autonomous weapons without human approval. In substance, these were not materially different from the position Anthropic had taken.

    OpenAI CEO Sam Altman stated publicly that he believed companies should work with the military provided they adhere to legal protections and shared red lines. He framed OpenAI’s position as consistent with Anthropic’s own principles.

    The significance of this sequence for the purposes of this paper is not that OpenAI acted opportunistically or that Altman’s position was insincere. It is that, within a single working day, the AI system embedded in allied defense and intelligence infrastructure had changed, the terms under which it was provided had changed, and the company providing it had changed. None of those changes involved any consultation with, approval from, or even notification of the governments and institutions outside the United States that had operational dependencies on these systems.

    The lesson is not that AI CEOs are unreliable. Some will hold principled positions under extraordinary pressure, as Amodei did. Others will move more fluidly in response to commercial and political incentives, as the market dynamics of the situation illustrate. The lesson is that the critical infrastructure of sovereign states should not be contingent on the character of any individual chief executive. Character is not a governance framework.

    1.4 The Broader Geopolitical Frame

    The AI sector has not yet had its Lehman Brothers moment.

    The Anthropic-Pentagon dispute is the sharpest available illustration of the problem, but it sits within a much broader geopolitical reality. AI infrastructure concentration is a sovereignty risk for every nation that does not have its own frontier AI capability, and currently that means effectively every nation outside the United States and, to a limited and contested degree, China.

    The European Union, the United Kingdom, Australia, and New Zealand are among the jurisdictions most acutely exposed. Each has allowed significant operational dependency on US-based AI systems to develop across government and critical infrastructure. Each is now, in various ways, confronting the implications. The EU’s AI Act represents the most developed legislative response. Regulatory challenges and litigation in Australia and New Zealand signal that those governments are recognizing the power imbalance even if comprehensive remediation frameworks remain nascent.

    The direction of travel is clear. What is less clear is whether governments will move proactively to establish sovereign frameworks before the next systemic disruption, or reactively in response to one. The financial crisis precedent, examined in the following section, suggests that reactive responses are both more expensive and less effective than proactive frameworks. The AI sector has not yet had its Lehman Brothers moment. That is not an argument for complacency. It is an argument for acting while the window remains open.

    Section 2: The Systemic Risk Case

    2.1 Too Big to Fail: The Financial Crisis as a Governance Template

    The phrase ‘too big to fail’ entered the policy lexicon in the context of the 2008 global financial crisis. Its meaning, stripped of the political controversy that surrounded it, was precise: certain institutions had become so embedded in the functioning of the broader financial system that their disorderly failure would produce consequences disproportionate to their own commercial significance. The systemic damage would be borne not by the institutions themselves, or their shareholders, but by governments and the populations they serve.

    The conditions that produced that outcome are now recognizable in the AI sector. A small number of private entities control systems of critical importance to multiple governments and economic sectors. The benefits of those systems accrue privately. The catastrophic downside risk, the scenario in which access is abruptly lost, terms are unilaterally changed, or a provider fails commercially, is borne collectively by the governments and institutions that have allowed the dependency to develop.

    The financial crisis also established a second lesson that is directly applicable here. The absence of sovereign continuity frameworks does not prevent catastrophic outcomes. It simply ensures that when those outcomes occur, governments must respond under maximum pressure, with minimum preparation, at maximum cost. The ad hoc interventions of 2008, the emergency bailouts, the improvised resolution mechanisms, the political contortions required to justify spending public money to rescue private institutions, were the direct consequence of governance frameworks that had not kept pace with the systemic importance of the institutions they were supposed to oversee.

    The Lehman Brothers parallel is worth stating directly. In a scenario where a major AI provider faces abrupt commercial failure, is sanctioned by a foreign government, or is subjected to the kind of designation that the Trump administration applied to Anthropic, the governments depending on that provider’s systems face a choice between a managed transition, if continuity frameworks exist, and an emergency scramble, if they do not. The cost of the former is a fraction of the cost of the latter.

    2.2 The International Monetary Architecture as Governance Model

    The International Monetary Fund, the European Central Bank, and the US Federal Reserve did not come into existence because governments wanted to control their economies. They came into existence because governments recognized that systemically important economic functions required governance structures that were technically credentialed, insulated from short-term political pressure, and empowered to act in the long-term systemic interest rather than the immediate commercial or electoral one.

    The central bank independence model is the most relevant template for sovereign AI governance, and for a reason that goes beyond structural analogy. Central bank independence was designed specifically to address the problem of technically complex, systemically critical functions being governed by institutions subject to the short-term incentives of elected officials or private market participants. Neither elected officials nor private market participants are well-positioned to make governance decisions about systemically critical AI infrastructure. The same logic that produced central bank independence produces the case for an Independent AI Oversight Council.

    Such a body would not build AI systems. It would not direct their development. It would do what central banks do: establish and enforce the systemic stability conditions under which AI infrastructure can operate safely. In practical terms, this means continuity obligations, mandatory transition planning, transparency and reporting requirements, and the authority to intervene when those obligations are not met.

    The institutional form matters. A body that reports to elected governments is subject to the same political pressures that make elected governments poor governors of technically complex systemic risk. The ECB model, with a defined and limited mandate, technical independence, and democratic accountability through transparency and reporting rather than direct political control, is the appropriate template.

    2.3 The Spectrum of Sovereign Intervention

    There is an important distinction to draw, both analytically and politically, between the different forms sovereign intervention in the AI sector might take. Conflating them weakens the argument and obscures the range of achievable policy options.

    At one end of the spectrum is full state ownership: a government acquires an AI company outright, as it might acquire a failing bank or a strategic energy asset. This is not a fantasy position in all jurisdictions. France’s tradition of strategic state ownership, Germany’s willingness to take equity stakes in critical industries, and the EU’s capacity to mandate structural remedies under competition law all create contexts in which ownership is a credible policy instrument. For most Western democracies, however, full state ownership of commercial AI companies is politically and practically difficult as a default position, and this paper does not advocate for it as such.

    At the other end of the spectrum is mandatory continuity infrastructure: AI companies operating in critical sectors are required to maintain model escrow arrangements, transition plans, and structured access agreements that allow governments to manage a change of provider without operational disruption. This is analogous to the bank resolution planning requirements introduced after 2008, under which systemically important financial institutions are required to maintain ‘living wills’ detailing how they could be wound down in an orderly fashion. It is achievable, bounded, and politically defensible.

    Between these poles lies a range of intermediate options: mandatory licensing regimes, state-backed continuity funds, equity participation requirements, and structured access agreements with allied governments. The appropriate instrument will vary by jurisdiction, sector, and the specific nature of the AI dependency in question.

    The central argument of this paper is for the continuity infrastructure end of this spectrum as the immediate and achievable policy ask, with state ownership acknowledged as the backstop option that legislators in appropriate jurisdictions should begin examining. Continuity infrastructure is not a compromise position. It is the foundational governance requirement without which all other policy options are moot.

    Section 3: The Open-Source Software Continuity Bridge

    3.1 The Pattern Established at the Infrastructure Layer

    The argument for sovereign AI governance does not exist in isolation. It is the continuation of a pattern that the GSDRF’s primary research has documented with empirical rigour at the software infrastructure layer on which AI systems are built.

    The GSDRF’s analysis of GHArchive data covering the period 2015 to 2025, drawn from original BigQuery primary research, has established several findings directly relevant to this argument. Open-source software project health showed a COVID-period bounce followed by an incomplete recovery. From 2023 onward, a divergence between event activity and contributor activity emerged, consistent with AI-generated noise inflating apparent activity while genuine human contributor engagement declined. From 2024 into 2025, the decline in established OSS projects accelerated sharply.

    Crucially, the research identified a clear bifurcation in outcomes. Projects with profit-motivated institutional investment behind them, such as VS Code, Hugging Face Transformers, and NixOS, sustained their contributor ecosystems. Projects with extractively-backed or commercially indifferent support structures collapsed. The determining variable was not the technical quality of the project or its importance to the broader ecosystem. It was whether a sustained institutional commitment to its continuity existed.

    3.2 The Sovereign OSS Fund as Proof of Concept

    The GSDRF has separately developed the case for a Sovereign OSS Fund, structured around mandatory contributions scaled to deployment intensity and governed by an Independent Review Council modelled on central bank independence. The argument rests on precisely the same logic that this paper applies to the AI layer: critical infrastructure that is systemically important but commercially fragile requires sovereign continuity frameworks to protect against the consequences of its failure.

    The Sovereign OSS Fund proposal is therefore not merely analogous to the sovereign AI governance argument. It is its direct predecessor. If the case for sovereign intervention holds at the OSS infrastructure layer, the same argument applied to the AI systems built on top of that layer is not a radical extension of principle. It is the consistent application of a framework that has already been established.

    For policymakers and legislators examining sovereign AI governance proposals, the OSS fund model provides a working template. It demonstrates that the governance instruments required are not novel. They exist in adjacent policy domains and have been developed with the specific characteristics of software infrastructure in mind.

    3.3 The Compounding Risk

    The interaction between OSS decline and AI dependency is not additive. It is multiplicative. AI systems are, in significant part, built on OSS foundations. The models, the training infrastructure, the deployment tooling, the security frameworks, all have substantial open-source components. As OSS project health at the foundation layer degrades, the AI systems built on it become less reliable, less secure, and more dependent on the continued commitment of the small number of well-capitalised institutions that are sustaining those foundational projects.

    A sovereign AI governance framework that does not address the OSS layer is therefore incomplete. And an OSS continuity framework that does not extend its logic to the AI layer built on top of it is addressing the foundation while ignoring the structure. A coherent sovereign technology policy must address both layers as a connected system, or it addresses neither adequately.

    Section 4: A Policy Framework for Sovereign AI

    4.1 Core Principles

    Any credible sovereign AI governance framework must rest on four core principles if it is to be both effective and politically sustainable.

    The first is proportionality. Governance obligations should be scaled to systemic importance. AI systems embedded in critical national infrastructure carry different obligations than AI systems used in commercial applications. The framework should be tiered accordingly.

    The second is technical independence. The governance body must be insulated from both short-term political pressure and commercial influence. This is the central lesson of the central bank independence model. Credibility depends on it.

    The third is continuity primacy. The foundational obligation is continuity of access: the assurance that a government or institution depending on an AI system can manage a change of provider, or a disruption to service, without operational catastrophe. All other governance obligations flow from this.

    The fourth is international co-ordination. AI infrastructure is global. Fragmented national frameworks create regulatory arbitrage and reduce effectiveness. The EU’s demonstrated capacity to export regulatory standards through market access leverage provides the most credible vehicle for international co-ordination, and should be the anchor of any multilateral sovereign AI governance effort.

    4.2 The Tiered Intervention Model

    Sovereign AI governance should operate across a defined spectrum of intervention levels, with the appropriate level determined by the systemic importance of the AI deployment in question.

    Tier 1: Mandatory Continuity Infrastructure. All AI systems deployed in critical national infrastructure must maintain model escrow arrangements, transition plans, and structured access agreements. This is the minimum viable sovereign governance requirement and the immediate policy ask of this paper. It is analogous to bank resolution planning and is achievable through existing legislative mechanisms in most jurisdictions.

    Tier 2: Independent Oversight Accountability. AI systems of defined systemic importance must report to an Independent AI Oversight Council on continuity, security, and transparency obligations. The Council’s mandate is defined and limited; it does not direct development but enforces the systemic stability conditions under which AI infrastructure operates.

    Tier 3: Structured Access and Licensing. For AI systems of the highest systemic importance, mandatory licensing regimes and structured access agreements with participating governments provide additional resilience. State-backed continuity funds, modelled on the Sovereign OSS Fund architecture, provide a financial backstop.

    Tier 4: State Participation and Ownership. In scenarios where a provider of critical AI infrastructure faces commercial failure or is otherwise unable to meet its continuity obligations, state ownership or equity participation is the backstop of last resort. Legislators in appropriate jurisdictions should develop the legal and financial frameworks for this option before it is needed, not in response to a crisis.

    4.3 The Jurisdictional Opportunity: EU Leadership and Multilateral Co-ordination

    The European Union is the most credible anchor for a multilateral sovereign AI governance framework, for reasons that are structural rather than aspirational. The EU has demonstrated, through GDPR and now the AI Act, that it can export regulatory standards globally by virtue of market access leverage. Companies operating in EU markets must comply with EU frameworks regardless of where they are incorporated. That leverage is real, has been tested, and has proven effective.

    The EU AI Act, already in force with phased implementation obligations, establishes the foundational regulatory architecture on which a sovereign AI governance layer can be built. It addresses risk classification and prohibitions. What it does not yet fully address is continuity infrastructure and systemic risk governance of the kind this paper advocates. The extension of the AI Act framework to incorporate the continuity obligations described above is both legally achievable and politically consistent with the EU’s established regulatory posture.

    The United Kingdom, post-Brexit, has both the incentive and the regulatory capability to develop a complementary framework rather than a derivative one. A UK-EU co-ordination mechanism on AI governance continuity standards would extend the reach of both frameworks and reduce the regulatory arbitrage risk that fragmentation creates.

    Australia and New Zealand’s active and increasingly assertive regulatory posture towards large technology platforms, as evidenced by Australia’s News Media Bargaining Code and subsequent legislative interventions, signals political readiness for the kind of sovereign AI governance framework this paper describes. Their participation in a multilateral co-ordination mechanism anchored by the EU and UK would significantly extend its geographic reach and credibility.

    4.4 Timeline: This Is a Two to Three Year Question

    The framing of sovereign AI governance as a future policy concern is no longer accurate. It is a present policy urgency.

    The EU AI Act is in force. High-risk AI system obligations are phasing in. The regulatory architecture exists. The question is whether the continuity infrastructure layer is added to it through deliberate legislative action or whether it is constructed reactively in response to the first major AI continuity failure.

    The UK’s post-Brexit regulatory agenda includes AI governance as a priority. The institutional capability exists. The political will is developing. The window for proactive framework design is open now.

    Australia and New Zealand have demonstrated that they are willing to move decisively when they perceive a structural power imbalance between large technology platforms and the public interest. The Anthropic-Pentagon dispute will not have gone unnoticed in those jurisdictions.

    Industry and public sector decision makers who are treating sovereign AI governance as a five to ten year horizon question are misreading the pace of regulatory development. The more realistic planning horizon is two to three years for initial legislative frameworks in leading jurisdictions, with compliance obligations following within the standard implementation period thereafter.

    Section 5: A Briefing for Industry

    5.1 The Honest Assessment

    This section speaks directly to industry leaders in the AI sector and in the enterprise organisations that depend on AI infrastructure. The message is not adversarial. It is practical.

    Sovereign AI frameworks are coming. The political conditions that produced the EU AI Act, the regulatory assertiveness of Australia and the UK, and the object lesson of the Anthropic-Pentagon dispute have created a political environment in which legislative action on AI governance continuity is not a possibility but a matter of timing. The question for industry is not whether these frameworks will exist, but whether the industry will have shaped them or been subjected to the version written without its input.

    The history of technology regulation does not favour the second option. GDPR was written largely without meaningful industry co-design. The compliance costs, the legal uncertainty, and the regulatory friction that followed were, in significant part, the consequences of that absence. The AI Act, developed over several years with extensive consultation, is materially better calibrated. The lesson is available to be learned.

    5.2 Why Well-Designed Sovereign Frameworks Serve Industry Interests

    The reflexive industry objection to sovereign AI governance frameworks is that they chill innovation, create bureaucratic overhead, and reduce competitive agility. This objection has surface plausibility and limited analytical depth.

    Regulatory clarity reduces uncertainty. The single greatest cost of the current unregulated environment is not compliance overhead. It is the uncertainty premium that enterprise customers, institutional investors, and government procurement officials apply when they cannot assess the governance risk of the AI systems they are considering deploying. A credible sovereign governance framework, by establishing clear and consistent obligations, reduces that uncertainty premium and expands the addressable market for compliant providers.

    Independent oversight reduces litigation risk. The accumulating litigation and regulatory challenge that AI companies face across the EU, Australia, New Zealand, and now in the United States itself is not a temporary feature of an early market. It is the predictable consequence of systemically important technology operating without adequate governance frameworks. A credible independent oversight body reduces the political and legal surface area for that litigation by establishing that governance obligations exist and are being enforced.

    Continuity obligations are commercially valuable. Enterprise customers and government procurement officials are increasingly requiring assurance that the AI systems they depend on will remain available, on consistent terms, for the planning horizons they operate across. A provider that can demonstrate credible continuity infrastructure is more attractive than one that cannot, all else being equal. Sovereign governance frameworks, by mandating continuity infrastructure, level the playing field and prevent the race-to-the-bottom dynamics that make continuity assurance commercially difficult to offer unilaterally.

    5.3 What Constructive Engagement Looks Like

    Constructive engagement with sovereign AI governance framework development is not a public relations exercise. It is a strategic imperative, and it has a specific content.

    It means participating actively in legislative consultation processes, with substantive technical input rather than lobbying against the principle of governance. It means proactively developing and publishing continuity infrastructure that demonstrates the industry can meet reasonable sovereign governance expectations without waiting for legislation to mandate it. It means engaging transparently with independent oversight bodies on reporting and accountability obligations rather than treating them as adversarial.

    It also means accepting that the era in which individual AI companies could unilaterally determine the terms on which governments and their institutions access critical AI infrastructure is ending. That is not a regulatory overreach. It is the natural and appropriate consequence of AI infrastructure crossing the threshold from commercial technology to critical national infrastructure. The companies that recognise and adapt to that reality will be better positioned in the regulatory environment that is coming than those that contest it.

    Conclusion

    The argument of this paper is not about whether artificial intelligence is beneficial. It is, in most of its applications, enormously so. It is not about whether the companies developing frontier AI are acting in bad faith. The Anthropic case is instructive here: the company’s refusal to permit its technology to be used for mass surveillance of citizens or to power fully autonomous weapons is not a radical or outlying position. It is entirely consistent with the prohibited use categories established under the EU AI Act, with the stated policies of most allied defense ministries, and with the ethical frameworks that the broader AI industry has itself publicly endorsed. What was unusual in the Anthropic-Pentagon dispute was not Anthropic’s position. It was the US executive branch’s demand that a private company abandon protections that are, by any international standards measure, established norms of responsible AI deployment. That context matters for how industry leaders read the dispute: not as evidence that AI companies are ungovernable, but as evidence that governance frameworks which depend on the alignment of a single national executive with international norms are insufficient protection for anyone.

    It is tempting to read the sovereign AI governance argument as primarily a concern for non-American governments and institutions. It is not. American companies operating in sectors with any government adjacency face the same continuity and dependency risks as their international counterparts, in some respects more acutely. The supply chain risk designation applied to Anthropic in February 2026 did not distinguish between European and American enterprise customers. It threatened the operational continuity of any organization, domestic or international, that had integrated Claude into systems touching government work. Those organizations had no advance notice, no seat at the table, and no prepared response. American enterprises are inside the same governance gap as everyone else. In certain scenarios, given their direct and unmediated exposure to domestic executive authority, they are deeper inside it.

    This paper is, at its core, about closing that gap. The governance frameworks required are not punitive towards industry, and they are not designed to be. The central bank independence model, the bank resolution planning requirements, the market access leverage mechanisms that underpin EU regulatory standards, the Sovereign OSS Fund architecture: these frameworks exist to create the conditions under which systemically important institutions can operate sustainably, with the confidence of the governments and populations they serve. AI infrastructure has crossed the threshold that makes equivalent frameworks necessary. That is not a judgement on the industry. It is a description of how far it has come.

    The events of February and March 2026 have made the cost of that absence visible in a way that is difficult to dismiss. The window for proactive sovereign AI governance frameworks is open. It will not remain so indefinitely, and the frameworks constructed in response to the next major AI continuity failure will be more expensive, less well-designed, and more politically contentious than those built in anticipation of it.

    Sovereign AI frameworks are coming, on a two to three year legislative horizon in leading jurisdictions. The companies and institutions that engage constructively now, helping to shape continuity infrastructure that is workable, proportionate, and technically credible, will be better positioned than those that wait. More than that: they will have contributed to a governance environment that is better for the industry as a whole, because it is one that governments and the public can trust. That trust is the condition on which the continued expansion of AI into critical systems depends. It cannot be assumed. It has to be built. Sovereign governance frameworks are how it gets built.


    References

    The Anthropic–Pentagon Dispute
    • Axios : Anthropic says Pentagon’s ‘final offer’ is unacceptable (26 February 2026) 🔗
    • CNN Business : Anthropic rejects latest Pentagon offer (26 February 2026) 🔗
    • CNN Business : Trump administration orders military contractors and federal agencies to cease business with Anthropic (27 February 2026) 🔗
    • CNBC : Trump admin blacklists Anthropic as AI firm refuses Pentagon demands (27 February 2026) 🔗
    • NPR : OpenAI announces Pentagon deal after Trump bans Anthropic (27–28 February 2026) 🔗
    • CBS News : Anthropic CEO: We’re trying to ‘deescalate’ Pentagon AI standoff (4 March 2026) 🔗
    • CNN Business : Anthropic sues the Trump administration after it was designated a supply chain risk (9 March 2026) 🔗
    • Fortune : Anthropic sues the Pentagon after being labeled a threat to national security (9 March 2026) 🔗
    • The Washington Post : Anthropic sues Pentagon over being labeled a national security risk (9 March 2026) 🔗
    • Tech Policy Press : A Timeline of the Anthropic–Pentagon Dispute (9 March 2026) 🔗
    EU AI Act and Regulatory Framework
    • European Parliament and Council of the European Union : Regulation (EU) 2024/1689 of the European Parliament and of the Council (Artificial Intelligence Act) (2024) 🔗
    • European Commission : Guidelines on Prohibited Artificial Intelligence Practices established by Regulation (EU) 2024/1689 (4 February 2025) 🔗
    • European Parliament : Guidelines for military and non-military use of Artificial Intelligence (2021) 🔗
    • Tech Policy Press : Europe’s AI Act Leaves a Gap for Military AI Entering Civilian Life (10 March 2026) 🔗
    • Future of Privacy Forum : Red Lines under the EU AI Act: Understanding Prohibited AI Practices and their Interplay with the GDPR (2025) 🔗
    Australia and Platform Regulation
    • Australian Competition and Consumer Commission : News Media Bargaining Code (2021) 🔗
    • Parliament of Australia : Treasury Laws Amendment (News Media and Digital Platforms Mandatory Bargaining Code) Act 2021 (2021) 🔗
    • Bossio, D. et al. : Australia’s News Media Bargaining Code and the global turn towards platform regulation (2022) 🔗
    GSDRF Primary Research

    Global Software Development Research Foundation (2025–2026). “OSS Sustainability and AI Impact: Primary Research on GHArchive Data 2015–2025.” Original BigQuery analysis. Internal research dataset, GSDRF.

    Global Software Development Research Foundation (2025–2026). “Future Proofing Software Development.” GSDRF Policy Paper.

    Financial Crisis and Systemic Risk Governance
    • Financial Stability Board : Reducing the Moral Hazard posed by Systemically Important Financial Institutions (October 2010) 🔗
    • Basel Committee on Banking Supervision : Global systemically important banks: assessment methodology and the additional loss absorbency requirement (2011) 🔗
    • Council on Foreign Relations : The Defense Production Act of 1950: History, Authorities, and Considerations for Congress (2020) 🔗
    GSDRF Primary Research
    • Stop Killer Robots Campaign : What are the AI Act and the Council of Europe Convention? (2023) 🔗
    • Paul Hastings LLP : European Commission & AI: Guidelines on Prohibited Practices (2025) 🔗


    Global Software Development Research Foundation  |  Policy Research  |  March 2026

  • The Accelerating Decline of the Open Source Ecosystem

    The Accelerating Decline of the Open Source Ecosystem

    How Generative AI is Compounding a Structural Crisis and Why Governments Must Act

    Executive Summary

    Open source software is the foundational infrastructure of the global digital economy. Estimates of its total economic value exceed $8.8 trillion. It underpins financial systems, healthcare networks, national security infrastructure, and the majority of commercial software products worldwide. It is maintained, overwhelmingly, by unpaid volunteers.

    This paper argues that this arrangement was already structurally unsustainable before the emergence of generative AI tools, and that those tools have now materially accelerated this pre-existing decline. There is longitudinal evidence of a rising trend in open source project abandonment and contributor decline that pre-dates both the COVID-19 disruption and the AI adoption wave. This baseline goes a long way to establishing that the ecosystem was deteriorating under the weight of years of corporate free-riding on unpaid maintainer labor. Against this baseline, we assess three mechanisms through which generative AI is compounding the crisis:

    • the degradation of contribution quality and the associated acceleration of maintainer burnout
    • a novel and rapidly expanding security attack surface
    • the structural invisibility of both harms in the ROI framework organizations use, and through which AI adoption is currently evaluated and justified.

    The logical upshot of this data is that we make a regulatory argument. The costs of AI misuse in the open source ecosystem are external, that is they are invisible to the measurement frameworks of the organizations generating them. Thus they are forced to be absorbed by a commons with no mechanism to charge the parties responsible. The open source commons has been given a decade to demonstrate otherwise, but it has shown that voluntary correction cannot be relied upon. The voluntary participation rate among open source users in the largest funding program is a staggeringly low 0.001 per cent. Put bluntly, the voluntary model has overwhelmingly and unambiguously failed.

    The irony at the center of this crisis is not incidental, it is structural. The artificial intelligence industry, which has grown faster and extracted more commercial value from open source software than any preceding wave of technology adoption, is built almost entirely on open source foundations. The training infrastructure that powers large language models runs on open source frameworks. The model architectures are derived from openly published research with open source reference implementations. The data pipelines, evaluation harnesses, fine-tuning libraries, and deployment tooling are, with negligible exception, open source projects maintained by the same volunteer and underfunded contributor communities this paper documents as being in decline. AI companies did not merely benefit incidentally from the open source commons. They consumed it at scale, systematically, as a precondition of their existence. They are now, through the AI coding tools they sell into the developer market, one of the primary vectors for the contribution noise, quality degradation, and maintainer burnout that this paper documents as accelerating the commons’ deterioration. The industry that most depends on the commons is among those most actively degrading it. That is not a peripheral observation. It is the central justification for the policy response this paper proposes.

    That proposal is the creation of a Sovereign OSS Fund. It would be a mandatory contribution mechanism, scaled proportionately to commercial benefit and AI deployment intensity. This mechanism would be similar to the existing models of the IMF/EU Central Bank and Federal Reserve Bank for fininancial regulation and maintenance. The Sovereign OSS Fund would be governed by an independent body operating outside both government and industry control, designed to function as a collective insurance pool against the systemic risk of critical digital infrastructure failure. The fund is not a tax. It is the price of operating responsibly in a shared digital economy whose foundations are disintegrating.


    Introduction: The Open Source Bargain and Its Breaking Point

    There is an implicit bargain at the foundation of the modern digital economy. Millions of software developers, working largely without payment, maintain the libraries, frameworks, and tools upon which the world’s largest technology companies have built products worth trillions of dollars. Those companies take the outputs freely, use them in their commercial products, and in theory, contribute back to the projects they benefit from. In practice, this translates either to contributing minimally or not at all to the maintenance of the infrastructure they depend upon. This arrangement has persisted for two decades, founded on the assumption that the goodwill of volunteer maintainers is an inexhaustible resource. It is not.

    The signs of structural strain have been visible for years. Burnout rates among open source maintainers have been consistently above 50 per cent in every survey conducted since 2015. Critical infrastructure projects have been maintained by single individuals, or small groups, with no succession planning, no institutional support, and no compensation commensurate with the systems depending on their work. The Log4Shell vulnerability in 2021 exposed this dependency to the world. It was a critical flaw in a library maintained by two volunteers, that threatened hundreds of millions of systems globally. Estimates put the cost to the private sector at an estimated $100 million or more in remediation. It received the attention of governments and security agencies worldwide, and many expected corporations to take from that episode. It revealed in stark terms that they had a duty of care to the infrastructure they depended upon. Unfortunately, as is so often the case when the cost is invisible to the bottom line, this lesson was lost on the organizations in question. The voluntary participation rate in open source funding programs did not materially improve.

    Into this deteriorating landscape emerges generative AI, and the advent of the “citizen developer”. Beginning in late 2022, AI coding assistants became widely available to enterprise software developers and hobbyists alike, and their adoption has been rapid. These tools are capable of generating code, identifying bugs, and drafting documentation at unprecedented scale and speed. Critically, as the tools have improved, they have also become capable of autonomously submitting contributions to open source projects in the form of pull requests and issue reports. Their deployment at scale by large commercial organizations, and ease-of-use by amateur coders has introduced a new category of stress into a system already under strain. Maintainers now face a flood of AI-generated contributions requiring human review, often containing higher error rates, and more subtle and hard to identify errors, than human-authored code. The errors are part of the tools themselves, hallucinations, and security vulnerabilities that these AI models often produce. Unfortunately they are also arriving faster than the volunteer maintainer community can process them. Several prominent projects have already changed their governance structures or closed to external contributions in response. Others have been retired, or effectively closed their programs entirely.

    At it’s core, and based on all these factors, this paper makes three basic supporting arguments for its ultimate policy recommendation:

    1. The decline of the open source ecosystem is not a new phenomenon caused by AI, but a pre-existing structural failure that AI is now meaningfully accelerating. The distinction matters for policy: the appropriate response to an AI-caused problem is AI regulation; the appropriate response to a structural market failure that AI is worsening is structural market reform.
    2. The harm AI is causing to the open source commons is currently invisible to the corporate decision-making frameworks through which AI adoption is evaluated, and that this invisibility is the foundational reason why voluntary self-regulation cannot be expected to produce adequate results.
    3. That governments have allowed this situation to develop through a combination of deference to industry self-regulation and a failure of their own measurement frameworks, and that the time for that deference has passed.

    The policy proposal at the center of this paper, the creation of the Sovereign OSS Fund, is designed as a structural response to a structural problem. It does not restrict AI adoption, nor does it seek to regulate it. Instead, it requires that organizations adopting AI at scale bear, proportionately, the costs of maintaining the underlying infrastructure their adoption is degrading. It is governed independently of both government and industry to prevent the capture that has historically undermined similar initiatives. And it is framed, deliberately, not as a punitive instrument but as a collective insurance pool against a systemic risk that every participant in the digital economy shares.


    Argument 1: The Pre-Existing Fragility — A Decade of Structural Deterioration

    Any accurate account of what generative AI is doing to open source must begin before AI. The argument that AI is the cause of the ecosystem’s current difficulties is both analytically incorrect and politically misleading. It locates the problem in a disruptive technology, rather than in the structural conditions that made the technology’s impact so damaging. The ecosystem was already failing, with meaningful declines in participation and volume of activity on projects already in evidence since well before the advent of AI tools. Understanding why the system is in such a state, requires examining the last two decades of a system of dependency without reciprocity.

    1.1  The Scale of the Dependency and the Vacancy of the Contribution

    The economic value of open source software is difficult to overstate. A 2024 study from Harvard Business School estimated the total value of freely available open source code at $8.8 trillion. It concluded that if companies were required to replicate that software using proprietary development, the cost would be 3.5 times higher than current expenditure on commercial software globally. Just south of $30 trillion, or the entire GDP of the United States to find an applicable comparison. To say the impact of the loss of this infrastructure would be seismic is perhaps the understatement of all time.

    The maintenance of this infrastructure is provided overwhelmingly by unpaid volunteers. The 2024 Tidelift State of the Open Source Maintainer Report found that 60% of maintainers describe themselves as unpaid hobbyists. The Linux Foundation’s Census II study found that 136 individual developers were responsible for more than 80 per cent of the code committed to the top 50 open source packages. These are not secondary utilities. They include the cryptographic libraries, authentication frameworks, network protocols, and data processing tools upon which the majority of commercial software infrastructure is built.

    The contribution gap, the distance between the value extracted and the maintenance investment returned, is the structural condition from which everything else in this paper follows. It is not new, and it is not narrowing. GitHub Sponsors, the largest voluntary funding mechanism for open source maintainers, has grown to approximately 4,200 corporate participants out of an estimated 300 million companies worldwide that use open source in commercial products. This is a voluntary participation rate of approximately 0.001 per cent. The voluntary model has had a decade to demonstrate that the private sector will voluntarily maintain the commons it depends upon. It has not done so.

    1.2  The Human Cost: Burnout and the Exit of Experienced Maintainers

    The sustainability crisis is not primarily a financial one, though finance is a component. It is a human one. Maintainers are leaving open source projects, not because they have been offered better employment elsewhere, but because the work has become untenable. The 2024 Tidelift survey found that 60 per cent of maintainers had quit or seriously considered quitting due to burnout, a figure statistically unchanged from the prior year’s 58 per cent despite sustained public awareness of the problem following Log4Shell.

    The causes of burnout are consistent across surveys and qualitative accounts: the volume of inbound issue reports and pull requests; the expectation of responsiveness from commercial users of projects maintained in personal time. The exposure to harassment and entitlement from users who treat open source maintainers as unpaid support staff for commercial products. And not least of all, the psychological weight of knowing that critical infrastructure used by millions of people depends on one’s continued personal availability. These are not problems that can be solved by occasional financial contributions. They require structural changes to how the open source commons is governed and resourced.

    The loss of experienced maintainers has a compounding effect that is structurally more damaging than the headline burnout figures suggest. When a maintainer with years of context about a project’s architecture, security considerations, and dependency graph leaves, that knowledge is generally not recorded. The project does not produce a transition document. The next maintainer, if there is one, begins with a fraction of the institutional knowledge that left, and a learning curve to understand the software as it is today. This is the mechanism by which the single-maintainer dependency problem becomes a security vulnerability: not when the maintainer burns out, but when the replacement, or the absence of any replacement, creates a gap that determined adversaries can exploit. The XZ-Utils incident, in which a sophisticated multi-year social engineering campaign nearly installed a backdoor in the majority of the world’s Linux distributions through a single maintainer’s exhaustion and consequent susceptibility to a seemingly helpful new contributor, is the clearest illustration of where this leads.

    1.3  The Concentration Problem: Single Points of Failure at Scale

    The Linux Foundation’s Census II methodology identified a specific and underappreciated dimension of the fragility: it is not simply that open source is under-resourced overall, but that the under-resourcing is extreme at precisely the most critical points. The finding that 136 developers are responsible for more than 80 per cent of contributions to the top 50 packages means that the global digital economy has, in practical terms, approximately 136 single points of failure in its most critical software infrastructure. This is before any consideration of the long tail of smaller but still critical projects below that top tier.

    The concentration problem is made worse by the invisibility of dependency relationships to the organizations that have them. A financial services firm with a multi-billion-dollar technology budget may have no awareness that one of its core authentication systems depends on a library maintained by a single individual in their personal time, using personal equipment, with no institutional support and no succession plan. The SBOM (Software Bill of Materials) mandates introduced by the Biden administration’s 2021 Executive Order on Cybersecurity were designed, in part, to address this visibility gap. They are a necessary but far from sufficient response to the underlying risk.

    1.4  What Corporate Engagement Has Looked Like in Practice

    It would be inaccurate to suggest that corporations have done nothing. Several large technology companies have established open source program offices, contributed employees to the maintenance of critical projects, and made financial commitments to open source foundations. These efforts are real, and do make a meaningful difference. They are also structurally insufficient to stem the bleeding of the ecosystem as a whole. The data clearly shows that they concentrated among a small number of technology companies, weighted toward projects of direct commercial relevance to those specific companies’ products, and disconnected from the long tail of critical infrastructure that receives no equivalent support.

    The pattern that emerges from the evidence is one of extractive dependency. Corporations that have built significant portions of their product value on open source foundations, and that have not reciprocated at a level commensurate with that extraction. The Harvard HBS figure of $8.8 trillion in value, time 3.5x replacement costs, is a measure of what those corporations would need to spend to replicate what they currently receive for free. Against that figure, the aggregate corporate investment in open source maintenance is vanishingly small. This is not primarily a result of bad faith, no one wants to actively kill off open source. It is a result of a measurement framework that accounts for the value received but not the cost of providing it, and an incentive structure that rewards free-riding as long as the commons holds.

    Argument 2: The Longitudinal Abandonment Trend — A Pre-Existing and Worsening Crisis

    Establishing that open source project abandonment was rising before the AI adoption wave is the central evidentiary task of this paper. It is also the task on which the paper’s regulatory argument most directly depends. If AI caused the crisis, the appropriate policy response is AI regulation. If the crisis pre-dates AI and AI is accelerating it, the appropriate policy response is structural, and must address the market failure that made the ecosystem vulnerable, with AI governance as a component rather than the totality of the solution. This section presents existing secondary evidence, based on our own original primary research using GHArchive public data, and a framework for interpreting what the combined evidence shows.

    2.1  Existing Evidence of Pre-AI Decline

    Several peer-reviewed studies establish that open source project abandonment was rising before generative AI tools became widely available. Coelho et al. (2019) found a 16 per cent abandonment rate in a sample of 1,932 popular GitHub projects, identifying maintainer overwork and declining community engagement as the primary drivers. This study, published before any significant AI coding tool adoption, documents a baseline condition of structural fragility that existed independently of the AI question.

    The more substantial dataset comes from Xu et al. (arXiv 2507.21678, 2025), which analyzed 115,466 GitHub repositories and confirmed abandonment in 57,733 of them, using a multi-factor methodology that filtered for genuine project inactivity rather than simply the absence of recent commits. This dataset is, to date, the largest systematic abandonment analysis in the literature. Its primary limitation for the purposes of this paper is that it was published as a predictive modelling framework rather than a time-series analysis: it identifies abandoned projects but does not provide year-by-year disaggregated data that would allow a trend line to be constructed.

    The Tidelift maintainer survey series provides complementary evidence from the supply side. Rates of burnout, quitting, and seriously considered departure among open source maintainers have been consistently above 55 per cent in every Tidelift maintainer survey conducted between 2021 and 2024, a figure the surveys themselves describe as essentially unchanged year on year despite growing public awareness of the problem. The consistency of these figures across four consecutive annual surveys is itself the most damaging evidence against the voluntary action argument. Awareness of the crisis has not moved the needle. The surveys do not provide the project-level longitudinal data that would allow a contributor attrition trend line to be constructed, but they establish the human context within which the quantitative findings in Section 2.3 should be read.

    2.2  Primary Research: The GHArchive Analysis

    To establish a longitudinal trend line from primary data, this paper presents original analysis of the GHArchive public dataset, covering the period 2015 to 2025. GHArchive is a public archive of GitHub’s event stream, capturing push events, pull request events, issue events, and contributor activity across the full population of public GitHub repositories. The analysis focuses on a filtered set of high-longevity repositories selected to represent consequential open source infrastructure rather than the long tail of personal and experimental projects.

    Methodology and filtering criteria

    The filtering methodology adapts the multi-factor approach of Xu et al. to the fields available in the GHArchive event stream. Repositories were required to meet all of the following criteria to be included in the analysis: a minimum of seven years of continuous activity within the study window; a minimum of five distinct contributors at peak year engagement; a minimum of 100 activity events per year in active years (approximately two meaningful actions per week on an annualized basis) to distinguish sustained maintenance from occasional personal activity; and exclusion of known noise patterns including repository names consistent with tutorial, course, homework, dotfiles, demo, example, and curated list repositories. Repositories with a pull-request-to-push ratio below 0.01 were excluded as probable automated mirrors.

    The analysis tracks four primary metrics across each year in the study window:

    1. total activity events
    2. distinct contributor count by GitHub login
    3. push events as a proxy for direct commit activity
    4. pull request events as a proxy for external contribution volume.

    The distinction between push and pull request event trends becomes analytically significant in the 2023 data, as discussed in Section 2.3.

    The dataset covers 2015 to 2025. Data prior to 2015 exists in the GHArchive record but is treated with caution given documented inconsistencies in GitHub’s early event capture; 2015 is the more defensible starting point for the quantitative trend line.

    Known limitations

    The GHArchive event stream captures activity but cannot distinguish the origin of that activity. A pull request submitted by a human developer and one generated by an AI coding tool appear identically in the event record. This limitation is directly relevant to the interpretation of the 2023 data discussed in Section 2.3, and the paper is explicit about what can and cannot be established from aggregate event counts alone. The contributor count metric uses distinct GitHub logins per year, which will under count contributors operating under multiple accounts and over count bots or automation scripts operating under named accounts. These limitations are acknowledged and do not materially affect the directional findings.

    Note on forthcoming research: A natural extension of this analysis is tracking individual contributor logins across repositories to distinguish migration from complete exit. While it is technically feasible using the same GHArchive dataset, it is outside the scope of this paper. A targeted version, tracking the 2019 Gatsby contributor cohort against successor frameworks Next.js and Astro in 2021–2023, is planned as a companion study. The question of whether contributors who leave established projects migrate to newer ones or exit open source contribution entirely has significant implications for the policy argument. A migration suggests the contributor pool is redistributing, while a full exit confirms the net depletion the Tidelift surveys describe from the supply side. This forthcoming research is flagged here as a known gap that the current analysis cannot resolve from aggregate event data alone.

    2.3 Findings: The Longitudinal Trend

    Analysis of the filtered repository set across the full 2015–2025 study window reveals a pattern that has moved, in the period since the original draft of this section, from predictive to descriptive. The stress accumulation model advanced in Section 1 is no longer a projection of where the open source ecosystem is heading. The data documents where it has already arrived.

    Before presenting the findings it is worth establishing the analytical frame. The dataset does not show uniform collapse, and the paper does not claim that it does. It shows a clear and consistent bifurcation between two categories of project. The first category, those projects whose institutional backers have direct, ongoing, profit-motivated investment in sustaining their contributor ecosystems, shows stability or modest decline. The second category, those projects that are commercially critical but whose backers extract value from them without commensurate contributor investment, shows sustained and in many cases accelerating decline. Notably, the 2024 and 2025 data reveals a cliff that the 2015–2023 trend line had been building toward for almost a decade. The bifurcation is the finding. It is not a caveat to the argument. It is the argument.

    The pre-COVID baseline deterioration (2015–2019)

    Among projects with nine or more years of continuous activity in the dataset, the majority show declining or flat contributor counts between 2015 and 2019. Apache Spark declined from 1,165 distinct contributors and approximately 177,000 total events in 2015 to 741 contributors and 127,000 events by 2019. This was a 36 per cent reduction in the active contributor base during a period of strong and growing enterprise adoption. Elastic Kibana peaked at 2,639 distinct contributors in 2016 and had declined to 2,208 by 2019 despite substantially increased event volume, the earliest instance in the dataset of the concentration dynamic. That is when a shrinking contributor pool is absorbing a growing workload. gRPC peaked at 1,761 contributors in 2017 and had begun a decline that would prove uninterrupted through 2025. Laravel/framework peaked at 4,059 contributors in 2017 and declined steadily thereafter. Golang/go reached its contributor peak of 4,996 in 2019 before beginning a decline that would continue without interruption to 2025.

    The pre-COVID pattern is significant because it establishes that contributor attrition in established projects was underway before either the COVID disruption or the AI adoption wave. It was not caused by either, but they accelerated it.

    The 2020 disruption

    The 2020 data shows a marked increase in activity across the majority of projects in the dataset, consistent with the well-documented surge in open source contribution accompanying the shift to remote working during the COVID-19 pandemic. TensorFlow reached its peak contributor engagement of 10,796 in 2020. Flutter reached its all-time peak of 19,666. Rust reached 3,536. Homebrew peaked at 1,014. Apache Spark was the notable exception to this rule. Its contributor count continued falling through 2020 from 741 to 651, as a project whose structural decline was already too advanced for the broader ecosystem trend to reverse.

    The 2020 data is treated throughout this analysis as an external disruption rather than a structural signal. Its analytical significance lies entirely in what the recovery looks like in the years that follow.

    The incomplete recovery (2021–2022)

    Across the majority of established projects in the dataset, the 2021–2022 period does not return to 2019 levels. It settles at a lower floor. Kubernetes declined from its peak of 6,606 contributors in 2019 to 4,060 by 2022. Elastic Kibana fell to 1,517, 42% below its 2016 peak. Flutter fell from 19,666 in 2020 to 11,926 by 2022, a 39% decline in two years. TensorFlow fell from 10,796 in 2020 to 3,872 by 2022, a 64% decline in two years, despite sustained internal Google development activity throughout. Gatsby, whose collapse began before the COVID disruption, fell from 5,947 contributors at peak to 1,505 by 2022 — an 87 per cent decline driven primarily by the Netlify acquisition and the community exit that followed it.

    The system emerged from the 2020 disruption weaker than it entered it. The post-COVID floor was lower than the pre-COVID baseline across virtually every project in the declining category.

    The 2023 AI noise signal

    The 2023 data shows a pattern consistent with the AI contribution noise hypothesis advanced in Section 3. Several projects show partial stabilization or modest recovery in total event counts. Year-on-year, Kubernetes was flat at 145,152, Elastic Kibana was flat at 192,022, Rust marginally up, even while distinct contributor counts continued to fall. For Kubernetes, contributors fell from 4,060 to 3,420 despite flat event totals. TensorFlow’s push events remained stable while PR events collapsed from 5,239 to 2,421, consistent with continued corporate-driven development and a near-complete withdrawal of external community contribution.

    This event-contributor divergence, appearing consistently across multiple projects in the first full year of mass AI coding tool adoption, is consistent with AI-generated activity inflating event counts while the human contributor base continues its structural contraction. It must be very clearly stated that this cannot be established definitively from aggregate event data alone. Distinguishing AI-generated from human-generated contributions in the GHArchive event stream requires contributor-level analysis beyond the scope of this study, but the directional pattern is consistent with what AI-generated contribution noise at scale would produce, and is consistent with the qualitative evidence from named projects in Section 2.4.

    The 2024–2025 acceleration: the reckoning arrives

    The 2024 and 2025 data represent the most significant finding in this analysis. What the 2015–2023 trend line had been building toward as a projection has arrived as a measured reality. Across the majority of established projects in the dataset, 2024 and 2025 show accelerating contributor collapse that in several cases returns projects to or below their pre-growth baselines. In just two years a decade of community contribution is erased, while the commercial infrastructure built on those projects has never been larger or more deeply embedded. These are some important and illustrative examples, but the list of declines is many, many times longer than presented here.

    Kubernetes fell from 3,420 contributors in 2023 to 3,098 in 2024 and 2,365 in 2025. Meanwhile, its total events in 2025 were 93,627, almost exactly its 2015 level of 94,419. A project that grew from a standing start to become the de facto standard for enterprise container orchestration globally has returned, on contributor engagement, to where it began. The commercial dependency on Kubernetes has never been greater. The community sustaining it has never been smaller relative to that dependency.

    Flutter fell from 11,399 contributors in 2023 to 10,364 in 2024 and 7,115 in 2025. Its total events of 91,653 in 2025 were the lowest since 2018, when Flutter was a niche project with minimal adoption. Despite the demonstrable fact that it now underpins hundreds of millions of deployed applications across iOS and Android.

    Apache Spark, admittedly a project that would be considered “legacy” in some circles, but is still widely deployed in analytics workflows globally, reached 523 contributors and 18,167 total events in 2025, below its 2015 starting point of 1,165 contributors and 177,222 events on both metrics.

    Golang/go fell to 2,711 contributors in 2025, a 46 per cent decline from its 4,996 peak.

    gRPC fell to 632 contributors, a 64 per cent decline from peak, with push events rising in 2024–2025 in a pattern consistent with internal automation replacing community contribution.

    Laravel/framework fell to 1,675 contributors in 2025, its lowest recorded level.

    TensorFlow presents the most analytically complex 2024–2025 pattern in the dataset and warrants specific treatment. Total events spiked to 111,654 in 2024, the highest since 2020, while distinct contributors collapsed to 2,921. In 2025, events remained elevated at 93,418 while contributors fell further to 1,222. Push events exploded from approximately 15,000 per year across 2021–2023 to 56,756 in 2024 and 47,013 in 2025. PR events similarly spiked from around 2,400–7,300 in prior years to 31,361 in 2024 and 34,214 in 2025. The result is that 1,222 distinct contributors generated 93,418 events in 2025. That’s an average of approximately 76 events per contributor, compared to approximately 9 events per contributor at peak community health in 2019 the last “regular” year of data. This is the most extreme event-contributor divergence in the dataset. It is consistent with a project that has become primarily a vehicle for internal Google automation and AI-generated contribution activity, with its open source community having almost entirely disengaged. That Google’s primary AI framework, has reached this state is not a footnote – vast swathes of the AI ecosystem would not exist without Tensorflow and it’s overall capabilities. It is a summary of the paper’s argument in a single data series.

    WordPress/Gutenberg warrants specific attention given its infrastructural significance, and its 2024–2025 trajectory is more explicable, and more instructive, than the raw numbers alone suggest. Events of 101,447 and contributors of 2,019 in 2024 were followed by 34,422 events and 1,191 contributors in 2025. That is a 66 per cent single-year event collapse and a 41 per cent contributor decline in one year. WordPress powers approximately 40 per cent of the public web. Gutenberg is its core editing interface. The proximate context for this decline is the highly public dispute between Automattic and WP Engine in late 2024, which produced governance instability, community polarization, and a measurable withdrawal of developer engagement at precisely the moment Automattic was accelerating its AI integration strategy. The data suggests a compound mechanism: governance-driven community exit amplified by developer push-back against an AI-first strategic direction imposed by the corporate backer without community consensus. If that interpretation is correct, something subsequent qualitative research is required to confirm, then Gutenberg represents a new category of decline mechanism not previously identified in this dataset. Active contributor withdrawal in response to corporate AI encroachment on a project’s identity and direction, rather than the passive attrition through burnout and neglect that characterizes the majority of declining projects. The policy implication is that AI strategy decisions made unilaterally by corporate backers can destroy contributor communities rapidly and at scale, adding a further dimension to the governance argument developed in Section 5.

    The counter-examples and what they prove

    A handful of projects in the dataset show stability or growth in 2024–2025, and their characteristics are analytically decisive.

    Visual Studio Code — Microsoft’s code editor and the primary host for every major AI coding extension on the market shows events of 144,669 in 2024 and 150,398 in 2025, with contributors of 20,579 and 19,982 respectively. Essentially it is flat across three years ending 2025. Microsoft has direct, measurable, ongoing commercial incentive to maintain VS Code’s contributor ecosystem. It is significant that it is the delivery mechanism for GitHub Copilot, which is a significant and growing revenue line. It’s project stability is not coincidental. It is the product of active investment.

    Hugging Face Transformers — the most widely deployed model fine-tuning and serving library in the AI industry shows events of 52,959 in 2024 declining to 40,072 in 2025, with contributors of 4,904 and 3,515 respectively. A meaningful 2025 decline, but from a high base, and 2025 levels remain above 2021 and 2022 unlike other projects. The AI industry has direct commercial incentive to maintain this library’s contributor ecosystem. The relative resilience, compared to projects where that direct incentive is absent, is consistent with the pattern.

    NixOS/nixpkgs reached its all-time peak of 407,677 events and 9,644 contributors in 2024 before a modest pullback to 350,974 events and 8,896 contributors in 2025. It remains the one project in the dataset showing sustained community-driven growth without clear corporate profit motivation as the primary driver. This is a genuine exception, explicable by the combination of strong foundation governance, an ideologically committed community, and technical characteristics that attract sustained new contributor interest. It is the dataset’s proof that the structural conditions for sustainability are achievable. It is also, notably, the only project in the dataset for which that can be said in the absence of direct corporate profit motivation.

    Homebrew/homebrew-core shows a more ambiguous picture that is worth flagging as a warning signal rather than a counter-example. Total events grew through 2024 (201,202, the highest in its history) before a modest 2025 pullback to 191,936. But distinct contributors have been falling steadily and without interruption: from 3,114 in 2017 to 1,410 in 2025, a 55 per cent decline across eight years. Homebrew-core is doing substantially more work with substantially fewer people. The event volume is being maintained while the contributor base sustaining it is quietly contracting. This is the concentration dynamic in its most developed form. It is not yet a cliff, but with the preconditions for one accumulating year on year.

    Rust-lang/rust shows a 2025 anomaly that requires a methodological note. Contributors fell from 4,257 at peak to 2,926 in 2025, continuing the gradual decline visible since 2022. However, push events spiked sharply in 2025 to 12,130. That’s nearly five times the 2024 level of 2,722 and the highest push event count in the project’s recorded history. Interesting because it happens with no corresponding announcement of a large-scale internal engineering initiative in the Rust project’s published 2025 roadmap and blog record. The most simple explanation, consistent with the broader patterns observed in this dataset, is that Rust is experiencing significant AI-generated or automated push activity. Rust’s technical prestige and community prominence make it a plausible target for low-effort, AI-assisted contributions, by actors seeking visibility in a high-status project. This hypothesis will be explored in subsequent research using contributor-level analysis. For the purposes of this paper, the Rust contributor count trend, declining but not collapsing and remaining substantially above pre-COVID levels, is treated as the more reliable signal of community health, and the 2025 push event spike is flagged as an unverified anomaly rather than an organic community signal.

    The pattern and what it means

    The bifurcation across this dataset is not random and it is not explained by project age, language, domain, or corporate affiliation in the simple sense. What distinguishes the stable projects from the declining ones is a single variable: whether the institutional backer has direct, ongoing, profit-motivated incentive to invest in the contributor ecosystem specifically. The backer(s) aims to fully use the project, not merely to employ engineers who commit to it, but to actively sustain the community that maintains it.

    Microsoft invests in VS Code’s contributor ecosystem because VS Code is the distribution mechanism for Copilot revenue. Hugging Face invests in Transformers because Transformers is the product. NixOS’s community invests in itself because the community is the product. In every other case in the dataset, the institutional relationship is extraction. The project is used, depended upon, and in some cases nominally sponsored, but the contributor ecosystem is not actively maintained as an asset. It is consumed as a commons.

    The result is visible in the data across a decade. Commercial dependency without active contributor investment is not protection. It is a slow-motion depletion of a shared resource. The projects in this dataset that are declining are not failing because they are bad projects or because their maintainers are insufficiently dedicated. They are failing because the market structure that surrounds them systematically rewards extraction and systematically fails to fund maintenance. That market failure is what this paper’s policy proposals are designed to address.

    2.4  Case Studies: Projects at the Breaking Point

    The quantitative trend data is supplemented with named case studies that illustrate the mechanisms at work. Four cases are particularly instructive, each representing a different point on the failure spectrum from governance change to active security exploitation.

    Ingress NGINX was retired from active development in November 2025. It is not a peripheral project: it was the default ingress controller for Kubernetes clusters globally, underpinning enterprise infrastructure across financial services, technology, and public sector deployments. Its retirement was not caused by a lack of corporate sponsorship — it had that. It was caused by a lack of human attention: the maintainers, facing an increasing volume of AI-generated pull requests requiring review, and carrying the cognitive overhead of a project whose architectural complexity had grown beyond what a small volunteer team could sustainably manage, simply reached the end of their capacity.

    curl, one of the most widely deployed command-line tools in the world, paused its bug bounty programme in 2024 after receiving a flood of AI-generated bug reports that described non-existent vulnerabilities, consuming maintainer time without producing security improvements. The decision was documented publicly by the lead maintainer, providing a rare first-hand account of the mechanism by which AI degrades maintainer sustainability.

    tldraw, an open source drawing tool with widespread use in commercial products, temporarily blocked all external pull requests in response to what its maintainers described as a volume of AI-generated contributions that overwhelmed their capacity for review. The block was a pragmatic response to a denial-of-service attack on maintainer attention, using the language of one of the project’s own contributors.

    XZ-Utils, a data compression library present in the majority of Linux distributions, was the target of a multi-year social engineering campaign that came within a routine system update of installing a backdoor in global SSH infrastructure. The attacker exploited a maintainer in visible distress, who was overwhelmed, and publicly expressing difficulty keeping up. They did this by offering sustained, high-quality contributions over months before embedding a malicious payload. This case pre-dates AI-assisted attacks but illustrates the attack surface that AI dramatically widens. A maintainer under stress, with insufficient capacity to review contributions thoroughly, is the vector. A huge volume of low-effort AI submissions is the smokescreen. ~A single malicious contribution is the payoff.

    Argument 3: How Generative AI is Worsening the Crisis in Three Mechanisms

    This section is not an argument that generative AI is intrinsically harmful to open source. Some maintainers are using AI tools to manage their own triage, documentation, and code review workloads. Some projects have benefited from AI-assisted contributions that improve code quality. The argument is narrower and more specific. It is simply that generative AI, as deployed at scale by commercial organizations without governance frameworks adequate to the task, is net-negative for the structural health of the open source ecosystem through three distinct mechanisms.

    3.1  Contribution Quality Degradation and the Asymmetric Burden

    AI coding assistants are capable of generating a lot of very plausible-looking code, very quickly. They are significantly less capable of generating high-quality and accurate code, particularly in the context of large, complex, long-lived projects with intricate dependency relationships and non-obvious invariants. The gap between plausible and accurate is not random, rather it is systematically biased toward certain failure modes that are expensive for maintainers to detect and reject.

    The Carnegie Mellon University study analyzed 807 open source GitHub repositories that adopted Cursor between January 2024 and March 2025, tracking code quality metrics through to August 2025. It found no reversal in quality degradation trends even as the underlying models improved substantially across the same period. The GitClear analysis of 153 million lines of committed code found that code duplication increased from 8.3 per cent to 12.3 per cent between 2021 and 2024, and that refactoring activity, the deliberate improvement of code structure by developers who understand it deeply, collapsed from 25 per cent of commits to fewer than 10 per cent over the same period. Code that is duplicated is code that is not understood, Code that is not refactored is code that accumulates technical debt. Both are indicators of AI-generated contributions being accepted into codebases without the quality of human review that the original contribution volume warranted.

    The CodeRabbit analysis found that AI-generated code creates approximately 1.7 times as many code review issues as human-authored code. The asymmetry is the critical point. The entities generating the increased volume of contributions incur none of the cost of reviewing them. The commercial organization whose developer submitted a pull request using an AI tool bears no cost if that pull request requires multiple rounds of review from a volunteer maintainer. The maintainer bears the entire cost. This asymmetry is the externality structure that markets cannot self-correct.

    The phenomenon has been characterized by practitioners as ‘vibe coding’, the generation of contributions that appear to address a stated issue but that have not been verified by the submitter against the actual system, and that may contain subtle errors, incorrect assumptions, or changes to unrelated parts of the codebase. By early 2026, curl, tldraw, and Ghostty had all changed their contribution policies directly in response to this phenomenon. The Ingress NGINX retirement, noted above, represents the highest profile example of the end state of the trajectory these policy changes are attempting to forestall.

    3.2  Security Escalation: From Quality Degradation to Attack Vector

    The security implications of AI-assisted development extend beyond the quality issues discussed above. AI tools introduce a distinct category of security risk that operates at the architectural level rather than simply the code quality level, and that is substantially more difficult to defend against.

    Apiiro’s analysis of Fortune 50 enterprises found a ten-fold increase in security findings per month between December 2024 and June 2025. Research from multiple independent sources consistently finds that between 15 and 25 per cent of AI-generated code contains security vulnerabilities. A 2024 academic study found that between 5 and 22 per cent of the package names referenced in AI-generated code do not exist. This is a phenomenon that has given rise to a novel attack category. When AI models hallucinate package names and developers submit code referencing those names to open source repositories, attackers can register the hallucinated names with malicious payloads. Any project or user that subsequently installs the dependency as referenced in the accepted code receives the attacker’s payload. This attack vector, termed ‘slopsquatting’ by researchers, requires no compromise of any existing package; it exploits the hallucination behavior of AI coding tools directly.

    Georgetown University’s Center for Security and Emerging Technology has documented a more sophisticated variant already in use. The AI models themselves can be targeted with data poisoning attacks that cause them to recommend malicious packages to developers using AI coding tools. This attack operates at the model level rather than the code level, and is therefore invisible to conventional code review processes. A developer using an AI assistant that has been poisoned to recommend a particular malicious dependency may receive that recommendation across all of their coding sessions, across all projects, before any individual project maintainer has the opportunity to reject a pull request.

    The OpenSSF has explicitly noted that AI lowers the cost and sophistication barrier for supply chain attacks on open source infrastructure. The XZ-Utils attack, which required years of sustained, high-quality human engagement to execute, represents the high end of the sophistication spectrum prior to AI assistance. The tools now available to attackers make equivalent attacks substantially cheaper to mount, substantially easier to scale, and substantially harder to detect. Phylum’s analysis found more than 35,000 malicious packages in open source repositories, with an eight-fold increase in critical malware detection over the survey period.

    Key Finding: AI is not merely introducing more vulnerable code into open source repositories. It is lowering the barrier to sophisticated supply chain attacks on critical infrastructure. The combination of maintainer burnout, reduced review capacity, and AI-enabled attack sophistication creates a risk environment that is qualitatively different from the pre-AI baseline.

    3.3  The Measurement Problem: Invisible Externalities

    The third mechanism is structural rather than technical, and it is in some respects the most important. The harm that AI is causing to the open source commons is systematically invisible to the corporate frameworks through which AI adoption is evaluated and justified. This invisibility is not incidental. It is a feature of how ROI is measured in large organizations, and it is the foundational reason why the problem cannot be solved by appeals to corporate self-interest or voluntary action.

    The December 2024 IBM study on AI return on investment surveyed 2,413 IT Decision Makers at large enterprises. It found that 51 per cent reported positive ROI from AI adoption in software development contexts. The study is routinely cited by AI vendors and enterprise technology advocates as evidence that AI delivers measurable value to software development. A close examination of its methodology reveals why it cannot support that conclusion in the context of open source.

    The respondents are director-level or above, with purchasing authority over IT products. They are, structurally, the people whose professional role is to justify the purchasing decisions they have made. The metrics used to calculate ROI, in this study: faster software development, faster innovation, time savings in productivity, are self-reported estimates at the individual level. They are also measuring the output of the developer using the tool, not the total system impact including downstream costs. The study measures the value the AI tool appears to produce in the transaction where it is used. It does not and cannot measure the costs imposed on maintainers reviewing the output of that tool, on security teams addressing the vulnerabilities it introduces, or on the shared infrastructure degraded by its aggregate use across the industry.

    This is not a failure of IBM’s researchers. It is a description of the limits of ROI as a measurement framework for systemic externalities. The costs are real, they simply fall outside the boundary of the measurement. When an open source maintainer spends an additional hour reviewing an AI-generated pull request that turns out to be incorrect, that hour appears nowhere in any corporate ROI calculation. When a security team responds to a supply chain attack that exploited a hallucinated package name, the attribution to AI tool usage is invisible. The negative externalities are diffused across the commons and therefore unaccountable in any individual firm’s ledger.

    The executive-developer perception gap is the human face of this measurement failure. The Atlassian 2025 State of DevEx Survey found that 63 per cent of developers report that their leaders do not understand their pain points, a sharp increase from 44 per cent the prior year, with developers specifically identifying the disconnect between leadership enthusiasm for AI productivity and developer experience of AI-generated review burden as a primary source of frustration. The EY Work Reimagined Survey of 15,000 workers found that 64 per cent perceive increased workloads despite widespread AI adoption, with only 5 per cent using AI in genuinely transformative ways. The METR randomised controlled trial is perhaps the most damning of all for the impact of AI on software development. It found that experienced developers took 19 per cent longer on tasks when using AI tools, despite believing, in a pre-trial estimate, that AI had made them 20 per cent faster. The perception gap runs in the wrong direction even at the practitioner level; at the executive level, it is wider still. 


    Part 4: The Externality Structure — Why Markets Cannot Self-Correct

    The preceding three sections establish the empirical case: the open source ecosystem was already deteriorating before AI, AI is worsening it through identifiable mechanisms, and the harm is invisible to the measurement frameworks of the organizations causing it. This section makes the structural argument: that these conditions together constitute a classic negative externality that markets cannot self-correct, and that the historical record of voluntary action confirms this at scale.

    4.1  The Externality Defined

    A negative externality exists when the costs of a private action are borne by parties outside the transaction in which the action is taken, and where the acting party therefore has no market signal incentivizing cost reduction. Pollution is the canonical example: the factory producing pollution bears none of the cost of the health damage it causes to downstream communities, and therefore has no financial incentive to reduce pollution absent regulation. The incentive for self-regulation is structurally absent because the internal cost-benefit calculation of the polluting firm does not register the costs it is externalizing.

    The open source situation has precisely this structure. The commercial organization that deploys an AI coding tool at scale incurs the cost of the tool license and the time its developers spend using it. It receives the benefit of apparently accelerated output. It does not incur the cost of the additional review burden imposed on open source maintainers, the security vulnerabilities introduced into shared infrastructure, or the accelerated burnout of the volunteer community whose continued unpaid labor it depends upon. All of those costs are externalized onto the commons. The commons has no mechanism to charge the externalizing party. The externalizing party has no incentive to reduce its externalization because its internal metrics do not register it.

    The analogy to environmental regulation is instructive as a policy logic and not merely a rhetorical device. When private actors extract value from a shared resource while externalizing degradation costs onto that resource, the outcome in the absence of regulation is well-documented. It is the “tragedy of the commons”, in which individually rational behavior produces collectively irrational outcomes. Garrett Hardin identified the mechanism in 1968; Elinor Ostrom won the Nobel Prize in Economics in 2009 for demonstrating the governance conditions under which commons can be sustainably managed. The open source commons currently lacks those governance conditions. It has the access structure of a commons, free extraction by all, without the governance structure that prevents tragedy or abuse.

    4.2  Why Voluntary Action Has Failed and Will Continue to Fail

    The standard industry response to arguments for regulatory intervention is to appeal to the potential for voluntary self-regulation: the assertion that corporations, properly informed of the costs they are externalizing, will act in their long-term interest by contributing to the maintenance of the infrastructure they depend upon. There are two problems with this response. First, it has been tried and has demonstrably failed. Second, the structural conditions that produced that failure have gotten measurably worse.

    GitHub Sponsors has operated for several years. The voluntary participation rate among open source users is approximately 0.001 per cent. The Open Source Pledge is a more recent initiative asking companies to commit a minimum of $2,000 per developer per year to open source. While it has attracted a small number of early adopters, predominantly smaller technology companies, it is not the hyperscalers whose AI tool deployment is the primary driver of the problem this paper addresses. These initiatives are not failures of design or communication. They are demonstrations that the collective action problem inherent in commons governance does not yield to voluntary mechanisms in the absence of either binding agreements or reputational incentives strong enough to overcome the financial benefit of free-riding.

    The collective action problem takes a specific form here. Even a corporation that recognizes its dependency on open source maintenance and would, in isolation, prefer a well-maintained ecosystem to a deteriorating one has a strong competitive incentive not to contribute unilaterally. Every dollar spent on open source maintenance is a dollar not spent on competitive activities, and the benefit of that maintenance is shared equally with competitors who spend nothing. The only rational response under these conditions, for a firm maximizing its own competitive position, is to free-ride and hope that enough others contribute to keep the commons functioning. This is not bad faith! It is the predictable outcome of a measurement and incentive framework that rewards free-riding.

    Mandatory contribution solves the collective action problem that voluntary mechanisms cannot, because it eliminates the competitive disadvantage of unilateral participation. When all commercial beneficiaries of the open source commons are required to contribute proportionately, no individual firm is penalized for contributing. The cost is distributed across the industry rather than borne disproportionately by the conscientious minority.

    4.3  The Governmental Duty of Care

    Governments have a specific obligation in this context that extends beyond general market regulation. Open source infrastructure is not merely a private sector resource. It underpins financial systems, healthcare networks, energy grid management, national defense capabilities, and public communications infrastructure. Its failure is not just a private sector problem. It is a national security risk of the first order.

    The XZ-Utils near-miss in 2024 was a demonstration, in miniature, of what that failure looks like. A backdoor inserted through the social engineering of a single, overwhelmed volunteer maintainer came within a routine system update of compromising global SSH infrastructure. The attacker had nation-state level patience and sophistication. The defender had none: a single individual, unpaid, managing a critical library in personal time, with insufficient capacity to thoroughly scrutinise the contributions of a persistent and skilled adversary.

    The OpenSSF and Georgetown CSET have both documented that AI dramatically lowers the cost and sophistication barrier for equivalent attacks. An attack that required multi-year human investment to execute in the XZ-Utils case can now be partially or substantially automated. The number of potential targets that a sophisticated attacker can pursue simultaneously increases in proportion to the automation available. The maintained volunteer community that is the primary defense against such attacks is shrinking, burning out, and now absorbing additional review burden from the AI tool adoption of commercial organizations.

    The question for governments is not whether a systemic failure event is possible. Log4Shell and XZ-Utils have already demonstrated that it is. The question is at what point the expected value of the downside risk, the probability multiplied by magnitude of damage, exceeds the political cost of intervention. This paper argues that point was reached some time ago, and that generative AI has now compressed the timeline to the point where continued inaction represents not caution, but outright failure of the governmental duty of care.

    Part 5: The Policy Gap — Existing Instruments and Their Limits

    The argument for a Sovereign OSS Fund does not require that existing regulatory instruments have done nothing. Several have made meaningful contributions to the problem space. The argument is that collectively they leave a gap that is both large and consequential, and that the gap is precisely the one the fund is designed to fill.

    5.1  What Exists

    The EU Cyber Resilience Act (2024) is the most relevant existing instrument. It places mandatory security obligations on manufacturers of products with digital components, including products built substantially on open source foundations. In principle, it creates downstream liability for products that fail to meet defined security standards, establishing the precedent that commercial beneficiaries of open source have obligations that follow from that benefit. In practice, it creates compliance obligations but not maintenance funding: a manufacturer can satisfy its Cyber Resilience Act obligations by documenting its dependencies and implementing security response processes without contributing a single euro to the maintenance of the open source projects it depends upon.

    The Biden administration’s 2021 Executive Order on Cybersecurity mandated Software Bills of Materials for government procurement, requiring federal contractors to document their open source dependencies. This created the disclosure infrastructure that a contribution obligation model would require: if commercial organizations are required to know and declare what they depend on, that declaration can become the basis for a proportionate contribution obligation. The SBOM mandate is the thin end of the wedge. It has not yet been extended to create the contribution obligation that its transparency would make administratively feasible.

    The Biden National Cybersecurity Strategy (2023) shifted liability for insecure software toward organizations best positioned to reduce risk, explicitly rejecting the prevailing norm under which end users bear the cost of software vulnerabilities that developers or vendors could have prevented. The logic applies directly to the open source context: the organizations best positioned to reduce the risk of open source infrastructure failure are those large commercial entities with the resources to fund its maintenance. The strategy’s logic has not, however, been extended to create specific obligations in the open source maintenance context.

    5.2  What Is Missing

    The gap left by these instruments has three dimensions. First, none of them creates a funding mechanism. The Cyber Resilience Act creates compliance costs. The SBOM mandate creates disclosure costs. Neither generates resources for the maintenance it implicitly depends upon. Open source projects whose security and maintenance standards are assessed by the Act’s compliance framework will be judged against criteria that their volunteer maintainers have no additional resources to meet.

    Second, none of them addresses the AI dimension specifically. The acceleration of the problem documented in the preceding sections, specifically the flood of AI-generated contributions, the novel attack vectors, and the systemic invisibility of costs to corporate ROI frameworks, is entirely outside the scope of instruments designed before widespread AI coding tool adoption.

    Third, none of them addresses the governance problem. Who decides which projects are critical, and how are disbursements made against principled criteria rather than lobbying effectiveness? The absence of a credible, independent answer to this question is itself a barrier to the voluntary corporate contributions that the gap leaves as the only available mechanism. Corporations that might be willing to contribute do not have a trusted mechanism through which to do so that distributes the benefit to genuinely critical projects rather than high-profile ones.

    The Policy Gap in Summary: Existing instruments create transparency obligations (SBOM mandates), compliance obligations (Cyber Resilience Act), and liability-shifting logic (2023 National Cybersecurity Strategy). None creates a funding mechanism, none addresses the AI dimension specifically, and none resolves the governance question of how to identify critical projects and distribute resources to them credibly and independently. The Sovereign OSS Fund fills all three gaps.

    Section 6: The Sovereign OSS Fund — Architecture, Rationale, and Design

    The Sovereign OSS Fund is the central policy proposal of this paper. It is designed to address the market failure described in the preceding sections by internalizing the costs currently externalized onto the open source commons, doing so in a manner that is proportionate to commercial benefit and AI deployment scale, transparent in its governance and disbursements, and independent of both government control and industry influence in its critical designation decisions. This section sets out the fund’s rationale, its contribution structure, and its governance architecture.

    6.1  The Systemic Risk Insurance Framing

    The most rigorous case for the fund rests not on the polluter pays principle alone, though that principle is sound and applicable. It focuses instead on the systemic risk insurance framing inherent in the open source software ecosystem. Critical open source infrastructure is a public good, it is non-excludable (anyone can access it), non-rivalrous (one party’s use does not reduce another’s), and subject to free-riding at scale. It is also exposed to low-probability, high-impact failure events whose costs would be distributed across the entire digital economy without regard to any individual organization’s contribution history. This is the exact structure that insurance markets exist to address.

    Conventional insurance cannot function in this context, however, because the underlying asset is a commons. No single party owns or controls it, and no single party can be assigned the residual claimant status that insurance underwriting requires. The Sovereign OSS Fund therefore functions as a collective insurance pool, operated by an independent body, funded by proportionate mandatory contributions from commercial beneficiaries, and structured to maintain the designated assets against failure. The contribution obligation is the insurance premium. The designated projects whose maintenance the fund supports are the insured assets. The catastrophic failure events that the fund is designed to prevent, such as a Log4Shell-scale vulnerability in critical infrastructure, an XZ-Utils-style supply chain compromise of national security significance, or the accelerating abandonment of projects underpinning healthcare or financial system infrastructure, are the insured risks.

    The case for placing AI companies at the center of the mandatory contribution mechanism is not simply that they are large and profitable, though they are. It is that they occupy a uniquely paradoxical position in the open source ecosystem. They are its most intensive beneficiaries and, through the market deployment of AI coding tools, among its most consequential current stressors. The AI industry’s dependence on open source is not historical or incidental, it is ongoing and structural. Production AI systems today run on open source inference infrastructure, are evaluated against open source benchmarks, are fine-tuned using open source tooling, and are deployed through open source serving frameworks. The value of every commercial AI product on the market today is partly a function of the open source commons from which it was built and on which it continues to depend. Against this, the contribution noise, maintainer burden, and security surface expansion that AI coding tools are generating in the open source ecosystem represents a transfer of cost from the AI industry onto the maintainer communities it has already extracted value from, without commensurate return. The AI deployment levy proposed in this paper is not an external imposition on a bystander industry. It is a contribution requirement on the industry that has extracted most from the commons and is now, measurably, making it harder to sustain. Framed correctly, and the framing matters for political viability, this is not regulation. It is the belated recognition of a debt.

    This framing carries the urgency argument without alarmist leanings or hyperbole. The question put to governments is not whether catastrophic failure is possible. We are past that point, as it has already been demonstrated that it is. The question is whether the expected value of the downside risk now justifies the cost of collective insurance. The answer is yes, and it has been yes for some time. AI has not created this conclusion, it has made it more urgent and more undeniable.

    6.2  The Polluter Pays Dimension

    Alongside the insurance framing, the polluter pays principle provides the moral and legal foundation for the contribution obligation. Organizations that deploy AI coding tools at scale impose costs on the open source commons that their tools degrade. Those costs are real, measurable at the aggregate level, and uncompensated. The principle that the party generating an externality should bear the cost of it, established in environmental regulation across all major jurisdictions, cleanly and directly applies here.

    The fund is not a tax, and the paper asserts this distinction explicitly and for structural rather than merely rhetorical reasons. A tax generates general government revenue allocated through the political budget process. The Sovereign OSS Fund contributions flow directly into a ring-fenced mechanism managed by an independent body outside direct government control, disbursed according to publicly auditable criteria to specifically designated projects. No government has discretionary access to fund proceeds. The contribution is tied to a defined public benefit, literally the maintenance of infrastructure from which the contributing organizations derives direct commercial value. It is kept equitable because it is proportionate to the scale of that benefit and the scale of the harm being generated.

    6.3  The Contribution Tier Structure

    The contribution structure is designed to satisfy proportionality and administrability simultaneously. A blunt mechanism that treats a ten-person startup identically to a hyperscaler would be both inequitable and politically unworkable. A mechanism so complex that compliance costs exceed contribution value would be administratively self-defeating. The four-tier model below achieves both requirements.

    Tier 0 — Exempt:

    Organizations with annual revenue below €10 million carry no contribution obligation. The administrative cost of collection from this tier would exceed the revenue generated, and the political cost of applying the mechanism to startups and early-stage companies would undermine its broader credibility. Voluntary participation is encouraged through an Open Source Pledge-style recognition scheme. This also ensures that it does not actively stifle the startup/early stage organization with regulation and cost that did not inhibit its predecessors.

    Tier 1 — Base:

    Organizations with annual revenue between €10 million and €500 million contribute at a flat rate scaled to their declared open source dependency footprint, as documented in mandatory SBOM disclosures. The existing EU Cyber Resilience Act and US Executive Order SBOM mandates provide the disclosure infrastructure this tier requires. Since organizations already have an obligation to know and declare what they depend on, that declaration becomes the basis for a modest proportionate contribution.

    Tier 2 — Scaled:

    Organizations with annual revenue above €500 million contribute at a rate that scales with both their revenue band and their declared AI coding tool deployment intensity. AI deployment intensity is measured by declared usage of AI coding tools above defined thresholds, and easily measured. Developer seat counts using AI assistants, AI tool spend above an auditable minimum, or where the data is available and technically feasible, the volume of AI-assisted commits above a defined level. This tier reflects the reality that larger organizations have deeper open source dependencies and are beginning to deploy AI tools at a scale that materially affects the ecosystem.

    Tier 3 — Hyperscale:

    Organizations with annual revenue above €5 billion, particularly those deploying AI coding tools at scale across tens of thousands of developers, carry the maximum contribution obligation. The weighting formula at this tier places the greatest emphasis on AI deployment intensity, on the basis that these organizations are the primary generators of the maintainer burden the fund exists to address. They are simultaneously the greatest extractors of value from the open source commons, the primary deployers of the tools degrading it, and the parties with the greatest capacity to contribute to its maintenance. The moral and financial case converges at the hyperscale tier.

    6.4  The AI Deployment Levy

    The AI deployment intensity metric at Tiers 2 and 3 is the most novel element of the contribution structure and the one most directly connected to the paper’s argument. A contribution obligation based solely on open source dependency footprint, which existing proposals such as the Open Source Pledge approximate, does not distinguish between organizations that are worsening the maintenance burden through AI deployment and those that are not. The AI deployment levy closes this gap.

    The metric is imperfect, as all regulatory proxies are. AI coding tool usage is not uniformly distributed across development activities, and some AI-assisted contributions to open source are high quality and do not impose net costs. The metric measures deployment at scale rather than harm at the contribution level. That is because harm at the contribution level is not currently measurable. This is an acknowledged limitation that should be revisited as measurement capabilities develop. What the metric achieves is that the highest contribution obligations fall on the organizations whose AI deployment is most plausibly connected to the harm the fund is designed to mitigate. The hyperscalers and large SaaS companies deploying AI coding tools to tens of thousands of developers, that are generating the review burden that is burning out the maintainer community.

    This is the element of the proposal that will attract the most scrutiny from AI vendors and large technology companies, we are direct about it. The industry argument will be that AI deployment and open source harm are not causally linked at the individual contribution level, and that a levy based on deployment intensity is therefore arbitrary. The response is that causal attribution at the individual contribution level is precisely what the externality structure prevents. The harm is diffused across the commons and cannot be traced to individual actors. The levy is calibrated to deployment scale as the best available proxy for aggregate contribution to the harm, in the same way that environmental levies are calibrated to production volume or emissions rather than to individually attributable harm events. The standard for regulatory proxy metrics is workability and proportionality, not perfect causal attribution.

    6.5  Governance: The Independent Review Council

    The governance architecture of the fund is the element the paper treats with the greatest prescriptiveness, because it is where the risk of failure is highest and where the consequences of failure are most damaging to the fund’s legitimacy. Two failure modes must be structurally prevented rather than merely discouraged:

    1. capture by government, which would allow the fund’s resources to be diverted to politically convenient purposes rather than technically critical ones
    2. capture by industry, which would allow well-resourced lobbying to shape criticality designations in favor of commercially convenient projects rather than genuinely critical ones.

    The Independent Review Council is the mechanism by which both failure modes are structurally addressed. Its design rests on a foundational principle: neither a government nor a company should have unilateral authority over what is designated as critical infrastructure. The IRC holds sole decision-making power on criticality designation, and its independence from both sets of interests is structural.

    The IRC is composed of technical experts drawn from the security research, open source maintainer, and software engineering communities; academic representatives with relevant expertise in software security and infrastructure governance; and civil society and digital rights organizations. No active corporate directors/officers/principals and no government officials serve on the IRC. Membership is selected through an open nomination process with mandatory conflict-of-interest disclosure requirements. Term limits prevent entrenchment. The full deliberation records of every designation decision are published on the Public Transparency Register.

    The oversight structure above the IRC consists of a Statutory Framework Body, comprising government representatives from participating jurisdictions, which sets contribution obligation levels and provides the legal authority for the fund, but which has no operational authority over designation decisions or disbursements. Government participation is limited to non-voting observer seats on the Fund Operational Board. Industry participation is limited to a single, annually rotating non-voting observer seat. All operational authority rests with the IRC’s nominees on the Fund Operational Board.

    The precedents for this governance model in the open source context are established and successful. The Internet Security Research Group, which operates Let’s Encrypt and provides free TLS certificates to the majority of websites globally, functions as a public benefit corporation with transparent governance that is neither government-controlled nor industry-dominated. The Linux Foundation hosts critical infrastructure projects under a neutral governance umbrella that has sustained significant projects across organizational and political changes. Neither is a perfect template, but both demonstrate that the independent governance model is achievable and has worked at scale in the open source context.

    The fund’s designations, disbursements, and governance deliberations are all published in full on the Public Transparency Register, operated by an independent auditor appointed by the Statutory Framework Body. The register is not subject to modification by either government or industry participants. This transparency mechanism serves a dual purpose. It provides accountability for the IRC’s decisions, and it provides the public record that allows civil society, academic researchers, and affected communities to identify and challenge designation errors through the formal appeals process.

    6.6  Criticality Designation: Methodology and Safeguards

    The criticality designation methodology is designed to ground designation decisions in objective, publicly auditable criteria rather than political judgment. Three categories of evidence are assessed by the IRC for each designation decision.

    Dependency and usage metrics provide the quantitative foundation: download volumes across major package managers, dependency graph depth and breadth, and contributor concentration. Projects dependent on fewer than five active maintainers are flagged for heightened scrutiny regardless of their current stability, on the basis that the risk is prospective. These metrics draw on existing public data sources including package manager APIs, the OpenSSF Security Scorecard, and the Linux Foundation Census methodology. They are not proprietary to the fund and are independently verifiable.

    Infrastructure criticality assessment examines whether the project underpins systems in sectors formally designated as critical national infrastructure in participating jurisdictions: financial services, healthcare, defense supply chains, communications infrastructure, energy systems, and public administration. Projects that are material dependencies of systems in these sectors carry a different risk profile than projects of equivalent download volume in consumer applications, and that difference is reflected in the designation weighting.

    Maintenance health assessment evaluates the current and projected adequacy of the project’s maintenance regime against the risk it represents. A project that is currently stable but whose maintainer community is declining, whose burnout indicators are elevated, or whose security response timelines are lengthening is a candidate for proactive designation rather than reactive response to failure.

    All designation decisions are published in full, including the IRC’s reasoning, the data reviewed, dissenting views from IRC members, and the weighting applied to each assessment category. An appeals mechanism allows any organization to challenge a designation or non-designation through a panel drawn from outside the IRC membership. The appeals process does not give any single party veto power over designations; it provides a formal corrective for errors without creating a mechanism for strategic delay.

    The Full IRC and Fund structure: It is impossible to fully detail and document all possible structural and procedural functions of the proposed organization, and indeed we would not want to. The basic formation, by non-profit incentivized parties is the limit of the structural recommendation. The expectation is that the first actions of the founding body itself would be to set up the norms, procedures, and overall structure of the governing body, and the fund. This would far exceed the abilities of the policy statement, but should not be seen as a deficiency. Self-governance is a critical part of open source, and the goal of the IRC is to protect, not determine, and so the proposal matches that ethos.

    Section 7: Anticipated Objections and Responses

    This section addresses the objections most likely to be raised against the Sovereign OSS Fund by industry stakeholders, policy sceptics, and critics of regulatory intervention. The paper addresses them here rather than in an appendix because anticipating and answering objections is part of the policy argument, not a supplement to it. A proposal that cannot survive rigorous scrutiny should not be recommended; one that can should demonstrate this directly.

    7.1  ‘This is an AI tax by another name’

    This is the most predictable political attack on the proposal and the paper is direct in response. The distinction between the Sovereign OSS Fund contribution and a tax is structural and legally significant. A tax is a compulsory payment to government generating revenue allocated through the political budget process, with no defined beneficiary and no hypothecation to a specific purpose. The Sovereign OSS Fund contribution is a compulsory payment to a ring-fenced mechanism managed by an independent body outside direct government control, disbursed against publicly auditable criteria to specifically designated beneficiaries, with no government discretion over the allocation. These are different instruments both in law and in economic function.

    The contribution is better characterised as a mandatory insurance premium or a utility infrastructure levy: a defined payment into a collective mechanism that maintains shared infrastructure from which the paying party derives direct commercial benefit, scaled to the scale of that benefit. The political argument that this constitutes a tax on innovation is answered by the tier structure: a small company with minimal AI deployment pays nothing. A hyperscaler deploying AI coding tools to tens of thousands of developers, extracting billions of dollars in productivity value from open source infrastructure, and generating a proportionate share of the review burden degrading that infrastructure, pays a contribution commensurate with that footprint. Calling that contribution a tax on innovation inverts the actual relationship: it is the price of using innovation responsibly.

    Objection: ‘You are penalising AI adoption and will stifle the industry.’: The proposal does not restrict AI adoption. It requires that organisations adopting AI at scale contribute proportionately to the maintenance of the infrastructure their tools are degrading. The Tier 0 exemption ensures that startups and early-stage companies are unaffected. The AI deployment intensity weighting ensures that the heaviest obligations fall on the parties with the greatest capacity to bear them and the greatest connection to the harm. The alternative — unrestricted AI deployment that continues to degrade critical infrastructure until a systemic failure event produces emergency remediation costs far exceeding any contribution obligation — is the outcome that stifles industry.

    7.2  ‘Determining what is “critical” is too arbitrary and creates capture risk’

    This is a legitimate governance challenge, and it is answered by the IRC architecture rather than dismissed. The response rests on three elements.

    First, the designation methodology is objective and auditable. Criticality is assessed against quantifiable metrics drawn from publicly available data sources, not political judgment. The IRC’s reasoning for every designation decision is published in full. Any organisation can challenge a designation through a formal process. The arbitrariness risk is mitigated by methodology transparency; the capture risk is mitigated by governance structure.

    Second, the IRC’s independence is structural. No corporate employees and no government officials sit on the IRC. Neither the government observer seats nor the industry observer seat on the Operational Board carry voting rights on designation decisions. The membership selection process requires conflict-of-interest disclosure and operates through open nomination. None of these safeguards is individually perfect; collectively they make undue influence visible and resistible rather than merely prohibited.

    Third, the relevant comparison is not between the proposed IRC and a theoretically perfect independent body, but between the proposed IRC and the current situation: no governance, no accountability, no transparency, and de facto designation by default — where ‘criticality’ is determined entirely by whether a project’s failure has already caused enough damage to attract attention. The IRC is substantially more robust than the status quo, and the paper should make that comparison plainly rather than defending against a standard of perfection that no regulatory body meets.

    7.3 – Objection – ‘Companies in jurisdictions that implement this will be competitively disadvantaged relative to those in jurisdictions that do not.’

    The EU Cyber Resilience Act and GDPR have established that extraterritorial reach is achievable for digital regulation. Currently, any company serving EU customers or operating EU infrastructure is subject to EU rules regardless of its country of incorporation. The Sovereign OSS Fund would function on the same basis. Beyond jurisdictions, the systemic risk argument inverts the competitive disadvantage claim. A catastrophic failure in critical open source infrastructure damages all participants in the digital economy equally, regardless of jurisdiction, just as the software itself may be written in one jurisdiction and used in all. Collective maintenance of that infrastructure is not a competitive disadvantage – it removes the disadvantage outright form the voluntary structure that incentivizes free-riders. At this point it simply is a precondition for the market continuing to function at all.

    7.4 – Objection – ‘Governments cannot be trusted to manage technology infrastructure’

    The fund design anticipates this objection structurally. The government’s role is limited to:

    • setting the legal framework and contribution obligation levels through the Statutory Framework Body
    • providing annual public audit and parliamentary accountability
    • participating as a non-voting observer on the Fund Operational Board.

    Governments do not designate critical projects, do not control disbursement decisions, and do not have discretionary access to fund proceeds. The operational authority of the fund resides entirely with the IRC and the Fund Operational Board, operating under independent governance charters with full public transparency.

    The objection is a valid one if it were directed at a different model, one in which government directly administers the fund. That is not the model proposed. The proposed model is closer to the governance of Let’s Encrypt or the Linux Foundation’s hosted projects: public benefit, independent operation, transparent governance, government engagement limited to the legal framework and accountability oversight that any significant financial mechanism requires.


    Conclusion: A Duty of Care for the Digital Commons

    Open source software is infrastructure. It carries the same systemic importance to the digital economy that roads, power grids, and water systems carry to the physical one. We do not expect roads to be maintained by volunteers on the assumption that the drivers who use them will contribute freely. We do not leave the maintenance of power infrastructure to the discretion of the industrial consumers whose operations depend on it. We recognize that shared infrastructure requires shared governance and shared funding, and we build the regulatory and institutional frameworks that make that possible. The open source commons has been operating without those frameworks for two decades, sustained by a volunteer community whose goodwill is finite.

    Generative AI has not created this problem. It has arrived at a moment of particular structural vulnerability and added a new category of load to a system already under severe strain. The response to that addition cannot be to ask the volunteers who have been carrying the weight to carry more. It cannot be to convene another industry round table on voluntary commitments. It cannot be to wait for the next Log4Shell or the next XZ-Utils attack that does not fail, to focus governmental attention. The cost of that wait, in economic damage, national security exposure, and the loss of the infrastructure itself, will be orders of magnitude greater than the cost of the intervention this paper proposes.

    The Sovereign OSS Fund is a proportionate response to a documented market failure. It does not restrict AI. It does not punish innovation. It does not place government in control of the digital commons. It requires that the organizations that have built their commercial success on open source foundations, and that are now deploying tools that accelerate those foundations’ deterioration, bear a share of the cost of maintaining what they depend upon. It creates an independent governance body capable of making that determination credibly and transparently, insulated from both the lobbying power of industry and the political incentives of government. And it frames the contribution not as a penalty but as what it is: the collective insurance premium for an infrastructure risk that every participant in the digital economy shares.

    Time’s up. The governance gap in the digital commons has been visible for years, and the evidence that it is worsening is now substantial. Governments that continue to defer to voluntary industry action on this question are not being cautious; they are being negligent. The tools to address it exist. The regulatory precedents exist. The governance model exists. What has been missing is the political will to use them. This paper is an argument that the moment for that will has arrived, and that the cost of waiting for a better moment will be measured in infrastructure we no longer have.


    Bibliography — Working References

    Note: References updated in v0.2 to reflect completed primary GHArchive research. References [1]–[3] expanded to full Tidelift series; references [27]–[28] are new additions.

    [1]  Tidelift. (2021). State of the Open Source Maintainer Report 2021. Tidelift Inc.

    [2]  Tidelift. (2023). State of the Open Source Maintainer Report 2022/23. Tidelift Inc.

    [3]  Tidelift. (2024). State of the Open Source Maintainer Report 2024. Tidelift Inc.

    [4]  Greenstein, S. & Nagle, F. (2024). The Value of Open Source Software. Harvard Business School Working Paper.

    [5]  Linux Foundation. (2024). Census II of Free and Open Source Software — Application Libraries. Linux Foundation / LABO.

    [6]  Xu, Z. et al. (2025). Predicting Open Source Project Abandonment on GitHub. arXiv preprint arXiv:2507.21678.

    [7]  Coelho, J. et al. (2019). GitHub Pull Requests Abandoned and Merged: A Large-Scale Study. arXiv preprint arXiv:1906.08058.

    [8]  GitHub. (2020). Octoverse 2020: The State of Open Source. GitHub Inc.

    [9]  GitHub. (2025). Octoverse 2025: The State of Open Source. GitHub Inc.

    [10]  GitClear. (2024). Coding on Copilot: 2023 Data Suggests Coding Assistants Degrading Code Quality. GitClear Research.

    [11]  Carnegie Mellon University. (2025). [Full citation pending primary source access.] Study of code quality degradation across 807 GitHub repositories adopting Cursor, 2024–2025.

    [12]  CodeRabbit. (2024–2025). [Full citation pending primary source access.] Analysis of AI-generated vs human-authored pull request issue rates.

    [13]  METR. (2025). Measuring the Impact of AI Coding Assistants on Developer Productivity: A Randomised Controlled Trial. METR Research.

    [14]  Atlassian. (2025). State of Developer Experience 2025. Atlassian Corporation.

    [15]  EY. (2025). Work Reimagined Survey 2025. Ernst & Young Global Limited.

    [16]  IBM Institute for Business Value / Morning Consult. (2024). AI ROI: From Potential to Profit. IBM Corporation.

    [17]  Apiiro. (2025). [Full citation pending primary source access.] Security findings analysis, Fortune 50, December 2024–2025.

    [18]  Phylum. (2024–2025). [Full citation pending primary source access.] Open source malicious package analysis.

    [19]  Georgetown CSET. (2024). AI-Enabled Supply Chain Attacks on Open Source Software. Center for Security and Emerging Technology.

    [20]  OpenSSF. (2024–2025). [Full citation pending primary source access.] Security guidance on AI and supply chain risk.

    [21]  Hardin, G. (1968). The Tragedy of the Commons. Science, 162(3859), 1243–1248.

    [22]  Ostrom, E. (1990). Governing the Commons: The Evolution of Institutions for Collective Action. Cambridge University Press.

    [23]  European Commission. (2024). Regulation (EU) 2024/2847 on horizontal cybersecurity requirements for products with digital elements (Cyber Resilience Act).

    [24]  Executive Office of the President. (2021). Executive Order 14028 on Improving the Nation’s Cybersecurity. The White House.

    [25]  The White House. (2023). National Cybersecurity Strategy. Office of the National Cyber Director.

    [26]  Stack Overflow. (2025). Developer Survey 2025. Stack Overflow.

    [27]  GHArchive / Google BigQuery. (2026). Primary analysis of open source contributor activity trends, 2015–2023. Original research conducted for this paper. Methodology and query documentation in supplementary materials.

    [28]  Raman, N., Cao, M., Tsvetkov, Y., Kästner, C., & Vasilescu, B. (2020). Stress and burnout in open source: Toward finding, understanding, and mitigating unhealthy interactions. Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering: New Ideas and Emerging Results. https://doi.org/10.1145/3377816.3381732

    Research Notes — Outstanding Actions for Further Research

    Section 2 — Primary Research CompleteThe GHArchive BigQuery analysis has been completed covering 2015–2023 across a filtered set of high-longevity repositories. The 2024–2025 data will be incorporated in a subsequent revision. The forthcoming companion study tracking contributor migration versus exit is noted in Section 2.2 but is outside the scope of this paper.

    •       Extend GHArchive analysis to 2024–2025 to confirm post-2023 trajectory.

    •       Verify all quantitative figures from GHArchive primary research [27] against the supplementary data export before submission.

    •       Obtain full text of Carnegie Mellon Cursor study [11] and CodeRabbit analysis [12] for direct citation.

    •       Contact Xu et al. for year-disaggregated data from arXiv:2507.21678 as a cross-check against GHArchive primary findings.

    •       Systematic case study compilation: expand Section 2.4 to a minimum of 10 named projects with documented governance changes in response to AI contribution burden.

    •       Map EU Cyber Resilience Act SBOM obligations in detail to confirm they provide the dependency disclosure infrastructure the Tier 1 contribution model requires.

    •       Identify case law or regulatory precedents for hypothecated levies applied to digital infrastructure in EU and UK jurisdictions.

    •       Explore alignment of the critical infrastructure definition proposed in Section 6.6 with existing EU NIS2 Directive definitions and any divergences requiring explicit treatment.

    •       Develop the maintenance cost model for disbursement calculations in sufficient operational detail to be credible: what does it cost to maintain a critical project at the required standard, and what data sources underpin the estimate?

    •       Legal review of the IRC governance design: does the independence structure as proposed require specific treaty or statutory arrangements in the jurisdictions of intended implementation?

    •       Design primary research survey instrument for planned 2026 survey. Core questions: do you currently contribute to open source; have you reduced contributions over time; did you switch projects or stop entirely; what drove the decision. Design to be repeatable annually to build a longitudinal series comparable to the Tidelift survey line.

  • Future Proofing Software Development

    Future Proofing Software Development

    Preparing for the post AI order

    Executive Summary

    The software development workforce is approaching a structural crisis that is not yet visible in today’s hiring data, but is clearly legible in the demographic and educational trends accumulating beneath it. This paper argues that the current wave of junior developer redundancies, driven largely by anticipated productivity gains from artificial intelligence tools, represents a significant strategic miscalculation by organisations in both the private and public sectors. A miscalculation whose consequences will be felt most acutely between 2028 and forward, when the costs of reversing it will be substantially higher than the costs of avoiding it today.

    Four converging factors underpin this assessment. 

    AI tools are transforming the developer role from code writer to code architect, and doing so faster than organisations can adapt their workforce structures to accommodate the change. The measured productivity gains from AI in real production environments are, at present, significantly lower than executive perception suggests. The most rigorous independent study to date (Joel Becker, 2025) found that experienced developers using frontier AI tools in 2025 took 19% longer to complete tasks than without them, against a developer-perceived improvement of 20%. Organisational hiring decisions are being made at scale on the basis of anticipated gains that have not yet materialised at the level assumed. 

    The second factor is the rapid contraction of the developer education and training pipeline, as students and institutions respond rationally to the signal that entry-level employment has collapsed. US entry-level technology job postings fell 67% between 2023 and 2024, and computer science program enrollment deposits have declined by over 25% in the most recent academic cycle. 

    The third factor is the accelerating demographic ageing of the existing professional developer workforce.The proportion of professional developers aged 35 and above has grown from 31% to 39% in just three years, while the under-25 cohort has shrunk from 33% to 23% over the same period. The aging of the workforce and the subsequent large drop in new engineers is a clear sign that the system is becoming top heavy. Coupling this with the higher levels of age discrimination, and ageism beginning earlier, in software development, it creates a scenario where the industry loses its most skilled practitioners at precisely the point it can afford to the least.

    The fourth consideration is a significant and well-documented deterioration in the quality of the developer experience, a measure of developer’s work environment effectiveness and job satisfaction. Research shows this is accelerating attrition at the senior level precisely when retention of senior capability is most critical.

    These four factors are not independent. They form a compounding system in which each accelerates the others, and in which the window for low-cost intervention is closing. The developers currently displaced by junior hiring freezes will not remain available indefinitely. Research consistently shows that when entry-level professionals cannot gain a foothold in their trained field within a reasonable period, a meaningful proportion redirect their careers and employment permanently into adjacent sectors. Once lost to the pipeline, they do not return.

    There is a historical parallel that is instructive. The COBOL developer shortage, now costing financial and government institutions extraordinary sums in retention and recruitment, was the direct consequence of a previous generation’s failure to invest in succession. That shortage affected a single ageing language. The emerging shortage affects the foundational layer of all software capability, at a moment when AI implementation itself demands the most experienced engineering judgement available.

    The recommendation of this paper is straightforward, though counterintuitive in the current business climate. Organisations that continue to eliminate junior development roles in favour of a leaner, AI-augmented senior workforce are optimising for a present that is already changing, at the cost of a future in which they will find themselves competing in an open market for a talent cohort that was never adequately formed. The more resilient path is to recommit to building developer capability in-house, treating junior developers not as a cost to be eliminated but as the senior engineers and technical architects of the next decade. The organisations that maintain this discipline now, both private sector firms and public bodies responsible for critical digital infrastructure, will be the ones best positioned to capture the genuine long-term productivity gains that AI will eventually deliver, because they will have the experienced human judgement required to direct it.

    A compounding set of forces

    Generative AI and in particular the use of Agentic AI is changing the landscape of Software Development at an unprecedented pace. We have found in our research that it is the area with the highest development rate as well as the area of AI with the most adoption. While the jury is still out on the effectiveness overall of the AI tools and overall functionality, we already observe a direct effect of this adoption in the form of lower hiring and even a shift towards layoffs and force reduction driven by the anticipated gains of AI. While it feels an obvious choice for a business to reduce dependence on human workforce in favor of laborers that never sleep, the research points to a situation that businesses will be hard pressed to recover from.

    Companies never practice the sort of foresight that researchers can, and in doing so they tend to embrace the here and now because the economics of their financial cycles are tied to immediate performance, not long-term sustainability. The AI arms race is changing that scenario however, and companies would do well to look beyond today and tomorrow at the long-term scenario they are creating. In short, a reduction of junior-level staff today will lead to a massive uptick in hiring costs later, if there is even anyone left to hire. The stats back up the race to the endgame condition we are in, and even during the data science boom of the 2010’s and COBOL developer competitiveness years into the early 2020’s, we did not see this much of an imbalance on the horizon. The data is stark, showing a relative workforce age increase of 5 years on average, coming in just the last 3 years.

    The factors influencing this condition fall into 4 basic issues:

    • The real impact of AI tools on developers and the mismatch of perceived and realized gains. 
    • A rapid drop-off in training and education of developers because of the fear of being unable to obtain employment post education. 
    • The “aging out” of the majority of senior developers – in an industry where real age discrimination begins at 35, the majority of developers are now pushing 40.
    • The significant and precipitous decline in the quality of the developer experience, both the tools and platforms they use, and their working environment as a whole, for developers of all levels, and in all organizations,

    The solution is obvious, but is counterintuitive when it comes to the current business climate of reducing headcount on the back of AI. This behavior is an anti-pattern when it comes to survival in the long-term, however. Our recommendation is returning to the somewhat antiquated notion of building your employee’s careers “in house”, and fostering your entry level people of today into the seniors you will need tomorrow. By doing this through a combination of a quality work environment and a reasonable career path, you can hedge against the increasing pressures of AI adoption, and the inevitable arms race that is brewing for talent that can handle the increasingly complex and critical implementation of AI tools.

    Issue 1: The AI Productivity Question: Reality, Perception, and Consequence

    Artificial intelligence tools have arrived in software development with genuine capability. That much is not in dispute. GitHub Copilot users in controlled studies completed isolated coding tasks up to 55% faster than peers working without assistance. Surveys consistently show that between 67% and 90% of developers report perceiving productivity improvements in their daily work, and by 2025 approximately 84% of professional developers were using AI tools regularly, with 51% doing so on a daily basis. The technology has moved from experimental to embedded in the development workflow faster than almost any preceding tool category, and organisations are right to take it seriously.

    The problem is not that AI tools lack impact. The problem is the size of the gap between perceived impact and measured impact, and the consequential decisions being made in that gap.

    The most methodologically rigorous study conducted to date on this question is the METR randomised controlled trial published in July 2025, examining how early-2025 AI tools affected experienced open-source developers working on real production codebases. The developers in the study are all experienced practitioners with an average of five years of active contribution to their specific repositories. Before the study participants estimated that AI tools would reduce task completion time by approximately 24%. After completing the study, participants estimated the tools had reduced their time by around 20%. The measured reality was that using AI tools increased their completion time by 19%. The tools, in this real-world context, made experienced developers slower, not faster.

    This finding sits in deliberate tension with some of the more optimistic studies in the literature, and that tension is itself informative. Studies showing the largest productivity gains from AI, including figures of 56% and 65% improvement, were conducted using synthetic or simplified tasks specifically designed for automated testing. As the METR study authors note, such tasks are unrepresentative of most production software development work, and are likely to closely resemble training data used to develop the AI systems being evaluated, which creates a measurement advantage that does not exist in real codebases. A separate analysis of 800 developers by Uplevel found no significant gains in objective measurements such as cycle time or pull request throughput, and found that developers using AI coding assistants introduced a 41% increase in bugs, representing a measurable negative impact on code quality.

    The code quality question extends beyond individual organisations into the open source infrastructure on which the entire profession depends. Open source projects underpin an enormous proportion of production software in both the private and public sectors, and they are maintained largely by volunteers working under significant time constraints. The arrival of AI-assisted development has introduced a new and serious strain on this infrastructure that has no clear precedent.

    The most documented example is the curl project. Curl is a foundational internet data transfer tool embedded in billions of devices and as with many open source projects, it is maintained by a small team of volunteers. From early 2024 onwards, the project’s lead maintainer Daniel Stenberg documented a sustained increase in AI-generated vulnerability reports submitted via its bug bounty programme. By mid-2025, genuine vulnerability reports had declined from 15% of all submissions to approximately 5%, with the remaining 95% requiring triage time despite producing no actionable findings. In January 2026, the project shut down its bug bounty programme entirely. Stenberg described the volume of AI-generated submissions as effectively a denial-of-service attack on his maintainers’ time: a small number of trusted volunteers absorbing an unlimited volume of automated noise, each report superficially plausible enough to require human investigation before being dismissed.

    Curl is not an isolated case. Seth Larson, a security triage worker for multiple open source projects, documented an equivalent pattern across the projects he managed, noting that AI-generated reports “appear at first glance to be potentially legitimate and thus require time to refute.” His assessment of the systemic consequence was direct: maintainers experiencing this burden become burnt out and increasingly averse to legitimate security work, with low-quality reports functioning in practice as if they were malicious regardless of intent. 

    Django updated its security documentation to reject AI-generated reports outright. 

    The Node.js project imposed minimum quality score thresholds to filter automated submissions. 

    The libxml2 maintainer ended support for embargoed vulnerability reports entirely in June 2025, citing unsustainable triage burden as an unpaid volunteer. 

    Jeff Geerling, who manages over 300 open source projects, reported that the problem had become serious enough that GitHub introduced a feature to disable pull requests entirely. This is a capability that, as he noted, is a striking response given that pull requests are the fundamental mechanism that made GitHub central to collaborative software development in the first place.

    The open source signal-to-noise problem is directly related to the broader code quality argument. AI tools lower the cost of generating plausible-looking contributions without lowering the cost of evaluating them. That asymmetry is manageable when the humans producing AI-assisted contributions have sufficient understanding of the codebase to distinguish useful signal from noise before submission. It becomes structurally damaging when that understanding is absent. This is precisely the kind of contextual, codebase-specific understanding that junior developers acquire through structured development over time that is most difficult for AI tools to replicate and most consequential when it is missing. Currently, the data shows that entry-level developers are being displaced from formal employment. This is happening as AI tools make it easier to generate superficially convincing, but fundamentally uninformed contributions. Thus anyone hoping to contribute code can create code, whether it is objectively “good code” or not. This in turn causes a deterioration in the quality of the shared infrastructure that all software development ultimately depends upon.

    The organisational picture is consistent with this ambiguity. McKinsey’s survey of C-suite executives conducted in late 2024 found that only 19% reported revenue increases of more than 5% from AI investments, with 36% reporting no change at all. Only 1% of organisational leaders described their companies as mature in AI deployment. A comprehensive Danish study linking AI chatbot adoption across eleven occupations to administrative labour market records found essentially zero measurable effects on earnings or recorded working hours through the end of 2024, despite widespread worker-reported adoption. The gap between what AI tools appear to do in controlled settings and what they deliver at organisational scale is not a rounding error — it is the central empirical question of the current moment in AI adoption.

    None of this is an argument against AI in software development. It is an argument for proportionality. The Faros AI Productivity Paradox Report of 2025, drawing on real engineering metrics across organisations, found that even where team-level AI adoption drove measurable individual productivity gains, those gains were consistently absorbed by downstream bottlenecks in review, testing, and deployment pipelines. The organisation did not become more productive simply because individual developers touched more tasks. The gains require the surrounding system to be redesigned to capture them — a process that requires experienced engineering judgement to lead, and that takes time measured in years, not quarters.

    This matters enormously in the context of the hiring decisions now being made. The Octopus Deploy 2026 AI Pulse Report documents that 73% of organisations have already reduced their number of junior developers over the past two years, with an explicit “seniors with AI” strategy emerging as the dominant model. This strategy rests on the assumption that experienced developers augmented by AI tools can sustain output whilst the organisation reduces headcount at the junior level. Given the evidence on actual versus perceived productivity gains, this assumption is being made at a significant scale on the basis of optimism rather than measured performance. The Brynjolfsson et al. analysis of high-frequency US payroll data through September 2025 found that early-career workers aged 22 to 25 in the most AI-exposed occupations show relative employment declines on the order of 15 to 16%, while senior employment remains largely stable — confirming that the adjustment is not distributed evenly but is concentrated precisely in the entry-level cohort.

    The irony is sharp. Organisations are making irreversible structural decisions about their talent pipelines on the basis of productivity gains that, at the organisational level, have not yet materialised at the scale anticipated. The hype cycle around AI is real, well documented in the research literature, and directly traceable in the hiring data. The Danish study’s description of a “productivity J-curve”,  in which organisational reorganisation and task restructuring delay measurable economic effects, suggests that genuine productivity gains from AI may indeed arrive, but will do so on a timescale that the current pace of junior hiring reductions is not designed to accommodate. By the time the gains are real and stable, the pipeline that would have produced the next generation of senior developers will have been disrupted for long enough that recovery will require competing in an open market for a cohort that was never properly formed.

    It is worth also highlighting a hidden issue that will creep into organizations in the years ahead, as AI tools become the standard. Expertise in software engineering does not result simply from the accumulation of theoretical knowledge. In reality, it stems from the formation of “developer reflexes” born out of the repeated resolution of low-complexity difficulties. Historically, the tasks assigned to junior developers (writing unit tests, simple refactoring, first-level debugging or enhancements) served as an indispensable learning ground. It is by manipulating these fundamental building blocks that the human brain develops robust mental models, innovative strategies and, above all, technical intuition. Conversely, by systematically delegating these tasks to AI under the pretext of immediate time savings, organisations are inadvertently eliminating stages crucial to developing this technical instinct. We are witnessing a rupture in the learning chain: without the laborious practice of simple code, the capacity to supervise complex code withers.

    As Nicholas Carr notes in The Glass Cage, automation converts tacit knowledge, meaning, the intuitive ‘feel’ for code built through manual debugging, into explicit procedures. This cuts out a key part of the learning process, leaving juniors able to generate code but unable to build the internal mental models required to oversee complex systems.

    Automation creates a cognitive dependency trap. This phenomenon is analogous to the systematic use of GPS: by blindly following assisted directions to navigate one’s own city, one eventually loses the habit of memorising landmarks or constructing mental maps of journeys. Just as a driver becomes incapable of finding their way without assistance, the developer who delegates their logic to AI risks becoming lost when facing a complex system as soon as the tool is no longer able to guide them.

    If a future senior developer has never learned to identify the subtle nuances of a logical bug for themselves because a machine did it in their place during their early years, they will find themselves unable to exercise the critical judgment necessary to validate the outputs of more advanced AI. We risk seeing the emergence of a generation of “surface-level seniors.” These will be “assisted experts”, certainly capable of piloting intelligent agents, but unfortunately devoid of the depth of analysis required when the AI fails, hallucinates, or drifts outside the specified framework. Expertise cannot be reduced to a supervisory role. It must be understood that if the supervisor does not possess sufficient technical mastery relative to the tool, they will quickly be overwhelmed. Ultimately, experience cannot be downloaded; on the contrary, it is forged through effort and concrete learning.

    In the long term, this lack of direct confrontation with the raw material of code threatens the architectural innovation capacity of organisations. The understanding of large systems depends on an in-depth knowledge of their individual components. If the foundation of the teams expected to hold the skills and knowledge (including the ability to write code manually) is replaced by automated systems, however “intelligent” they may be, the overall coherence of the architecture becomes uncertain. Architectural decisions made by engineers who have never experienced the constraints of baseline implementation risks being disconnected from ground-level realities. It appears, therefore, indispensable to value the work of junior developers. The challenge is to preserve the integrity of the organisation’s global expertise and to ensure that the technical decision-makers of tomorrow possess the intellectual legitimacy (grounded in their lived experience as developers) necessary to direct increasingly opaque systems.

    Issue 2: The Collapse of the Developer Formation Pipeline

    Labour markets operate on signals. Prospective students, career changers, and educational institutions all make formation decisions based on their reading of where employment will be available, at what level, and at what compensation. When those signals turn negative, the pipeline does not slow gradually, it contracts sharply, and the contraction persists well beyond the point at which the underlying market conditions that caused it have begun to recover. The developer formation pipeline is currently receiving the most unambiguously negative set of signals it has experienced since the dot-com collapse of the early 2000s, and it is responding accordingly.

    The proximate cause is the collapse in entry-level hiring documented in Section 1. US entry-level technology job postings fell 67% between 2023 and 2024, according to Stanford Digital Economy Lab analysis of ADP payroll data. The UK experienced a 46% decline over the same period. Employment among the youngest professional software developers sat approximately 20% below its late-2022 peak by the end of 2024. The consequences of these figures for the formation pipeline are not theoretical. They are already visible in enrolment data, institutional behaviour, and the accelerating closure of alternative development pathways.

    The university pipeline is contracting at pace. MARKETview data, tracking the entire enrolment funnel across public, private, and selective national institutions in the US, recorded an 8% year-on-year decline in computer science programme deposit volumes in 2023-24, followed by a further decline of more than 25% in the most recent cycle. Application volumes fell sharply in 2024-25, and the yield, the proportion of admitted students who actually enrol,  has declined steadily for two consecutive years. The Computing Research Association’s own pulse survey of 130 computing departments, conducted in autumn 2025, found that 64% of responding academic units reported declining bachelor’s degree enrolments in computing, with 40% reporting declines of up to 10%, a further 29% reporting declines of 11 to 20%, and 31% reporting declines of more than 20%. Five institutions reported declines exceeding 35%, with some losing as much as 75% of their undergraduate computing student body in a single year. The University of California system reported a 6% decline in computer science(CS) majors in 2025, down 9% over two years. This is the first sustained retreat in that system since the dot-com bust, and the first time in over twenty years that the number has fallen.

    Critically, this decline is not occurring because students have lost interest in technology as a field. It is occurring because they have made a rational assessment that the traditional computer science degree no longer reliably produces the employment outcome it once promised. CS graduate unemployment reached 6.1% in 2025 against a general graduate unemployment rate of approximately 4%, placing computer science seventh highest among all college majors in unemployment — below philosophy, art history, and journalism. Federal Reserve Bank of New York data confirms the shift. Students are not abandoning technology; they are redirecting towards AI-specific programmes, data science, and cybersecurity, all of which carry more immediately legible employment signals. The US now offers 193 bachelor’s degree programmes in artificial intelligence and 310 AI master’s programmes, numbers that have grown rapidly as universities respond to student demand. This redirection will eventually produce a different kind of technology professional, but it will not produce the foundational software development capability that production systems require. Worth noting is that the results of this shift will not be seen for a decade at the earliest.

    The alternative formation routes have fared considerably worse. The coding bootcamp sector, which at its peak in the mid-2010s provided a substantial and inclusive parallel pipeline into entry-level development roles, has undergone a structural collapse. Between December 2023 and mid-2024, more than a dozen prominent bootcamps closed their doors across North America, including Codeup, Kenzie Academy, Momentum Learning, Rithm School, Epicodus, Code Fellows, Women Who Code, and Toronto’s Juno College. Southern New Hampshire University shuttered its coding programme in 2023, citing AI adoption as a direct factor in declining placement rates. The most symbolically significant closure was 2U’s December 2024 decision to exit the bootcamp sector entirely, shutting down rather than selling a portfolio of partnerships with over 50 universities that had produced more than 96,000 graduates. This marks a decision that followed a 40% drop in bootcamp enrolment and a 23% revenue decline in its alternative credentials segment. The company’s $750 million acquisition of Trilogy Education in 2019 was effectively written to zero. These closures represent not merely business failures but the elimination of accessible, non-degree pathways that had served career changers and those without access to traditional university education, disproportionately affecting groups already underrepresented in the profession.

    What makes this contraction structurally dangerous, as distinct from simply being a cyclical correction, is the timing mismatch between pipeline signals and market need. The students currently deciding not to pursue computer science degrees or bootcamp programmes are making decisions that will manifest as workforce absence in 2028 to 2032. The organisations currently eliminating junior roles will not begin to feel the downstream consequences of those decisions for a similar period. By the time the market signal reverses, as the BLS projection of 15% employment growth for software developers through 2034 suggests it will, the pipeline will have been suppressed for long enough that recovery will be limited. Degree programmes take three to four years to produce graduates. Experienced developers take five to ten years to mature from entry-level hires. Neither timeline accommodates the kind of short-cycle correction that organisations typically deploy in response to talent shortages. Salary competition and international recruitment, the standard tools of talent crisis management, will be available but at substantially elevated cost and without guaranteeing the depth of institutional and domain knowledge that internally developed talent provides.

    There is a further multiplier that the headline enrolment figures do not fully capture. Research on prior talent pipeline disruptions consistently shows that a proportion of displaced early-career professionals who cannot gain a foothold in their trained field within two to three years do not return to it when conditions improve. They redirect permanently into adjacent fields where their skills transfer. Areas like financial technology, defence and security, healthcare informatics, and industrial automation among them, where compensation and stability are competitive. The 67% collapse in junior developer hiring between 2023 and 2024 therefore does not simply delay the entry of that cohort into professional software development by one or two years. It permanently removes a fraction of them from the profession’s future supply. The size of that fraction is not yet measurable, because the disruption is recent, but historical precedent from comparable labour market dislocations suggests it is not trivial.

    Issue 3: The Ageing Out of the Senior Developer Cohort

    Every profession has a demographic centre of gravity. This is the age band where the largest concentration of experienced practitioners sits, where institutional knowledge is deepest, and where the capacity to mentor, lead, and make consequential architectural decisions is most reliably found. For software development, that centre of gravity has been shifting steadily upwards for a decade, and is now moving at a pace that the broader workforce planning conversation has not yet caught up with.

    The longitudinal data is unambiguous. Stack Overflow’s annual developer surveys, the most consistently sampled source in the field with responses from between 65,000 and 100,000 developers each year, show a profession that has aged measurably and continuously since at least 2015. In that year, the average developer age across respondents was approximately 29, and more than half of all developers were under 30. By 2016, analysis of the same survey noted that barely a quarter of practitioners had more than a decade of experience, leading commentators at the time to describe software development as a “field of newbies.” That description is no longer accurate. By 2018 and 2019, approximately three-quarters of professional developers were still under 35. By 2021, 48% of professional developers were concentrated in the 25 to 34 band. By 2022, 31% of all respondents were aged 35 and above. By 2023 that figure had grown to 35%, and by 2024 it had reached 39%. The 2025 survey confirms that 66% of professional developers now fall between 25 and 44, with the overall population measurably older than in any prior survey year.

    SlashData’s global developer population data, drawn from surveys of over 11,000 developers across 126 countries, corroborates the Stack Overflow trajectory with a sharper lens on the most recent period. Between early 2022 and early 2025 alone, the proportion of developers aged 18 to 24 fell from 33% to 23% of the global professional population. That’s a ten percentage point drop in three years. Over the same period, the 35 to 44 cohort grew from 22% to 26%. The direction of travel is consistent across every major data source, and the rate of change has accelerated since 2022.

    That demographic shift is unfolding within a profession that has historically skewed young, and the contrast with the broader workforce makes the trajectory more pronounced than the raw numbers alone suggest. The US occupational average for software developers currently sits at approximately 39 years, placing the profession at or near the median of the general workforce for the first time in its history. Globally the average remains younger, with the 25 to 34 bracket still constituting the largest cohort at around 32%, but the US figure is the relevant one for any organisation considering talent strategy in the world’s largest technology market. The Pave compensation management data cited by LeadDev is perhaps the most striking single data point in this regard: the average workforce age at large public tech firms rose from 34.3 years to 39.4 years between January 2023 and August 2025 alone, a shift of five years in under three real years. The junior hiring freeze has not merely slowed the pipeline, it has already produced a measurable and rapid demographic step-change in the workforce itself.

    This ageing trajectory is directly at odds with a well-documented but underacknowledged dimension of the technology labour market: the effective age threshold at which developers experience discrimination in hiring is considerably lower than in almost any other professional field. The US Age Discrimination in Employment Act sets its legal threshold at 40, and the EEOC’s own reporting confirms that the proportion of tech workers over 40 declined from 56% to 52% between 2014 and 2022. This was during a period when that same cohort represented 53% of the overall US workforce, meaning tech had moved from over-representing the 40-plus group to under-representing it. AARP’s 2023 survey found that 64% of workers aged 40 and above reported seeing or personally experiencing age discrimination, the highest level since AARP began tracking the question.

    The tech-specific threshold is, however, considerably earlier than the legal one. A 2021 study by the University of Gothenburg found that 35 is considered old by tech industry standards. The CWJobs survey of UK technology workers found that developers begin experiencing age-related discrimination at 29, a full twelve years earlier than the cross-industry average of 41. Further it found that by 38, colleagues consider a tech worker to have passed their peak. These are consistent findings across multiple geographies and multiple methodologies, and they carry a direct implication for the workforce ageing argument: the cohort most at risk of being structurally pushed out of the profession is precisely the 35 to 44 bracket that, as the SlashData survey shows, is now growing as a proportion of the overall developer population. The same demographic trend that is shifting the centre of gravity of the profession upward is simultaneously moving more developers into the age range where hiring discrimination becomes an active factor. The pipeline contraction described in Section 2 is not merely reducing the inflow of new talent, it is doing so at the moment when the outflow pressure on mid-career developers is also increasing.

    This matters for two distinct but compounding reasons. The first is volume. The cohort that is now concentrated in the 35 to 44 band, the developers who entered the profession in the early to mid-2010s and who now constitute its most experienced working layer, will begin transitioning out of hands-on development roles in significant numbers between approximately 2030 and 2040. The US Bureau of Labor Statistics projects approximately 129,200 annual openings for software developers through to 2034, and explicitly notes that a significant portion of those openings will arise from the need to replace workers who transfer to other occupations or exit the labour force entirely. In 2025, a record 4.2 million Americans reached age 65, and approximately 10,000 Baby Boomers are estimated to be reaching retirement age every day through to 2027. The science and engineering workforce, of which software development forms a substantial part, shows labour force participation beginning to decline meaningfully between ages 55 and 60, with a marked reduction by 65. The retirement wave that has been forecast for years in the broader knowledge economy is not a future event. It is occurring now, and its heaviest impacts on the software development profession are approximately five to ten years away.

    The second reason is less about volume and more about depth. Senior developers carry something that cannot be acquired quickly and cannot be transferred through documentation alone: the accumulated understanding of why systems are built the way they are, how architectural decisions made years ago interact with present requirements, and where the institutional knowledge of an organisation’s technical landscape actually lives. This is not a soft concern. It is a well-documented and quantifiable risk. Research consistently shows that fewer than 30% of organisations have a formal knowledge retention plan in place, and only 19% have formal succession plans at all. The consequence of that absence becomes visible at the moment experienced practitioners leave, and the knowledge gap is discovered rather than planned for. The NASA Apollo programme provides perhaps the most cited illustration of this dynamic: when engineers who had worked on the Saturn V rocket retired without structured knowledge transfer, the blueprints for the most powerful rocket ever built were effectively lost, and the capability had to be rebuilt from scratch at enormous cost decades later. The lesson was not learned at the organisational level. In 2025, the US Social Security Administration announced a three-year, one billion dollar AI-assisted programme to modernise its legacy COBOL codebase. It did this, not because the technology had failed, but because the human expertise required to maintain it was retiring faster than it could be replaced.

    The COBOL situation warrants extended attention in this context because it is not an aberration. It is a preview. Approximately 220 billion lines of COBOL code remain in active operation globally, powering 43% of American banking systems, 95% of ATM transactions, and the majority of government benefits processing across multiple countries. The average age of a COBOL programmer today is between 55 and 60, and approximately 10% of the remaining COBOL-literate workforce retires each year. Around 70% of universities stopped teaching COBOL decades ago, assuming the language would become obsolete. The resulting shortage has driven COBOL specialist salaries to between $121,000 and $150,000 annually, with 60% of organisations that still rely on COBOL reporting that finding skilled developers is their single biggest operational challenge. During the COVID-19 pandemic, the state of New Jersey made a public appeal for COBOL programmers to help process an unprecedented surge in unemployment claims. It was not because the state’s systems had broken down, but because there were not enough people alive who understood them well enough to extend them under pressure. Researchers at the Dutch national institute for mathematics and computer science have characterised their country’s equivalent situation as a “knowledge crisis,” noting that the real problem is not the programming language itself but the loss of system knowledge. It is the loss of the understanding of decades of accumulated business logic, regulatory adaptation, and organisational context that retires with the developers who hold it.

    The parallel to the current moment in general software development is direct and uncomfortable. COBOL’s crisis was the product of a single failure mode: a generation of practitioners was allowed to reach retirement without succession planning or structured knowledge transfer, in a field that educational institutions had stopped feeding. The emerging crisis in mainstream software development is the product of that same failure mode operating simultaneously across a much larger and more strategically critical domain. The difference is scale, not kind. The organisations now eliminating junior development roles are not simply reducing headcount. They are removing the mechanism by which institutional technical knowledge passes from one generation of practitioners to the next. When the senior cohort that currently holds that knowledge moves on, as demographic data confirms it will in increasing numbers from the late 2020s onwards, the knowledge will not transfer itself. It will simply be lost, at a cost that will be measured not in recruitment fees but in the ability to maintain, extend, and direct the AI-augmented systems that organisations will by then have become entirely dependent upon.

    Issue 4: The Decline of the Developer Experience

    Of the four factors driving the emerging talent crisis, the deterioration of the developer experience within organisations is simultaneously the least discussed in public policy terms and the most immediately actionable. It is also the factor with the most direct bearing on retention. It is the primary mechanism by which organisations either hold on to the experienced developers they currently have, or lose them into a market that will soon be competing intensely for exactly that cohort.

    Developer experience, in the research literature, encompasses everything that shapes a developer’s daily working life: the quality of tooling, the degree of autonomy over their own work, the amount of time available for focused technical contribution versus administrative overhead, the clarity of direction from leadership, the health of the codebase they inherit, and the organisational culture within which all of this operates. It is not a soft metric. McKinsey’s Developer Velocity Index research found that organisations in the top quartile of developer experience outperformed those in the bottom quartile by up to five times in revenue growth. Gartner’s 2024 research found that teams with high-quality developer experiences are 20% more likely to retain their talent. The business case is not ambiguous, and yet the data on the current state of developer experience across organisations is deeply concerning.

    The most comprehensive recent study is the State of Developer Experience Report, produced jointly by Atlassian and DX, drawing on surveys of 1,250 engineering leaders across the US, Germany, France, and Australia, and 900 developers globally. Its headline finding is stark: 69% of developers report losing eight or more hours per week to inefficiencies. That is 20% of total working capacity, or one full working day in every five, consumed not by productive development work but by the accumulated friction of poor tooling, technical debt, insufficient documentation, and unclear direction. For an organisation with 1,000 developers earning an average of $100,000 annually, the arithmetic produces a loss of approximately $18.5 million per year in wasted capacity. The principal causes identified by developers themselves are technical debt, followed by insufficient documentation, complex build processes, and lack of focused time. These are not novel problems, they are the predictable consequence of years of underinvestment in engineering foundations in favour of feature velocity.

    The technical debt dimension deserves particular attention because it compounds directly with the ageing and attrition dynamics described in Section 3. Technical debt in US organisations alone was estimated at $1.52 trillion in 2022 by the Consortium for Information and Software Quality. Some 91% of CTOs identified technical debt as their biggest operational challenge heading into 2024, according to STX Next’s Global CTO Study. McKinsey research indicates that CIOs estimate technical debt amounts to between 20% and 40% of the total value of their technology estate, with approximately 30% reporting that more than a fifth of their technology budget nominally allocated to new development is actually diverted to resolving issues arising from it. Technical debt does not remain static. It accumulates. And it accumulates fastest when the experienced developers who understand the codebase well enough to manage it strategically are stretched across expanded responsibilities following headcount reductions, or are absent because they have left.

    The layoff cycles of 2022 to 2025 have directly worsened this dynamic. LeadDev’s Engineering Leadership Report 2025, surveying 617 engineering leaders and developers, found that 65% reported expanded responsibilities following recent workforce reductions, with 40% managing more direct reports than previously. Only 3% reported a decrease in scope. The same survey found that 22% of respondents were experiencing critical levels of burnout, with a further 24% moderately burned out. This means nearly half of the engineering leadership population was operating under significant stress. The 2024 Stack Overflow survey captured a data point that had not appeared in prior years: for the first time, senior developers reported lower job satisfaction than junior developers. The profession’s most experienced practitioners, precisely the people whose retention is most critical, are the least satisfied with their working conditions.

    The Haystack Analytics study found that 83% of developers report experiencing burnout. The LeadDev survey found that 40% of engineering leaders noted their teams were less motivated than a year previously. A 2025 survey cited in the JetBrains State of the Developer Ecosystem Report found that 66% of developers do not believe current organisational metrics accurately reflect their real contributions. This is a finding that speaks to a profound disconnect between how developers experience their own work and how leadership perceives and measures it. That disconnect is not merely a morale concern. It is a structural risk, because developers who feel their work is neither understood nor fairly measured by their organisations are the most likely to leave those organisations when the labour market improves which, as the BLS projections confirm, it will.

    The interaction between poor developer experience and the junior hiring freeze deserves explicit attention. One of the most consistently documented benefits of investing in junior developers is the forcing function it creates on developer experience itself. Onboarding a new developer requires that processes be documented, codebases be navigable, tooling be coherent, and mentoring structures exist. Organisations that maintain a flow of junior developers into their teams are, whether they recognise it or not, continuously incentivised to keep their technical foundations in reasonable order. Organisations that eliminate junior hiring remove that incentive entirely. Technical debt accumulates uninhibited. Documentation degrades. Onboarding processes atrophy. The codebase becomes progressively less navigable to anyone other than the small number of senior developers who already carry its institutional history in their heads — until those developers leave, at which point the knowledge gap becomes an operational emergency rather than a planning problem.

    The 2025 State of Developer Experience report found that 63% of developers consider developer experience to be either important or very important when deciding whether to remain in their current role, and that nearly two in three would consider leaving a role if dissatisfied with their developer experience. Some 86% of engineering leaders agreed that attracting and retaining top talent will be difficult without meaningful improvement in developer experience. The gap between that stated recognition and the actual investment being made to address it is where the fourth factor becomes an accelerant of the other three. Poor developer experience drives senior attrition. Senior attrition concentrates institutional knowledge risk. Institutional knowledge risk becomes critical at exactly the moment when the formation pipeline, already suppressed by the forces described in Section 2, fails to produce the replacements needed to address it.

    Conclusion: The Counterintuitive Imperative — Building the Future In-House

    The four factors examined in this paper do not operate independently. They form an interlocking system, and the interactions between them lies the real danger. AI tools are driving a hiring freeze on junior developers. That hiring freeze is sending a market signal that is already suppressing the formation pipeline. The formation pipeline contraction will reduce the future supply of experienced developers at exactly the moment when the current senior cohort begins aging out in significant numbers. And across all of this, the deteriorating quality of the developer experience within organisations is accelerating the attrition of the experienced practitioners who are, right now, the only buffer between present stability and future crisis. Each factor makes every other factor worse. The system is moving in one direction, and it is not self-correcting.

    The timeline is the critical point for any organisation seeking to act. The consequences of the decisions being made today will not arrive as an emergency in the next quarterly earnings cycle. They will arrive quietly, between 2028 and 2034, as a realization that the experienced developers an organisation needs to direct and maintain its increasingly complex AI-augmented systems are simply not available. To be clear, it is not because those developers do not exist somewhere, but because the pipeline that should have been forming them for the past five years was deliberately closed. At that point, the organisation’s options reduce to three: pay a significant salary premium to compete for a constrained supply of experienced developers in an open market; recruit internationally at comparable cost and with no guarantee of institutional fit or domain knowledge; or attempt to rebuild internal capability from scratch, which carries both a multi-year timeline and the overhead of doing so from a degraded technical foundation encumbered by years of accumulated debt.

    None of those options is cost-free. None of them is fast. None of them delivers the depth of institutional knowledge, contextual understanding, and organisational alignment that comes from developing talent internally over time. The organisations that will be best positioned in 2030 and beyond are the ones that treated the current period not as an opportunity to reduce headcount but as an opportunity to build the bench they will need when the arms race begins in earnest.

    The recommendation of this paper is therefore straightforward in principle, even if it requires significant resolve to execute against the prevailing business climate. Organisations, both private sector firms and public bodies responsible for critical digital infrastructure, should recommit to the structured development of junior and early-career developers as a core strategic investment, not a discretionary cost. This means maintaining a deliberate inflow of entry-level talent into development teams even when AI tools appear to reduce the immediate operational need for them. It means creating structured career pathways that develop early-career developers into the mid-level and senior practitioners the organisation will need in five to ten years. It means investing in the developer experience conditions — manageable technical debt, coherent tooling, documented codebases, and protected time for deep work — that make those career pathways viable and that retain the senior developers already in place. And it means treating knowledge transfer from senior to junior developers not as an optional cultural nicety but as a formal operational continuity practice, with the same organisational seriousness applied to any other critical infrastructure risk.

    The COBOL parallel, examined in Section 3, is instructive about what this looks like when it goes wrong at scale. Organisations and governments that failed to maintain succession in that domain are now spending extraordinary sums to maintain systems they no longer fully understand, at the mercy of a talent market they no longer influence. The Social Security Administration’s one billion dollar AI-assisted modernisation programme is not the cost of upgrading old technology. It is the cost of a generation of succession planning failures, now coming due. The emerging equivalent across the broader software development profession carries the potential to be an order of magnitude larger in its impact, because the systems at stake are not legacy administrative platforms but the AI-augmented production systems on which every sector of the modern economy will depend.

    There is a further dimension that applies with particular force to public sector organisations. Private sector firms operating in competitive markets will eventually respond to talent shortages through compensation pressure. The response will manifest expensively, but it will be responsive. Public sector bodies typically cannot move at that speed. Government agencies, public health systems, defense establishments, and public utilities that allow their internal development capability to atrophy in the current period will find themselves structurally disadvantaged in any future talent market recovery, because they will be competing on compensation terms they cannot match and on employer brand propositions that the private sector will always be able to outbid. The organisations within the public sector that will retain digital resilience in the decade ahead are those that recognise now that their talent pipeline is a public infrastructure asset, not an administrative overhead, and invest in it accordingly.

    The argument for returning to in-house talent development is sometimes characterised as nostalgic. It is seen as a preference for an older and simpler model of workforce development in an era when AI has changed everything. That characterisation misreads both the evidence and the moment. The case for building talent internally is not nostalgic. It is adaptive. The organisations that will most effectively harness the genuine long-term productivity gains that AI will eventually deliver are not those that have reduced their human development capability to a skeleton crew of senior practitioners. They are the ones that have maintained the human formation pipeline through which the understanding, judgement, and institutional knowledge required to direct AI systems effectively is created and transmitted. AI does not eliminate the need for experienced human developers. It raises the stakes for having them.

    The data assembled in this paper points to a window that remains open, but is closing. The formation pipeline has contracted sharply but has not yet collapsed irreversibly. The senior cohort is aging but has not yet departed in the volumes that will define the crisis. The developer experience is deteriorating but has not yet driven the wave of senior attrition that poor conditions ultimately produce. There is still time for organisations that choose to act to do so from a position of relative strength, building the capability they will need before the market forces them to compete for it. That window will not remain open indefinitely. The organisations that act in the next two to three years will be the ones setting the terms of the talent market in 2030. Those that wait for the crisis to arrive before responding will be competing on terms they did not choose and cannot control.

    The choice, at its core, is between building and buying. Building is available now, at manageable cost, through the deliberate and sustained investment in developing junior developers into the senior practitioners of the next decade. Buying will be available later, at substantially higher cost, in a market shaped by the collective failure of organisations to build. The research is unambiguous about which of those two futures is preferable. The only question is whether organisations will act on that research before the preferable future is no longer available to them.


    List of Cited Works

    Section 1: The AI Productivity Question

    METR — Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity (July 2025) Joel Becker, David Rein, Nate Rush, Elizabeth Barnes. 2025. “Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity.” Research Paper. METR, July. https://metr.org/blog/2025-07-10-early-2025-ai-experienced-os-dev-study/. 

    Uplevel — AI Coding Assistant Study: 800 Developers, Copilot Bug Rate and Productivity Analysis (2024)

    Faros AI — The AI Productivity Paradox Report (June 2025)

    Brynjolfsson, E., Chandar, B., and Chen, R. — Canaries in the Coal Mine: Early-Career Workers in AI-Exposed Occupations (2025). ADP high-frequency payroll microdata analysis through September 2025.

    Humlum, A. and Vestergaard, E. — AI Chatbot Adoption and Labour Market Outcomes: Evidence from Danish Administrative Records (2025). Difference-in-differences study across 11 occupations through December 2024.

    Doron Yeverechyahu, Raveesh Mayya, and Gal Oestreicher-Singer — GitHub Copilot Rollout Study: Commits, Pull Requests and Code Contributions (2024).

    Industry Reports and Surveys

    McKinsey & Company — AI in the Workplace: A Report for 2025. US CxO Survey, October–November 2024 (3,613 employees, 238 C-level executives).

    Octopus Deploy — AI Pulse Report 2026: Senior Developer Shortage as AI Reshapes Hiring and Productivity.

    GitHub / Microsoft and MIT Sloan School of Management — AI Developer Tools: Productivity Impact Study (500+ developers, August 2024).

    Stack Overflow — Developer Survey 2024 and 2025 (65,000+ and 49,000+ respondents respectively).

    Open Source Infrastructure and Code Quality

    Daniel Stenberg (curl project maintainer) — Bug Bounty Programme Shutdown Statement (January 2026). Public posts at daniel.haxx.se.

    Seth Larson — AI-Generated Security Reports and the Burden on Open Source Maintainers. Published writing, 2025.

    Django Security Team — Updated Security Reporting Policy Rejecting AI-Generated Reports (2025).

    Node.js Security Team — Minimum Quality Score Thresholds for Vulnerability Reports (2025).

    Nick Wellnhofer (libxml2) — End of Embargoed Vulnerability Reports Statement (June 2025).

    Jeff Geerling — AI Slop and the Collapse of Open Source Pull Request Quality (2025). Public writing.

    Section 2: The Education and Training Pipeline

    Entry-Level Hiring Collapse

    Stanford Digital Economy Lab — Analysis of ADP Payroll Data: 67% Decline in US Entry-Level Technology Job Postings, 2023–2024.

    Brynjolfsson, E., Chandar, B., and Chen, R. — Canaries in the Coal Mine (see Section 1). Early-career employment declines of 15–16% in AI-exposed occupations.

    Layoffs.fyi — Tech Layoff Tracking Data: 260,000+ layoffs in 2023, 150,000+ in 2024.

    The Glass Cage (Blog): https://fs.blog/the-glass-cage-nicholas-carr/ 

    Enrolment and Degree Completion

    MARKETview — Why is Computer Science Enrollment Declining… and What Comes Next? Data showing 8% YoY deposit decline in 2023–24, followed by 25%+ decline in 2024–25.

    National Center for Education Statistics (NCES) — CS Bachelor’s Degree Completions Data: 51,696 in 2013–14 rising to 112,720 in 2022–23.

    Computing Research Association (CRA) — Pulse Survey on CS Graduate Enrolment (2024).

    University of California System — Computer Science Undergraduate Enrolment Data (2024).

    Bootcamp and Alternative Pipeline

    Momentum Learning — Closure Announcement Citing AI Impact on Job Placement Rates (2024).

    Launch Academy — Closure Announcement Citing AI Impact on Job Placement Rates (2024).

    2U — Financial difficulties and programme closures (2024).

    Graduate Employment

    Federal Reserve / Burning Glass Institute — CS Graduate Unemployment Rate: 6.1% in 2025 versus 3.6% overall rate.

    Section 3: The Ageing Workforce

    Global Developer Demographics

    SlashData — Global Developer Population Trends 2025 (Q1 2025). 18–24 cohort: 33% in 2022 to 23% in 2025. 35–44 cohort: 22% to 26%.

    Stack Overflow — Annual Developer Survey, Longitudinal Data 2015–2025. Respondents aged 35+ grew from 31% (2022) to 35% (2023) to 39% (2024).

    Evans Data Corporation — Worldwide Developer Population and Demographic Study, 34th Edition (2023–2028).

    US-Specific Workforce Ageing

    Pave Compensation Data (via LeadDev) — Average age at large public technology firms rose from 34.3 to 39.4 between January 2023 and August 2025.

    Lemon.io — US Developer Age Statistics: Average 39.2 years (male), 38.5 years (female).

    US Bureau of Labor Statistics (BLS) — Occupational Outlook Handbook: Software Developers, Quality Assurance Analysts, and Testers. 15% employment growth projected 2024–2034; 129,200 annual openings.

    National Science Foundation — Age and Retirement of the Science and Engineering Workforce (2018). Labour force participation among S&E workers aged 60–64 rose from 69% (1993) to 74% (2015).

    Age Discrimination

    US Equal Employment Opportunity Commission (EEOC) — Technology Sector Workforce Report (late 2024). Workers aged 40+ in tech declined from 56% (2014) to 52% (2022); overall US workforce 53% over 40.

    AARP — Age Discrimination Survey (2023). 64% of workers aged 40+ report experiencing age discrimination, the highest rate since tracking began in 2003.

    University of Gothenburg — Age Discrimination in the Tech Industry: The 35-Year Threshold (2021).

    CWJobs — Age Discrimination in UK Tech Survey. Discrimination reported beginning at age 29, considered ‘over the hill’ by 38.

    COBOL Parallel

    Various industry sources — COBOL: 220 billion lines in production; 43% of banking systems; 95% of ATM transactions. Average COBOL developer age: 55–60.

    Section 4: Developer Experience Decline

    Atlassian / DX — State of Developer Experience Report 2024. 69% of developers lose 8+ hours weekly to inefficiencies; estimated $18.5M annual loss per 1,000-developer organisation.

    Consortium for Information and Software Quality (CISQ) — Cost of Poor Software Quality in the US (2022). US technical debt estimated at $1.52 trillion.

    STX Next — CTO Survey: 91% cite technical debt as their biggest challenge.

    McKinsey & Company — Technical Debt: Tackling the Hidden IT Problem (2022). Technical debt represents 20–40% of technology estate value; 30% of CIOs report more than 20% of new product budget diverted to debt resolution.

    LeadDev — Engineering Leadership Report 2025 (617 respondents). Post-layoff: 65% report expanded responsibilities, 40% managing more direct reports; 22% critical burnout, 24% moderate burnout.

    Haystack Analytics — Developer Burnout Survey. 83% of developers report experiencing burnout.

    Stack Overflow — Developer Survey 2024. First year in which senior developers report lower job satisfaction than junior developers.

    JetBrains — State of Developer Ecosystem 2025. 66% of developers believe current metrics do not reflect their true contributions; 63% consider developer experience important or very important for retention decisions.

    Note on sources

    Where a source exists in both a primary form (such as a researcher’s own published writing or an official organisational report) and in secondary coverage (such as trade press reporting), the primary source is listed above. Secondary sources consulted during research included reporting from The Register, BleepingComputer, LeadDev, and Computerworld, which provided corroborating coverage for several findings, particularly in the open source infrastructure and age discrimination sections.