Open source decline and the need for government intervention

The Accelerating Decline of the Open Source Ecosystem


How Generative AI is Compounding a Structural Crisis and Why Governments Must Act

Executive Summary

Open source software is the foundational infrastructure of the global digital economy. Estimates of its total economic value exceed $8.8 trillion. It underpins financial systems, healthcare networks, national security infrastructure, and the majority of commercial software products worldwide. It is maintained, overwhelmingly, by unpaid volunteers.

This paper argues that this arrangement was already structurally unsustainable before the emergence of generative AI tools, and that those tools have now materially accelerated this pre-existing decline. There is longitudinal evidence of a rising trend in open source project abandonment and contributor decline that pre-dates both the COVID-19 disruption and the AI adoption wave. This baseline goes a long way to establishing that the ecosystem was deteriorating under the weight of years of corporate free-riding on unpaid maintainer labor. Against this baseline, we assess three mechanisms through which generative AI is compounding the crisis:

  • the degradation of contribution quality and the associated acceleration of maintainer burnout
  • a novel and rapidly expanding security attack surface
  • the structural invisibility of both harms in the ROI framework organizations use, and through which AI adoption is currently evaluated and justified.

The logical upshot of this data is that we make a regulatory argument. The costs of AI misuse in the open source ecosystem are external, that is they are invisible to the measurement frameworks of the organizations generating them. Thus they are forced to be absorbed by a commons with no mechanism to charge the parties responsible. The open source commons has been given a decade to demonstrate otherwise, but it has shown that voluntary correction cannot be relied upon. The voluntary participation rate among open source users in the largest funding program is a staggeringly low 0.001 per cent. Put bluntly, the voluntary model has overwhelmingly and unambiguously failed.

The irony at the center of this crisis is not incidental, it is structural. The artificial intelligence industry, which has grown faster and extracted more commercial value from open source software than any preceding wave of technology adoption, is built almost entirely on open source foundations. The training infrastructure that powers large language models runs on open source frameworks. The model architectures are derived from openly published research with open source reference implementations. The data pipelines, evaluation harnesses, fine-tuning libraries, and deployment tooling are, with negligible exception, open source projects maintained by the same volunteer and underfunded contributor communities this paper documents as being in decline. AI companies did not merely benefit incidentally from the open source commons. They consumed it at scale, systematically, as a precondition of their existence. They are now, through the AI coding tools they sell into the developer market, one of the primary vectors for the contribution noise, quality degradation, and maintainer burnout that this paper documents as accelerating the commons’ deterioration. The industry that most depends on the commons is among those most actively degrading it. That is not a peripheral observation. It is the central justification for the policy response this paper proposes.

That proposal is the creation of a Sovereign OSS Fund. It would be a mandatory contribution mechanism, scaled proportionately to commercial benefit and AI deployment intensity. This mechanism would be similar to the existing models of the IMF/EU Central Bank and Federal Reserve Bank for fininancial regulation and maintenance. The Sovereign OSS Fund would be governed by an independent body operating outside both government and industry control, designed to function as a collective insurance pool against the systemic risk of critical digital infrastructure failure. The fund is not a tax. It is the price of operating responsibly in a shared digital economy whose foundations are disintegrating.


Introduction: The Open Source Bargain and Its Breaking Point

There is an implicit bargain at the foundation of the modern digital economy. Millions of software developers, working largely without payment, maintain the libraries, frameworks, and tools upon which the world’s largest technology companies have built products worth trillions of dollars. Those companies take the outputs freely, use them in their commercial products, and in theory, contribute back to the projects they benefit from. In practice, this translates either to contributing minimally or not at all to the maintenance of the infrastructure they depend upon. This arrangement has persisted for two decades, founded on the assumption that the goodwill of volunteer maintainers is an inexhaustible resource. It is not.

The signs of structural strain have been visible for years. Burnout rates among open source maintainers have been consistently above 50 per cent in every survey conducted since 2015. Critical infrastructure projects have been maintained by single individuals, or small groups, with no succession planning, no institutional support, and no compensation commensurate with the systems depending on their work. The Log4Shell vulnerability in 2021 exposed this dependency to the world. It was a critical flaw in a library maintained by two volunteers, that threatened hundreds of millions of systems globally. Estimates put the cost to the private sector at an estimated $100 million or more in remediation. It received the attention of governments and security agencies worldwide, and many expected corporations to take from that episode. It revealed in stark terms that they had a duty of care to the infrastructure they depended upon. Unfortunately, as is so often the case when the cost is invisible to the bottom line, this lesson was lost on the organizations in question. The voluntary participation rate in open source funding programs did not materially improve.

Into this deteriorating landscape emerges generative AI, and the advent of the “citizen developer”. Beginning in late 2022, AI coding assistants became widely available to enterprise software developers and hobbyists alike, and their adoption has been rapid. These tools are capable of generating code, identifying bugs, and drafting documentation at unprecedented scale and speed. Critically, as the tools have improved, they have also become capable of autonomously submitting contributions to open source projects in the form of pull requests and issue reports. Their deployment at scale by large commercial organizations, and ease-of-use by amateur coders has introduced a new category of stress into a system already under strain. Maintainers now face a flood of AI-generated contributions requiring human review, often containing higher error rates, and more subtle and hard to identify errors, than human-authored code. The errors are part of the tools themselves, hallucinations, and security vulnerabilities that these AI models often produce. Unfortunately they are also arriving faster than the volunteer maintainer community can process them. Several prominent projects have already changed their governance structures or closed to external contributions in response. Others have been retired, or effectively closed their programs entirely.

At it’s core, and based on all these factors, this paper makes three basic supporting arguments for its ultimate policy recommendation:

  1. The decline of the open source ecosystem is not a new phenomenon caused by AI, but a pre-existing structural failure that AI is now meaningfully accelerating. The distinction matters for policy: the appropriate response to an AI-caused problem is AI regulation; the appropriate response to a structural market failure that AI is worsening is structural market reform.
  2. The harm AI is causing to the open source commons is currently invisible to the corporate decision-making frameworks through which AI adoption is evaluated, and that this invisibility is the foundational reason why voluntary self-regulation cannot be expected to produce adequate results.
  3. That governments have allowed this situation to develop through a combination of deference to industry self-regulation and a failure of their own measurement frameworks, and that the time for that deference has passed.

The policy proposal at the center of this paper, the creation of the Sovereign OSS Fund, is designed as a structural response to a structural problem. It does not restrict AI adoption, nor does it seek to regulate it. Instead, it requires that organizations adopting AI at scale bear, proportionately, the costs of maintaining the underlying infrastructure their adoption is degrading. It is governed independently of both government and industry to prevent the capture that has historically undermined similar initiatives. And it is framed, deliberately, not as a punitive instrument but as a collective insurance pool against a systemic risk that every participant in the digital economy shares.


Argument 1: The Pre-Existing Fragility — A Decade of Structural Deterioration

Any accurate account of what generative AI is doing to open source must begin before AI. The argument that AI is the cause of the ecosystem’s current difficulties is both analytically incorrect and politically misleading. It locates the problem in a disruptive technology, rather than in the structural conditions that made the technology’s impact so damaging. The ecosystem was already failing, with meaningful declines in participation and volume of activity on projects already in evidence since well before the advent of AI tools. Understanding why the system is in such a state, requires examining the last two decades of a system of dependency without reciprocity.

1.1  The Scale of the Dependency and the Vacancy of the Contribution

The economic value of open source software is difficult to overstate. A 2024 study from Harvard Business School estimated the total value of freely available open source code at $8.8 trillion. It concluded that if companies were required to replicate that software using proprietary development, the cost would be 3.5 times higher than current expenditure on commercial software globally. Just south of $30 trillion, or the entire GDP of the United States to find an applicable comparison. To say the impact of the loss of this infrastructure would be seismic is perhaps the understatement of all time.

The maintenance of this infrastructure is provided overwhelmingly by unpaid volunteers. The 2024 Tidelift State of the Open Source Maintainer Report found that 60% of maintainers describe themselves as unpaid hobbyists. The Linux Foundation’s Census II study found that 136 individual developers were responsible for more than 80 per cent of the code committed to the top 50 open source packages. These are not secondary utilities. They include the cryptographic libraries, authentication frameworks, network protocols, and data processing tools upon which the majority of commercial software infrastructure is built.

The contribution gap, the distance between the value extracted and the maintenance investment returned, is the structural condition from which everything else in this paper follows. It is not new, and it is not narrowing. GitHub Sponsors, the largest voluntary funding mechanism for open source maintainers, has grown to approximately 4,200 corporate participants out of an estimated 300 million companies worldwide that use open source in commercial products. This is a voluntary participation rate of approximately 0.001 per cent. The voluntary model has had a decade to demonstrate that the private sector will voluntarily maintain the commons it depends upon. It has not done so.

1.2  The Human Cost: Burnout and the Exit of Experienced Maintainers

The sustainability crisis is not primarily a financial one, though finance is a component. It is a human one. Maintainers are leaving open source projects, not because they have been offered better employment elsewhere, but because the work has become untenable. The 2024 Tidelift survey found that 60 per cent of maintainers had quit or seriously considered quitting due to burnout, a figure statistically unchanged from the prior year’s 58 per cent despite sustained public awareness of the problem following Log4Shell.

The causes of burnout are consistent across surveys and qualitative accounts: the volume of inbound issue reports and pull requests; the expectation of responsiveness from commercial users of projects maintained in personal time. The exposure to harassment and entitlement from users who treat open source maintainers as unpaid support staff for commercial products. And not least of all, the psychological weight of knowing that critical infrastructure used by millions of people depends on one’s continued personal availability. These are not problems that can be solved by occasional financial contributions. They require structural changes to how the open source commons is governed and resourced.

The loss of experienced maintainers has a compounding effect that is structurally more damaging than the headline burnout figures suggest. When a maintainer with years of context about a project’s architecture, security considerations, and dependency graph leaves, that knowledge is generally not recorded. The project does not produce a transition document. The next maintainer, if there is one, begins with a fraction of the institutional knowledge that left, and a learning curve to understand the software as it is today. This is the mechanism by which the single-maintainer dependency problem becomes a security vulnerability: not when the maintainer burns out, but when the replacement, or the absence of any replacement, creates a gap that determined adversaries can exploit. The XZ-Utils incident, in which a sophisticated multi-year social engineering campaign nearly installed a backdoor in the majority of the world’s Linux distributions through a single maintainer’s exhaustion and consequent susceptibility to a seemingly helpful new contributor, is the clearest illustration of where this leads.

1.3  The Concentration Problem: Single Points of Failure at Scale

The Linux Foundation’s Census II methodology identified a specific and underappreciated dimension of the fragility: it is not simply that open source is under-resourced overall, but that the under-resourcing is extreme at precisely the most critical points. The finding that 136 developers are responsible for more than 80 per cent of contributions to the top 50 packages means that the global digital economy has, in practical terms, approximately 136 single points of failure in its most critical software infrastructure. This is before any consideration of the long tail of smaller but still critical projects below that top tier.

The concentration problem is made worse by the invisibility of dependency relationships to the organizations that have them. A financial services firm with a multi-billion-dollar technology budget may have no awareness that one of its core authentication systems depends on a library maintained by a single individual in their personal time, using personal equipment, with no institutional support and no succession plan. The SBOM (Software Bill of Materials) mandates introduced by the Biden administration’s 2021 Executive Order on Cybersecurity were designed, in part, to address this visibility gap. They are a necessary but far from sufficient response to the underlying risk.

1.4  What Corporate Engagement Has Looked Like in Practice

It would be inaccurate to suggest that corporations have done nothing. Several large technology companies have established open source program offices, contributed employees to the maintenance of critical projects, and made financial commitments to open source foundations. These efforts are real, and do make a meaningful difference. They are also structurally insufficient to stem the bleeding of the ecosystem as a whole. The data clearly shows that they concentrated among a small number of technology companies, weighted toward projects of direct commercial relevance to those specific companies’ products, and disconnected from the long tail of critical infrastructure that receives no equivalent support.

The pattern that emerges from the evidence is one of extractive dependency. Corporations that have built significant portions of their product value on open source foundations, and that have not reciprocated at a level commensurate with that extraction. The Harvard HBS figure of $8.8 trillion in value, time 3.5x replacement costs, is a measure of what those corporations would need to spend to replicate what they currently receive for free. Against that figure, the aggregate corporate investment in open source maintenance is vanishingly small. This is not primarily a result of bad faith, no one wants to actively kill off open source. It is a result of a measurement framework that accounts for the value received but not the cost of providing it, and an incentive structure that rewards free-riding as long as the commons holds.

Argument 2: The Longitudinal Abandonment Trend — A Pre-Existing and Worsening Crisis

Establishing that open source project abandonment was rising before the AI adoption wave is the central evidentiary task of this paper. It is also the task on which the paper’s regulatory argument most directly depends. If AI caused the crisis, the appropriate policy response is AI regulation. If the crisis pre-dates AI and AI is accelerating it, the appropriate policy response is structural, and must address the market failure that made the ecosystem vulnerable, with AI governance as a component rather than the totality of the solution. This section presents existing secondary evidence, based on our own original primary research using GHArchive public data, and a framework for interpreting what the combined evidence shows.

2.1  Existing Evidence of Pre-AI Decline

Several peer-reviewed studies establish that open source project abandonment was rising before generative AI tools became widely available. Coelho et al. (2019) found a 16 per cent abandonment rate in a sample of 1,932 popular GitHub projects, identifying maintainer overwork and declining community engagement as the primary drivers. This study, published before any significant AI coding tool adoption, documents a baseline condition of structural fragility that existed independently of the AI question.

The more substantial dataset comes from Xu et al. (arXiv 2507.21678, 2025), which analyzed 115,466 GitHub repositories and confirmed abandonment in 57,733 of them, using a multi-factor methodology that filtered for genuine project inactivity rather than simply the absence of recent commits. This dataset is, to date, the largest systematic abandonment analysis in the literature. Its primary limitation for the purposes of this paper is that it was published as a predictive modelling framework rather than a time-series analysis: it identifies abandoned projects but does not provide year-by-year disaggregated data that would allow a trend line to be constructed.

The Tidelift maintainer survey series provides complementary evidence from the supply side. Rates of burnout, quitting, and seriously considered departure among open source maintainers have been consistently above 55 per cent in every Tidelift maintainer survey conducted between 2021 and 2024, a figure the surveys themselves describe as essentially unchanged year on year despite growing public awareness of the problem. The consistency of these figures across four consecutive annual surveys is itself the most damaging evidence against the voluntary action argument. Awareness of the crisis has not moved the needle. The surveys do not provide the project-level longitudinal data that would allow a contributor attrition trend line to be constructed, but they establish the human context within which the quantitative findings in Section 2.3 should be read.

2.2  Primary Research: The GHArchive Analysis

To establish a longitudinal trend line from primary data, this paper presents original analysis of the GHArchive public dataset, covering the period 2015 to 2025. GHArchive is a public archive of GitHub’s event stream, capturing push events, pull request events, issue events, and contributor activity across the full population of public GitHub repositories. The analysis focuses on a filtered set of high-longevity repositories selected to represent consequential open source infrastructure rather than the long tail of personal and experimental projects.

Methodology and filtering criteria

The filtering methodology adapts the multi-factor approach of Xu et al. to the fields available in the GHArchive event stream. Repositories were required to meet all of the following criteria to be included in the analysis: a minimum of seven years of continuous activity within the study window; a minimum of five distinct contributors at peak year engagement; a minimum of 100 activity events per year in active years (approximately two meaningful actions per week on an annualized basis) to distinguish sustained maintenance from occasional personal activity; and exclusion of known noise patterns including repository names consistent with tutorial, course, homework, dotfiles, demo, example, and curated list repositories. Repositories with a pull-request-to-push ratio below 0.01 were excluded as probable automated mirrors.

The analysis tracks four primary metrics across each year in the study window:

  1. total activity events
  2. distinct contributor count by GitHub login
  3. push events as a proxy for direct commit activity
  4. pull request events as a proxy for external contribution volume.

The distinction between push and pull request event trends becomes analytically significant in the 2023 data, as discussed in Section 2.3.

The dataset covers 2015 to 2025. Data prior to 2015 exists in the GHArchive record but is treated with caution given documented inconsistencies in GitHub’s early event capture; 2015 is the more defensible starting point for the quantitative trend line.

Known limitations

The GHArchive event stream captures activity but cannot distinguish the origin of that activity. A pull request submitted by a human developer and one generated by an AI coding tool appear identically in the event record. This limitation is directly relevant to the interpretation of the 2023 data discussed in Section 2.3, and the paper is explicit about what can and cannot be established from aggregate event counts alone. The contributor count metric uses distinct GitHub logins per year, which will under count contributors operating under multiple accounts and over count bots or automation scripts operating under named accounts. These limitations are acknowledged and do not materially affect the directional findings.

Note on forthcoming research: A natural extension of this analysis is tracking individual contributor logins across repositories to distinguish migration from complete exit. While it is technically feasible using the same GHArchive dataset, it is outside the scope of this paper. A targeted version, tracking the 2019 Gatsby contributor cohort against successor frameworks Next.js and Astro in 2021–2023, is planned as a companion study. The question of whether contributors who leave established projects migrate to newer ones or exit open source contribution entirely has significant implications for the policy argument. A migration suggests the contributor pool is redistributing, while a full exit confirms the net depletion the Tidelift surveys describe from the supply side. This forthcoming research is flagged here as a known gap that the current analysis cannot resolve from aggregate event data alone.

2.3 Findings: The Longitudinal Trend

Analysis of the filtered repository set across the full 2015–2025 study window reveals a pattern that has moved, in the period since the original draft of this section, from predictive to descriptive. The stress accumulation model advanced in Section 1 is no longer a projection of where the open source ecosystem is heading. The data documents where it has already arrived.

Before presenting the findings it is worth establishing the analytical frame. The dataset does not show uniform collapse, and the paper does not claim that it does. It shows a clear and consistent bifurcation between two categories of project. The first category, those projects whose institutional backers have direct, ongoing, profit-motivated investment in sustaining their contributor ecosystems, shows stability or modest decline. The second category, those projects that are commercially critical but whose backers extract value from them without commensurate contributor investment, shows sustained and in many cases accelerating decline. Notably, the 2024 and 2025 data reveals a cliff that the 2015–2023 trend line had been building toward for almost a decade. The bifurcation is the finding. It is not a caveat to the argument. It is the argument.

The pre-COVID baseline deterioration (2015–2019)

Among projects with nine or more years of continuous activity in the dataset, the majority show declining or flat contributor counts between 2015 and 2019. Apache Spark declined from 1,165 distinct contributors and approximately 177,000 total events in 2015 to 741 contributors and 127,000 events by 2019. This was a 36 per cent reduction in the active contributor base during a period of strong and growing enterprise adoption. Elastic Kibana peaked at 2,639 distinct contributors in 2016 and had declined to 2,208 by 2019 despite substantially increased event volume, the earliest instance in the dataset of the concentration dynamic. That is when a shrinking contributor pool is absorbing a growing workload. gRPC peaked at 1,761 contributors in 2017 and had begun a decline that would prove uninterrupted through 2025. Laravel/framework peaked at 4,059 contributors in 2017 and declined steadily thereafter. Golang/go reached its contributor peak of 4,996 in 2019 before beginning a decline that would continue without interruption to 2025.

The pre-COVID pattern is significant because it establishes that contributor attrition in established projects was underway before either the COVID disruption or the AI adoption wave. It was not caused by either, but they accelerated it.

The 2020 disruption

The 2020 data shows a marked increase in activity across the majority of projects in the dataset, consistent with the well-documented surge in open source contribution accompanying the shift to remote working during the COVID-19 pandemic. TensorFlow reached its peak contributor engagement of 10,796 in 2020. Flutter reached its all-time peak of 19,666. Rust reached 3,536. Homebrew peaked at 1,014. Apache Spark was the notable exception to this rule. Its contributor count continued falling through 2020 from 741 to 651, as a project whose structural decline was already too advanced for the broader ecosystem trend to reverse.

The 2020 data is treated throughout this analysis as an external disruption rather than a structural signal. Its analytical significance lies entirely in what the recovery looks like in the years that follow.

The incomplete recovery (2021–2022)

Across the majority of established projects in the dataset, the 2021–2022 period does not return to 2019 levels. It settles at a lower floor. Kubernetes declined from its peak of 6,606 contributors in 2019 to 4,060 by 2022. Elastic Kibana fell to 1,517, 42% below its 2016 peak. Flutter fell from 19,666 in 2020 to 11,926 by 2022, a 39% decline in two years. TensorFlow fell from 10,796 in 2020 to 3,872 by 2022, a 64% decline in two years, despite sustained internal Google development activity throughout. Gatsby, whose collapse began before the COVID disruption, fell from 5,947 contributors at peak to 1,505 by 2022 — an 87 per cent decline driven primarily by the Netlify acquisition and the community exit that followed it.

The system emerged from the 2020 disruption weaker than it entered it. The post-COVID floor was lower than the pre-COVID baseline across virtually every project in the declining category.

The 2023 AI noise signal

The 2023 data shows a pattern consistent with the AI contribution noise hypothesis advanced in Section 3. Several projects show partial stabilization or modest recovery in total event counts. Year-on-year, Kubernetes was flat at 145,152, Elastic Kibana was flat at 192,022, Rust marginally up, even while distinct contributor counts continued to fall. For Kubernetes, contributors fell from 4,060 to 3,420 despite flat event totals. TensorFlow’s push events remained stable while PR events collapsed from 5,239 to 2,421, consistent with continued corporate-driven development and a near-complete withdrawal of external community contribution.

This event-contributor divergence, appearing consistently across multiple projects in the first full year of mass AI coding tool adoption, is consistent with AI-generated activity inflating event counts while the human contributor base continues its structural contraction. It must be very clearly stated that this cannot be established definitively from aggregate event data alone. Distinguishing AI-generated from human-generated contributions in the GHArchive event stream requires contributor-level analysis beyond the scope of this study, but the directional pattern is consistent with what AI-generated contribution noise at scale would produce, and is consistent with the qualitative evidence from named projects in Section 2.4.

The 2024–2025 acceleration: the reckoning arrives

The 2024 and 2025 data represent the most significant finding in this analysis. What the 2015–2023 trend line had been building toward as a projection has arrived as a measured reality. Across the majority of established projects in the dataset, 2024 and 2025 show accelerating contributor collapse that in several cases returns projects to or below their pre-growth baselines. In just two years a decade of community contribution is erased, while the commercial infrastructure built on those projects has never been larger or more deeply embedded. These are some important and illustrative examples, but the list of declines is many, many times longer than presented here.

Kubernetes fell from 3,420 contributors in 2023 to 3,098 in 2024 and 2,365 in 2025. Meanwhile, its total events in 2025 were 93,627, almost exactly its 2015 level of 94,419. A project that grew from a standing start to become the de facto standard for enterprise container orchestration globally has returned, on contributor engagement, to where it began. The commercial dependency on Kubernetes has never been greater. The community sustaining it has never been smaller relative to that dependency.

Flutter fell from 11,399 contributors in 2023 to 10,364 in 2024 and 7,115 in 2025. Its total events of 91,653 in 2025 were the lowest since 2018, when Flutter was a niche project with minimal adoption. Despite the demonstrable fact that it now underpins hundreds of millions of deployed applications across iOS and Android.

Apache Spark, admittedly a project that would be considered “legacy” in some circles, but is still widely deployed in analytics workflows globally, reached 523 contributors and 18,167 total events in 2025, below its 2015 starting point of 1,165 contributors and 177,222 events on both metrics.

Golang/go fell to 2,711 contributors in 2025, a 46 per cent decline from its 4,996 peak.

gRPC fell to 632 contributors, a 64 per cent decline from peak, with push events rising in 2024–2025 in a pattern consistent with internal automation replacing community contribution.

Laravel/framework fell to 1,675 contributors in 2025, its lowest recorded level.

TensorFlow presents the most analytically complex 2024–2025 pattern in the dataset and warrants specific treatment. Total events spiked to 111,654 in 2024, the highest since 2020, while distinct contributors collapsed to 2,921. In 2025, events remained elevated at 93,418 while contributors fell further to 1,222. Push events exploded from approximately 15,000 per year across 2021–2023 to 56,756 in 2024 and 47,013 in 2025. PR events similarly spiked from around 2,400–7,300 in prior years to 31,361 in 2024 and 34,214 in 2025. The result is that 1,222 distinct contributors generated 93,418 events in 2025. That’s an average of approximately 76 events per contributor, compared to approximately 9 events per contributor at peak community health in 2019 the last “regular” year of data. This is the most extreme event-contributor divergence in the dataset. It is consistent with a project that has become primarily a vehicle for internal Google automation and AI-generated contribution activity, with its open source community having almost entirely disengaged. That Google’s primary AI framework, has reached this state is not a footnote – vast swathes of the AI ecosystem would not exist without Tensorflow and it’s overall capabilities. It is a summary of the paper’s argument in a single data series.

WordPress/Gutenberg warrants specific attention given its infrastructural significance, and its 2024–2025 trajectory is more explicable, and more instructive, than the raw numbers alone suggest. Events of 101,447 and contributors of 2,019 in 2024 were followed by 34,422 events and 1,191 contributors in 2025. That is a 66 per cent single-year event collapse and a 41 per cent contributor decline in one year. WordPress powers approximately 40 per cent of the public web. Gutenberg is its core editing interface. The proximate context for this decline is the highly public dispute between Automattic and WP Engine in late 2024, which produced governance instability, community polarization, and a measurable withdrawal of developer engagement at precisely the moment Automattic was accelerating its AI integration strategy. The data suggests a compound mechanism: governance-driven community exit amplified by developer push-back against an AI-first strategic direction imposed by the corporate backer without community consensus. If that interpretation is correct, something subsequent qualitative research is required to confirm, then Gutenberg represents a new category of decline mechanism not previously identified in this dataset. Active contributor withdrawal in response to corporate AI encroachment on a project’s identity and direction, rather than the passive attrition through burnout and neglect that characterizes the majority of declining projects. The policy implication is that AI strategy decisions made unilaterally by corporate backers can destroy contributor communities rapidly and at scale, adding a further dimension to the governance argument developed in Section 5.

The counter-examples and what they prove

A handful of projects in the dataset show stability or growth in 2024–2025, and their characteristics are analytically decisive.

Visual Studio Code — Microsoft’s code editor and the primary host for every major AI coding extension on the market shows events of 144,669 in 2024 and 150,398 in 2025, with contributors of 20,579 and 19,982 respectively. Essentially it is flat across three years ending 2025. Microsoft has direct, measurable, ongoing commercial incentive to maintain VS Code’s contributor ecosystem. It is significant that it is the delivery mechanism for GitHub Copilot, which is a significant and growing revenue line. It’s project stability is not coincidental. It is the product of active investment.

Hugging Face Transformers — the most widely deployed model fine-tuning and serving library in the AI industry shows events of 52,959 in 2024 declining to 40,072 in 2025, with contributors of 4,904 and 3,515 respectively. A meaningful 2025 decline, but from a high base, and 2025 levels remain above 2021 and 2022 unlike other projects. The AI industry has direct commercial incentive to maintain this library’s contributor ecosystem. The relative resilience, compared to projects where that direct incentive is absent, is consistent with the pattern.

NixOS/nixpkgs reached its all-time peak of 407,677 events and 9,644 contributors in 2024 before a modest pullback to 350,974 events and 8,896 contributors in 2025. It remains the one project in the dataset showing sustained community-driven growth without clear corporate profit motivation as the primary driver. This is a genuine exception, explicable by the combination of strong foundation governance, an ideologically committed community, and technical characteristics that attract sustained new contributor interest. It is the dataset’s proof that the structural conditions for sustainability are achievable. It is also, notably, the only project in the dataset for which that can be said in the absence of direct corporate profit motivation.

Homebrew/homebrew-core shows a more ambiguous picture that is worth flagging as a warning signal rather than a counter-example. Total events grew through 2024 (201,202, the highest in its history) before a modest 2025 pullback to 191,936. But distinct contributors have been falling steadily and without interruption: from 3,114 in 2017 to 1,410 in 2025, a 55 per cent decline across eight years. Homebrew-core is doing substantially more work with substantially fewer people. The event volume is being maintained while the contributor base sustaining it is quietly contracting. This is the concentration dynamic in its most developed form. It is not yet a cliff, but with the preconditions for one accumulating year on year.

Rust-lang/rust shows a 2025 anomaly that requires a methodological note. Contributors fell from 4,257 at peak to 2,926 in 2025, continuing the gradual decline visible since 2022. However, push events spiked sharply in 2025 to 12,130. That’s nearly five times the 2024 level of 2,722 and the highest push event count in the project’s recorded history. Interesting because it happens with no corresponding announcement of a large-scale internal engineering initiative in the Rust project’s published 2025 roadmap and blog record. The most simple explanation, consistent with the broader patterns observed in this dataset, is that Rust is experiencing significant AI-generated or automated push activity. Rust’s technical prestige and community prominence make it a plausible target for low-effort, AI-assisted contributions, by actors seeking visibility in a high-status project. This hypothesis will be explored in subsequent research using contributor-level analysis. For the purposes of this paper, the Rust contributor count trend, declining but not collapsing and remaining substantially above pre-COVID levels, is treated as the more reliable signal of community health, and the 2025 push event spike is flagged as an unverified anomaly rather than an organic community signal.

The pattern and what it means

The bifurcation across this dataset is not random and it is not explained by project age, language, domain, or corporate affiliation in the simple sense. What distinguishes the stable projects from the declining ones is a single variable: whether the institutional backer has direct, ongoing, profit-motivated incentive to invest in the contributor ecosystem specifically. The backer(s) aims to fully use the project, not merely to employ engineers who commit to it, but to actively sustain the community that maintains it.

Microsoft invests in VS Code’s contributor ecosystem because VS Code is the distribution mechanism for Copilot revenue. Hugging Face invests in Transformers because Transformers is the product. NixOS’s community invests in itself because the community is the product. In every other case in the dataset, the institutional relationship is extraction. The project is used, depended upon, and in some cases nominally sponsored, but the contributor ecosystem is not actively maintained as an asset. It is consumed as a commons.

The result is visible in the data across a decade. Commercial dependency without active contributor investment is not protection. It is a slow-motion depletion of a shared resource. The projects in this dataset that are declining are not failing because they are bad projects or because their maintainers are insufficiently dedicated. They are failing because the market structure that surrounds them systematically rewards extraction and systematically fails to fund maintenance. That market failure is what this paper’s policy proposals are designed to address.

2.4  Case Studies: Projects at the Breaking Point

The quantitative trend data is supplemented with named case studies that illustrate the mechanisms at work. Four cases are particularly instructive, each representing a different point on the failure spectrum from governance change to active security exploitation.

Ingress NGINX was retired from active development in November 2025. It is not a peripheral project: it was the default ingress controller for Kubernetes clusters globally, underpinning enterprise infrastructure across financial services, technology, and public sector deployments. Its retirement was not caused by a lack of corporate sponsorship — it had that. It was caused by a lack of human attention: the maintainers, facing an increasing volume of AI-generated pull requests requiring review, and carrying the cognitive overhead of a project whose architectural complexity had grown beyond what a small volunteer team could sustainably manage, simply reached the end of their capacity.

curl, one of the most widely deployed command-line tools in the world, paused its bug bounty programme in 2024 after receiving a flood of AI-generated bug reports that described non-existent vulnerabilities, consuming maintainer time without producing security improvements. The decision was documented publicly by the lead maintainer, providing a rare first-hand account of the mechanism by which AI degrades maintainer sustainability.

tldraw, an open source drawing tool with widespread use in commercial products, temporarily blocked all external pull requests in response to what its maintainers described as a volume of AI-generated contributions that overwhelmed their capacity for review. The block was a pragmatic response to a denial-of-service attack on maintainer attention, using the language of one of the project’s own contributors.

XZ-Utils, a data compression library present in the majority of Linux distributions, was the target of a multi-year social engineering campaign that came within a routine system update of installing a backdoor in global SSH infrastructure. The attacker exploited a maintainer in visible distress, who was overwhelmed, and publicly expressing difficulty keeping up. They did this by offering sustained, high-quality contributions over months before embedding a malicious payload. This case pre-dates AI-assisted attacks but illustrates the attack surface that AI dramatically widens. A maintainer under stress, with insufficient capacity to review contributions thoroughly, is the vector. A huge volume of low-effort AI submissions is the smokescreen. ~A single malicious contribution is the payoff.

Argument 3: How Generative AI is Worsening the Crisis in Three Mechanisms

This section is not an argument that generative AI is intrinsically harmful to open source. Some maintainers are using AI tools to manage their own triage, documentation, and code review workloads. Some projects have benefited from AI-assisted contributions that improve code quality. The argument is narrower and more specific. It is simply that generative AI, as deployed at scale by commercial organizations without governance frameworks adequate to the task, is net-negative for the structural health of the open source ecosystem through three distinct mechanisms.

3.1  Contribution Quality Degradation and the Asymmetric Burden

AI coding assistants are capable of generating a lot of very plausible-looking code, very quickly. They are significantly less capable of generating high-quality and accurate code, particularly in the context of large, complex, long-lived projects with intricate dependency relationships and non-obvious invariants. The gap between plausible and accurate is not random, rather it is systematically biased toward certain failure modes that are expensive for maintainers to detect and reject.

The Carnegie Mellon University study analyzed 807 open source GitHub repositories that adopted Cursor between January 2024 and March 2025, tracking code quality metrics through to August 2025. It found no reversal in quality degradation trends even as the underlying models improved substantially across the same period. The GitClear analysis of 153 million lines of committed code found that code duplication increased from 8.3 per cent to 12.3 per cent between 2021 and 2024, and that refactoring activity, the deliberate improvement of code structure by developers who understand it deeply, collapsed from 25 per cent of commits to fewer than 10 per cent over the same period. Code that is duplicated is code that is not understood, Code that is not refactored is code that accumulates technical debt. Both are indicators of AI-generated contributions being accepted into codebases without the quality of human review that the original contribution volume warranted.

The CodeRabbit analysis found that AI-generated code creates approximately 1.7 times as many code review issues as human-authored code. The asymmetry is the critical point. The entities generating the increased volume of contributions incur none of the cost of reviewing them. The commercial organization whose developer submitted a pull request using an AI tool bears no cost if that pull request requires multiple rounds of review from a volunteer maintainer. The maintainer bears the entire cost. This asymmetry is the externality structure that markets cannot self-correct.

The phenomenon has been characterized by practitioners as ‘vibe coding’, the generation of contributions that appear to address a stated issue but that have not been verified by the submitter against the actual system, and that may contain subtle errors, incorrect assumptions, or changes to unrelated parts of the codebase. By early 2026, curl, tldraw, and Ghostty had all changed their contribution policies directly in response to this phenomenon. The Ingress NGINX retirement, noted above, represents the highest profile example of the end state of the trajectory these policy changes are attempting to forestall.

3.2  Security Escalation: From Quality Degradation to Attack Vector

The security implications of AI-assisted development extend beyond the quality issues discussed above. AI tools introduce a distinct category of security risk that operates at the architectural level rather than simply the code quality level, and that is substantially more difficult to defend against.

Apiiro’s analysis of Fortune 50 enterprises found a ten-fold increase in security findings per month between December 2024 and June 2025. Research from multiple independent sources consistently finds that between 15 and 25 per cent of AI-generated code contains security vulnerabilities. A 2024 academic study found that between 5 and 22 per cent of the package names referenced in AI-generated code do not exist. This is a phenomenon that has given rise to a novel attack category. When AI models hallucinate package names and developers submit code referencing those names to open source repositories, attackers can register the hallucinated names with malicious payloads. Any project or user that subsequently installs the dependency as referenced in the accepted code receives the attacker’s payload. This attack vector, termed ‘slopsquatting’ by researchers, requires no compromise of any existing package; it exploits the hallucination behavior of AI coding tools directly.

Georgetown University’s Center for Security and Emerging Technology has documented a more sophisticated variant already in use. The AI models themselves can be targeted with data poisoning attacks that cause them to recommend malicious packages to developers using AI coding tools. This attack operates at the model level rather than the code level, and is therefore invisible to conventional code review processes. A developer using an AI assistant that has been poisoned to recommend a particular malicious dependency may receive that recommendation across all of their coding sessions, across all projects, before any individual project maintainer has the opportunity to reject a pull request.

The OpenSSF has explicitly noted that AI lowers the cost and sophistication barrier for supply chain attacks on open source infrastructure. The XZ-Utils attack, which required years of sustained, high-quality human engagement to execute, represents the high end of the sophistication spectrum prior to AI assistance. The tools now available to attackers make equivalent attacks substantially cheaper to mount, substantially easier to scale, and substantially harder to detect. Phylum’s analysis found more than 35,000 malicious packages in open source repositories, with an eight-fold increase in critical malware detection over the survey period.

Key Finding: AI is not merely introducing more vulnerable code into open source repositories. It is lowering the barrier to sophisticated supply chain attacks on critical infrastructure. The combination of maintainer burnout, reduced review capacity, and AI-enabled attack sophistication creates a risk environment that is qualitatively different from the pre-AI baseline.

3.3  The Measurement Problem: Invisible Externalities

The third mechanism is structural rather than technical, and it is in some respects the most important. The harm that AI is causing to the open source commons is systematically invisible to the corporate frameworks through which AI adoption is evaluated and justified. This invisibility is not incidental. It is a feature of how ROI is measured in large organizations, and it is the foundational reason why the problem cannot be solved by appeals to corporate self-interest or voluntary action.

The December 2024 IBM study on AI return on investment surveyed 2,413 IT Decision Makers at large enterprises. It found that 51 per cent reported positive ROI from AI adoption in software development contexts. The study is routinely cited by AI vendors and enterprise technology advocates as evidence that AI delivers measurable value to software development. A close examination of its methodology reveals why it cannot support that conclusion in the context of open source.

The respondents are director-level or above, with purchasing authority over IT products. They are, structurally, the people whose professional role is to justify the purchasing decisions they have made. The metrics used to calculate ROI, in this study: faster software development, faster innovation, time savings in productivity, are self-reported estimates at the individual level. They are also measuring the output of the developer using the tool, not the total system impact including downstream costs. The study measures the value the AI tool appears to produce in the transaction where it is used. It does not and cannot measure the costs imposed on maintainers reviewing the output of that tool, on security teams addressing the vulnerabilities it introduces, or on the shared infrastructure degraded by its aggregate use across the industry.

This is not a failure of IBM’s researchers. It is a description of the limits of ROI as a measurement framework for systemic externalities. The costs are real, they simply fall outside the boundary of the measurement. When an open source maintainer spends an additional hour reviewing an AI-generated pull request that turns out to be incorrect, that hour appears nowhere in any corporate ROI calculation. When a security team responds to a supply chain attack that exploited a hallucinated package name, the attribution to AI tool usage is invisible. The negative externalities are diffused across the commons and therefore unaccountable in any individual firm’s ledger.

The executive-developer perception gap is the human face of this measurement failure. The Atlassian 2025 State of DevEx Survey found that 63 per cent of developers report that their leaders do not understand their pain points, a sharp increase from 44 per cent the prior year, with developers specifically identifying the disconnect between leadership enthusiasm for AI productivity and developer experience of AI-generated review burden as a primary source of frustration. The EY Work Reimagined Survey of 15,000 workers found that 64 per cent perceive increased workloads despite widespread AI adoption, with only 5 per cent using AI in genuinely transformative ways. The METR randomised controlled trial is perhaps the most damning of all for the impact of AI on software development. It found that experienced developers took 19 per cent longer on tasks when using AI tools, despite believing, in a pre-trial estimate, that AI had made them 20 per cent faster. The perception gap runs in the wrong direction even at the practitioner level; at the executive level, it is wider still. 


Part 4: The Externality Structure — Why Markets Cannot Self-Correct

The preceding three sections establish the empirical case: the open source ecosystem was already deteriorating before AI, AI is worsening it through identifiable mechanisms, and the harm is invisible to the measurement frameworks of the organizations causing it. This section makes the structural argument: that these conditions together constitute a classic negative externality that markets cannot self-correct, and that the historical record of voluntary action confirms this at scale.

4.1  The Externality Defined

A negative externality exists when the costs of a private action are borne by parties outside the transaction in which the action is taken, and where the acting party therefore has no market signal incentivizing cost reduction. Pollution is the canonical example: the factory producing pollution bears none of the cost of the health damage it causes to downstream communities, and therefore has no financial incentive to reduce pollution absent regulation. The incentive for self-regulation is structurally absent because the internal cost-benefit calculation of the polluting firm does not register the costs it is externalizing.

The open source situation has precisely this structure. The commercial organization that deploys an AI coding tool at scale incurs the cost of the tool license and the time its developers spend using it. It receives the benefit of apparently accelerated output. It does not incur the cost of the additional review burden imposed on open source maintainers, the security vulnerabilities introduced into shared infrastructure, or the accelerated burnout of the volunteer community whose continued unpaid labor it depends upon. All of those costs are externalized onto the commons. The commons has no mechanism to charge the externalizing party. The externalizing party has no incentive to reduce its externalization because its internal metrics do not register it.

The analogy to environmental regulation is instructive as a policy logic and not merely a rhetorical device. When private actors extract value from a shared resource while externalizing degradation costs onto that resource, the outcome in the absence of regulation is well-documented. It is the “tragedy of the commons”, in which individually rational behavior produces collectively irrational outcomes. Garrett Hardin identified the mechanism in 1968; Elinor Ostrom won the Nobel Prize in Economics in 2009 for demonstrating the governance conditions under which commons can be sustainably managed. The open source commons currently lacks those governance conditions. It has the access structure of a commons, free extraction by all, without the governance structure that prevents tragedy or abuse.

4.2  Why Voluntary Action Has Failed and Will Continue to Fail

The standard industry response to arguments for regulatory intervention is to appeal to the potential for voluntary self-regulation: the assertion that corporations, properly informed of the costs they are externalizing, will act in their long-term interest by contributing to the maintenance of the infrastructure they depend upon. There are two problems with this response. First, it has been tried and has demonstrably failed. Second, the structural conditions that produced that failure have gotten measurably worse.

GitHub Sponsors has operated for several years. The voluntary participation rate among open source users is approximately 0.001 per cent. The Open Source Pledge is a more recent initiative asking companies to commit a minimum of $2,000 per developer per year to open source. While it has attracted a small number of early adopters, predominantly smaller technology companies, it is not the hyperscalers whose AI tool deployment is the primary driver of the problem this paper addresses. These initiatives are not failures of design or communication. They are demonstrations that the collective action problem inherent in commons governance does not yield to voluntary mechanisms in the absence of either binding agreements or reputational incentives strong enough to overcome the financial benefit of free-riding.

The collective action problem takes a specific form here. Even a corporation that recognizes its dependency on open source maintenance and would, in isolation, prefer a well-maintained ecosystem to a deteriorating one has a strong competitive incentive not to contribute unilaterally. Every dollar spent on open source maintenance is a dollar not spent on competitive activities, and the benefit of that maintenance is shared equally with competitors who spend nothing. The only rational response under these conditions, for a firm maximizing its own competitive position, is to free-ride and hope that enough others contribute to keep the commons functioning. This is not bad faith! It is the predictable outcome of a measurement and incentive framework that rewards free-riding.

Mandatory contribution solves the collective action problem that voluntary mechanisms cannot, because it eliminates the competitive disadvantage of unilateral participation. When all commercial beneficiaries of the open source commons are required to contribute proportionately, no individual firm is penalized for contributing. The cost is distributed across the industry rather than borne disproportionately by the conscientious minority.

4.3  The Governmental Duty of Care

Governments have a specific obligation in this context that extends beyond general market regulation. Open source infrastructure is not merely a private sector resource. It underpins financial systems, healthcare networks, energy grid management, national defense capabilities, and public communications infrastructure. Its failure is not just a private sector problem. It is a national security risk of the first order.

The XZ-Utils near-miss in 2024 was a demonstration, in miniature, of what that failure looks like. A backdoor inserted through the social engineering of a single, overwhelmed volunteer maintainer came within a routine system update of compromising global SSH infrastructure. The attacker had nation-state level patience and sophistication. The defender had none: a single individual, unpaid, managing a critical library in personal time, with insufficient capacity to thoroughly scrutinise the contributions of a persistent and skilled adversary.

The OpenSSF and Georgetown CSET have both documented that AI dramatically lowers the cost and sophistication barrier for equivalent attacks. An attack that required multi-year human investment to execute in the XZ-Utils case can now be partially or substantially automated. The number of potential targets that a sophisticated attacker can pursue simultaneously increases in proportion to the automation available. The maintained volunteer community that is the primary defense against such attacks is shrinking, burning out, and now absorbing additional review burden from the AI tool adoption of commercial organizations.

The question for governments is not whether a systemic failure event is possible. Log4Shell and XZ-Utils have already demonstrated that it is. The question is at what point the expected value of the downside risk, the probability multiplied by magnitude of damage, exceeds the political cost of intervention. This paper argues that point was reached some time ago, and that generative AI has now compressed the timeline to the point where continued inaction represents not caution, but outright failure of the governmental duty of care.

Part 5: The Policy Gap — Existing Instruments and Their Limits

The argument for a Sovereign OSS Fund does not require that existing regulatory instruments have done nothing. Several have made meaningful contributions to the problem space. The argument is that collectively they leave a gap that is both large and consequential, and that the gap is precisely the one the fund is designed to fill.

5.1  What Exists

The EU Cyber Resilience Act (2024) is the most relevant existing instrument. It places mandatory security obligations on manufacturers of products with digital components, including products built substantially on open source foundations. In principle, it creates downstream liability for products that fail to meet defined security standards, establishing the precedent that commercial beneficiaries of open source have obligations that follow from that benefit. In practice, it creates compliance obligations but not maintenance funding: a manufacturer can satisfy its Cyber Resilience Act obligations by documenting its dependencies and implementing security response processes without contributing a single euro to the maintenance of the open source projects it depends upon.

The Biden administration’s 2021 Executive Order on Cybersecurity mandated Software Bills of Materials for government procurement, requiring federal contractors to document their open source dependencies. This created the disclosure infrastructure that a contribution obligation model would require: if commercial organizations are required to know and declare what they depend on, that declaration can become the basis for a proportionate contribution obligation. The SBOM mandate is the thin end of the wedge. It has not yet been extended to create the contribution obligation that its transparency would make administratively feasible.

The Biden National Cybersecurity Strategy (2023) shifted liability for insecure software toward organizations best positioned to reduce risk, explicitly rejecting the prevailing norm under which end users bear the cost of software vulnerabilities that developers or vendors could have prevented. The logic applies directly to the open source context: the organizations best positioned to reduce the risk of open source infrastructure failure are those large commercial entities with the resources to fund its maintenance. The strategy’s logic has not, however, been extended to create specific obligations in the open source maintenance context.

5.2  What Is Missing

The gap left by these instruments has three dimensions. First, none of them creates a funding mechanism. The Cyber Resilience Act creates compliance costs. The SBOM mandate creates disclosure costs. Neither generates resources for the maintenance it implicitly depends upon. Open source projects whose security and maintenance standards are assessed by the Act’s compliance framework will be judged against criteria that their volunteer maintainers have no additional resources to meet.

Second, none of them addresses the AI dimension specifically. The acceleration of the problem documented in the preceding sections, specifically the flood of AI-generated contributions, the novel attack vectors, and the systemic invisibility of costs to corporate ROI frameworks, is entirely outside the scope of instruments designed before widespread AI coding tool adoption.

Third, none of them addresses the governance problem. Who decides which projects are critical, and how are disbursements made against principled criteria rather than lobbying effectiveness? The absence of a credible, independent answer to this question is itself a barrier to the voluntary corporate contributions that the gap leaves as the only available mechanism. Corporations that might be willing to contribute do not have a trusted mechanism through which to do so that distributes the benefit to genuinely critical projects rather than high-profile ones.

The Policy Gap in Summary: Existing instruments create transparency obligations (SBOM mandates), compliance obligations (Cyber Resilience Act), and liability-shifting logic (2023 National Cybersecurity Strategy). None creates a funding mechanism, none addresses the AI dimension specifically, and none resolves the governance question of how to identify critical projects and distribute resources to them credibly and independently. The Sovereign OSS Fund fills all three gaps.

Section 6: The Sovereign OSS Fund — Architecture, Rationale, and Design

The Sovereign OSS Fund is the central policy proposal of this paper. It is designed to address the market failure described in the preceding sections by internalizing the costs currently externalized onto the open source commons, doing so in a manner that is proportionate to commercial benefit and AI deployment scale, transparent in its governance and disbursements, and independent of both government control and industry influence in its critical designation decisions. This section sets out the fund’s rationale, its contribution structure, and its governance architecture.

6.1  The Systemic Risk Insurance Framing

The most rigorous case for the fund rests not on the polluter pays principle alone, though that principle is sound and applicable. It focuses instead on the systemic risk insurance framing inherent in the open source software ecosystem. Critical open source infrastructure is a public good, it is non-excludable (anyone can access it), non-rivalrous (one party’s use does not reduce another’s), and subject to free-riding at scale. It is also exposed to low-probability, high-impact failure events whose costs would be distributed across the entire digital economy without regard to any individual organization’s contribution history. This is the exact structure that insurance markets exist to address.

Conventional insurance cannot function in this context, however, because the underlying asset is a commons. No single party owns or controls it, and no single party can be assigned the residual claimant status that insurance underwriting requires. The Sovereign OSS Fund therefore functions as a collective insurance pool, operated by an independent body, funded by proportionate mandatory contributions from commercial beneficiaries, and structured to maintain the designated assets against failure. The contribution obligation is the insurance premium. The designated projects whose maintenance the fund supports are the insured assets. The catastrophic failure events that the fund is designed to prevent, such as a Log4Shell-scale vulnerability in critical infrastructure, an XZ-Utils-style supply chain compromise of national security significance, or the accelerating abandonment of projects underpinning healthcare or financial system infrastructure, are the insured risks.

The case for placing AI companies at the center of the mandatory contribution mechanism is not simply that they are large and profitable, though they are. It is that they occupy a uniquely paradoxical position in the open source ecosystem. They are its most intensive beneficiaries and, through the market deployment of AI coding tools, among its most consequential current stressors. The AI industry’s dependence on open source is not historical or incidental, it is ongoing and structural. Production AI systems today run on open source inference infrastructure, are evaluated against open source benchmarks, are fine-tuned using open source tooling, and are deployed through open source serving frameworks. The value of every commercial AI product on the market today is partly a function of the open source commons from which it was built and on which it continues to depend. Against this, the contribution noise, maintainer burden, and security surface expansion that AI coding tools are generating in the open source ecosystem represents a transfer of cost from the AI industry onto the maintainer communities it has already extracted value from, without commensurate return. The AI deployment levy proposed in this paper is not an external imposition on a bystander industry. It is a contribution requirement on the industry that has extracted most from the commons and is now, measurably, making it harder to sustain. Framed correctly, and the framing matters for political viability, this is not regulation. It is the belated recognition of a debt.

This framing carries the urgency argument without alarmist leanings or hyperbole. The question put to governments is not whether catastrophic failure is possible. We are past that point, as it has already been demonstrated that it is. The question is whether the expected value of the downside risk now justifies the cost of collective insurance. The answer is yes, and it has been yes for some time. AI has not created this conclusion, it has made it more urgent and more undeniable.

6.2  The Polluter Pays Dimension

Alongside the insurance framing, the polluter pays principle provides the moral and legal foundation for the contribution obligation. Organizations that deploy AI coding tools at scale impose costs on the open source commons that their tools degrade. Those costs are real, measurable at the aggregate level, and uncompensated. The principle that the party generating an externality should bear the cost of it, established in environmental regulation across all major jurisdictions, cleanly and directly applies here.

The fund is not a tax, and the paper asserts this distinction explicitly and for structural rather than merely rhetorical reasons. A tax generates general government revenue allocated through the political budget process. The Sovereign OSS Fund contributions flow directly into a ring-fenced mechanism managed by an independent body outside direct government control, disbursed according to publicly auditable criteria to specifically designated projects. No government has discretionary access to fund proceeds. The contribution is tied to a defined public benefit, literally the maintenance of infrastructure from which the contributing organizations derives direct commercial value. It is kept equitable because it is proportionate to the scale of that benefit and the scale of the harm being generated.

6.3  The Contribution Tier Structure

The contribution structure is designed to satisfy proportionality and administrability simultaneously. A blunt mechanism that treats a ten-person startup identically to a hyperscaler would be both inequitable and politically unworkable. A mechanism so complex that compliance costs exceed contribution value would be administratively self-defeating. The four-tier model below achieves both requirements.

Tier 0 — Exempt:

Organizations with annual revenue below €10 million carry no contribution obligation. The administrative cost of collection from this tier would exceed the revenue generated, and the political cost of applying the mechanism to startups and early-stage companies would undermine its broader credibility. Voluntary participation is encouraged through an Open Source Pledge-style recognition scheme. This also ensures that it does not actively stifle the startup/early stage organization with regulation and cost that did not inhibit its predecessors.

Tier 1 — Base:

Organizations with annual revenue between €10 million and €500 million contribute at a flat rate scaled to their declared open source dependency footprint, as documented in mandatory SBOM disclosures. The existing EU Cyber Resilience Act and US Executive Order SBOM mandates provide the disclosure infrastructure this tier requires. Since organizations already have an obligation to know and declare what they depend on, that declaration becomes the basis for a modest proportionate contribution.

Tier 2 — Scaled:

Organizations with annual revenue above €500 million contribute at a rate that scales with both their revenue band and their declared AI coding tool deployment intensity. AI deployment intensity is measured by declared usage of AI coding tools above defined thresholds, and easily measured. Developer seat counts using AI assistants, AI tool spend above an auditable minimum, or where the data is available and technically feasible, the volume of AI-assisted commits above a defined level. This tier reflects the reality that larger organizations have deeper open source dependencies and are beginning to deploy AI tools at a scale that materially affects the ecosystem.

Tier 3 — Hyperscale:

Organizations with annual revenue above €5 billion, particularly those deploying AI coding tools at scale across tens of thousands of developers, carry the maximum contribution obligation. The weighting formula at this tier places the greatest emphasis on AI deployment intensity, on the basis that these organizations are the primary generators of the maintainer burden the fund exists to address. They are simultaneously the greatest extractors of value from the open source commons, the primary deployers of the tools degrading it, and the parties with the greatest capacity to contribute to its maintenance. The moral and financial case converges at the hyperscale tier.

6.4  The AI Deployment Levy

The AI deployment intensity metric at Tiers 2 and 3 is the most novel element of the contribution structure and the one most directly connected to the paper’s argument. A contribution obligation based solely on open source dependency footprint, which existing proposals such as the Open Source Pledge approximate, does not distinguish between organizations that are worsening the maintenance burden through AI deployment and those that are not. The AI deployment levy closes this gap.

The metric is imperfect, as all regulatory proxies are. AI coding tool usage is not uniformly distributed across development activities, and some AI-assisted contributions to open source are high quality and do not impose net costs. The metric measures deployment at scale rather than harm at the contribution level. That is because harm at the contribution level is not currently measurable. This is an acknowledged limitation that should be revisited as measurement capabilities develop. What the metric achieves is that the highest contribution obligations fall on the organizations whose AI deployment is most plausibly connected to the harm the fund is designed to mitigate. The hyperscalers and large SaaS companies deploying AI coding tools to tens of thousands of developers, that are generating the review burden that is burning out the maintainer community.

This is the element of the proposal that will attract the most scrutiny from AI vendors and large technology companies, we are direct about it. The industry argument will be that AI deployment and open source harm are not causally linked at the individual contribution level, and that a levy based on deployment intensity is therefore arbitrary. The response is that causal attribution at the individual contribution level is precisely what the externality structure prevents. The harm is diffused across the commons and cannot be traced to individual actors. The levy is calibrated to deployment scale as the best available proxy for aggregate contribution to the harm, in the same way that environmental levies are calibrated to production volume or emissions rather than to individually attributable harm events. The standard for regulatory proxy metrics is workability and proportionality, not perfect causal attribution.

6.5  Governance: The Independent Review Council

The governance architecture of the fund is the element the paper treats with the greatest prescriptiveness, because it is where the risk of failure is highest and where the consequences of failure are most damaging to the fund’s legitimacy. Two failure modes must be structurally prevented rather than merely discouraged:

  1. capture by government, which would allow the fund’s resources to be diverted to politically convenient purposes rather than technically critical ones
  2. capture by industry, which would allow well-resourced lobbying to shape criticality designations in favor of commercially convenient projects rather than genuinely critical ones.

The Independent Review Council is the mechanism by which both failure modes are structurally addressed. Its design rests on a foundational principle: neither a government nor a company should have unilateral authority over what is designated as critical infrastructure. The IRC holds sole decision-making power on criticality designation, and its independence from both sets of interests is structural.

The IRC is composed of technical experts drawn from the security research, open source maintainer, and software engineering communities; academic representatives with relevant expertise in software security and infrastructure governance; and civil society and digital rights organizations. No active corporate directors/officers/principals and no government officials serve on the IRC. Membership is selected through an open nomination process with mandatory conflict-of-interest disclosure requirements. Term limits prevent entrenchment. The full deliberation records of every designation decision are published on the Public Transparency Register.

The oversight structure above the IRC consists of a Statutory Framework Body, comprising government representatives from participating jurisdictions, which sets contribution obligation levels and provides the legal authority for the fund, but which has no operational authority over designation decisions or disbursements. Government participation is limited to non-voting observer seats on the Fund Operational Board. Industry participation is limited to a single, annually rotating non-voting observer seat. All operational authority rests with the IRC’s nominees on the Fund Operational Board.

The precedents for this governance model in the open source context are established and successful. The Internet Security Research Group, which operates Let’s Encrypt and provides free TLS certificates to the majority of websites globally, functions as a public benefit corporation with transparent governance that is neither government-controlled nor industry-dominated. The Linux Foundation hosts critical infrastructure projects under a neutral governance umbrella that has sustained significant projects across organizational and political changes. Neither is a perfect template, but both demonstrate that the independent governance model is achievable and has worked at scale in the open source context.

The fund’s designations, disbursements, and governance deliberations are all published in full on the Public Transparency Register, operated by an independent auditor appointed by the Statutory Framework Body. The register is not subject to modification by either government or industry participants. This transparency mechanism serves a dual purpose. It provides accountability for the IRC’s decisions, and it provides the public record that allows civil society, academic researchers, and affected communities to identify and challenge designation errors through the formal appeals process.

6.6  Criticality Designation: Methodology and Safeguards

The criticality designation methodology is designed to ground designation decisions in objective, publicly auditable criteria rather than political judgment. Three categories of evidence are assessed by the IRC for each designation decision.

Dependency and usage metrics provide the quantitative foundation: download volumes across major package managers, dependency graph depth and breadth, and contributor concentration. Projects dependent on fewer than five active maintainers are flagged for heightened scrutiny regardless of their current stability, on the basis that the risk is prospective. These metrics draw on existing public data sources including package manager APIs, the OpenSSF Security Scorecard, and the Linux Foundation Census methodology. They are not proprietary to the fund and are independently verifiable.

Infrastructure criticality assessment examines whether the project underpins systems in sectors formally designated as critical national infrastructure in participating jurisdictions: financial services, healthcare, defense supply chains, communications infrastructure, energy systems, and public administration. Projects that are material dependencies of systems in these sectors carry a different risk profile than projects of equivalent download volume in consumer applications, and that difference is reflected in the designation weighting.

Maintenance health assessment evaluates the current and projected adequacy of the project’s maintenance regime against the risk it represents. A project that is currently stable but whose maintainer community is declining, whose burnout indicators are elevated, or whose security response timelines are lengthening is a candidate for proactive designation rather than reactive response to failure.

All designation decisions are published in full, including the IRC’s reasoning, the data reviewed, dissenting views from IRC members, and the weighting applied to each assessment category. An appeals mechanism allows any organization to challenge a designation or non-designation through a panel drawn from outside the IRC membership. The appeals process does not give any single party veto power over designations; it provides a formal corrective for errors without creating a mechanism for strategic delay.

The Full IRC and Fund structure: It is impossible to fully detail and document all possible structural and procedural functions of the proposed organization, and indeed we would not want to. The basic formation, by non-profit incentivized parties is the limit of the structural recommendation. The expectation is that the first actions of the founding body itself would be to set up the norms, procedures, and overall structure of the governing body, and the fund. This would far exceed the abilities of the policy statement, but should not be seen as a deficiency. Self-governance is a critical part of open source, and the goal of the IRC is to protect, not determine, and so the proposal matches that ethos.

Section 7: Anticipated Objections and Responses

This section addresses the objections most likely to be raised against the Sovereign OSS Fund by industry stakeholders, policy sceptics, and critics of regulatory intervention. The paper addresses them here rather than in an appendix because anticipating and answering objections is part of the policy argument, not a supplement to it. A proposal that cannot survive rigorous scrutiny should not be recommended; one that can should demonstrate this directly.

7.1  ‘This is an AI tax by another name’

This is the most predictable political attack on the proposal and the paper is direct in response. The distinction between the Sovereign OSS Fund contribution and a tax is structural and legally significant. A tax is a compulsory payment to government generating revenue allocated through the political budget process, with no defined beneficiary and no hypothecation to a specific purpose. The Sovereign OSS Fund contribution is a compulsory payment to a ring-fenced mechanism managed by an independent body outside direct government control, disbursed against publicly auditable criteria to specifically designated beneficiaries, with no government discretion over the allocation. These are different instruments both in law and in economic function.

The contribution is better characterised as a mandatory insurance premium or a utility infrastructure levy: a defined payment into a collective mechanism that maintains shared infrastructure from which the paying party derives direct commercial benefit, scaled to the scale of that benefit. The political argument that this constitutes a tax on innovation is answered by the tier structure: a small company with minimal AI deployment pays nothing. A hyperscaler deploying AI coding tools to tens of thousands of developers, extracting billions of dollars in productivity value from open source infrastructure, and generating a proportionate share of the review burden degrading that infrastructure, pays a contribution commensurate with that footprint. Calling that contribution a tax on innovation inverts the actual relationship: it is the price of using innovation responsibly.

Objection: ‘You are penalising AI adoption and will stifle the industry.’: The proposal does not restrict AI adoption. It requires that organisations adopting AI at scale contribute proportionately to the maintenance of the infrastructure their tools are degrading. The Tier 0 exemption ensures that startups and early-stage companies are unaffected. The AI deployment intensity weighting ensures that the heaviest obligations fall on the parties with the greatest capacity to bear them and the greatest connection to the harm. The alternative — unrestricted AI deployment that continues to degrade critical infrastructure until a systemic failure event produces emergency remediation costs far exceeding any contribution obligation — is the outcome that stifles industry.

7.2  ‘Determining what is “critical” is too arbitrary and creates capture risk’

This is a legitimate governance challenge, and it is answered by the IRC architecture rather than dismissed. The response rests on three elements.

First, the designation methodology is objective and auditable. Criticality is assessed against quantifiable metrics drawn from publicly available data sources, not political judgment. The IRC’s reasoning for every designation decision is published in full. Any organisation can challenge a designation through a formal process. The arbitrariness risk is mitigated by methodology transparency; the capture risk is mitigated by governance structure.

Second, the IRC’s independence is structural. No corporate employees and no government officials sit on the IRC. Neither the government observer seats nor the industry observer seat on the Operational Board carry voting rights on designation decisions. The membership selection process requires conflict-of-interest disclosure and operates through open nomination. None of these safeguards is individually perfect; collectively they make undue influence visible and resistible rather than merely prohibited.

Third, the relevant comparison is not between the proposed IRC and a theoretically perfect independent body, but between the proposed IRC and the current situation: no governance, no accountability, no transparency, and de facto designation by default — where ‘criticality’ is determined entirely by whether a project’s failure has already caused enough damage to attract attention. The IRC is substantially more robust than the status quo, and the paper should make that comparison plainly rather than defending against a standard of perfection that no regulatory body meets.

7.3 – Objection – ‘Companies in jurisdictions that implement this will be competitively disadvantaged relative to those in jurisdictions that do not.’

The EU Cyber Resilience Act and GDPR have established that extraterritorial reach is achievable for digital regulation. Currently, any company serving EU customers or operating EU infrastructure is subject to EU rules regardless of its country of incorporation. The Sovereign OSS Fund would function on the same basis. Beyond jurisdictions, the systemic risk argument inverts the competitive disadvantage claim. A catastrophic failure in critical open source infrastructure damages all participants in the digital economy equally, regardless of jurisdiction, just as the software itself may be written in one jurisdiction and used in all. Collective maintenance of that infrastructure is not a competitive disadvantage – it removes the disadvantage outright form the voluntary structure that incentivizes free-riders. At this point it simply is a precondition for the market continuing to function at all.

7.4 – Objection – ‘Governments cannot be trusted to manage technology infrastructure’

The fund design anticipates this objection structurally. The government’s role is limited to:

  • setting the legal framework and contribution obligation levels through the Statutory Framework Body
  • providing annual public audit and parliamentary accountability
  • participating as a non-voting observer on the Fund Operational Board.

Governments do not designate critical projects, do not control disbursement decisions, and do not have discretionary access to fund proceeds. The operational authority of the fund resides entirely with the IRC and the Fund Operational Board, operating under independent governance charters with full public transparency.

The objection is a valid one if it were directed at a different model, one in which government directly administers the fund. That is not the model proposed. The proposed model is closer to the governance of Let’s Encrypt or the Linux Foundation’s hosted projects: public benefit, independent operation, transparent governance, government engagement limited to the legal framework and accountability oversight that any significant financial mechanism requires.


Conclusion: A Duty of Care for the Digital Commons

Open source software is infrastructure. It carries the same systemic importance to the digital economy that roads, power grids, and water systems carry to the physical one. We do not expect roads to be maintained by volunteers on the assumption that the drivers who use them will contribute freely. We do not leave the maintenance of power infrastructure to the discretion of the industrial consumers whose operations depend on it. We recognize that shared infrastructure requires shared governance and shared funding, and we build the regulatory and institutional frameworks that make that possible. The open source commons has been operating without those frameworks for two decades, sustained by a volunteer community whose goodwill is finite.

Generative AI has not created this problem. It has arrived at a moment of particular structural vulnerability and added a new category of load to a system already under severe strain. The response to that addition cannot be to ask the volunteers who have been carrying the weight to carry more. It cannot be to convene another industry round table on voluntary commitments. It cannot be to wait for the next Log4Shell or the next XZ-Utils attack that does not fail, to focus governmental attention. The cost of that wait, in economic damage, national security exposure, and the loss of the infrastructure itself, will be orders of magnitude greater than the cost of the intervention this paper proposes.

The Sovereign OSS Fund is a proportionate response to a documented market failure. It does not restrict AI. It does not punish innovation. It does not place government in control of the digital commons. It requires that the organizations that have built their commercial success on open source foundations, and that are now deploying tools that accelerate those foundations’ deterioration, bear a share of the cost of maintaining what they depend upon. It creates an independent governance body capable of making that determination credibly and transparently, insulated from both the lobbying power of industry and the political incentives of government. And it frames the contribution not as a penalty but as what it is: the collective insurance premium for an infrastructure risk that every participant in the digital economy shares.

Time’s up. The governance gap in the digital commons has been visible for years, and the evidence that it is worsening is now substantial. Governments that continue to defer to voluntary industry action on this question are not being cautious; they are being negligent. The tools to address it exist. The regulatory precedents exist. The governance model exists. What has been missing is the political will to use them. This paper is an argument that the moment for that will has arrived, and that the cost of waiting for a better moment will be measured in infrastructure we no longer have.


Bibliography — Working References

Note: References updated in v0.2 to reflect completed primary GHArchive research. References [1]–[3] expanded to full Tidelift series; references [27]–[28] are new additions.

[1]  Tidelift. (2021). State of the Open Source Maintainer Report 2021. Tidelift Inc.

[2]  Tidelift. (2023). State of the Open Source Maintainer Report 2022/23. Tidelift Inc.

[3]  Tidelift. (2024). State of the Open Source Maintainer Report 2024. Tidelift Inc.

[4]  Greenstein, S. & Nagle, F. (2024). The Value of Open Source Software. Harvard Business School Working Paper.

[5]  Linux Foundation. (2024). Census II of Free and Open Source Software — Application Libraries. Linux Foundation / LABO.

[6]  Xu, Z. et al. (2025). Predicting Open Source Project Abandonment on GitHub. arXiv preprint arXiv:2507.21678.

[7]  Coelho, J. et al. (2019). GitHub Pull Requests Abandoned and Merged: A Large-Scale Study. arXiv preprint arXiv:1906.08058.

[8]  GitHub. (2020). Octoverse 2020: The State of Open Source. GitHub Inc.

[9]  GitHub. (2025). Octoverse 2025: The State of Open Source. GitHub Inc.

[10]  GitClear. (2024). Coding on Copilot: 2023 Data Suggests Coding Assistants Degrading Code Quality. GitClear Research.

[11]  Carnegie Mellon University. (2025). [Full citation pending primary source access.] Study of code quality degradation across 807 GitHub repositories adopting Cursor, 2024–2025.

[12]  CodeRabbit. (2024–2025). [Full citation pending primary source access.] Analysis of AI-generated vs human-authored pull request issue rates.

[13]  METR. (2025). Measuring the Impact of AI Coding Assistants on Developer Productivity: A Randomised Controlled Trial. METR Research.

[14]  Atlassian. (2025). State of Developer Experience 2025. Atlassian Corporation.

[15]  EY. (2025). Work Reimagined Survey 2025. Ernst & Young Global Limited.

[16]  IBM Institute for Business Value / Morning Consult. (2024). AI ROI: From Potential to Profit. IBM Corporation.

[17]  Apiiro. (2025). [Full citation pending primary source access.] Security findings analysis, Fortune 50, December 2024–2025.

[18]  Phylum. (2024–2025). [Full citation pending primary source access.] Open source malicious package analysis.

[19]  Georgetown CSET. (2024). AI-Enabled Supply Chain Attacks on Open Source Software. Center for Security and Emerging Technology.

[20]  OpenSSF. (2024–2025). [Full citation pending primary source access.] Security guidance on AI and supply chain risk.

[21]  Hardin, G. (1968). The Tragedy of the Commons. Science, 162(3859), 1243–1248.

[22]  Ostrom, E. (1990). Governing the Commons: The Evolution of Institutions for Collective Action. Cambridge University Press.

[23]  European Commission. (2024). Regulation (EU) 2024/2847 on horizontal cybersecurity requirements for products with digital elements (Cyber Resilience Act).

[24]  Executive Office of the President. (2021). Executive Order 14028 on Improving the Nation’s Cybersecurity. The White House.

[25]  The White House. (2023). National Cybersecurity Strategy. Office of the National Cyber Director.

[26]  Stack Overflow. (2025). Developer Survey 2025. Stack Overflow.

[27]  GHArchive / Google BigQuery. (2026). Primary analysis of open source contributor activity trends, 2015–2023. Original research conducted for this paper. Methodology and query documentation in supplementary materials.

[28]  Raman, N., Cao, M., Tsvetkov, Y., Kästner, C., & Vasilescu, B. (2020). Stress and burnout in open source: Toward finding, understanding, and mitigating unhealthy interactions. Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering: New Ideas and Emerging Results. https://doi.org/10.1145/3377816.3381732

Research Notes — Outstanding Actions for Further Research

Section 2 — Primary Research CompleteThe GHArchive BigQuery analysis has been completed covering 2015–2023 across a filtered set of high-longevity repositories. The 2024–2025 data will be incorporated in a subsequent revision. The forthcoming companion study tracking contributor migration versus exit is noted in Section 2.2 but is outside the scope of this paper.

•       Extend GHArchive analysis to 2024–2025 to confirm post-2023 trajectory.

•       Verify all quantitative figures from GHArchive primary research [27] against the supplementary data export before submission.

•       Obtain full text of Carnegie Mellon Cursor study [11] and CodeRabbit analysis [12] for direct citation.

•       Contact Xu et al. for year-disaggregated data from arXiv:2507.21678 as a cross-check against GHArchive primary findings.

•       Systematic case study compilation: expand Section 2.4 to a minimum of 10 named projects with documented governance changes in response to AI contribution burden.

•       Map EU Cyber Resilience Act SBOM obligations in detail to confirm they provide the dependency disclosure infrastructure the Tier 1 contribution model requires.

•       Identify case law or regulatory precedents for hypothecated levies applied to digital infrastructure in EU and UK jurisdictions.

•       Explore alignment of the critical infrastructure definition proposed in Section 6.6 with existing EU NIS2 Directive definitions and any divergences requiring explicit treatment.

•       Develop the maintenance cost model for disbursement calculations in sufficient operational detail to be credible: what does it cost to maintain a critical project at the required standard, and what data sources underpin the estimate?

•       Legal review of the IRC governance design: does the independence structure as proposed require specific treaty or statutory arrangements in the jurisdictions of intended implementation?

•       Design primary research survey instrument for planned 2026 survey. Core questions: do you currently contribute to open source; have you reduced contributions over time; did you switch projects or stop entirely; what drove the decision. Design to be repeatable annually to build a longitudinal series comparable to the Tidelift survey line.