Europe Is Learning What AI Sovereignty Actually Costs

Europe Is Learning What AI Sovereignty Actually Costs

/ Maxim Starkweather

In March 2026, GPT-NL won the Dutch AI Award for Best Initiative. The award went to a project that, as of this month, has no publicly available model. You cannot download GPT-NL. You cannot query it through an API. If you check the project’s Hugging Face organization page, the models section reads: “None public yet.”

That’s not a failure. It’s the most honest accounting of what AI sovereignty actually requires that any European government project has produced in four years of trying.

The Netherlands started GPT-NL in 2024, funded by €13.5M from the Netherlands Enterprise Agency on behalf of the Ministry of Economic Affairs and Climate Policy. The project is led by TNO — the country’s national applied scientific research organization — alongside SURF, the Dutch research infrastructure cooperative, and NFI, the Netherlands Forensic Institute. Product Manager Saskia Lensink and R&D Manager Frank Brinkkemper are the named leads. The goal was to build a large language model from scratch, in-country, on verified clean data, governed by European law. Two years later, they have most of the pieces except the assembled product, and that sequence is the entire argument for why this project matters.

The EU’s AI strategy for the last three years has been written in the language of regulation. GDPR, the AI Act, transparency requirements for GPAI model providers, documentation obligations for training data. Europe got excellent at writing rules for other people’s technology. The question of whether it would build any of its own kept getting deferred to the next framework document. GPT-NL is the first serious answer from a member state, and what it reveals is that building beats regulating on the timeline that matters — but building correctly takes longer than any press release will admit.

What Two Years and €13.5M Built

The deliverables from GPT-NL are not glamorous. Five open-source repositories on GitHub under the Apache 2.0 license: a data-curation-pipeline, a data-extraction module, a set of curation utilities, project infrastructure, and a Wikidata synthetic data generator written in Rust that produces training examples from structured knowledge graphs. A tokenizer, released April 2026. And a corpus.

The corpus is the substantive artifact. The GPT-NL Public Corpus contains 302 million items across 32 data subsets. Its construction methodology is documented in an LREC 2026 paper by Jesse van Oort, Frank Brinkkemper, Erik de Graaf, Bram Vanroy, and Saskia Lensink, available at arXiv:2604.00920. The Dutch-language-specific content represents 36 billion tokens not previously available in any other pretraining corpus. The complete dataset is under CC-BY licensing — every item has documented legal permissions for commercial reuse.

The GPT-NL data pipeline: Apache 2.0 tools for building a clean, CC-BY licensed training corpus

That last sentence is not standard. Major LLM training corpora — Common Crawl, the Pile, the training stacks underlying the most powerful commercial models — contain scraped web data under legally contested terms. Publishers in the US and Europe have been filing suit over exactly this. GPT-NL’s corpus is the first large-scale Dutch-language pretraining dataset where provenance is systematically documented and licensing is clean. They built it that way because they had to: the Content Board governance structure gives data providers and rights holders direct input into what gets used. A portion of project revenues flows back to content creators.

As of Q1 2026, the project transitioned “from development to practical use” with a cohort of Launching Customers — selected Dutch organizations testing the model in actual workflows. The National Library of the Netherlands (KB) announced a feasibility study with GPT-NL in June 2026. A model exists in some form. It just isn’t public yet, and the project has been deliberate about that sequence: launching customers test it before the world does, which is how you find out what breaks before you’ve committed to a public release.

This is what €13.5M buys when you build correctly: a clean corpus, an open-source data pipeline, a governance structure that respects copyright holders, and an early deployment cohort that generates real feedback. Not a downloadable model, not a public API, not a ChatGPT competitor — but the infrastructure that makes a responsible model possible.

Sovereignty Is a Data Problem, Not a Model Problem

The obvious objection: Mistral exists. You can fine-tune Mistral or Llama on Dutch data for a fraction of €13.5M and get a capable Dutch-language model in three months instead of two years. Why spend the time?

The answer is in what you’re actually trying to own. If you fine-tune Mistral, you have a Dutch model trained on Mistral’s weights, which were derived from Mistral’s training corpus, which is largely Common Crawl under contested terms. You have no visibility into what behavioral patterns you inherited from that pretraining. Your model’s legal liability runs through Mistral’s corporate entity and its interpretation of French and European law. When a Dutch government ministry deploys it for public services — in courts, tax administration, health records — they’re trusting that Mistral’s training data didn’t create legal exposure under Dutch law. They cannot verify that trust because Mistral hasn’t disclosed its pretraining corpus at a level of detail that makes verification possible.

GPT-NL’s entire architecture is built around making that trust unnecessary. The data pipeline is open source and auditable. The corpus licensing is documented item by item. The training process starts from scratch rather than inheriting another organization’s choices. When the model runs in Dutch public services, the deploying organization can answer, under Dutch law, exactly what data the model was trained on and under what terms. That answer is legally significant in a way that “we fine-tuned someone else’s model” is not.

This is the argument the EU has been making theoretically since the GDPR: that data governance has intrinsic value independent of the technical capability of the system trained on that data. GPT-NL is the first attempt to engineer that argument into an actual model-building project rather than a compliance checklist. The Content Board — giving rights holders a structural role in what data gets included — is a governance innovation, not a technical one. The Wikidata synthetic data generator uses structured factual knowledge to bootstrap training examples, reducing dependence on scraped text. The revenue-sharing mechanism creates a sustainable pipeline for future data collection. None of this shows up on a benchmark.

The scale gap: €13.5M of sovereign infrastructure, measured against what frontier training actually costs

The EU AI Act requires providers of powerful GPAI models to document training data and make it available for copyright compliance review. That requirement assumes you have the documentation. If you built on someone else’s undisclosed corpus, you may not. GPT-NL has the documentation because documentation was the requirement they designed toward from the start. The regulatory compliance is a byproduct of the governance architecture, not a box checked afterward.

The Gap Between Ambition and Scale

None of this changes the scale problem. The GPT-NL corpus includes 207 billion English tokens, 232 billion code tokens, 48 billion German and Danish tokens, and 36 billion Dutch-specific tokens. This is a substantial corpus for a model in the 7B–13B parameter range. It is not sufficient to train a frontier-class model competitive with the systems that major AI labs have been building — systems trained on trillions of tokens using compute clusters that cost hundreds of millions per training run. The order-of-magnitude difference isn’t a policy problem or a governance problem. It’s a money problem.

A model trained on this corpus at this scale will be a genuinely useful regional tool. It will handle document classification, translation assistance, internal knowledge retrieval, and the structured text generation tasks that Dutch public institutions actually need. What it will not do is replace the frontier models that European enterprise teams reach for when they need the highest available capability. GPT-NL cannot compete with the current generation of commercial frontier models on agentic coding, autonomous research, or complex multi-step reasoning. Calling it “sovereign” doesn’t change that.

The counter-argument is that this is the wrong comparison. GPT-NL isn’t competing with frontier commercial models — it’s building the infrastructure precondition for whatever Europe decides to train next. The data pipeline is modular and scalable. The corpus methodology is documented and replicable. The governance structures are in place. If the EU funds a significantly larger sovereign AI infrastructure initiative, GPT-NL provides the architecture to build it on. The €13.5M isn’t the model budget; it’s the proof-of-concept budget.

That’s a real distinction. It requires the EU to follow through, which is a large assumption. European infrastructure projects have a history of producing exactly what they promised: careful, well-documented, well-governed artifacts that no one scales because the next funding cycle didn’t materialize. GPT-NL’s proof of concept is only valuable if someone decides it proved something worth scaling. That decision hasn’t been announced. No one has committed to the next phase.

The Netherlands has demonstrated that AI sovereignty is a buildable thing — not a political aspiration or a regulatory framework, but an engineering project with deliverables: a corpus, a pipeline, a governance structure, a deployment cohort, and eventually a model. The precedent exists now. It wasn’t obvious that a national research organization, working at government-program scale, could build a clean large-scale training corpus with full provenance documentation and open-source the toolchain. They did. Two years, €13.5M, no shortcuts, and no model to download yet.

What comes next depends on whether the EU reads this as a proof of concept or an endpoint. At €13.5M, it’s the beginning of an argument. At €1.3 billion, it becomes the argument. The infrastructure is there. The political will to fund it at the scale that would actually matter — that’s the only remaining variable, and the EU’s record on that particular question is not encouraging.

AI-generated editorial image

AI-generated editorial illustration · TemperatureZero · June 17, 2026

Keep reading the signal

Get the Daily Signal — a concise briefing on what actually matters in AI and the systems around it.

Subscribe Free

Continue the archive

Latest BriefingsArticlesAbout Temperature Zero