Swiss researchers just released one of the most transparent large language models yet. Called Apertus, it comes with everything: training data, code, weights, methods, alignment details. All documented. All reproducible.
Developed through the Swiss AI Initiative by teams at EPFL, ETH Zurich and the Swiss National Supercomputing Centre, the model launched on September 2, 2025. It arrives in 8-billion and 70-billion parameter versions. Both trained on more than 15 trillion tokens spanning over 1,000 languages. Forty percent of that data is non-English.
The name means “open” in Latin. The tagline on the project site puts it plainly: “Apertus is to AI as Open is to Source.” (Apertvs.ai)
This isn’t another corporate model with partial weights or vague promises. The full process stands open for inspection. That includes how the team filtered data to respect opt-out signals from websites, strip personal information and avoid unwanted memorization. Such steps align with EU AI Act requirements and Swiss data rules.
“We aim to provide a blueprint for how a trustworthy, sovereign, and inclusive AI model can be developed,” said Martin Jaggi, professor of machine learning at EPFL and member of the Swiss AI Initiative steering committee. The quote comes from the official ETH Zurich press release. (ETH Zurich)
Performance holds its own. The models compete with leading open alternatives at their respective scales. Multilingual support starts from the first version rather than as an afterthought. Low-resource languages gain real representation. Swiss German and Romansh sit alongside global tongues.
But the deeper point lies in compliance at scale. Most open models skip rigorous data hygiene. Apertus builds it in. The training corpus draws only from publicly available material. Filters catch machine-readable opt-outs, even those applied after initial collection. Personal data gets removed. Memorization risks drop.
The technical paper lays this out in detail. Titled “Apertus: Democratizing Open and Compliant LLMs for Global Language Environments,” it lists dozens of contributors from the collaborating institutions. The arXiv preprint spells out the two main gaps the project targets: data compliance and multilingual coverage. (arXiv)
Access comes easy for those ready to experiment. Download weights from Hugging Face. Chat with the model through Public AI’s free interface. Swisscom offers it on its sovereign platform for business users. The telecom provider acts as strategic partner to the initiative.
“Apertus is built for the public good. It stands among the few fully open LLMs at this scale and is the first of its kind to embody multilingualism, transparency, and compliance as foundational design principles,” added Imanol Schlag, technical lead of the project and research scientist at ETH Zurich. (ETH Zurich)
Swisscom echoed the sentiment. “Swisscom is proud to be among the first to deploy this pioneering large language model on our sovereign Swiss AI Platform,” said Daniel Dobos, research director at the company. The deployment ties into broader Swiss {ai} Weeks events that invite developers to test and build. (ETH Zurich)
From Foundation Model to Practical Tools
Early follow-up work shows the model’s flexibility. In March 2026 the team released a fine-tuned version for Ticino, the Italian-speaking Swiss region. It handles local translation needs. Then came the paper acceptance at ACL 2026. By mid-June the project dropped Apertus Mini: a collection of 16 small language models that demonstrate distillation and quantization methods. These compact versions maintain surprising capability while shrinking size. All updates appear on the official site. (Apertvs.ai)
Discussion on Hacker News lit up quickly after the initial release. Some praised the public-institution backing and full transparency. Others questioned whether the performance truly matches the hype when stacked against specialized models. One commenter noted that while the recipe looks perfect, real-world taste matters more. Threads on Reddit’s r/LocalLLaMA highlighted the 40%+ non-English training share as rare among major open releases.
Joshua Tan, lead maintainer of the Public AI Inference Utility, positioned the model as infrastructure. “Currently, Apertus is the leading public AI model: a model built by public institutions, for the public interest. It is our best proof yet that AI can be a form of public infrastructure like highways, water, or electricity.” His comment appears in the ETH release. (ETH Zurich)
Antoine Bosselut, professor and head of the Natural Language Processing Laboratory at EPFL, framed the launch differently. “The release of Apertus is not a final step, rather it’s the beginning of a journey, a long-term commitment to open, trustworthy, and sovereign AI foundations, for the public good worldwide.” He expects future versions to tackle law, climate, health and education domains while preserving openness. (ETH Zurich)
Thomas Schulthess, director of CSCS and professor at ETH Zurich, stressed the innovation angle. “Apertus is not a conventional case of technology transfer from research to product. Instead, we see it as a driver of innovation and a means of strengthening AI expertise across research, society and industry.” The project consumed more than 10 million GPU hours on CSCS’s Alps system, backed by ETH Board funding and Swisscom contributions. (ETH Zurich)
Recent chatter on X shows continued interest. Posts from June 2026 point back to the project site and Hacker News thread. Some users experiment with the 8B instruct version for local tasks. Others compare it directly against newer commercial offerings. The conversation remains lively nine months after launch.
What sets Apertus apart isn’t raw benchmark scores alone. Plenty of open models chase those. The combination does: full openness, baked-in compliance, genuine multilingual breadth and institutional backing that avoids single-company control. Enterprises wary of data leaks or regulatory fines notice. Researchers tired of black-box systems gain a canvas they can actually study.
Of course challenges remain. Running a 70B model demands serious hardware. Fine-tuning for specific industries takes expertise. And no model escapes the limits of its training data. Yet by publishing intermediate checkpoints, training code and detailed reports, the Swiss team invites the community to improve the work rather than start from scratch.
Liip, a Swiss digital agency, published a practical guide shortly after launch. It outlined four ways to try the model: through Public AI, with RAG setups, local downloads and more. Their post captured early excitement about sovereign and ethical AI. (Liip)
The Wikipedia entry, updated over time, confirms the Apache 2.0 license and core facts. It notes the model’s design for business and research use cases worldwide. (Wikipedia)
In the end, Apertus offers a different vision. One where AI foundations don’t concentrate power in a handful of labs. One where compliance isn’t an afterthought. One where languages beyond English get equal footing from day one. Whether it becomes the base for tomorrow’s most important applications remains to be seen. The ingredients, however, sit there for anyone to examine, modify and build upon.


WebProNews is an iEntry Publication