DeepTech India - Leading Deep Technology News & Insights

Two events, ten weeks apart, defined Sarvam AI's 2026. On 6 March, at the India AI Impact Summit, the Bengaluru lab released two open-weight large language models under the permissive Apache-2.0 licence. On 15 June, it closed the first tranche of a $234M Series B at a $1.5B valuation, becoming India's 130th unicorn. The sequence matters more than either milestone alone. Sarvam first proved that frontier-grade models could be trained on subsidised domestic compute, then converted that proof into the kind of capital that, until now, had bypassed Indian foundation-model labs entirely.

For a country that has spent two years debating whether it should build its own large models or simply consume American and Chinese ones, Sarvam is the most concrete answer yet to the sovereign-AI question.

The models: MoE economics meet open weights

The headline release was a pair. Sarvam-30B is a mixture-of-experts (MoE) model with 30 billion total parameters but only 2.4 billion active per forward pass, routing each token through a subset of 128 experts. Sarvam-105B is a larger, reasoning-tuned dense-equivalent model trained on 12 trillion tokens; the 30B was trained on 16 trillion.

The MoE design is the commercially interesting choice. In a dense 30B model, every parameter fires for every token, so inference cost scales with the full parameter count. An MoE activates only the relevant experts, so Sarvam-30B carries the knowledge capacity of a 30B model while costing roughly what a 2.4B dense model costs to serve. For an Indian deployment context, where per-query margins are thin and volumes are high, that gap between total and active parameters is the difference between a model that can be served profitably and one that cannot. It is the same architectural bet that underpins the strongest open models globally.

On benchmarks, the 105B reasoning model is competitive where it counts for enterprise and education use cases. Sarvam reports 98.6 on Math500 and 88.3 on AIME-25, the American Invitational Mathematics Examination set that has become a standard probe for multi-step reasoning. Both models span 22 languages across 12 scripts, which is the actual product thesis: a model that handles Tamil, Bengali, Marathi and Hindi script-switching natively is far more useful in India than a marginally higher MMLU score in English.

Releasing all of this under Apache-2.0, rather than a restricted research licence, is a deliberate ecosystem move. Apache-2.0 permits commercial use, modification and redistribution with no copyleft obligation, which means startups, banks and government departments can fine-tune and ship Sarvam derivatives without licensing friction. The play is to make Sarvam the default substrate for Indian-language AI, then monetise the layers above it: hosted inference, enterprise fine-tuning, and the agentic and voice products the company has been building.

The compute moat: trained on India's subsidised GPUs

The most strategically significant fact about these models is where they were trained. Both were built entirely on IndiaAI Mission compute, with the government providing ₹246.72 crore of support and 4,096 NVIDIA H100 GPUs for six months. At commercial cloud rates of roughly $2 to $4 per GPU-hour, a six-month allocation of that scale would run well into the tens of millions of dollars. Through the subsidised IndiaAI pool, the effective cost collapses.

This is the lever. India's industrial-policy bet is that cheap, state-backed compute lowers the capital barrier to frontier training enough that domestic labs can reach the frontier without raising hundreds of millions purely to rent GPUs. Sarvam is the first full validation of that thesis at the 100B-parameter scale. The compute grant functions as a moat in two directions: it let Sarvam train models a self-funded startup could not have afforded, and it ties the most ambitious Indian model work to domestic infrastructure rather than US hyperscalers.

The honest caveat is that this moat is policy-dependent. The economics work because the state is absorbing the compute bill; they do not yet demonstrate that Sarvam can train its next generation at full market cost and still clear a venture return. That question is what the Series B capital is meant to answer.

The unicorn round: IT services meets the frontier lab

On 15 June, Sarvam announced the first close of a $234M Series B at a $1.5B valuation, led by HCLTech with $150.7M for roughly 10.5%, alongside existing backers Bessemer Venture Partners, Khosla Ventures and Peak XV. It is the structure of this round, not just the size, that signals a shift.

HCLTech is not a generalist crossover fund chasing AI; it is a top-tier Indian IT-services company writing the anchor cheque into a frontier-model lab. That is the convergence thesis made explicit. India's services giants have decades of enterprise distribution and deep client relationships but have historically lacked frontier IP. Sarvam has the IP but lacks distribution. An HCLTech-anchored Sarvam can, in principle, embed sovereign Indian models directly into the enterprise and government engagements that the services majors already own. For investors, the moat is no longer just the technology; it is the channel.

At a $1.5B valuation against a $234M first close, the round prices Sarvam as a platform bet rather than a model-of-the-week. The comparable that matters is what it is not: the global pattern of foundation-model labs raising at $10B-plus valuations on the strength of consumer hype. This is a more grounded number, tied to a defined enterprise route to market.

The contrast that proves the thesis: Krutrim's retreat

Sarvam's trajectory is best understood against Krutrim, Ola founder Bhavish Aggarwal's AI venture and India's first GenAI unicorn at a $1B valuation. In May 2026, Krutrim paused its in-house chip-design and frontier-model programmes, pivoted to an enterprise AI-cloud business, cut more than 200 roles and withdrew its Kruti consumer app. Its last released model, Krutrim-2, was a 12B dense model.

The two stories rhyme deliberately. Krutrim attempted to do everything, custom silicon, frontier models, a consumer app, largely on its own balance sheet, and found the unit economics of pure foundation-model building unsustainable. Sarvam concentrated on open-weight models trained on subsidised state compute, then attached itself to a distribution-rich strategic backer. One retreated from the frontier; the other reached unicorn status on it within the same quarter. That divergence is the strongest available evidence for the specific path India is backing: subsidised compute, plus open weights, plus enterprise verticalisation, rather than balance-sheet-funded full-stack ambition.

Investor takeaways and the risks worth pricing

The bull case is clean. Sarvam owns the leading open-weight Indian-language model stack, trained at a fraction of market compute cost, with an Apache-2.0 ecosystem strategy and an IT-services anchor providing distribution. If even a modest share of Indian enterprise and government AI workloads standardises on sovereign models, the addressable market is large and structurally protected by language, data-residency and procurement preferences that favour a domestic player.

The risks are equally concrete. The compute moat depends on continued state subsidy; benchmark scores are largely self-reported and need independent replication; open weights mean the models themselves are not the monetisable asset, so the entire return rests on Sarvam executing the services and inference layers above them. And the frontier moves fast: a 105B model that is competitive in March may be table stakes by year-end.

What Sarvam has demonstrated is narrower but real. A sovereign frontier model can be built on Indian compute, and the capital markets, with an Indian services major in the lead, will now fund the result. For the dozen other labs in India's sovereign-model cohort, that is the template to beat.

Sarvam's Sovereign-AI Stack: From Open-Weights Frontier Models to Unicorn Status

The models: MoE economics meet open weights

The compute moat: trained on India's subsidised GPUs

The unicorn round: IT services meets the frontier lab

The contrast that proves the thesis: Krutrim's retreat

Investor takeaways and the risks worth pricing

Tags

More from Artificial Intelligence

SAP Labs India's 2026 Studio Bets on Agentic AI and Quantum-Safe Startups

Detect Technologies Lands TotalEnergies, Exporting Indian Industrial AI