Compute & Infrastructure
Top Line
Argentum AI has secured a $4.1bn AI cloud contract covering 27,000 Nvidia GB300 GPUs, signalling that large-scale, GPU-dense contracts are increasingly flowing to second-tier cloud providers rather than hyperscalers — a structural shift in how AI compute capacity is procured.
SpaceX has locked in a $6.3bn compute deal with AI startup Reflection for access to GB300s at its Colossus 2 data centre, confirming SpaceX's ambition to become a serious commercial AI infrastructure player alongside its aerospace operations.
Nvidia has announced a liquid cooling system operating at 45°C coolant temperature that claims up to 100% reduction in water consumption, directly targeting one of the most binding constraints on data centre expansion — but independent sustainability validation is still absent.
Groq has raised $650m in new growth capital for AI cloud expansion, reflecting sustained investor conviction in alternative inference hardware even as Cerebras disappointed markets with a weak 2026 revenue outlook, dropping ~10% after guidance.
Oracle confirmed 21,000 layoffs over fiscal year 2026, explicitly linking the cuts to internal AI deployment and the capital demands of its AI infrastructure buildout — a concrete example of how infrastructure ambition is reshaping enterprise workforce economics.
Key Developments
Mega-Contract Wave Validates GB300 Demand — But Concentration Risk Deepens
Two large compute contracts announced today collectively represent over $10bn in committed GPU infrastructure spend, both denominated in Nvidia GB300 hardware. Argentum AI's $4.1bn deal with an unnamed customer for 27,000 GB300s, reported by Data Centre Dynamics, and SpaceX's $6.3bn agreement with Reflection AI for GB300 access at Colossus 2, also reported by Data Centre Dynamics, together underscore that the GB300 Blackwell Ultra architecture is now the de facto unit of large-scale AI capacity procurement. The anonymity of Argentum's customer is a material unknown — whether this is a sovereign buyer, a large enterprise, or a new AI lab carries significantly different implications for market structure.
Both deals deepen the ecosystem's dependence on Nvidia as the sole credible supplier for contracts of this scale. No AMD, Intel, or custom silicon is referenced in either deal. This is not merely a market preference — it reflects supply chain reality, where TSMC's CoWoS advanced packaging for Nvidia's HBM-integrated GPUs remains the critical bottleneck. The concentration of two multi-billion-dollar contracts on a single GPU architecture, procured through non-hyperscale channels, also signals that the AI cloud market is fragmenting into a larger set of specialised infrastructure operators competing on GPU access rather than software differentiation.
Nvidia's High-Temperature Liquid Cooling Targets the Energy and Water Constraint Directly
Nvidia has announced a liquid cooling system that raises coolant inlet temperature to 45°C — above conventional direct liquid cooling thresholds — and claims to eliminate facility water consumption entirely by enabling dry cooling at the facility level, according to Tom's Hardware. The higher coolant temperature is strategically significant: it allows the rejected heat to be dissipated via air-cooled dry coolers rather than evaporative cooling towers, which is the primary mechanism by which hyperscale data centres consume millions of gallons of water annually. Reduced electricity consumption is a secondary claim, likely from eliminating chiller plant energy draw.
Tom's Hardware flags that sustainability challenges remain — a critical caveat. The claim of 100% water reduction applies to the cooling loop itself, but manufacturing the hardware, the embodied carbon of high-density GPU infrastructure, and the source of electricity powering the facility are unaddressed. This announcement should be read as a credible engineering advancement on operational water use, not a holistic sustainability solution. For infrastructure operators facing water-use restrictions in Arizona, Texas, and parts of Europe, this is a meaningful operational unlock — but regulatory and environmental scrutiny of data centre footprints is broadening, not narrowing.
Alternative AI Silicon: Groq Raises While Cerebras Disappoints — Market Bifurcation Emerging
Groq has closed $650m in new growth capital led by Disruptive and Infinitum for AI cloud expansion, per Data Centre Dynamics, while on the same day Cerebras dropped approximately 10% after issuing a 2026 revenue outlook that failed to demonstrate it is capturing a material share of the AI data centre market, per Bloomberg. These two datapoints together suggest the alternative inference silicon market is bifurcating: Groq, with its LPU architecture optimised for low-latency inference, retains investor conviction, while Cerebras — whose wafer-scale engine targets large model training — is struggling to translate architectural differentiation into commercial scale.
The Groq raise is a confirmed capital event; the Cerebras guidance disappointment is a confirmed market signal. Neither constitutes evidence that alternative silicon is displacing Nvidia at scale — the GB300 contract announcements elsewhere today make that clear. What it does indicate is that a viable inference-layer market exists for non-Nvidia architectures where latency and cost-per-token economics favour specialised designs. The strategic question is whether these players can secure long-term contracts before Nvidia's own inference-optimised products commoditise that window.
Asia-Pacific Data Centre M&A Accelerates as Institutional Capital Chases Regional AI Infrastructure
BlackRock-backed AIP, KKR, and Brookfield Asset Management are reportedly among bidders for Stack Infrastructure's Asia-Pacific data centre portfolio, according to Bloomberg. This is an announced process with named institutional participants — it is not speculative that a sale process is underway, though deal terms and ultimate buyer remain unconfirmed. The involvement of AIP — the BlackRock, Microsoft, and Global Infrastructure Partners joint vehicle explicitly created to fund AI infrastructure — alongside traditional infrastructure funds like KKR and Brookfield signals that Asia-Pacific AI compute capacity is now commanding the same institutional attention as North American and European buildout.
The strategic logic for buyers is straightforward: Asia-Pacific AI demand is growing rapidly, greenfield permitting and grid connection timelines in Singapore, Japan, and Australia are extending, and acquiring operational capacity with established power agreements is a faster path to AI-ready infrastructure than ground-up development. For Stack, this represents a potential exit or recapitalisation at a premium to pre-AI valuations. The outcome will set a pricing benchmark for Asia-Pacific AI data centre assets.
Signals & Trends
Vehicle-to-Grid and Distributed Battery Storage Are Being Positioned as AI Energy Infrastructure
CATL's Chief Manufacturing Officer, speaking at WEF Dalian, flagged two distinct propositions: deploying sodium-ion batteries for AI-related energy storage this year, and the prospect of harvesting computing power from idle electric vehicles, per Bloomberg. The sodium-ion deployment is a confirmed near-term commercial focus — relevant because sodium-ion avoids lithium supply chain dependencies and offers cost advantages for stationary grid storage, which is increasingly critical for data centres requiring behind-the-meter storage to manage grid constraints. The idle-EV compute concept is speculative and faces profound practical barriers, but its articulation by a senior executive at a global forum is a weak signal worth tracking: it reflects the industry's active search for distributed compute and energy buffers that bypass constrained utility infrastructure. The more immediate strategic implication is that battery chemistry diversification — away from lithium-dominated supply chains — is entering the AI infrastructure energy stack.
Forced Capital Substitution: AI Infrastructure Spend Is Cannibalising Enterprise Headcount at Scale
Oracle's confirmation of 21,000 layoffs in fiscal year 2026, explicitly attributed to AI automation and AI cloud infrastructure cost pressures, per Tom's Hardware, represents a concrete, large-scale instance of a pattern that infrastructure analysts should model more explicitly: the capital required to build and operate AI infrastructure is not additive to existing enterprise cost structures — it is being funded in part by displacing labour costs. Oracle's statement that layoffs will continue as internal AI deployment grows suggests this is a declared strategic posture, not a one-time restructuring. The implication for infrastructure demand forecasting is that enterprise AI infrastructure investment may be more durable than sentiment-driven models suggest, because it is structurally self-funding through workforce reduction. This dynamic also creates political risk: large-scale AI-linked layoffs at visible enterprise names will intensify regulatory scrutiny of AI adoption timelines and potentially create legislative friction for data centre permitting in jurisdictions where those workforces are concentrated.
Wide-Bandgap Power Electronics Are Emerging as a Quiet Chokepoint in Data Centre Power Delivery
AlpSemi's €17m seed raise for wide-bandgap power switches targeting AI data centres, led by Yotta Capital with participation from Navitas Semiconductor, per Data Centre Dynamics, is a small funding event with a disproportionate strategic signal. As GPU rack densities push toward and beyond 100kW per rack with GB300 and successor architectures, conventional silicon-based power conversion and distribution equipment hits efficiency and thermal limits. Wide-bandgap semiconductors — gallium nitride and silicon carbide — offer substantially higher switching frequencies and efficiency at elevated temperatures, enabling more compact, efficient power delivery. The involvement of Navitas, a GaN specialist, as a strategic investor signals this is not purely financial — it reflects supply chain positioning. Infrastructure operators and hyperscalers who are today focused on GPU supply should begin tracking power electronics supply chains as a second-order constraint: if wide-bandgap components cannot be manufactured at scale fast enough to match rack density increases, power delivery infrastructure becomes the binding constraint on dense AI data centre deployment.
Explore Other Categories
Read detailed analysis in other strategic domains