Compute & Infrastructure
Top Line
NVIDIA confirmed Vera Rubin NVL72 is in full production at Computex 2026, with CoreWeave claiming the first live deployment delivered by Dell — marking the transition from announcement to operational infrastructure for NVIDIA's most powerful AI platform to date.
Alphabet is raising $80 billion in equity capital, including a Berkshire Hathaway investment, to fund AI infrastructure spending — the largest single capital raise explicitly tied to AI compute buildout by a hyperscaler, signalling that the financing scale required is now beyond normal operating cash flows.
The US Bureau of Industry and Security closed a significant export control loophole: Chinese-owned subsidiaries incorporated outside China can no longer purchase advanced AI chips, after reports that hundreds of thousands of chips were acquired through this gap — a material tightening of the semiconductor containment regime.
At least seven Chinese universities with confirmed ties to the People's Liberation Army and defense industry are actively procuring NVIDIA H200 chips via open tender, exposing the limits of export controls even for chips technically permitted for export to China.
NVIDIA unveiled RTX Spark, an Arm-based SoC combining a 20-core Grace CPU with a Blackwell GPU, entering the consumer PC market directly and challenging Intel's x86 dominance in the premium Windows segment at a critical moment following the expiration of Qualcomm's Windows-on-Arm exclusivity agreement.
Key Developments
Vera Rubin NVL72: From Production Confirmation to First Live Deployment
NVIDIA announced at Computex 2026 that the Vera Rubin platform is now in full production — a confirmed status upgrade from the previously speculative roadmap position. CoreWeave simultaneously claimed the first operational NVL72 rack, delivered by Dell Technologies, making it the first hyperscale-adjacent operator to deploy the platform commercially. ServeTheHome and Data Center Dynamics both confirmed these announcements independently.
The infrastructure ecosystem is already mobilising around Vera Rubin. Siemens, NVIDIA, and Fluence have jointly developed a reference electrical and power architecture covering the full power path for NVL72 deployments, and Supermicro demonstrated cooling solutions using a new dielectric coolant with 1,000 times higher electrical impedance than standard fluids — a direct response to the thermal density challenges of NVL72-class racks. Data Center Dynamics reported the Siemens-Fluence collaboration, while Tom's Hardware covered Supermicro's cooling innovations.
Capital Mobilisation at Scale: Alphabet's $80 Billion Raise and the AI Infrastructure Financing Inflection
Alphabet confirmed it is raising $80 billion through equity offerings, with Berkshire Hathaway among the investors, explicitly to fund AI infrastructure spending. Bloomberg reported the deal structure includes both public equity and a direct Berkshire investment, making this the largest explicitly AI-infrastructure-linked capital raise by any single company on record. This follows HPE's strong forward guidance driven by server demand, with HPE shares surging on an 18-month sales outlook that cited AI-driven server and networking growth as the primary driver. Bloomberg confirmed the HPE guidance.
The Anthropic IPO filing, confirmed by Bloomberg, frames the AI startup competitive dynamic explicitly around compute access — the filing positions capital raised as directly determinative of who wins the compute race. This reinforces the structural dynamic: access to capital is now a direct proxy for access to compute, given the concentration of supply in NVIDIA hardware. Meanwhile, a new industry report cited by Data Center Dynamics projects an additional 250GW of data centre capacity required under an aggressive but plausible demand scenario — a figure that dwarfs current buildout commitments and frames the Alphabet raise as a necessary but insufficient response.
Export Control Enforcement: Loophole Closure and Military-Linked Procurement Exposure
The US Bureau of Industry and Security issued a formal clarification extending export controls to Chinese-owned subsidiaries incorporated in third countries, closing a structural gap through which an estimated hundreds of thousands of advanced AI chips had been acquired. Tom's Hardware confirmed the BIS clarification and the scale of prior acquisitions. The clarification is administrative, not legislative, which means its enforcement will depend on BIS's capacity to investigate complex corporate ownership structures across multiple jurisdictions — a resource-intensive challenge.
Simultaneously, a procurement records review reported by Bloomberg identified at least seven Chinese universities with documented PLA and defence industry ties actively tendering for NVIDIA H200 chips — chips that the US has categorised as permissible for export to China. This creates a direct policy contradiction: H200s are export-legal, yet they are being sought by institutions with military procurement relationships. China's concurrent expansion of trade secret rules to cover data and algorithms, confirmed by Bloomberg, adds a defensive layer to its technology strategy — seeking to prevent outbound leakage of Chinese AI assets even as it seeks inbound access to foreign compute.
Connectivity as the Next AI Bottleneck: Marvell's Computex Positioning
Marvell CEO Matt Murphy declared at Computex that connectivity infrastructure — not compute silicon — is now the binding constraint on AI data centre performance, a claim amplified by Jensen Huang publicly calling Marvell the next trillion-dollar company. Data Center Dynamics reported both statements. This is strategically significant: Marvell's core products — custom ASICs, optical DSPs, and interconnect chiplets — sit at the exact bottleneck Huang is identifying. NVIDIA endorsing Marvell's centrality to the AI stack is also a signal that NVIDIA's ecosystem strategy now explicitly depends on non-NVIDIA silicon for the networking layer.
The connectivity bottleneck claim is consistent with the infrastructure reality emerging around Vera Rubin NVL72 deployments: at rack-scale power densities exceeding 100kW, the limiting factor shifts from raw compute to how fast data can move between GPUs, between racks, and between data centres. Custom silicon for this layer — which Marvell supplies to Google and Amazon among others — becomes a second chokepoint in the AI hardware supply chain alongside NVIDIA's GPUs themselves.
NVIDIA RTX Spark: Compute Concentration Extends into the PC Market
NVIDIA's RTX Spark superchip — a single package combining a 20-core Arm-based Grace CPU and a Blackwell GPU — represents NVIDIA's first direct entry into the consumer PC SoC market, launched at Computex alongside Microsoft's Surface Laptop Ultra as the first OEM device. ServeTheHome, Tom's Hardware, and The Verge all confirmed the product specifications and Microsoft partnership. The Surface Laptop Ultra ships with up to 128GB unified memory — double what any current Windows laptop offers — positioning RTX Spark as an on-device AI inference platform, not merely a productivity chip.
Intel's public response — characterised as a healthy dose of paranoia while publicly calling RTX Spark great for the market — signals genuine competitive concern. Intel's Xeon 6+ announcement at the same event is partly a response to NVIDIA's data centre encroachment, and RTX Spark extends that threat into Intel's core PC business. The expiration of Qualcomm's Windows-on-Arm exclusivity agreement created the opening; NVIDIA's Grace CPU architecture and Blackwell GPU integration is a materially more credible Arm-on-Windows platform than Qualcomm's Snapdragon X delivered.
Signals & Trends
Power Demand Is Outrunning Grid and Capital Planning Horizons Simultaneously
The projection of an additional 250GW of data centre capacity requirement — reported by Data Center Dynamics — coincides with Schroders Greencoat explicitly targeting data centre-linked renewable assets as an investment category. These two signals together indicate that capital markets are beginning to price AI power demand as a structural infrastructure theme, not a cyclical one. However, the gap between what grid infrastructure can deliver and what AI buildout requires is growing faster than either utility investment cycles or renewable project timelines can close. The Siemens-NVIDIA-Fluence reference architecture for Vera Rubin power systems is a direct acknowledgement that standard data centre electrical design is insufficient for next-generation rack densities — and that co-development of power infrastructure alongside compute hardware is now a prerequisite for deployment, not an afterthought.
AI Memory Costs Are Creating Divergent Outcomes Across the Hardware Supply Chain
GoPro's going-concern warning — attributed explicitly to AI-fuelled memory cost inflation — is the first public casualty of the AI-driven DRAM and NAND demand surge making its way into non-AI hardware supply chains. South Korea's equity market surging to sixth globally, driven by semiconductor heavyweights, reflects the same dynamic from the other side: the memory and logic chip producers are capturing outsized value while downstream consumers of commodity memory are being squeezed out. This bifurcation — extreme profitability at the semiconductor supply layer, cost crisis for hardware assemblers outside the AI premium segment — is likely to accelerate as HBM allocation continues to prioritise AI accelerator production over consumer electronics. The sub-2nm process challenges documented by Semiconductor Engineering add a further dimension: yield and variation problems at leading-edge nodes will extend the period of constrained advanced memory supply, keeping this cost pressure elevated.
The Export Control Perimeter Is Shifting From Products to Ownership Structures
The BIS loophole closure marks a qualitative shift in US semiconductor export control methodology: the unit of control is moving from the chip product specification to the beneficial ownership of the purchasing entity. This is significantly harder to administer and verify than product-based controls, but it is also harder to circumvent via straightforward re-routing. China's simultaneous expansion of trade secret rules to cover data and algorithms suggests Beijing anticipates a tightening external environment and is building defensive legal infrastructure around its existing AI assets. The practical implication for infrastructure professionals is that due diligence on cloud and colocation customers — and on the ownership chains of data centre operators — is becoming a compliance obligation, not just a commercial one, particularly for operators with facilities in Singapore, Malaysia, UAE, and other third-country markets that have historically served as transshipment points.
Explore Other Categories
Read detailed analysis in other strategic domains