The Gist — Compute & Infrastructure

Top Line

A Thailand-based smuggling network allegedly routed restricted Nvidia AI GPUs through a government entity to Chinese buyers including Alibaba, representing a direct breach of US export controls and exposing a critical enforcement gap in the semiconductor supply chain.

CoreWeave issued a below-expectations revenue forecast for Q2 2026 despite heavy data centre spending, raising structural questions about whether GPU-cloud operators can convert massive capex into durable margin before the market tightens.

ByteDance has raised its 2026 AI infrastructure budget by 25% to 200 billion yuan ($29.4 billion), a signal that Chinese hyperscalers are accelerating domestic compute buildout even as US export restrictions tighten and memory costs rise.

Maryland is formally contesting a $2 billion grid upgrade cost allocation by PJM, crystallising the emerging regulatory and political flashpoint over who pays for transmission infrastructure required by AI data centres.

Anthropic has contracted capacity at SpaceX's Colossus 1 data centre in Memphis, adding confirmed inference and training capacity at a facility already notable for its scale and Elon Musk association.

Key Developments

Nvidia GPU Smuggling Network Exposed: Thailand Route to China

Executives connected to Supermicro allegedly constructed a diversion network using a Thai government entity to export restricted Nvidia AI accelerators to Chinese customers, with Alibaba named as a recipient, according to reporting by Tom's Hardware. This is the most operationally sophisticated circumvention route disclosed to date, going beyond individual grey-market resellers to implicate a state-adjacent institution as a transshipment node.

The exposure of this network has several compounding implications. First, it validates long-standing concerns among export control analysts that entity-list restrictions are being systematically gamed through Southeast Asian intermediaries — a pattern US Commerce has been attempting to close via the diffusion rule and additional country-level licensing requirements. Second, Supermicro's involvement, even if limited to executive-level actors rather than corporate policy, adds to the company's already strained compliance record. Third, if Alibaba received restricted H100-class hardware, Chinese frontier model training capacity may be materially higher than US intelligence estimates based on sanctioned trade data. Enforcement action and likely additional regulatory pressure on SE Asian transshipment hubs should be expected.

Why it matters

This incident demonstrates that hardware export controls — the primary US mechanism for constraining Chinese AI capability — have a confirmed, operationally mature evasion pathway, undermining the strategic premise of the entire semiconductor export regime.

What to watch

Whether the Bureau of Industry and Security pursues criminal referrals against named individuals, and whether Thailand tightens re-export controls under US diplomatic pressure — outcomes that will determine whether Southeast Asia remains a viable transshipment corridor.

CoreWeave's Growth Slowdown Signals GPU-Cloud Unit Economics Under Pressure

CoreWeave posted wider losses and issued a disappointing Q2 2026 revenue forecast, according to Bloomberg, despite continuing to spend aggressively on data centre capacity. The company's model — borrow capital, lease Nvidia GPUs at scale, resell compute on flexible contracts — is sensitive to two variables that are now moving adversely: the cost of capital has not returned to 2021 levels, and customers are increasingly negotiating harder on GPU rental rates as the H100 spot market has softened.

This is a structurally important data point because CoreWeave was the canonical proof-of-concept for the independent GPU cloud model. If its growth is decelerating while capex remains elevated, it raises questions about whether the addressable market for non-hyperscaler GPU compute is as deep as venture and debt markets assumed at IPO. The contrast with ByteDance's 25% capex increase is instructive: hyperscale and large model labs are consuming compute at scale, but they are increasingly doing so through direct infrastructure ownership or bespoke deals — not through CoreWeave's spot and reserved capacity products.

Why it matters

CoreWeave's guidance miss is an early stress indicator for the independent GPU cloud segment, which absorbed tens of billions in debt financing on the assumption that AI compute demand would outpace hyperscaler build rates indefinitely.

What to watch

Customer contract renewal rates and whether Microsoft — CoreWeave's anchor customer accounting for a large share of revenue — extends or renegotiates its committed capacity agreements over the next two quarters.

Energy Cost Socialisation Becomes a Political Fight: Maryland vs. PJM

Maryland has filed a formal complaint with federal energy regulators against PJM's proposal to allocate $2 billion in grid upgrade costs to the state, arguing that the transmission infrastructure primarily serves data centres located in neighbouring Virginia and other jurisdictions, not Maryland ratepayers, according to Tom's Hardware. This is not an isolated grievance — it is the first major inter-state regulatory confrontation over how to allocate the transmission costs triggered by the data centre load boom concentrated in Northern Virginia.

The structural problem is that PJM's cost allocation methodology was designed for a geographically diffuse load growth model, not for the hyperconcentrated demand spikes created by multi-gigawatt data centre campuses. States that host the data centres capture property taxes and economic activity; states that sit downstream on the grid absorb transmission costs. Without a federal-level resolution of this allocation framework, similar disputes are likely across every major grid operator territory where data centre demand is growing — MISO, ERCOT, and SPP included. This regulatory friction will not stop buildout, but it will slow transmission permitting and add cost uncertainty to grid interconnection timelines.

Why it matters

The Maryland-PJM dispute establishes a legal and political template for challenging data centre-driven grid cost socialisation, and a FERC ruling in Maryland's favour would force a fundamental rethinking of how transmission upgrade costs are allocated across the PJM footprint.

What to watch

FERC's response to Maryland's complaint and whether other states file similar challenges, which would create regulatory pressure to revise PJM's cost allocation tariff — a process that could take years and create material uncertainty for data centre interconnection timelines in the Mid-Atlantic.

Sovereign AI Cloud and Domestic Compute: UK and China Move in Parallel

Argyll and SambaNova have launched a sovereign AI cloud in the UK, fulfilling a partnership first announced in October 2025, according to Data Centre Dynamics. SambaNova's architecture — purpose-built inference hardware rather than Nvidia GPUs — is a deliberate sovereign compute differentiator, allowing UK government and enterprise customers to process sensitive data on infrastructure not subject to US hardware export dependencies or hyperscaler terms of service.

Simultaneously, ByteDance's confirmed 25% capex increase to 200 billion yuan for 2026, reported by Bloomberg citing the South China Morning Post, reflects Beijing's implicit policy of encouraging domestic tech giants to build out China's internal AI compute base. Rising memory chip costs are noted as a driver of the budget increase, suggesting that even well-resourced Chinese operators are absorbing commodity component inflation. Together, these two developments — a mid-tier sovereign cloud in the UK and a hyperscale domestic buildout in China — illustrate the bifurcating global compute investment landscape, where geopolitical alignment increasingly determines infrastructure architecture choices.

Why it matters

The parallel emergence of sovereign cloud offerings in allied nations and accelerated domestic buildout in China signals that the AI infrastructure market is fragmenting along geopolitical lines, creating durable demand for non-Nvidia, non-US-cloud compute stacks.

What to watch

Whether the UK government formalises procurement commitments to the Argyll-SambaNova platform, which would determine whether this is a niche offering or the seed of a meaningful alternative to AWS and Azure for UK public sector AI workloads.

Advanced Packaging and Community Opposition: Two Structural Constraints on Buildout

On the hardware supply side, the emergence of hybrid 3.5D packaging — combining 2.5D interposer approaches with vertical 3D stacking — reflects the semiconductor industry's attempt to extract more performance per die without waiting for next-node lithography advances, as analysed by Semiconductor Engineering. For AI accelerator roadmaps, this matters because CoWoS capacity at TSMC remains a binding constraint on HBM-to-GPU bandwidth; 3.5D architectures could relieve some of that pressure by enabling denser integration without requiring entirely new packaging lines.

On the demand side, an Ipsos survey cited by Tom's Hardware found that 47% of Americans oppose data centre construction near their communities — a figure that directly translates into zoning opposition, extended permitting timelines, and site selection constraints. Community opposition is not a new phenomenon for energy infrastructure, but at this scale and with this level of political salience, it creates meaningful friction for operators trying to site gigawatt-scale campuses in suburban and semi-rural areas with existing grid access.

Why it matters

Advanced packaging innovation and community opposition represent opposite ends of the infrastructure bottleneck problem: the former offers a technical path to higher compute density per facility, while the latter constrains the number of facilities that can be built at all.

What to watch

TSMC's CoWoS capacity expansion timeline through 2027, and whether state-level preemption legislation — already proposed in several US states — overrides local zoning authority for data centre siting.

Signals & Trends

Export Control Evasion Is Scaling: Southeast Asia Is Now a Systemic Transshipment Layer

The Thailand-Supermicro-Alibaba case is not an anomaly — it is evidence that export control circumvention has matured from opportunistic grey-market arbitrage into structured, multi-node networks with institutional cover. The pattern — acquire hardware in a nominally compliant jurisdiction, route through a government-adjacent entity, deliver to a sanctioned end user — is replicable across Malaysia, the UAE, and other jurisdictions with less rigorous re-export enforcement. US Commerce's response has been to expand the entity list and tighten country-level licensing, but these are reactive measures. The signal for infrastructure professionals: any estimate of China's restricted AI hardware stock based solely on official trade data should be treated as a floor, not a ceiling, with the gap growing over time.

The GPU Cloud Independent Operator Model Is Entering a Stress Test

CoreWeave's guidance miss, set against ByteDance's $29 billion capex commitment and Anthropic's deal for SpaceX's Colossus 1 capacity, reveals a bifurcation in how AI compute is being consumed. The largest model developers and the best-capitalised hyperscalers are either building proprietary infrastructure or signing bespoke, long-duration capacity deals directly with facility operators — bypassing the independent GPU cloud layer. This structural shift compresses the addressable market for operators like CoreWeave, Lambda, and CoreWeave's peers who built their businesses on the assumption that model developers would prefer flexible, metered compute. The risk is a wave of GPU cloud operators who are highly leveraged, facing softer pricing, and competing for a shrinking pool of customers willing to pay premium rates for flexibility they increasingly don't need.

Grid Cost Politicisation Will Reshape Data Centre Geography

Maryland's FERC complaint is the first formal regulatory salvo in what will become a multi-state battle over transmission cost socialisation. The underlying dynamic — data centre load growth is so geographically concentrated that it creates transmission upgrade requirements that spill across grid operator boundaries — is not solvable by individual state action. But the political pressure it generates will influence where operators site future facilities. States that can offer both power availability and insulation from cross-border cost allocation disputes will gain a siting premium. This points toward jurisdictions with dedicated generation assets (large hydro in the Pacific Northwest, nuclear in the Southeast) or states that have pre-negotiated data centre-specific tariff structures, as the next generation of preferred buildout locations — at the expense of the historically dominant Northern Virginia corridor.

Explore Other Categories

Read detailed analysis in other strategic domains

Capital & Industrial Strategy

Nvidia has committed over $40 billion in equity investments across the AI ecosystem in 2026 alone — not as passive bets, but as dual equity-plus-commercial deals that bind portfolio companies to its hardware. The strategic logic mirrors Intel's 1990s playbook: own enough of the value chain that capital flows back to your core product regardless of which application layer wins. Nvidia's balance sheet is becoming the AI supply chain's gravitational centre.

Frontier Capability Developments

Former CTO Mira Murati testified under oath that Sam Altman falsely told her a new AI model had received legal safety clearance — a claim she later found to be untrue. This is not a leak or allegation; it is legally tested testimony from OpenAI's most credible insider, directly corroborating the board's 2023 rationale for firing Altman. Enterprise procurement teams and regulators now have a durable, court-documented liability to work with.