Skip to content
AI-Daily-Builder

Tags · #semiconductors

NVIDIA Pushes Memory Makers to 16-High HBM4 by Q4 2026, Raising the Bar for AI Accelerator Supply

Read this because The constraint on AI compute is migrating from logic to memory stacking. Whoever solves 16-high bonding first controls a chokepoint that every frontier GPU now depends on.

NVIDIA asked SK hynix, Samsung and Micron for 16-high HBM4 by Q4 2026 — a stacking jump that makes advanced packaging the AI supply-chain bottleneck.

NVIDIA and TSMC Push AI Deep Into the Fabs: cuLitho Cuts Lithography Cost 20-50%, FabTwin Goes Digital

Read this because Not the capex headline: NVIDIA is selling compute into the supply chain that builds NVIDIA's own chips. cuLitho's 20-50% litho cost/cycle-time cut is the load-bearing number — it gates how fast and cheaply sub-2nm wafers reach volume. A vertical loop.

At GTC Taipei, TSMC adopted NVIDIA's CUDA-X stack across lithography, simulation and inspection, with cuLitho cutting litho cost up to 50%.

Micron joins the $1 trillion club as HBM stops being a commodity

Read this because The number to watch isn't the trillion — it's the word "structural." UBS's 204% target hike rests on the thesis that AI memory has de-cyclicalized: HBM4 sold out, DRAM up 58-63%, pricing locked through 2029. If memory has escaped its cycle, the entire semi playbook changes.

Micron crossed a $1T market cap on May 26 — the 5th chipmaker — after UBS tripled its target to $1,625 and 2026 HBM4 capacity sold out under long-term deals.

AMD's 256-core EPYC "Venice" is the first HPC chip to ramp on TSMC 2nm

Read this because Everyone watches the GPU. But AI clusters still need a host CPU to feed them, and AMD just put a 256-core server part on the most advanced node before anyone else. The lever here is efficiency at the power wall — and a quiet Arizona on-shoring story riding alongside it.

AMD is ramping EPYC "Venice" on TSMC 2nm (May 21) — a 256-core/512-thread part, the industry's first HPC product on the node, 70%+ uplift over Turin.

NVIDIA Q1 FY27: $81.6B revenue (+85%), data center nearly doubles, Q2 guide $91B

Read this because The number that resets the AI-capex debate: Q2 guide of $91B EXCLUDING China, above a $86B consensus. The bear case needed a demand crack; instead the run-rate accelerated. Hyperscaler 2026 capex tracking ~$725B (+77%) is the demand floor under it.

NVIDIA Q1 FY27 record revenue $81.6B (+85%), data center $75.2B (+92%), EPS $1.87 beat. Q2 guide $91B ex-China, above $86B consensus — 4th straight beat.

Alibaba T-Head Zhenwu M890 — 144GB domestic AI accelerator, 3x prior gen

Read this because The number that matters is 560,000 units already shipped — this is not a paper launch. China's domestic accelerator stack is at volume, and the M890's agent-workload tuning shows the decoupling is now targeting the same workloads NVIDIA sells into.

Alibaba T-Head unveiled the Zhenwu M890: 144GB memory, 800GB/s interchip, 3x the 810E. 560K Zhenwu units shipped to 400+ customers. V900 in 2027.

TSMC: $31.3B capex approved, $20B injected into Arizona, 53% to advanced nodes

Read this because Track the front-end-vs-back-end capex ratio, not the headline dollar figure. A 53% shift to leading-edge nodes is what decides whether next-gen Blackwell/MI400 ships on schedule — and therefore your 2027 inference cost curve.

5/12 board: $31.3B capex + $20B Arizona injection. Advanced front-end now 53% of capex (37% in 2024-25) — direct read on AI accelerator demand.

Tip