Skip to content
AI-Daily-Builder

2026-05-18 views $ARM · ARM Holdings · Cortex-X / Neoverse / Mali / CSS

ARM Holdings — the IP layer under every smartphone and increasingly every AI server

ARM licenses CPU IP under 99%+ smartphone SoCs and is the structural read on AI-server CPU shift (NVIDIA Grace, AWS Graviton, Apple). Arm v9 royalty runs 30-50% above v8 per chip — the v9 mandate is the pricing event.

ARM Holdings (NASDAQ: ARM) is the IP layer underneath 99%+ of smartphone SoCs and an increasing share of AI-server CPUs. The business model is pure license + royalty — ARM doesn’t fab chips, it licenses the instruction set + reference cores, then collects per-unit royalty when partner silicon ships.

What ARM actually sells

Product lineWhat it isWhere it ships
Cortex-A / Cortex-XHigh-performance application CPU coresApple A-series (architectural license), Qualcomm Snapdragon, MediaTek Dimensity — every flagship Android
Cortex-N / efficiency coresLow-power CPUsIoT, wearables, low-tier mobile
Mali / Immortalis GPUMobile GPU IPMediatek, Samsung Exynos, low-end Android
Neoverse V/N/EServer CPU IP for cloud + AINVIDIA Grace, AWS Graviton, Ampere, Microsoft Cobalt
Compute Subsystems (CSS)Pre-validated chiplet/IP packagesNew 2024+ business line — sells whole subsystems, not just cores

The structural pricing event: v9 mandate

The non-obvious story is Arm v9 royalty pricing. Where Arm v8 royalty rates ran ~1-2% of chip ASP, Arm v9 commands ~30-50% higher per-unit royalties because the new ISA mandate (SVE2 vector, MTE memory tagging, confidential compute) is non-optional for new flagship designs.

In practical terms: every iPhone, Galaxy, and Pixel SoC built in 2026+ pays ARM more per unit than the same vendor paid in 2024. The royalty escalator is automatic; ARM doesn’t have to add new customers, just wait for the existing flagship pipeline to refresh.

The AI-server angle: Neoverse + CSS

CustomerProductStatus
NVIDIAGrace CPU (paired with Hopper/Blackwell)Shipping volume to hyperscalers
AWSGraviton 4 → Graviton 5~50% of new EC2 capacity is Graviton
MicrosoftCobalt 100 (Azure)Production deployment 2025
GoogleAxion (GCP)Production 2025
AppleM4/M5 (architectural license)Mac + on-device AI workloads
AmpereAmpereOne / OneFamilySmaller share, present in Oracle Cloud

The Compute Subsystems (CSS) business sells pre-integrated chiplet packages — not just core licenses — to customers that don’t want to build a server CPU from scratch. CSS shortens time-to-silicon from ~3 years to ~12-18 months, which is what’s letting AWS, Microsoft, and Google catch up on custom silicon vs. NVIDIA Grace’s head start.

Why this matters for AI infrastructure

Two threads:

  1. Server CPU shift is real. Every cloud hyperscaler is building or buying Arm-based server CPUs for AI-adjacent workloads (control plane, data preprocessing, inference orchestration). x86 is still the volume base, but Arm share of new datacenter purchases is rising ~5-10 points/year. Each new design pays ARM royalty.
  2. The royalty tail is wide. ARM’s revenue grew 19% YoY through fiscal 2025 with ~95% gross margin — there’s no fab cost, no inventory. As long as the v9 mandate holds and Neoverse adoption keeps expanding, the model is structurally levered to total AI silicon volume without taking on fab risk.

What’s risky

Practitioner note

For AI-product builders:

The under-considered angle: ARM is the only infrastructure IP company where every iPhone, Android, server, and AI cluster pays royalty. That cross-modal exposure — mobile + server + auto + IoT — is what makes the business asymmetric vs. pure-server names like AMD or Intel. The bear case requires multiple unrelated demand pillars to break at once.


Sources

Tip