Builder Daily

Physical AI

Humanoids, embodied foundation models, and the sim-to-real gap — what crossed from research lab to factory floor this week.

2026-05-10

Fine-tuning vision-language-action models on a single DGX Spark — what works in May 2026

Pi-0.5, OpenVLA-2, and RT-2-Edge can all be LoRA-finetuned on a single DGX Spark (128 GB unified) with 100-300 demos. The 4-hour overnight run gives you a deployable robot policy.

2026-05-09

Mobile manipulation goes consumer — Stretch AI 3 ships at $19,500 with foundation-model brain

Hello Robot Stretch AI 3 ships May 2026 at $19,500 — first consumer-priced mobile manipulator with built-in VLA foundation model (Llama-Vision derivative). Targets eldercare, domestic, and small-format retail. Open API, ROS 2, and 4-hour battery.

2026-05-08

Tactile-augmented VLM agents — when robots can feel, the policy gets simpler

Three labs shipped tactile-augmented VLM policies in May 2026: GelSight Mini fingers + Llama-Vision integration cuts per-task data needs by ~60% on contact-rich manipulation. Cheap commodity tactile is now a credible add-on, not a research-only feature.

2026-05-03

Physical AI roundup — humanoid foundation models in 2026 Q2

Four humanoid foundation models shipped real-world demos in 2026 Q2: NVIDIA GR00T N2, Tesla Optimus Gen 3, Figure 03, and Physical Intelligence π0.5. The sim-to-real gap is closing — but only on dexterous tasks where teleoperation data is plentiful.

2026-05-02

World models for robotics — Cosmos, Genie 3, and V-JEPA-2 explained for builders

NVIDIA Cosmos, Google Genie 3, and Meta V-JEPA-2 each take a different bet on synthetic training data for embodied AI. Here is what each one is actually good for, and the open question of whether world models can replace teleoperation.

2026-04-30

Open-source humanoid hardware in 2026 — Unitree G1, K-Scale, 1X Neo Beta

A $16K commodity humanoid did not exist 18 months ago. Now Unitree G1 has shipped 12,000+ units to research labs, K-Scale Open Humanoid is fully open-hardware, and 1X Neo Beta is taking $20K consumer pre-orders. Here is what each platform is actually for.

2026-04-25

The teleoperation data engine — why ALOHA-2 and GELLO are the new training corpus

Embodied AI is bottlenecked on bimanual manipulation data. A $35K ALOHA-2 rig collects 50 hrs/day. A $300 GELLO rig is 100x cheaper but slower. Here is the operational reality of running a teleoperation farm in 2026.

Tip