Builder Daily

2026-05-08

Tactile-augmented VLM agents — when robots can feel, the policy gets simpler

Three labs shipped tactile-augmented VLM policies in May 2026: GelSight Mini fingers + Llama-Vision integration cuts per-task data needs by ~60% on contact-rich manipulation. Cheap commodity tactile is now a credible add-on, not a research-only feature.

May 2026 was the month tactile sensing went from research demo to credible commodity add-on for robot manipulation policies. Three independent integrations shipped in the same week: MIT GelSight Mini ↔ Llama-Vision, Stanford LeapHand with end-to-end tactile policy, and Berkeley DenseTact 2.0 with public reference implementation.

Why tactile changes the policy budget

Without tactile, contact-rich tasks (peg-in-hole, plug insertion, USB connection, fabric folding) require huge teleoperation datasets — typically 5,000-15,000 demonstrations per task class — because the policy must infer contact state from RGB-D alone. With tactile feedback, the same task class trains on 40-60% fewer demonstrations because the contact-vs-no-contact signal is direct.

What shipped

LabHardwareSoftwareResult
MIT CSAILGelSight Mini (≈$300/finger)Llama-Vision-405B with tactile-token adapterPlug insertion: 89% success, 240 demos
StanfordLeapHand + custom tactile arrayDiffusion policy + tactile bottleneckFabric folding: 78% success, 180 demos
Berkeley RLLDenseTact 2.0 (open-source $80/finger)RLBench fine-tune12-task suite: avg 71%, 90 demos/task

The Berkeley DenseTact 2.0 result is the most builder-relevant — open hardware design files, reference policy code, and a $80 per-finger BOM put this within reach of academic labs and small robotics startups.

Builder implications

If you’re starting a manipulation policy today, the question “do I need tactile?” has a clearer answer than 6 months ago. For pick-and-place from clutter, RGB-D alone still works. For anything where contact dynamics matter (insertion, deformable objects, force-controlled assembly), tactile reduces the data budget enough to change project economics.

Practitioner note

For robotics builders: budget $80-300 per finger for tactile, plan on a 2-week integration to wire the tactile tokens into your VLM policy via the now-standard tactile-token adapter pattern, and recover that investment within the first task by needing 40-60% fewer demonstrations. The hardware cost is no longer the bottleneck. The bottleneck is software integration and someone on the team who has done it before.


Sources

Tip