VibeAct: Vibration to Actions for Contact-Rich Reactive Robot Dexterity

2026-06-25 • Robotics

Robotics

AI summaryⓘ

The authors developed VibeAct, a method that helps robots better sense and handle objects using tiny microphones that pick up vibrations from touch. They collect real vibration data from a robot hand, label it using a digital model, and train the robot to understand touch and slipping from these vibrations. This approach allows the robot to learn faster and perform better in tasks like regrasping and inserting objects by relying on tactile feedback instead of just vision or joint sensing. Their method improves success when tested on a real robot hand.

dexterous manipulationpiezoelectric microphonesvibro-acoustic signalsreinforcement learningsimulation-to-realitycontact sensingslip detectionrobot handtactile feedbackin-hand manipulation

Authors

Yuemin Mao, Uksang Yoo, Jean Oh, Jonathan Francis, Jeffrey Ichnowski

Abstract

Dexterous manipulation depends on contact events that are fast, local, and often visually occluded. Piezoelectric microphones offer a compact and high-bandwidth way to sense these interactions, but the resulting vibro-acoustic signals are difficult to simulate faithfully enough for end-to-end sim-to-real policy learning on dexterous robot hands. We propose VibeAct, a framework that bridges real vibrotactile sensing and simulation-based reinforcement learning through a shared physical representation of contact and slip. In the real world, we embed piezoelectric microphones into a dexterous robot hand and collect vibro-acoustic data through teleoperation, then replay the recordings in a calibrated digital clone to automatically label per-finger contact and slip. A tactile estimator learns to predict contact and slip from real microphone waveforms, while manipulation policies are trained in simulation on the same representation computed directly from simulated contacts. This decoupling lets policies exploit rapid tactile feedback without simulating raw audio. Across five contact-rich tasks spanning regrasping, in-hand reorientation, and insertion, VibeAct consistently outperforms a proprioception-and-point-cloud baseline in simulation, with the largest gains on tasks requiring sustained reactive control, where the continuous slip-magnitude channel proves the most informative observation. The learned policies transfer to a physical dexterous hand-arm platform, improving success rates on deployed tasks. Project videos and additional details are at https://vibeact.github.io/.

View PDFOpen arXiv