Learning to unfold cloth: Scaling up world models to deformable object manipulation

2026-02-18Robotics

Robotics
AI summary

The authors focus on teaching robots how to handle cloth, which is tricky because cloth moves and folds in complicated ways. They improved a type of robot learning method called DreamerV2 by adding information about the cloth's surface details and changing how the robot learns from past experiences. Their improvements help the robot better understand cloth physics so it can unfold different cloths in the air. They tested their approach both in computer simulations and on a real robot, showing it can work on various cloth types without extra training.

robotic cloth manipulationreinforcement learningDreamerV2surface normalsworld modelreplay bufferdata augmentationzero-shot deploymentdeformable object physicsgeneralisation
Authors
Jack Rome, Stephen James, Subramanian Ramamoorthy
Abstract
Learning to manipulate cloth is both a paradigmatic problem for robotic research and a problem of immediate relevance to a variety of applications ranging from assistive care to the service industry. The complex physics of the deformable object makes this problem of cloth manipulation nontrivial. In order to create a general manipulation strategy that addresses a variety of shapes, sizes, fold and wrinkle patterns, in addition to the usual problems of appearance variations, it becomes important to carefully consider model structure and their implications for generalisation performance. In this paper, we present an approach to in-air cloth manipulation that uses a variation of a recently proposed reinforcement learning architecture, DreamerV2. Our implementation modifies this architecture to utilise surface normals input, in addition to modiying the replay buffer and data augmentation procedures. Taken together these modifications represent an enhancement to the world model used by the robot, addressing the physical complexity of the object being manipulated by the robot. We present evaluations both in simulation and in a zero-shot deployment of the trained policies in a physical robot setup, performing in-air unfolding of a variety of different cloth types, demonstrating the generalisation benefits of our proposed architecture.