Soft Grip: A Vision-Based Wrist for Delicate Robotics

Author: Denis Avetisyan


Researchers have developed a low-cost soft robotic wrist that uses computer vision to sense contact forces, enabling more nuanced and adaptable manipulation.

ShapeForce presents a cost-effective and easily integrated sensing system that delivers force-like signals, allowing robots to perform tasks requiring frequent contact with performance comparable to that achieved with traditional force-torque sensors.
ShapeForce presents a cost-effective and easily integrated sensing system that delivers force-like signals, allowing robots to perform tasks requiring frequent contact with performance comparable to that achieved with traditional force-torque sensors.

ShapeForce provides force-like sensing using deformation analysis and vision, achieving performance comparable to traditional force-torque sensors at a significantly lower cost.

Reliable contact feedback is crucial for robust robotic manipulation, yet conventional six-axis force-torque sensors remain prohibitively expensive and fragile for many applications. This limitation motivates the work presented in ‘ShapeForce: Low-Cost Soft Robotic Wrist for Contact-Rich Manipulation’, which introduces a novel soft wrist leveraging vision-based deformation sensing to provide comparable performance at significantly reduced cost. By focusing on relative changes in force rather than absolute values, ShapeForce delivers effective contact feedback without complex calibration or specialized electronics. Could this approach unlock wider adoption of contact-rich manipulation in resource-constrained settings and accelerate the development of more adaptable robotic systems?


The Inevitable Limitations of Positional Control

Conventional robotic systems often falter when confronted with tasks demanding delicate physical interaction, such as assembling intricate components or precisely inserting objects into tight spaces. This difficulty stems from a reliance on positional control, where robots move based on pre-programmed coordinates, rather than responding to tactile sensations. While effective for repetitive, open-loop operations, this approach lacks the adaptability needed when encountering variations in part alignment, unexpected resistance, or the subtle forces inherent in maintaining stable contact. Consequently, even seemingly simple actions – fastening a screw, plugging in a USB cable, or even wiping a surface – present significant challenges, limiting the broader application of robotics in dynamic, real-world scenarios requiring nuanced physical dexterity.

Current force-torque sensors, while capable of measuring forces and torques at a robot’s end-effector, frequently present limitations for intricate manipulation tasks. Traditional designs often involve bulky hardware and significant expense, hindering their integration into robots requiring dexterity and sensitivity. More critically, these sensors frequently lack the resolution and bandwidth necessary to capture the subtle, high-frequency interactions crucial for successful contact-rich maneuvers – like feeling for a keyhole or detecting the initial resistance during a press-fit assembly. This deficiency means robots struggle to ‘feel’ their way through tasks, relying instead on pre-programmed trajectories and lacking the adaptability needed when encountering unexpected variations in the environment or object properties. Consequently, achieving truly robust and versatile manipulation-essential for deployment in real-world, unstructured settings-remains a significant challenge.

The successful execution of seemingly simple manipulations – inserting a USB drive, joining a peg and board, or even wiping a whiteboard – demands a level of tactile sensitivity that currently challenges robotic systems. These actions aren’t merely about applying force; they require discerning subtle changes in contact, identifying surface textures, and responding to unexpected resistance. A robot attempting these tasks needs to ‘feel’ when the USB connector is properly aligned, when the peg is nearing its hole, or when sufficient pressure is applied to clean a whiteboard surface. This necessitates robust contact feedback mechanisms capable of detecting not just the magnitude of force, but also its direction, location, and the nuanced characteristics of the interacting surfaces, effectively bridging the gap between programmed motion and skillful, adaptive manipulation.

The inability of robots to reliably navigate and interact with the complexities of real-world settings presents a significant barrier to widespread adoption. Current robotic systems, hampered by limited tactile sensing and manipulation capabilities, struggle with tasks demanding adaptability and finesse. This deficiency restricts their deployment beyond highly structured environments, like factory assembly lines, and impedes progress in applications requiring nuanced interaction – from assisting in healthcare and eldercare to performing in-home tasks or responding to disaster scenarios. Consequently, the full potential of robotics remains unrealized until these limitations are overcome, enabling robots to operate effectively and safely amidst the inherent uncertainties of unstructured, dynamic environments.

A Ufactory xArm7 robotic arm equipped with ShapeForce and an Intel RealSense D435 provides both manipulation and visual perception for contact information acquisition.
A Ufactory xArm7 robotic arm equipped with ShapeForce and an Intel RealSense D435 provides both manipulation and visual perception for contact information acquisition.

ShapeForce: A Pragmatic Approach to Tactile Sensing

ShapeForce is a soft robotic wrist system engineered for applications requiring frequent physical contact, such as assembly, manipulation, and human-robot interaction. The device is designed for ease of integration, featuring a plug-and-play interface that minimizes setup time and programming requirements. Cost reduction was a primary design goal, achieved through the utilization of readily available materials and a simplified manufacturing process. This allows for deployment in scenarios where expensive or delicate force/torque sensors are impractical, and provides a robust sensing solution for tasks involving variable contact forces and unpredictable environments.

ShapeForce’s central sensing mechanism relies on a deformable core constructed from a compliant material. When external forces are applied to the wrist, this core undergoes measurable deformation. This deformation is not directly measured; instead, changes in the core’s shape are correlated with the magnitude and direction of the applied force. The system is designed such that the relationship between deformation and force is consistent and repeatable, allowing for reliable force estimation. This approach avoids the need for force-sensitive materials or complex internal instrumentation, simplifying the design and reducing manufacturing costs.

ShapeForce distinguishes itself from conventional force and tactile sensors by employing passive compliance, eliminating the need for active control or substantial energy input. Traditional sensors often require complex calibration procedures to account for drift, temperature sensitivity, and hysteresis; ShapeForce’s inherent mechanical properties minimize these issues. The absence of integrated electronics and active components also simplifies the system, reducing both manufacturing costs and potential failure points, and obviates the need for an external power supply during operation. This design approach results in a sensor that is inherently robust, readily deployable, and simplifies integration into robotic systems.

Marker-based pose tracking within the ShapeForce system utilizes a series of retroreflective markers affixed to the robotic wrist. These markers are tracked using an external optical motion capture system, providing six degrees of freedom (6DoF) pose data – position and orientation in three-dimensional space. This data is then processed to accurately estimate the wrist’s configuration, including joint angles and end-effector pose. The resulting pose information is integrated with force-like signals derived from the compliant core deformation, enabling a comprehensive understanding of the interaction between the wrist and its environment. This approach achieves sub-millimeter positional accuracy and enhances the system’s ability to discern nuanced contact dynamics.

ShapeForce demonstrates adaptability by successfully transferring control from a wrist-mounted setup to the fingertips of a dexterous hand, highlighting its potential for use with diverse robotic end-effectors.
ShapeForce demonstrates adaptability by successfully transferring control from a wrist-mounted setup to the fingertips of a dexterous hand, highlighting its potential for use with diverse robotic end-effectors.

Learning to Feel: Validating Performance with Rich Contact Feedback

ShapeForce provides critical contact feedback for learning-based manipulation policies by delivering dense, geometrically-interpretable contact information. This feedback mechanism allows policies, utilizing architectures such as Action Chunking Transformers and Diffusion Policies, to effectively predict and execute complex manipulation tasks. Unlike traditional methods relying solely on joint position or velocity, ShapeForce provides signals directly correlated to contact wrench and deformation, demonstrated by a high $R^2$ value of 0.9577. This results in improved performance across a range of tasks, including Peg Insertion, USB Insertion, Whiteboard Wiping, and Toy Desk Assembly, with success rates exceeding those achieved using conventional force-torque sensors in certain scenarios – notably, an 80% success rate for Toy Desk Assembly compared to 65% with force-torque sensors.

Both search-and-control and learning-based robotic policies benefit from the integration of force-like signals derived from ShapeForce. Search-and-control policies utilize these signals within their planning and execution loops to react to contact and adjust trajectories in real-time. Learning-based policies, conversely, incorporate these signals as input features during training, allowing the models to learn mappings between observed contact and appropriate actions. This shared reliance on contact feedback demonstrably improves performance across a range of manipulation tasks, enabling more robust and adaptable robotic behavior compared to systems operating without such sensory input.

Learning-based manipulation policies within the ShapeForce framework employ advanced architectures for action prediction, specifically Action Chunking Transformers and Diffusion Policies. Action Chunking Transformers decompose complex tasks into a sequence of manageable action chunks, facilitating more efficient learning and generalization. Diffusion Policies, conversely, model the policy as a diffusion process, allowing the generation of diverse and robust action sequences. These architectures enable the policies to learn directly from rich contact feedback provided by ShapeForce, predicting subsequent actions based on observed wrench deformations and achieving performance comparable to, or exceeding, traditional force-torque sensing methods.

ShapeForce demonstrates high performance across a range of manipulation tasks, achieving success rates exceeding 90% for Peg Insertion, USB Insertion, and Whiteboard Wiping. These results are statistically comparable to those obtained using traditional force-torque sensors, indicating that ShapeForce provides a viable alternative for obtaining contact feedback. The consistency of high success rates across these diverse tasks highlights the robustness of the ShapeForce methodology and its potential for broad application in robotic manipulation.

During Toy Desk Assembly tasks, policies utilizing ShapeForce feedback achieved an 80% success rate, demonstrably exceeding the 65% success rate obtained with traditional force-torque sensors. This performance difference indicates ShapeForce provides more effective contact information for this specific manipulation challenge, potentially due to its ability to capture nuanced contact details not readily available through standard force-torque measurements. The 15% improvement suggests a significant advantage in robustness and precision when assembling the toy desk using ShapeForce-guided learning policies.

The fidelity of the ShapeForce contact feedback system is quantitatively demonstrated by a high $R^2$ value of 0.9577. This metric indicates a strong linear relationship between the applied wrench – a measure of force and torque – and the resulting deformation of the manipulated object. An $R^2$ value approaching 1.0 signifies that approximately 95.77% of the variance in object deformation can be explained by the applied wrench, validating the system’s ability to accurately model contact mechanics and provide reliable feedback for policy learning.

ShapeForce integrates marker-based pose tracking with both search-and-control and imitation-learning policies to translate observed deformation into force-like signals, enabling the execution of complex contact-rich tasks through a mechanically designed system validated by finite element analysis.
ShapeForce integrates marker-based pose tracking with both search-and-control and imitation-learning policies to translate observed deformation into force-like signals, enabling the execution of complex contact-rich tasks through a mechanically designed system validated by finite element analysis.

Beyond the Hype: Practical Implications and Future Directions

ShapeForce presents a significant departure from conventional force sensing methodologies, offering a streamlined and economically advantageous approach to robotic interaction. Traditional force sensors, often reliant on strain gauges or complex mechanical linkages, can be both expensive and challenging to integrate into robotic systems. In contrast, ShapeForce leverages the inherent compliance of specifically designed structures – typically utilizing flexible materials – to infer applied forces through measurable deformation. This simplification not only reduces the bill of materials and assembly time but also enhances robustness by eliminating delicate components prone to failure. The resulting system is lighter, more adaptable, and requires less calibration, paving the way for wider deployment of force-sensitive robots in diverse applications, from delicate assembly tasks to safe human-robot collaboration.

The integration of ShapeForce sensing with learning-based robotic policies unlocks capabilities previously challenging for automated systems. By providing nuanced tactile feedback, ShapeForce empowers robots to adapt to the inherent uncertainties of real-world tasks – a critical advancement beyond traditional, rigidly-programmed approaches. This synergy was demonstrated through the successful execution of a toy desk assembly task, where the robot skillfully manipulated parts despite variations in alignment and contact forces. The system learns to correlate tactile readings with successful task completion, improving performance over time and opening avenues for tackling more complex and delicate manipulations. This approach not only enhances reliability but also reduces the need for precise calibration and detailed environmental modeling, promising more robust and versatile robotic solutions.

A critical component of ShapeForce’s efficacy lies in the detailed characterization of its compliant core’s linear stiffness. Through rigorous testing and analysis, researchers determined the precise relationship between applied force and resulting deformation within the core material. This understanding enables the development of accurate dynamic models, allowing for precise prediction of sensor behavior and improved control algorithms. By quantifying this key mechanical property – typically expressed as $k$ in the equation $F = kx$, where $F$ is force and $x$ is displacement – the system can compensate for variations in manufacturing or environmental factors, ensuring consistent and reliable force sensing across a wide range of robotic manipulations and ultimately expanding the robot’s ability to perform delicate and complex tasks.

Continued development anticipates a significant leap in robotic capabilities through the fusion of ShapeForce with advanced visual perception systems. By enabling robots to ‘see’ and understand their environment, alongside tactile sensing from the compliant core, future iterations will move beyond pre-programmed routines. Researchers intend to implement sophisticated control algorithms, potentially utilizing reinforcement learning and model predictive control, to allow for real-time adaptation to unforeseen circumstances and increased robustness in dynamic environments. This synergistic approach promises to unlock more complex and nuanced task performance, ultimately enabling robots to handle unstructured scenarios and interact with the world with greater dexterity and intelligence – moving beyond toy assembly to tackle real-world challenges requiring both vision and touch.

Proportional scaling demonstrates that our force-like signal accurately represents essential contact information during toy desk assembly, closely aligning with ground-truth force measurements.
Proportional scaling demonstrates that our force-like signal accurately represents essential contact information during toy desk assembly, closely aligning with ground-truth force measurements.

The pursuit of elegant solutions in robotics invariably encounters the brutal reality of deployment. ShapeForce, with its vision-based deformation sensing, represents another attempt to sidestep the expense of traditional force-torque sensors. It’s a clever approach, certainly, but one should remember that any system relying on visual interpretation will eventually face conditions that break the underlying assumptions. As David Hilbert famously stated, “We must be able to answer definite questions.” The paper demonstrates impressive performance in controlled settings, yet the question remains: how robust is this ‘low-cost’ sensing when subjected to the unpredictable forces of a real-world, contact-rich environment? Documentation, of course, will paint a rosier picture, but production invariably finds the cracks.

So, What Breaks First?

ShapeForce, predictably, attempts to solve a problem production will inevitably redefine. Low-cost sensing is perpetually desirable, of course, until someone needs it to reliably palletize fragile produce in a humid environment. The reliance on vision, while elegantly sidestepping expensive hardware, introduces a new class of failure modes. Expect the next iteration to involve a frantic search for IR illumination robust enough to penetrate dust, grease, and the existential dread of a 24/7 operation. The comparison to traditional force-torque sensors is… optimistic. Current benchmarks likely involve carefully calibrated lab setups.

The true test won’t be grasping known objects, but the unknown – the oddly shaped, unexpectedly heavy, or simply sticky things the real world throws at robots. Further work will undoubtedly focus on improving robustness to occlusion and lighting changes, and the inevitable integration with reinforcement learning to mask the underlying sensor noise. The real question isn’t whether ShapeForce works in a simulation, but how quickly it degrades under sustained, unglamorous use.

Ultimately, this work is a temporary reprieve, a slightly slower accrual of tech debt. Everything new is old again, just renamed and still broken. The search for the perfect sensor continues, and production, as always, will be the final, unforgiving judge.


Original article: https://arxiv.org/pdf/2511.19955.pdf

Contact the author: https://www.linkedin.com/in/avetisyan/

See also:

2025-11-27 05:21