Talks and presentations

Plan-Guided Reinforcement Learning for Whole-Body Manipulation

October 01, 2023

Invited Talk, Workshop on Leveraging Models for Contact-Rich Manipulation at IROS 2023, Detriot, Michigan, USA

Synthesizing complex whole-body manipulation behaviors has fundamental challenges due to the rapidly growing combinatorics inherent to contact interaction planning. While model-based methods have shown promising results in solving long-horizon manipulation tasks, they often work under strict assumptions, such as known model parameters, oracular observation of the environment state, and simplified dynamics, resulting in plans that cannot easily transfer to hardware. Learning-based approaches, such as imitation learning (IL) and reinforcement learning (RL), have been shown to be robust when operating over in-distribution states; however, they need heavy human supervision. Specifically, model-free RL requires a tedious reward-shaping process. IL methods, on the other hand, rely on human demonstrations that involve advanced teleoperation methods. In this work, we propose a plan-guided reinforcement learning (PGRL) framework to combine the advantages of model-based planning and reinforcement learning. Our method requires minimal human supervision because it relies on plans generated by model-based planners to guide the exploration in RL. In exchange, RL derives a more robust policy thanks to domain randomization. We test this approach on a whole-body manipulation task on Punyo, an upper-body humanoid robot with compliant, air-filled arm coverings, to pivot and lift a large box. Our preliminary results indicate that the proposed methodology is promising to address challenges that remain difficult for either model- or learning-based strategies alone.

STOCS: Simultaneous Trajectory Optimization and Contact Selection for Contact-Rich Manipulation

June 01, 2023

Invited Talk, Conference on Optimization 2023, Seattle, Washington

Contact-implicit trajectory optimization is an effective method to plan complex trajectories for various contact-rich systems including manipulation and locomotion. These methods formulate contact as complementarity constraints and require solving a mathematical program with complementarity constraints (MPCC). However, MPCC solve times increase steeply with the number of variables and complementarity constraints, which limits their applicability to problems with low geometric complexity. This paper introduces the simultaneous trajectory optimization and contact selection (STOCS) method that embeds the detection of salient contact points and contact times inside trajectory optimization. Because the number of active contact points is usually small, this approach minimize the number of MPCC variables and constraints, which makes solving manipulation trajectories for objects with complex, non-convex geometry computationally tractable. The proposed approach is validated on pivoting and sliding problems in simulation and on a 6 DoF manipulator arm.

Semi-Infinite Programming’s Application in Robotics

October 29, 2021

Invited Talk, Robotics Seminar @ Illinois, Urbana, Illinois

In optimization theory, semi-infinite programming (SIP) is an optimization problem with a finite number of variables and an infinite number of constraints, or an infinite number of variables and a finite number of constraints. In this talk, I will introduce our work which uses SIP to solve the problems in the field of robotics.
In the semi-infinite program with complementarity constraints (SIPCC) work, we use SIP to address the problem that contact is an infinite phenomenon involving continuous regions of interaction. Our method enables a gripper to find a feasible pose to hold (non-)convex objects while ensuring force and torque balance. In the non-penetration iterative closest points for single-view multi-object 6D pose estimation work, we use SIP to solve the penetration between (non-)convex objects. Through introducing non-penetration constraints to the framework of iterative closest points (ICP), we improve the pose estimation result’s accuracy of deep neural network based methods. Also, our method outperforms the best result on the IC-BIN dataset in the Benchmark for 6D Object Pose Estimation.

Mengchao Zhang

Talks and presentations

Plan-Guided Reinforcement Learning for Whole-Body Manipulation

STOCS: Simultaneous Trajectory Optimization and Contact Selection for Contact-Rich Manipulation

Semi-Infinite Programming’s Application in Robotics