Effective Offline Robot Learning with Structured Task Graph

, , ,

Offline reinforcement learning (RL) has shown great potential in many robotic tasks, where doing trial-and-error with the environment is risky, costly, or time-consuming. However, it is still hard to succeed in long-horizon tasks especially when given suboptimal and multimodal offline datasets. Nevertheless, existing RL methods rarely consider the structured information in offline datasets, which are commonly found in many robotic tasks. To address these challenges, we propose a novel offline RL approach that combines the techniques of dataset augmentation and subtask relabeling. Specifically, we first extract the subtasks and build the task graph based on the structured information in offline datasets. We then use the task graph to sample and generate an augmented dataset, which is more suitable for offline RL learning. After that, we relabel the dataset according to the task graph and finally learn a subtask-conditioned policy to complete the task. By doing so, we decompose the task of reaching a long-horizon goal state into a sequence of easier subtasks. This is not only useful for handling the long-horizon problem, but also reduces the error introduced by the offline dataset. We conducted extensive experiments in both the D4RL benchmark dataset and real-world robot with complex manipulation tasks. The experimental results show that our method significantly advances the state-of-the-art baselines in most tasks, particularly in longhorizon manipulation tasks with limited human demonstrations.

» Read on
Yiwen Hou, Jinming Ma, Haoyuan Sun, Feng Wu. Effective Offline Robot Learning with Structured Task Graph. IEEE Robotics and Automation Letters (RAL), :1-8, 2024.
Save as file
@article{HMSWral24,
 author = {Yiwen Hou and Jinming Ma and Haoyuan Sun and Feng Wu},
 doi = {10.1109/LRA.2024.3354620},
 journal = {IEEE Robotics and Automation Letters (RAL)},
 pages = {1-8},
 title = {Effective Offline Robot Learning with Structured Task Graph},
 year = {2024}
}