Transferring Optimal Contact Skills to Flexible Manipulators by Reinforcement Learning

Wenjun Xu,Anqi Pan,Hongliang Ren
DOI: https://doi.org/10.1007/s41315-019-00101-7
IF: 1.7
2019-01-01
International Journal of Intelligent Robotics and Applications
Abstract:Flexible/soft manipulators have the potential to maneuver in confined space and reach deeply-seated targets via curvy trajectories, thus enjoy increasing popularity in minimally invasive surgery (MIS) community. We aim to automate palpation movement for this type of robots, an important procedure for disease diagnosis, where multiple force and pose requirements are to be achieved simultaneously. It's challenging to obtain accurate models due to the system's inherent nonlinearities and actuation hysteresis. Moreover, unknown contact transitions and high-dimensionality specific to the palpation task, pose great challenges to deriving optimal task policies. We employ the model-free reinforcement learning method for learning palpation skills through deterministic policy gradient, whose reward function was carefully shaped to accommodate all the task objectives. In addition, we design a safety check routine to avoid undesirable collisions and a dedicated initialization process for generalization to various environment conditions. We demonstrate successful implementation of the learning framework in simulation and real world. The trained policy succeeds in automating the designed tasks.
What problem does this paper attempt to address?