SGD for robot motion? The effectiveness of stochastic optimization on a new benchmark for biped locomotion tasks

Martim Brandao,Kenji Hashimoto,Atsuo Takanishi
DOI: https://doi.org/10.48550/arXiv.1710.03029
2017-10-09
Abstract:Trajectory optimization and posture generation are hard problems in robot locomotion, which can be non-convex and have multiple local optima. Progress on these problems is further hindered by a lack of open benchmarks, since comparisons of different solutions are difficult to make. In this paper we introduce a new benchmark for trajectory optimization and posture generation of legged robots, using a pre-defined scenario, robot and constraints, as well as evaluation criteria. We evaluate state-of-the-art trajectory optimization algorithms based on sequential quadratic programming (SQP) on the benchmark, as well as new stochastic and incremental optimization methods borrowed from the large-scale machine learning literature. Interestingly we show that some of these stochastic and incremental methods, which are based on stochastic gradient descent (SGD), achieve higher success rates than SQP on tough initializations. Inspired by this observation we also propose a new incremental variant of SQP which updates only a random subset of the costs and constraints at each iteration. The algorithm is the best performing in both success rate and convergence speed, improving over SQP by up to 30% in both criteria. The benchmark's resources and a solution evaluation script are made openly available.
Robotics,Machine Learning,Optimization and Control
What problem does this paper attempt to address?
The paper attempts to address the following issues: 1. **Lack of Benchmarking**: In the field of robot posture generation and trajectory optimization, there is currently a lack of a public benchmarking environment, making it difficult to compare different algorithms. 2. **Avoiding Local Optima**: In robot posture generation and trajectory optimization, due to the non-convexity of the problem and the presence of multiple local optima, optimization algorithms are prone to getting stuck in local optima. To address these issues, the authors propose the following contributions: - **Developed a New Robot Challenge**: LegOpt Benchmark, a public benchmarking environment for evaluating static posture generation and trajectory optimization algorithms in bipedal robots. - **Evaluated Different Optimization Algorithms**: In the proposed benchmarking environment, existing trajectory optimization algorithms (such as those based on sequential quadratic programming) as well as new stochastic and incremental optimization methods borrowed from large-scale machine learning were evaluated. - **Demonstrated the Effectiveness of Stochastic Optimization Algorithms**: The study shows that some stochastic and incremental methods based on stochastic gradient descent (SGD) have a higher success rate than traditional sequential quadratic programming (SQP) methods when dealing with optimization problems with complex collision constraints and initial conditions. - **Proposed a New Incremental SQP Algorithm**: This algorithm updates only a random subset of the cost and constraints in each iteration, and the results show that it outperforms traditional SQP methods in terms of success rate and convergence speed, with improvements of up to 30%. Through these contributions, the paper aims to advance research in the field of robot posture generation and trajectory optimization and provide a public benchmarking environment for researchers to better compare the performance of different algorithms.