Abstract:In recent years, imitation learning using neural networks has enabled robots to perform flexible tasks. However, since neural networks operate in a feedforward structure, they do not possess a mechanism to compensate for output errors. To address this limitation, we developed a feedback mechanism to correct these errors. By employing a hierarchical structure for neural networks comprising lower and upper layers, the lower layer was controlled to follow the upper layer. Additionally, using a multi-layer perceptron in the lower layer, which lacks an internal state, enhanced the error feedback. In the character-writing task, this model demonstrated improved accuracy in writing previously untrained characters. In the character-writing task, this model demonstrated improved accuracy in writing previously untrained characters. Through autonomous control with error feedback, we confirmed that the lower layer could effectively track the output of the upper layer. This study represents a promising step toward integrating neural networks with control theories.

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is: **How to correct the errors in the neural network output by introducing a feedback mechanism in imitation learning based on bilateral control, thereby improving the accuracy of robots in performing tasks**. Specifically, traditional imitation learning methods based on bilateral control use feedforward neural networks, and this structure does not have a mechanism for compensating output errors. Therefore, once an error occurs during autonomous operation, the system cannot perform effective correction. To solve this problem, the author proposes a model with an error - feedback mechanism, aiming to correct output errors through error feedback between the upper and lower layers in the hierarchical structure. ### Detailed Explanation 1. **Problem Background**: - **Imitation Learning**: In recent years, imitation learning (Imitation Learning) enables robots to perform complex tasks through neural networks. In particular, imitation learning based on bilateral control can replicate human operations by synchronizing the position and force information of two robots. - **Limitations of Feedforward Neural Networks**: However, traditional neural networks adopt a feedforward structure and lack a memory mechanism for processing time - series data, resulting in the inability to effectively compensate for output errors. 2. **Proposed Solution**: - **Hierarchical Model**: The author designs a hierarchical neural network model, including an upper layer (upper layer) and a lower layer (lower layer). The upper layer is responsible for long - term prediction, while the lower layer is responsible for short - term prediction, and the error feedback is enhanced through a multi - layer perceptron (MLP). - **Error - feedback Mechanism**: By calculating the error between the outputs of the upper and lower layers and feeding the error information back to the lower layer, the lower layer can adjust its prediction to minimize the error with the upper - layer prediction. 3. **Experimental Verification**: - **Character - writing Task**: The author verifies the effectiveness of this model through a character - writing task. The experimental results show that after using the error - feedback mechanism, the model can significantly improve the writing accuracy and trajectory - tracking performance for both learned and unlearned characters. ### Formula Representation The error calculation formula is as follows: \[ \text{Error} = f_{\text{upper}}^{k + 1}- \hat{f}_{\text{lower}}^{k + 1} \] where \( f_{\text{upper}}^{k + 1} \) is the state predicted by the upper layer, and \( \hat{f}_{\text{lower}}^{k + 1} \) is the state predicted by the lower layer. The formula for feeding the error back to the lower - layer input is: \[ f_{\text{upper}} = f_{\text{upper}}^{k + 10}+(f_{\text{upper}}^{k + 1}- \hat{f}_{\text{lower}}^{k + 1}) \] ### Summary This research successfully solves the problem that the neural network in traditional imitation learning based on bilateral control cannot effectively compensate for output errors by introducing an error - feedback mechanism, improving the accuracy and robustness of robots in performing tasks.

Error-Feedback Model for Output Correction in Bilateral Control-Based Imitation Learning

Motion Generation Using Bilateral Control-Based Imitation Learning with Autoregressive Learning

Near Optimal Neural Network-based Output Feedback Control of Affine Nonlinear Discrete-Time Systems

Online Reinforcement Learning-based Neural Network Controller Design for Affine Nonlinear Discrete-time Systems.

Model Reference Output Feedback Control Using Episodic Natural Actor-Critic

Imitation Learning Based on Bilateral Control for Human-Robot Cooperation

An Automatic Control Model for Rat-Robot

Self-adaptive Control of Robotic Manipulator Based on Modeling Error Compensated by RBF

Neural Network Control of a Rehabilitation Robot by State and Output Feedback

Robust Integral Of Neural Network And Sign Of Tracking Error Control Of Uncertain Nonlinear Affine Systems Using State And Output Feedback

Adaptive neural network iterative learning control of long-stroke hybrid robots with initial errors and full state constraints

Iterative Learning Control Of Varying Trajectories For Robot Manipulators

Learning Online from Corrective Feedback: A Meta-Algorithm for Robotics

Learning from Adaptive Neural Output Feedback Control of Robot Manipulators.

Output Feedback Neural Network Adaptive Robust Control Of A Class Of Siso Nonlinear Systems

Neural Internal Model Control: Learning a Robust Control Policy via Predictive Error Feedback

A differential Hebbian framework for biologically-plausible motor control

Composite Learning Enhanced Neural Control for Robot Manipulator With Output Error Constraints

A Robust Iterative Learning Control with Neural Networks for Robot

Imitation Learning for Object Manipulation Based on Position/Force Information Using Bilateral Control

Feedback Favors the Generalization of Neural ODEs