scispace - formally typeset
Search or ask a question
Author

Fumio Miyazaki

Bio: Fumio Miyazaki is an academic researcher from Osaka University. The author has contributed to research in topics: Robot & Mobile robot. The author has an hindex of 31, co-authored 268 publications receiving 8933 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: A betterment process for the operation of a mechanical robot in a sense that it betters the nextoperation of a robot by using the previous operation's data is proposed.
Abstract: This article proposes a betterment process for the operation of a mechanical robot in a sense that it betters the next operation of a robot by using the previous operation's data. The process has an iterative learning structure such that the (k + 1)th input to joint actuators consists of the kth input plus an error increment composed of the derivative difference between the kth motion trajectory and the given desired motion trajectory. The convergence of the process to the desired motion trajectory is assured under some reasonable conditions. Numerical results by computer simulation are presented to show the effectiveness of the proposed learning scheme.

3,222 citations

Proceedings ArticleDOI
13 May 1990
TL;DR: The control rule and limiting method proposed are robot independent and hence can be applied to various kinds of mobile robots with a dead reckoning ability and was implemented on the autonomous mobile robot Yamabico-11.
Abstract: A stable tracking control rule is proposed for nonholonomic vehicles. The stability of the rule is proved through the use of a Liapunov function. Inputs to the vehicle are a reference posture (x/sub r/, y/sub r/, theta /sub r/)/sup t/ and reference velocities ( nu /sub r/, omega /sub r/)/sup t/. The major objective of this study is to propose a control rule to find reasonable target linear and rotational velocities ( nu , omega )/sup t/. Linearizing the system's differential equation is useful for deciding parameters for critical dumping for a small disturbance. In order to avoid slippage, a velocity/acceleration limitation scheme is introduced. Several simulation results are presented with or without the velocity/acceleration limiter. The control rule and limiting method proposed are robot independent and hence can be applied to various kinds of mobile robots with a dead reckoning ability. This method was implemented on the autonomous mobile robot Yamabico-11. Experimental results obtained are close to the results with the velocity/acceleration limiter. >

1,363 citations

Proceedings ArticleDOI
01 Dec 1984
TL;DR: In this paper, a new concept called "betterment process" is proposed for the purpose of giving a learning ability of autonomous construction of a better control input to a class of multi-input multi-output servomechanism or mechatronics systems such as mechanical robots.
Abstract: A new concept called "betterment process" is proposed for the purpose of giving a learning ability of autonomous construction of a better control input to a class of multi-input multi-output servomechanism or mechatronics systems such as mechanical robots. It is assumed that those dynamic systems can be operated repeatedly at low cost and in a relatively short time under invariant initial physical conditions, but the knowledge on precise description of their dynamics is not required. The betterment process is composed of a simple iteration rule that generates autonomously a present actuator input better than the previous one, provided a desired output response is given. The convergence of iteration is proved for a simple betterment process where the k+1th input is composed of the kth input plus an increment of the derivative error between the kth output response and given desired response. Discussions on potential applications of the proposed theory to controlling robots or other mechanical systems are presented together with future subjects to be investigated.

382 citations

Journal ArticleDOI
03 Jan 1988
TL;DR: To make a robot track a given desired motion trajectory, a learning control scheme is proposed which is based on the repeatability of robot motion and it is demonstrated that the input torque pattern that generates the desired motion can be formed without estimating the physical parameters of robot dynamics.
Abstract: To make a robot track a given desired motion trajectory, a learning control scheme is proposed which is based on the repeatability of robot motion. In this scheme the robot obtains a desired motion by repeating trials (test motion). A merit of this control scheme is that the input torque pattern that generates the desired motion can be formed without estimating the physical parameters of robot dynamics. In practice, to allow the robot motion to approach the desired one in each trial, the input torque given to the robot at the present trial is modified only by the velocity signal of the real robot motion at the previous trial and the desired one. The convergence to the desired motion is theoretically proved for a linear time-varying mechanical system, which is an approximate representation of nonlinear robot dynamics in the vicinity of the desired motion. The effectiveness of this control scheme is demonstrated through actual experiments in which a revolute-type manipulator with three degrees of freedom is used, and the desired motion trajectory is given not only in terms of joint-angle coordinates but also in terms of task-oriented coordinates. >

276 citations

Journal ArticleDOI
TL;DR: In this article, the effects of state disturbances, output noise, and errors in initial conditions on a class of learning control algorithms are investigated, and bounds on the asymptotic trajectory errors for the learned input and the corresponding state and output trajectories are obtained.
Abstract: The authors investigate the effects of state disturbances, output noise, and errors in initial conditions on a class of learning control algorithms. They present a simple learning algorithm and exhibit, via a concise proof, bounds on the asymptotic trajectory errors for the learned input and the corresponding state and output trajectories. Furthermore, these bounds are continuous functions of the bounds on the initial condition errors, state disturbances, and output noise, and the bounds are zero in the absence of these disturbances. >

275 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: A betterment process for the operation of a mechanical robot in a sense that it betters the nextoperation of a robot by using the previous operation's data is proposed.
Abstract: This article proposes a betterment process for the operation of a mechanical robot in a sense that it betters the next operation of a robot by using the previous operation's data. The process has an iterative learning structure such that the (k + 1)th input to joint actuators consists of the kth input plus an error increment composed of the derivative difference between the kth motion trajectory and the given desired motion trajectory. The convergence of the process to the desired motion trajectory is assured under some reasonable conditions. Numerical results by computer simulation are presented to show the effectiveness of the proposed learning scheme.

3,222 citations

Journal ArticleDOI
TL;DR: Though beginning its third decade of active research, the field of ILC shows no sign of slowing down and includes many results and learning algorithms beyond the scope of this survey.
Abstract: This article surveyed the major results in iterative learning control (ILC) analysis and design over the past two decades. Problems in stability, performance, learning transient behavior, and robustness were discussed along with four design techniques that have emerged as among the most popular. The content of this survey was selected to provide the reader with a broad perspective of the important ideas, potential, and limitations of ILC. Indeed, the maturing field of ILC includes many results and learning algorithms beyond the scope of this survey. Though beginning its third decade of active research, the field of ILC shows no sign of slowing down.

2,645 citations

Journal ArticleDOI
TL;DR: This article attempts to strengthen the links between the two research communities by providing a survey of work in reinforcement learning for behavior generation in robots by highlighting both key challenges in robot reinforcement learning as well as notable successes.
Abstract: Reinforcement learning offers to robotics a framework and set of tools for the design of sophisticated and hard-to-engineer behaviors. Conversely, the challenges of robotic problems provide both inspiration, impact, and validation for developments in reinforcement learning. The relationship between disciplines has sufficient promise to be likened to that between physics and mathematics. In this article, we attempt to strengthen the links between the two research communities by providing a survey of work in reinforcement learning for behavior generation in robots. We highlight both key challenges in robot reinforcement learning as well as notable successes. We discuss how contributions tamed the complexity of the domain and study the role of algorithms, representations, and prior knowledge in achieving these successes. As a result, a particular focus of our paper lies on the choice between model-based and model-free as well as between value-function-based and policy-search methods. By analyzing a simple problem in some detail we demonstrate how reinforcement learning approaches may be profitably applied, and we note throughout open questions and the tremendous potential for future research.

2,391 citations

Journal ArticleDOI
TL;DR: A direct adaptive tracking control architecture is proposed and evaluated for a class of continuous-time nonlinear dynamic systems for which an explicit linear parameterization of the uncertainty in the dynamics is either unknown or impossible.
Abstract: A direct adaptive tracking control architecture is proposed and evaluated for a class of continuous-time nonlinear dynamic systems for which an explicit linear parameterization of the uncertainty in the dynamics is either unknown or impossible. The architecture uses a network of Gaussian radial basis functions to adaptively compensate for the plant nonlinearities. Under mild assumptions about the degree of smoothness exhibit by the nonlinear functions, the algorithm is proven to be globally stable, with tracking errors converging to a neighborhood of zero. A constructive procedure is detailed, which directly translates the assumed smoothness properties of the nonlinearities involved into a specification of the network required to represent the plant to a chosen degree of accuracy. A stable weight adjustment mechanism is determined using Lyapunov theory. The network construction and performance of the resulting controller are illustrated through simulations with example systems. >

2,254 citations

Journal ArticleDOI
TL;DR: In this paper, an adaptive robot control algorithm is derived, which consists of a PD feedback part and a full dynamics feed for the compensation part, with the unknown manipulator and payload parameters being estimated online.
Abstract: A new adaptive robot control algorithm is derived, which consists of a PD feedback part and a full dynamics feedfor ward compensation part, with the unknown manipulator and payload parameters being estimated online. The algorithm is computationally simple, because of an effective exploitation of the structure of manipulator dynamics. In particular, it requires neither feedback of joint accelerations nor inversion of the estimated inertia matrix. The algorithm can also be applied directly in Cartesian space.

2,117 citations