scispace - formally typeset
Open AccessJournal ArticleDOI

Teleoperation Control Based on Combination of Wave Variable and Neural Networks

TLDR
A novel control scheme is developed for a teleoperation system, combining the radial basis function (RBF) neural networks (NNs) and wave variable technique to simultaneously compensate for the effects caused by communication delays and dynamics uncertainties.
Abstract
In this paper, a novel control scheme is developed for a teleoperation system, combining the radial basis function (RBF) neural networks (NNs) and wave variable technique to simultaneously compensate for the effects caused by communication delays and dynamics uncertainties. The teleoperation system is set up with a TouchX joystick as the master device and a simulated Baxter robot arm as the slave robot. The haptic feedback is provided to the human operator to sense the interaction force between the slave robot and the environment when manipulating the stylus of the joystick. To utilize the workspace of the telerobot as much as possible, a matching process is carried out between the master and the slave based on their kinematics models. The closed loop inverse kinematics (CLIK) method and RBF NN approximation technique are seamlessly integrated in the control design. To overcome the potential instability problem in the presence of delayed communication channels, wave variables and their corrections are effectively embedded into the control system, and Lyapunov-based analysis is performed to theoretically establish the closed-loop stability. Comparative experiments have been conducted for a trajectory tracking task, under the different conditions of various communication delays. Experimental results show that in terms of tracking performance and force reflection, the proposed control approach shows superior performance over the conventional methods.

read more

Content maybe subject to copyright    Report

Cronfa - Swansea University Open Access Repository
_____________________________________________________________
This is an author produced version of a paper published in:
IEEE Transactions on Systems, Man, and Cybernetics: Systems
Cronfa URL for this paper:
http://cronfa.swan.ac.uk/Record/cronfa39979
_____________________________________________________________
Paper:
Yang, C., Wang, X., Li, Z., Li, Y. & Su, C. (2017). Teleoperation Control Based on Combination of Wave Variable and
Neural Networks. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 47(8), 2125-2136.
http://dx.doi.org/10.1109/TSMC.2016.2615061
_____________________________________________________________
This item is brought to you by Swansea University. Any person downloading material is agreeing to abide by the terms
of the repository licence. Copies of full text items may be used or reproduced in any format or medium, without prior
permission for personal research or study, educational or non-commercial purposes only. The copyright for any work
remains with the original author unless otherwise specified. The full-text must not be sold in any format or medium
without the formal permission of the copyright holder.
Permission for multiple reproductions should be obtained from the original author.
Authors are personally responsible for adhering to copyright and publisher restrictions when uploading content to the
repository.
http://www.swansea.ac.uk/library/researchsupport/ris-support/

Teleoperation Control based on Combination of
Wave Variable and Neural Networks
Chenguang Yang, Xingjian Wang, Zhijun Li, Yanan Li, Chun-Yi Su
Abstract—In this paper, a novel control scheme is developed for
a teleoperation system, combining the radial basis function (RBF)
neural networks (NNs) and wave variable technique to simultane-
ously compensate for the effects caused by communication delays
and dynamics uncertainties. The teleoperation system is set up
with a TouchX joystick as the master device and a simulated
Baxter robot arm as the slave robot. The haptic feedback is
provided to the human operator to sense the interaction force
between the slave robot and the environment when manipulating
the stylus of the joystick. To utilize the workspace of the telerobot
as much as possible, a matching process is carried out between
the master and the slave based on their kinematics models. The
closed loop inverse kinematics (CLIK) method and RBF NN
approximation technique are seamlessly integrated in the control
design. To overcome the potential instability problem in the
presence of delayed communication channels, wave variables and
their corrections are effectively embedded into the control system,
and Lyapunov based analysis is performed to theoretically
establish the closed-loop stability. Comparative experiments have
been conducted for a trajectory tracking task, under the different
conditions of various communication delays. Experimental results
show that in terms of tracking performance and force reflection,
the proposed control approach shows superior performance over
the conventional methods.
Index Terms—Neural Networks; Teleoperation Control; Wave
Variable; Time-Varying Delay
I. INTRODUCTION
In the past decades, robotic technologies have been devel-
oped rapidly in a wide range of engineering fields. The teler-
obot operation as one of the most attractive and challenging
topic in robotics has been used in various applications such
as telesurgery, search and rescue, 3D game development and
so on [1]. A typical teleoperation system usually comprises 5
This work was partially supported by National Nature Science Founda-
tion (NSFC) under Grants 61473120, 61573147 and 91520201, Guangdong
Provincial Natural Science Foundation 2014A030313266 and International
Science and Technology Collaboration Grant 2015A050502017, Science and
Technology Planning Project of Guangzhou 201607010006 and the Funda-
mental Research Funds for the Central Universities under Grant 2015ZM065.
C. Yang, X. Wang, Z. Li and C.-Y. Su are with Key Laboratory of
Autonomous Systems and Networked Control, College of Automation Science
and Engineering, South China University of Technology, Guangzhou, 510640
China. (Email: cyang@ieee.org; xj.wang
scut@qq.com; zjli@ieee.org;chun-
yi.su@concordia.ca). C. Yang is also with Zienkiewicz Centre for Computa-
tional Engineering, Swansea University, SA1 8EN, UK. C.-Y. Su is on leave
from Concordia University, Canada.
Y. Li is with Department of Bioengineering, Imperial College London, SW7
2AZ London, UK. (Email: yanan.li@imperial.ac.uk)
parts: human operator, master device, communication chan-
nels, slave robot and environment [2]. Usually, the human
operator controls the motion of a master device, which is
physically in contact with human. The master device generates
commanding trajectories passed through the communication
channels, which are passed to the slave robot that acts on
the target environment and completes the task. The interaction
force between the slave robot and the environment is fedback
to the master, according to which the human operator could
control the robot more effectively.
In this paper, we use a Geomagic
r
TouchX as the master
device, which is designed by the SensAble Technologies Inc.
TouchX is a haptic feedback device including both hardware
drives and software packages (OpenHaptics
r
toolkit) [1]. The
TouchX arm includes 3 rotational joints, each joint is equipped
with a motor to generate the feedback force. A 3 degrees of
freedom (DOFs) gimbal joint stylus is installed at the end of
the manipulator to provide the orientation motion. As a slave
robot, there are 7 revolute joints in each arm of Baxter, which
make it easy to move in the 3-D space [3]. In order to grasp
and handle the objects, a rotational gripper is installed at the
end-effector of each arm. The MATLAB Robotics Toolbox [4]
is used to establish the kinematics and dynamics models of the
Baxter robot arm, which is used as the slave telerobot to test
the proposed method.
As we known, the communication channels play significant
roles in a teleoperation systems, and time delays in the
channels may cause system unstable in the presence of force
feedback [5]. Much effort has been made to handle the effect
of time delays [6], especially in the bilateral teleoperation
system [7], [8]. The notion of wave variable was proposed
and it has been established in [9] that the wave variable could
guarantee the stability of the communication with time delays.
The influence of time-varying delays to the stability of a
teleoperation system was studied in [10]. In [2], the method
of integrating the corrected wave was investigated to remove
the distortion caused by the transmission of wave variables. A
wave variable based control was proposed in [11] to handle
the problem in the bilateral time-varying system. Based on this
idea, a novel approach was presented for controlling the time-
varying delayed teleoperation system with a PD controller
[12].
Due to the existence of uncertainties in practical application-

s [13], the research on controlling the uncertain robot system
becomes significantly important. The adaptive control method
for robots has been studied in a considerable number of works
[14]–[16], which could be used in the situations of unknown
parameters and time-varying parameters in the robot model
[17], [18]. Time-varying delays and uncertainties of the robot
model have been studied together with adaptive control in
[19], [20]. In [21]–[23], adaptive fuzzy control was used for
identification of the unknown nonlinear control system. Fuzzy
control was used for studying uncertain nonlinear systems in
[24], [25]. In [26], fuzzy control was adopted to improve
the performance of the automobile cruise system. In recent
years, the applications of NN to the robot control system have
become increasingly popular [27]–[31], due to the fact that
the NN has the ability to emulate complicated nonlinearity
and uncertain functions [32]–[34]. The RBF NN is a highly
effective method and has been extensively used for control
design of uncertain robot systems [35]. Adaptive RBF NN
based control has been investigated in [36], [37] to deal
with the deadzone and uncertain robotic model. In [38], an
adaptive NN method was applied to achieve control of the
uncertain marine vessel system. In [39], a controller was
designed for dual-arm coordination of a humanoid robot based
on the adaptive neural control. The tracking performance of
the adaptive NN control for a discrete-time system was studied
in [40]. In [41], the effectiveness of the NN control was firstly
considered in the Prandtl-Ishlinskii (PI) hysteresis system. The
RBF NN was investigated in [42] as a compensator to solve
the non-linearities problem that a standard PD controller could
not handle. In [43], the RBF NN was used to learn the robot
behavior, and in [44] the RBF NN was investigated to improve
the behaviour of the non-linear actuator. In [45], trajectory
control of a fruit and vegetable picking robot was studied,
and in [46] the RBF NN was used to compensate for the
deadzone of the non-linear system. In [47], the RBF NN has
been discussed in detail for compensation for the tracking error
in controlling mobile robots. In this paper, a NN controller
based on the PD control is applied to the slave robot with
7 DOFs, which guarantees more accurate trajectory tracking
than the conventional PD controller.
The reminder of the paper is organized as follows: in Section
II, the computational model of the master-slave teleoperation
system is analysed. In Section III, the PD control on the master
and slave is first discussed, and the nonlinear uncertainties
of the model of the slave robot are then analysed. The
CLIK method is used for avoiding kinematic singularities and
numerical drifts. The RBF NN control is designed for the
slave robot. Finally, the convergence of the tracking error and
the stability of the teleoperation system with the time-varying
delays are established. Comparative experiments are carried
out in Section IV and conclusions and possible future work
are discussed in Section V.
Fig. 1. The system framework
Throughout this paper, the following notions are used
0
m×n
stands for an m × n dimensional zero matrix.
k·k denotes the Euclidean norm of vectors and induced
norm of matrices.
A := B means that B is defined as A.
sup means the least upper bound of a partially ordered
set.
λ(·) stands for eigenvalue of a matrix.
(·) stands for real part of a complex number.
L
2
is a function space and the functions in L
2
are
quadratically integrable. L
is a function space and the
functions in L
are essentially bounded measurable.
tr stands for trace of a matrix.
II. MATHEMATICAL MODEL OF TELEOPERATION
SYSTEM
A. Illustration of Teleoperation System
The teleoperation system in this paper is shown in Fig. 1.
As we see, the human operator holds the stylus of the haptic
device and drives the motion of the master device, which will
be regarded as position commands. Through processing by the
master computer, the new commands will be generated by the
master computer and then passed to the slave computer and
received by the slave controller. The Baxter robot will move in
accordance with the commands from the slave controller. The
manipulator end-effector interacts with the environment, and
the interaction force is passed to the haptic device to be sensed
by the human operator, which will lead to a new movement
and new control commands. In next five subsections we will
analyze the mathematical models of the components of the
teleoperation system.
B. Kinematics and Dynamics of Master Robot Arm
The haptic device Geomagic TouchX not only sends com-
mands of movement to the master device, but also returns the
interaction force between the telerobot and the environment,
and this is very useful for the operator to regulate the contact
force [48]. The mathematical model of the master includes the
kinematics model and the dynamics model.
The kinematics model of TouchX is built based on its
structure, as shown in Fig. 2. With 6 revolute joints, three
of them are equipped with motors, and the other three are
gimbal joint stylus considered as an end-effector, making it
flexible to move within the workspace. For more concrete and

Fig. 2. The structure of TouchX
Fig. 3. Comparison between two kinematics models of TouchX based on
standard DH parameters and modified DH parameters
intuitive representation of its structure, the Denavit Hartenburg
(DH) parameters are used to build the kinematics model [3].
There are two representations for DH parameters, the stan-
dard DH convention [49] and the modified DH convention
[50], and the latter representation is used for the kinematics
modelling of the TouchX joystick in this work. According to
the standard DH convention, the origins of the coordinates
relevant to joint 4 and joint 5 as shown in the left panel of
Fig. 3 are the same. Consequently, the simulated robot mod-
eled by the MATLAB Robotics Toolbox should be modified.
Specifically, a
i1
and d
i
are used to represent the link length
and the link offset, respectively, where i represents the ith
joint of the master device. α
i1
and θ
i
are used to represent
the joint twist angle and joint angle, respectively. All the 6
joints of the master device are revolute, and the modified DH
parameters of the TouchX are obtained in Tab. I.
The DH parameters in Tab. I represent the structure char-
acteristics of the master device, from which the kinematics
model could be obtained. According to [3], the homogeneous
transformation between two adjacent coordinates in Fig. 1
TABLE I
DH PARAMET ERS (MODIFIED CONVENTION) OF THE MASTER DEVICE
Link i θ
i
(angle limit(deg)) d
i
a
i1
α
i1
(rad)
1 q
1
(-6060) 0 0 0
2
q
2
(0105) 0 0 π/2
3
q
3
(-180180) 0 L
m1
0
4
q
4
(-145145) L
m2
0 π/2
5 q
5
(-7070) 0 0 π/2
6
q
6
(-145145) 0 0 π/2
could be formulated using DH parameters as follows:
i1
A
i
(θ
i
, d
i
, a
i
, α
i
) =
i
i
i
i
i
a
i
i
i
i
i
i
i
a
i
i
0
i
i
d
i
0 0 0 1
(1)
where c is short for trigonometric function cos and s
is short for sin .
Moreover, the relationship between the position of the end
effector and the base could be calculated as follows:
n
X
0
=
0
A
1
1
A
2
...
n1
A
n
· X
n
(2)
where n is 6 for the master device, X = [x, y, z, 1] represents
the position of the related joint, and
i
A
i+1
(i = 0, 1, ..., n 1)
represents the adjacent coordinate in (1).
The dynamics model of the master device reveals the
relationship between the driving torque or related force and
joint motion, and could be represented as follows:
M
m
(q
m
)¨q
m
+ h
m
(q
m
, ˙q
m
) = J
T
m
F
h
τ
m
+ f
m
(3)
where
h
m
(q
m
, ˙q
m
) = C
m
(q
m
, ˙q
m
) ˙q
m
+ G
m
(q
m
) (4)
with the subscript m used to indicate master. For a robot
manipulator with n-DOF serial links and all the joints revolute,
q
m
, ˙q
m
and ¨q
m
R
n
are the joint position, velocity and
acceleration, respectively. M
m
(q
m
)¨q
m
R
n×n
is the inertia
matrix. h
m
(q
m
, ˙q
m
) represents the nonlinear coupling term
of the centripetal force, Coriolis force and the gravity force.
f
m
represents coulomb friction, load changes, time-delayed
jamming and other disturbances. J
m
is the Jacobian matrix
and J
T
m
is its transpose. F
h
is the force exerted by the human
operator and τ
m
is the torque control signal, both of which
will be applied to the master device. The terms on the left
hand side of Eqs. (3) and (4) satisfy the following properties
[51]:
Property 1: The matrix M
m
(q
m
) R
n×n
is a symmetric
positive-definite matrix.
Property 2: The matrix
˙
M
m
(q
m
)2C
m
(q
m
, ˙q
m
) is a skew-
symmetric matrix, i.e., z
T
˙
M
m
2C
m
z = 0, z R
n
.
Property 3: M
m
(q
m
) and G
m
(q
m
) are bounded, and
C
m
(q
m
, ˙q
m
) satisfies that q
m
, ˙q
m
R
n
, K
cm
R
>0
such
that kC
m
(q
m
, ˙q
m
)k K
cm
|˙q
m
|
2
.
C. Model of the Slave Robot
Fig. 4 shows the structure of the Baxter robot, which is
a dual-arm robot with 7-DOFs per arm. In this paper, the
simulated left arm of the Baxter robot is taken as the slave
telerobot.
The standard DH parameters are used to describe the
structure of the left arm of the Baxter robot, as shown in
Tab. II. The lengths mentioned in Fig. 4 and Tab. II are L
s0
= 0.27m, L
s1
= 0.069m, L
s2
= 0.364m, L
s3
= 0.069m, L
s4

Fig. 4. The structure of the Baxter robot
TABLE II
DH PARAMETERS (STANDARD CONV ENTION) OF THE SLAVE ROBOT
Link i θ
i
(anglelimit(deg)) d
i
a
i
α
i
(rad)
1 q
1
(-97.597.5) L
s0
L
s1
π/2
2
q
2
+
π
2
(-12360)
0 0 π/2
3
q
3
(-175 175) L
s2
L
s3
π/2
4
q
4
(2.865 150) 0 0 π/2
5
q
5
(-175.27 175.27) L
s4
L
s5
π/2
6
q
6
(-90 120) 0 0 π/2
7
q
7
(-175.27 175.27) L
s6
0 0
= 0.375m, L
s5
= 0.01m and L
s6
= 0.28m. According to Eqs.
(1) and (2), the forward kinematics model of the Baxter robot
could be obtained, and n used in (2) is 7.
In order to achieve a precise tracking of the position com-
manded by the master joystick, a workspace matching between
the master joystick and the slave telerobot is essential. The
Monte Carlo method used in [3] was applied to approximate
the workspace for the master and the slave. In order to make
sure that the transformed workspace of the master device is
constrained within that of the slave robot, the workspace of
the master is scaled in a fixed proportion [52]. Fig. 5 shows
the workspace of the master and that of the slave after the
matching process. The top left panel of Fig. 5 shows the
enveloped surface, generated as convex hull of the 3D clouds
of the workspaces of both master and slave after the matching
process. Similar to the workspace transformation developed in
[3] between the Baxter robot arm and the Omni joystick, in
this work, the workspace transformation between the Baxter
robot arm and the TouchX joystick is given as below:
x
s
y
s
z
s
=
cosδ sinδ 0
sinδ cosδ 0
0 0 1
×
S
x
0 0
0 S
y
0
0 0 S
z
x
m
y
m
z
m
+
T
x
T
y
T
z
(5)
where [x
s
, y
s
, z
s
]
T
, [x
m
, y
m
, z
m
]
T
represent the Cartesian co-
ordinates of the end-effectors of Baxter and TouchX joystick,
respectively. δ is the rotation angle of Z axis for the base
of the master device, [S
x
, S
y
, S
z
]
T
and [T
x
, T
y
, T
z
]
T
are the
proportionality factors and offset correction terms about the
X, Y and Z axes, respectively. According to [3], the matching
parameters of (6) are given by
δ =
π
4
,
S
x
S
y
S
z
=
0.0041
0.0040
0.0041
,
T
x
T
y
T
z
=
0.701
0.210
0.129
1.5
1
0.5
Y
0
-0.5
-11
0.5
X
0
-0.5
-1
0.5
1
1.5
-0.5
0
Z
(a)
X
-0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1
Y
-0.6
-0.4
-0.2
0
0.2
0.4
0.6
0.8
1
1.2
(b)
X
-0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1
Z
-0.6
-0.4
-0.2
0
0.2
0.4
0.6
0.8
1
1.2
1.4
(c)
Y
-0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1 1.2
Z
-0.6
-0.4
-0.2
0
0.2
0.4
0.6
0.8
1
1.2
1.4
(d)
Fig. 5. Workspace matching. (a) The 3-D envelope surface of the master and
the slave. (b) Workspace matching in the X-Y plane. (c) Workspace matching
in the X-Z plane. (d) Workspace matching in the Y-Z plane
The dynamics model of the telerobot could be represented
as follows
M
s
(q
s
)¨q
s
+ h
s
(q
s
, ˙q
s
) = τ
s
J
T
s
F
e
+ f
s
(6)
h
s
(q
s
, ˙q
s
) = C
s
(q
s
, ˙q
s
) ˙q
s
+ G
s
(q
s
) (7)
where the subscript s is used to indicate slave. The denota-
tions of the components on the left hand side of Eqs. (6) and
(7) are similar to those of Eqs. (3) and (4). F
e
is the interaction
force between the environment and the telerobot, and τ
s
is the
control input to the slave.
D. Model of the Human Operator
Early studies have shown that the muscle property of
the human hand could be modelled as a spring. A more
complete mass spring damper model is proposed in [53]. In
the teleoperation system, the human hand holds the stylus of
the master joystick, and gives the corresponding position and
velocity commands. Even if the master device is subjected
to the environmental force transmitted from the slave, the
operator could also adjust the output of the hand to make the
master track the desired movement of the operator. Therefore,
the human hand could actually be regarded as controlled by
an intelligent proportional-integral (PI) controller, which could
adjust the output force of the hand according to the error
between the actual position x
m
and the desired position x
md
of the master described by
F
h
= K
hp
(x
md
x
m
) + K
hi
Z
t
t
0
(x
md
x
m
)dt (8)
where K
hp
and K
hi
are the proportional gain and integral
gain of the human hand, respectively. The notations of t
0
and t are the initial time instant and the current time instant,
respectively.

Citations
More filters
Journal ArticleDOI

Neural Control of Bimanual Robots With Guaranteed Global Stability and Motion Precision

TL;DR: In order to extend the semiglobal stability achieved by conventional neural control to global stability, a switching mechanism is integrated into the control design and effectiveness of the proposed control design has been shown through experiments carried out on the Baxter Robot.
Journal ArticleDOI

Adaptive Parameter Estimation and Control Design for Robot Manipulators With Finite-Time Convergence

TL;DR: A robot control/identification scheme to identify the unknown robot kinematic and dynamic parameters with enhanced convergence rate was developed, and the information of parameter estimation error was properly integrated into the proposed identification algorithm, such that enhanced estimation performance was achieved.
Journal ArticleDOI

Adaptive Neural Network Control of a Robotic Manipulator With Time-Varying Output Constraints

TL;DR: A disturbance observer is devised to estimate the unknown disturbance from humans and environment and a neural network which utilizes a radial basis function is used to solve the uncertain dynamics of the robotic manipulator.
Journal ArticleDOI

Robot Learning System Based on Adaptive Neural Control and Dynamic Movement Primitives

TL;DR: This paper proposes an enhanced robot skill learning system considering both motion generation and trajectory tracking, and a neural-network-based controller is designed for the robot to track the trajectories generated from the motion model.
Journal ArticleDOI

Robot manipulator control using neural networks: A survey

TL;DR: The problem foundation of manipulator control and the theoretical ideas on using neural network to solve this problem are analyzed and then the latest progresses on this topic in recent years are described and reviewed in detail.
References
More filters
Book

Applied Nonlinear Control

TL;DR: Covers in a progressive fashion a number of analysis tools and design techniques directly applicable to nonlinear control problems in high performance systems (in aerospace, robotics and automotive areas).
Book

A Mathematical Introduction to Robotic Manipulation

TL;DR: In this paper, the authors present a detailed overview of the history of multifingered hands and dextrous manipulation, and present a mathematical model for steerable and non-driveable hands.
Journal ArticleDOI

Universal approximation using radial-basis-function networks

TL;DR: It is proved thatRBF networks having one hidden layer are capable of universal approximation, and a certain class of RBF networks with the same smoothing factor in each kernel node is broad enough for universal approximation.
Journal ArticleDOI

Time-delay systems: an overview of some recent advances and open problems

TL;DR: Some open problems are discussed: the constructive use of the delayed inputs, the digital implementation of distributed delays, the control via the delay, and the handling of information related to the delay value.
Journal ArticleDOI

Bilateral control of teleoperators with time delay

TL;DR: In this paper, a control law for teleoperators is presented which overcomes the instability caused by time delay by using passivity and scattering theory, a criterion is developed which shows why existing bilateral control systems are unstable for certain environments, and why the proposed bilateral control law is stable for any environment and any time delay.
Related Papers (5)