Journal Article•DOI•

A Comparison of Virtual and Physical Training Transfer of Bimanual Assembly Tasks

Q: What are the contributions in "A comparison of virtual and physical training transfer of bimanual assembly tasks" ?

In this paper, the authors present a study that compares the effectiveness of virtual training and physical training for teaching a bimanual assembly task. After training, the authors conducted immediate tests in which participants were asked to solve a physical version of the puzzles. The authors measured performance through success rates and assembly completion testing times. The authors discuss the implications of the results and highlight the validity of virtual reality systems in training. Their results show that the performance of virtually trained participants was promising.

Maria Murcia-Lopez¹, Anthony Steed¹•Institutions (1)

University College London¹

01 Apr 2018-IEEE Transactions on Visualization and Computer Graphics (IEEE)-Vol. 24, Iss: 4, pp 1574-1583

TL;DR: A study that compares the effectiveness of virtual training and physical training for teaching a bimanual assembly task and highlights the validity of virtual reality systems in training.

read less

Abstract: As we explore the use of consumer virtual reality technology for training applications, there is a need to evaluate its validity compared to more traditional training formats. In this paper, we present a study that compares the effectiveness of virtual training and physical training for teaching a bimanual assembly task. In a between-subjects experiment, 60 participants were trained to solve three 3D burr puzzles in one of six conditions comprised of virtual and physical training elements. In the four physical conditions, training was delivered via paper- and video-based instructions, with or without the physical puzzles to practice with. In the two virtual conditions, participants learnt to assemble the puzzles in an interactive virtual environment, with or without 3D animations showing the assembly process. After training, we conducted immediate tests in which participants were asked to solve a physical version of the puzzles. We measured performance through success rates and assembly completion testing times. We also measured training times as well as subjective ratings on several aspects of the experience. Our results show that the performance of virtually trained participants was promising. A statistically significant difference was not found between virtual training with animated instructions and the best performing physical condition (in which physical blocks were available during training) for the last and most complex puzzle in terms of success rates and testing times. Performance in retention tests two weeks after training was generally not as good as expected for all experimental conditions. We discuss the implications of the results and highlight the validity of virtual reality systems in training.

...read moreread less

Summary (3 min read)

Jump to: [1 INTRODUCTION] – [2 RELATED WORK] – [4.2 Materials] – [4.3 Physical training environment] – [4.4 Virtual training environment] – [4.5 Procedure] – [5.1 Types of errors] – [5.2 First session 5.2.1 Training times] – [5.2.2 Immediate testing success rates] – [Dimension Question] – [5.2.3 Immediate testing completion times] – [5.2.4 Subjective questionnaire ratings] – [5.3.1 Participants] – [5.3.4 Subjective questionnaire ratings] – [6 DISCUSSION] and [7 CONCLUSION]

1 INTRODUCTION

Section 3 presents the experimental design and hypotheses.
Section 6 discusses the results, limitations and future work.

4.2 Materials

A virtual replica of the laboratory was modeled for the virtual enviornment used in the virtual experimental conditions.
An Oculus Rift Consumer Version 1, two Oculus Touch controllers and two Oculus sensors were used for the virtual experimental conditions.
Preassembled blocks for the first and second puzzles were glued together.

4.3 Physical training environment

For those experimental conditions in which the physical blocks were available during training (PB and PV I B) these were initially placed on the table following the same configuration as the paper instructions.
Preassembled puzzles were placed behind the blocks.

4.4 Virtual training environment

All interactions in the virtual training environment could be equally carried out using either hand and participants could concurrently complete one interaction with each hand.
A participant could grab and rotate the assembled pieces with one hand and grab the next block to attach with the other hand.
The green highlight indicates on the block is colliding with its preview block and within twenty degrees from the correct orientation.
By releasing the trigger button of the Oculus Touch controller the virtual block would snap into its correct location.

4.5 Procedure

After a waiting period of two weeks, participants returned to the lab for the second session.
In this session participants were asked to complete a paper version of the Vandenberg and Kuse Mental Rotations Test [22] .
They then completed the retention test for each of the three puzzles, in which they were asked to solve the three burr puzzles from the first session without a training phase, in the same order and in a maximum of three minutes.
They completed the same questionnaire from the first session at the end of each retention trial (see Table 3 ).
After completing all retention trials they were interviewed regarding strategies used throughout the session.

5.1 Types of errors

Unsuccessful puzzle completions during immediate and retention testing were due to one of two reasons.
In most cases, participants did not complete the 3D puzzles within the given maximum time (180s).
On the other hand, a low number of participants decided to stop the time before the upper limit thinking that they had successfully solved the puzzle.
Close inspection showed that they had not correctly assembled the pieces.
Completion time values for both immediate and retention testing were corrected by assigning the upper time limit (180s) to all unsuccessful attempts.

5.2 First session 5.2.1 Training times

The post hoc analysis revealed statistically significant differences in training times for the first puzzle.
There was a statistically significant difference between P (mean rank = 15.
The post hoc analysis revealed statistically significant differences in training times for the second puzzle.

5.2.2 Immediate testing success rates

The model suggested that participants in the P experimental condition were 0.074 times as likely to successfully assemble the third puzzle than participants in the reference category (PV I B).
The model suggested that participants in the PV I experimental condition were 0.028 times as likely to successfully assemble the third puzzle than participants in the reference category (PV I B).

Dimension Question

Likert scale extremes Difficulty Please rate the difficulty of the task you just completed.
It is important to note that all participants in this condition successfully completed the third puzzle.
The model suggested that participants who succeeded at correctly assembling the second puzzle were 9.687 times as likely to successfully assemble the third puzzle than participants in the reference category (PV I B).
For the third puzzle, the binomial logistic regression model with the highest percentage of correctly classified observations was the one that ascertained the effect of both experimental condition and successful completion of the previous puzzle.
Test statistics using Dunn's procedure [4] for immediate testing times between the different experimental conditions.

5.2.3 Immediate testing completion times

The post hoc analysis revealed statistically significant differences in immediate testing times for the third puzzle.
The analysis of immediate testing completion times shows some support for H2 and H3.

5.2.4 Subjective questionnaire ratings

There was a statistically significant difference in ease of use of the training environment (F(5,54) = 3.044, p = 0.017) between groups as determined by one-way ANOVA for the third puzzle.
No other significant interactions were found for the third puzzle.

5.3.1 Participants

A total of 56 participants that completed the first part session returned to complete the second session two weeks later (average number of days between training session and retention session: 14.16, SD = 0.918).
Overall, retention testing performance was lower than expected for all conditions both in terms of success rates and completion times.

5.3.4 Subjective questionnaire ratings

There was no statistically significant difference in rated difficulty and seriousness between groups as determined by one-way ANOVA for any of the three puzzles.
Tukey post hoc tests showed no significant interactions.

6 DISCUSSION

One of the limitations in their design was the high complexity of the puzzles.
Overall, retention testing resulted in lower performance than the authors had expected and they believe this is due to the difficulty associated with remembering the process to solve the three puzzles two weeks after the training.
This was further validated by verbal feedback from their participants during the second session.
The authors previous piloting of the task had not shown this effect.
Future studies should further evaluate the suitability of the task for retention.

7 CONCLUSION

The authors analysed performance in terms of success rates as well as immediate testing times and retention testing times.
The authors results show that the performance of virtually trained participants was promising.
A statistically significant difference wasn't found between condition V E A and the best performing physical condition (PV I B, in which physical blocks and animated instructions were available during training) for the last and most complex puzzle in terms of success rates and immediate testing times.
Retention testing performance was unexpectedly low due to the high complexity of the task.
The authors believe that the results of this study further validate the effectiveness of virtual training for bimanual assembly tasks.

Did you find this useful? Give us your feedback

Figures (1)

Table 5. Test statistics using Dunn’s procedure [4] for immediate testing times between the different experimental conditions. Significance values have been adjusted by the Bonferroni correction for multiple tests.

Content maybe subject to copyright Report

1574 IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 24, NO. 4, APRIL 2018

Manuscript received 11 Sept. 2017; accepted 8 Jan. 2018.

Date of publication 19 Jan. 2018; date of current version 18 Mar. 2018.

For information on obtaining reprints of this article, please send e-mail to:

reprints@ieee.org, and reference the Digital Object Identiﬁer below.

Digital Object Identiﬁer no. 10.1109/TVCG.2018.2793638

This work is licensed under a Creative Commons Attribution 3.0 License. For more information, see http://creativecommons.org/licenses/by/3.0/

A Comparison of Virtual and Physical Training Transfer

of Bimanual Assembly Tasks

Mar

ıa Murcia-L

opez and Anthony Steed

Abstract

—As we explore the use of consumer virtual reality technology for training applications, there is a need to evaluate its validity

compared to more traditional training formats. In this paper, we present a study that compares the effectiveness of virtual training and

physical training for teaching a bimanual assembly task. In a between-subjects experiment, 60 participants were trained to solve three

3D burr puzzles in one of six conditions comprised of virtual and physical training elements. In the four physical conditions, training was

delivered via paper- and video-based instructions, with or without the physical puzzles to practice with. In the two virtual conditions,

participants learnt to assemble the puzzles in an interactive virtual environment, with or without 3D animations showing the assembly

process. After training, we conducted immediate tests in which participants were asked to solve a physical version of the puzzles.

We measured performance through success rates and assembly completion testing times. We also measured training times as well

as subjective ratings on several aspects of the experience. Our results show that the performance of virtually trained participants

was promising. A statistically signiﬁcant difference was not found between virtual training with animated instructions and the best

performing physical condition (in which physical blocks were available during training) for the last and most complex puzzle in terms of

success rates and testing times. Performance in retention tests two weeks after training was generally not as good as expected for all

experimental conditions. We discuss the implications of the results and highlight the validity of virtual reality systems in training.

Index Terms—Learning transfer, virtual reality, assembly, training

1INTRODUCTION

The availability of consumer virtual reality technology has raised the

manufacturing industry’s interest in virtual training for manual assem-

bly tasks. Virtual environments could deliver cost-efﬁcient, safe and

potentially effective training. If proven adequate, virtual training would

also allow for the completion of operator instruction prior to the in-

stallation of physical workstations, tools and components. This would

accelerate the end-to-end manufacturing process and, consequently,

increase efﬁciency of production. However, more evidence is needed

to ascertain the effectiveness of virtual environments for training as

opposed to more traditional forms of training.

In this paper, we present a study that compares the effectiveness

of virtual and traditional paper- and video-based training transfer of

a bimanual assembly task, motivated by previous research [3, 10]. In

a between-subjects experimental design, participants were trained to

solve three six-piece burr puzzles in a virtual training environment or a

physical training environment. The conditions were designed to account

for situations in which the physical puzzle blocks are available or not

during training. The conditions were also devised to include static

instructions (paper) or combinations of static and animated instructions

(video or 3D animations). Table 1 introduces the experimental condition

types, acronyms and deﬁnitions. Table 2 shows a classiﬁcation of

the experimental conditions according to instruction type and block

availability during training.

Following training, participants were asked to solve physical ver-

sions of the puzzles (referred to as immediate testing). Participants then

completed a retention session, two weeks after the training (referred

to as retention testing). During the course of the study, participants

answered mental rotations tests and questionnaires measuring several

aspects of the experience.

We tested three hypotheses about the effectiveness of training in each

of the conditions being compared in the study. The ﬁrst hypothesis

• Mar

ıa Murcia-L

opez is with University College London. E-mail:

maria.murcia.13@ucl.ac.uk.

• Anthony Steed is with University College London. E-mail:

a.steed@ucl.ac.uk.

(H1) was that conditions in which the physical blocks were available

during training (PB and PV

B) would yield a higher number of success-

ful puzzle completions during immediate and retention testing. The

second hypothesis (H2) was that the conditions in which both static

and animated instructions were available during training (PV

, PV

and

) would result in lower assembly times during immediate and

retention testing. The third hypothesis (H3) was that condition PV

with animated instructions (video) and physical blocks, would yield the

highest performance as measured by immediate and retention success

rates and assembly testing times. Although we expected some condi-

tions to deliver worse or better performance, we had no hypothesis on

the full order so all the analysis presented in this paper is two-tailed.

Immediate testing results showed some support for the ﬁrst hypothe-

sis, some support for the second hypothesis and some support for the

third hypothesis. Retention performance was lower than expected for

all conditions both in terms of success rates and completion times and

did not provide evidence to support any of the three hypotheses.

The remainder of this paper is organised as follows. In Section 2

we review related work on learning transfer in immersive mixed reality

systems. Section 3 presents the experimental design and hypotheses.

In Section 4 we introduce the methodology and experimental setup. In

Section 5 we report the results of the study. Section 6 discusses the

results, limitations and future work. Section 7 concludes.

ELATED WORK

Previous research has highlighted the effectiveness of immersive mixed

reality training in different disciplines, including military training, medi-

cal training and vehicle driving simulators [17,21], as well as navigation

and spatial knowledge training [8, 23], amongst others. Despite the

recognised success in the aforementioned ﬁelds, studies on immersive

virtual training transfer of procedural and assembly tasks have reported

contrasting results.

Hall and Horwitz compared retention of procedural knowledge of

equipment operation in an immersive virtual environment and in a 2D

computer environment and found no signiﬁcant differences [7]. They

claimed that virtual reality training may not be superior to conventional

electronic media for training certain skills. Gavish et al. evaluated the

use of virtual reality and augmented reality technology for industrial

maintenance and assembly task training [5]. They concluded that an

augmented reality platform was more suitable for training of this type of

tasks and encouraged further evaluation of virtual reality based training.

Fig. 1. One of the three 3D printed burr puzzles used in the study.

In a more recent study Gonzalez-Franco et al. compared collabo-

rative conventional face-to-face training with a mixed reality training

setup for a manufacturing procedure of an aircraft door [6]. Their re-

sults indicated that performance levels yielded by the immersive mixed

reality training system were not signiﬁcantly different from the conven-

tional face-to-face training format. Rose et al. evaluated the transfer

from a virtual environment to the real world of a simple sensorimotor

task [16]. Overall, virtual training resulted in equivalent or even bet-

ter real world performance than real or physical training for the task.

However, they advise that their ﬁndings may not apply to other types

of training tasks.

Sowndararajan et al. found an effect of level of immersion in memo-

rising a complex procedure [20]. In their study, participants trained in

the system with the higher level of immersion (a large L-shaped projec-

tion display) completed tasks signiﬁcantly faster and with fewer errors

than participants trained in the system with lower level of immersion

(using a typical laptop display).

Other studies have shown effective learning transfer in virtual en-

vironments with the addition of haptic force-feedback devices. For

instance, Adams et al. conducted a study to explore the beneﬁts of

haptic feedback for virtual training of a manual task [1]. They reported

that force-feedback was a requirement for higher learning transfer in

virtual environments.

Our study is inspired by the work of Carlson et al. in 2015 [3], it-

self motivated by previous work [10, 13, 19]. In a between-subjects

experimental design, Carlson et al. compared the effectiveness of vir-

tual bimanual haptic training versus traditional physical training of

an assembly task consisting of a six-piece burr puzzle. Their results

indicated that physically trained participants initially outperformed

virtually trained participants. However, virtually trained participants

improved their testing times after two weeks. Results also showed that

virtual training was enhanced by using coloured blocks as they helped

participants remember the assembly process. We run a similar task

comparing paper- and video-based training with virtual training in the

absence of a haptic force-feedback device.

We agree with Carlson et al. in that 3D burr puzzles are suitable

proxy tasks or abstractions of context-speciﬁc manual assembly tasks,

such as engine assembly operations at vehicle manufacturing plants. We

therefore decided to use the same type of task in our study. Following

their reported methods, we complemented the training task with a series

of mental rotation tests to distribute participants amongst the condition

groups in our between-subjects experimental design [2, 14, 22]. We

also decided to colour-code the puzzle blocks and instructions as well

as to use a semi-transparent virtual representation of the hands in the

virtual environment [11, 12], amongst other recommendations made by

the authors which are further explained in Section 3.

Our study extends and builds on previous work by comparing a

number of virtual and physical training formats, the latter representing

the most common formats (video and paper instructions) in current

assembly process training programmes. The main aim of this research

is to verify whether exposure to a virtual training environment is sufﬁ-

cient for effective training. We are speciﬁcally interested in situations

in which haptic devices are not available and when the physical compo-

nents and tools used in the process are not accessible during training.

XPERIMENTAL DESIGN AND HYPOTHESES

Inspired by previous research [3], in our study we used three different

colour-coded versions of a six-piece burr puzzle for the assembly task

(see Figure 1). Burr puzzles have been commonly used for assembly

task training studies in the past because they provide a recognisable and

adequately complex model in which participants must follow a speciﬁc

procedure in order to solve them [3, 10]. However, our study differs

from previous work in that no haptic devices were used. In addition, we

are interested in whether consumer virtual reality systems are sufﬁcient

for effective training.

In our study, participants were trained and tested in assembling three

versions of a six-piece burr puzzle. To provide increasing difﬁculty, the

ﬁrst three blocks had been preassembled for the ﬁrst puzzle, the ﬁrst

two for the second and none for the third. This meant that participants

had to remember a higher number of steps in the assembly process over

the course of the experimental task for each puzzle.

Following a between-subjects experimental design, participants were

trained to solve each puzzle by adding the corresponding unassembled

blocks in one of six experimental conditions (see Table 1). Experimen-

tal conditions were designed to account for scenarios in which blocks

are not available (P and PV

), physical blocks are available (PB and

B) or virtual blocks are available (

and

) during training (see

Table 2 for a classiﬁcation of the experimental conditions). The physi-

cal experimental conditions (P, PB, PV

and PV

B) were designed to

encompass combinations of paper- and video- based instructions. The

virtual experimental conditions (

and

) involved a virtual ver-

sion of the paper instructions, with or without 3D animations showing

how to correctly assemble the puzzle, and always with virtual blocks

to practice during training. All instructions (static and animated) were

colour-coded to match the physical puzzle blocks.

Following training and after a short break, participants were asked

to assemble a 3D printed physical version of the corresponding puzzle

within a given time. Participants were asked to attend a retention

session, two weeks after the training, in which they were asked to

solve the same puzzles in the same order and within the same time

constraints. We measured success rates as well as training and testing

times. Sessions were complemented by a series of mental rotations

tests as well as questionnaires and debrief interviews.

As part of their recommendations for future work, Carlson et al. sug-

gested adding a snap-to-ﬁt function or constraint system [18] to alleviate

the time that virtually trained participants spent attempting to ﬁt and

assemble the virtual blocks [3]. We followed this recommendation

and added such functionality in the virtual training environment. We

also followed their recommendation to make the selection of a block

in the virtual environment to cause a change of colour instead of just

causing a change in transparency, as participants in their study reported

that it was difﬁcult to discern transparent pieces against the transparent

virtual representation of the glove. In their discussion they mentioned

individual differences for interaction between the two hands, as some

participants showed a preference for the haptic device or the glove for

predominant use. We therefore decided to make interaction ambidex-

trous, meaning all operations were designed to be performed equally

by the left hand and the right hand.

We made the following hypotheses:

H1:

The conditions in which the physical blocks were available during

training (PB and PV

B) would yield a higher number of successful

puzzle completions during immediate and retention testing. This

relates to the experience (or lack of) built around manipulating

and assembling the physical blocks during training.

H2:

The conditions in which static and animated instructions (video

or 3D animations) were available during training (PV

, PV

B and

) would result in lower assembly times during immediate

and retention testing, as participants would have received richer

visualisation on how to assemble the blocks during training.

H3:

Condition PV

B, with physical blocks and animated instructions

(video), would yield the best performance as measured by imme-

diate and retention success rates and assembly testing times. This

hypothesis is based on H1 and H2.

MURCIA-L´OPEZ AND STEED: A COMPARISON OF VIRTUAL AND PHYSICAL TRAINING TRANSFER OF BIMANUAL ASSEMBLY TASKS 1575