1. Introduction
In the mechanical engineering industry, assembly processes belong to activities whose final quality is significantly influenced by accuracy of work instructions, experience of the worker, and his ability to correctly interpret technical documentation. This influence is of special significance in conditions of a single-piece, small serial, or variant production, where assembly tasks consist of a high degree of manual labor, variability of operations and where they place requirements for increased spatial imagination of the user.
Traditional assembly procedures in industrial practice are based especially on technical drawings, bills of material, instructional texts, and verbal instruction. Advantage of paper documentation are its easy availability, low technical requirements, and longstanding use in industrial environment.
However, in the training of new workers or inexperienced users, its utilization might pose a problem, as it requires independent interpretation of 2D technical documentation, correct parts identification, and understanding of their spatial orientation. In the case of an unfamiliar mechanism, this manner of work may prolong assembly time, lead to occurrence of errors, and to increased need for assistance [
3,
4].
Another limitation of paper documentation is its static nature. The user is forced to repeatedly switch attention between a drawing, a written instruction, physical components and the assembly itself, which may increase cognitive load. Increased cognitive load may manifest in slower or less fluent execution of the assembly procedure [
5,
6].
Figure 1 is a diagram showing a difference between traditional work with paper documentation and the assembly supported by technologies of mixed reality. Display of work information directly in the assembly space may reduce the need for switching attention and contribute to better orientation of the user.
In the context of developing concepts of Industry 4.0, smart manufacturing and human-centered manufacturing, digital work instructions are the object of ever-increasing interest as a support tool of manual activities. Digital procedures enable presenting work information in the form of a text, an image, a video, sound or 3D models and provide the user with continuous guidance throughout the task execution [
7,
8]. Their benefit lies especially in their ability to guide the user step by step and to provide visual or checking support directly in the course of assembly. Thus, it may contribute to standardization of work procedures in intelligent production and a more efficient knowledge transfer between the more and the less experienced workers [
9,
10,
11].
Augmented and mixed reality technologies represent a significant approach to implementing digital work instructions directly in the environment of real assembly. Assembly information may be displayed in the context of a workstation, thus reducing the need for interpretation of abstract technical documentation and, at the same time, improving orientation of the user.
In industrial practice, these technologies are applied especially in assembly, maintenance and servicing, as well as in training of workers and in expert know-how transfer. Their utilization makes it possible to link digital content and real work environment, thus creating a prerequisite for more effective execution of manual operations.
Research into digital, augmented, and mixed work instructions conducted so far has focused especially on their effect on the efficiency of manual assembly processes. Attention is focused especially on indicators such as assembly time, rate of error, cognitive load, usability of technology and the degree of user independence. Several studies point to the significance of supporting the user by guiding him through work sequence and spatial orientation in the course of assembly [
1,
2]. Other works emphasize the benefit of digital and AR instructions in terms of error rate reduction and increased work efficiency, especially involving less experienced users [
3,
4,
5,
6]. At the same time, research results suggest that the efficacy of digital work instructions is significantly influenced by the manner of their implementation and quality of user interface.
A significant portion of research focuses on the use of digital and AR technologies in the area of workers training and practical skills transfer. These solutions enable a more effective conveyance of assembly and maintenance procedures while supporting adoption through visual and interactive user guidance. Studies point to the fact that the AR training systems may improve adoption of practical skills and increase independence of users in executing their work tasks [
7,
8,
9,
10]. At the same time, importance of user interface design and the manner of information presentation is emphasized, as they directly affect the adoption efficiency and the quality of executed work [
11].
An important area of research is creation and management of digital and AR work instructions. Research focuses on how to effectively capture expert know-how and transform it into structured, user-friendly work procedures. Approaches based on „programming-by-demonstration“ enable creation of instructions directly from execution of work tasks, thus reducing requirements for manual creation of documentation [
12,
13]. At the same time, it appears that efficiency of AR instructions depends on particular functions of the system, such as visual augmentation of steps, guidance of the user or checking of correct execution of operations [
14,
15,
16]. Significant role is also played by visual elements optimization and the amount of information presented, since too much information may expose the user to cognitive load [
17,
18,
19].
Systematic and overview studies point to the fact that efficiency of AR, VR, and MR work instructions depends on several factors, especially on the nature of the task, on the type of the imaging device, and on the characteristics of the user group [
20,
21,
22]. Results of those studies hint at the fact that benefit stemming from digital instructing systems cannot be deemed to be universal. Rather, it needs to be assessed in the context of a particular application. For this reason, experimental verification of individual solutions under real conditions is crucial for assessing their practical utilization.
Direct comparison of paper and digital or AR work instructions point to the fact that digital support may result in reduced assembly time and error rate, especially when involving lesser-known, spatially demanding, or sequence-sensitive tasks. Several studies’ results suggest that visual and interactive user guidance enables closer adherence to assembly sequence and reduces the risk of incorrect interpretation of work steps [
23,
24,
25]. This effect is especially pronounced in users lacking prior experience with the task at hand.
At the same time, experimental studies emphasize the significance of correct components identification, parts orientation, and adherence to assembly sequence, which have a crucial impact on the success of assembly operations. Digital and AR work instructions may provide effective support in this respect through visual guidance and contextual display of information, thus promoting error reduction and improving workflow [
26,
27,
28,
29].
Other research covers digital work instructions assessment in various spheres of application, including time-critical assignments, training scenarios, and specific industrial conditions. Results of these studies speak of the importance of the context of use and, at the same time, confirm that digital instructions may improve user orientation, the course of work, and reliability of executed operations [
30,
31,
32,
33,
34,
35].
It follows from available research that digital and AR work instructions have the potential of improving efficiency of assembly processes, especially in terms of assembly time, error rate, and user support. However, this benefit cannot be deemed universal, as the end result depends on the context of a particular application and the nature of the assembly task.
That is why it is important to verify specific technological solutions experimentally in conditions corresponding to real deployment in the settings of mechanical engineering. In this context, it is relevant to examine the use of the tool Vuforia Capture, which enables the creation of digital work procedures directly from real execution of assembly operations.
The aim of the paper is to compare the traditional paper documentation with digital work procedure created with the use of Vuforia Capture. Focus is directed at key indicators of assembly significance, namely the assembly time, the number of errors, the need for additional assistance and the usability of the technology for the training of its users.
Based on the aim above, the following research questions have been postulated:
RQ1: Does the use of digital work instructions created in Vuforia Capture result in reduced assembly time compared to paper documentation?
RQ2: Is there a smaller number of assembly errors using Vuforia Capture involving users lacking prior experience?
RQ3: Does digital work procedure reduce the need for additional assistance in the course of assembly?
RQ4: Is the technology of Vuforia Capture a suitable training support tool in practical conditions of mechanical engineering?