2. Materials and Methods
MMD-MSD was designed and collected for the needs of the ErgoResearch project, which among other goals aimed at the creation of resources (datasets, software, models) in support of research and technology development (RTD) that helps for the prevention of MSD. The data collection was based on photogrammetry as it provides a nonintrusive way to assess the human body posture in various scenarios and a wearable wristband for the acquisition of peripheral physiological signals.
The focus of the MMD-MSD data collection process aimed at implementing an experimental setup and scenarios, which allow for assessing the performance of a person while working in a typical office scenario. This involves a person sitting on an office chair next to an office desk equipped with a desktop computer configuration with standard interface devices, such as a monitor, keyboard, and mouse. The dataset comprises scenarios involving people sitting in a spontaneous posture, the one that a sitting person takes naturally while concentrating on computer-bound tasks, in contrast to the scenario when the body position is manually adjusted by a medical doctor, who specialized in the field of MSD rehabilitation, to sitting in the recommended ergonomic health-friendly posture. Besides, each participant was instructed on how to keep a proper ergonomic posture while working with the specific desktop computer configuration. For capturing the person-specific body structure, the dataset also registered spontaneous and corrected standing positions, which were adjusted with the help of a physiotherapist in enface and profile perspectives.
To elicit spontaneous sitting positions, we engaged the volunteers in various computer-based tasks, starting with filling in a questionnaire, resolving the Stroop test, as well as with a discussion about their experiences during the data collection process. Here, we made use of the Stroop test as a low-complexity mental task, which does not induce a high cognitive load but requires concentration and attention. In the Stroop tests, we started with a low-complexity stimulus and subsequently alternated episodes with lower and higher difficulty to motivate higher attention of the volunteers.
Each participant implemented the Stroop test twice -- when the person was sitting on an office chair without and with an air-cushion sitting disk placed on the chair. The sitting disk requires the sitting person to maintain balance, which is feasible only when sitting in a proper working posture. Thus, here we consider the scenario when people were sitting on the air-cushioned disk as enforced to a proper sitting posture. We recorded each volunteer's performance on the Stoop test when sitting in spontaneous and manually corrected body positions.
Finally, here we ought to mention that for the selected set of 100 volunteers involved in the database creation, who were mostly students and university staff, we deem that the observed differences in individual performance were primarily due to differences in their concentration and attention during the Stroop tests and shall not be interpreted as linked to their person-specific cognitive capacity.
In the following subsections, we outline the data collection setup, protocol, types of data and their tagging, and the dataset organization and file formats.
2.1. Data Collection Setup
The MMD-MSD data collection setup consists of a typical non-adjustable office desk with a chair, a desktop computer configuration with standard computer peripherals, and a wall grid. The office desk height is 73 cm, the chair height is 42 cm, and the 24’ monitor's bottom edge was set to 13 cm from the desk. During a major part of the data collection process, we considered a sitting person in a spontaneous working posture while using a 24’ monitor, a typical 89-button keyboard, and a right-handed 3-button computer mouse. An air-cushioned stability disk (The THERABAND Stability Disc) is used for forcing an active sitting position. Two cameras (Go Plus F800 1080P HD 30fps) were used to register images and overhead video recordings during the data collection campaign. An Empatica E4 wristband was used for the acquisition of peripheral physiological signals, such as BVP (Blood Volume Pulse), Electro-Dermal Activity (EDA), and Skin Temperature (ST), as well as three-axis accelerometer data. These physiological signals have different dynamics and were registered with different sampling rates (
Table 1).
A purposely created software tool implemented the stimulus generation and kept logs of the success rates during the color-to-text matching Stroop test [
24]. Our Stoop test consisted of 20 questions divided equally into two groups – with low and with higher complexity. In the first group of questions, the font color matched the written word, and in the second group, the color differed from the color specified by the written word. Each participant chose the correct answer from among three possible answers. In a participant-specific log file, the responses and reaction time for each stimulus were recorded.
In
Figure 1, we show the arrangement of equipment during the creation of MMD-MSD along with typical body postures registered during the subsequent steps of the data collection process.
A software environment was purposely developed for the recording of physiological signals and the temporal synchronization of all recording channels. The timestamp synchronization was implemented through the generation of specific markers that serve as reference points for the beginning and end of every episode during the data collection process. The team, that implemented the dataset collection and annotation, consisted of:
a data collection supervisor, who welcomed and instructed the volunteers and guided them throughout the data collection process;
an assistant with expertise in ergonomics, who designed and administrated the consensus forms and questionaries;
a technical assistant, who ensured that the equipment was operational, assisted the participants with the equipment use, and also served as a photographer;
a medical doctor with a specialty physiotherapist, who assisted in correcting the standing and sitting postures of participants during the data collection campaign;
a senior medical doctor with habilitation in MSD prevention, who implemented the data annotation process.
2.2. Dataset Collection Protocol
The MMD-MSD dataset was recorded in a single 45-minute session per participant, with short breaks between the data collection episodes. The data collection protocol aimed at the acquisition of motion data, pictures, videos, and physiological signals. We used a common data acquisition workflow (cf.
Figure 2) implemented for each of the 100 volunteers who were recruited as participants in the data collection campaign. None of the volunteers reported previous experience with an ergonomics-oriented data collection campaign and nobody had previous exposure to the Stroop test.
Each participant was admitted individually to the premises of the laboratory accompanied only by the team specified above, which is directly involved in implementing the data collection process. The access to the laboratory by other people was restricted to lower the degree of discomfort and distraction during the data acquisition.
Timestamp synchronization between all recording channels was implemented through a universal marker event, which was generated by the software tool at the beginning and the end of each data collection step. As shown in
Figure 2, the data collection starts with a briefing about the data collection process, filling in a consent declaration, and the entry questionnaire. The questionnaire collected general demographic information about the volunteers, their activity/sport/sleep habits as well as self-reported pain issues, which were later used as self-assessment tags.
After the entry questionnaire, each participant was invited to stand next to the wall with grid wallpaper, and photographs were taken in spontaneous and corrected standing positions. Pictures were taken of each person standing
en face (front-on view) and profile (left side view) (cf.
Figure 1), including spontaneous and corrected posture. The spontaneous standing posture for each participant corresponds to the uncorrected comfortable position that the participant takes naturally, while the corrected ones refer to the positions after the professional physiotherapist manually adjusts the body posture of each person according to the established MSD prevention recommendations.
Next, the participant was asked to sit on an office chair next to the office desk with a computer configuration and perform several tasks using the computer. The technical assistant helps each participant to put on the Empatica E4 wristband. The recording sessions begin with a baseline recording and each participant is instructed on how to perform the Stroop test. This corresponds to the start of the core part of the data collection, beginning with the initial baseline recording of physiological signals and finishing with the end of the second Stroop test. These data collection steps were registered from an overhead perspective using a video camera. The video recordings capture each participant's positions and sitting behavior during the implementation of the assigned computer-bound tasks.
All participants in the experiment were asked to solve the Stroop test twice – sitting on a chair in their usual comfortable work posture and sitting on a stability disc placed on the chair. When performing the Stroop Test #1, each participant is sitting in an office chair in their usual working position with a computer. While working on the Stroop Test #1 assignment, a picture of the spontaneous sitting posture is taken from the perspective of the left profile. At some time, the medical doctor corrects the sitting position of the participant to correspond to the ergonomic body position and then another picture of the corrected sitting posture (i.e. condition “corrected”) is taken.
When performing the Stroop Test #2, the participant is sitting on the stability disk placed on the chair, which forces active sitting and helps to maintain the ergonomic body posture. While working on the Stroop Test #2, another picture of the participant is taken from the perspective of the left profile.
After the Stoop Test#2, we proceed with the recording of the second baseline recording of the physiological signals. At this stage, each participant shares experiences and observations during the experiment and answers questions about whether they feel any desk sitting-related pain in general.
2.3. Tagging of Signals and Pictures
The pictures of each of the 100 participants, standing and sitting in various positions were post-processed and tagged by two medical doctors, who have specialization in physiotherapy and have practiced this profession for more than 10 years. They manually placed body markers on the pictures at precisely specified locations on the human body, such as the head, neck, shoulder, thorax, and lower back (cf.
Figure 3).
As shown in
Figure 3, two markers were used for the
en face position, and ten markers were used for the profile positions. The precise marker positions in standing and sitting body postures are specified in
Table 2.
Based on the manually positioned body markers, we calculated various postural angles. These angles are useful for the postural assessment of the head, neck, shoulder, thorax, and lower back (cf.
Figure 4) as well as in the analysis of the reasons for self-reported pain if reported in the entry questionnaire.
In
Table 3 we summarize the postural angles of interest and their reference values. These measurements provide crucial insights into postural alignment and serve as benchmarks for evaluating ergonomic practices.
2.4. Dataset Organisation
The MMD-MSD dataset is organized into six main folders, which contain the raw data (physiological signals, pictures, videos), annotated data (pictures with markers, timestamps for the physiological recordings), and data descriptors (angles and features). The folder names correspond to the type of information they contain:
Physiological_Signals_raw – with 100 subfolders, one folder per participant;
Pictures_raw – pictures in .jpg SOOC (Straight Out Of Camera).
Pictures_with_markers – either 10 or 2 posture-specific markers, manually placed;
Video – overhead recorded videos;
Angles – computed based on manually adjusted markers;
Features – computed from the physiological signals PPG, EDA, ST.
Besides, the root folder also contains three files:
StrooptestPerformance.csv – Stoop tests data, including participant ID; question number; task complexity (L-low; H-high); participant’s response; correct answer; reaction time (test1; test2);
Dataset Description.docx – technical documentation of MMD-MSD.
Details about the file names and formats are available in
Appendix A.
2.5. Participants - Demographic Information
The MMD-MSD dataset was collected with the help of 100 volunteer participants, among which 64 males and 36 females. The demographic information of these participants is summarized in
Table 4. Most of the male participants were students and teaching assistants and few were technical staff. The mean age of males is 24.71±7.17 years, weight 82.32±17.06 kg, and height 177.5±8.3 centimeters. Approximately half of the female participants were students, and the others were teaching or administrative staff at the Technical University of Varna. The mean age of female participants is 34.53±14.03 years, a weight of 63.06±12.9 kg, and a height of 169.0±11.02 centimeters. As seen in the pictures, the non-adjustable office desk was not comfortable for the tallest and heaviest participants. The same holds for the shortest females.
2.6. Questionnaire Self-Reported Data
A summary of the self-reported information provided by the participants concerning their weekly frequency of sports activity and hours of computer use on a per-day basis is presented in
Table 5 and
Table 6. Two-thirds of the participants reported that they are engaged in some sports activities, and only one-third reported that they get involved in sports activities at most once per week or not at all. However, there are some differences in the sports activity of men (73.4% active) and women (55.6% active). As shown in
Table 4, the distribution of hours spent working with computers is nearly uniform among the four selected categories for both male and female participants. The percentages concerning hours of computer use are nearly identical for men and women.