Submitted:
26 September 2024
Posted:
27 September 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
- A database of eye movements at varying levels of handling qualities when operating an eVTOL simulator;
- An analysis of the impact of perceived task workload on subjective handling qualities ratings, supplemented by statistical data mining to reveal key indicators for handling qualities assessment;
- A framework for assessing handling qualities objectively, supplementing the existing HQRM.
2. Background
2.1. Physiological Measurements in Aviation
3. Material and Methods
3.1. Handling Qualities Assessment Framework
3.2. Dataset
3.2.1. Participants
3.2.2. Simulator Setup
3.2.3. Mission Task Elements
- Vertical step: From a stable hover at 10 feet, the eVTOL ascends to a reference altitude (40-50 feet), stabilizes for at least 2 seconds, then descends back to hover at 10 feet;
- Acceleration/deceleration (Acc/Dec): Starting from a stable hover, participants rapidly increase speed to 50 knots, then decelerate back to a hover, adjusting pitch to maintain altitude;
- Sidestep: From a stable hover, the aircraft moves laterally to a set point, maintaining constant altitude throughout the maneuver;
- Diagonal hover to stop (Diagonal): The aircraft moves diagonally while maintaining altitude, beginning from a stable hover with the longitudinal axis set at a 45° angle to a reference line;
- Slalom: Performs a series of smooth turns along the centerline of the test route at 500-ft intervals;
- Takeoff and transition (Takeoff): The eVTOL takes off from a stable hover, climbs vertically to 100 feet, accelerates, and transitions into wingborne mode while staying above marked boundaries;
- Re-transition and landing (Landing): From wingborne mode at 80 knots, the aircraft decelerate to transition mode, following marked boundaries until coming to a hover and landing;
- Hover turn: Execute a 180° turn from a stable hover at an altitude under 20 feet;
- Pirouette: The aircraft moves laterally around a 100-feet radius circle while keeping the nose pointing toward the center and maintaining at 10 feet.
3.2.4. Eye Data Collection
3.2.5. Subjective Measurements
3.2.6. Experimental Procedures
3.3. Data Mining
3.3.1. Experimental Procedures
3.3.2. Deep Learning Networks
- Subjective Handling Qualities Score (HQS): the rated scores of CHR (9-classes);
- Subjective Handling Qualities Level (HQL): Set CHR scores 1-3/4-6/7-9 as level 1/2/3 (3-classes);
- Subjective Task Workload Score (TWS): the standardized NASA-RTLX scores (9-classes);
- Subjective Task Workload Level (TWL): Set NASA-RTLX scores 1-3/4-6/7-9 as level 1/2/3 (3-classes);
- Subjective Overall Workload Level (OWL): the ranking results at the end of the experiment (3-classes);
- Pre-defined Handling Qualities Level (PHQL): the pre-defined HQ level according to the task difficulty (3-classes).
4. Results
4.1. Subjective Measurements
4.2. Eye Measurements
4.2.1. Statistical Differences
4.2.2. Associations with Handling Qualities
4.2.3. Gaze Heat Maps
4.3. Deep Networks
5. Discussions
5.1. The Associations Between Handling Qualities and Task Workload
5.2. Data Mining of Eye Measurements
5.3. Gaze Heat Maps
5.4. The Proposed LSTM Model
5.5. Comparison With Existing HQ Assessment Methods
6. Case Study
- The operators perform several pre-designed MTEs and r provides handling qualities ratings based on CHR, as described in current HQRM;
- The recorded flight trajectory is analyzed to assess flight performance, corresponding to the adequate performance and desired performance criteria described in the CHR scale;
- Gaze heatmaps are examined in terms of their depth and width to verify if they align with the operator's CHR ratings. Wider and darker heatmaps typically indicate higher workload, which should correspond to lower CHR ratings;
- Eye-tracking data is processed through the proposed LSTM model. The model outputs for each interval are averaged to provide an overall output for each MTE;
- Any inconsistencies between the gaze heatmap analysis, model outputs, and the initial CHR ratings are identified and compared. This step highlights any discrepancies that may require further investigation;
- Qualitative interview feedback is reviewed to analyze inconsistencies between the gaze data, model outputs, and the initial subjective CHR ratings. These interviews provide insight into the reasoning behind the operator's subjective assessments;
- Based on qualitative feedback and data mining results, the CHR rating for the Landing task was adjusted from 1 to 4. This adjustment is accompanied by suggestions for improving the HMI and incorporating a "dead zone" design to mitigate aircraft deficiencies and improve pilot control.
7. Limitations
8. Conclusion
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A. The indicators output by Varjo XR 3.

Appendix B. The calculated eye indicators.

References
- Zhang, J.; Liu, Y.; Zheng, Y. Overall eVTOL aircraft design for urban air mobility. Green Energy Intell. Transp. 2024, 3, 100–150. [Google Scholar] [CrossRef]
- European Union Aviation Safety Authority. Means of Compliance with the Special Condition VTOL. 2021.
- Harper, R.P.; Cooper, G.E. Handling qualities and pilot evaluation. J. Guid. Control Dyn. 1986, 9, 500–505. [Google Scholar] [CrossRef]
- Klyde, D.H.; Lampton, A.K.; Mitchell, D.G.; Berka, C.; Rhinehart, M. A new approach to aircraft handling qualities prediction. In Proceedings of the AIAA SciTech 2021 Forum, VIRTUAL EVENT, 11–15 & 19–21 January 2021. [Google Scholar] [CrossRef]
- Bailey, R.E.; Jackson, E.B.; Bilimoria, K.D.; Mueller, E.R.; Frost, C.R.; Alderete, T.S. Cooper-Harper Experience Report for Spacecraft Handling Qualities Applications. NASA Center for AeroSpace Information, 2009.
- Harris, D. Measurement of pilot opinion when assessing aircraft handling qualities. Meas. Control 2000, 33, 1–5. [Google Scholar] [CrossRef]
- Ji, H.L.; Chen, R.L.; Li, P. Distributed Atmospheric Turbulence Model for Helicopter Flight Simulation and Handling-Quality Analysis. J. Aircr. 2017, 54, 190–198. [Google Scholar] [CrossRef]
- Dussart, G.; Lone, M.; Bailey, R. Development of a Multi-Directional Manoeuvre for Unified Handling Qualities Investigation. Aerospace 2019, 6, 700–710. [Google Scholar] [CrossRef]
- Brieger, O.; Kerr, M.; Leissling, D.; Postlethwaite, I.; Sofrony, J.; Turner, M.C. Flight testing of a rate saturation compensation scheme on the ATTAS aircraft. Aerosp. Sci. Technol. 2009, 13, 92–104. [Google Scholar] [CrossRef]
- Kong, X.W. Evaluation of Flight Test Data Quality Based on Rough Set Theory. In Proceedings of the 13th International CISP-BMEI, Chengdu, China, 17–19 October 2020. [Google Scholar] [CrossRef]
- Zhang, X.; Yao, S.; Zhu, H. Handling Quality Evaluation Method in Aircraft Cockpit Based on Pilot Performance. In Proceedings of the International Conference on Applied Human Factors and Ergonomics, Orlando, FL, USA, 2018., 21–25 July. [CrossRef]
- Tan, W.Q.; Wu, Y.; Qu, X.J.; Efremov, A.V. A Method for Predicting Aircraft Flying Qualities Using Neural Networks Pilot Model. In Proceedings of the 2ND ICSAI, Shanghai, China; 2014. [Google Scholar] [CrossRef]
- Herrington, S.M.; Zahed, M.J.H.; Fields, T.D. Handling Qualities Assessment and Performance Evaluation for Unmanned Aerial Systems and Pilots. Unmanned Syst. 2024, 12, 99–131. [Google Scholar] [CrossRef]
- Cooper, G.E.; Harper, R.P. The use of pilot rating in the evaluation of aircraft handling qualities. National Aeronautics and Space Administration, 1969.
- Wilson, D.; Riley, D. Cooper-harper pilot rating variability. In Proceedings of the 16th Atmospheric Flight Mechanics Conference, Boston, MA, USA, 14-16 August 1989. [Google Scholar] [CrossRef]
- Memon, W.A.; White, M.D.; Padfield, G.D.; Cameron, N.; Lu, L. Helicopter Handling Qualities: A study in pilot control compensation. Aeronaut. J. 2023. [Google Scholar] [CrossRef]
- Cunningham, K.; Cox, D.; Murri, D.; Riddick, S. A piloted evaluation of damage accommodating flight control using a remotely piloted vehicle. In Proceedings of the AIAA Guidance, Navigation, and Control Conference, Portland, Oregon, 08–11 August 2011. [Google Scholar] [CrossRef]
- Li, Y. H.; Zhang, S. G.; He, R. C.; Holzapfel, F. Objective Detection of Trust in Automated Urban Air Mobility: A Deep Learning-Based ERP Analysis. Aerospace 2024, 11, 174. [Google Scholar] [CrossRef]
- Charles, R.L.; Nixon, J. Measuring mental workload using physiological measures: A systematic review. Appl. Ergon. 2019, 74, 221–232. [Google Scholar] [CrossRef] [PubMed]
- Mahanama, B.; Jayawardana, Y.; Rengarajan, S.; Jayawardena, G.; Chukoskie, L.; Snider, J.; Jayarathna, S. Eye Movement and Pupil Measures: A Review. Front. Comput. Sci. 2022, 3, e733531. [Google Scholar] [CrossRef]
- Peissl, S.; Wickens, C.D.; Baruah, R. Eye-Tracking Measures in Aviation: A Selective Literature Review. Int. J. Aerosp. Psychol. 2018, 28, 98–112. [Google Scholar] [CrossRef]
- Skvarekova, I.; Skultety, F. Objective Measurement of Pilot's Attention Using Eye Track Technology during IFR Flights. In Proceedings of the TRANSCOM, Novy Smokovec, SLOVAKIA; 2019. [Google Scholar] [CrossRef]
- Greiwe, D.H.; Friedrich, M. Gaze Movements of Helicopter Pilots during Real and Simulated Take-Off and Landing Maneuvers. Aerospace 2024, 11, 429–429. [Google Scholar] [CrossRef]
- Lu, T.J.; Lou, Z.S.; Shao, F.; Li, Y.; You, X.Q. Attention and Entropy in Simulated Flight with Varying Cognitive Loads, Aerosp. Med. Hum. Perform. 2020, 91, 489–495. [Google Scholar] [CrossRef] [PubMed]
- Harris, D.J.; Arthur, T.; de Burgh, T.; Duxbury, M.; Lockett-Kirk, R.; McBarnett, W.; Vine, S.J. Assessing Expertise Using Eye Tracking in a Virtual Reality Flight Simulation. Int. J. Aerosp. Psychol. 2023, 33, 153–173. [Google Scholar] [CrossRef]
- Niu, Y.F.; Zhou, T.Y.; Bai, L. Research on color coding of fighter jet head-up display key information elements in air-sea flight environment based on eye-tracking technology. Proc. Inst. Mech. Eng. Part G J. Aerosp. Eng. 2022, 236, 2010–2030. [Google Scholar] [CrossRef]
- Lyu, M.T.; Li, F.; Xu, G.Y.; Su, H. Leveraging eye-tracking technologies to promote aviation safety- A review of key aspects, challenges, and future perspectives. Saf. Sci. 2023, 168. [Google Scholar] [CrossRef]
- Di Stasi, L.L.; McCamy, M.B.; Martinez-Conde, S.; Gayles, E.; Hoare, C.; Foster, M.; Macknik, S.L. Effects of long and short simulated flights on the saccadic eye movement velocity of aviators. Physiol. Behav. 2016, 153, 91–96. [Google Scholar] [CrossRef]
- Pathirana, P.; Senarath, S.; Meedeniya, D.; Jayarathna, S. Eye gaze estimation: A survey on deep learning-based approaches. Expert Syst. Appl. 2022, 199, e116894. [Google Scholar] [CrossRef]
- Franceschiello, B.; Di Noto, T.; Bourgeois, A.; Murray, M.M.; Minier, P.; Pouget, J.; Richiardi, P.; Bartolomeo, F.; Anselmi, F. Machine learning algorithms on eye tracking trajectories to classify patients with spatial neglect. Comput. Methods Programs Biomed. 2022, 221. [Google Scholar] [CrossRef] [PubMed]
- Li, F.; Chen, C.H.; Liu, Y.S.; Chang, D.N.; Cui, J.; Sourina, O. Autoencoder-enabled eye-tracking data analytics for objective assessment of user preference in humanoid robot appearance design. Expert Syst. Appl. 2024, 249. [Google Scholar] [CrossRef]
- Zintl, M.; Marb, M.M.; Wechner, M.A.; Seiferth, D.; Holzapfel, F. Development of a virtual reality simulator for eVTOL flight testing. In Proceedings of the AIAA Aviation 2022 Forum, Chicago, IL & Virtual, 2022., June 27–July 1. [CrossRef]
- Dollinger, D.; Reiss, P.F.; Angelov, J.; Loebl, D.; Holzapfel, F. Control Inceptor Design for Onboard Piloted Transition VTOL Aircraft Considering Simplified Vehicle Operation. In Proceedings of the AIAA SciTech Forum VIRTUAL EVENT, 11–15 & 19–21 January 2021. [Google Scholar] [CrossRef]
- Zintl, M.; Kimura, S.; Holzapfel, F. A Mixed Reality Research Flight Simulator for Advanced Air Mobility Vehicles. In Proceedings of the AIAA AVIATION FORUM AND ASCEND 2024, Las Vegas, Nevada, USA, 2024., 30 July–1 August. [CrossRef]
- Liedtke, S.; Zintl, M.; Holzapfel, F.; Klinker, G. A Mixed Reality Setup for Prototyping Holographic Cockpit Instruments. In Virtual Reality and Mixed Reality. EuroXR 2023. Lecture Notes in Computer Science, vol 14410; Springer, Cham, 2023. [CrossRef]
- Baskett, B.J.; Daniel, D.L.O. Aeronautical design standard performance specification handling qualities requirements for military rotorcraft. United States Army Aviation and Missile Command, 2000.
- Wechner, M.A.; Marb, M.M.; Zintl, M.; Seiferth, D.; Holzapfel, F. Design, Conduction and Evaluation of Piloted Simulation Mission Task Element Tests for Desired Behavior Validation of an eVTOL Flight Control System. In Proceedings of the AIAA Aviation 2022 Forum, Chicago, IL & Virtual, 2022., June 27–July 1. [CrossRef]
- Varjo. Varjo XR3. Available online: https://developer.varjo.com/docs/native/eye-tracking (accessed on [01 January 2024]).
- Friesen, D.; Brost, C.; Pavel, M.D.; Masarati, P.; Mulder, M. Human-automation interaction for helicopter flight: Comparing two decision-support systems for navigation tasks. Aerosp. Sci. Technol. 2022, 129, 107719. [Google Scholar] [CrossRef]
- Abdi, H.; Williams, L.J. Tukey’s honestly significant difference (HSD) test. Encycl. Res. Des. 2010, 3, 1–5. [Google Scholar]
- Jiang, H.Y.; Gao, M.D.; Huang, J.Q.; Tang, C.; Zhang, X.Q.; Liu, J. DCAMIL: Eye-tracking guided dual-cross-attention multi-instance learning for refining fundus disease detection. Expert Syst. Appl. 2024, 243. [Google Scholar] [CrossRef]
- Lazzara, M.; Chevalier, M.; Colombo, M.; Garcia, J.G.; Lapeyre, C.; Teste, O. Surrogate modelling for an aircraft dynamic landing loads simulation using an LSTM AutoEncoder-based dimensionality reduction approach. Aerosp. Sci. Technol. 2022, 126. [Google Scholar] [CrossRef]
- Yin, Y.; Juan, C.; Chakraborty, J.; McGuire, M.P. Classification of Eye Tracking Data Using a Convolutional Neural Network. In Proceedings of the 17th IEEE International Conference on Machine Learning and Applications (ICMLA), Orlando, FL, USA, 17–20 December 2018. [Google Scholar] [CrossRef]
- Lee, Y.; Shin, C.; Plopski, A.; Ltoh, Y.; Piumsomboon, T.; Dey, A.; Lee, G.; Kim, S.; Billinghurst, M. Estimating Gaze Depth Using Multi-Layer Perceptron. In Proceedings of the 2017 International Symposium on Ubiquitous Virtual Reality (ISUVR), Nara, Japan, 27–29 June 2017. [Google Scholar] [CrossRef]
- Strenzke, R.; Uhrmann, J.; Benzler, A.; Maiwald, F.; Rauschert, A.; Schulte, A. Managing cockpit crew excess task load in military manned-unmanned teaming missions by dual-mode cognitive automation approaches. In Proceedings of the AIAA Guidance, Navigation, and Control Conference, Portland, Oregon, 08–11 August 2011. [Google Scholar] [CrossRef]
- Das, S.; Maiti, J. Assessment of cognitive workload based on information theory enabled eye metrics. Saf. Sci. 2024, 176. [Google Scholar] [CrossRef]
- Wickens, C.D.; Alexander, A. Attentional Tunneling and Task Management in Synthetic Vision Displays. Int. J. Aviat. Psychol. 2009, 19, 182–199. [Google Scholar] [CrossRef]













| Method | Examples | Material | Advantages | Disadvantages |
|---|---|---|---|---|
| Subjective methods | [5,6] | CHR / Bedford scale / NASA TLX | Human-centered evaluation;???Easy to understand and apply. | Relies on the pilot's subjective feelings;???Difficult to quantify. |
| Flight model | [7,8] | flight test / simulation data | Adopting quantitative indicators makes the assessment objective. | Largely influenced by model accuracy.???Ignore human factor. |
| Flight test | [9,10] | flight test reports | Assessment results are realistic and comprehensive. | Costly and risky; ???Affected by pilots' ability to fly and assess. |
| Pilot model | [11,12] | motion data & subjective report | Qualitative and quantitative evaluation. | Influenced by pilot subjectivity and environmental factors. |
| ID | 01 | 02 | 03 | 04 | 05 | 06 | 07 | 08 | 09 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | ave |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Correlation | -0.03 | 0.66 | 0.45 | 0.44 | 0.94 | 0.52 | 0.55 | 0.88 | 0.60 | 0.41 | 0.82 | 0.58 | 0.90 | 0.68 | 0.06 | 0.59 | 0.60 |
| Regression | R-squared=0.361, F-statistic=80.22 | ||||||||||||||||
| Feature | HQ | TW | Feature | HQ | TW | ||||
|---|---|---|---|---|---|---|---|---|---|
| Score | P value | Score | P value | Score | P value | Score | P value | ||
| Focus distance | 2.94e4 | 0.00 | 1.63e4 | 0.00 | Stability | 1.85e4 | 0.00 | 1.99e4 | 0.00 |
| Gaze forward x | 1.53e2 | 1.04e-29 | 4.95e2 | 7.52e-33 | Gaze forward y | 1.14e3 | 4.25e-242 | 1.70e3 | 8.34e-285 |
| Gaze projected to left view y | 5.19e2 | 7.53e-108 | 7.65e2 | 5.79e-160 | Gaze projected to left view x | 5.45e1 | 1.90e-11 | 2.14e2 | 7.86e-42 |
| Gaze projected to right view y | 5.19e2 | 7.53e-108 | 7.65e2 | 5.79e-160 | Gaze projected to right view x | 6.79e1 | 3.96e-12 | 2.10e2 | 5.88e-41 |
| Gaze forward z | 5.92e2 | 1.20e-123 | 4.16e2 | 8.37e-85 | Left forward x | 2.34e2 | 5.69e-47 | 4.34e2 | 9.65e-89 |
| Left forward y | 1.48e3 | 0.00 | 1.48e3 | 0.00 | Left forward z | 1.54e2 | 7.38e-30 | 6.63e1 | 2.69e-11 |
| Left origin x | 5.20e-1 | 9.99e-1 | 3.99e-2 | 1.00e0 | Left pupil size | 1.10e3 | 8.18e-234 | 2.65e3 | 4.77e-280 |
| Left projected x | 8.85e-1 | 9.96e-1 | 1.98e0 | 9.82e-1 | Left projected y | 1.86e1 | 9.56e-3 | 1.52e1 | 5.58e-2 |
| Right forward x | 2.16e2 | 4.33e-43 | 6.10e2 | 1.40e-126 | Right forward y | 9.21e2 | 1.35e-194 | 8.37e2 | 2.12e-175 |
| Right forward z | 1.55e2 | 4.37e-30 | 6.53e1 | 4.13e-11 | Right origin x | 3.26e-2 | 1e0 | 5.62e-1 | 1.00e0 |
| Right pupil size | 1.25e3 | 1.15e-266 | 2.95e3 | 0.00 | Right projected x | 3.92e0 | 8.57e-1 | 4.27e0 | 8.32e-1 |
| Right projected y | 6.89e-1 | 9.98e-1 | 4.13e-1 | 9.89e-1 | Interpupillary distance | 2.84e4 | 0.00 | 7.06e4 | 0.00 |
| Left iris diameter | 3.11e4 | 0.00 | 1.52e4 | 0.00 | Left pupil diameter | 9.17e2 | 1.08e-193 | 2.49e3 | 0.00 |
| Left iris/pupil ratio | 4.04e2 | 4.09e-83 | 1.86e3 | 0.00 | Left eye openness | 1.75e3 | 4.20e-266 | 2.47e3 | 0.00 |
| Right iris diameter | 6.33e4 | 0.00 | 6.87e4 | 0.00 | Right pupil diameter | 9.26e2 | 1.18e-195 | 2.67e3 | 0.00 |
| Right iris/pupil ratio | 4.64e2 | 4.93e-96 | 1.10e3 | 5.06e-232 | Right eye openness | 1.67e3 | 7.30e-263 | 5.81e3 | 3.52e-120 |
| Gaze entropy | 3.95e3 | 0.00 | 5.63e3 | 0.00 | ACF values | 3.08e4 | 0.00 | 9.41e4 | 0.00 |
| Fixations | 6.94e3 | 0.00 | 2.21e4 | 0.00 | Saccadic distance | 3.78e2 | 1.17e-77 | 5.46e2 | 1.08e-112 |
| Model | Metrics | HQS | HQL | TWS | TWL | OTW | PHQL |
|---|---|---|---|---|---|---|---|
| Proposed LSTM | Accuracy | 0.94 | 0.97 | 0.93 | 0.98 | 0.94 | 0.90 |
| Recall | 0.94 | 0.97 | 0.93 | 0.97 | 0.93 | 0.90 | |
| Precision | 0.94 | 0.97 | 0.93 | 0.97 | 0.94 | 0.89 | |
| Convolutional Neural Network (CNN) [43] | Accuracy | 0.89 | 0.93 | 0.89 | 0.94 | 0.91 | 0.90 |
| Recall | 0.89 | 0.92 | 0.89 | 0.93 | 0.91 | 0.90 | |
| Precision | 0.90 | 0.92 | 0.89 | 0.94 | 0.91 | 0.89 | |
| Multi-Layer Perceptron (MLP) [44] | Accuracy | 0.83 | 0.89 | 0.81 | 0.91 | 0.86 | 0.87 |
| Recall | 0.84 | 0.88 | 0.81 | 0.91 | 0.86 | 0.86 | |
| Precision | 0.83 | 0.87 | 0.83 | 0.90 | 0.85 | 0.85 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).