Submitted:
23 August 2024
Posted:
28 August 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction And Motivation
- We describe the theoretical background of OMS detection incorporating a generator of synthetic images.
- We devise a novel multi-input OMS detection pipeline that facilitates monitoring of any pre-trained classifier.
- We demonstrate the robustness of our method against distributional shifts and adversarial attacks.
- Through an ablation study, we evaluate the sensitivity of the OMS monitor to the various inputs.
- We show that our pipeline achieves SOTA results in the computer vision domain.
2. Related Work
2.1. Functional Safety Versus Computer Vision
2.2. OOD and OMS
2.2.1. Out of Distribution
2.2.2. Out of Model Scope
- Covariate shift: altering certain image aspects, such as brightness, to a degree that causes the classifier to fail.
- Semantic shift: introducing semantic content that has not been reviously encountered in our domain .
- Adversarial attack: incorporating malicious perturbations in the input data, imperceptible to human eyes.
2.3. Oms Metrics And Monitors
- feature-based
- probability-based
- logit-based
- feature and logit-based
2.3.1. Metrics
2.3.2. Monitors
2.3.3. Other Evaluation Metrics
3. Oms Monitoring Pipeline
- The input space cannot be infinite, as described in [14].
- The Generator should have high reconstruction capability for samples within the Encoder’s scope and lose this ability for samples outside the scope : to achieve this, during the generator’s training, we minimize the classification loss between the Encoder’s prediction on the original and generated images from the model scope.
- To be able to compare the contribution of our multi-input OMS pipeline, we investigate the performance fluctuation of the Monitor with the ablation study.
- To increase the robustness we use Dropout directly on the inputs provided to the OMS monitor, namely x,, and z.
| Algorithm 1 OMS Monitor during inference. |
|
4. Experiments
4.1. Ims And OMS Datasets
4.2. Training
4.2.1. Encoder
4.2.2. Generator
- generator’s architecture and size,
- dimension of the latent feature space z,
- class/feature-embedding method,
- combination of intermediate layers building the latent feature space z,
- implementation of the latent space wrapper L,
- and integration of a CCE between ground truth label y and the classified generated image.
4.2.3. Latent Space Wrapper
4.2.4. OMS Monitor
4.3. Evaluation of Encoder and Generator
4.4. Evaluation OMS Detection
4.5. Ablation Study Of The OMS Monitor
5. Conclusions and Future Work
- Adjustment of our OMS detection pipeline to other tasks in computer vision and other fields of machine learning, such as Natural language processing.
- Research of a possible approximation of the intermediate encoders’ layers and investigation of end-to-end training which we found based on the definition of IMS, currently not applicable.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
- Throughout our work we used publicly available datasets which are correctly cited.
- FashionMNIST: https://github.com/zalandoresearch/fashion-mnist
Acknowledgments
Conflicts of Interest
Abbreviations
| OOD | Out Of Distribution |
| OMS | Out of Model Scope |
| IMS | In Model Scope |
| SOTIF | Safety of the Intended Functionality |
| SOTA | State-Of-The-Art |
| SIl | Safety-Integrity-Level |
| HW | HardWare |
| CV | Computer Vision |
| MAHA | Mahalanobis distance |
| DNN | Deep Neural Network |
| MSP | Max Softmax Probability |
| ODIN | Out-of-Distribution detector for Neural Networks |
| AUROC | Area Under the Receiver Operating Characteristic |
| AUPR | Area Under the Precision-Recall curve |
| FP | False-Positive |
| FN | False-Negative |
| TN | True-Negative |
| TNR | True Negative Rate |
| TPR | True Positive Rate |
| GAN | Generative Adversarial Networks |
| FSGM | Fast Gradient Sign Method |
| PGD | Projected Gradient Method |
| UMAP | Uniform Manifold Approximation and Projection |
| CCE | Categorical Cross Entropy |
| CCLDM | Class Conditional Latent Diffusion Model |
| WGAN | Wasserstein-GAN |
| IS | Inception Score |
| FID | Frechet-Inception Distance |
Appendix A
Appendix A.1. Latent Space Wrapper

Appendix A.2. Performance of Generator
| ↑IS | ↓FID | ||
| avgPool | 7.87 | 29.22 | 90.78 |
| maxPool | 7.92 | 28.04 | 91.96 |
| learnable | 6.89 | 32.07 | 78.45 |
| Method / Hyper-parameter | Reference | Reasoning | |
| Encoders depth | Small | Table[4] | Deploying more complex ResNet models on the CIFAR-10 dataset didn’t improve the accuracy of the encoder, nor did it enhance the performance of the decoder. |
| Encoders FS regularization | Middle | Table[5] | Training with Dropout and Feature regularization influences FS’s compactness and consequently improves the Generators’ training performance. |
| Generators Architecture | High | Table[3] | Almost similar results were achieved by the bigGAN model and Stable Diffusion model with latent class embedding. Compared to bigGAN, the WGAN achieved lower performance by 21%. |
| FS dimension | Middle | Figure[6a] | Grid search method settled around the dimension of size 64. Smaller and higher feature space dimension results in lower . |
| FS layers combination | Middle | Figure[6b] | Combination of features from more than one and deeper layers results in higher . |
| Trainable Latent Wrapper | Small | Table[A1] | The best results were achieved by finetuning the latentwrapper with Max Pooling Layers. The combination with trainable Conv2d layer didn’t improve the . |
References
- Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R. Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research 2014, 15, 1929–1958. [Google Scholar]
- Der Kiureghian, A.; Ditlevsen, O. Aleatory or epistemic? Does it matter? Structural safety 2009, 31, 105–112. [Google Scholar] [CrossRef]
- Byerly, A.; Kalganova, T.; Dear, I. No routing needed between capsules. Neurocomputing 2021, 463, 545–553. [Google Scholar] [CrossRef]
- Rao, Q.; Frtunikj, J. Deep Learning for Self-Driving Cars: Chances and Challenges. In Proceedings of the 2018 IEEE/ACM 1st International Workshop on Software Engineering for AI in Autonomous Systems (SEFAIAS), 2018, pp. 35–38.
- Vaze, S.; Han, K.; Vedaldi, A.; Zisserman, A. Open-Set Recognition: a Good Closed-Set Classifier is All You Need? In Proceedings of the International Conference on Learning Representations, 2022.
- Guérin, J.; Delmas, K.; Ferreira, R.; Guiochet, J. Out-of-distribution detection is not all you need. In Proceedings of the Proceedings of the AAAI Conference on Artificial Intelligence, 2023, Vol. 37, pp. 14829–14837.
- ISO. Road vehicles - Safety of the intended functionality. Standard, International Organization for Standardization, 2019.
- ISO. Road vehicles – Functional safety, 2011.
- Lin, Z.; Roy, S.D.; Li, Y. Mood: Multi-level out-of-distribution detection. In Proceedings of the Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition, 2021, pp. 15313–15323.
- Wang, H.; Li, Z.; Feng, L.; Zhang, W. ViM: Out-of-Distribution With Virtual-Logit Matching. In Proceedings of the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2022, pp. 4921–4930.
- Gall, H. Functional safety IEC 61508 / IEC 61511 the impact to certification and the user. In Proceedings of the AICCSA. IEEE Computer Society, 2008, pp. 1027–1031.
- Kendall, A.; Gal, Y. What uncertainties do we need in bayesian deep learning for computer vision? Advances in neural information processing systems 2017, 30. [Google Scholar]
- Hendrycks, D.; Gimpel, K. A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks. Proceedings of International Conference on Learning Representations 2017. [Google Scholar]
- Fang, Z.; Li, Y.; Lu, J.; Dong, J.; Han, B.; Liu, F. Is out-of-distribution detection learnable? Advances in Neural Information Processing Systems 2022, 35, 37199–37213. [Google Scholar]
- Yang, J.; Zhou, K.; Li, Y.; Liu, Z. Generalized Out-of-Distribution Detection: A Survey. CoRR 2021, abs/2110.11334, [2110.11334].
- Lee, K.; Lee, K.; Lee, H.; Shin, J. A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks. In Proceedings of the Advances in Neural Information Processing Systems; Bengio, S.; Wallach, H.; Larochelle, H.; Grauman, K.; Cesa-Bianchi, N.; Garnett, R., Eds. Curran Associates, Inc., 2018, Vol. 31.
- Yang, J.; Wang, P.; Zou, D.; Zhou, Z.; Ding, K.; PENG, W.; Wang, H.; Chen, G.; Li, B.; Sun, Y.; et al. OpenOOD: Benchmarking Generalized Out-of-Distribution Detection. In Proceedings of the Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track, 2022.
- Aggarwal, C.C.; Yu, P.S. Outlier detection for high dimensional data. In Proceedings of the Proceedings of the 2001 ACM SIGMOD international conference on Management of data, 2001, pp. 37–46.
- Hodge, V.; Austin, J. A survey of outlier detection methodologies. Artificial intelligence review 2004, 22, 85–126. [Google Scholar] [CrossRef]
- Chalapathy, R.; Chawla, S. DEEP LEARNING FOR ANOMALY DETECTION: ASURVEY 2019.
- Pang, G.; Shen, C.; Cao, L.; Hengel, A.V.D. Deep learning for anomaly detection: A review. ACM computing surveys (CSUR) 2021, 54, 1–38. [Google Scholar] [CrossRef]
- Chen, V.; Yoon, M.K.; Shao, Z. Task-aware novelty detection for visual-based deep learning in autonomous systems. In Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2020, pp. 11060–11066.
- Henzinger, T.A.; Lukina, A.; Schilling, C. Outside the Box: Abstraction-Based Monitoring of Neural Networks. In Proceedings of the ECAI; Giacomo, G.D.; Catalá, A.; Dilkina, B.; Milano, M.; Barro, S.; Bugarín, A.; Lang, J., Eds. IOS Press, 2020, Vol. 325, Frontiers in Artificial Intelligence and Applications, pp. 2433–2440. [CrossRef]
- Liu, W.; Wang, X.; Owens, J.; Li, Y. Energy-based out-of-distribution detection. Advances in neural information processing systems 2020, 33, 21464–21475. [Google Scholar]
- Mahalanobis, P.C. On the generalized distance in statistics. Sankhyā: The Indian Journal of Statistics, Series A (2008-) 2018, 80, S1–S7. [Google Scholar]
- Lee, K.; Lee, K.; Lee, H.; Shin, J. A simple unified framework for detecting out-of-distribution samples and adversarial attacks. Advances in neural information processing systems 2018, 31. [Google Scholar]
- Feinman, R.; Curtin, R.R.; Shintre, S.; Gardner, A.B. Detecting adversarial samples from artifacts. arXiv preprint arXiv:1703.00410 2017.
- Lee, K.; Lee, H.; Lee, K.; Shin, J. TRAINING CONFIDENCE-CALIBRATED CLASSIFIERS FOR DETECTING OUT-OF-DISTRIBUTION SAMPLES. stat 2017, 1050, 1. [Google Scholar]
- Liang, S.; Li, Y.; Srikant, R. Enhancing the reliability of out-of-distribution image detection in neural networks. In Proceedings of the 6th International Conference on Learning Representations, ICLR 2018, 2018.
- Hendrycks, D.; Basart, S.; Mazeika, M.; Mostajabi, M.; Steinhardt, J.; Song, D.X. Scaling Out-of-Distribution Detection for Real-World Settings. In Proceedings of the International Conference on Machine Learning, 2022.
- Sun, Y.; Guo, C.; Li, Y. React: Out-of-distribution detection with rectified activations. Advances in Neural Information Processing Systems 2021, 34, 144–157. [Google Scholar]
- Davis, J.; Goadrich, M. The relationship between Precision-Recall and ROC curves. In Proceedings of the Proceedings of the 23rd international conference on Machine learning, 2006, pp. 233–240.
- Chung, I.; Kim, D.; Kwak, N. Maximizing cosine similarity between spatial features for unsupervised domain adaptation in semantic segmentation. In Proceedings of the Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022, pp. 1351–1360.
- Heusel, M.; Ramsauer, H.; Unterthiner, T.; Nessler, B.; Hochreiter, S. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium. In Proceedings of the Advances in Neural Information Processing Systems; Guyon, I.; Luxburg, U.V.; Bengio, S.; Wallach, H.; Fergus, R.; Vishwanathan, S.; Garnett, R., Eds. Curran Associates, Inc., 2017, Vol. 30.
- Salimans, T.; Goodfellow, I.J.; Zaremba, W.; Cheung, V.; Radford, A.; Chen, X. Improved Techniques for Training GANs. CoRR 2016, abs/1606.03498, [1606.03498].
- Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial networks. Communications of the ACM 2020, 63, 139–144. [Google Scholar] [CrossRef]
- Ho, J.; Jain, A.; Abbeel, P. Denoising Diffusion Probabilistic Models. In Proceedings of the Advances in Neural Information Processing Systems; Larochelle, H.; Ranzato, M.; Hadsell, R.; Balcan, M.; Lin, H., Eds. Curran Associates, Inc., 2020, Vol. 33, pp. 6840–6851.
- Krizhevsky, A.; Hinton, G.; et al. Learning multiple layers of features from tiny images. University of Toronto 2009.
- LeCun, Y. The MNIST database of handwritten digits. http://yann. lecun. com/exdb/mnist/ 1998.
- Xiao, H.; Rasul, K.; Vollgraf, R. Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms. CoRR 2017, abs/1708.07747, [1708.07747].
- Clanuwat, T.; Bober-Irizar, M.; Kitamoto, A.; Lamb, A.; Yamamoto, K.; Ha, D. Deep learning for classical japanese literature. arXiv preprint arXiv:1812.01718 2018.
- Netzer, Y.; Wang, T.; Coates, A.; Bissacco, A.; Wu, B.; Ng, A.Y. Reading Digits in Natural Images with Unsupervised Feature Learning. In Proceedings of the NIPS Workshop on Deep Learning and Unsupervised Feature Learning 2011, 2011.
- Cimpoi, M.; Maji, S.; Kokkinos, I.; Mohamed, S.; .; Vedaldi, A. Describing Textures in the Wild. In Proceedings of the Proceedings of the IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2014.
- Yuan, X.; He, P.; Zhu, Q.; Li, X. Adversarial examples: Attacks and defenses for deep learning. IEEE transactions on neural networks and learning systems 2019, 30, 2805–2824. [Google Scholar] [CrossRef] [PubMed]
- Goodfellow, I.; Shlens, J.; Szegedy, C. Explaining and Harnessing Adversarial Examples. In Proceedings of the International Conference on Learning Representations, 2015.
- Madry, A.; Makelov, A.; Schmidt, L.; Tsipras, D.; Vladu, A. Towards Deep Learning Models Resistant to Adversarial Attacks. In Proceedings of the International Conference on Learning Representations, 2018.
- Moosavi-Dez li, S.; Fawzi, A.; Frossard, P. DeepFool: A Simple and Accurate Method to Fool Deep Neural Networks. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Los Alamitos, CA, USA, jun 2016; pp. 2574–2582. [CrossRef]
- Croce, F.; Hein, M. Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks. In Proceedings of the Proceedings of the 37th International Conference on Machine Learning; III, H.D.; Singh, A., Eds. PMLR, 13–18 Jul 2020, Vol. 119, Proceedings of Machine Learning Research, pp. 2206–2216.
- Kim, H. Torchattacks: A pytorch repository for adversarial attacks. arXiv preprint arXiv:2010.01950 2020.
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 770–778. [CrossRef]
- Kingma, D.; Ba, J. Adam: A Method for Stochastic Optimization. In Proceedings of the International Conference on Learning Representations (ICLR), San Diega, CA, USA, 2015.
- Paszke, e.a. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In NEURIPS; Curran Associates, Inc., 2019; pp. 8024–8035.
- McInnes, L.; Healy, J.; Saul, N.; Großberger, L. UMAP: Uniform Manifold Approximation and Projection. Journal of Open Source Software 2018, 3, 861. [Google Scholar] [CrossRef]
- Kullback, S.; Leibler, R.A. On Information and Sufficiency. The Annals of Mathematical Statistics 1951, 22, 79–86. [Google Scholar] [CrossRef]
- Gulrajani, I.; Ahmed, F.; Arjovsky, M.; Dumoulin, V.; Courville, A.C. Improved Training of Wasserstein GANs. In Advances in Neural Information Processing Systems 30; Guyon, I.; Luxburg, U.V.; Bengio, S.; Wallach, H.; Fergus, R.; Vishwanathan, S.; Garnett, R., Eds.; Curran Associates, Inc., 2017; pp. 5767–5777.
- Radford, A.; Metz, L.; Chintala, S. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434 2015.
- Zhang, H.; Goodfellow, I.; Metaxas, D.; Odena, A. Self-attention generative adversarial networks. In Proceedings of the 36th International Conference on Machine Learning, ICML 2019. International Machine Learning Society (IMLS), 2019, pp. 12744–12753.
- Brock, A.; Donahue, J.; Simonyan, K. Large Scale GAN Training for High Fidelity Natural Image Synthesis. In Proceedings of the International Conference on Learning Representations, 2018.
- Karras, T.; Laine, S.; Aila, T. A Style-Based Generator Architecture for Generative Adversarial Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 2021, 43, 4217–4228. [Google Scholar] [CrossRef] [PubMed]
- Miyato, T.; Kataoka, T.; Koyama, M.; Yoshida, Y. Spectral Normalization for Generative Adversarial Networks. In Proceedings of the International Conference on Learning Representations, 2018.
- Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18. Springer, 2015, pp. 234–241.
- Lin, T.Y.; Goyal, P.; Girshick, R.; He, K.; Dollár, P. Focal Loss for Dense Object Detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 2020, 42, 318–327. [Google Scholar] [CrossRef] [PubMed]
- Salimans, T.; Goodfellow, I.; Zaremba, W.; Cheung, V.; Radford, A.; Chen, X. Improved Techniques for Training GANs, 2016, [arXiv:cs.LG/1606.03498].
- Provost, F.J.; Fawcett, T.; Kohavi, R. The Case against Accuracy Estimation for Comparing Induction Algorithms. In Proceedings of the Proceedings of the Fifteenth International Conference on Machine Learning, San Francisco, CA, USA, 1998; ICML ’98, p. 445–453.
- Hendrycks, D.; Basart, S.; Mazeika, M.; Mostajabi, M.; Steinhardt, J.; Song, D. A Benchmark for Anomaly Segmentation. CoRR 2019, abs/1911.11132, [1911.11132].
- Chan, R.; Rottmann, M.; Gottschalk, H. Entropy maximization and meta classification for out-of-distribution detection in semantic segmentation. In Proceedings of the Proceedings of the ieee/cvf international conference on computer vision, 2021, pp. 5128–5137.
- Sun, Y.; Li, Y. DICE: Leveraging Sparsification for Out-of-Distribution Detection. In Proceedings of the Computer Vision – ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXIV, Berlin, Heidelberg, 2022; p. 691–708. [CrossRef]
- Ren, J.; Fort, S.; Liu, J.Z.; Roy, A.G.; Padhy, S.; Lakshminarayanan, B. A Simple Fix to Mahalanobis Distance for Improving Near-OOD Detection. CoRR 2021, abs/2106.09022, [2106.09022].
- Zhang, J.; Fu, Q.; Chen, X.; Du, L.; Li, Z.; Wang, G.; xiaoguang Liu.; Han, S.; Zhang, D. Out-of-Distribution Detection based on In-Distribution Data Patterns Memorization with Modern Hopfield Energy. In Proceedings of the The Eleventh International Conference on Learning Representations, 2023.






| IMS | OMS | ||
|---|---|---|---|
| Semantic | Covariate | Adversarial | |
| CIFAR-10 |
MNIST fashionMNIST kMNIST DTD SVHN |
blurring brightness rotation noise |
FGSM PGD DeepFool AutoAttack |
| ResNet | train. | val. | test | Dropout | FR |
| 18 | 98.94 | 88.64 | 87.01 | 88.69 | 74.06 |
| 34 | 99.48 | 89.88 | 88.18 | 88.99 | 73.51 |
| 50 | 98.89 | 89.19 | 87.73 | 87.76 | 72.12 |
| 101 | 98.35 | 89.10 | 86.71 | 77.72 | 70.09 |
| model G | ↑IS | ↓FID | Layers Combination | E-regularization | Learnable L | ||
| W-GAN | 5.71 | 38.21 | 70.50 | 64 | L1+L2+L3+L4 | with Dropout | False-Avg Pool |
| bigGAN | 7.92 | 28.04 | 91.96 | 64 | L1+L2+L3+L4 | with Dropout | False-Max Pool |
| CCLDM | 8.01 | 19.87 | 89.89 | 128 | L1+L2+L3+L4 | with Dropout | False-Max Pool |
| model E | WGAN | bigGAN | CCLDM |
| 18 | 69.14 | 90.15 | 79.60 |
| 34 | 70.50 | 91.96 | 78.69 |
| 50 | 65.12 | 85.25 | 73.50 |
| 101 | 63.87 | 86.74 | 68.28 |
| regularization | ↑IS | ↓FID | |
| no regularization | 6.57 | 30.01 | 90.61 |
| dropout | 7.92 | 28.04 | 91.96 |
| feature regularization | 5.84 | 35.85 | 86.38 |
| ↑Precision | ↑Recall | ↑F1-score | Support OMS + IMS | |
| 67.42 | 98.76 | 80.14 | 67014 + 4914 |
|
Training OMS datasets |
Testset balanced accuracy in [%] | |||
| covariate | adversarial | semantic | ++combined | |
| covariate | 98.95 | 93.04 | 97.99 | 97.93 |
| adversarial | 99.18 | 98.58 | 97.28 | 97.96 |
| semantic | 97.18 | 83.50 | 99.73 | 95.56 |
| combined | 99.21 | 92.25 | 97.76 | 99.52 |
| Deactivated branch | delta acc. [%] to no deactivation |
|---|---|
| Input image x | -25.20 |
| Feature space z | -4.32 |
| Generated image | -3.20 |
| + | -37.36 |
| + | -22.94 |
| + | -32.43 |
| No deactivation | 97.56 |
| Total Population | Predicted cls. | ||
| 71928 | Positive (IMS) | Negative (OMS) | |
| True cls. | Positive (IMS) | ↑ TP: IMS as IMS 4701 |
↓ FN: IMS as OMS 213 |
| Negative (OMS) | ↓ FP: OMS as IMS 2289 |
↑ TN: OMS as OMS 64725 |
|
| Monitor | ↑AUROC | ↑AUPR | ↓FPR95TPR |
| MSP [13] | 61.28 | 95.50 | 100 |
| ODIN [29] | 73.32 | 97.71 | 100 |
| Mahalanobis [26] | 93.74 | 99.52 | 39.50 |
| KLMatching [65] | 66.17 | 95.81 | 83.70 |
| MaxLogit [30] | 61.44 | 95.49 | 100 |
| EnergyBased [24] | 61.30 | 95.42 | 100 |
| Entropy [66] | 61.60 | 95.55 | 100 |
| DICE [67] | 61.60 | 95.50 | 100 |
| RMD [68] | 81.49 | 98.16 | 63.93 |
| ReAct [31] | 62.05 | 95.53 | 99.98 |
| ViM [10] | 74.22 | 97.80 | 99.80 |
| SHE [69] | 56.13 | 95.19 | 100 |
| ours | 99.46 | 93.94 | 3.42 |
Short Biography of Authors
![]() |
Václav Diviš Autonomous Driving Senior Expert with a proven track record in the automotive industry, namely ARRK Engineering GmbH and ZF Friedrichshafen A.G. Skilled in Automotive SPICE, Functional Safety, Software Development Process, Computer Vision, and Artificial Neural Networks. Currently working on his PhD thesis, focused on Automotive Engineering Technology at the University of West Bohemia, Pilsen, Czech Republic. |
![]() |
Bastian Spatz Machine Learning Engineer at ARRK Engineering GmbH with a M.Sc. in mathematics from the Technical University of Munich. His main research interests are computer vision, robotics, programming, and scripting. |
![]() |
Marek Hrúz received a Ph.D. degree in computer science from the Faculty of Applied Sciences, University of West Bohemia in Pilsen, Czech Republic, in 2012. He is currently an Assistant Professor at the Department of Cybernetics, Faculty of Applied Sciences, University of West Bohemia in Pilsen, and a Senior Researcher with the New Technologies for the Information Society. His main research interests are computer vision, machine learning, deep learning, signal processing, and multi-modal signal processing and their application in document analysis, traffic analysis, composite materials welds analysis, and sign language translation. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).


