Submitted:
21 May 2026
Posted:
22 May 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
- MQTT4SSN Ontology: Further development of the previous version, providing a comprehensive representation that covers all MQTT 5.0 control packets, including their complete contents and structure. Enrichment of the network infrastructure with session context.
- MQTT2RDF Semantic Integration and Analytics Framework: A modular framework that integrates heterogeneous MQTT message streams into RDF-based knowledge graphs in real time. The framework supports semantic querying, provenance-aware analytics, anomaly detection, and explainable analysis across distributed IoT environments.
- Ontology and Framework Evaluation: A thorough evaluation of the ontology based on its coverage of the MQTT 5.0 specification [3] and a use case in the industrial context, complemented by runtime analyses of the framework, evaluating semantic processing overhead and practical applicability.
2. Motivating Industrial Usage Scenario
- Message Content Analytics: First, the meaning of the transmitted payloads is not explicit. Transmitted data can only be correctly interpreted if the receiving system already knows the payload structure, the unit, the associated sensor or actuator, and the production context. If a new sensor vendor introduces, e.g., a different payload format, the integration logic must often be adapted manually.
- Semantic Topic Analytics Second, MQTT topics are typically defined according to local or vendor-specific naming conventions. For example, a topic may indicate a production line, a robot, a sensor, or a measured property, but MQTT itself does not define the semantics of the topic levels. As a result, systems cannot reliably discover which topics provide observations of a certain property or which topics carry actuation commands for a specific robot arm without additional application-specific knowledge.
- Transport and Provenance Analytics Third, quality problems in the production process are difficult to trace across distributed systems. If a robot grips a package incorrectly, several causes are possible: the vision sensor may be malfunctioning, the edge device may have calculated incorrect values, the actuation command may have been delayed, or a message may have been lost or malformed during transmission. Conventional MQTT-based infrastructures transport messages but do not provide an integrated semantic representation that connects product identifiers (e.g., RFID), sensor observations, actuation commands, topics, clients, brokers, and available transport metadata. Semantic end-to-end traceability can support debugging and accountability analysis, for example, through Message Transport Provenance [11,12].
3. Related Work
3.1. Related Data Models
3.2. Semantic Integration and Knowledge Graph Population
4. Methodology
4.1. Requirements Definition
4.2. Ontology Engineering Process
4.3. Semantic Integration and Analytics Framework Design
4.4. Publication and Reusability
5. MQTT4SSN Ontology
5.1. Core Concept
- Class and Property Specialization and Generalization: We define special classes and properties for a general class, respectively, to achieve more semantic depth. In the other direction, we have generalized some classes or properties for general queries or to cluster common semantic backgrounds.
- Class and Property Disjointness: We explicitly define class and property disjointness among subclasses and, respectively, among subproperties to enhance ontological clarity and enable more precise reasoning and querying over instances.
- Inverse Propery: Almost every object property received an inverse, allowing more targeted queries in both directions.
- Equivalences: The original MQT4SSN ontology integrated the concepts of MQV [18] through equivalent classes and properties with owl:equivalentClass and owl:equivalentProperty, respectively. Due to the draft’s incompleteness, the ontology is not discussed further in this work. However, the MQV ontology remains in MQTT4SSN for possible future developments.
- Semantic Annotation: To provide a clear and reusable ontology, we enriched all classes, properties, and the ontology itself with rdfs:label and rdfs:comment annotations. In addition, we annotated all MQTT-related datatype properties with skos:example based on the MQTT 5.0 Specification [3], helping with better understanding at the instantiation.
5.2. Ontology Description



- 3. Payload and Application Message: Figure 8 illustrates the ontology design for modeling MQTT payloads and their relation to the transferred application message.

- 4. Network Infrastructure: Figure 10 illustrates the ontology design for modeling the MQTT network infrastructure.

- 5. Topic Subject: Figure 11 illustrates the ontology design for modeling MQTT topics and their semantic alignment with SOSA.

6. MQTT2RDF Semantic Integration and Analytics Framework
- Containerized deployment environment (deployment layer)
- MQTT broker (edge ingestion layer)
- Node-RED processing flows (stream processing layer)
- RML mapping definitions and execution scripts (semantic transformation layer)
- MQTT4SSN ontology (semantic model layer)
- RDF triplestore (storage layer)
- Semantic Analytics Dashboard using SPARQL queries (analytics layer)
7. Evaluation
7.1. Dataset


7.2. Ontology Verification
| CQ | Question | Entities |
|---|---|---|
| CQ1.01g | What types of Control Packets exist? | |
| CQ1.01u | Which MQTT Control Packet has which Fixed Header / Variable Header / Payload? | |
| CQ2.01g | What types of Fixed Header Flags exists? | |
| CQ2.03u | Are RETAIN/DUP flags set in PUBLISH? | |
| CQ3.02g | How is the Application Message linked to SOSA? | |
| CQ3.04u | Who has sent which Application Messages? | |
| CQ4.02g | Which relations does the Topic have? | |
| CQ4.01u | Which SOSA sensors publish to which MQTT Topics? | |
| CQ5.01g | Which Network Entities does the Network Infrastructure have? | |
| CQ5.02u | Which MQTT Clients are connected to which Broker? |
7.3. Ontology Validation and Framework Evaluation
8. Conclusion and Future Work
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| CQ | Competency Question |
| ETL | Extract, Transform, Load |
| FAIR | Findable, Accessible, Interoperable, and Reusable |
| GDB | Graph Database |
| IoT | Internet of Things |
| MQTT | Message Queuing Telemetry Transport |
| M2M | machine-to-machine |
| ODP | Ontology Design Pattern |
| OOPS! | OntOlogy Pitfall Scanner! |
| QoS | Quality of Service |
| QUDT | Quantities, Units, Dimensions, and Data Types |
| RDB | Relational Database |
| SSN | Semantic Sensor Network |
| WOT | Web of Things |
References
- Eren, E. Edge Computing Applications in Industrial IoT: A Literature Review. In Proceedings of the Economics of Grids, Clouds, Systems, and Services (GECON 2022). Springer, 2022, Vol. 13430, Lecture Notes in Computer Science, pp. 153–167. [CrossRef]
- Piccialli, F.; et al. Edge Intelligence for Industrial IoT: Opportunities and Limitations. Procedia Computer Science 2024, 246, 1–10. [CrossRef]
- Banks, A.; Briggs, E.; Borgendale, K.; Gupta, R. MQTT Version 5.0. https://docs.oasis-open.org/mqtt/mqtt/v5.0/os/mqtt-v5.0-os.html, 2019. OASIS Standard. Latest version: https://docs.oasis-open.org/mqtt/mqtt/v5.0/mqtt-v5.0.html.
- Mohammed, A.; et al. Performance Evaluation Framework of MQTT Client Libraries for IoT Applications. Internet of Things 2024, 26, 100000. [CrossRef]
- Fysarakis, K.; et al. On the Security and Privacy of MQTT in Industrial Applications. In Proceedings of the Proceedings of the Information Security and Cryptography Conference (ISCC), 2017.
- Janowicz, K.; Haller, A.; Cox, S.J.D.; Le Phuoc, D.; Lefrançois, M. SOSA: A Lightweight Ontology for Sensors, Observations, Samples, and Actuators. Journal of Web Semantics 2019, 56, 1–10. [CrossRef]
- Haller, A.; Janowicz, K.; Cox, S.J.D.; Le Phuoc, D.; Taylor, K.; Lefrançois, M. Semantic Sensor Network Ontology. W3c recommendation, W3C and OGC, 2017. Accessed May 23, 2026.
- W3C Spatial Data on the Web Interest Group. SSN/SOSA Ontology Draft. https://w3c.github.io/sdw-sosa-ssn/ssn/, 2024. Draft version.
- Sandeep, M.; Chandavarkar, B.R.; Khatri, S. Heterogeneous Data Format Integration and Conversion (HDFIC) Using Ontology-Based Techniques in IoT. Evolving Systems 2024, 15, 375–396. [CrossRef]
- Doerner, N.; Maleshkova, M. MQTT4SSNOntology: MQTT4SSN v1.0: CEUR Workshop Proceedings release, 2025. [CrossRef]
- Markovic, M.; Corsar, D.; Asif, W.; Edwards, P.; Rajarajan, M. Towards Transparency of IoT Message Brokers. In Proceedings of the Provenance and Annotation of Data and Processes (IPAW 2018). Springer, 2018, Vol. 11017, Lecture Notes in Computer Science, pp. 200–203. [CrossRef]
- Asif, W.; Corsar, D.; Markovic, M.; Edwards, P.; Rajarajan, M. Enhancing Transparency of MQTT Brokers for IoT Applications Through Semantic Provenance. In Proceedings of the Proceedings of the Workshop on Semantic Web for IoT, 2021.
- Haller, A.; Janowicz, K.; Cox, S.J.D.; Le Phuoc, D.; Taylor, K.; Lefrançois, M. Semantic Sensor Network Ontology. https://www.w3.org/TR/vocab-ssn/, 2017. W3C Recommendation 19 October 2017.
- Taylor, K.; Haller, A.; Lefrançois, M.; Cox, S.J.D.; Janowicz, K.; Garcia-Castro, R.; Le-Phuoc, D.; Lieberman, J.; Atkinson, R.; Stadler, C. The Semantic Sensor Network Ontology, Revamped. In Proceedings of the Proceedings of the Journal Track co-located with the 18th International Semantic Web Conference (ISWC 2019). CEUR-WS.org, 2019, Vol. 2576, CEUR Workshop Proceedings, pp. 1–9.
- Cox, S.J.D.; Lefrançois, M.; Warren, R.; Atkinson, R.; Moreira de Sousa, L.; Schleidt, K.; Grellet, S.; Haller, A.; Janowicz, K.; Le Phuoc, D.; et al. Semantic Sensor Network Ontology – 2023 Edition. W3C First Public Working Draft WD-vocab-ssn-2023-20250916, World Wide Web Consortium, 2025. W3C First Public Working Draft, 16 September 2025.
- QUDT.org. Quantities, Units, Dimensions and Types (QUDT) Ontology Version 1.1. https://www.linkedmodel.org/doc/qudt/1.1/, 2011. Accessed May 23, 2026.
- QUDT.org. QUDT Ontologies Overview. https://qudt.org/pages/QUDToverviewPage.html, 2024. Overview of quantity kinds, units, dimensions, and data types in QUDT. Accessed May 23, 2026.
- W3C Web of Things. MQTT-to-RDF Ontology and Binding Templates. https://w3c.github.io/wot-binding-templates/bindings/protocols/mqtt/ontology.html, 2024.
- Doerner, N.; Maleshkova, M. MQTT4SSN: An Ontology for the MQTT Message Protocol. In Proceedings of the Joint Proceedings of the 16th Workshop on Ontology Design and Patterns (WOP 2025) and the 1st Workshop on Bridging Hybrid Intelligence and the Semantic Web (HAIBRIDGE 2025), co-located with the 24th International Semantic Web Conference (ISWC 2025); Novakazi, F.; Dalal, A.S., Eds., Nara, Japan, 2025; Vol. 4093, CEUR Workshop Proceedings, pp. 57–70. [CrossRef]
- Blomqvist, E.; Gangemi, A.; Presutti, V. Experiments on pattern-based ontology design. In Proceedings of the Proceedings of the Fifth International Conference on Knowledge Capture, New York, NY, USA, 2009; K-CAP ’09, p. 41–48. [CrossRef]
- Das, S.; Sundara, S.; Cyganiak, R. R2RML: RDB to RDF Mapping Language. W3c recommendation, World Wide Web Consortium (W3C), 2012.
- Meester, B.D.; Heyvaert, P.; Delva, T.; Dimou, A.; Sande, M.V. RDF Mapping Language (RML). Unofficial draft, RML.io / Knowledge Graph Construction community, 2024.
- Michel, F.; Djimenou, L.; Faron-Zucker, C.; Montagnat, J. xR2RML: Relational and Non-Relational Databases to RDF Mapping Language (Version 5). https://webusers.i3s.unice.fr/~fmichel/xr2rml_specification_v5.html, 2017. Specification v5, I3S Laboratory, Université Côte d’Azur, CNRS, Inria.
- Canim, M.; Cornelio, C.; Farrell, R.; Fokoue, A.; Gao, K.; Gunnels, J.; Iyengar, A.; Musa, R.; Rodriguez-Muro, M.; Uceda-Sosa, R. A knowledge and reasoning toolkit for cognitive applications. In Proceedings of the Proceedings of the fifth ACM/IEEE Workshop on Hot Topics in Web Systems and Technologies, San Jose California, 2017; pp. 1–10. [CrossRef]
- Xiao, G.; Ding, L.; Cogrel, B.; Calvanese, D. Virtual Knowledge Graphs: An Overview of Systems and Use Cases. Data Intelligence 2019, 1, 201–223. [CrossRef]
- Xiao, G.; Lanti, D.; Kontchakov, R.; Komla-Ebri, S.; Güzel-Kalaycı, E.; Ding, L.; Corman, J.; Cogrel, B.; Calvanese, D.; Botoeva, E. The Virtual Knowledge Graph System Ontop. In Proceedings of the The Semantic Web – ISWC 2020; Pan, J.Z.; Tamma, V.; d’Amato, C.; Janowicz, K.; Fu, B.; Polleres, A.; Seneviratne, O.; Kagal, L., Eds., Cham, 2020; pp. 259–277.
- Heyvaert, P.; Dimou, A.; Chaves-Fraga, D. RML Implementation Report. Unofficial draft, RML.io / Knowledge Graph Construction community, 2022.
- Heyvaert, P.; Van Assche, D.; De Meester, B.; Haesendonck, G.; de Vleeschauwer, E.; Oo, S.M. RMLMapper-JAVA. [CrossRef]
- Ivanovic, P.; Burbach, S.; Maleshkova, M. MontoFlow – A Maritime Ontology Framework for Modeling Ship Sensory Systems. In Proceedings of the The Semantic Web – ISWC 2025; Garijo, D.; Kirrane, S.; Salatino, A.; Shimizu, C.; Acosta, M.; Nuzzolese, A.G.; Ferrada, S.; Soulard, T.; Kozaki, K.; Takeda, H.; et al., Eds., Cham, 2026; pp. 276–294.
- Listl, F.G.; Fischer, J.; Sohr, A.; Dittler, D.; Jazdi, N.; Weyrich, M. Utilizing ISA-95 in an Industrial Knowledge Graph for Material Flow Simulation - Semantic Model Extensions and Efficient Data Integration. Procedia CIRP 2023, 120, 1558–1563. [CrossRef]
- Foundation, O.; Contributors. Node-RED.
- Scrocca, M.; Grassi, M.; Carenini, A.; Calbimonte, J.P.; Anicic, D.; Celino, I. A DataOps Toolbox Enabling Continuous Semantic Integration of Devices for Edge-Cloud AI Applications, 2025. arXiv:2508.02708 [cs]. [CrossRef]
- Bender, M.; Kirdan, E.; Pahl, M.O.; Carle, G. Open-Source MQTT Evaluation. In Proceedings of the 2021 IEEE 18th Annual Consumer Communications & Networking Conference (CCNC), Las Vegas, NV, USA, 2021; pp. 1–4. [CrossRef]
- Light, R.A. Mosquitto: server and client implementation of the MQTT protocol. Journal of Open Source Software 2017, 2, 265. [CrossRef]
- Inc., E.T. EMQX: The Unified MQTT Platform for AI & IoT Data Streaming — emqx.com. https://www.emqx.com/en. Accessed May 23, 2026.
- Aguiar, C.Z.; Souza, V.E.S. SABiOx: the extended systematic approach for building ontologies, 2024.
- Musen, M.A. The Protégé project: A look back and a look forward. AI Matters 2015, 1. [CrossRef]
- Poveda-Villalón, M.; Gómez-Pérez, A.; Suárez-Figueroa, M.C. OOPS! (OntOlogy Pitfall Scanner!): An On-line Tool for Ontology Evaluation. International Journal on Semantic Web and Information Systems (IJSWIS) 2014, 10, 7–34.
- Lohmann, S.; Link, V.; Marbach, E.; Negru, S. WebVOWL: Web-based Visualization of Ontologies. 04 2015, Vol. 8982, pp. 154–158. [CrossRef]
- Garijo, D. WIDOCO: a wizard for documenting ontologies. In Proceedings of the International Semantic Web Conference. Springer, Cham, 2017, pp. 94–102. [CrossRef]
- Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.J.; Appleton, G.; Axton, M.; Baak, A.; Blomberg, N.; Boiten, J.W.; da Silva Santos, L.B.; Bourne, P.E.; et al. The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data 2016, 3. [CrossRef]
- Biron, P.V.; Malhotra, A. XML Schema Part 2: Datatypes Second Edition. https://www.w3.org/TR/xmlschema-2/, 2004. W3C Recommendation, 28 October 2004.
- Merkel, D. Docker: lightweight linux containers for consistent development and deployment. Linux journal 2014, 2014, 2.
- GraphDB Downloads and Resources — graphdb.ontotext.com. https://graphdb.ontotext.com/. Accessed May 23, 2026.
- Wireshark Foundation. Wireshark: Network Protocol Analyzer. https://www.wireshark.org/, 2024. Accessed: May 23, 2026.
- Wireshark Foundation. TShark: Command-Line Network Protocol Analyzer. https://www.wireshark.org/docs/man-pages/tshark.html, 2024. Accessed: May 23, 2026.
- Doerner, N.; Maleshkova, M. MQTT2RDF v0.1: Pre-publication release, 2025. [CrossRef]
| 1 | |
| 2 | |
| 3 | |
| 4 | |
| 5 | |
| 6 | |
| 7 | |
| 8 |










| Data () | Preproc. | Batching | Saving | Node Total | Mapping | Upload | Shell Total | All Total | |
|---|---|---|---|---|---|---|---|---|---|
| 0,00 | 221,00 | 3,00 | 224,00 | 2.035,00 | 63,00 | 2.130,00 | 2.354,00 | ||
| 3,00 | 240,00 | 3,00 | 246,00 | 2.853,00 | 55,00 | 2.962,00 | 3.208,00 | ||
| 5,00 | 255,00 | 2,00 | 262,00 | 2.402,00 | 80,00 | 2.510,00 | 2.772,00 | ||
| 0,90 | 221,40 | 3,30 | 225,60 | 2.132,00 | 55,90 | 2.212,50 | 2.438,10 | ||
| 2,30 | 224,80 | 3,50 | 230,60 | 2.345,30 | 85,60 | 2.456,00 | 2.686,60 | ||
| 6,70 | 234,20 | 2,50 | 243,40 | 2.401,60 | 98,20 | 2.522,60 | 2.766,00 | ||
| 1,10 | 224,51 | 3,61 | 229,22 | 2.464,79 | 72,86 | 2.565,65 | 2.794,87 | ||
| 1,62 | 220,53 | 3,03 | 225,18 | 2.184,91 | 70,76 | 3.301,29 | 3.526,47 | ||
| 6,22 | 217,62 | 2,90 | 226,74 | 2.903,64 | 256,07 | 4.928,84 | 5.155,58 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).