Submitted:
15 April 2024
Posted:
16 April 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Technical Background and Literature Overview
2.1. Technical Background
2.2. Related Work
3. Methodology
4. Qualitative Evaluation
- Traditional DL without the proposed metadata enrichment mechanism.
- DL with a semantic metadata enrichment mechanism [15]
5. Experimental Assessment
5.1. Design of Experiments
5.2. Experimental Results
6. Discussion and Conclusions
- • Granular Insights: A DM enables individual teams to own their data, allowing them to use software analytics methods unique to their software systems. This strategy offers fine-grained insights into usage patterns, team-specific utilization performance, and other pertinent indicators.
- • Contextual Awareness: By utilizing DM principles, teams increase their awareness of the context of the data they produce and how it relates to their software processes and products. This context gives a greater understanding of the variables affecting software performance and behavior, which improves the usefulness of software analytics.
- • Rapid Iteration and Improvement: A DM provides teams with control and autonomy over their data, allowing them to iterate and enhance software products and procedures quickly using the knowledge gleaned from software analytics. Continuous improvement and agility are fostered by this iterative methodology.
References
- >Rodríguez-Mazahua, L.; Rodriguez-Enriquez; C.A., Sánchez-Cervantes J.L.; Cervantes J.; García-Alcaraz, J.L.; Alor-Hernández, G.: A general perspective of Big Data: applications, tools, challenges, and trends. The Journal of Supercomputing 2016, 72, 3073–3113. [CrossRef]
- Chen, M.; Mao, S.; Liu, Y. : Big data: A survey. Mobile networks and applications 2014, 19, 171–209. [Google Scholar] [CrossRef]
- Mehboob, T.; Ahmed, I. A.; Afzal, A. : Big Data Issues, Challenges and Techniques: A Survey. Pakistan Journal of Engineering and Technology 2022, 5(2), 216–220. [Google Scholar] [CrossRef]
- Gandomi, A.; Haider, M. : Beyond the hype: Big Data Concepts, methods, and analytics. International Journal of Information Management 2015, 35, 137–144. [Google Scholar] [CrossRef]
- Howarth, J. : 30+ incredible big data statistics (2023). Available online: https://explodingtopics.com/blog/big-data-stats (accessed on 10 February 2024).
- Kościelniak, H.; Puto, A. : Big data in decision making processes of enterprises. Procedia Computer Science 2015, 2015 65, 1052–1058. [Google Scholar] [CrossRef]
- Santos, M. Y.; & Costa, C. ; & Costa, C. River Publishers 2020, 9–36. [Google Scholar]
- Luckow, A., Kennedy, K., Manhardt, F., Djerekarov, E., Vorster, B., Apon, A.: Automotive Big Data: Applications, workloads and infrastructures. IEEE International Conference on Big Data (2015). [CrossRef]
- Kim, Y.; You, E.; Kang, M; , Choi, J. : Does big data matter to value creation? : Based on oracle solution case. Journal of the Korea society of IT services 2012, 11, 39–48. [Google Scholar] [CrossRef]
- Herschel, R.; Miori, V.M. : Ethics & Big Data. Technology in Society 2017, 2017 49, 31–36. [Google Scholar] [CrossRef]
- Blazquez, D.; Domenech, J. : Big data sources and methods for social and economic analyses. Technological Forecasting and Social Change 2018, 130, 99–113. [Google Scholar] [CrossRef]
- Machado, I.A. , Costa, C., Santos, M.Y.: Data Mesh: Concepts and principles of a paradigm shift in data architectures. Procedia Computer Science. 196, 263–271 (2022). [CrossRef]
- Eichler, R. , Giebler, C., Gröger, C., Hoos, E., Schwarz, H., Mitschang, B.: Enterprise-wide metadata management. Business Information Systems. 269–279 (2021). [CrossRef]
- Dehghani, Z. : Data Mesh: Delivering data-driven value at scale. Available online: https://www.thoughtworks.com/insights/books/data-mesh (accessed on 12 February 2024).
- Pingos, M.; Andreou, A. A Data Lake Metadata Enrichment Mechanism via Semantic Blueprints. In 17th International Conference on Evaluation of Novel Approaches to Software Engineering, 2022, pp. 186–196. [CrossRef]
- Xin, R. S.; Rosen, J.; Zaharia, M.; Franklin, M. J.; Shenker, S.; Stoica, I. Shark: SQL and rich analytics at scale. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of data 13-24. [Google Scholar] [CrossRef]
- Leclerc, B. What is a Data Lake? Available online: https://aws.amazon.com/big-data/datalakes-and-analytics/what-is-a-data-lake/, (accessed on 12 March 2024).
- Khine, P.P.; Wang, Z.S. : Data Lake: A new ideology in Big Data Era. ITM Web of Conferences 2018, 17, 03025. [Google Scholar] [CrossRef]
- Sawadogo, P. , Darmont, J. : On Data Lake Architectures and metadata management. Journal of Intelligent Information Systems 2020, 56, 97–120. [Google Scholar] [CrossRef]
- Dehghani, Z. : How to move beyond a Monolithic Data Lake to a distributed data mesh. https://martinfowler.com/articles/data-monolith-to-mesh.html.
- Dehghani, Z. Data Mesh Principles and logical architecture. Available online: https://martinfowler.com/articles/data-mesh-principles.html (accessed on 15 March 2024).
- Dehghani,Z. Available online: https://www.oreilly.com/library/view/datmesh/9781492092384, (accessed on 23 March 2024).
- Papazoglou, M.P. , Elgammal, A.: The Manufacturing Blueprint Environment: Bringing Intelligence into manufacturing. International Conference on Engineering, Technology and Innovation 2017 (ICE/ITMC). [CrossRef]
- Pingos, M.; Christodoulou, P.; Andreou, A. DLMetaChain: An IoT Data Lake Architecture Based on the Blockchain. In 13th International Conference on Information, Intelligence, Systems & Applications (IISA), July 2022, pp. 1–8. IEEE. [CrossRef]
- Pingos, M.; Andreou, A.S. 2022. A smart manufacturing data lake metadata framework for process mining. ICSEA 2022, 11. [Google Scholar] [CrossRef]
- Pingos, M.; Andreou, A.S. Exploiting Metadata Semantics in Data Lakes Using Blueprints. In International Conference on Evaluation of Novel Approaches to Software Engineering. Publisher: Springer Nature Switzerland, 2022, pp. 220–242. [CrossRef]
- Jung, C. : From Data Lakes to data mesh: A guide to the latest Enterprise Data Architecture. Available online: https://towardsdatascience.com/from-data-lakes-to-data-mesh-a-guide-to-the-latest-enterprise-data-architecture-d7a266a3bc33 (accessed on 28 March 2024).







| Characteristic | Low | Medium | High |
| Data Domain readiness and alignment | 4-5 actions | 2-3 actions | 1 action maximum |
| Granularity | 1 level | 2 levels | 3 or more levels |
| Decentralization | none or limited | normal | unlimited |
| Agility | none or limited | normal | unlimited |
| Approach | Data Domain Readiness and Alignment | Granularity | Decentrali-zation | Agility |
| Traditional DL without the proposed metadata enrichment mechanism | Low | Low | Low | Low |
| DL with the proposed metadata enrichment mechanism [15] | Medium | High | Medium | High |
| DM proposed architecture | High | High | High | High |
| Structure Architecture with Levels |
Number of Sources | Creation Time (s) |
|---|---|---|
| DL with Ponds & Puddles / Data Mesh Level 2 |
100 | 0.0229 |
| 10000 | 3.9879 | |
| 100000 | 29.9931 | |
| Data Mesh Level 3 | 100 | 0.0673 |
| 10000 | 7.0856 | |
| 100000 | 72.3088 | |
| Data Mesh Level 4 | 100 | 0.1075 |
| 10000 | 6.6337 | |
| 100000 | 70.0903 | |
| Data Mesh Level 5 | 100 | 0.0906 |
| 10000 | 9.7921 | |
| 100000 | 93.5849 | |
| Data Mesh Level 7 | 100 | 0.2379 |
| 10000 | 15.8697 | |
| 100000 | 166.9752 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).