Submitted:
18 April 2024
Posted:
22 April 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
Benefits of Distributed Database Management System DDBMS
2. Literature Review

3. Transparency Levels in DDBMS
3.1. Data Independence
-
Logical Data IndependenceConsider a situation in which a biomedical research database needs to be updated to include new genomic biomarkers that are becoming useful for personalized medicine approaches.The database structure has been changed to integrate the additional genetic biomarkers as new characteristics. Logical data independence ensures that existing programs used by researchers and clinicians to query patient genetic information or track treatment outcomes do not need to be upgraded or made aware of schema changes unless they are specifically modified to accommodate the new data. This ensures that the clinical decision support systems will continue to work properly even if the database schema evolves to include increasingly detailed genetic data.
-
Physical Data IndependenceDue to the growing number of complicated genomic data, database managers decide to move the data storage to a more powerful database system designed for large-scale genetic sequences and structured clinical data.Physical data independence enables the IT team to complete the migration and optimization of data storage without disrupting the frontend apps used by researchers and physicians. This modification comprises the implementation of new data compression algorithms and the use of more efficient indexing systems to accelerate data retrieval. Applications continue to execute data queries without modification, and users do not perceive any service disruptions or changes in how they interact with the system.
3.2. Network Transparency
-
Location TransparencyThe cloud-based document storage solution enables users to save, view, and manage documents without knowing their physical location. Users interact with the system using a simple interface, which allows them to upload and retrieve documents. Underneath the scenes, the service may distribute and store documents in several data centers around the world, increasing accessibility and redundancy. However, the user can access these documents as if they were all stored on their local device, demonstrating location transparency.
-
Naming TransparencyIn the same cloud service, each document is assigned a unique identification that does not correspond to its physical storage location or the server on which it is hosted. This identity enables users to view their files from any internet-connected device, regardless of where the document is really stored in the backend systems. This configuration ensures that even if the service provider reorganizes the data across its servers or relocates it to multiple data centers to improve load balancing and performance, the user experience stays consistent and unaffected.
3.3. Mobility (Migration) Transparency
3.4. Replication Transparency
-
Push ReplicationAfter a modification is made to the data, the originating DP node notifies the replica nodes of the modifications to ensure that the data is immediately updated. This type of replication prioritizes data consistency. However, due to the latency required to assure data consistency across all nodes, data availability is diminished.
-
Pull ReplicationAfter updating data, the originating DP node notifies the replica nodes via "messages" that the update has occurred. The replica nodes are responsible for determining when updates are applied to their local fragment. This replication method delays the propagation of data modifications to replicas. Priority is given to guaranteeing data availability. Nevertheless, this form of replication permits momentary inconsistencies in the data. Replication offers numerous benefits, including enhanced data availability, increased tolerance for data failures, improved load distribution, and reduced query costs. However, it also contributes to the processing overhead of DDBMSes as the system is required to maintain each data clone.
3.5. Concurrency Transparency
3.6. Failure Transparency
3.7. Fragmentation Transparency
3.8. Performance Transparency
3.9. Scaling Transparency
Who is Responsible for Providing Transparency?
4. Summary
5. Future Advice
- DDBMS that adjusts itself: In order to handle shifting workloads and urgent data needs, researchers have concentrated on developing database management systems (DBMS) that dynamically adapt their performance and resource allocation.
- Distributed Environment Security: DDBMS security measures have been reinforced, particularly to fend off the recent wave of cyberattacks and adhere to stringent data protection regulations.
- Hyper-Transactional/Analytical Processing (HTAP): There is continuous work to create adaptable systems that can handle transactional and analytical tasks with ease, guaranteeing efficacy and efficiency.
- DDBMS Integration with IoT: The project has looked at several architectures and techniques for integrating DDBMS with the Internet of Things (IoT) in order to manage the massive volume of data coming in from many devices.
- Localization and Data Sovereignty: The difficult legal and technological issues involved in managing data internationally and satisfying the data residency requirements of global DDBMS installations are addressed in this field of research.
Conclusion
References
- Noah Webster. Merriam-Webster’s Collegiate Dictionary: Eleventh Edition. Logos Bible Software. Merriam-Webster, Incorporated, 2004. ISBN 9780877798095. URL https://books.google.com.tr/books?id=TAnheeIPcAEC.
- S.K. Rahimi and F.S. Haug. Distributed Database Management Systems: A Practical Approach. Wiley, 2010. ISBN 978-0-470-60236-2. URL https://books.google.com.tr/books?id=0TqLzJBGImcC.
- A.S. Tanenbaum and M. van Steen. Distributed Systems. CreateSpace Independent Publishing Platform, 2017. ISBN 978-1-5430-5738-6. URL https://books.google.com.tr/books?id=c77GAQAACAAJ.
- Okungbowa Aihanuwa Augusta Seji Happy Alfred and Lambo Kenibanemi Egi. Impact of distributed database management system to individuals, institutions and organizations all over the world. BW Academic Journal, 1(1):10, 2022. URL https://www.bwjournal.org/index.php/bsjournal/article/view/758.
- Ramez Elmasri and Sham Navathe. Fundamentals of database systems. Pearson, seventh edition edition, 2016. ISBN 0-13-397077-9 978-0-13-397077-7 978-1-292-09761-9 1-292-09761-2.
- Okardi Biobele and Asagba O. Prince. Overview of distributed database system. International Journal of Computer Techniques, 2021. ISSN 2394-2231. URL http://www.ijctjournal.org.
- Philip A. Bernstein and Eric Newcomer. Principles of Transaction Processing (Second Edition). The Morgan Kaufmann Series in Data Management Systems. Morgan Kaufmann, San Francisco, second edition edition, 2009. ISBN 978-1-55860-623-4. doi: https://doi.org/10.1016/B978-1-55860-623-4.00013-5. URL https://www.sciencedirect.com/science/article/pii/B9781558606234000135.
- M. Tamer Özsu and Patrick Valduriez. Principles of Distributed Database Systems. Springer, 4th edition, 2020.
- Edgar F. Codd. A relational model of data for large shared data banks. Communications of the ACM, 13(6):377–387, 1970. [CrossRef]
- Stefano Ceri and Giuseppe Pelagatti. Distributed Databases: Principles and Systems. McGraw-Hill, Inc., 1984.
- Andrew S. Tanenbaum and Maarten Van Steen. Distributed Systems: Principles and Paradigms. Prentice Hall, 1994.
- Theo Härder, Erhard Rahm, and Harald Schöning. Data management in the cloud: Challenges and opportunities. it - Information Technology, 53(4):181–185, 2011.
- Alejandro Corbellini, Cristian Mateos, Alejandro Zunino, Daniela Godoy, and Silvia Schiaffino. Handling big data: Research challenges and future directions. Journal of Computer Science & Technology, 18(2):e33, 2018. [CrossRef]
- Ira J. Kalet. List of figures. In Principles of Biomedical Informatics (Second Edition). Academic Press, San Diego, second edition edition, 2014. ISBN 978-0-12-416019-4. doi: https://doi.org/10.1016/B978-0-12-416019-4.00018-4. URL https://www.sciencedirect.com/science/article/pii/B9780124160194000184.
- George Coulouris, Jean Dollimore, and Tim Kindberg. Distributed Systems: Concepts and Design. Addison-Wesley, 5th edition, 2011.
- Sudheer R Mantena. Transparency in distributed systems. The University of Texas at Arlington, 2019.
- Vera Goebel. Distributed database systems. Department Informatics, University of Oslo, Oslo, 2011.
- Ajay D. Kshemkalyani and Mukesh Singhal. Distributed Computing: Principles, Algorithms, and Systems. Cambridge University Press, 2011.





| Trend | Expected Impact on DDBMS |
|---|---|
| Integration with AI | Improved automated data curation and management to enhance data analytics capabilities, resulting in more intelligent along with self-managing systems. |
| Quantum Computing | There is a potential revolution in data processing speeds that could have a significant impact on the way DDBMS deal with large-scale computations as well as intricate operations. |
| Edge Computing | With the increasing decentralization of data processing to edge devices, there is a growing need for DDBMS to prioritize latency-sensitive as well location-aware data management. |
| Blockchain Technology | With a focus on security and data integrity, we provide reliable and transparent solutions for storing data and conducting transactions. |
| Sustainability Focus | Implementing green computing methods in DDBMS operations to reduce energy consumption alongside with carbon footprint. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).