Submitted:
08 May 2024
Posted:
08 May 2024
Read the latest preprint version here
Abstract
Keywords:
1. Introduction
2. Background and Concepts
2.1. Human-Data Interaction (HDI)
- Data Accessibility: HDI prioritizes ease of access to data through user-friendly interfaces, supporting diverse user needs and contexts [11].
- Data Understanding: This involves presenting data in ways that are easy to understand and analyze, leveraging advanced visualization techniques to clarify complex information [12].
- Data Interaction: HDI focuses on interactive systems that allow users to manipulate and explore data to derive meaningful insights [13].
- Privacy and Ethics: Ensuring that data systems uphold ethical standards and protect user privacy is a cornerstone of HDI [14].
HDI’s Role in Various Sectors
- Healthcare: HDI integrates real-time data from wearables and sensors, enhancing diagnostic capabilities and enabling personalized medicine. Challenges include improving the design and usability of electronic health records (EHRs) [15].
- Business and Marketing: Companies utilize HDI to analyze consumer behavior, optimize marketing strategies, and enhance decision-making through data visualization and analytics [16].
- Education: Through learning analytics, HDI provides insights into student performance, supporting personalized learning and adaptive educational systems [17].
- Government: HDI-driven data insights help in policy making and service delivery, with smart cities using real-time data for urban planning and management [18].
- Finance: HDI underpins financial models that detect fraud and aid in decision-making processes, with increasing reliance on algorithmic trading and risk assessment tools [19].
- Everyday Life: From social media to navigation apps, HDI influences daily activities by tailoring digital interactions based on user data, significantly shaping our consumption patterns and daily routines [20].
2.2. Data Virtualization: Key Concepts
3. Enhancing HDI from Data Virtualization
4. Case Studies
5. Empowering Real-World Applications: In-Depth Use Cases of DV in Healthcare, Finance, and Education
5.1. Healthcare: Enhanced Patient Care through Integrated Data Systems
5.1.1. HDI Objectives and Study Hypotheses
- Data Legibility: Ensure that the integrated patient data is presented in an intuitive and understandable format for medical professionals, regardless of their technical skills. Visualization tools like patient health dashboards can display complex health data (like real-time vitals and historical health trends) in a clear and actionable manner.
- Data Accessibility: Assess if DV tools provide healthcare professionals with more accessible and comprehensive patient data.
- Data Agency: Evaluate whether DV tools empower healthcare professionals by enabling more control over data manipulation and interpretation. This enables them to view their own health data, contribute information, and make informed decisions about their treatment options.
- Data Negotiability: Determine if DV tools facilitate better negotiation between data sources and users, enhancing the transparency and customization of data interactions. This can be supported through digital consent tools that allow patients to manage who has access to their data and for what purpose.
5.1.2. Methodology
- Diagnostic Accuracy: Comparing the correctness of diagnoses made using DV and traditional systems.
- Speed of Diagnosis: Time taken to arrive at a diagnosis using both systems.
- User Satisfaction: Based on the ease of use, accessibility, and control over data.
5.1.3. Data Flow
- Data Collection: Patient data from various hospitals and clinics are collected. This includes historical health records, real-time monitoring through wearables, and results from recent medical tests.
- Data Integration: DV software consolidates data from these varied sources without needing to store it in a central repository, respecting the privacy and security regulations like HIPAA.
- Data Usage: Physicians access a comprehensive dashboard that provides a holistic view of a patient’s health status, enabling better diagnosis and personalized treatment plans.
5.1.4. Results
- Improved Data Accessibility: Professionals using DV had quicker access to comprehensive patient data, leading to faster and more accurate diagnoses.
- Enhanced Data Agency: DV tools allowed more direct manipulation of data, giving healthcare providers greater confidence in their diagnostic decisions.
- Effective Data Negotiability: DV facilitated a more flexible interaction with data sources, allowing for customized views and reports tailored to individual patient needs.
- Improved Diagnostic Accuracy: Access to comprehensive, real-time patient data helps in diagnosing diseases earlier and more accurately.
- Personalized Treatment: Insights derived from a unified data view allow for treatments tailored to individual patient needs and conditions.
- Efficient Care Delivery: Reduces the time doctors spend gathering information, allowing more time to focus on patient care.
5.1.5. Discussion
5.1.6. Conclusion
5.2. Finance: Fraud Detection and Risk Management
5.2.1. HDI Objectives and Study Hypotheses
- Data Legibility: Design user interfaces for fraud detection systems that allow financial analysts to easily navigate and interpret transaction data from various sources. Use graphical representations to highlight patterns and anomalies that may indicate fraudulent activities.
- Data Accessibility: Determine if DV tools enhance the accessibility of financial data across different platforms and systems for real-time fraud detection.
- Data Agency: Assess whether DV empowers analysts by providing more control over data queries and manipulations. rovide customers with tools to monitor their own transaction activities and report suspicious actions directly through banking apps. This increases user engagement and enhances the effectiveness of fraud detection systems.
- Data Negotiability: Explore if DV improves the ability of analysts to negotiate and customize how data is presented and analyzed for better fraud detection outcomes. Implement transparent policies regarding how customer data is used for fraud detection and risk assessments. Offer customers options to opt-in or opt-out of certain data collection practices, reinforcing trust and compliance
5.2.2. Methodology
- Fraud Detection Rates: Accuracy and number of fraud cases detected.
- Response Times: Time efficiency in detecting and responding to potential fraud.
- User Satisfaction: Surveyed based on the adaptability, efficiency, and user control of the data system.
5.2.3. Data Flow
- Data Aggregation: Transaction data from various bank branches and online banking services are integrated in real time.
- Anomaly Detection: Advanced analytics tools, running on top of the virtualized data layer, identify unusual patterns that suggest potential fraud.
5.2.4. Results
- Enhanced Data Accessibility: Analysts using DV systems accessed comprehensive transaction data more swiftly, enabling quicker responses to fraudulent activities.
- Increased Data Agency: DV tools provided analysts with enhanced capabilities to manipulate and analyze data on-the-fly, fostering proactive fraud detection strategies.
- Improved Data Negotiability: DV allowed for better customization of data views and analytic models, which tailored the fraud detection process to specific needs of the institution.
- Enhanced Fraud Detection: Real-time data access enables quicker response to fraudulent activities, reducing potential losses.
- Dynamic Risk Assessment: Continuously updated data allows for more accurate and timely risk assessments, improving the bank’s financial stability.
- Regulatory Compliance: Easier compliance with global financial regulations through centralized monitoring and reporting of transaction data.
5.2.5. Discussion
5.2.6. Conclusion
5.3. Education: Personalized Learning and Performance Tracking
5.3.1. HDI Objectives and Study Hypotheses
- Data Legibility: Ensure that data collected from various educational tools is integrated and presented back to both students and educators in an accessible format. Dashboards that show academic progress, areas of strength, and areas needing improvement can help make educational decisions more data-informed.
- Data Accessibility: Assess whether DV tools provide educators and students with easier access to comprehensive learning data.
- Data Agency: Determine if DV empowers educators to manipulate and apply data in ways that enhance personalized learning. Allow students to interact with their own performance data and set personal academic goals. Provide tools that let them customize their learning experience, such as choosing elective topics based on performance trends.
- Data Negotiability: Explore how DV enables educators and students to negotiate the terms of data usage, customizing data interactions to better meet individual learning needs. Create channels for students and educators to provide feedback on the data collection and analytics processes. This can help refine the data integration to better serve educational goals and adapt to the needs of diverse student populations.
5.3.2. Methodology
- Personalization Effectiveness: The ability of educators to tailor learning experiences based on integrated data insights.
- Learning Outcomes: Measured improvements in student performance and engagement.
- User Satisfaction: Feedback from educators and students on their experiences with the data systems.
5.3.3. Data Flow
- Data Integration: Collect and integrate data from classroom interactions, online quizzes, and feedback sessions.
- Personalized Feedback: Use data analysis to provide real-time feedback and personalized learning pathways for students.
- Performance Monitoring: Teachers and administrators use integrated data to monitor and analyze student performance over time.
5.3.4. Results
- Improved Data Accessibility: DV systems allowed for real-time access to a wide array of educational data, facilitating a quicker adaptation of learning strategies to student needs.
- Enhanced Data Agency: Educators using DV reported greater control over data, enabling them to design more effective, personalized instructional methods.
- Increased Data Negotiability: DV provided customizable interfaces and analytics, allowing both students and educators to interact with data in ways that best supported their unique learning and teaching styles.
- Tailored Learning Experiences: Students receive learning materials and tasks suited to their individual learning pace and style, enhancing engagement and understanding.
- Improved Educational Outcomes: Data-driven insights allow educators to intervene early with students who may need additional support, potentially increasing overall academic success.
- Efficient Resource Allocation: Insights from data help allocate educational resources more effectively, improving the learning environment.
5.3.5. Discussion
5.3.6. Conclusion
5.4. Overview and Benefits of the Three Studies
6. Toward an HDI Framework: Insights from Data Virtualization Use Cases
6.1. Rationale for an HDI Framework
- Enhancing Data Legibility: The framework would offer guidelines on designing intuitive visualizations and interfaces that make complex data comprehensible for all users, irrespective of their technical background. This is crucial in sectors like healthcare, where data comprehensibility can directly influence patient care outcomes.
- Empowering with Data Agency: Guidelines on enabling user interaction with data, allowing individuals to view, modify, and control their personal or professional data. In financial sectors, this can translate to tools that let users actively monitor and control their transaction data to prevent fraud.
- Facilitating Data Negotiability: Recommendations for fostering transparent communication between data users and providers. This involves creating mechanisms for users to negotiate what data is collected, how it is used, and under what circumstances, particularly in educational contexts where data privacy is paramount.
6.2. Implications for Implementation
7. Research Challenges
8. Conclusions and Future Works
Author Contributions
Funding
Conflicts of Interest
References
- Mortier, R.; Haddadi, H.; Henderson, T.; McAuley, D.; Crowcroft, J. Human-Data Interaction: The Human Face of the Data-Driven Society. SSRN Electronic Journal 2014. [Google Scholar] [CrossRef]
- Stonebraker, M.; Cetintemel, U. "One size fits all": an idea whose time has come and gone. 21st International Conference on Data Engineering (ICDE’05), 2005, pp. 2–11. [CrossRef]
- Gandomi, A.; Haider, M. Beyond the hype: Big data concepts, methods, and analytics. International Journal of Information Management 2015, 35, 137–144. [Google Scholar] [CrossRef]
- Tene, O.; Polonetsky, J. Big Data for All: Privacy and User Control in the Age of Analytics. Northwestern Journal of Technology and Intellectual Property 2012, 11, 239. [Google Scholar]
- Mittelstadt, B.; Floridi, L. The Ethics of Big Data: Current and Foreseeable Issues in Biomedical Contexts. Science and engineering ethics 2015. [Google Scholar] [CrossRef]
- Nissenbaum, H. Privacy as contextual integrity. Wash. L. Rev. 2004, 79, 119. [Google Scholar]
- Bowen, G.M.; Bartley, A. Helping students make sense of the “real world” data mess. Science Activities 2020, 57, 143–153. [Google Scholar] [CrossRef]
- Burrell, J. How the machine ‘thinks’: Understanding opacity in machine learning algorithms. Big Data & Society 2016, 3, 2053951715622512. [Google Scholar] [CrossRef]
- Andrejevic, M. Big Data, Big Questions| The Big Data Divide. International Journal of Communication 2014, 8. [Google Scholar]
- Pasquale, F. The black box society: The secret algorithms that control money and information; Harvard University Press, 2015.
- Kun, P.; Mulder, I.; Kortuem, G. Developing a Design Inquiry Method for Data Exploration. Interaction Design and Architecture(s). [CrossRef]
- Few, S. Now You See It: Simple Visualization Techniques for Quantitative Analysis, 1st ed.; Analytics Press: Oakland, CA, USA, 2009. [Google Scholar]
- Heer, J.; Shneiderman, B. Interactive dynamics for visual analysis. Commun. ACM 2012, 55, 45–54. [Google Scholar] [CrossRef]
- Shilton, K. Values Levers: Building Ethics Into Design. Science, Technology, & Human Values 2013, 38, 374–397. [Google Scholar] [CrossRef]
- Pine, M.; Sonneborn, M.; Schindler, J.; Stanek, M.; Maeda, J.L. Harnessing the power of enhanced data for healthcare quality improvement: lessons from a Minnesota Hospital Association Pilot Project. Journal of Healthcare Management 2012, 57, 406–418. [Google Scholar] [CrossRef] [PubMed]
- Chen, H.; Chiang, R.H.L.; Storey, V.C. Business Intelligence and Analytics: From Big Data to Big Impact. MIS Q. 2012, 36, 1165–1188. [Google Scholar] [CrossRef]
- Siemens, G.; Baker, R. Learning analytics and educational data mining: Towards communication and collaboration. ACM International Conference Proceeding Series 2012. [Google Scholar] [CrossRef]
- Batty, M.; Axhausen, K.; Giannotti, F.; Pozdnoukhov, A.; Bazzani, A.; Wachowicz, M.; Ouzounis, G.; Portugali, Y. Smart cities of the future. The European Physical Journal Special Topics 2012, 214, 481–518. [Google Scholar] [CrossRef]
- MacKenzie, D. ‘Making’, ‘taking’ and the material political economy of algorithmic trading. Economy and Society 2018, 47, 501–523. [Google Scholar] [CrossRef]
- Resnick, P.; Varian, H.R. Recommender systems. Commun. ACM 1997, 40, 56–58. [Google Scholar] [CrossRef]
- Davis, J.R.; Eve, R. Data virtualization: going beyond traditional data integration to achieve business agility; Nine Five One Press, 2011.
- Brynjolfsson, E.; Hitt, L.; Kim, H. Strength in Numbers: How Does Data-Driven Decisionmaking Affect Firm Performance? SSRN Electronic Journal 2011, 1. [Google Scholar] [CrossRef]
- Ferguson, M. Data Virtualization–Flexible Technology for the Agile Enterprise. Intelligent Business Strategies www. sas. com 2014. [Google Scholar]
- Shilton, K. Values and Ethics in Human-Computer Interaction. Foundations and Trends® in Human-Computer Interaction 2018, 12, 107–171. [Google Scholar] [CrossRef]
- Bellahsene, Z.; Benbernou, S.; Jaudoin, H.; Pinet, F.; Pivert, O.; Toumani, F. A flexible data integration system based on data semantics. ACM Sigmod Record 2010, 39, 11–18. [Google Scholar] [CrossRef]
- Catarci, T.; Santucci, G.; Da Silva, S.L. An Interactive Visual Exploration of Medical data for Evaluating Health Centres. Journal of Research and Practice in Information Technology 2003, 35, 99–119. [Google Scholar]
- Castano, S.; De Antonellis, V.; De Capitani di Vimercati, S. Global Viewing of Heterogeneous Data Sources. IEEE Trans. on Knowl. and Data Eng. 2001, 13, 277–297. [Google Scholar] [CrossRef]
- Raghupathi, W.; Raghupathi, V. Big data analytics in healthcare: Promise and potential. Health Information Science and Systems 2014, 2, 3. [Google Scholar] [CrossRef] [PubMed]
- Hai, R.; Quix, C.; Zhou, C. Query rewriting for heterogeneous data lakes. Advances in Databases and Information Systems: 22nd European Conference, ADBIS 2018, Budapest, Hungary, September 2–5, 2018, Proceedings 22. Springer, 2018, pp. 35–49.
- Cappiello, C.; Matera, M.; Picozzi, M. A UI-Centric Approach for the End-User Development of Multidevice Mashups. ACM Trans. Web 2015, 9. [Google Scholar] [CrossRef]
- Börner, K.; Bueckle, A.; Ginda, M. Data visualization literacy: Definitions, conceptual frameworks, exercises, and assessments. Proceedings of the National Academy of Sciences 2019, 116, 1857–1864. [Google Scholar] [CrossRef]
- Burger, M.; Smith, K.T.; Smith, L.M.; Wood, J. An Examination of Fraud Risk at Oil and Gas Companies. Journal of Forensic and Investigative Accounting 2022, 14, 74–85. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).