Search | Preprints.org

Search Results

7 articles found

Order by Most Viewed Most Downloaded Newest Relevance

Preprint ARTICLE | doi:10.20944/preprints201706.0115.v1

Fair: A Hadoop-based Hybrid Model for Faculty Information Retrieval System

Harishchandra Dubey

Subject: Computer Science And Mathematics, Information Systems Keywords: big data,；Hadoop； visualization； model

Online: 26 June 2017 (06:07:51 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201904.0281.v1

Choice of Cluster Computing System Hadoop and Apache Spark for Network Systems

Vasiliy Elagin, Vladislav Karpov, Aleksandr Kravchenko, Aleksandr Goldstein, Andrei Vladyko

Subject: Computer Science And Mathematics, Information Systems Keywords: Cluster computing, Big Data, Spark, Hadoop.

Online: 25 April 2019 (11:22:27 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202404.0154.v2

Towards Improving YARN performance for Frugal Heterogeneous SBC-based Edge Clusters

Subject: Computer Science And Mathematics, Computer Science Keywords: ; frugal hadoop clusters; dynamic analytical hierarchy process; locality aware data placement; single board computers ;

Online: 3 May 2024 (10:40:13 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202309.0148.v1

Iot Powered by Big Data: Architecture, Ecosystem, Applications

Subject: Computer Science And Mathematics, Discrete Mathematics And Combinatorics Keywords: Internet of Things; Big Data Ecosystem; Hadoop Ecosystem; Storage Computing

Online: 5 September 2023 (10:33:33 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201810.0618.v1

Security and Cryptographic Challenges for Authentication Based on Biometrics Data

Stefania Loredana Nita, Marius Iulian Mihailescu, Valentin Corneliu Pau

Subject: Computer Science And Mathematics, Security Systems Keywords: classification; machine learning; chaos-based cryptography; Hadoop; data clustering; biometrics

Online: 26 October 2018 (05:50:53 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202407.1803.v1

Digital Forensics Readiness in Big Data Wireless Networks: A Novel Framework and Incident Response Script for Linux-Hadoop Environments

Cephas Mpungu, Carlisle George, Glenford Mapp

Subject: Computer Science And Mathematics, Computer Networks And Communications Keywords: Wireless networks; digital forensics; digital forensics readiness; incident response; big data; Hadoop

Online: 23 July 2024 (16:02:46 CEST)

Show abstract| Download PDF| Share

Preprint REVIEW | doi:10.20944/preprints202211.0161.v1

Data Locality in High Performance Computing, Big Data, and Converged Systems: An Analysis of the Cutting Edge and A Future System Architecture

Sardar Usman, Rashid Mehmood, Iyad Katib, Aiiad Albeshri

Subject: Computer Science And Mathematics, Information Systems Keywords: High Performance Computing (HPC); big data; High Performance Data Analytics (HPDS); con-vergence; data locality; spark; Hadoop; design patterns; process mapping; in-situ data analysis

Online: 9 November 2022 (01:38:34 CET)

Show abstract| Download PDF| Share

Big data has revolutionised science and technology leading to the transformation of our societies. High Performance Computing (HPC) provides the necessary computational power for big data analysis using artificial intelligence and methods. Traditionally HPC and big data had focused on different problem domains and had grown into two different ecosystems. Efforts have been underway for the last few years on bringing the best of both paradigms into HPC and big converged architectures. Designing HPC and big data converged systems is a hard task requiring careful placement of data, analytics, and other computational tasks such that the desired performance is achieved with the least amount of resources. Energy efficiency has become the biggest hurdle in the realisation of HPC, big data, and converged systems capable of delivering exascale and beyond performance. Data locality is a key parameter of HPDA system design as moving even a byte costs heavily both in time and energy with an increase in the size of the system. Performance in terms of time and energy are the most important factors for users, particularly energy, due to it being the major hurdle in high performance system design and the increasing focus on green energy systems due to environmental sustainability. Data locality is a broad term that encapsulates different aspects including bringing computations to data, minimizing data movement by efficient exploitation of cache hierarchies, reducing intra- and inter-node communications, locality-aware process and thread mapping, and in-situ and in-transit data analysis. This paper provides an extensive review of the cutting-edge on data locality in HPC, big data, and converged systems. We review the literature on data locality in HPC, big data, and converged environments and discuss challenges, opportunities, and future directions. Subsequently, using the knowledge gained from this extensive review, we propose a system architecture for future HPC and big data converged systems. To the best of our knowledge, there is no such review on data locality in converged HPC and big data systems.

We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.