Version 1
: Received: 24 April 2019 / Approved: 25 April 2019 / Online: 25 April 2019 (11:22:27 CEST)
How to cite:
Elagin, V.; Karpov, V.; Kravchenko, A.; Goldstein, A.; Vladyko, A. Choice of Cluster Computing System Hadoop and Apache Spark for Network Systems. Preprints2019, 2019040281 (doi: 10.20944/preprints201904.0281.v1).
Elagin, V.; Karpov, V.; Kravchenko, A.; Goldstein, A.; Vladyko, A. Choice of Cluster Computing System Hadoop and Apache Spark for Network Systems. Preprints 2019, 2019040281 (doi: 10.20944/preprints201904.0281.v1).
Cite as:
Elagin, V.; Karpov, V.; Kravchenko, A.; Goldstein, A.; Vladyko, A. Choice of Cluster Computing System Hadoop and Apache Spark for Network Systems. Preprints2019, 2019040281 (doi: 10.20944/preprints201904.0281.v1).
Elagin, V.; Karpov, V.; Kravchenko, A.; Goldstein, A.; Vladyko, A. Choice of Cluster Computing System Hadoop and Apache Spark for Network Systems. Preprints 2019, 2019040281 (doi: 10.20944/preprints201904.0281.v1).
Abstract
The article provides detailed information about the new technologies based on cluster computing Hadoop and Apache Spark. The experimental task of processing logistic regression with the help of these technologies is considered. The findings on the comparison of the performance of cluster computing of Hadoop and Apache Spark are revealed and substantiated.
Subject Areas
Cluster computing, Big Data, Spark, Hadoop.
Copyright:
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.