Version 1
: Received: 13 June 2018 / Approved: 13 June 2018 / Online: 13 June 2018 (16:19:48 CEST)
How to cite:
Di Tria, F.; Lefons, E.; Tangorra, F. A Proposal of Methodology for Designing Big Data Warehouses. Preprints2018, 2018060219. https://doi.org/10.20944/preprints201806.0219.v1
Di Tria, F.; Lefons, E.; Tangorra, F. A Proposal of Methodology for Designing Big Data Warehouses. Preprints 2018, 2018060219. https://doi.org/10.20944/preprints201806.0219.v1
Di Tria, F.; Lefons, E.; Tangorra, F. A Proposal of Methodology for Designing Big Data Warehouses. Preprints2018, 2018060219. https://doi.org/10.20944/preprints201806.0219.v1
APA Style
Di Tria, F., Lefons, E., & Tangorra, F. (2018). A Proposal of Methodology for Designing Big Data Warehouses. Preprints. https://doi.org/10.20944/preprints201806.0219.v1
Chicago/Turabian Style
Di Tria, F., Ezio Lefons and Filippo Tangorra. 2018 "A Proposal of Methodology for Designing Big Data Warehouses" Preprints. https://doi.org/10.20944/preprints201806.0219.v1
Abstract
Big Data warehouses are a new class of databases that largely use unstructured and volatile data for analytical purpose. Examples of this kind of data sources are those coming from the Web, such as social networks and blogs, or from sensor networks, where huge amounts of data may be available only for short intervals of time. In order to manage massive data sources, a strategy must be adopted to define multidimensional schemas in presence of fast-changing situations or even undefined business requirements. In the paper, we propose a design methodology that adopts agile and automatic approaches, in order to reduce the time necessary to integrate new data sources and to include new business requirements on the fly. The data are immediately available for analyses, since the underlying architecture is based on a virtual data warehouse that does not require the importing phase. Examples of application of the methodology are presented along the paper in order to show the validity of this approach compared to a traditional one.
Keywords
Big data technology; Business intelligence; Data integration; System virtualization.
Subject
Computer Science and Mathematics, Information Systems
Copyright:
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.