Preprint Article Version 1 This version is not peer-reviewed

A Proposal of Methodology for Designing Big Data Warehouses

Version 1 : Received: 13 June 2018 / Approved: 13 June 2018 / Online: 13 June 2018 (16:19:48 CEST)

How to cite: Di Tria, F.; Lefons, E.; Tangorra, F. A Proposal of Methodology for Designing Big Data Warehouses. Preprints 2018, 2018060219 (doi: 10.20944/preprints201806.0219.v1). Di Tria, F.; Lefons, E.; Tangorra, F. A Proposal of Methodology for Designing Big Data Warehouses. Preprints 2018, 2018060219 (doi: 10.20944/preprints201806.0219.v1).

Abstract

Big Data warehouses are a new class of databases that largely use unstructured and volatile data for analytical purpose. Examples of this kind of data sources are those coming from the Web, such as social networks and blogs, or from sensor networks, where huge amounts of data may be available only for short intervals of time. In order to manage massive data sources, a strategy must be adopted to define multidimensional schemas in presence of fast-changing situations or even undefined business requirements. In the paper, we propose a design methodology that adopts agile and automatic approaches, in order to reduce the time necessary to integrate new data sources and to include new business requirements on the fly. The data are immediately available for analyses, since the underlying architecture is based on a virtual data warehouse that does not require the importing phase. Examples of application of the methodology are presented along the paper in order to show the validity of this approach compared to a traditional one.

Subject Areas

Big data technology; Business intelligence; Data integration; System virtualization.

Readers' Comments and Ratings (0)

Leave a public comment
Send a private comment to the author(s)
Rate this article
Views 0
Downloads 0
Comments 0
Metrics 0
Leave a public comment

×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.