Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Data Preprocessing Method and API for Mining Processes from Cloud-Based Application Event Logs

Version 1 : Received: 13 April 2022 / Approved: 15 April 2022 / Online: 15 April 2022 (07:37:06 CEST)

A peer-reviewed article of this Preprint also exists.

El-Gharib, N.M.; Amyot, D. Data Preprocessing Method and API for Mining Processes from Cloud-Based Application Event Logs. Algorithms 2022, 15, 180. El-Gharib, N.M.; Amyot, D. Data Preprocessing Method and API for Mining Processes from Cloud-Based Application Event Logs. Algorithms 2022, 15, 180.

Abstract

Background: Process mining (PM) exploits event logs to obtain meaningful information about the processes that produced them. As the number of applications developed on cloud infrastructures is increasing, it becomes important to study and discover their underlying processes. However, many current PM technologies face challenges in dealing with complex and large event logs from cloud applications, especially when they have little structure (e.g., clickstreams). Methods: Using Design Science Research, this paper introduces a new method, called Cloud Pattern API – Process Mining (CPA-PM), that enables discovering and analyzing cloud-based application processes using PM in a way that addresses many of these challenges. CPA-PM exploits a new application programming interface (API), with an R implementation, for creating repeatable scripts that preprocess event logs collected from such applications. Results: Applying CPA-PM to a case with real and evolving event logs related to the trial process of a Software-as-a-Service cloud application led to useful analyses and insights, with reusable scripts. Conclusion: CPA-PM helps producing executable scripts for filtering event logs from clickstream and cloud-based applications, where the scripts can be used in pipelines while minimizing the need for error-prone and time-consuming manual filtering.

Keywords

API; clickstream; cloud applications; process mining; scripting

Subject

Computer Science and Mathematics, Software

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.