Nirdizati: A Web-Based Tool for Predictive Process Monitoring

Similar documents
A General Framework for Predictive Business Process Monitoring

20775A: Performing Data Engineering on Microsoft HD Insight

Minimizing Overprocessing Waste in Business Processes via Predictive Activity Ordering

20775: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.

20775 Performing Data Engineering on Microsoft HD Insight

Process Mining & Predictive Process Monitoring

Managing Data in Motion with the Connected Data Architecture

Integrating MATLAB Analytics into Enterprise Applications The MathWorks, Inc. 1

: 20776A: Performing Big Data Engineering on Microsoft Cloud Services

SOLUTION SHEET Hortonworks DataFlow (HDF ) End-to-end data flow management and streaming analytics platform

CASE STUDY Delivering Real Time Financial Transaction Monitoring

Adobe and Hadoop Integration

This topic focuses on how to prepare a customer for support, and how to use the SAP support processes to solve your customer s problems.

Teradata IntelliSphere

Adobe and Hadoop Integration

SOLUTION SHEET End to End Data Flow Management and Streaming Analytics Platform

Live Video Analytics at Scale with Approximation and Delay-Tolerance

Configurable Policy Enforcement. Automated Remedy Actions. Granular Reporting - Scheduled and On-Demand

Pre-Requisites A good understanding of Azure data services A basic knowledge of the Microsoft Windows operating system and its core functionality

Infosys Real Time Streams

Building a Secure, Approved AMI Factory Process Using Amazon EC2 Systems Manager (SSM), AWS Marketplace, and AWS Service Catalog

Statistics & Optimization with Big Data

Cloud Management Platform Overview First Published On: Last Updated On:

Introduction to SEND Assurance Tool

Automated Service Builder

Comprehensive Enterprise Solution for Compliance and Risk Monitoring

CA UIM Log Analytics. Gain Full Stack Visibility With Contextual Log Insights. Mark Tukh Principal Presale Consultant CA NESS AT

zdata Solutions BI / Advanced Analytic Platform and Pilot Programs

Introduction to the IBM MessageSight appliance for Mobile Messaging and M2M

Top Tier Storage Systems. Case Study Top Tier Storage Systems

Business Process Monitoring and Mining

Analytics for All Your Data: Cloud Essentials. Pervasive Insight in the World of Cloud

Solution Brief Patent insight Pro Enterprise

GROW WITH BIG DATA. Third Eye Consulting Services & Solutions LLC.

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB

Driving digital transformation through (big) data

Large US Bank Boosts Insider Threat Detection by 5X with StreamAnalytix

e-issn: p-issn:

ezsupport for ebay What is it? ezsupport for ebay Demo ezsupport for ebay Demo HostedSupport.com 1

Extensions for Alfresco Content Services & Process Services

IT portfolio management template

Integrating MATLAB Analytics into Enterprise Applications

GPU ACCELERATED BIG DATA ARCHITECTURE

Frameworx 11 Certification Report Business Process Framework Release 9.0

Meeting the New Standard for AWS Managed Services

Oracle Enterprise Data Quality Product Roadmap and Statement of Direction. October 2016

New generation of Operational Lease Management

Super Schlumberger Scheduler

Server Configuration Monitor

Bot Insight is here. Improve your company s top-and-bottom-line with powerful, real-time RPA Analytics Go be great.

SoCalGas ENVOY. Understanding Maintenance Schedules Informational Postings User Guide

IoT ANALYTICS IN THE ENTERPRISE WITH FUNL

1% + 99% = AI Popularization

Optimizing Service Assurance with Vitria Operational Intelligence

CHAPTER 7 DESIGN AND IMPLEMENTATION OF AN APPLICATION LAYER ORIENTED ARCHITECTURE FOR RFID SYSTEMS

Driving digital transformation through analytics

Workflow-Processing and Verification for Safety- Critical Engineering: Conceptual Architecture Deliverable D6.1

IBM Message Hub. James Bennett Offering Manager, IBM Cloud Integration IBM Corporation

DOWNTIME IS NOT AN OPTION

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

Ampliando MATLAB Analytics con Kafka y Servicios en la Nube

Course 20535A: Architecting Microsoft Azure Solutions

ALGORITHMIA ENTERPRISE SECURITY & COMPLIANCE OVERVIEW

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake

Stateful Services on DC/OS. Santa Clara, California April 23th 25th, 2018

Automating System i Disk Management Better System Performance & Improved Resource Utilization

5th Annual. Cloudera, Inc. All rights reserved.

Datacenter Resource Management First Published On: Last Updated On:

Scaling up MATLAB Analytics with Kafka and Cloud Services

SOFTWARE FOR FINANCIAL ADVISERS

Oracle Policy Automation The modern enterprise advice platform

MiCloud Engage Contact Center

ML Methods for Solving Complex Sorting and Ranking Problems in Human Hiring

2015 The MathWorks, Inc. 1

Scaling up MATLAB Analytics with Kafka and Cloud Services

IBM Watson IoT Strategy

Data Viz Helping Your Staff Become a Data Wiz. Charlie Farah Director Market Development, Healthcare & Public Sector, Qlik APAC August 2017

Microsoft Azure Essentials

Queue Mining: Service Perspectives in Process Mining (Extended Abstract)

Connected Vehicles Accelerator

Financial Services Compliance

Clarity PPM On Demand. Zane Schafer, SVP Software Engineering

Push IIoT Data from Sensor to Cloud Without Getting Lost Along the Way

Hadoop and Analytics at CERN IT CERN IT-DB

WIN BIG WITH GOOGLE CLOUD

vsom vsphere with Operations

SVAP 2.0 Onboarding Workbook. Updated: February 2018

Server Configuration Monitor

Databricks Cloud. A Primer

Architecting Microsoft Azure Solutions

Hortonworks Data Platform

Configurable Process Models: How to Adopt Standard Practices in Your Own Way?

Goya Inference Platform & Performance Benchmarks. Rev January 2019

Building a Robust Analytics Platform

Digital Drive Train Services. Enabling an increase in productivity and safety

Virtual Integrated Plant Simulator (VIPS)

NICE Customer Engagement Analytics - Architecture Whitepaper

AUTOMATING HEALTHCARE CLAIM PROCESSING

Transcription:

Nirdizati: A Web-Based Tool for Predictive Process Monitoring Kerwin Jorbina 1, Andrii Rozumnyi 1,IlyaVerenich 1,2, Chiara Di Francescomarino 3, Marlon Dumas 1, Chiara Ghidini 3, Fabrizio Maria Maggi 1, Marcello La Rosa 3, and Simon Raboczi 3 1 University of Tartu, Estonia {kerwin.jorbina,andriiro,marlon.dumas,f.m.maggi}@ut.ee 2 Queensland University of Technology, Australia {ilya.verenich,m.larosa,simon.raboczi}@qut.edu.au 3 FBK IRST, Trento, Italy {dfmchiara,ghidini}@fbk.eu Abstract. This paper introduces Nirdizati: A web-based application for generating predictions about running cases of a business process. Nirdizati is a configurable full-stack web application that supports users in selecting and tuning prediction methods from a list of implemented algorithms and enables the continuous prediction of various performance indicators at runtime. The tool can be used to predict the outcome, the next events, the remaining time, or the overall workload per day of each case of a process. For example, in a lead-to-order process, Nirdizati can predict which customer leads will convert to purchase orders and when. In a claim handling process, it can predict if a claim decision will be made on time or late. The predictions, as well as real-time summary statistics about the process executions, are presented in a dashboard that o ers multiple visualization options. Based on these predictions, process participants can act proactively to resolve or mitigate potential process performance violations. The target audience of this demonstration includes process mining researchers as well as practitioners interested in exploring the potential of predictive process monitoring. Keywords: Process Mining, Predictive Process Monitoring, Machine Learning 1 Introduction Predictive Process Monitoring [1] is an emerging paradigm based on the continuous generation of predictions about the future values of user-specified performance indicators of a currently running process execution. In this paradigm, a user defines the type of predictions they are interested in and provides a set of historical execution traces. Based on the analysis of these traces, the idea of predictive monitoring is to continuously provide the user with predictions and estimated values of the performance indicators. Such predictions generally depend both on: (i) the sequence of activities executed in a given case; and (ii) the values of data attributes after each activity execution in the case.

2 Jorbina et al. Fig. 1. High-level overview of Nirdizati. As an example, consider a business process for managing loan applications; the applicant can be advised about the combinations of loan amount and duration that are the most likely to lead to an acceptance of the application, given the history of the application until now and the personal data of the applicant (e.g., age, salary, etc.). Several approaches have been proposed in the literature to tackle common predictive process monitoring tasks. However, so far, these approaches have largely remained in the academic domain and have not been widely applied in real-time scenarios where users require a continuous predictive support. In this paper, we present Nirdizati, a pioneering open-source web-based predictive monitoring tool, which is able to fill this gap between research and practice, by providing business analysts with a highly flexible instrument for the selection, generation and analysis of di erent predictive models, and end-users with continuous runtime predictions. Nirdizati consists of two components: Nirdizati Training and Nirdizati Runtime (Fig. 1). Nirdizati Training takes as input a business process event log and produces one or more predictive models, which can then be deployed in Nirdizati Runtime. Once a model is deployed, Nirdizati Runtime listens to a stream of events coming from an information system supporting the process, and produces a stream of predictions. These predictions are then visualized in a continuously updated web dashboard. 2 Nirdizati Training Nirdizati Training is the component of Nirdizati that allows users to produce predictive models later used by Nirdizati Runtime for making predictions on a stream of events. It provides several algorithms for generating predictive models suitable for di erent types of predictions and tailored to each specific dataset. For example, it is able to build predictive models for predicting remaining time, the next activity to be performed, whether a certain outcome will be achieved

Nirdizati: A Web-Based Tool for Predictive Process Monitoring 3 Fig. 2. Front-end of Nirdizati Training. or not and the overall workload per day. To this aim, the training component of Nirdizati relies on two phases: a training and a validation phase. In the former, one or more predictive models are built; in the latter, their suitability to the specific dataset is evaluated, so as to support the user in selecting the predictive model that ensures the best results. Nirdizati Training is composed of a front-end application (Fig. 2), which allows users to select the prediction methods and to assess the goodness-offit of the built models, and a back-end application responsible for the actual training and validation. The back-end application is, in turn, composed of four submodules shown in Fig. 3. A Log Manager is in charge of managing the logs. Uploading and retrieving the logs are the basic operations of this module. The Encoder is responsible for parsing the logs, labeling them according to the desired type of prediction, and preparing the data for the training phase. In the Training Fig. 3. High-level overview of Nirdizati Training.

4 Jorbina et al. case outcome slow probability Information system process events prefix collator event prefixes case outcome reject probability predictions join events to predictions events with predictions Predictive dashboard Text Kafka processor Kafka topic Text Stream of Text JSON objects remaining time Fig. 4. High-level data flow diagram of Nirdizati Runtime. submodule, the encoded data is split into training and validation set used for evaluation purposes, and the predictive models are built from the training data. Finally, the Evaluation submodule tests the validation data against the created model(s) to get accuracy measures with respect to the ground truth, available from the complete traces in the validation data. The back-end application comes with a storage module for saving the uploaded logs and the generated predictive models. 3 Nirdizati Runtime Once the predictive models have been created, they are used by the Runtime component to make predictions on ongoing cases. Nirdizati Runtime takes a stream of events produced by an information system, transforms it into a stream of predictions, and visualizes those predictions in a web-based dashboard. The transformation is implemented using the dataflow pipeline in Fig. 4. The pipeline is built on top of the open-source Apache Kafka stream processing platform. The components of the pipeline are the predictive models from Nirdizati Training. The topic components are network-accessible queues of JSON messages with publisher/subscriber support. This allows the computationally intense work of the s to be distributed across a cluster of networked computers, providing scalability and fault-tolerance. The collator component accumulates the sequence of events-to-date for each case, such that the prediction is a stateless function of the trained predictive model and of the case history. This statelessness is what allows the s to be freely duplicated and distributed. The joiner component composes the original events with the various predictions, ready for display on the dashboard. The dashboard provides a list of both currently ongoing cases (colored in gray) as well as completed cases (colored in green), as shown in Fig. 5. For each case, it is also possible to visualize a range of summary statistics including the number of events in the case, its starting time and the time when the latest event in the case has occurred. For the ongoing cases, Nirdizati Runtime provides the predicted values of the performance indicators the user wants to predict. For completed cases, instead, it shows the actual values of the indicators. In addition to the table view, the dashboard o ers other visualization options, such as pie charts for case outcomes and bar charts for case durations.

Nirdizati: A Web-Based Tool for Predictive Process Monitoring 5 Fig. 5. Main view of Nirdizati Runtime. Process workers and operational managers typical users of Nirdizati can set some process performance targets and subscribe to a stream of warnings and alerts generated whenever these targets are predicted to be violated. Thus, they will be capable of making informed, data-driven decisions to get a better control of the process executions. This is especially beneficial for processes where process participants have more leeway to make corrective actions (for example, in a lead management process). 4 Conclusion Nirdizati is a configurable full-stack web application that supports users in selecting and tuning prediction methods from a list of implemented algorithms and that enables the continuous prediction of various performance indicators at runtime. The predictions are presented in a dashboard that o ers multiple visualization options. A video demo of Nirdizati can be found at http://youtu.be/0nr14lx04-i. A public release of Nirdizati Training and Nirdizati Runtime is available at http://training.nirdizati.com and http://dashboard.nirdizati.com,respectively. The source code is available under the Lesser GNU Public License (LGPL) at http://github.com/nirdizati. References 1. Maggi, F.M., Di Francescomarino, C., Dumas, M., Ghidini, C.: Predictive monitoring of business processes. In: Advanced Information Systems Engineering - 26th International Conference, CAiSE 2014. Proceedings. pp. 457 472 (2014)