FLINK IN ZALANDO S WORLD OF MICROSERVICES JAVIER LOPEZ MIHAIL VIERU

Size: px
Start display at page:

Download "FLINK IN ZALANDO S WORLD OF MICROSERVICES JAVIER LOPEZ MIHAIL VIERU"

Transcription

1 FLINK IN ZALANDO S WORLD OF MICROSERVICES JAVIER LOPEZ MIHAIL VIERU

2 AGENDA Zalando s Microservices Architecture Saiki - Data Integration and Distribution at Scale Flink in a Microservices World Stream Processing Use Cases: o o Business Process Monitoring Continuous ETL Future Work 2

3 ABOUT US Mihail Vieru Big Data Engineer, Business Intelligence Javier López Big Data Engineer, Business Intelligence 3

4 4

5 One of Europe's largest online fashion retailers 15 countries ~19 million active customers ~3 billion revenue ,500 brands 150,000+ products 11,000+ employees in Europe 5

6 ZALANDO TECHNOLOGY TECHNOLOGISTS Rapidly growing international team 6

7 VINTAGE ARCHITECTURE

8 VINTAGE BUSINESS INTELLIGENCE Dev Business Logic Business Logic Business Logic Business Logic DBA Database Database Database Database Classical ETL process BI Data Warehouse (DWH) 8

9 VINTAGE BUSINESS INTELLIGENCE Exasol DWH Oracle 9

10 RADICAL AGILITY

11 RADICAL AGILITY AUTONOMY MASTERY PURPOSE 11

12 RADICAL AGILITY - AUTONOMY 12 Technologies Operations Teams

13 REST API REST API REST API REST API REST API SUPPORTING AUTONOMY: MICROSERVICES Business Logic Business Logic Business Logic Database Database Database Business Logic Business Logic Database Database 13

14 REST API SUPPORTING AUTONOMY: MICROSERVICES Team A Business Logic public Internet REST API Business Logic Team B Database Database Applications communicate using REST APIs Databases hidden behind the walls of AWS VPC 14

15 REST API SUPPORTING AUTONOMY: MICROSERVICES Team A Business Logic public Internet REST API Business Logic Team B Database Database Classical ETL process is impossible! 15

16 App A REST API App B REST API App C REST API App D REST API SUPPORTING AUTONOMY: MICROSERVICES Business Logic Business Logic Business Logic Business Logic Database Database Database Database Business Intelligence 16

17 SAIKI

18 SAIKI DATA PLATFORM App A App B App C App D BI Data Warehouse SAIKI 18

19 SAIKI DATA INTEGRATION & DISTRIBUTION App A App B App C App D BI Data Warehouse E.g. Forecast DB SAIKI REST API Exporter Stream Processing via Apache Flink Data Lake. AWS S3 19

20 Data sources Extraction SAIKI SUMMARY Data sources Connections Data sources Technologies Data Delivery B E F O R E Δ J D B C A F T E R REST 20

21 FLINK IN A MICROSERVICES WORLD

22 OPPORTUNITIES FOR NEXT GEN BI Cloud Computing - Distributed ETL - Scale Access to Real Time Data - All teams publish data to central event bus Hub for Data Teams - Data Lake provides distributed access and fine grained security - Data can be transformed (aggregated, joined, etc.) before delivering it to data teams Semi-Structured Data General-purpose data processing engines like Flink or Spark let you define own data types and functions. - Fabian Hueske, dataartisans 22

23 THE RIGHT FIT STREAM PROCESSING 23

24 THE RIGHT FIT STREAM PROCESSING ENGINE Candidates: Storm & Samza ruled out because of batch processing requirement 24

25 25

26 26 SPARK VS. FLINK DIFFERENCES Feature Apache Spark Apache Flink Processing mode micro-batching tuple at a time Temporal processing support processing time event time, ingestion time, processing time Latency seconds sub-second Back pressure handling manual configuration implicit, through system architecture State access full state scan for each microbatch value lookup by key Operator library neutral ++ (split, windowbycount..) Support neutral ++ (mailing list, direct contact & support from data Artisans)

27 APACHE FLINK true stream processing framework process events at a consistently high rate with low latency scalable great community and on-site support from Berlin/ Europe university graduates with Flink skills 27

28 FLINK ON AWS - OUR APPLIANCE MASTER ELB EC2 Docker Flink Shadow Master EC2 Docker Flink Master WORKERS ELB EC2 Docker Flink Worker EC2 Docker Flink Worker EC2 Docker Flink Worker 28

29 USE CASES

30 BUSINESS PROCESS MONITORING

31 BUSINESS PROCESS A business process is in its simplest form a chain of correlated events: start event completion event ORDER_CREATE ALL_PARCELS_SHIPPED D Business Events from the whole Zalando platform flow through Saiki => opportunity to process those streams in near real time 31

32 REAL-TIME BUSINESS PROCESS MONITORING Check if business processes in the Zalando platform work Analyze data on the fly: o Order velocities o Delivery velocities o Control SLAs of correlated events, e.g. parcel sent out after order 32

33 ARCHITECTURE BPM Operational Systems App A App B Nakadi Event Bus Kafka2Kafka Unified Log Stream Processing Elasticsearch App C Saiki BPM Cfg Service UI Alert Svc OAUTH PUBLIC INTERNET 33

34 34 HOW WE USE FLINK IN BPM Event Types; 1 Event Type -> 1 Kafka topic Analyze processes with correlated event types (Join & Union) Enrich data based on business rules Sliding Windows (1min to 48hrs) for Platform Snapshots State for alert metadata Generation and processing of Complex Events (CEP lib)

35 STREAMING ETL

36 Extract Transform Load (ETL) Traditional ETL process: Batch processing No real time ETL tools Heavy processing on the storage side 36

37 WHAT CHANGED WITH RADICAL AGILITY? Data comes in a semi-structured format (JSON payload) Data is distributed in separate Kafka topics There would be peak times, meaning that the data flow will increase by several factors Data sources number increased by several factors 37

38 ARCHITECTURE STREAMING ETL Operational Systems App A App B App C Importer Oracle DWH Nakadi Event Bus Kafka2Kafka ` Unified Log Exporter Stream Processing Saiki Streaming ETL 38

39 HOW WE (WOULD) USE FLINK IN STREAMING ETL Transformation of complex payloads into simple ones for easier consumption in Oracle DWH Combine several topics based on Business Rules (Union, Join) Pre-Aggregate data to improve performance in the generation of reports (Windows, State) Data cleansing Data validation 39

40 FUTURE USE CASES

41 COMPLEX EVENT PROCESSING FOR BPM Cont. example business process: Multiple PARCEL_SHIPPED events per order Generate complex event ALL_PARCELS_SHIPPED, when all PARCEL_SHIPPED events received (CEP lib, State) 41

42 DEPLOYMENTS FROM OTHER BI TEAMS Flink Jobs from other BI Teams Requirements: manage and control deployments isolation of data flows 42 o prevent different jobs from writing to the same sink resource management in Flink o share cluster resources among concurrently running jobs StreamSQL would significantly lower the entry barrier

43 REPLACE KAFKA2KAFKA COMPONENT Python app extracts events from REST API Nakadi Event Bus writes them to our Kafka cluster Idea: Create Nakadi consumer/producer to enable stream processing with Flink to other internal users 43 (first POC done)

44 OTHER FUTURE TOPICS New use cases for Real Time Analytics/ BI o o Sales monitoring Price monitoring Fraud detection for payments (evaluation) Contact customer according to variable event pattern (evaluation) 44

45 CONCLUSION Flink proved to be the right fit for our current stream processing use cases. It enables us to build Zalando s Next Gen BI platform. 45

46 THANK YOU

MapR Pentaho Business Solutions

MapR Pentaho Business Solutions MapR Pentaho Business Solutions The Benefits of a Converged Platform to Big Data Integration Tom Scurlock Director, WW Alliances and Partners, MapR Key Takeaways 1. We focus on business values and business

More information

Introduction to Stream Processing

Introduction to Stream Processing Introduction to Processing Guido Schmutz DOAG Big Data 2018 20.9.2018 @gschmutz BASEL BERN BRUGG DÜSSELDORF HAMBURG KOPENHAGEN LAUSANNE guidoschmutz.wordpress.com FRANKFURT A.M. FREIBURG I.BR. GENF MÜNCHEN

More information

Confidential

Confidential June 2017 1. Is your EDW becoming too expensive to maintain because of hardware upgrades and increasing data volumes? 2. Is your EDW becoming a monolith, which is too slow to adapt to business s analytical

More information

Real-time Streaming Applications on AWS Patterns and Use Cases

Real-time Streaming Applications on AWS Patterns and Use Cases Real-time Streaming Applications on AWS Patterns and Use Cases Christian Deger, Chief Architect, AutoScout24 Dr. Steffen Hausmann, Solutions Architect, AWS May 18, 2017 2016, Amazon Web Services, Inc.

More information

Architecting for Real- Time Big Data Analytics. Robert Winters

Architecting for Real- Time Big Data Analytics. Robert Winters Architecting for Real- Time Big Data Analytics Robert Winters About Me 2 ROBERT WINTERS Head of Business Intelligence, TravelBird Ten years experience in analytics, five years with Vertica and big data

More information

Operational Hadoop and the Lambda Architecture for Streaming Data

Operational Hadoop and the Lambda Architecture for Streaming Data Operational Hadoop and the Lambda Architecture for Streaming Data 2015 MapR Technologies 2015 MapR Technologies 1 Topics From Batch to Operational Workloads on Hadoop Streaming Data Environments The Lambda

More information

TechArch Day Digital Decoupling. Oscar Renalias. Accenture

TechArch Day Digital Decoupling. Oscar Renalias. Accenture TechArch Day 2018 Digital Decoupling Oscar Renalias Accenture !"##$ oscar.renalias@acenture.com @oscarrenalias https://www.linkedin.com/in/oscarrenalias/ https://github.com/accenture THE ERA OF THE BIG

More information

Microsoft Azure Essentials

Microsoft Azure Essentials Microsoft Azure Essentials Azure Essentials Track Summary Data Analytics Explore the Data Analytics services in Azure to help you analyze both structured and unstructured data. Azure can help with large,

More information

Enterprise Analytics Accelerating Your Path to Value with an Open Analytics Platform

Enterprise Analytics Accelerating Your Path to Value with an Open Analytics Platform Enterprise Analytics Accelerating Your Path to Value with an Open Analytics Platform Federico Pozzi @fedealbpozzi Mathias Coopmans @macoopma Characteristics of a badly managed platform No clear data

More information

20775: Performing Data Engineering on Microsoft HD Insight

20775: Performing Data Engineering on Microsoft HD Insight Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com

More information

TechValidate Survey Report. Converged Data Platform Key to Competitive Advantage

TechValidate Survey Report. Converged Data Platform Key to Competitive Advantage TechValidate Survey Report Converged Data Platform Key to Competitive Advantage TechValidate Survey Report Converged Data Platform Key to Competitive Advantage Executive Summary What Industry Analysts

More information

Big Data Introduction

Big Data Introduction Big Data Introduction Who we are Experts At Your Service Over 50 specialists in IT infrastructure Certified, experienced, passionate Based In Switzerland 100% self-financed Swiss company Over CHF8 mio.

More information

How In-Memory Computing can Maximize the Performance of Modern Payments

How In-Memory Computing can Maximize the Performance of Modern Payments How In-Memory Computing can Maximize the Performance of Modern Payments 2018 The mobile payments market is expected to grow to over a trillion dollars by 2019 How can in-memory computing maximize the performance

More information

Building event-driven Microservices with Kafka Ecosystem

Building event-driven Microservices with Kafka Ecosystem Building event-driven Microservices with Kafka Ecosystem Guido Schmutz London, 30.5.2018 @gschmutz guidoschmutz.wordpress.com BASEL BERN BRUGG DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. GENF HAMBURG KOPENHAGEN

More information

20775 Performing Data Engineering on Microsoft HD Insight

20775 Performing Data Engineering on Microsoft HD Insight Duración del curso: 5 Días Acerca de este curso The main purpose of the course is to give students the ability plan and implement big data workflows on HD. Perfil de público The primary audience for this

More information

SCALING ZALANDO

SCALING ZALANDO SCALING ARCHITECTURE @ ZALANDO FELIX MÜLLER @FMUELLER_BLN 02-02-2017 ABOUT ME Software Architect @ Zalando Software Engineer by heart 8 years in IT 2 years as an Architect Twitter: @fmueller_bln 2 ZALANDO

More information

Datametica. The Modern Data Platform Enterprise Data Hub Implementations. Why is workload moving to Cloud

Datametica. The Modern Data Platform Enterprise Data Hub Implementations. Why is workload moving to Cloud Datametica The Modern Data Platform Enterprise Data Hub Implementations Why is workload moving to Cloud 1 What we used do Enterprise Data Hub & Analytics What is Changing Why it is Changing Enterprise

More information

EXPERIENCE EVERYTHING

EXPERIENCE EVERYTHING EXPERIENCE EVERYTHING RAPID. OPEN. SECURE. Jigar Bhansali VP Solution & Architecture, Asia & China INNOVATION TOUR 2018 April 26 Singapore 2018 Software AG. All rights reserved. For internal use only HYBRID

More information

Common Customer Use Cases in FSI

Common Customer Use Cases in FSI Common Customer Use Cases in FSI 1 Marketing Optimization 2014 2014 MapR MapR Technologies Technologies 2 Fortune 100 Financial Services Company 104M CARD MEMBERS 3 Financial Services: Recommendation Engine

More information

Building data-driven applications with SAP Data Hub and Amazon Web Services

Building data-driven applications with SAP Data Hub and Amazon Web Services Building data-driven applications with SAP Data Hub and Amazon Web Services Dr. Lars Dannecker, Steffen Geissinger September 18 th, 2018 Cross-department disconnect Cross-department disconnect Cross-department

More information

SOLUTION SHEET Hortonworks DataFlow (HDF ) End-to-end data flow management and streaming analytics platform

SOLUTION SHEET Hortonworks DataFlow (HDF ) End-to-end data flow management and streaming analytics platform SOLUTION SHEET Hortonworks DataFlow (HDF ) End-to-end data flow management and streaming analytics platform CREATE STREAMING ANALYTICS APPLICATIONS IN MINUTES WITHOUT WRITING CODE The increasing growth

More information

DOAG Big Data Days 2018 DWH Modernization

DOAG Big Data Days 2018 DWH Modernization DOAG Big Data Days 2018 DWH Modernization Do I need a data lake? If yes, why? Jan Ott @jan_ott_ch https://janottblog.com BASEL BERN BRUGG DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. GENF HAMBURG KOPENHAGEN

More information

20775A: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight 20775A: Performing Data Engineering on Microsoft HD Insight Course Details Course Code: Duration: Notes: 20775A 5 days This course syllabus should be used to determine whether the course is appropriate

More information

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. Course Content Course Description: The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. At Course Completion: After competing this course,

More information

CASE STUDY Delivering Real Time Financial Transaction Monitoring

CASE STUDY Delivering Real Time Financial Transaction Monitoring CASE STUDY Delivering Real Time Financial Transaction Monitoring Steve Wilkes Striim Co-Founder and CTO Background Customer is a US based Payment Systems Provider Large Network of ATM and Cashier Operated

More information

Configurable Data Pipelines Towards a Reference Architecture. E. U. Syed Philips Lighting Research Sunday, June 18, 2017

Configurable Data Pipelines Towards a Reference Architecture. E. U. Syed Philips Lighting Research Sunday, June 18, 2017 Configurable Data Pipelines Towards a Reference Architecture E. U. Syed Philips Lighting Research Sunday, June 18, 2017 A Little About Me Ekhtiar Syed Scientist/ Data Engineer, Philips Lighting Research

More information

Data Analytics. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC

Data Analytics. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC Data Analytics Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC Last 15 years IT-centric Traditional Analytics Traditional Applications Rigid Infrastructure Internet Next

More information

SOLUTION SHEET End to End Data Flow Management and Streaming Analytics Platform

SOLUTION SHEET End to End Data Flow Management and Streaming Analytics Platform SOLUTION SHEET End to End Data Flow Management and Streaming Analytics Platform CREATE STREAMING ANALYTICS APPLICATIONS IN MINUTES WITHOUT WRITING CODE The increasing growth of data, especially data-in-motion,

More information

20775A: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight 20775A: Performing Data Engineering on Microsoft HD Insight Duration: 5 days; Instructor-led Implement Spark Streaming Using the DStream API. Develop Big Data Real-Time Processing Solutions with Apache

More information

How to Build Your Data Ecosystem with Tableau on AWS

How to Build Your Data Ecosystem with Tableau on AWS How to Build Your Data Ecosystem with Tableau on AWS Moving Your BI to the Cloud Your BI is working, and it s probably working well. But, continuing to empower your colleagues with data is going to be

More information

Real-time IoT Big Data-in-Motion Analytics Case Study: Managing Millions of Devices at Country-Scale

Real-time IoT Big Data-in-Motion Analytics Case Study: Managing Millions of Devices at Country-Scale Real-time IoT Big Data-in-Motion Analytics Case Study: Managing Millions of Devices at Country-Scale Real-time IoT Big Data-in-Motion Analytics Case Study: Managing Millions of Devices at Country-Scale

More information

Managing Data in Motion with the Connected Data Architecture

Managing Data in Motion with the Connected Data Architecture Managing in Motion with the Connected Architecture Dmitry Baev Director, Solutions Engineering Doing It Right SYMPOSIUM March 23-24, 2017 1 Hortonworks Inc. 2011 2016. All Rights Reserved 4 th Big & Business

More information

Your Top 5 Reasons Why You Should Choose SAP Data Hub INTERNAL

Your Top 5 Reasons Why You Should Choose SAP Data Hub INTERNAL Your Top 5 Reasons Why You Should Choose INTERNAL Top 5 reasons for choosing the solution 1 UNIVERSAL 2 INTELLIGENT 3 EFFICIENT 4 SCALABLE 5 COMPLIANT Universal view of the enterprise and Big Data: Get

More information

Secure information access is critical & more complex than ever

Secure information access is critical & more complex than ever WHITE PAPER Purpose-built Cloud Platform for Enabling Identity-centric and Internet of Things Solutions Connecting people, systems and things across the extended digital business ecosystem. Secure information

More information

A Reference Architecture for Hybrid Integration. Peter Broadhurst Senior Technical Staff Member for IBM App Connect

A Reference Architecture for Hybrid Integration. Peter Broadhurst Senior Technical Staff Member for IBM App Connect A Reference Architecture for Hybrid Integration Peter Broadhurst Senior Technical Staff Member for IBM App Connect Common use cases for hybrid integration within digital transformation A. Join the API

More information

ProSiebenSat1 s Digital Media Transformation AWS Enterprise Summit 2015

ProSiebenSat1 s Digital Media Transformation AWS Enterprise Summit 2015 ProSiebenSat1 s Digital Media Transformation AWS Enterprise Summit 2015 Xiaoqun Clever, CTO Frankfurt March 24, 2014 Page 1 Who and where are we? Page 2 P7S1 Group a leading media corporation in Europe

More information

WELCOME TO. Cloud Data Services: The Art of the Possible

WELCOME TO. Cloud Data Services: The Art of the Possible WELCOME TO Cloud Data Services: The Art of the Possible Goals for Today Share the cloud-based data management and analytics technologies that are enabling rapid development of new mobile applications Discuss

More information

MapR Streams A global pub-sub event streaming system for big data and IoT

MapR Streams A global pub-sub event streaming system for big data and IoT MapR Streams A global pub-sub event streaming system for big data and IoT Ben Sadeghi Data Scientist, APAC IDA Forum on IoT Jan 18, 2016 2015 MapR Technologies 2015 MapR Technologies MapR Streams: Vision

More information

Integrating the Enterprise. How Business Leaders are Implementing Digital Integration

Integrating the Enterprise. How Business Leaders are Implementing Digital Integration Integrating the Enterprise How Business Leaders are Implementing Digital Integration Today s Session In Review Business Value of IoT Building an IoT Backbone Integrating the Enterprise Market Potential

More information

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

ORACLE DATA INTEGRATOR ENTERPRISE EDITION ORACLE DATA INTEGRATOR ENTERPRISE EDITION Oracle Data Integrator Enterprise Edition delivers high-performance data movement and transformation among enterprise platforms with its open and integrated E-LT

More information

ADVANCED ANALYTICS & IOT ARCHITECTURES

ADVANCED ANALYTICS & IOT ARCHITECTURES ADVANCED ANALYTICS & IOT ARCHITECTURES Presented by: Orion Gebremedhin Director of Technology, Data & Analytics Marc Lobree National Architect, Advanced Analytics EDW THE RIGHT TOOL FOR THE RIGHT WORKLOAD

More information

Maturing IoT solutions with Microsoft Azure. Glenn Colpaert Azure/IoT Domain

Maturing IoT solutions with Microsoft Azure. Glenn Colpaert Azure/IoT Domain Maturing IoT solutions with Microsoft Azure Glenn Colpaert Azure/IoT Domain Lead @GlennColpaert Who we are 2000 Belgium 2004 France 2013 Portugal 2016 Switzerland 2016 UK 2016 The Netherlands 2017 Malta

More information

Alexander Klein. ETL meets Azure

Alexander Klein. ETL meets Azure Alexander Klein ETL meets Azure Thanks to our sponsors: Who am I? Independent BI Consultant > 15 years experience of SQL Server Focus on Microsoft BI Stack & AI & Azure a.klein@consulting-bi.de @SQL_Alex

More information

REDEFINE BIG DATA. Zvi Brunner CTO. Copyright 2015 EMC Corporation. All rights reserved.

REDEFINE BIG DATA. Zvi Brunner CTO. Copyright 2015 EMC Corporation. All rights reserved. 1 REDEFINE BIG DATA Zvi Brunner CTO 2 2020: A NEW DIGITAL WORLD 30B DEVICES 7B PEOPLE Millions OF NEW BUSINESSES Source: Gartner Group, 2014 DIGITIZATION IS ALREADY BEGINNING PRECISION FARMING DRESS THAT

More information

RHAPSODY VISION & ROADMAP FIORA AU, TIM WHITTINGTON OCT 2017

RHAPSODY VISION & ROADMAP FIORA AU, TIM WHITTINGTON OCT 2017 RHAPSODY VISION & ROADMAP FIORA AU, TIM WHITTINGTON OCT 2017 Rhapsody is evolving alongside healthcare Healthcare data integration continues to get larger scale and more demanding MORE data and interfaces,

More information

Combine Microservices Framework for Flexible, Scalable, High Availability Big Data Analytics

Combine Microservices Framework for Flexible, Scalable, High Availability Big Data Analytics Combine Microservices Framework for Flexible, Scalable, High Availability Big Data Analytics Dan Widdis, Principal Operations Research Analyst May 10, 2016 Approved for public release; distribution is

More information

The Need For Speed: Fast Data Development Trends Insights from over 2,400 developers on the impact of Data in Motion in the real world

The Need For Speed: Fast Data Development Trends Insights from over 2,400 developers on the impact of Data in Motion in the real world The Need For Speed: Fast Data Development Trends 2017 Insights from over 2,400 developers on the impact of Data in Motion in the real world 1 About This Report The digitization of the world has fueled

More information

When Big Data Meets Fast Data

When Big Data Meets Fast Data 15 November 2016 When Big Data Meets Fast Data - London 2016 Ted Orme VP Technology EMEA When Big Data Meets Fast Data The Evolution of Hadoop Enterprise ready From batch to real-time Now add Cloud It

More information

Amsterdam. (technical) Updates & demonstration. Robert Voermans Governance architect

Amsterdam. (technical) Updates & demonstration. Robert Voermans Governance architect (technical) Updates & demonstration Robert Voermans Governance architect Amsterdam Please note IBM s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice

More information

Data Integration for the Real-Time Enterprise

Data Integration for the Real-Time Enterprise Solutions Brief Data Integration for the Real-Time Enterprise Business Agility in a Constantly Changing World Executive Summary For companies to navigate turbulent business conditions and add value to

More information

C3 Products + Services Overview

C3 Products + Services Overview C3 Products + Services Overview AI CLOUD PREDICTIVE ANALYTICS IoT Table of Contents C3 is a Computer Software Company 1 C3 PaaS Products 3 C3 SaaS Products 5 C3 Product Trials 6 C3 Center of Excellence

More information

EXECUTIVE BRIEF. Successful Data Warehouse Approaches to Meet Today s Analytics Demands. In this Paper

EXECUTIVE BRIEF. Successful Data Warehouse Approaches to Meet Today s Analytics Demands. In this Paper Sponsored by Successful Data Warehouse Approaches to Meet Today s Analytics Demands EXECUTIVE BRIEF In this Paper Organizations are adopting increasingly sophisticated analytics methods Analytics usage

More information

CI in the cloud. Case Talend & Nordcloud Ilja Summala, Group CTO, Nordcloud Anubhav Sharma, Senior Software Architect, Talend Deutschland

CI in the cloud. Case Talend & Nordcloud Ilja Summala, Group CTO, Nordcloud Anubhav Sharma, Senior Software Architect, Talend Deutschland BERLIN 2015 CI in the cloud Case Talend & Nordcloud Ilja Summala, Group CTO, Nordcloud Anubhav Sharma, Senior Software Architect, Talend Deutschland About Nordcloud Nordcloud is leading European AWS consultancy

More information

St Louis CMG Boris Zibitsker, PhD

St Louis CMG Boris Zibitsker, PhD ENTERPRISE PERFORMANCE ASSURANCE BASED ON BIG DATA ANALYTICS St Louis CMG Boris Zibitsker, PhD www.beznext.com bzibitsker@beznext.com Abstract Today s fast-paced businesses have to make business decisions

More information

Intro to Big Data and Hadoop

Intro to Big Data and Hadoop Intro to Big and Hadoop Portions copyright 2001 SAS Institute Inc., Cary, NC, USA. All Rights Reserved. Reproduced with permission of SAS Institute Inc., Cary, NC, USA. SAS Institute Inc. makes no warranties

More information

New Big Data Solutions and Opportunities for DB Workloads

New Big Data Solutions and Opportunities for DB Workloads New Big Data Solutions and Opportunities for DB Workloads Hadoop and Spark Ecosystem for Data Analytics, Experience and Outlook Luca Canali, IT-DB Hadoop and Spark Service WLCG, GDB meeting CERN, September

More information

Data Strategy: How to Handle the New Data Integration Challenges. Edgar de Groot

Data Strategy: How to Handle the New Data Integration Challenges. Edgar de Groot Data Strategy: How to Handle the New Data Integration Challenges Edgar de Groot New Business Models Lead to New Data Integration Challenges Organisations are generating insight Insight is capital 3 Retailers

More information

IoT ANALYTICS IN THE ENTERPRISE WITH FUNL

IoT ANALYTICS IN THE ENTERPRISE WITH FUNL INNOVATION PLATFORM WHITE PAPER 1 The plethora of IoT devices is already adding to the exponentially increasing volumes, variety, and velocity of Big Data. This paper examines IoT analytics and provides

More information

AWS Architecture Case Study: Real-Time Bidding. Tom Maddox, Solutions Architect

AWS Architecture Case Study: Real-Time Bidding. Tom Maddox, Solutions Architect AWS Architecture Case Study: Real-Time Bidding Tom Maddox, Solutions Architect Who am I? Gardener (Capacity Planning) Motorcyclist (Agility) Mobile App Writer Problem Solver Technology Geek Solutions Architect

More information

Modern Analytics Architecture

Modern Analytics Architecture Modern Analytics Architecture So what is a. Modern analytics architecture? Machine Learning AI Open source Big Data DevOps Cloud In-memory IoT Trends supporting Next-Generation analytics Source: Next-Generation

More information

Insights-Driven Operations with SAP HANA and Cloudera Enterprise

Insights-Driven Operations with SAP HANA and Cloudera Enterprise Insights-Driven Operations with SAP HANA and Cloudera Enterprise Unleash your business with pervasive Big Data Analytics with SAP HANA and Cloudera Enterprise The missing link to operations As big data

More information

DOWNTIME IS NOT AN OPTION

DOWNTIME IS NOT AN OPTION DOWNTIME IS NOT AN OPTION HOW APACHE MESOS AND DC/OS KEEPS APPS RUNNING DESPITE FAILURES AND UPDATES 2017 Mesosphere, Inc. All Rights Reserved. 1 WAIT, WHO ARE YOU? Engineer at Mesosphere DC/OS Contributor

More information

IBM Message Hub. James Bennett Offering Manager, IBM Cloud Integration IBM Corporation

IBM Message Hub. James Bennett Offering Manager, IBM Cloud Integration IBM Corporation IBM Message Hub James Bennett Offering Manager, IBM Cloud Integration 2016 IBM Corporation The continued growth of PaaS 2 Use cases 1 Hub for asynchronously connecting services inside Bluemix or beyond

More information

Cask Data Application Platform (CDAP) Extensions

Cask Data Application Platform (CDAP) Extensions Cask Data Application Platform (CDAP) Extensions CDAP Extensions provide additional capabilities and user interfaces to CDAP. They are use-case specific applications designed to solve common and critical

More information

Integrating MATLAB Analytics into Enterprise Applications The MathWorks, Inc. 1

Integrating MATLAB Analytics into Enterprise Applications The MathWorks, Inc. 1 Integrating Analytics into Enterprise Applications 2015 The MathWorks, Inc. 1 Agenda Example Problem Access and Preprocess Data Develop a Predictive Model Integrate Analytics with Production Systems Build

More information

Data Analytics Use Cases, Platforms, Services. ITMM, March 5 th, 2018 Luca Canali, IT-DB

Data Analytics Use Cases, Platforms, Services. ITMM, March 5 th, 2018 Luca Canali, IT-DB Data Analytics Use Cases, Platforms, Services ITMM, March 5 th, 2018 Luca Canali, IT-DB 1 Analytics and Big Data Pipelines Use Cases Many use cases at CERN for analytics Data analysis, dashboards, plots,

More information

Introducing Infor Xi/Ming.le for M3

Introducing Infor Xi/Ming.le for M3 Introducing Infor Xi/Ming.le for M3 Merit Consulting AS Sandnes/Norway karsten.hesselager@infor.com 1 2 Agenda Introducing Infor Xi Tech Stack Why have Infor developed Xi? What is included in Xi Demo of

More information

EBOOK: Cloudwick Powering the Digital Enterprise

EBOOK: Cloudwick Powering the Digital Enterprise EBOOK: Cloudwick Powering the Digital Enterprise Contents What is a Data Lake?... Benefits of a Data Lake on AWS... Building a Data Lake on AWS... Cloudwick Case Study... About Cloudwick... Getting Started...

More information

Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Copyright 2013, Oracle and/or its affiliates. All rights reserved. 1 Oracle Fusion Middleware Next-Generation Application Platform Web Social Mobile Business Process Management Service Integration User Engagement Content Management Identity Management Business Intelligence

More information

Processing Big Data with Pentaho. Rakesh Saha Pentaho Senior Product Manager, Hitachi Vantara

Processing Big Data with Pentaho. Rakesh Saha Pentaho Senior Product Manager, Hitachi Vantara Processing Big Data with Pentaho Rakesh Saha Pentaho Senior Product Manager, Hitachi Vantara Agenda Pentaho s Latest and Upcoming Features for Processing Big Data Batch or Real-time Process big data visually

More information

Industrial IoT Solution Architecture Design From Connectivity to Data

Industrial IoT Solution Architecture Design From Connectivity to Data Industrial IoT Solution Architecture Design From Connectivity to Data Cheryl Hsu Program Manager Strategic Engagement & Industrial IoT, Microsoft IoT Enables a Digital Feedback Loop The benefits are profound

More information

CA UIM Log Analytics. Gain Full Stack Visibility With Contextual Log Insights. Mark Tukh Principal Presale Consultant CA NESS AT

CA UIM Log Analytics. Gain Full Stack Visibility With Contextual Log Insights. Mark Tukh Principal Presale Consultant CA NESS AT CA UIM Log Analytics Gain Full Stack Visibility With Contextual Log Insights Mark Tukh Principal Presale Consultant CA Division @ NESS AT Analytics is the New Battleground > 50% large organizations globally

More information

Simplifying Your Modern Data Architecture Footprint

Simplifying Your Modern Data Architecture Footprint MIKE COCHRANE VP Analytics & Information Management Simplifying Your Modern Data Architecture Footprint Or Ways to Accelerate Your Success While Maintaining Your Sanity June 2017 mycervello.com Businesses

More information

Business is being transformed by three trends

Business is being transformed by three trends Business is being transformed by three trends Big Cloud Intelligence Stay ahead of the curve with Cortana Intelligence Suite Business apps People Custom apps Apps Sensors and devices Cortana Intelligence

More information

Spotlight Sessions. Nik Rouda. Director of Product Marketing Cloudera, Inc. All rights reserved. 1

Spotlight Sessions. Nik Rouda. Director of Product Marketing Cloudera, Inc. All rights reserved. 1 Spotlight Sessions Nik Rouda Director of Product Marketing Cloudera @nrouda Cloudera, Inc. All rights reserved. 1 Spotlight: Protecting Your Data Nik Rouda Product Marketing Cloudera, Inc. All rights reserved.

More information

Big data is hard. Top 3 Challenges To Adopting Big Data

Big data is hard. Top 3 Challenges To Adopting Big Data Big data is hard Top 3 Challenges To Adopting Big Data Traditionally, analytics have been over pre-defined structures Data characteristics: Sales Questions answered with BI and visualizations: Customer

More information

What companies are looking for

What companies are looking for What companies are looking for Reduce costs and inefficiencies Increase revenue Create new business models The Internet of Things helps to reach these goals Insights Speed & Efficiency Innovation & new

More information

Edge Analytics for IoT Device Intelligence

Edge Analytics for IoT Device Intelligence Edge Analytics for IoT Device Intelligence 1. IoT Trends 2. IoT Analytics 3. Edge Analytics Platform: Kanga 4. Future Direction 2017. 3. 10 IoT Trends - Business/Technology (1/3) Google : IoT Solution

More information

Databricks Cloud. A Primer

Databricks Cloud. A Primer Databricks Cloud A Primer Who is Databricks? Databricks was founded by the team behind Apache Spark, the most active open source project in the big data ecosystem today. Our mission at Databricks is to

More information

Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand

Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand Paper 2698-2018 Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand ABSTRACT Digital analytics is no longer just about tracking the number

More information

Azure PaaS and SaaS Microsoft s two approaches to building IoT solutions

Azure PaaS and SaaS Microsoft s two approaches to building IoT solutions Azure PaaS and SaaS Microsoft s two approaches to building IoT solutions Hector Garcia Tellado Program Manager Lead, Azure IoT Suite #IoTinActionMS #IoTinActionMS Agenda Customers using IoT today Microsoft

More information

NOT ALL CLOUDS ARE CREATED EQUAL

NOT ALL CLOUDS ARE CREATED EQUAL NOT ALL CLOUDS ARE CREATED EQUAL Saran Sankar Technology CoE Thursday, February 15 1 connecting senior-level financial executives since 1931 CPE Credits Today s webinar is worth 1 Continuing Professional

More information

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW TOPICS COVERED 1 2 Fundamentals of Big Data Platforms Major Big Data Tools Scaling Up vs. Out SCALE UP (SMP) SCALE OUT (MPP) + (n) Upgrade

More information

Dell IT Proven Dell IT s Big Data Journey with Big Possibilities

Dell IT Proven Dell IT s Big Data Journey with Big Possibilities Dell IT Proven Dell IT s Big Data Journey with Big Possibilities Darryl Smith Chief Data Platform Architect, DE Patrick O Sullivan Sr Director, Analytics & BI Agenda 1 Why are we here / current state challenges

More information

COURSE OUTLINE: Implementing a Data Warehouse with SQL Server Implementing a Data Warehouse with SQL Server 2014

COURSE OUTLINE: Implementing a Data Warehouse with SQL Server Implementing a Data Warehouse with SQL Server 2014 Course Name Course Duration Course Structure Course Overview Course Outcome Course Details 20463 Implementing a Data Warehouse with SQL Server 2014 5 Days Instructor-Led (Classroom) This course describes

More information

Processing and Analyzing Streams. CDRs in Real Time

Processing and Analyzing Streams. CDRs in Real Time Processing and Analyzing Streams of CDRs in Real Time Streaming Analytics for CDRs 2 The V of Big Data Velocity means both how fast data is being produced and how fast the data must be processed to meet

More information

Stream Processing. Kai Wähner. as Game Changer for the Internet of

Stream Processing. Kai Wähner. as Game Changer for the Internet of Stream Processing as Game Changer for the Internet of Things Kai Wähner kwaehner@tibco.com @KaiWaehner www.kai-waehner.de LinkedIn / Xing Please connect! Key Messages Streaming Analytics processes Data

More information

Device Data in IoT: Use it Early and Often. Josh Pederson, Director of Product Management Ayla Networks July 25, 2017

Device Data in IoT: Use it Early and Often. Josh Pederson, Director of Product Management Ayla Networks July 25, 2017 Device Data in IoT: Use it Early and Often Josh Pederson, Director of Product Management Ayla Networks July 25, 2017 The IoT is Huge and Growing The Internet of Things is Impacting Every Market 2020 20.4B

More information

Financial Data Enterprise Platform

Financial Data Enterprise Platform BCC Group 2017 DEC Rel. 3.2 EN Financial Data Enterprise Platform 1 Financial Data Enterprise Platform ONE [Cloud] Financial Data [Cloud] Platform 2 Financial Data Enterprise Platform ONE " You never change

More information

Leveraging Oracle Big Data Discovery to Master CERN s Data. Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden

Leveraging Oracle Big Data Discovery to Master CERN s Data. Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden Leveraging Oracle Big Data Discovery to Master CERN s Data Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden Manuel Martin Marquez Intel IoT Ignition Lab Cloud and

More information

Using Mesos Schedulers with Amazon EC2 Container Service

Using Mesos Schedulers with Amazon EC2 Container Service Using Mesos Schedulers with Amazon EC2 Container Service Ryosuke Iwanaga Solutions Architect, Amazon Web Services Japan July 2016, LinuxCon+ContainerCon Japan 2016, Amazon Web Services, Inc. or its Affiliates.

More information

GUIDE The Enterprise Buyer s Guide to Public Cloud Computing

GUIDE The Enterprise Buyer s Guide to Public Cloud Computing GUIDE The Enterprise Buyer s Guide to Public Cloud Computing cloudcheckr.com Enterprise Buyer s Guide 1 When assessing enterprise compute options on Amazon and Azure, it pays dividends to research the

More information

Transforming Big Data to Business Benefits

Transforming Big Data to Business Benefits Transforming Big Data to Business Benefits Automagical EDW to Big Data Migration BI at the Speed of Thought Stream Processing + Machine Learning Platform Table of Contents Introduction... 3 Case Study:

More information

Oracle Retail Data Model (ORDM) Overview

Oracle Retail Data Model (ORDM) Overview Oracle Retail Data Model (ORDM) Overview May, 2014 Content Retail Business Intelligence Key Trends Retail Industry Findings Foundation for Business Information Flows Retail is being Redefined Challengers

More information

The Internet of Things Wind Turbine Predictive Analytics. Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure

The Internet of Things Wind Turbine Predictive Analytics. Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure The Internet of Things Wind Turbine Predictive Analytics Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure Big Data and Tribo-Analytics Today we will see how Fluitec solved real-world challenges

More information

Scaling up MATLAB Analytics with Kafka and Cloud Services

Scaling up MATLAB Analytics with Kafka and Cloud Services Scaling up Analytics with Kafka and Cloud Services Olof Larsson 2015 The MathWorks, Inc. 1 Agenda 1 Access and Explore Data 2 Preprocess Data 3 Develop Predictive Models Integrate with Production 5 Visualize

More information

PORTFOLIO AND TECHNOLOGY DIRECTION ARMISTEAD SAPP & RANDY GUARD

PORTFOLIO AND TECHNOLOGY DIRECTION ARMISTEAD SAPP & RANDY GUARD PORTFOLIO AND TECHNOLOGY DIRECTION ARMISTEAD SAPP & RANDY GUARD FOCUS MARKETS SAS Addressable Market Size $US Billions $14.7 2015 2019 $10.6 $9.6 $7.0 $7.9 $5.0 $2.6 $3.7 $5.7 $4.4 $3.0 $4.2 BUSINESS INTELLIGENCE

More information

Taking the Pain Out of Developing and Deploying Streaming Applications. Craig Blitz, Senior Product Director Gerard Maas, Senior SW Engineer

Taking the Pain Out of Developing and Deploying Streaming Applications. Craig Blitz, Senior Product Director Gerard Maas, Senior SW Engineer Taking the Pain Out of Developing and Deploying Streaming Applications Craig Blitz, Senior Product Director Gerard Maas, Senior SW Engineer Disclaimer Lightbend Inc 2018. All Rights Reserved The information

More information

Azure Data Factory Hybrid data integration, at global scale. Erika Harris Senior Program Manager AzureCAT

Azure Data Factory Hybrid data integration, at global scale. Erika Harris Senior Program Manager AzureCAT Azure Data Factory Hybrid data integration, at global scale Erika Harris Senior Program Manager AzureCAT Data Cloud AI There are barriers to getting value from data Data silos Incongruent data types Complexity

More information

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica Accelerating Your Big Data Analytics Jeff Healey, Director Product Marketing, HPE Vertica Recent Waves of Disruption IT Infrastructu re for Analytics Data Warehouse Modernization Big Data/ Hadoop Cloud

More information