Making Analytics Viable in Enterprises: Potential routes for Industry 4.0

Size: px
Start display at page:

Download "Making Analytics Viable in Enterprises: Potential routes for Industry 4.0"

Transcription

1 Making Analytics Viable in Enterprises: Potential routes for Industry 4.0 Jorge Sanz Anusha Choori Business Analytics Center National University of Singapore

2 Agenda and Goals Business Analytics as an enabler for Industry 4.0 Cases from the field, typical challenges and lessons-learnt Viable roadmaps for Industrial Sector companies Potential opportunities for Luxembourg Conclusions

3 Industry 4.0 Industry 4.0 The Multi-Faceted Goal Framework Core Capability Key Dimensions in the Framework Source: 2016 Global Industry Survey Industry 4.0: Building the digital enterprise - PWC

4 Some Key Enablers of Industry Capture and process large data sources Collect, retrieve and query data Analyze data (from reporting to prediction) 2. Integrate conventional IT silos more deeply Integrate Information-based Insights into Process Lifecycle 3. Reduce cost-of-ownership for viability Cloud and Everything-as-a-Service

5 The Role of Business Analytics A relatively new discipline that addresses the key enablers Not only for Industrial Segments but also for most other industries realizing some critical capabilities for Industry 4.0: Large-data analytics and enterprise architecture enable a new thinking of production management and factory management Analytical algorithms (some capable of learning from data) will be able to achieve more flexibility and robustness in manufacturing, supply chain and distribution Different forms of cognitive systems to support decision-making are part of the emerging jargon (back to AI, NLP, etc. powered up by big data)

6 Key Imperative: Shorten the solution cycle and reduce costs 1 Exploit new opportunities based on business analytics (from production improvements to new business models) 2 Collect, transmit, analyze large data from devices to monitor and predict / anticipate service needs 3 Shorten the Ideate-to-Monetize value-generation path and reduce cost-of-ownership

7 Knowledge Areas building the BA Domain Business Competence Business Competence Finance, accounting, marketing, supply chain, HR, channels, IT, customer relationship Industry-specific competences: underwriting, fraud, claim life-cycle, product design, wealth management, traffic Modeling and Technology Extending & Emerging Professions Organization Processes Organization Processes The design and transformation of work processes Decision-making processes Strategy processes Operational processes How information improves and innovate processes Modeling and Technology Stochastic Models, Operations Research (and tools: R, SAS, SPSS, ) Data generation sources (eg: Mobile messaging, GPS locator, Surveillance cameras, ATMs, etc) Systems in support of Cognitive and Information processes (Watson, HANA, etc)

8 Business Analytics Applications (Most Active Markets) Source: IDC

9 Projects from NUS Business Analytics Center, 2016 (I) - Text Mining - Social Network and Geospatial Analytics in context of Insurance - Motor Pricing KPIs - Travel Pricing KPIs and exploratory analysis - Cross-sell and up-sell in insurance - POS transactional data - News recommendation engine for high net worth customers - Economic Scenario Stress Testing - PnL Analytics - Cyber Security - Anti Money Laundering - Fraud analytics - Risk assessment model for investigation program - The Future of Audit (1) - The Future of Audit (2) - Optimising maintenance schedule for fleet management - Analysis of Customer Queuing Time & Headcount planning - Predicting High Risk Churn Segments Via Product Usage Data - BlueMix & Watson Analytics - Breakout detection for Hep C patients. - An Analytics Approach to Improve Subscription Rate for Nursing Course (prelim title) - Case Study on Global FP&A Transformation - Balance Sheet Forecasting - IT Tools Comparison - Case Study on Global FP&A Transformation - Sales Forecasting and Tools for Predictive Analytics 9

10 Projects from NUS Business Analytics Center, 2016 (II) - Global logistics cost optimisation - Social Media - Automatic Rostering System - Understanding Family Attitudes and Social Support Networks through Analytics - Deriving Insights from NEHR (National Electronic Health Record) - Developing an accurate model to provide estimates on how long a job should take given the characteristics of the job - ALM Roll-Tagging Prediction - Applications of Analytics to AML Proposing a Risk Classification Model - Analysing High Risk Segments in Auto Loan Portfolio - Supply Chain Optimization Customer-Money Life Cycle - Marketplace analysis - Customer credit risk analysis - Emergency Medical Service (EMS) Ambulance Demand Analytics & Prediction - Sales Management Analytics - HR Analytics - Pricing Assessment Tool based on Analytics - Healthcare analytics - Frequent Attenders to the Emergency Department - Online Analytics - Analysis on overtime cost - Analyzing cancer claims for policy holders 10

11 Projects from NUS Business Analytics Center, 2016 (III) - IoT / Event Analytics in Manufacturing - Data Lake architecture to deliver a virtualization layer for disparate data sources - Market Research - Social Media /Digital Marketing/PR - CRM / Markets - Optimising Endowment Portfolio Performance - Determining optimal level of markdowns through customer segmentation for revenue maximization - Predicting ebay Auction Sales for Laptops - Predicting Airbnb New User Bookings - Predict which hotel type will an Expedia customer book - Analysing Residential Property Prices 11

12 Industry 4.0 Manufacturing Prevailing Challenges Manufacturers could improve preventive and predictive maintenance of different production assets Many manufacturing systems are finding it difficult to collect, aggregate and benefit from data originating from large data sources because of the lack of novel analytical tools and appropriate infrastructure For example, unplanned and excessive downtime of equipment increases this directly affects the operational cost and throughput This requires the utilization of advance prediction tools and algorithms so that data can be systematically processed into information that can explain the uncertainties, breakdowns, failures, short-stops and can thereby make more informed decisions By introducing analytics and more flexible production techniques, manufacturers could boost their productivity by as much as 30 percent promises promises Source: Industrial Insights Report, Accenture Industrial-Internet-Changing-Competitive-Landscape-Industries-2015

13 From Physical to Digital to Analytics Physical World (Entities) Learn & synchronize from physical world: Knowledge extraction & accumulation 1. Cyber Physical Interaction Feedback to the physical world: Production scheduling Maintenance & Adaptation 2. Machine Health Awareness Analytics Computational Space 3. Optimal Decision Support Analytics 2. Machine Health Awareness Analytics Computational Space 3. Optimal Decision Support Analytics 2. Machine Health Awareness Analytics Computational Space 3. Optimal Decision Support Analytics An ensemble of digital life-cycles of entities deployed in different settings

14 yielding opportunities for new business models Industrial companies are moving towards greater digital value-creation, from augmented products to serving digital ecosystems Augmented Digital Product Player Focus on products digitally-endowed (like sensors or communication devices) Complete Solution/ Service Provider Focus on digital products and dataservices; which provide a complete solution for the customer Data Analytics, Content & Platform Integrator Focus on data analytics services; Give access to customers via a dedicated (online) platform (APIs) Integrated Digital Ecosystem Provider Integration of thirdparty partner or competitor products and control systems in a complete customer ecosystem Data Intensity Asset Intensity Source: 2016 Global Industry Survey Industry 4.0: Building the digital enterprise - PwC

15 Predictive Maintenance Analytics -as-a-service Data Transfer Bank and / or owner of ATMs Other Internal motors Receipt Printer Keypad Card Reader Cash Dispenser Cash Deposit Unit Asset data aggregation Monitoring/ Maintenance Embedded Sensors

16 Other forms of Analytics-as-a-Service Internal light sensor Combustion sensor Engine related sensors Front light sensor Exhaust Sensor Mobile Application Manufacturing / Assembly Gas station w/ intelligent appliance Analytics Cloud Repository Dashboard Monitoring

17 Gas Generator Temperature Water Pressure Sensor data from the equipment Humidity Speed Influencing variables Power Gas concentration Response variable Factory equipment 200 sensors on every equipment on the shop floor Sensors emit data at every 500 millisecond interval Temperature Water Pressure Predictive Analytics Gas Concentration Predictive Analytics being performed to understand the underlying the data patterns and to predict the abnormality of gas concentration in equipment. Power

18 But Q.: Are companies ready for more predictive and innovative kinds of solutions? A.: Not yet Current capabilities of Industry 4.0 segments in Analytics 58% of the companies have capabilities to collect data and analyze it Only 40% of the companies can predict based on existing data Fewer still, 36% only, can optimize operations from data insights Source: Based on a survey conducted by GE and Accenture, 2015

19 Three-Tier Scenarios in Business Analytics Analytics and Reporting Real-Time, Near Real-time, Batch / Off-line EDW Analytics & Reporting In-Memory Depending on acceptable latency of decision-making and cost-of-ownership Low-Latency Integration Analytics & Reporting Higher-Latency Integration Different analytical situations in training mode from production Sensor and Other Data Sources ERP Data

20 Managing Industrial data sources and analytics infrastructure - Open Source Infrastructure - Business Analytics Types of analysis Depending on the use-case, the type of analytical approaches differ: Offline Analysis is performed on static data Data Lake Analytics (or Data Store Analytics) Online Analysis is performed on data that is streaming in Edge Analytics Analytics Data Lake Analytics Edge Analytics Hadoop for distributed storage Apache Spark for distributed computing Types of analytics tools (Open Source) Apache Spark Streaming for low latency analytics

21 Managing Industrial Big Data and Analytics Infrastructure Case-Study - Open Source Infrastructure - Industry 4.0 opportunities in the manufacturing unit of a leading packaging and processing company Problem Statement The infrastructure in the organization comprises of ERP (Enterprise Resource Planning) systems, Business Warehouse (BW) units, and traditional transactional databases (MES Manufacturing Execution Systems) for capturing and analyzing sensor data and operations data Data ingestion, storage and processing are all performed in their current environment which consists of traditional data stores Architecture gives very little scope to perform analysis on massive data and near-real time analytics. By adding more BW support, databases, compute power, they run into the risk of paying a HUGE cost Current setup lacks: Infrastructure to ingest/ store massive data Framework to perform large-scale data analytics

22 Industry 4.0 Business Analytics A case-study in the manufacturing unit of a leading packaging and processing company Solution Overview Existing infrastructure Proposed Infrastructure for Business Analytics Status Quo Descriptive Diagnostic Predictive Prescriptive Proposed architecture Processing and Analysis (Business Warehouse) Processing (Central + Distributed) - Spark Storage and EDW Storage(Central + Distributed) HDFS Data Input (Batch into SQL Server) Data Ingestion (Kafka) Machine and Sensor data On-Line Basic Processing Machine Data and Sensor data The right-hand side depicts the overall solution overview to analyze both offline (historical data) and near-real time data

23 Industry 4.0 Business Analytics A case-study in the manufacturing unit of a leading packaging and processing company Solution Overview Alert to Operator Sensor logs Proposed End-to-End Infrastructure

24 Current Infrastructure Pressure Temperature Water temp Why Apache Hadoop? MES (Manufacturing Execution System) Shop floor system Data reflected after 24 hours Data Acquisition System SQL Server Humidity Speed Ambient concentration - Bounded by the actual size of the database - May need to perform truncation - Cannot support unstructured data - Scope for analytics is reduced Appropriately routed to reach HDFS cluster in Hadoop Scalable No license fees Distributed storage Supports structured and unstructured data Proposed Infrastructure

25 What is Apache Hadoop? What is - Apache Hadoop? Apache Hadoop is an open source framework which was built for: Distributed Storage HDFS (Hadoop Distributed File System) Distributed Processing Map / Reduce HDFS stores large files (structured and unstructured data) across several machines (laptops), PC s and commodity servers Even though the data is spread across several machines in the cluster, the user is still guaranteed a universal view of the data this is possible via a single management interface Inexpensive Storage Without the hassle of purchasing or licensing specialized hardware Having the capacity to store structured and unstructured data originating from sensors Ability to scale easily No compromise on data storage No truncation of data points originally captured Preliminary Analysis Seamless integration with developer systems Universal access of stored data within the cluster

26 Analytics Infrastructure Why Hadoop? Problem Statement The existing infrastructure cannot store petabytes of sensor data (there are about sensor tags in each part of the equipment on the shop floor with the sensors emitting data signals for every 500 milliseconds) As a result, it becomes challenging to perform even simple off-line analysis on ALL the sensor data In addition, the organization does not want to incur additional licensing fees and while cloud subscription fees are more affordable, data security concerns and latency of the needed data upstreaming to perform analytics are caveats Proposed Solution The COST of storing all the individual sensor values from all the factory equipment on the shop floor needs an inexpensive storage which also has the ability to scale as and when needed Moreover, since the organization does not want to incur additional costs of purchasing special hardware to host petabytes of sensor data, the best solution would be to choose an open source distributed data sink which can be easily installed on commodity, inexpensive hardware and which can inherently scale up as more machines are added to the cluster Hence, Apache Hadoop was chosen as the data store to host sensor data and to perform OFFLINE analytics

27 Analytics Infrastructure Why Apache Spark? Problem Statement Hadoop can store petabytes of machine sensor data (both structured and unstructured) but when it comes to complex data analytics over that massive VOLUME of distributed data, Hadoop falls short in performing an efficient and quick computation The existing MapReduce Operations do not fare well when the user is joining two or more large datasets with several complex join conditions. Overall, MapReduce tasks generate a lot of overhead by re-reading and parsing data which reduces its overall computational efficiency to a LARGE extent even for off-line analysis In addition, building complex models or applications in Hadoop requires deep Java programming skillsets Proposed Solution Keeping the volume of distributed data in mind, the natural choice to pick an open source framework to perform efficient, parallelized computing is APACHE SPARK The main advantage of adopting Spark is that it can also run on commodity hardware by pooling ALL the memory of the available machines in the cluster and assigning jobs to them and orchestrates the execution in parallel thereby saving time, cost and improving its efficiency and lowering its execution time.

28 Apache Spark for Business Analytics in Industry 4.0 What is Apache Spark? Apache Spark is an open source cluster-computing framework which was built to overcome the limitations of Hadoop s MapReduce computing framework Spark was mainly built to achieve: Parallelism in data operations Distributed computing across a cluster of RAM s which are available in the cluster Fault tolerance Scalability Blends in with Hadoop Apache Spark Indispensable Components Spark SQL Spark MLlib GraphX Spark Streaming Spark Core

29 Apache Spark Streaming BA Infrastructure for Industry 4.0 Problem Statement Data in a Streaming Analytics environment is processed (on-line) before it lands in a database Currently, in the manufacturing unit of a leading packaging and processing industry, machine sensor data is being stored after a significant change is detected in their data acquisition system In addition, this data is truncated sensor readings which may suggest the working status of the equipment in the future are lost as a process when the data hits the database Lack of real-time streaming analytics to predict alerts with more anticipation Proposed Solution What if this sensor data is analyzed as it is streaming in? Spark Streaming And, what if decisions were made before it hits the database? Spark Streaming Analyzed industrial big-data can then be made to flow into a database of their choice (SQL Server), a distributed database (Cassandra, HBase) or into a distributed file system (HDFS).

30 Apache Spark Streaming Apache Spark Streaming Analyzing data streams in real time, streams of real-time sensor data instead of large, dataintensive batch jobs on a daily basis

31 Problem Statement Apache Kafka BA Infrastructure for Industry 4.0 Use-case in a real life scenario: The leading packaging and processing firm has about 200 sensors in every equipment and machinery they own. These innumerable sensors emit data at every 500 millisecond interval this data has to be correctly captured, queued, analyzed and stored for further analysis to happen In cases there are many real-time applications that consume data from 1000s of these sensors for reporting and analytics, it becomes a criss-crossed and random way of requesting data from sensors Add to that, there is a risk of losing data mid-way and listening to the wrong sensor reading or listening to the messages which are coming out-of-order Proposed Solution Apache Kafka can simplify the current messaging architecture by using a Producer-Consumer approach and orchestrates messaging services by acting as a broker. The coordination, replication, fault-tolerance, partitioning and parallelism of this architecture are taken care of by the Kafka server entirely. Producers publish to TOPICS Kafka orchestrates messaging Subscribers listen to these TOPICS

32 Producers Kafka Broker Zookeeper Apache Kafka Topic 1 Topic 2 Partition Consumers Streaming applications 0 1 Partition 2 Database Partition Database Partition Analytics Modelling

33 Apache Kafka s role in Manufacturing Apache Kafka for Industry 4.0/ IoT Use cases in IIoT (Industrial Internet of Things) - Real time stream processing (coupled with Spark Streaming) - General purpose message bus - Collecting user activity data - Collecting operational metrics from sensors, applications, servers or devices - Log aggregation - Change data capture - Maintaining a commit log for distributed systems Source: Confluent

34 Cloud Strategy Reduce cost-of-ownership, simplify management of IT operations, and shorten the path from invention to delivery of new applications Develop a new business model opportunity by creating a domain-specific service-suite accessible to subscribers or pay-per-use through APIs Cloud for Analytics Capabilities is a very important option to manage the complexity of Business Analytics infrastructure Scalability, High Availability Data Model Flexibility, Data Mobility Seamless work with an ecosystem of apps and tools Built-in analytical tools support for faster and efficient data analysis on-line and off-line CRITICAL: smooth integration with ERP capabilities, thus facilitating better bridges between process management and information life-cycle in the industry 4.0 enterprise

35 Offerings for key open source BA infrastructure IBM BigInsights Available on-premises, on-cloud, and integrated with other systems in use today IBM BigInsights On-premises version IBM BigInsights Analyst Big SQL BigSheets IBM BigInsights Data Scientist Text Analytics Machine Learning on Big R Big R (R support) Big SQL BigSheets... Free Quick Start (non production): IBM Open Platform BigInsights Analyst, Data Scientist features Community support IBM BigInsights Enterprise Management POSIX Distributed File system Multi-workload, multi-tenant scheduling IBM Open Platform with Apache Hadoop (HDFS, YARN, MapReduce, Ambari, Flume, HBase, Hive, Kafka, Knox, Oozie, Pig, Slider, Solr, Spark, Sqoop, Zookeeper)

36 The Open Source on Cloud by SAP SAP HANA Vora SAP HANA + Vora + Hadoop (also on premises) SAP HANA Vora integrates SAP HANA data with data lakes(hadoop) Seamless integration with HANA + Spark + Hadoop One can archive ERP data from HANA to Hadoop Integrated BI

37 More ICT offerings for key BA infrastructure on Cloud MapR, Hortonworks, Cloudera On premises and Cloud On premises and Cloud On premises and Cloud Microsoft Azure HDInsight Cloud Service

38 Offerings for key Open Source BA infrastructure Microsoft Cloud Service Azure HDInsight: Components offered: Apache Hadoop/ YARN Apache Tez Apache Pig Apache Hive Apache Hbase Apache Sqoop Apache Oozie Apache Zookeeper Apache Storm Apache Mahout Apache Spark

39 Amazon Web Services Support for Open Source Capabilities on the Cloud Hadoop Elastic Map Reduce Apache Spark Elastic Map Reduce HDFS is automatically installed with Hadoop on Amazon s EMR(Elastic Map Reduce) cluster EMR = Managed service Hadoop Framework by Amazon Spark is also supported by Amazon EMR cluster The in-memory caching, optimized execution, general batch processing, streaming analytics, machine learning, graph databases and ad-hoc queries are all supported on cloud by Amazon EMR and EC2(Elastic Cloud Compute) Amazon EMR is easy to tune in for clusters and helps reduce infrastructure maintenance and operational costs Supports multiple data stores Since it is elastic, one can provision 100s and 1000s of compute instances to process large datasets (increase and decrease the number of instances) Amazon Elastic Cloud Compute(EC2) is a web service that provides resizable compute capacity in the cloud Source: Source:

40 Commercial vehicles for delivering viable BA solutions IBM Predictive Maintenance Predict Asset Failure/ Extend Life: Determine failure based on usage characteristics Identify conditions that lead to high failure Predict Part Quality Detect anomalies within the process Conduct in-depth root cause analysis

41 Analytics on dispersed sources of structured and unstructured data IBM Watson Explorer On premises and in cloud

42 Commercial vehicles for delivering viable BA solutions IBM Streams On premises and in cloud

43 Commercial vehicles for delivering viable BA solutions SAP HANA and Analytics SAP offers near-real time in-memory computing capabilities and efficient reporting through HANA Applications Cloud Applications Analytics Excel IoT Mobile/ Web API SAP HANA Platform Web Server JavaScript, SQLScript, SQL Spatial Search Text Mining Stored Procedure & Data Models Application & UI services Business Function Library Predictive Analytics Library Database Services Planning Engine Rules Engine Planning Engine Transaction Unstructured Machine Hadoop Real-time Location Other Apps

44 Commercial vehicles for delivering viable BA solutions SAP Smart Data Streaming On premises

45 Opportunity for Luxembourg: High specialization in selected capabilities leading to new ICT Solutions with impact to Industry 4.0 Business Analytics Reference Architectures Infrastructure Research and Innovation Security of Network Systems Legal Framework Architectures integrating process and big data. Achitectures for Cloudbased Applications and Services Affordable options for big data and analytics needs Create a sandbox of new and custom algorithms for quick PoCs. Define an API App Cloud for Industry 4.0 How to use data analytics for better security GDPR and data confidentiality assurance in Industry 4.0 Training and Education Others Programs in BA for executives, managers and engineers Proper funding for collaboration between start-ups / R&D / industry Key Topics

46 Paths to make BA viable for Industry Focus on specific areas where large data sources and analytics may lead to operational savings and new business Do not boil the ocean by making exotic mega-plans or tough ROI cases Start simple with quick wins for exec management buy-in 2. Get help to assess the viability of the initiative very quickly (technically and financially) If your organization does not have the specific skills, do not rely on internal-only assessments (good IT or Engineering does not mean that they will know BA) Partnership with an R&D organization that can help you assess (for example, the FEDER project in LIST) 3. Use ICT third-party rented infrastructure to discover and validate solutions, architecture, options in depth 4. If you can afford to do some BA work based on internal infrastructure, test ideas by using simple tools Open tools are appealing for zero-cost license but the skills needed to use them properly and maintain an informal development environment are very specialized Partner with an organization that can help you define a good architecture for the solution toward a fast No / No-go PoC (i.e., fail fast and cheaply)

47 Open source software - Considerations Programming Language Support Runtime Considerations Platform Support OS Hadoop Java Distributed disk access No firewall between intended systems JDK Java Linux Windows Mac OS Spark Scala, Java, Python, R via SparkR Distributed memory access No firewall between intended systems ICMP protocol should not be blocked JDK Java Linux Windows Mac OS Spark Streaming Scala, Java, Python Distributed memory access System ports should not be blocked by firewall JDK Java Linux Windows Mac OS Kafka Several including Scala, Java, Python, stdin/stdout System ports should not be blocked by firewall JDK Java Zookeeper Linux Windows Mac OS SAP Smart Data Streaming CCL Smart Data Streaming should be hosted on a separate server [TBD] Linux

Microsoft Azure Essentials

Microsoft Azure Essentials Microsoft Azure Essentials Azure Essentials Track Summary Data Analytics Explore the Data Analytics services in Azure to help you analyze both structured and unstructured data. Azure can help with large,

More information

Spotlight Sessions. Nik Rouda. Director of Product Marketing Cloudera, Inc. All rights reserved. 1

Spotlight Sessions. Nik Rouda. Director of Product Marketing Cloudera, Inc. All rights reserved. 1 Spotlight Sessions Nik Rouda Director of Product Marketing Cloudera @nrouda Cloudera, Inc. All rights reserved. 1 Spotlight: Protecting Your Data Nik Rouda Product Marketing Cloudera, Inc. All rights reserved.

More information

EXAMPLE SOLUTIONS Hadoop in Azure HBase as a columnar NoSQL transactional database running on Azure Blobs Storm as a streaming service for near real time processing Hadoop 2.4 support for 100x query gains

More information

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Rohit Bakhshi, Solution Architect, Hortonworks Jim Walker, Director Product Marketing, Talend Page 1 About Us Rohit Bakhshi Solution

More information

5th Annual. Cloudera, Inc. All rights reserved.

5th Annual. Cloudera, Inc. All rights reserved. 5th Annual 1 The Essentials of Apache Hadoop The What, Why and How to Meet Agency Objectives Sarah Sproehnle, Vice President, Customer Success 2 Introduction 3 What is Apache Hadoop? Hadoop is a software

More information

Business is being transformed by three trends

Business is being transformed by three trends Business is being transformed by three trends Big Cloud Intelligence Stay ahead of the curve with Cortana Intelligence Suite Business apps People Custom apps Apps Sensors and devices Cortana Intelligence

More information

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

E-guide Hadoop Big Data Platforms Buyer s Guide part 1 Hadoop Big Data Platforms Buyer s Guide part 1 Your expert guide to Hadoop big data platforms for managing big data David Loshin, Knowledge Integrity Inc. Companies of all sizes can use Hadoop, as vendors

More information

Big data is hard. Top 3 Challenges To Adopting Big Data

Big data is hard. Top 3 Challenges To Adopting Big Data Big data is hard Top 3 Challenges To Adopting Big Data Traditionally, analytics have been over pre-defined structures Data characteristics: Sales Questions answered with BI and visualizations: Customer

More information

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Roger Ding Cloudera February 3rd, 2018 1 Agenda Hadoop History Introduction to Apache Hadoop

More information

Hortonworks Connected Data Platforms

Hortonworks Connected Data Platforms Hortonworks Connected Data Platforms MASTER THE VALUE OF DATA EVERY BUSINESS IS A DATA BUSINESS EMBRACE AN OPEN APPROACH 2 Hortonworks Inc. 2011 2016. All Rights Reserved Data Drives the Connected Car

More information

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. Course Content Course Description: The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. At Course Completion: After competing this course,

More information

20775A: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight 20775A: Performing Data Engineering on Microsoft HD Insight Duration: 5 days; Instructor-led Implement Spark Streaming Using the DStream API. Develop Big Data Real-Time Processing Solutions with Apache

More information

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW TOPICS COVERED 1 2 Fundamentals of Big Data Platforms Major Big Data Tools Scaling Up vs. Out SCALE UP (SMP) SCALE OUT (MPP) + (n) Upgrade

More information

MapR: Solution for Customer Production Success

MapR: Solution for Customer Production Success 2015 MapR Technologies 2015 MapR Technologies 1 MapR: Solution for Customer Production Success Big Data High Growth 700+ Customers Cloud Leaders Riding the Wave with Hadoop The Big Data Platform of Choice

More information

20775: Performing Data Engineering on Microsoft HD Insight

20775: Performing Data Engineering on Microsoft HD Insight Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com

More information

How In-Memory Computing can Maximize the Performance of Modern Payments

How In-Memory Computing can Maximize the Performance of Modern Payments How In-Memory Computing can Maximize the Performance of Modern Payments 2018 The mobile payments market is expected to grow to over a trillion dollars by 2019 How can in-memory computing maximize the performance

More information

20775A: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight 20775A: Performing Data Engineering on Microsoft HD Insight Course Details Course Code: Duration: Notes: 20775A 5 days This course syllabus should be used to determine whether the course is appropriate

More information

Bringing the Power of SAS to Hadoop Title

Bringing the Power of SAS to Hadoop Title WHITE PAPER Bringing the Power of SAS to Hadoop Title Combine SAS World-Class Analytics With Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities ii Contents Introduction... 1 What

More information

Spark and Hadoop Perfect Together

Spark and Hadoop Perfect Together Spark and Hadoop Perfect Together Arun Murthy Hortonworks Co-Founder @acmurthy Data Operating System Enable all data and applications TO BE accessible and shared BY any end-users Data Operating System

More information

Cloud Based Analytics for SAP

Cloud Based Analytics for SAP Cloud Based Analytics for SAP Gary Patterson, Global Lead for Big Data About Virtustream A Dell Technologies Business 2,300+ employees 20+ data centers Major operations in 10 countries One of the fastest

More information

Modernizing Your Data Warehouse with Azure

Modernizing Your Data Warehouse with Azure Modernizing Your Data Warehouse with Azure Big data. Small data. All data. Christian Coté S P O N S O R S The traditional BI Environment The traditional data warehouse data warehousing has reached the

More information

Preface About the Book

Preface About the Book Preface About the Book We are living in the dawn of what has been termed as the "Fourth Industrial Revolution" by the World Economic Forum (WEF) in 2016. The Fourth Industrial Revolution is marked through

More information

IBM Analytics Unleash the power of data with Apache Spark

IBM Analytics Unleash the power of data with Apache Spark IBM Analytics Unleash the power of data with Apache Spark Agility, speed and simplicity define the analytics operating system of the future 1 2 3 4 Use Spark to create value from data-driven insights Lower

More information

BIG DATA AND HADOOP DEVELOPER

BIG DATA AND HADOOP DEVELOPER BIG DATA AND HADOOP DEVELOPER Approximate Duration - 60 Hrs Classes + 30 hrs Lab work + 20 hrs Assessment = 110 Hrs + 50 hrs Project Total duration of course = 160 hrs Lesson 00 - Course Introduction 0.1

More information

20775 Performing Data Engineering on Microsoft HD Insight

20775 Performing Data Engineering on Microsoft HD Insight Duración del curso: 5 Días Acerca de este curso The main purpose of the course is to give students the ability plan and implement big data workflows on HD. Perfil de público The primary audience for this

More information

Big Data Introduction

Big Data Introduction Big Data Introduction Who we are Experts At Your Service Over 50 specialists in IT infrastructure Certified, experienced, passionate Based In Switzerland 100% self-financed Swiss company Over CHF8 mio.

More information

Common Customer Use Cases in FSI

Common Customer Use Cases in FSI Common Customer Use Cases in FSI 1 Marketing Optimization 2014 2014 MapR MapR Technologies Technologies 2 Fortune 100 Financial Services Company 104M CARD MEMBERS 3 Financial Services: Recommendation Engine

More information

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK Are you drowning in Big Data? Do you lack access to your data? Are you having a hard time managing Big Data processing requirements?

More information

WELCOME TO. Cloud Data Services: The Art of the Possible

WELCOME TO. Cloud Data Services: The Art of the Possible WELCOME TO Cloud Data Services: The Art of the Possible Goals for Today Share the cloud-based data management and analytics technologies that are enabling rapid development of new mobile applications Discuss

More information

Maturing IoT solutions with Microsoft Azure. Glenn Colpaert Azure/IoT Domain

Maturing IoT solutions with Microsoft Azure. Glenn Colpaert Azure/IoT Domain Maturing IoT solutions with Microsoft Azure Glenn Colpaert Azure/IoT Domain Lead @GlennColpaert Who we are 2000 Belgium 2004 France 2013 Portugal 2016 Switzerland 2016 UK 2016 The Netherlands 2017 Malta

More information

Insights-Driven Operations with SAP HANA and Cloudera Enterprise

Insights-Driven Operations with SAP HANA and Cloudera Enterprise Insights-Driven Operations with SAP HANA and Cloudera Enterprise Unleash your business with pervasive Big Data Analytics with SAP HANA and Cloudera Enterprise The missing link to operations As big data

More information

Hortonworks Data Platform

Hortonworks Data Platform Hortonworks Data Platform An open-architecture platform to manage data in motion and at rest Highlights Addresses a range of data-at-rest use cases Powers real-time customer applications Delivers robust

More information

Monetizing Data. Creating Wealth through Analytics Powered Digital Culture. Narayanan Ramanathan (NR) Chief Digital Officer & Global Head

Monetizing Data. Creating Wealth through Analytics Powered Digital Culture. Narayanan Ramanathan (NR) Chief Digital Officer & Global Head Monetizing Data Creating Wealth through Analytics Powered Digital Culture Narayanan Ramanathan (NR) Chief Digital Officer & Global Head Restricted Circulation L&T Technology Services 2018 Exciting Facts

More information

Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand

Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand Paper 2698-2018 Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand ABSTRACT Digital analytics is no longer just about tracking the number

More information

Outline of Hadoop. Background, Core Services, and Components. David Schwab Synchronic Analytics Nov.

Outline of Hadoop. Background, Core Services, and Components. David Schwab Synchronic Analytics   Nov. Outline of Hadoop Background, Core Services, and Components David Schwab Synchronic Analytics https://synchronicanalytics.com Nov. 1, 2018 Hadoop s Purpose and Origin Hadoop s Architecture Minimum Configuration

More information

Cask Data Application Platform (CDAP)

Cask Data Application Platform (CDAP) Cask Data Application Platform (CDAP) CDAP is an open source, Apache 2.0 licensed, distributed, application framework for delivering Hadoop solutions. It integrates and abstracts the underlying Hadoop

More information

Big Data The Big Story

Big Data The Big Story Big Data The Big Story Jean-Pierre Dijcks Big Data Product Mangement 1 Agenda What is Big Data? Architecting Big Data Building Big Data Solutions Oracle Big Data Appliance and Big Data Connectors Customer

More information

Insights to HDInsight

Insights to HDInsight Insights to HDInsight Why Hadoop in the Cloud? No hardware costs Unlimited Scale Pay for What You Need Deployed in minutes Azure HDInsight Big Data made easy Enterprise Ready Easier and more productive

More information

Microsoft Big Data. Solution Brief

Microsoft Big Data. Solution Brief Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,

More information

MapR Pentaho Business Solutions

MapR Pentaho Business Solutions MapR Pentaho Business Solutions The Benefits of a Converged Platform to Big Data Integration Tom Scurlock Director, WW Alliances and Partners, MapR Key Takeaways 1. We focus on business values and business

More information

Big Data and Business Analytics: Accelerating Digital Transformation in Enterprises and Industries

Big Data and Business Analytics: Accelerating Digital Transformation in Enterprises and Industries Big Data and Business Analytics: Accelerating Digital Transformation in Enterprises and Industries James PANG, Ph.D. Visiting Associate Professor School of Computing National University of Singapore Jorge

More information

TechArch Day Digital Decoupling. Oscar Renalias. Accenture

TechArch Day Digital Decoupling. Oscar Renalias. Accenture TechArch Day 2018 Digital Decoupling Oscar Renalias Accenture !"##$ oscar.renalias@acenture.com @oscarrenalias https://www.linkedin.com/in/oscarrenalias/ https://github.com/accenture THE ERA OF THE BIG

More information

Apache Hadoop in the Datacenter and Cloud

Apache Hadoop in the Datacenter and Cloud Apache Hadoop in the Datacenter and Cloud The Shift to the Connected Data Architecture Digital Transformation fueled by Big Data Analytics and IoT ACTIONABLE INTELLIGENCE Cloud and Data Center IDMS Relational

More information

The Importance of good data management and Power BI

The Importance of good data management and Power BI The Importance of good data management and Power BI The BI Iceberg Visualising Data is only the tip of the iceberg Data Preparation and provisioning is a complex process Streamlining this process is key

More information

Building data-driven applications with SAP Data Hub and Amazon Web Services

Building data-driven applications with SAP Data Hub and Amazon Web Services Building data-driven applications with SAP Data Hub and Amazon Web Services Dr. Lars Dannecker, Steffen Geissinger September 18 th, 2018 Cross-department disconnect Cross-department disconnect Cross-department

More information

Our Emerging Offerings Differentiators In-focus

Our Emerging Offerings Differentiators In-focus Our Emerging Offerings Differentiators In-focus Agenda 1 Dotbits 2 Dotbits@US ; Dotbits@India 3 Differentiators and Key Trends 4 Solutions and Service Offerings 5 Representative Experiences Page 2 Dotbits

More information

Analytics Platform System

Analytics Platform System Analytics Platform System Big data. Small data. All data. Audie Wright, DW & Big Data Specialist Audie.Wright@Microsoft.com Ofc 425-538-0044, Cell 303-324-2860 Sean Mikha, DW & Big Data Architect semikha@microsoft.com

More information

SAP Predictive Analytics Suite

SAP Predictive Analytics Suite SAP Predictive Analytics Suite Tania Pérez Asensio Where is the Evolution of Business Analytics Heading? Organizations Are Maturing Their Approaches to Solving Business Problems Reactive Wait until a problem

More information

MapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia

MapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia MapR: Converged Data Pla3orm and Quick Start Solu;ons Robin Fong Regional Director South East Asia Who is MapR? MapR is the creator of the top ranked Hadoop NoSQL SQL-on-Hadoop Real Database time streaming

More information

Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved.

Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved. Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved. 1 SAP Big Data Webinar Series Big Data - Introduction to SAP Big Data Technologies Big Data - Streaming Analytics Big Data - Smarter

More information

Active Analytics Overview

Active Analytics Overview Active Analytics Overview The Fourth Industrial Revolution is predicated on data. Success depends on recognizing data as the most valuable corporate asset. From smart cities to autonomous vehicles, logistics

More information

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN From Information to Insight: The Big Value of Big Data Faire Ann Co Marketing Manager, Information Management Software, ASEAN The World is Changing and Becoming More INSTRUMENTED INTERCONNECTED INTELLIGENT

More information

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica Accelerating Your Big Data Analytics Jeff Healey, Director Product Marketing, HPE Vertica Recent Waves of Disruption IT Infrastructu re for Analytics Data Warehouse Modernization Big Data/ Hadoop Cloud

More information

Azure IoT Suite. Secure device connectivity and management. Data ingestion and command + control. Rich dashboards and visualizations

Azure IoT Suite. Secure device connectivity and management. Data ingestion and command + control. Rich dashboards and visualizations Azure IoT Suite Secure device connectivity and management Data ingestion and command + control Rich dashboards and visualizations Business workflow integration Move beyond building blocks with pre-configured

More information

Industrial IoT Solution Architecture Design From Connectivity to Data

Industrial IoT Solution Architecture Design From Connectivity to Data Industrial IoT Solution Architecture Design From Connectivity to Data Cheryl Hsu Program Manager Strategic Engagement & Industrial IoT, Microsoft IoT Enables a Digital Feedback Loop The benefits are profound

More information

E-guide Hadoop Big Data Platforms Buyer s Guide part 3

E-guide Hadoop Big Data Platforms Buyer s Guide part 3 Big Data Platforms Buyer s Guide part 3 Your expert guide to big platforms enterprise MapReduce cloud-based Abie Reifer, DecisionWorx The Amazon Elastic MapReduce Web service offers a managed framework

More information

Databricks Cloud. A Primer

Databricks Cloud. A Primer Databricks Cloud A Primer Who is Databricks? Databricks was founded by the team behind Apache Spark, the most active open source project in the big data ecosystem today. Our mission at Databricks is to

More information

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved.

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved. Apache Spark 2.0 GA The General Engine for Modern Analytic Use Cases 1 Apache Spark Drives Business Innovation Apache Spark is driving new business value that is being harnessed by technology forward organizations.

More information

SOLUTION SHEET Hortonworks DataFlow (HDF ) End-to-end data flow management and streaming analytics platform

SOLUTION SHEET Hortonworks DataFlow (HDF ) End-to-end data flow management and streaming analytics platform SOLUTION SHEET Hortonworks DataFlow (HDF ) End-to-end data flow management and streaming analytics platform CREATE STREAMING ANALYTICS APPLICATIONS IN MINUTES WITHOUT WRITING CODE The increasing growth

More information

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect 2005 Concert de Coldplay 2014 Concert de Coldplay 90% of the world s data has been created over the last two years alone 1 1. Source

More information

Bluemix Overview. Last Updated: October 10th, 2017

Bluemix Overview. Last Updated: October 10th, 2017 Bluemix Overview Last Updated: October 10th, 2017 Agenda Overview Architecture Apps & Services Cloud Computing An estimated 85% of new software is being built for cloud deployment Cloud Computing is a

More information

Hybrid Data Management

Hybrid Data Management Kelly Schlamb Executive IT Specialist, Worldwide Analytics Platform Enablement and Technical Sales (kschlamb@ca.ibm.com, @KSchlamb) Hybrid Data Management IBM Analytics Summit 2017 November 8, 2017 5 Essential

More information

Intel Public Sector 3

Intel Public Sector 3 Intel technologies features and benefits depend on system configuration and may require enabled hardware, software or service activation. Performance varies depending on system configuration. No computer

More information

Access, Transform, and Connect Data with SAP Data Services Software

Access, Transform, and Connect Data with SAP Data Services Software SAP Brief SAP s for Enterprise Information Management SAP Data Services Access, Transform, and Connect Data with SAP Data Services Software SAP Brief Establish an enterprise data integration and data quality

More information

SAP Big Data. Markus Tempel SAP Big Data and Cloud Analytics Services

SAP Big Data. Markus Tempel SAP Big Data and Cloud Analytics Services SAP Big Data Markus Tempel SAP Big Data and Cloud Analytics Services Is that Big Data? 2015 SAP AG or an SAP affiliate company. All rights reserved. 2 What if you could turn new signals from Big Data into

More information

ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics

ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics ADVANCED ANALYTICS & IOT ARCHITECTURES Presented by: Orion Gebremedhin Director of Technology, Data & Analytics Marc Lobree National Architect, Advanced Analytics EDW THE RIGHT TOOL FOR THE RIGHT WORKLOAD

More information

Angat Pinoy. Angat Negosyo. Angat Pilipinas.

Angat Pinoy. Angat Negosyo. Angat Pilipinas. Angat Pinoy. Angat Negosyo. Angat Pilipinas. Four megatrends will dominate the next decade Mobility Social Cloud Big data 91% of organizations expect to spend on mobile devices in 2012 In 2012, mobile

More information

IoT ENABLED INTELLIGENT FLEET MANAGEMENT. Kalman Tiboldi Chief Business Innovation Officer

IoT ENABLED INTELLIGENT FLEET MANAGEMENT. Kalman Tiboldi Chief Business Innovation Officer IoT ENABLED INTELLIGENT FLEET MANAGEMENT Kalman Tiboldi Chief Business Innovation Officer TVH GROUP > 5600 colleagues worldwide Consolidated turnover 1,3 billion SMART LOGISTICS PART OF INDUSTRY 4.0 Smart

More information

Two offerings which interoperate really well

Two offerings which interoperate really well Microsoft Two offerings which interoperate really well On-premises Cortana Intelligence Suite SQL Server 2016 Cloud IAAS Enterprise PAAS Cloud Storage Service 9 SQL Server 2016: Everything built-in built-in

More information

Architecture Overview for Data Analytics Deployments

Architecture Overview for Data Analytics Deployments Architecture Overview for Data Analytics Deployments Mahmoud Ghanem Sr. Systems Engineer GLOBAL SPONSORS Agenda The Big Picture Top Use Cases for Data Analytics Modern Architecture Concepts for Data Analytics

More information

Predictive Analytics Reimagined for the Digital Enterprise

Predictive Analytics Reimagined for the Digital Enterprise SAP Brief SAP BusinessObjects Analytics SAP BusinessObjects Predictive Analytics Predictive Analytics Reimagined for the Digital Enterprise Predicting and acting in a business moment through automation

More information

Transforming IIoT Data into Opportunity with Data Torrent using Apache Apex

Transforming IIoT Data into Opportunity with Data Torrent using Apache Apex CASE STUDY Transforming IIoT Data into Opportunity with Data Torrent using Apache Apex DataTorrent delivers better business outcomes for customers using industrial of things (IIoT) data Challenge The industrial

More information

AZURE HDINSIGHT. Azure Machine Learning Track Marek Chmel

AZURE HDINSIGHT. Azure Machine Learning Track Marek Chmel AZURE HDINSIGHT Azure Machine Learning Track Marek Chmel SESSION AGENDA Understanding different scenarios of Hadoop Building an end to end pipeline using HDInsight Using in-memory techniques to analyze

More information

Analytic Alphabet Soup: IoT, AI & ESP Big Data Analytics is a game changer in our Connected World

Analytic Alphabet Soup: IoT, AI & ESP Big Data Analytics is a game changer in our Connected World Analytic Alphabet Soup: IoT, AI & ESP Big Data Analytics is a game changer in our Connected World Eric Hunley Director Enterprise Solutions SAS Great leaders are almost always great simplifiers, who can

More information

SUSiEtec The Application Ready IoT Framework. Create your path to digitalization while predictively addressing your business needs

SUSiEtec The Application Ready IoT Framework. Create your path to digitalization while predictively addressing your business needs SUSiEtec The Application Ready IoT Framework Create your path to digitalization while predictively addressing your business needs Industry 4.0 trends and vision Transform every aspect of the manufacturing

More information

Operational Hadoop and the Lambda Architecture for Streaming Data

Operational Hadoop and the Lambda Architecture for Streaming Data Operational Hadoop and the Lambda Architecture for Streaming Data 2015 MapR Technologies 2015 MapR Technologies 1 Topics From Batch to Operational Workloads on Hadoop Streaming Data Environments The Lambda

More information

Konica Minolta Business Innovation Center

Konica Minolta Business Innovation Center Konica Minolta Business Innovation Center Advance Technology/Big Data Lab May 2016 2 2 3 4 4 Konica Minolta BIC Technology and Research Initiatives Data Science Program Technology Trials (Technology partner

More information

Transforming Analytics with Cloudera Data Science WorkBench

Transforming Analytics with Cloudera Data Science WorkBench Transforming Analytics with Cloudera Data Science WorkBench Process data, develop and serve predictive models. 1 Age of Machine Learning Data volume NO Machine Learning Machine Learning 1950s 1960s 1970s

More information

Data Science, realizing the Hype Cycle. Luigi Di Rito, Director Data Science Team, SAP Center of Excellence

Data Science, realizing the Hype Cycle. Luigi Di Rito, Director Data Science Team, SAP Center of Excellence Data Science, realizing the Hype Cycle. Luigi Di Rito, Director Data Science Team, SAP Center of Excellence Data Science, Machine Learning and Artificial Intelligence Deep Learning AREAS OF AI Rule-based

More information

ADVANCED ANALYTICS & IOT ARCHITECTURES

ADVANCED ANALYTICS & IOT ARCHITECTURES ADVANCED ANALYTICS & IOT ARCHITECTURES Presented by: Orion Gebremedhin Director of Technology, Data & Analytics Marc Lobree National Architect, Advanced Analytics EDW THE RIGHT TOOL FOR THE RIGHT WORKLOAD

More information

Enterprise Analytics Accelerating Your Path to Value with an Open Analytics Platform

Enterprise Analytics Accelerating Your Path to Value with an Open Analytics Platform Enterprise Analytics Accelerating Your Path to Value with an Open Analytics Platform Federico Pozzi @fedealbpozzi Mathias Coopmans @macoopma Characteristics of a badly managed platform No clear data

More information

SAP Cloud Platform Big Data Services EXTERNAL. SAP Cloud Platform Big Data Services From Data to Insight

SAP Cloud Platform Big Data Services EXTERNAL. SAP Cloud Platform Big Data Services From Data to Insight EXTERNAL FULL-SERVICE BIG DATA IN THE CLOUD, a fully managed Apache Hadoop and Apache Spark cloud offering, form the cornerstone of many successful Big Data implementations. Enterprises harness the performance

More information

From Data Deluge to Intelligent Data

From Data Deluge to Intelligent Data SAP Data Hub From Data Deluge to Intelligent Data Orchestrate Your Data for an Intelligent Enterprise Data for Intelligence, Speed, and With Today, corporate data landscapes are growing increasingly diverse

More information

ABOUT THIS TRAINING: This Hadoop training will also prepare you for the Big Data Certification of Cloudera- CCP and CCA.

ABOUT THIS TRAINING: This Hadoop training will also prepare you for the Big Data Certification of Cloudera- CCP and CCA. ABOUT THIS TRAINING: The world of Hadoop and Big Data" can be intimidating - hundreds of different technologies with cryptic names form the Hadoop ecosystem. This comprehensive training has been designed

More information

Cask Data Application Platform (CDAP) Extensions

Cask Data Application Platform (CDAP) Extensions Cask Data Application Platform (CDAP) Extensions CDAP Extensions provide additional capabilities and user interfaces to CDAP. They are use-case specific applications designed to solve common and critical

More information

NICE Customer Engagement Analytics - Architecture Whitepaper

NICE Customer Engagement Analytics - Architecture Whitepaper NICE Customer Engagement Analytics - Architecture Whitepaper Table of Contents Introduction...3 Data Principles...4 Customer Identities and Event Timelines...................... 4 Data Discovery...5 Data

More information

Louis Bodine IBM STG WW BAO Tiger Team Leader

Louis Bodine IBM STG WW BAO Tiger Team Leader Louis Bodine IBM STG WW BAO Tiger Team Leader Presentation Objectives Discuss the value of Business Analytics Discuss BAO Ecosystem Discuss Transformational Solutions http://www.youtube.com/watch?v=eiuick5oqdm

More information

Redefine Big Data: EMC Data Lake in Action. Andrea Prosperi Systems Engineer

Redefine Big Data: EMC Data Lake in Action. Andrea Prosperi Systems Engineer Redefine Big Data: EMC Data Lake in Action Andrea Prosperi Systems Engineer 1 Agenda Data Analytics Today Big data Hadoop & HDFS Different types of analytics Data lakes EMC Solutions for Data Lakes 2 The

More information

Five Advances in Analytics

Five Advances in Analytics Five Advances in Analytics Fern Halper TDWI Director of Research for Advanced Analytics @fhalper March 26, 2015 Sponsor 2 Speakers Fern Halper Research Director for Advanced Analytics, TDWI Mike Watschke

More information

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake White Paper Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake Motivation for Modernization It is now a well-documented realization among Fortune 500 companies

More information

Lesson 3 Cloud Platform as a Service usages for accelerated Design and Deployment of IoTs

Lesson 3 Cloud Platform as a Service usages for accelerated Design and Deployment of IoTs Lesson 3 Cloud Platform as a Service usages for accelerated Design and Deployment of IoTs 1 Large and Big Data platform Oracle IOT PaaS For delivering, integrating, securing and retrieving For analysing

More information

Analyze Big Data Faster and Store it Cheaper. Dominick Huang CenterPoint Energy Russell Hull - SAP

Analyze Big Data Faster and Store it Cheaper. Dominick Huang CenterPoint Energy Russell Hull - SAP Analyze Big Data Faster and Store it Cheaper Dominick Huang CenterPoint Energy Russell Hull - SAP ABOUT CENTERPOINT ENERGY, INC. Publicly traded on New York Stock Exchange Headquartered in Houston, Texas

More information

Cloudera, Inc. All rights reserved.

Cloudera, Inc. All rights reserved. 1 Data Analytics 2018 CDSW Teamplay und Governance in der Data Science Entwicklung Thomas Friebel Partner Sales Engineer tfriebel@cloudera.com 2 We believe data can make what is impossible today, possible

More information

Contents at a Glance COPYRIGHTED MATERIAL. Introduction... 1 Part I: Getting Started with Big Data... 7

Contents at a Glance COPYRIGHTED MATERIAL. Introduction... 1 Part I: Getting Started with Big Data... 7 Contents at a Glance Introduction... 1 Part I: Getting Started with Big Data... 7 Chapter 1: Grasping the Fundamentals of Big Data...9 Chapter 2: Examining Big Data Types...25 Chapter 3: Old Meets New:

More information

Actionable Insights with PI Integrators

Actionable Insights with PI Integrators Actionable Insights with PI Integrators Hans Otto Weinhold, Partner Solutions Architect Joy Wang, Product Manager Agenda Introduction to PI Integrators Learn about Integrators and how they add value to

More information

Confidential

Confidential June 2017 1. Is your EDW becoming too expensive to maintain because of hardware upgrades and increasing data volumes? 2. Is your EDW becoming a monolith, which is too slow to adapt to business s analytical

More information

Hortonworks Powering the Future of Data

Hortonworks Powering the Future of Data Hortonworks Powering the Future of Simon Gregory Vice President Eastern Europe, Middle East & Africa 1 Hortonworks Inc. 2011 2016. All Rights Reserved MASTER THE VALUE OF DATA EVERY BUSINESS IS A DATA

More information

The Internet of Things Wind Turbine Predictive Analytics. Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure

The Internet of Things Wind Turbine Predictive Analytics. Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure The Internet of Things Wind Turbine Predictive Analytics Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure Big Data and Tribo-Analytics Today we will see how Fluitec solved real-world challenges

More information

Digital transformation is the next industrial revolution

Digital transformation is the next industrial revolution Digital transformation is the next industrial revolution Steam, water, mechanical production equipment Division of labor, electricity, mass production Electronics, IT, automated production Blurring the

More information

WHITEPAPER. Unlocking Your ATM Big Data : Understanding the power of real-time transaction monitoring and analytics.

WHITEPAPER. Unlocking Your ATM Big Data : Understanding the power of real-time transaction monitoring and analytics. Unlocking Your ATM Big Data : Understanding the power of real-time transaction monitoring and analytics www.inetco.com Summary Financial organizations are heavily investing in self-service and omnichannel

More information