ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics

Similar documents
ADVANCED ANALYTICS & IOT ARCHITECTURES

BIG DATA & ADVANCED ANALYTICS ROADSHOW

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW

Azure Data Analytics & Machine Learning Seminar. Daire Cunningham: BI Practice Area Manager

20775A: Performing Data Engineering on Microsoft HD Insight

20775: Performing Data Engineering on Microsoft HD Insight

Microsoft Azure Essentials

20775A: Performing Data Engineering on Microsoft HD Insight

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.

Business is being transformed by three trends

Azure ML Data Camp. Ivan Kosyakov MTC Architect, Ph.D. Microsoft Technology Centers Microsoft Technology Centers. Experience the Microsoft Cloud

20775 Performing Data Engineering on Microsoft HD Insight


Big data is hard. Top 3 Challenges To Adopting Big Data

Advanced Analytics in Azure

The Importance of good data management and Power BI

Measure Consume. Store. Data Governance

Jason Virtue Business Intelligence Technical Professional

HDInsight - Hadoop for the Commoner Matt Stenzel Data Platform Technical Specialist

Making Realtime Reporting a Reality

Boston Azure Cloud User Group. a journey of a thousand miles begins with a single step

Spotlight Sessions. Nik Rouda. Director of Product Marketing Cloudera, Inc. All rights reserved. 1

Business Intelligence in Azure Alex Whittles

Alexander Klein. ETL meets Azure

SAP Predictive Analytics Suite

AZURE HDINSIGHT. Azure Machine Learning Track Marek Chmel

Azure Offerings for Big data. In Kee Paek Cloud Data Solution Architect Microsoft Korea October. 2016

Common Customer Use Cases in FSI

Digital transformation is the next industrial revolution

Data Analytics. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC

Architecting an Open Data Lake for the Enterprise

INTRODUCTION TO R FOR DATA SCIENCE WITH R FOR DATA SCIENCE DATA SCIENCE ESSENTIALS INTRODUCTION TO PYTHON FOR DATA SCIENCE. Azure Machine Learning

Building a Modern Data Warehouse in Azure for Power BI

Big Data at PennDOT (ISTO DW-BI Team)

Cask Data Application Platform (CDAP) Extensions

Confidential

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect

Two offerings which interoperate really well

aka.ms/ uber-selfies

HPE Flexible Capacity with Microsoft Azure & Azure Stack

5th Annual. Cloudera, Inc. All rights reserved.

How In-Memory Computing can Maximize the Performance of Modern Payments

Integrating the Enterprise. How Business Leaders are Implementing Digital Integration

Course 20535A: Architecting Microsoft Azure Solutions

1% + 99% = AI Popularization

Your Big Data to Big Data tools using the family of PI Integrators

The Internet of Things Wind Turbine Predictive Analytics. Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure

Azure Data Factory Hybrid data integration, at global scale. Erika Harris Senior Program Manager AzureCAT

Operational Hadoop and the Lambda Architecture for Streaming Data

Actionable Insights with PI Integrators

Incorporating Predictive Models for Operational Intelligence

Modern Analytics Architecture

Your Top 5 Reasons Why You Should Choose SAP Data Hub INTERNAL

Industrial IoT Solution Architecture Design From Connectivity to Data

Power BI. Melissa Coates. Atlanta SQLSaturday BI Edition 1/9/2016. Blog: sqlchick.com

Five Advances in Analytics

Architecting Microsoft Azure Solutions

IIOT Data Access with the PI System

Big Data Introduction

How to create an Azure subscription

Turn Data into Business Value

Control Anything. Gain Insights. Connect Things. Action. 10% of the data on earth will come from IoT by B connected devices by 2020

DevOps och IoT. Infrastructure Architect - Avanade DK Blog:

IIoT Data Access with the PI System

Fast Innovation requires Fast IT

Using Technology and Big Data to Provide Customers with a Passenger Experience. IBTTA September 12, 2017

Apache Hadoop in the Datacenter and Cloud

Building data-driven applications with SAP Data Hub and Amazon Web Services

Angat Pinoy. Angat Negosyo. Angat Pilipinas.

Security Solutions in Azure

OSIsoft Super Regional Transform Your World

MapR Pentaho Business Solutions

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

Azure Data Lake How to organize. Jan Cordtz, Microsoft Denmark Cloud Solution Architect

Analytics Platform System

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake

Architecting for Real- Time Big Data Analytics. Robert Winters

CASE STUDY Delivering Real Time Financial Transaction Monitoring

TAP Air Portugal. in Real Time TÍTULO. Subtítulo. Rui Monteiro - February 19. Data da apresentação

Data Lake Organization A Hadoop Eco-System. Jan Cordtz, Microsoft Denmark Cloud Solution Architect

Analyzing Data with Power BI

Big Data The Big Story

Application Performance Management for Microsoft Azure and HDInsight

Sunnie Chung. Cleveland State University

Architecting Microsoft Azure Solutions

MapR: Solution for Customer Production Success

Modernizing Your Data Warehouse with Azure

Data Analytics for Semiconductor Manufacturing The MathWorks, Inc. 1

Cloud & Datacenter Monitoring with System Center Operations Manager

Real-time Streaming Insight & Time Series Data Analytic For Smart Retail

Analytics in Action transforming the way we use and consume information

30 min. Close. Facilitating innovation with IoT. Digital Transformation. Microsoft portfolio for product development

Course Outline (10996A)

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

Experiences in the Use of Big Data for Official Statistics

Analyzing Data with Power BI (20778)

Ventana Research. Big Data, Analytics & Cloud There s No Free Lunch! David Menninger SVP & Research Director Ventana

Responsive enterprise the future of the enterprise PERSPECTIVE

Data Science, realizing the Hype Cycle. Luigi Di Rito, Director Data Science Team, SAP Center of Excellence

Databricks Cloud. A Primer

Transcription:

ADVANCED ANALYTICS & IOT ARCHITECTURES Presented by: Orion Gebremedhin Director of Technology, Data & Analytics Marc Lobree National Architect, Advanced Analytics

EDW THE RIGHT TOOL FOR THE RIGHT WORKLOAD RDBMS Data Stores SSIS Local Data Sources Unstructured data Flat File Upload Azure SQL DW HDInsight Storage blob Azure SQL database On-Premises Reporting & Analytics SSIS Excel (Direct Access)

ESA EDW REFERENCE ARCHITECTURE HYBRID BIG DATA PROCESSING On-Demand-Compute Direct Access/ Report Model Level Integration Cloud Storage Cube PowerShell AZCopy SSIS SSIS Data Layer Level Integration Tabular

ON PREMISES BIG DATA IMPLEMENTATIONS

USE CASE: ETL OFFLOADING Have you outgrown your data delivery SLAs? Is your business frustrated with data delays? Get the right data at the right time.

Neudesic partnered with one of the nation s largest utility companies that recently deployed Smar Utility Meters for power customers, nearly a million meters sending usage data every 15 minutes. The result: an Azure hybrid big data processing solution that enabled the customer to perform gap analytics: a process for identifying gaps that exist in the power usage readings, over 7x faster than their previous solution! Billions of Smart Meter reads get processed to identify the nature and duration of the gaps to mitigate revenue losses.

USE CASE: REAL-TIME ANALYSIS Got end users that need data now? Provide business units the data they need at the time they need it.

REAL TIME TRAFFIC MANAGEMENT Toll Data EventsHub StreamAnalytics Toll Way Event Generator Toll Violations Reference Data Vehicle Registration Toll Violation Tickets

Real-Time Analysis On-premises Using Data Lake to capture all data for everyone. OLTP Kafka Spark MLlib Kafka Logs HDFS PM B DM MDM Machine learning

USE CASE: INTERNET OF THINGS What action does your IoT device drive? Help guide end-users to the action they are looking to take.

VENDING MACHINE MANAGEMENT Vending Machine EventsHub StreamAnalytics Vending Machine Vending Transactions EventsHub Batch Predictions Real-time Notifications Machine learning EventsHub Vehicle Location Info

REAL TIME TRAFFIC MANAGEMENT Toll Data EventsHub StreamAnalytics Toll Way Event Generator Toll Violations Reference Data Vehicle Registration Toll Violation Tickets

IOT WEARABLE MANAGEMENT Processing device data in real time. HD Insight Spark SQL Analyze Device API Azure Event Hub or IOT Hub Azure Stream Analytics Power BI Dataset Temporal Power BI Dashboards

USE CASE: ITERATIVE EXPLORATION What can we do with all of this data? Mine for answers-one question at a time.

ITERATIVE EXPLORATION Build expert systems, move to supervised learning, and evolve to reinforced learning. Web Service used for Orchestration HD Insight Azure Machine Learning API End Point Azure Data Warehouse Power BI

ITERATIVE EXPLORATION Monitor and remove noise from textual data. Web Service used for Orchestration Azure SQL DB Keyword Analytics Power BI Dataset Statistical Media Services Power BI Dashboards Machine Learning API End Point Event Hubs Stream Analytics Power BI Dataset Temporal

USE CASE: SELF SERVICE Are your reports only telling half the story? Quickly deliver large datasets for ad hoc analysis.

SELF SERVICE Allowing business to fulfill their analytics needs. Semi-structured Files Apache Hadoop Spark SQL Analyze Service Bus SQL Server

HYBRID SELF SERVICE

HYBRID SELF SERVICE

USE CASE: DATA AS A SERVICE Got savvy end users that need more data? Provide data scientists with what they need while making it easy for the business user.

Data-as-a-Service USING AZURE Using Data Lake to capture all data for everyone. Data Sources Loading Data Lake Raw Data Lake Building Data Streams Self-Service Catalog SQL Data Factory Click Stream Logs Data Factory Azure ML Azure Data Catalog Data Historian (PI Server) App Service Azure Data Lake Store Data Factory Azure Data Lake Store HDInsight Hive or Spark Power BI Dashboards Device API Azure Event Hub or IOT Hub Azure Stream Analytics Azure Blob Storage

Advanced Analytics Methodology

Solution Development Process Business Objective Understanding Data Understanding Data Model Creation + Testing Integration in Data Strategy Model Creation + Testing Data Acquisition Visual Analysis Model(s) Selection Model Comparison Integration in Data Strategy Build Model + Web Service Location for SQL query Consumption Layer

Model Selection: Supervised (we know the response). Parametric Regression Linear Polynomial Stepwise Binomial Splines Partial Least Squares Generalized Linear Models Classification Logistic Linear / Quadratic Discriminant Analysis Non Parametric K Nearest Neighbors Decision Trees Random Forests Boosting Neural Network Support Vector Machines Generalized Additive Models Forecasting Moving Averages Exponential Smoothing ARIMA Regressions *Some models can change (parametric/nonparametric) and (regression/classification)

Model Selection: MAPE & RMSE & R^2 Mean Average Percent Error Root Mean Square Error Variation explained by Predictor We want to choose the model that reduces the test error and has a high percent value for how much the predictors explains the response

Examining Weather and Active Meters in the System by Time Temperature by time Active Metes by time Seasonality of temperature Constant increase of active meters

Usage & Temp Usage by Day of Week & Verse Temperature Day of Week Trend Hourly Usage Trends Day of Month Temp = Red Usage = Blue

Auto-Regressive Integrated Moving Average ARIMA(p,d,q)x(P,D,Q)[m] AR(p) = number of seasonal autoregressive terms I(d) = number of differencing terms MA(q) = number of seasonal moving average terms m = periods inside frequency Stationary Mean & Variance Avg. Temperature Time Series

NEXT STEP BECOME THE BI SUPERHERO Information Management Big Data Storage Apache Hadoop Real-time intelligence Machine learning IoT Dashboards and Visualizations and more! Ideate, chart your quick wins, ask questions and get answers to your real Big Data challenges. It s insightful, it s easy and can be done from the comfort of your conference room. www.neudesic.com/meetneat

BIG DATA & Advanced Analytics Roadshow Questions? Orion Gebremedhin Orion.Gebremedhin@Neudesic.com Twitter: @oriongm Marc Lobree Marc.Lobree@Neudesic.com