The Internet of Things Wind Turbine Predictive Analytics. Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure

Similar documents
Architecture Overview for Data Analytics Deployments

MapR Pentaho Business Solutions

ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Analyzing Data with Power BI

Microsoft Azure Essentials

Cask Data Application Platform (CDAP)

Enterprise Architecture for Digital Business

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

Let s distribute.. NOW: Modern Data Platform as Basis for Transformation and new Services

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

Welcome to. enterprise-class big data and financial a. Putting big data and advanced analytics to work in financial services.

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

MapR: Solution for Customer Production Success

Operational Hadoop and the Lambda Architecture for Streaming Data

IoT Enabled Technologies Delivering Actionable Insights for the Telecom Industry

Your Big Data to Big Data tools using the family of PI Integrators

ETL on Hadoop What is Required

Common Customer Use Cases in FSI

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

Real-time Streaming Insight & Time Series Data Analytic For Smart Retail

Audience Profile The course will likely be attended by SQL Server report creators who are interested in alternative methods of presenting data.

PI Integrator for Business Analytics

Streaming Analytics, Data Lakes and PI Integrators

E-Guide THE EVOLUTION OF IOT ANALYTICS AND BIG DATA

How to Build Your Data Ecosystem with Tableau on AWS

The Internet of Things Platform

Oracle Enterprise Data Quality Product Roadmap and Statement of Direction. October 2016

Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics

ALEXANDER PIQUER ANDERSON AMARAL. OSIsoft. OSIsoft

Analyzing Data with Power BI

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN

Copyright - Diyotta, Inc. - All Rights Reserved. Page 2

Big Data The Big Story

Safe Harbor Statement

Data Analytics. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC

: 20776A: Performing Big Data Engineering on Microsoft Cloud Services

InfoSphere Warehouse. Flexible. Reliable. Simple. IBM Software Group

Azure IoT Suite. Secure device connectivity and management. Data ingestion and command + control. Rich dashboards and visualizations

Simplifying Your Modern Data Architecture Footprint

Stuck with Power BI? Get Pyramid Starting at $0/month. Start Moving with the Analytics OS

Building Your Big Data Team

EBOOK: Cloudwick Powering the Digital Enterprise

PaaS na prática para soluções B2B Oracle Customer Experience Cloud

Hortonworks Powering the Future of Data

20332B: Advanced Solutions of Microsoft SharePoint Server 2013

WHITE PAPER SPLUNK SOFTWARE AS A SIEM

Integrating MATLAB Analytics into Enterprise Applications

Jason Virtue Business Intelligence Technical Professional

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved.

SAP Big Data. Markus Tempel SAP Big Data and Cloud Analytics Services

Hybrid Data Management

Bringing the Power of SAS to Hadoop Title

ETL challenges on IOT projects. Pedro Martins Head of Implementation

GADD platform Overview

"Charting the Course... MOC A: Architecting Microsoft Azure Solutions. Course Summary

Modernizing Data Integration

Cloud Based Analytics for SAP

System log analysis using InfoSphere BigInsights and IBM Accelerator for Machine Data Analytics

Oracle BI 12c: Create Analyses and Dashboards Ed 1

The Rise of Engineering-Driven Analytics

Oracle Autonomous Data Warehouse Cloud

Oracle SCM Cloud. Integration and Extensibility Strategy. Jon Chorley. CSO and Group Vice President Oracle SCM Product Strategy.

Business Intelligence in Azure Alex Whittles

Leveraging Oracle Big Data Discovery to Master CERN s Data. Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden

Cloud Integration and the Big Data Journey - Common Use-Case Patterns

ActualTests.C Q&A C Foundations of IBM Big Data & Analytics Architecture V1

RealTests.P questions

CONDITION MONITORING SOLUTIONS

IoT Analytics. in Manufacturing: equipment condition. plant utilization. supplier performance. physical environment

Extending Enterprise to the Edge

HPE Flexible Capacity with Microsoft Azure & Azure Stack

Cisco Connected Asset Manager for IoT Intelligence

Machina Research White Paper for ABO DATA. Data aware platforms deliver a differentiated service in M2M, IoT and Big Data

FLINK IN ZALANDO S WORLD OF MICROSERVICES JAVIER LOPEZ MIHAIL VIERU

Internet of Things. Point of View. Turn your data into accessible, actionable insights for maximum business value.

What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data?

Konica Minolta Business Innovation Center

Pool Data: 2/18/2018. Best Practices and Practical Considerations. Do you have the Moneyball Mindset at your pool?

House Keeping. You are in Listen Only Mode. Azure 101: Azure Overview. Azure 201: How to do a Cost Estimate for Virtual Machines

Big Data and Machine Learning for Predictive Maintenance

Oracle Autonomous Data Warehouse Cloud

Boston Azure Cloud User Group. a journey of a thousand miles begins with a single step

THE DATA WAREHOUSE EVOLVED: A FOUNDATION FOR ANALYTICAL EXCELLENCE

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect

Azure Offerings for Big data. In Kee Paek Cloud Data Solution Architect Microsoft Korea October. 2016

SALESFORCE CERTIFIED DATA ARCHITECTURE AND MANAGEMENT DESIGNER

The Internet of Things Services and People Next level solutions

The ABCs of. CA Workload Automation

Transforming IIoT Data into Opportunity with Data Torrent using Apache Apex

Advanced Machine Monitoring. Whitepaper

Data Governance and Data Quality. Stewardship

Automating the Integration Factory. Nagaraj Sastry HCL

K4U Knowledge for Success Internet of Things Webinar Series

INTRODUCING BIRST INFOR S GO-FORWARD BUSINESS INTELLIGENCE SOLUTION

Digitalisieren Sie Ihr Unternehmen mit dem Internet der Dinge Michael Epprecht Microsoft GBB IoT

Introducing Infor Xi/Ming.le for M3

HP SummerSchool TechTalks Kenneth Donau Presale Technical Consulting, HP SW

IBM SPSS & Apache Spark

Transcription:

The Internet of Things Wind Turbine Predictive Analytics Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure

Big Data and Tribo-Analytics Today we will see how Fluitec solved real-world challenges Data wrangling on a Big Data scale Predictive modeling and data mining of complex, heterogeneous data sources Utilization of cloud resources Integration of Big Data tools such as Hadoop with MySQL and Java Use of network connected devices to manage complex machinery

Internet of Things Devices embedded with electronics, software and sensors... network-connected... enabling communication and data transfer... incorporating automated analytics

Tribo-Analytics and IoT Automated feeds from sensors on wind turbine components Incorporated into central repository Automated analytics produce warnings and recommendations WTG 89172: Add 1000 mg P to GB oil WTG 00811: Yaw motor likely to fail < 3 mos

M&S Consulting Big Data Architecture Machine Learning Data Wrangling (ETL) Hybrid Cloud

Fluitec International

Fluitec Fluitec International - Tribology experts Fluitec Wind - Decreasing the cost of wind energy through advanced data analytics

Fluitec Wind

Fluitec Wind Tribo-Analytics SaaS platform Help turbine operators utilize existing data to reduce costs and guide decision making Aggregates data from thousands of turbines globally Recommend Maintenance Plans Identify At-Risk Components

Value Proposition for Customers Using data that is already being collected, predictive analytics can be used to identify which components will fail and when Customers can then either take preventive action or at least plan for replacement Enables condition-based oil changes

Why Wind Turbines? Expensive to build $2 MM/MW On-shore, $6 MM/MW Off-shore Expensive to maintain $50,000 / WTG per year $7,500 / WTG oil change (every 6 years) $100,000 Gearbox replacement cost Often remote = Difficult logistics Huge amounts of data already being collected

Often Remote

Big and Expensive

Really Big...

Exposed to the Elements

Maintenance can be Tricky

Distracted Staff

Not So Simple - Lots of Connected "Things"

Predict Component Failure Initial Focus on Gearboxes Very Expensive Difficult to Replace One of the most frequent causes of failure in wind turbines

Gearboxes

Damaged Bearing

Gear Micropitting

Types of Data Used Sensors on gearbox major components Ambient conditions Oil analysis Technician s reports

The Data 5,000 WTGs (Wind Turbine Generators) 30,000 oil samples Make, model, rated capacity, etc. Billions of rows of sensor data (to start) Bearing temperature Shaft RPM Vibration alerts Many more...

Collecting and Preparing Data Collect data historical + ongoing feeds Traditional ETL on low-volume data MDM is important Preprocessed high-volume data ETL Derived data (statistics)

High-Volume Data Flow Continuous File Processing Secure Landing Zone For Data Uploads From Customers File-level Validation Cataloging Transfer to HDFS Repository Copy to Backup Storage Hadoop Repository Cleansing Transformation Pre- Aggregation Calculation of Statistics Data Feed Object Catalog and Lineage Repository Cheap Storage (Raw File Repository)

Complete Data Flow (generic) Data Sources (examples) Real-Time Routing Data Cleansing & Restructuring Pre- Processing Delivery (PCs, tablets, phones) Internal financial data in Oracle Machine Sensor data (flat files) Internal CRM data in SQL Server Semi-structured (customer clicks on website) Unstructured (technician incident report) One or more applications which determine which ETL path the data will follow (batch, real-time, or both). The data may be sent to more than one downstream system. For example, fastmoving financial markets data may be streamed to a real-time analytics app for immediate use but would also be sent to the data warehouse for more in-depth analysis and inclusion in the historical data repository. Batch Batch High-volume ETL in Hadoop Cluster Low/Mediumvolume ETL and MDM in Relational Database Real-time Pre-processing and light-weight validation Reporting and Analytics Processing (traditional BI - calculation of KPIs, summaries, dashboards) Advanced Analytics (machine learning, predictive analytics, NLP, etc.) Real-Time Analytics Decision Makers (Dashboards Self-service BI Download to Excel) Analysts (Data Mining, R,SAS, SPSS, Predictive Models) Dashboards, Alerts, Reports Reports, Alerts

Hybrid Cloud Architecture Secure Customer Upload AWS Customer #1 Hadoop Data Repository + MDM (copy) MySQL Pre-Processed Data For Model Ingestion Cataloging Data Lineage MDM On-Premise Secure Customer Upload Hadoop Development Customer #2

Data Collection and Pre-Processing Systems Landing Zone - hybrid Internal and AWS-based Linux servers Secure, encrypted transfer to Hadoop cluster in cloud Pre-Processing in Hadoop Pre-Aggregated Values and Statistics Sqoop ed to MySQL

Machine Learning Methodology Identify patterns associated with gearbox failure modes using training set (create model) Validate model using N-fold cross-validation Using model based on training set, predict TTF for: New wind turbines Existing turbines using updated data

Anomaly Detection Sensor Data

Oil Anomalies

Oil Contamination

Side Note... Big Data tools are not just for big data Powerful, open-source software Data Mining Exploring data in new ways Create KPIs for automated monitoring

Predictive Model Selection Ran millions of scenarios Used automated model generator 128 scenarios run in parallel on multiple cloud servers Cloud servers shutdown when not running scenarios = $0 cost

Challenges and Lessons Learned Data Quality Semantics Data Volume Identifying Distinct Modes of Failure Identifying Failures Validate Peer Groups Should be clear that model works or doesn t

Key Features Repository for Data Mining Predictive Model Data Catalog Model Logs Automated Model Generation and Assessment No Lost Information

Key Technologies MySQL Hadoop Java R Python

Led to New Lines of Business Breeze Oil Never change the oil But need monitoring to establish credibility Condition-based oil changes With monitoring and Fluitec expert analysis, only change oil when needed Additional additives sales Top up additives instead of complete oil change

M&S Consulting Process and Technology Consulting National clientele

Q&A Questions welcomed...