Evaluation of Machine Learning Algorithms for Satellite Operations Support

Similar documents
SAS Machine Learning and other Analytics: Trends and Roadmap. Sascha Schubert Sberbank 8 Sep 2017

How to build and deploy machine learning projects

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

IBM SPSS & Apache Spark

What s new in Machine Learning across the Splunk Portfolio

Copyr i g ht 2012, SAS Ins titut e Inc. All rights res er ve d. ENTERPRISE MINER: ANALYTICAL MODEL DEVELOPMENT

Agile Industrial Analytics

BIG DATA and DATA SCIENCE

Smart BW Bank. Gerrit Bungeroth, BW-Bank Stefan Weingärtner, AdvancedAnalytics.Academy

Video Traffic Classification

DATA ROBOTICS 1 REPLY

Brian Macdonald Big Data & Analytics Specialist - Oracle

Microsoft Azure Essentials

SPM 8.2. Salford Predictive Modeler

New restaurants fail at a surprisingly

Data Analytics with MATLAB Adam Filion Application Engineer MathWorks

Is Machine Learning the future of the Business Intelligence?

3DCNN for False Positive Reduction in Lung Nodule Detection

BIG WITH BIG DATA ANALYTICS

Week 1 Unit 1: Intelligent Applications Powered by Machine Learning

Predictive Modelling for Customer Targeting A Banking Example

Intro Logistic Regression Gradient Descent + SGD

Data Mining Applications with R

A Taxonomy for Test Oracles

Azure PaaS and SaaS Microsoft s two approaches to building IoT solutions

DATA SCIENCE OPS IN PRACTICE

Du Smart Metering au Big Data


PSS E. High-Performance Transmission Planning Application for the Power Industry. Answers for energy.

STORAGE-OPTIMIZED MACHINE LEARNING

Achieve Better Insight and Prediction with Data Mining

EMEA USERS CONFERENCE BERLIN, GERMANY. Copyright 2016 OSIsoft, LLC

Kaseya Traverse Unified Cloud, Network, Server & Application Monitoring

ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics

Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS

Estimating Duration and Cost. CS 390 Lecture 26 Chapter 9: Planning and Estimating. Planning and the Software Process

Automated data analysis for HV batteries with KNIME. Maximilian Mücke (Deutsche ACCUmotive) and Patryk Koryzna (DATATRONIQ)

MICROSOFT AI PLATFORM

A NOVEL FOREST FIRE PREDICTION TOOL UTILIZING FIRE WEATHER AND MACHINE LEARNING METHODS

Data Mining in MRO process optimisation

Developers should be burning up with excitement about. the opportunities ahead in 2018, with technologies such

Real World Use Cases: Hadoop & NoSQL in Production. Big Data Everywhere London 4 June 2015

MANUFACTURING EXECUTION SYSTEM

More information for FREE VS ENTERPRISE LICENCE :

Flight Software Telemetry

Applying Regression Techniques For Predictive Analytics Paviya George Chemparathy

Microsoft Developer Day

DevOps and Machine Learning. Jasjeet Thind VP, Data Science & Engineering, Zillow

Building a Data Lake with Spark and Cassandra Brendon Smith & Mayur Ladwa

Enterprise Resource Planning Systems

Production Loss Accounting with the PI System and RtDuet

3 Ways to Improve Your Targeted Marketing with Analytics

Startup Machine Learning: Bootstrapping a fraud detection system. Michael Manapat

Going Long: Enabling Expansion of TransCanada s Enterprise Analytics Program

Predictive Conversion Modeling

Uncover the Power of a Big Data Platform Machine Learning at Work

: 20776A: Performing Big Data Engineering on Microsoft Cloud Services

Accelerate HPC Development with Allinea Performance Tools. Olly Perks & Florent Lebeau

A comparative study of Linear learning methods in Click-Through Rate Prediction

Transform Application Performance Testing for a More Agile Enterprise

Industry Solution. Real-time Historian for Synchrophasor Data Management

PAST research has shown that real-time Twitter data can

How to use SAP PowerDesigner to model your landscape architecture

Predictive Analytics Cheat Sheet

Asset Performance Management from GE Digital. Enabling intelligent asset strategies to optimize performance

Application of Machine Learning to Financial Trading

Predicting gas usage as a function of driving behavior

COMPUTER INTEGRATED MANUFACTURING. Dr Mirza Jahanzaib

DATA ANALYTICS WITH R, EXCEL & TABLEAU

Digital Wind Operations Optimization from GE Renewable Energy. Enhance the performance and efficiency of your people and machines to drive outcomes

Starting with Oracle Data Science in the Cloud

2-2 Copyright 2011 Pearson Education, Inc. Publishing as Prentice Hall

An Enterprise-Grade Architecture for Salesforce Native Applications

Predictive Modeling Using SAS Visual Statistics: Beyond the Prediction

Predict the financial future with data and analytics

Using Twitter to Predict Voting Behavior

Leveraging Big Data For Payment Risk Management

RAMSES. is your best choice for any Level 2 Automation System in steel works

Statistics Canada s Modern and Comprehensive Information Management (IM) Strategy

Using Predictive Analytics to Detect Contract Fraud, Waste, and Abuse Case Study from U.S. Postal Service OIG

PERSPECTIVE. MAKING GPP (Global PAYplus) TESTING PREDICTABLE

Embed Deep-Learning Techniques into Predictive Modeling

Digital Finance in Shared Services & GBS. Deloitte: Piyush Mistry & Oscar Hamilton LBG: Steve McKenna

Complex Event Processing: Power your middleware with StreamInsight. Mahesh Patel (Microsoft) Amit Bansal (PeoplewareIndia.com)

Data Warehousing. and Data Mining. Gauravkumarsingh Gaharwar

In silico prediction of novel therapeutic targets using gene disease association data

Predicting International Restaurant Success with Yelp

Welcome to. enterprise-class big data and financial a. Putting big data and advanced analytics to work in financial services.

New and noteworthy in Rational Asset Manager V7.5.1

Advanced Analytics with Tableau

Novedades de las últimas versiones de MATLAB y Simulink

SAP Predictive Maintenance and Service & SAP Asset Intelligence Network

A PRIMER TO MACHINE LEARNING FOR FRAUD MANAGEMENT

Machina Research White Paper for ABO DATA. Data aware platforms deliver a differentiated service in M2M, IoT and Big Data

Artificial Intelligence in Automotive Production

CRITICAL COMMUNICATIONS RECORDING WITH CONFIDENCE

Intelligent continuous improvement, when BPM meets AI. Miguel Valdés Faura CEO and co-founder

Using FPGAs to Accelerate Neural Network Inference

Predictive Modeling using SAS. Principles and Best Practices CAROLYN OLSEN & DANIEL FUHRMANN

Metamodelling and optimization of copper flash smelting process

Transcription:

Evaluation of Machine Learning Algorithms for Satellite Operations Support Julian Spencer-Jones, Spacecraft Engineer Telenor Satellite AS Greg Adamski, Member of Technical Staff L3 Technologies Telemetry and RF Products 2017 by L3 Technologies and Telenor Satellite AS. Published by The Aerospace Corporation with permission.

Agenda Goals and background Machine learning tools overview Archiving subsystem ease of data access Machine learning for telemetry data - concepts Analyzing data from Telenor fleet: techniques, algorithms and results Lessons learned and future development

Problem statement and background Goal: predict battery voltage levels through eclipse season to prevent a catastrophic power failure. We will be analyzing data from the past 7 years and using machine learning algorithms to attempt a realistic prediction. Secondary goal: gain experience in machine learning algorithms and their applicability to satellite telemetry data analysis. Ease of use of the algorithm was of paramount importance: Quick implementation Results that can be explained Good defaults and as little requirement on data preparation as possible Lack of data science background the work was done by a Satellite Engineer and a Software Engineer We expect this is true for many organizations

Tools Machine learning - scikit-learn: open source library for machine learning - Powerful and includes a wide range of algorithms and tools - Used for all the work in this project scikit-learn NumPy SciPy matplotlib Python Many alternatives available in a variety of languages (not tested) Especially in Python and R Also: TensorFlow (Google), Theano, Keras Including cloud based like Amazon AI

Data Storage Nebula Archiver From proprietary to open Legacy archiving based on closed, proprietary file formats Data access possible only through proprietary APIs or GUIs Nebula Archiver power to the user Based on Cassandra No-SQL database (other DBs supported as needed) Almost unlimited scalability Lightning fast data access Data laid out to match typical access patterns Ease of access just a few lines of python code needed to retrieve and plot data Easily accessible from any programming platform

Nebula Archiver high level architecture Real time feed from operational fleet InControl Real time data Cassandra Driver CQL Calls Historical reports Happy operator Historical data for analytics Legacy Data Store Source reader 3 rd party data source Nebula Importer Data Updates Cassandra Cluster Nebula Archiver Data access for ML Machine Learning/ Analytics Tools Other system Data Updates/Import

Nebula Archiver - Benefits For the operator Quick overview of large portions of data [view 1 year of data in as little as a few seconds] Reporting has no impact on the real time processing For the engineer Very fast and simple access to stored data using open tools (python/java/js) Use analytical tools without impacting the real time system Almost unlimited scalability For the manager Happy operator/engineer Reduced hardware cost and ability to expand storage subsystem as needs grow

Machine learning - concepts Machine learning: data analytics technique that uses historical data to predict future system behavior Supervised learning: using previously collected labeled data (classification) or numerical values (regression) to predict system behavior in new circumstances Unsupervised learning: using unlabeled data to infer system behavior (e.g. clustering) Classification: Automatic assignment of an event to a class based on historical data and other inputs Overfitting: learning the underlying data vs. trends in the data

Regression Algorithms Several different regression algorithms were tested: Linear regression stochastic gradient descent State vector machine linear, polynomial and radial basis function Random forest extra trees regressor Extra trees regressor produced the best results in most cases tested, especially for time series. SVM RBF is faster and sometimes a good alternative.

Regression Use Cases Monitoring battery cell voltage through eclipse seasons. Dynamic limits potentially used for any predictable telemetry point. Predicting the attitude disturbance of momentum wheel unloads. Based on the momentum stored on the wheels.

Regression Example Battery Cell Voltages Training - Input: - Voltage parameters at the beginning of the eclipse season - Up to 15 days taken into account - Calculated daily average for each of the input parameters - Day of the eclipse season - Predicted duration of eclipse - Output - Battery cell voltage for each day of the eclipse season Cross-validation: - One (or more) seasons set aside to validate model performance

Regression Example Battery Cell Voltages More varied seasons, better predictions No model has previously been found to reliably predict them.

Regression Example Battery Cell Voltages Model does not does not extrapolate well to new curves

Regression Example Battery Cell Voltages Using data collected for a number of days into the eclipse season allows the model to re-align

Regression Example Dynamic Limits Predicts the trends ahead and sets limits. Prediction accuracy deteriorates with time. More work needed to reduce false alarms.

Other research - classification Monitoring battery charging Supervised training used to identify nominal charge characteristics for the various modes and states. More work is needed for a useful tool. Identifying anomalous behaviour Successfully identifies known anomalies. Other methods are more practical. Not enough events for thorough training.

Lessons Learned and Future Directions Lessons Learned: Challenge is to define the task and preprocess data. Some useful tools developed. Too few anomalies. Hard to train algorithms without failures. Future Development: Expand and improve current tools - Plenty of other potential applications Anomaly detection - Dynamic limits, out of family behaviour, component degradation Investigate techniques for more general learning

Questions?

Thank You!