Real-time IoT Big Data-in-Motion Analytics Case Study: Managing Millions of Devices at Country-Scale

Similar documents
Data: Foundation Of Digital Transformation

Microsoft Azure Essentials

Analytics for All Your Data: Cloud Essentials. Pervasive Insight in the World of Cloud

Datametica. The Modern Data Platform Enterprise Data Hub Implementations. Why is workload moving to Cloud

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

Copyright - Diyotta, Inc. - All Rights Reserved. Page 2


Hortonworks Connected Data Platforms

Hybrid Data Management

BIG DATA TRANSFORMS BUSINESS. Copyright 2013 EMC Corporation. All rights reserved.

Insights-Driven Operations with SAP HANA and Cloudera Enterprise

Analytics for All Data

How In-Memory Computing can Maximize the Performance of Modern Payments

Introduction to Stream Processing

Big Data Introduction

WHITE PAPER SPLUNK SOFTWARE AS A SIEM

Managing explosion of data. Cloudera, Inc. All rights reserved.

Business is being transformed by three trends

Insights to HDInsight

Active Analytics Overview

Building data-driven applications with SAP Data Hub and Amazon Web Services

Architecture Overview for Data Analytics Deployments

DLT AnalyticsStack. Powering big data, analytics and data science strategies for government agencies

Spotlight Sessions. Nik Rouda. Director of Product Marketing Cloudera, Inc. All rights reserved. 1

EXECUTIVE BRIEF. Successful Data Warehouse Approaches to Meet Today s Analytics Demands. In this Paper

REDEFINE BIG DATA. Zvi Brunner CTO. Copyright 2015 EMC Corporation. All rights reserved.

Advancing your Big Data Strategy

Cloud-Scale Data Platform

Savvius and Splunk: Network Insights for Operational Intelligence

CASE STUDY Delivering Real Time Financial Transaction Monitoring

POWER REAL-TIME TELCO NETWORK OPERATIONS WITH EXTREME ANALYTICS

The Internet of Things Wind Turbine Predictive Analytics. Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake

Oracle Infinity TM. Key Components

Azure ML Data Camp. Ivan Kosyakov MTC Architect, Ph.D. Microsoft Technology Centers Microsoft Technology Centers. Experience the Microsoft Cloud

The Importance of good data management and Power BI

20775A: Performing Data Engineering on Microsoft HD Insight

Why an Open Architecture Is Vital to Security Operations

Deliver Always-On, Real-Time Insights at Scale. with DataStax Enterprise Analytics

20775 Performing Data Engineering on Microsoft HD Insight

Azure PaaS and SaaS Microsoft s two approaches to building IoT solutions

Amsterdam. (technical) Updates & demonstration. Robert Voermans Governance architect

Operational Hadoop and the Lambda Architecture for Streaming Data

Establishing Self-Driving Infrastructure Operations

Industrial IoT Solution Architecture Design From Connectivity to Data

GPU ACCELERATED BIG DATA ARCHITECTURE

Data Integration for the Real-Time Enterprise

20775: Performing Data Engineering on Microsoft HD Insight

Hortonworks Data Platform

5th Annual. Cloudera, Inc. All rights reserved.

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved.

Cloudera Data Science and Machine Learning. Robin Harrison, Account Executive David Kemp, Systems Engineer. Cloudera, Inc. All rights reserved.

TechValidate Survey Report. Converged Data Platform Key to Competitive Advantage

AppDynamics Launches Business iq

Making Realtime Reporting a Reality

Meta-Managed Data Exploration Framework and Architecture

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.

CONNECTED TRAFFIC CLOUD. A New Approach to Intelligent Traffic Management

The Internet of Everything and the Research on Big Data. Angelo E. M. Ciarlini Research Head, Brazil R&D Center

SAP Cloud Platform Big Data Services EXTERNAL. SAP Cloud Platform Big Data Services From Data to Insight

20775A: Performing Data Engineering on Microsoft HD Insight

H2O Powers Intelligent Product Recommendation Engine at Transamerica. Case Study

Real-time Streaming Applications on AWS Patterns and Use Cases

IoT ANALYTICS IN THE ENTERPRISE WITH FUNL

SOLUTION SHEET Hortonworks DataFlow (HDF ) End-to-end data flow management and streaming analytics platform

Application Performance Management for Microsoft Azure and HDInsight

Microsoft FastTrack For Azure Service Level Description

DRAFT ENTERPRISE TECHNICAL REFERENCE FRAMEWORK ETRF WHITE PAPER

Cognitive Data Warehouse and Analytics

Transforming IIoT Data into Opportunity with Data Torrent using Apache Apex

Building a Data Lake on AWS EBOOK: BUILDING A DATA LAKE ON AWS 1

SOLUTION SHEET End to End Data Flow Management and Streaming Analytics Platform

Gain Insights. Control Anything. Take Action. Connect Things. 10% of the data on earth will come from IoT by B connected devices by 2020

Turn Data into Business Value

Preparing for the analytics economy Why and how Telecom operators should adapt

Big Data Platform Implementation

SUSiEtec The Application Ready IoT Framework. Create your path to digitalization while predictively addressing your business needs

Modernizing Your Data Warehouse with Azure

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara

Building a Data Lake on AWS

: Boosting Business Returns with Faster and Smarter Data Lakes

Increasing the Value Density of Data with Logtrust Event Lake TM

Oracle Stream Analytics

Statistics & Optimization with Big Data

Angat Pinoy. Angat Negosyo. Angat Pilipinas.

Enterprise Analytics Accelerating Your Path to Value with an Open Analytics Platform

EXPERIENCE EVERYTHING

MapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia

Advanced Analytics in Service Operation Management WHAT IF? Data Analytics For IOT

Oracle 全数据平台解决方案 : 打破技术壁垒, 释放数据能量. Sally Piao 甲骨文公司全球研发副总裁

Scaling up MATLAB Analytics with Kafka and Cloud Services

Data. Does it Matter?

Azure IoT Suite. Secure device connectivity and management. Data ingestion and command + control. Rich dashboards and visualizations

IoT ENABLED INTELLIGENT FLEET MANAGEMENT. Kalman Tiboldi Chief Business Innovation Officer

Predictive Analytics Reimagined for the Digital Enterprise

Maturing IoT solutions with Microsoft Azure. Glenn Colpaert Azure/IoT Domain

BIG DATA TRANSFORMS BUSINESS

Luxoft and the Internet of Things

Let s distribute.. NOW: Modern Data Platform as Basis for Transformation and new Services

Transcription:

Real-time IoT Big Data-in-Motion Analytics Case Study: Managing Millions of Devices at Country-Scale

Real-time IoT Big Data-in-Motion Analytics Case Study: Managing Millions of Devices at Country-Scale SUMMARY... 3 CUSTOMER CHALLENGES... 4 The Need for Speed... 4 Country-Scale Continuous Real-time Operations... 4 OUR SOLUTION... 5 Time-to-Value Counts... 5 Single Platform to Analyze Past and Present Events... 6 A Flat Ultra Low Latency Platform... 6 Always Hot Event Lake and Event Brokering... 7 Live Interactive Visual Analytics... 8 ABOUT LOGTRUST... 9 Any data, any volume, in real-time, all the time... 9 What makes us unique... 10 2

Real-time IoT Big Data-in-Motion Analytics Case Study: Managing Millions of Devices at Country Scale This white paper is intended for all audiences involved with IoT Big Data Monitoring and Analytics such as Security Admin/Manager, IT Manager, Network Ops, Incident Responder, Forensics, Hunting Team, Compliance Officer, Field Operations, DevOps, CISO/CSO, CIO, CDO. SUMMARY We will describe a real-world case study running in production at Logtrust where a major telecom operator is adopting a modern architecture to manage millions of devices at country-scale to improve the quality of experience of its digital TV and Video On-Demand service. Our customer is leveraging Logtrust s unique platform capability to unify historical and present data as-they-come to provide real-time, all-the-time insights on Fast and Big Data Analytics. As a result, our customer is benefiting from the following unique competitive advantages: Understanding data in-the-moment for real-time insights enabling actionable corrective actions. In the field of security, it could be real-time threat hunting where intruder cyber kill chains can be observed, and in marketing it would enable real-time digital marketing personalization. Understanding data historically such as performing ultra-fast retrospective pattern matching and infering the likelihood of failures ahead. In the field of operations this is core to predictive conditions-based maintenance and in the field of security it is all about forensics. 3

CUSTOMER CHALLENGES The Need for Speed Real-time Ingestion, Classification, Retention Managing millions of devices generating gigabytes (GB) of data per second requires a special data pipeline, compute and storage architecture capable of ingesting continuously millions to billions of records per second of raw events. As data arrive often unbounded in size, loading the data in memory is not realistic because the data stream may be potentially too large to store in a local storage. For example, a turbine engine with more than 100 sensors can send 200+ GB per day of data; then depending upon how accurately you must detect deviations from the past, you may need to retain petabytes (PB) of data per turbine. In our real-world use case scenario, the telecom operator must manage millions of set-top boxes and digital TV vital signs. Each of these devices can generate anywhere between 1GB to 100GB of data per day. Depending upon the amount of historical data you must retain per device, each device may require more than 5 terabytes (TB) of data retained for statistical baselining. Ultra-fast data exploration and interaction To make sense out of billions of events you must perform ultra-fast investigative searches and queries back in time and compare them with real-time events as-they-come. Since data sources may well be disconnected, you will likely need to perform in-fly enrichment of data to gain additional attributes and perform aggregation and statistical operations. In our real-world use case, the telecom operator field operations need to: Manage many events occurring on devices of the same type Detect a few number of events-of-interest of a few types in a selected time window Deal with rates of a thousand events per minute Country-Scale Continuous Real-time Operations Today just to deliver on the basic 24x7 service level agreements (SLAs), the telecom operator is leveraging its connected infrastructure for both Live TV and Video On Demand for fault 4

detection and diagnostics via continuous remote monitoring. The challenge is determining when you do need be on site and perform corrective actions at country-scale. There are certainly various operating models, but in this particular case the telecom has gained flexibility by establishing partnerships in each country/region to deliver a service through a virtual organization mode that represents the telecom operator as a single entity. This has been made possible through the use of Logtrust s platform Cloud architecture which enables near instantaneous deployment operations in any region or country with the necessary company portal personalization. OUR SOLUTION Time-to-Value Counts At Logtrust we specialize in real-time insights from high volume, high velocity data. Our platform increases the value density of data which can be defined as # of business relevant data per GB. What is critical to understand is that the value density of data has a tight link between the age of data and its arrival time. Data has the greatest value as it enters the data pipeline in real-time and is fresh, whereas historical data, or data at rest loses value with the passage of time. However, when you enrich historical data with real-time data, new insights can be gleaned, and the value density of the historical data increases. The key is that real-time data and historical need to be combined to extract the greatest value. For example, if you need continuous telemetry on a device but need to combine the history of its configuration setting changes to trigger a real-time alert, every second matters now. 5

At Logtrust we have a platform that unifies historical data and real-time data in a single architecture and hence is able to perform real-time search and correlation continuously for every query type. Single Platform to Analyze Past and Present Events Logtrust provides a single platform to analyze past and present data. A unified architecture for historical data and real-time streaming is key for real-time performance at scale, and also for cost efficiency. Current technologies such as LAMBDA will recommend a dual architecture one for real-time streaming such as Kafka, Spark and one for historical analysis in batch mode such as HDFS with Over Hadoop SQL query. This dual type of architecture requires heavy compute and storage and lots of engineers to run and maintain it. In our real-world production use case, the telecom operator uses Logtrust s single platform to analyze both historical data and real-time data streaming in from millions of set-top boxes without any performance degradation. A Flat Ultra Low Latency Platform Logtrust has built a modern platform that not only embraced Cloud Hyper Scalability in both compute and storage but also native secure multi-tenancy. It is based on a combination of a time- 6

series compute paradigm and multi-stage indexing to perform ultra-low latency, massive parallel queries with linear performance per core. As a result, Logtrust has unique capabilities of real-time ingestion per core of +150,000 EPS/core, +1,000,000 EPS/core for real-time search and +65,000 EPS/core for complex event processing. Logtrust s platform addresses the scalability wall, when the volume of data begins to rise above 100 million events per second. Always Hot Event Lake and Event Brokering When managing millions of devices at country-scale, it is all about patterns of events management. Event objects contain information that needs to be extracted. Events-of-interests are a set of events whose sequence or concurrence within the same window of time may have significance. Leveraging its open REST API architecture and always hot cloud storage, Logtrust holds the results of aggregated events or pre-computed event processing in a hot mode ready to be either passed to another event process or visualized via an interactive dashboard. With Logtrust Event Lake TM and Event Brokering capabilities using REST, the whole connected IoT devices infrastructure can be organized around a central broker hub. The hub dispatches precomputed events-of-interests between event producers and event consumers; though event producers and consumers do not know each other, both can subscribe to their own events-ofinterests and cross-communicate about what matters most. Logtrust Event Lake and Event Brokering enables highly scalable solutions without dependencies between event producers and event consumers. 7

Live Interactive Visual Analytics Using simple point and click navigation, business analysts and data scientists can build powerful interactive visualizations that are being refreshed in real-time. Logtrust has democratized graphbased visualizations such as Force-Directed Graph, flow-based diagrams such as Sankey, and spatial density-based diagrams such as Voronoi so that hidden data relationships from within billions of data points can be detected, explored and investigated on-the-fly. 8

ABOUT LOGTRUST Any data, any volume, in real-time, all the time Logtrust understands that businesses must be real-time or be obsolete. Founded in May 2011 by a team of professionals with over 15 years of experience in Fast Data and Big Data management and protection for financial services, Logtrust is a Real-Time Big Data-in-Motion platform offering Fast Data, Big Data analytics through a solution that enables real-time analytics for operations, fraud, security, marketing, IoT and other aspects of business. Logtrust provides the 9

ability to ingest, store and analyze massive, varied and dynamic data sets at high speed through its flexible cloud, on-premise and hybrid deployments. Recognized as a Gartner Cool Vendor 2016, our Mission is to democratize real-time Big Data tools for companies of any size and sector, allowing them to maximize their business value via security intelligence, infrastructure, monitoring, compliance, customer behavior, analytics, and business monitoring solutions. Funded by Atlantic Bridge (ABVen), Kibo Ventures, and Investing Profit Wisely (IPW), venture funds specializing in accelerating the scale up and internationalization of technology companies, Logtrust has a senior advisory board with extensive industry experience and proven leadership in growing companies internationally. Logtrust is located at the epicenter of Silicon Valley in Sunnyvale, CA, and further serves its global clients through offices in New York and Madrid. What makes us unique Because every second counts and time-to-insight matters, Logtrust has designed a time machine that ingests time-series logs in real-time and correlates with ultra-low-latency queries. Businesses can travel back in time at lightning speed and fast forward to the present to gain real-time data insights. Our industry first Fast Data and Big Data platform uses a FULL (Flat-Ultra-Low- Latency) elastic architecture capable of processing over 150,000 events per second per core or querying over 1 million events per second per core. This enables customers to perform real-time data-in-motion analytics and historical data queries to gain insight as-data-come delivering live data exploration and advanced visualization on an Always Hot, Always-On architecture. Major European telecommunication providers, cyber security service providers, and banks are using the Logtrust platform for real-time QoS TV streams monitoring, real-time adaptive defense against intrusion, and real-time applications-to-network monitoring. To learn more, visit www.logtrust.com, email info@logtrust.com or phone +1 866 242-1700 or +34 913088331. Logtrust and Logtrust Event Lake are trademarks of Logtrust, Inc. 10