Monetizing the Lake. Kirk Haslbeck, Hortonworks Dan Kernaghan, Pitney Bowes

Similar documents
Microsoft Azure Essentials

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

Common Customer Use Cases in FSI

MapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia

TechValidate Survey Report. Converged Data Platform Key to Competitive Advantage

SAP Predictive Analytics Suite

Building Your Big Data Team

Insights to HDInsight

Leveraging Oracle Big Data Discovery to Master CERN s Data. Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden

Business is being transformed by three trends

Bringing the Power of SAS to Hadoop Title

WHITE PAPER. Results Delivers Value

LEVERAGING DATA ANALYTICS TO GAIN COMPETITIVE ADVANTAGE IN YOUR INDUSTRY

Azure ML Data Camp. Ivan Kosyakov MTC Architect, Ph.D. Microsoft Technology Centers Microsoft Technology Centers. Experience the Microsoft Cloud

SAP Big Data. Markus Tempel SAP Big Data and Cloud Analytics Services

Exelon Utilities Data Analytics Journey

Hybrid Data Management

Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand

5th Annual. Cloudera, Inc. All rights reserved.

Pitney Bowes Spectrum Technology Platform. Deliver actionable customer and location intelligence when and where you need it

Cloud Integration and the Big Data Journey - Common Use-Case Patterns

GET MORE VALUE OUT OF BIG DATA

Microsoft Big Data. Solution Brief

Customer Value Analytics for Banking & Capital Markets

Hadoop and Analytics at CERN IT CERN IT-DB

Your Big Data to Big Data tools using the family of PI Integrators

Big Data The Big Story

DLT AnalyticsStack. Powering big data, analytics and data science strategies for government agencies

Customer Value Analytics for Banking & Capital Markets

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Big Data Analytics for Retail with Apache Hadoop. A Hortonworks and Microsoft White Paper

Brian Macdonald Big Data & Analytics Specialist - Oracle

Cloud Based Analytics for SAP

Pentaho 8.0 Overview. Pedro Alves

Integrated Social and Enterprise Data = Enhanced Analytics

Nouvelle Génération de l infrastructure Data Warehouse et d Analyses

Architecture Overview for Data Analytics Deployments

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB

Hortonworks Powering the Future of Data

Intel Public Sector 3

Oracle Big Data Cloud Service

ETL challenges on IOT projects. Pedro Martins Head of Implementation

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara

IBM SPSS Modeler Personal

Spark, Hadoop, and Friends

Open Banking Approach with SmartVista Technologies. Peter Theunis. BPC Banking Technologies 2017 Mexico City

: Boosting Business Returns with Faster and Smarter Data Lakes

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS)

Transforming Big Data to Business Benefits

Big Data Monetisation : Selected Success Stories

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

Let s distribute.. NOW: Modern Data Platform as Basis for Transformation and new Services

20775: Performing Data Engineering on Microsoft HD Insight

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

Why Big Data Matters? Speaker: Paras Doshi

Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics

Oracle Big Data Discovery The Visual Face of Big Data

Real-time Streaming Insight & Time Series Data Analytic For Smart Retail

WHITE PAPER SPLUNK SOFTWARE AS A SIEM

Session 30 Powerful Ways to Use Hadoop in your Healthcare Big Data Strategy

Governing Big Data and Hadoop

Integrating MATLAB Analytics into Enterprise Applications

Oracle Autonomous Data Warehouse Cloud

Comprehensive Enterprise Solution for Compliance and Risk Monitoring

Big Data Trends Arató Bence. BI Consulting

EMC IT Big Data Analytics Journey. Mahmoud Ghanem Sr. Systems Engineer

OpenText Captiva. Redefine Your Business Through Intelligent Enterprise Capture

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW

Data Analytics for Semiconductor Manufacturing The MathWorks, Inc. 1

STAR Network Overview

Top 3 Strategies for Modernizing Enterprise Data Management C L O U D A N A L Y T I C S D I G I T A L S E C U R I T Y

CA UIM Log Analytics. Gain Full Stack Visibility With Contextual Log Insights. Mark Tukh Principal Presale Consultant CA NESS AT

OPEN MODERN DATA ARCHITECTURE FOR FINANCIAL SERVICES RISK MANAGEMENT

DIGITAL BSS CORE Solution Overview

In-Memory Analytics: Get Faster, Better Insights from Big Data

Operational Hadoop and the Lambda Architecture for Streaming Data

ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics

Customer Relationship Management Solutions for Vehicle Captive Finance. An Oracle White Paper October 2003

The Evolution of Big Data

Trusted by more than 150 CSPs worldwide.

Verint Engagement Management Solution Brief. Overview of the Applications and Benefits of

At the Heart of Maximizing Ancillary Revenues

Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved.

Service Virtualization

Cask Data Application Platform (CDAP)

HP SummerSchool TechTalks Kenneth Donau Presale Technical Consulting, HP SW

Accelerating Cloud Value through Analytics

The Alpine Data Platform

Responsive enterprise the future of the enterprise PERSPECTIVE

Welcome to. enterprise-class big data and financial a. Putting big data and advanced analytics to work in financial services.

IBM Big Data Summit 2012

Reduce Money Laundering Risks with Rapid, Predictive Insights

How to improve your AML detection? Christopher Ghenne Principal Manager Fraud & Security Intelligence EMEA

Apply Big Data Analytics and Machine Learning in Real Time to Disrupt Business Models. OOP 2017 (Munich, Germany)

Copyright - Diyotta, Inc. - All Rights Reserved. Page 2

Conquering big data challenges

Konica Minolta Business Innovation Center

Evolution to Revolution: Big Data 2.0

Maximize Your Big Data Investment with Self-Service Analytics Presented by Michael Setticasi, Sr. Director, Business Development

Transcription:

Monetizing the Lake Kirk Haslbeck, Hortonworks Dan Kernaghan, Pitney Bowes

Hadoop is Lower Cost and more Scalable 14000 Cost Per Terabyte 12000 10000 8000 6000 4000 2000 0 HDP Oracle X Teradata Netezza Hortonworks #REF! 2 Hortonworks Inc. 2011 2016. All Rights Reserved

Cost Drivers The Big Picture Insights Produce more valuable and more holistic insights Security - Apply Security Policies in one place instead of repeating them in each Silo Collaborate - Curate Feature Vectors for our Data Scientists and Promote Collaboration Time Get models into production faster. Human time still the most costly Storage Store data in an accessible file system at the lowest cost Time-to- Market Insights Storage Security Collaborate 3 Hortonworks Inc. 2011 2016. All Rights Reserved

Various Data Types Structured Time-Series Unstructured First_Name SSN Net_Worth Joe 233-33 100,000 Mark 456-77 200,000 40 35 30 25 20 15 10 5 0 12:05 12:08 12:11 12:14 12:17 12:20 Best Buy released their earnings this quarter and beat analyst expectations. Earnings per share increased by 0.02 DB2, Oracle KDB File System 4 Hortonworks Inc. 2011 2016. All Rights Reserved

HDP Stack Attack the Data with the Right Tool 5 Hortonworks Inc. 2011 2016. All Rights Reserved

Limitations of Building a Model on a Traditional Platform If you need a lot of data to build a good model, what tools can you use? Data volumes can eliminate the possibility of desktop tools R, Eclipse all limited to 8G of Ram on the desktop machine Sampling? Well we better get an even distribution of true and false positives in each sample, but wait that requires data munging, back to what tools can we use. Security Concerns? Extracting data from it s secure resting place and pushing it into other environments, often times unsecure files or desktops where Matlab or R can be installed. Collaboration Push processing to the data using modern distributed tooling. 6 Hortonworks Inc. 2011 2016. All Rights Reserved

Web-based Notebook for interactive analytics Apache Zeppelin Features Ad-hoc experimentation Deeply integrated with Spark + Hadoop Supports multiple language backends Incubating at Apache Use Case Data exploration and discovery Visualization Interactive snippet-at-a-time experience Modern Data Science Studio 7 Hortonworks Inc. 2011 2016. All Rights Reserved

Data Science Notebooks - Collaborate 8 Hortonworks Inc. 2011 2016. All Rights Reserved

Insider Trading 9 Hortonworks Inc. 2011 2016. All Rights Reserved

10 Hortonworks Inc. 2011 2016. All Rights Reserved

Banking: Credit Card Fraud Detection 11 Hortonworks Inc. 2011 2016. All Rights Reserved

Discovery Gathered all Credit Card Transactions Problem is they didn t make sense No identifiable patterns, no log normal curves Gas $45, Chipotle $8.50, Steak dinner $88, Amazon shoes $55 Classification 12 Hortonworks Inc. 2011 2016. All Rights Reserved

Outlier Detection: identify abnormal patterns Example: identify anomalies Features: - Time frequency - Category - Amount - Distance 13 Hortonworks Inc. 2011 2016. All Rights Reserved

Hortonworks Data Flow 14 Hortonworks Inc. 2011 2016. All Rights Reserved Page 14

Pitney Bowes and Hortonworks Spatially Enabling the Data Lake 15 Hortonworks Inc. 2011 2016. All Rights Reserved

6 Pitney Bowes Data Global Coverage Global coverage built on a legacy of accuracy and precision Recognized leader for LI Data and capabilities. 16 AMER EMEA APAC 764 3079 719 Datasets Datasets Datasets Hortonworks Inc. 2011 2016. All Rights Reserved Local datasets for 240 Countries

Pitney 17 Bowes Partner Program Overview February 14, 2017 Pitney Bowes Data Unparalleled Depth 17 Hortonworks Inc. 2011 2016. All Rights Reserved

Risk of Relying Solely on Public Data 5 / 5 / Incorrect information for this property: Last sale date Last sale price # of bedrooms # of rooms Finished basement # of spaces (garage) Structure type Lot width Parcel boundary $207,000 July 1997 Unfinished /13 July 1997 $207,000 / 2 18 Hortonworks Inc. 2011 2016. All Rights Reserved 75

Easy to Deploy and Use Spatial Visualization Reporting Big Data Ecosystem Tools Analytics Custom Applications Client Applications Spectrum Data Quality for Big Data Spectrum Addressing for Big Data Spectrum Spatial for Big Data Spectrum Geocoding for Big Data Spectrum Routing for Big Data Pitney Bowes Data Products Distributed Cluster NoSQL Database HDFS Reference Datasets Hive Spark Pitney Bowes April 19, 2017 19 Hortonworks Inc. 2011 2016. All Rights Reserved

Enriching Data with a Location Stack For a given location: POI (carries attributes) Retail (Business) Footprint poly Building Footprint Parcel (Lot) Isochrone(travel time) Demographics, lifestyle attributes, financial and consumer vitality, etc. 20 Hortonworks Inc. 2011 2016. All Rights Reserved

Hydrating the Spatial Data Lake Property Data 180M+ Property Addresses Geocode Property Attributes Risk Data Property Boundaries Distance to Water Flood Risk Wild Fire Risk Market Data GeoDemographics Neighborhood Boundaries Zip Code Boundaries Points of Interest Property Data Risk Data Market Data Wild Fire Risk Walkability Scores Plus Transactions IOT Sensors Social Media 21 Hortonworks Inc. 2011 2016. All Rights Reserved

22 Case studies: Drive superior business outcomes and gain a deeper understanding of customers. Online Mortgage Loan Provider By consolidating data and running real-time address validation, they gained a complete view of customers, enabling more effective marketing, accelerated mortgage origination to enable loan processing in days not weeks. Financial service firm gains richer profiles Restored missing address data through data standardization, data augmentation and geocoding. Enabled firm to run targeted multichannel promotions via web and smartphone apps. Global US Based Wealth Management Organization Increase customer lifetime value and provide ideal customer experience by optimizing every contact with its mass-affluent customers, with 35% increase in revenues and 55% improvement in client satisfaction Pitney 22 Bowes Partner Program Overview April 19, 2017 Hortonworks Inc. 2011 2016. All Rights Reserved

Large US Online Loan Provider Property Analytics Case Study Business Challenge: Close loans more quickly, improve client experience while mitigating lender s risk This lender, unlike most others, relies on wholesale funding to make its loans and uses online applications rather than a system of branches. Close Loans Faster Lender found many specific requirements delayed loan funding and closure, causing clients to abandon online process. Integration if Pitney Bowes data through the pb key enabled the analysis of loan requests to provided an accurate qualification of the property for a loan, reducing abandoned rates and accelerating revenue. Mitigating Risk The accurate and complete attributes provided by the spatial data lake, correctly assessed the risks associated with a loan, enabling more accurate pricing and profitability. Desired Outcomes Improved real-time and long-term decisions Access to accurate date for 180M properties in the US Sharing information with partners (e.g. Fannie Mae) Complete picture of property, risk and market Benefits Accurate qualification of property for a particular loan type Faster loan processing and closure Improved risk assessment of loan to particular property. 23 Hortonworks Inc. 2011 2016. All Rights Reserved

24 Five reasons to modernize with Pitney Bowes Big Data SDKs 1 They re easy Simple and intuitive user experience Program in SQL to run processes in the Hortonworks Spatial Data Lake 2 3 4 5 They re powerful Take advantage of more data Answer questions that were too big before They re incredibly fast Process enormous amounts of data in a fraction of the time They re practical Avoid large capital outlays They ll run in the cloud They re secure Extend and enforce your Hadoop permissions Easy to manage and configure Pitney Bowes April 19, 2017 24 Hortonworks Inc. 2011 2016. All Rights Reserved