Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved.

Size: px
Start display at page:

Download "Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved."

Transcription

1 Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved. 1

2 SAP Big Data Webinar Series Big Data - Introduction to SAP Big Data Technologies Big Data - Streaming Analytics Big Data - Smarter Data Virtualization Big Data - Gain New Insight from Hadoop Big Data - Spatial Data Processing for Richer Insights Big Data - Text Analytics

3 Speaker Introduction Yuvaraj Athur Raghuvir, is a Senior Director in the SAP HANA Platform Solution Management at SAP. He leads the Big Data Analytics portfolio including SAP Real-time Data Platform. Yuvaraj has over 14 years of experience spanning Business Applications, Business Analytic Solutions, Architecture and Engineering SAP AG or an SAP affiliate company. All rights reserved. 3

4 SAP Big Data Webinar Series Gain New insight from Hadoop Presented by: Yuvaraj Athur Raghuvir, SAP HANA Platform July

5 SHOPPERS GET FASHION ADVICE THAT FITS THEIR STYLE

6 Big Data Economics Streaming Data incl. Sensors, Social and Mobile Predictive incl. data mining and machine learning Urgent Need Unstructured Analysis Incl. text, media, spatial etc. Scalable Storage 2013 SAP AG or an SAP affiliate company. All rights reserved. 6

7 Open Source Community Gift: Apache Hadoop Streaming Data Apache Hadoop incl. Sensors, Social and Mobile Predictive [Commons] incl. data mining and machine learning Urgent Need [Projects] Unstructured Analysis Incl. text, [Distributions] media, spatial etc. [Data Scientists] Scalable Storage 2013 SAP AG or an SAP affiliate company. All rights reserved. 7

8 Hadoop Possibilities Amazon has reported that it started 2 million Elastic MapReduce (EMR) clusters in a single year. Dynamic Community with over Hadoop related projects in GitHub. Growing Popularity among Vendors Storage Compute Network High Performance Computing Vendors Pushing New Boundaries! Source: 1) Gartner Blog retrieved on July ) Gartner Blog retrieved on July ) GitHub query Hadoop retrived on July SAP AG or an SAP affiliate company. All rights reserved. 8

9 Hadoop Community Complexity! Community Projects!! Highly Dynamic & Evolving Ecosystem Projects & Alternatives Gartner Infographic Source: Gartner Blog Source: GitHub query Hadoop retrived on July Source: GigaOm Infographic, Mar 5, retrieved on July retreived on July SAP AG or an SAP affiliate company. All rights reserved. GigaOm Infographic 9

10 The Big Data Phenomenon Big Data is more than just Hadoop Business Trends Technology Trends Exploding data volumes Increasing data variety Accelerating data velocity Storage / Memory / CPU advances Hadoop & distributed MPP Data Mining/Predictive analysis In-memory computing Complex event processing Enterprise Big Data 2013 SAP AG or an SAP affiliate company. All rights reserved. 10

11 Big Data Challenges Business and technical needs Business Needs Quick insights from all business-relevant data Pick the right action among many choices Technical Challenges Cost of store vs. cost to process data considerations Pressure to gain insight quickly from data Action 1 Action 2 Action 3 All Relevant Big Data Quick Insight Diversity of data formats makes it difficult to analyze Right Action Need to have disparate technologies interoperate across enterprise Determine the right action among many valuable insights across variety, volume, velocity and technology is difficult 2013 SAP AG or an SAP affiliate company. All rights reserved. 11

12 How to Capitalize on the Big Data Opportunity and Address Big Data Technical Challenges? To deploy an integrated data processing framework Optimize data management in each phase of the information lifecycle process Regardless of data source, processing technologies, latency challenges, number of user demands To enable real-time, actionable insights in business process context Marry business process insights from structured data analysis with deep pattern, behavior analysis of unstructured data Enable decision making based on multi-factor considerations, not just instinct/experience To derive new value from INFORMATION Focus on deriving new value from data by enabling new business and technology use cases previously not feasible Augment existing business scenarios with new data insights to enable better decision 2013 SAP AG or an SAP affiliate company. All rights reserved. 12

13 Enterprise Scenarios with Hadoop Hadoop as a flexible data store Streaming Data Social Media Reference Data Transaction Data Enterprise Data SAP Business Suite Other SAP solutions Hadoop as a simple database Hadoop SAP Solutions Data warehouse/database (SAP HANA, SAP Sybase IQ/SAP Sybase ASE) SAP Data Services Computation Engine(s) Job Management Data storage (Hadoop Distributed File system) Hadoop as a processing engine Hadoop for data analytics Non-SAP solutions BI and analytics software from SAP In-memory Analytic engine and/or... Disk-based data ware-house (SAP Sybase IQ) Analytic engine 2013 SAP AG or an SAP affiliate company. All rights reserved. 13

14 Hadoop as a Simple Database SAP Business Suite SAP Solutions Focus: Storage and Retrieval of data from Hadoop typically using interfaces provided by HIVE or direct HDFS access Other SAP solutions Data warehouse/database (SAP HANA, SAP Sybase IQ/SAP Sybase ASE) Hadoop as a flexible data store Streaming Data Social Media Reference Data Transaction Data Enterprise Data Hadoop as a simple database Hadoop SAP Data Services Computation Engine(s) Job Management Data storage (Hadoop Distributed File system) Scenarios of Use include: Extract-Transform-Load from other systems to Hadoop. SAP Data Services provides ETL support from Hadoop to SAP HANA. Store & Retrieve Structured data based on projects like Hive. Depending on the scenario, scalable analytical systems like Sybase IQ can also be considered. Handle large documents as Blobs to do retrievals or analytics later. Use as a near-line store for offloading data that is considered cold or frozen Data lifecycle management is typically manual SAP AG or an SAP affiliate company. All rights reserved. 14

15 Hadoop as a Processing Engine SAP Business Suite SAP Solutions Focus: Distributed Compute leveraging the Map-Reduce computation framework of Hadoop on large distributed data sets Other SAP solutions Data warehouse/database (SAP HANA, SAP Sybase IQ/SAP Sybase ASE) Hadoop as a flexible data store Streaming Data Social Media Reference Data Transaction Data Enterprise Data Hadoop SAP Data Services Computation Engine(s) Job Management Data storage (Hadoop Distributed File system) Hadoop as a processing engine Scenarios of Use include: Data Enrichment. Push down of Text Data Transforms from Data Services is an example Data Pattern Analysis. This is an emerging space across new data forms. Convergence between procedural data science and declarative access patterns are evolving SAP AG or an SAP affiliate company. All rights reserved. 15

16 Hadoop for Data Analytics BI and analytics software from SAP Analytic engine In-memory and/or... Disk-based data ware-house (SAP Sybase IQ) Analytic engine Focus: A combination of storage and delegated analytics supporting two approaches: Two-Phase Analytics: Background processing engine refine and feeding data Federated Queries: Client side federation across data stores Hadoop as a flexible data store Streaming Data Social Media Reference Data Transaction Data Enterprise Data Hadoop for data analytics Hadoop Computation Engine(s) Job Management Data storage (Hadoop Distributed File system) Scenarios of Use include: Cross DM analytics. Practical only when performance from Hadoop is acceptable Stand-alone analytics. Emerging area to use Hadoop as the direct data store for analytics 2013 SAP AG or an SAP affiliate company. All rights reserved. 16

17 SAP HANA SP6 - smart data access capability Data virtualization for on-premise and hybrid cloud environments New HANA Tables Transactions + Analytics SAP HANA Virtual Tables Benefits Enables access to remote data access just like local table Provides SAP HANA to SAP HANA queries Smart query processing including query decomposition with predicate push-down, functional compensation Supports data location agnostic development No special syntax to access heterogeneous data sources Non-disruptive evolution Teradata IQ Heterogeneous data sources SAP HANA to Hadoop (Hive) Teradata SAP Sybase ASE SAP Sybase IQ Hadoop ASE SAP HANA 2013 SAP AG. All rights reserved. 17

18 Example Reference Architecture : Machine-to-Machine Infrastructure Run-Time Architecture Device End User Apps Web App Mobile App Dashboard Edge Cloud / Backend Device Management Stream M2M Services Data Acquisition & Processing Batch M2M Application Server Core Services Application and Analytical Services Industry-Specific Services SQLA / UltraLite Data Persistence Synchronization Real-Time Data Platform ESP Event Processing SQLA Data Synchronization HANA Hot Data Data Models Predictive Models ASE / IQ Warm and Cold Data Hadoop Big Data Sets 2013 SAP AG. All rights reserved. 18

19 Beyond Business Networks Internet of Things Meters, Drills MRI, PDAs Generators Turbines Healthcare Industrial Windmills Implants, Surgical Equipment UPS Batteries Pumps, Monitors Public Sector Fuel Cells, etc. Pumps, Valves, Vats, Conveyors, Pipelines Telemedicine, etc. Motors, Drives, Converting, Fabrication Annual smart meter shipments to surpass 140 million units worldwide by 2016, representing a CAGR of 32.9%.[3] Tanks, Fighter Jets Battlefield Comms Jeeps, Cars, Ambulances Breakdown, Lone Worker Homeland Security Tolls, etc. Environ. Monitoring, etc. Planes, Signage Assembly/Packaging, Vessels/Tanks, etc. Vehicles, Lights, Ships Meter Data Management will exceed $420 Million by 2020 with a CAGR of 16.8%[2] Cellular M2M revenue opportunity projected to reach $1.2 Trillion by 2020 Picture: Beecham Research 1. GSMA; 2.Pike Research; 3. IDC Energy Insights

20 DRIVERS DIVERTED BEFORE FATAL ACCIDENTS HAPPEN

21 SAP Big Data Webinar Series Thank You! Presented by: Yuvaraj Athur Raghuvir, SAP HANA Platform 2013 SAP AG or an SAP affiliate company. All rights reserved.

22 Hadoop Commons : Core / Common Components Hadoop Distributed File System: HDFS, the storage layer of Hadoop, is a distributed, scalable, Java-based file system adept at storing large volumes of unstructured data. MapReduce: MapReduce is a software framework that serves as the compute layer of Hadoop. MapReduce jobs are divided into two (obviously named) parts. The Map function divides a query into multiple parts and processes data at the node level. The Reduce function aggregates the results of the Map function to determine the answer to the query. Hive: Hive is a Hadoop-based data warehousing-like framework originally developed by Facebook. It allows users to write queries in a SQL-like language caled HiveQL, which are then converted to MapReduce. This allows SQL programmers with no MapReduce experience to use the warehouse and makes it easier to integrate with business intelligence and visualization tools such as Microstrategy, Tableau, Revolutions Analytics, etc. Pig: Pig Latin is a Hadoop-based language developed by Yahoo. It is relatively easy to learn and is adept at very deep, very long data pipelines (a limitation of SQL.) HBase: HBase is a non-relational database that allows for low-latency, quick lookups in Hadoop. It adds transactional capabilities to Hadoop, allowing users to conduct updates, inserts and deletes. EBay and Facebook use HBase heavily. Flume: Flume is a framework for populating Hadoop with data. Agents are populated throughout ones IT infrastructure inside web servers, application servers and mobile devices, for example to collect data and integrate it into Hadoop. Oozie: Oozie is a workflow processing system that lets users define a series of jobs written in multiple languages such as Map Reduce, Pig and Hive -- then intelligently link them to one another. Oozie allows users to specify, for example, that a particular query is only to be initiated after specified previous jobs on which it relies for data are completed. Source: retrieved on Jul SAP AG or an SAP affiliate company. All rights reserved. 22

23 Hadoop Commons : Core / Common Components Ambari: Ambari is a web-based set of tools for deploying, administering and monitoring Apache Hadoop clusters. It's development is being led by engineers from Hortonwroks, which include Ambari in its Hortonworks Data Platform. Avro: Avro is a data serialization system that allows for encoding the schema of Hadoop files. It is adept at parsing data and performing removed procedure calls. Mahout: Mahout is a data mining library. It takes the most popular data mining algorithms for performing clustering, regression testing and statistical modeling and implements them using the Map Reduce model. Sqoop: Sqoop is a connectivity tool for moving data from non-hadoop data stores such as relational databases and data warehouses into Hadoop. It allows users to specify the target location inside of Hadoop and instruct Sqoop to move data from Oracle, Teradata or other relational databases to the target. HCatalog: HCatalog is a centralized metadata management and sharing service for Apache Hadoop. It allows for a unified view of all data in Hadoop clusters and allows diverse tools, including Pig and Hive, to process any data elements without needing to know physically where in the cluster the data is stored. BigTop: BigTop is an effort to create a more formal process or framework for packaging and interoperability testing of Hadoop's sub-projects and related components with the goal improving the Hadoop platform as a whole. Source: retrieved on Jul SAP AG or an SAP affiliate company. All rights reserved. 23

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Rohit Bakhshi, Solution Architect, Hortonworks Jim Walker, Director Product Marketing, Talend Page 1 About Us Rohit Bakhshi Solution

More information

Microsoft Azure Essentials

Microsoft Azure Essentials Microsoft Azure Essentials Azure Essentials Track Summary Data Analytics Explore the Data Analytics services in Azure to help you analyze both structured and unstructured data. Azure can help with large,

More information

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

E-guide Hadoop Big Data Platforms Buyer s Guide part 1 Hadoop Big Data Platforms Buyer s Guide part 1 Your expert guide to Hadoop big data platforms for managing big data David Loshin, Knowledge Integrity Inc. Companies of all sizes can use Hadoop, as vendors

More information

Bringing the Power of SAS to Hadoop Title

Bringing the Power of SAS to Hadoop Title WHITE PAPER Bringing the Power of SAS to Hadoop Title Combine SAS World-Class Analytics With Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities ii Contents Introduction... 1 What

More information

Microsoft Big Data. Solution Brief

Microsoft Big Data. Solution Brief Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,

More information

Big Data The Big Story

Big Data The Big Story Big Data The Big Story Jean-Pierre Dijcks Big Data Product Mangement 1 Agenda What is Big Data? Architecting Big Data Building Big Data Solutions Oracle Big Data Appliance and Big Data Connectors Customer

More information

20775: Performing Data Engineering on Microsoft HD Insight

20775: Performing Data Engineering on Microsoft HD Insight Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com

More information

SAP Big Data. Markus Tempel SAP Big Data and Cloud Analytics Services

SAP Big Data. Markus Tempel SAP Big Data and Cloud Analytics Services SAP Big Data Markus Tempel SAP Big Data and Cloud Analytics Services Is that Big Data? 2015 SAP AG or an SAP affiliate company. All rights reserved. 2 What if you could turn new signals from Big Data into

More information

Cask Data Application Platform (CDAP)

Cask Data Application Platform (CDAP) Cask Data Application Platform (CDAP) CDAP is an open source, Apache 2.0 licensed, distributed, application framework for delivering Hadoop solutions. It integrates and abstracts the underlying Hadoop

More information

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

ORACLE DATA INTEGRATOR ENTERPRISE EDITION ORACLE DATA INTEGRATOR ENTERPRISE EDITION Oracle Data Integrator Enterprise Edition delivers high-performance data movement and transformation among enterprise platforms with its open and integrated E-LT

More information

MapR: Solution for Customer Production Success

MapR: Solution for Customer Production Success 2015 MapR Technologies 2015 MapR Technologies 1 MapR: Solution for Customer Production Success Big Data High Growth 700+ Customers Cloud Leaders Riding the Wave with Hadoop The Big Data Platform of Choice

More information

Operational Hadoop and the Lambda Architecture for Streaming Data

Operational Hadoop and the Lambda Architecture for Streaming Data Operational Hadoop and the Lambda Architecture for Streaming Data 2015 MapR Technologies 2015 MapR Technologies 1 Topics From Batch to Operational Workloads on Hadoop Streaming Data Environments The Lambda

More information

ETL challenges on IOT projects. Pedro Martins Head of Implementation

ETL challenges on IOT projects. Pedro Martins Head of Implementation ETL challenges on IOT projects Pedro Martins Head of Implementation Outline What is Pentaho Pentaho Data Integration (PDI) Smartcity Copenhagen Example of Data structure without an OLAP schema Telematics

More information

Evolution to Revolution: Big Data 2.0

Evolution to Revolution: Big Data 2.0 Evolution to Revolution: Big Data 2.0 An ENTERPRISE MANAGEMENT ASSOCIATES (EMA ) White Paper Prepared for Actian March 2014 IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Table of Contents

More information

Why Big Data Matters? Speaker: Paras Doshi

Why Big Data Matters? Speaker: Paras Doshi Why Big Data Matters? Speaker: Paras Doshi If you re wondering about what is Big Data and why does it matter to you and your organization, then come to this talk and get introduced to Big Data and learn

More information

MapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia

MapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia MapR: Converged Data Pla3orm and Quick Start Solu;ons Robin Fong Regional Director South East Asia Who is MapR? MapR is the creator of the top ranked Hadoop NoSQL SQL-on-Hadoop Real Database time streaming

More information

Analyze Big Data Faster and Store it Cheaper. Dominick Huang CenterPoint Energy Russell Hull - SAP

Analyze Big Data Faster and Store it Cheaper. Dominick Huang CenterPoint Energy Russell Hull - SAP Analyze Big Data Faster and Store it Cheaper Dominick Huang CenterPoint Energy Russell Hull - SAP ABOUT CENTERPOINT ENERGY, INC. Publicly traded on New York Stock Exchange Headquartered in Houston, Texas

More information

Common Customer Use Cases in FSI

Common Customer Use Cases in FSI Common Customer Use Cases in FSI 1 Marketing Optimization 2014 2014 MapR MapR Technologies Technologies 2 Fortune 100 Financial Services Company 104M CARD MEMBERS 3 Financial Services: Recommendation Engine

More information

Big Business Value from Big Data and Hadoop

Big Business Value from Big Data and Hadoop Big Business Value from Big Data and Hadoop Page 1 Topics The Big Data Explosion: Hype or Reality Introduction to Apache Hadoop The Business Case for Big Data Hortonworks Overview & Product Demo Page 2

More information

HP SummerSchool TechTalks Kenneth Donau Presale Technical Consulting, HP SW

HP SummerSchool TechTalks Kenneth Donau Presale Technical Consulting, HP SW HP SummerSchool TechTalks 2013 Kenneth Donau Presale Technical Consulting, HP SW Copyright Copyright 2013 2013 Hewlett-Packard Development Development Company, Company, L.P. The L.P. information The information

More information

Jason Virtue Business Intelligence Technical Professional

Jason Virtue Business Intelligence Technical Professional Jason Virtue Business Intelligence Technical Professional jvirtue@microsoft.com Agenda Microsoft Azure Data Services Azure Cloud Services Azure Machine Learning Azure Service Bus Azure Stream Analytics

More information

Building Your Big Data Team

Building Your Big Data Team Building Your Big Data Team With all the buzz around Big Data, many companies have decided they need some sort of Big Data initiative in place to stay current with modern data management requirements.

More information

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN From Information to Insight: The Big Value of Big Data Faire Ann Co Marketing Manager, Information Management Software, ASEAN The World is Changing and Becoming More INSTRUMENTED INTERCONNECTED INTELLIGENT

More information

MapR Pentaho Business Solutions

MapR Pentaho Business Solutions MapR Pentaho Business Solutions The Benefits of a Converged Platform to Big Data Integration Tom Scurlock Director, WW Alliances and Partners, MapR Key Takeaways 1. We focus on business values and business

More information

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK Are you drowning in Big Data? Do you lack access to your data? Are you having a hard time managing Big Data processing requirements?

More information

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica Accelerating Your Big Data Analytics Jeff Healey, Director Product Marketing, HPE Vertica Recent Waves of Disruption IT Infrastructu re for Analytics Data Warehouse Modernization Big Data/ Hadoop Cloud

More information

Real-time Streaming Insight & Time Series Data Analytic For Smart Retail

Real-time Streaming Insight & Time Series Data Analytic For Smart Retail Real-time Streaming Insight & Time Series Data Analytic For Smart Retail Sudip Majumder Senior Director Development Industry IoT & Big Data 10/5/2016 Economic Characteristics of Data Data is the New Oil..then

More information

1. Intoduction to Hadoop

1. Intoduction to Hadoop 1. Intoduction to Hadoop Hadoop is a rapidly evolving ecosystem of components for implementing the Google MapReduce algorithms in a scalable fashion on commodity hardware. Hadoop enables users to store

More information

IBM Big Data Summit 2012

IBM Big Data Summit 2012 IBM Big Data Summit 2012 12.10.2012 InfoSphere BigInsights Introduction Wilfried Hoge Leading Technical Sales Professional hoge@de.ibm.com twitter.com/wilfriedhoge 12.10.1012 IBM Big Data Strategy: Move

More information

Hybrid Data Management

Hybrid Data Management Kelly Schlamb Executive IT Specialist, Worldwide Analytics Platform Enablement and Technical Sales (kschlamb@ca.ibm.com, @KSchlamb) Hybrid Data Management IBM Analytics Summit 2017 November 8, 2017 5 Essential

More information

Welcome to. enterprise-class big data and financial a. Putting big data and advanced analytics to work in financial services.

Welcome to. enterprise-class big data and financial a. Putting big data and advanced analytics to work in financial services. Welcome to enterprise-class big data and financial a Putting big data and advanced analytics to work in financial services. MapR-FSI Martin Darling We reinvented the data platform for next-gen intelligent

More information

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved.

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved. Apache Spark 2.0 GA The General Engine for Modern Analytic Use Cases 1 Apache Spark Drives Business Innovation Apache Spark is driving new business value that is being harnessed by technology forward organizations.

More information

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect 2005 Concert de Coldplay 2014 Concert de Coldplay 90% of the world s data has been created over the last two years alone 1 1. Source

More information

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS)

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS) ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS) Hadoop Storage-as-a-Service ABSTRACT This White Paper illustrates how Dell EMC Elastic Cloud Storage (ECS ) can be used to streamline

More information

ETL on Hadoop What is Required

ETL on Hadoop What is Required ETL on Hadoop What is Required Keith Kohl Director, Product Management October 2012 Syncsort Copyright 2012, Syncsort Incorporated Agenda Who is Syncsort Extract, Transform, Load (ETL) Overview and conventional

More information

Deloitte School of Analytics. Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward

Deloitte School of Analytics. Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward Deloitte School of Analytics Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward February 2018 Agenda 7 February 2018 8 February 2018 9 February 2018 8:00 9:00 Networking

More information

Data Analytics. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC

Data Analytics. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC Data Analytics Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC Last 15 years IT-centric Traditional Analytics Traditional Applications Rigid Infrastructure Internet Next

More information

Big Data & Hadoop Advance

Big Data & Hadoop Advance Course Durations: 30 Hours About Company: Course Mode: Online/Offline EduNextgen extended arm of Product Innovation Academy is a growing entity in education and career transformation, specializing in today

More information

COPYRIGHTED MATERIAL. 1Big Data and the Hadoop Ecosystem

COPYRIGHTED MATERIAL. 1Big Data and the Hadoop Ecosystem 1Big Data and the Hadoop Ecosystem WHAT S IN THIS CHAPTER? Understanding the challenges of Big Data Getting to know the Hadoop ecosystem Getting familiar with Hadoop distributions Using Hadoop-based enterprise

More information

Session 30 Powerful Ways to Use Hadoop in your Healthcare Big Data Strategy

Session 30 Powerful Ways to Use Hadoop in your Healthcare Big Data Strategy Session 30 Powerful Ways to Use Hadoop in your Healthcare Big Data Strategy Bryan Hinton Senior Vice President, Platform Engineering Health Catalyst Sean Stohl Senior Vice President, Product Development

More information

The Intersection of Big Data and DB2

The Intersection of Big Data and DB2 The Intersection of Big Data and DB2 May 20, 2014 Mike McCarthy, IBM Big Data Channels Development mmccart1@us.ibm.com Agenda What is Big Data? Concepts Characteristics What is Hadoop Relational vs Hadoop

More information

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB Data Analytics and CERN IT Hadoop Service CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB 1 Data Analytics at Scale The Challenge When you cannot fit your workload in a desktop Data

More information

SAP Predictive Analytics Suite

SAP Predictive Analytics Suite SAP Predictive Analytics Suite Tania Pérez Asensio Where is the Evolution of Business Analytics Heading? Organizations Are Maturing Their Approaches to Solving Business Problems Reactive Wait until a problem

More information

Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications

Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications Copyright 2015 Cask Data, Inc. All Rights Reserved. February

More information

Hadoop and Analytics at CERN IT CERN IT-DB

Hadoop and Analytics at CERN IT CERN IT-DB Hadoop and Analytics at CERN IT CERN IT-DB 1 Hadoop Use cases Parallel processing of large amounts of data Perform analytics on a large scale Dealing with complex data: structured, semi-structured, unstructured

More information

How Data Science is Changing the Way Companies Do Business Colin White

How Data Science is Changing the Way Companies Do Business Colin White How Data Science is Changing the Way Companies Do Business Colin White BI Research July 17, 2014 Sponsor 2 Speakers Colin White President, BI Research Bill Franks Chief Analytics Officer, Teradata 3 How

More information

DLT AnalyticsStack. Powering big data, analytics and data science strategies for government agencies

DLT AnalyticsStack. Powering big data, analytics and data science strategies for government agencies DLT Stack Powering big data, analytics and data science strategies for government agencies Now, government agencies can have a scalable reference model for success with Big Data, Advanced and Data Science

More information

SAS & HADOOP ANALYTICS ON BIG DATA

SAS & HADOOP ANALYTICS ON BIG DATA SAS & HADOOP ANALYTICS ON BIG DATA WHY HADOOP? OPEN SOURCE MASSIVE SCALE FAST PROCESSING COMMODITY COMPUTING DATA REDUNDANCY DISTRIBUTED WHY HADOOP? Hadoop will soon become a replacement complement to:

More information

E-Guide THE EVOLUTION OF IOT ANALYTICS AND BIG DATA

E-Guide THE EVOLUTION OF IOT ANALYTICS AND BIG DATA E-Guide THE EVOLUTION OF IOT ANALYTICS AND BIG DATA E nterprises are already recognizing the value that lies in IoT data, but IoT analytics is still evolving and businesses have yet to see the full potential

More information

In-Memory Analytics: Get Faster, Better Insights from Big Data

In-Memory Analytics: Get Faster, Better Insights from Big Data Discussion Summary In-Memory Analytics: Get Faster, Better Insights from Big Data January 2015 Interview Featuring: Tapan Patel, SAS Institute, Inc. Introduction A successful analytics program should translate

More information

PI Integrator for Business Analytics

PI Integrator for Business Analytics PI Integrator for Business Analytics Big Data Analytics with the PI System Presented by LP Page-Morin, Sr. Systems Engineer 2 high volume, velocity, and/or variety information assets that demand cost-effective,

More information

The Internet of Things Wind Turbine Predictive Analytics. Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure

The Internet of Things Wind Turbine Predictive Analytics. Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure The Internet of Things Wind Turbine Predictive Analytics Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure Big Data and Tribo-Analytics Today we will see how Fluitec solved real-world challenges

More information

OPEN MODERN DATA ARCHITECTURE FOR FINANCIAL SERVICES RISK MANAGEMENT

OPEN MODERN DATA ARCHITECTURE FOR FINANCIAL SERVICES RISK MANAGEMENT WHITEPAPER OPEN MODERN DATA ARCHITECTURE FOR FINANCIAL SERVICES RISK MANAGEMENT A top-tier global bank s end-of-day risk analysis jobs didn t complete in time for the next start of trading day. To solve

More information

Business In The Moment: From Reactive to Proactive. Timo Elliott, May 2012

Business In The Moment: From Reactive to Proactive. Timo Elliott, May 2012 Business In The Moment: From Reactive to Proactive Timo Elliott, May 2012 1 Legal Disclaimer The information in this presentation is confidential and proprietary to SAP and may not be disclosed without

More information

ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics

ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics ADVANCED ANALYTICS & IOT ARCHITECTURES Presented by: Orion Gebremedhin Director of Technology, Data & Analytics Marc Lobree National Architect, Advanced Analytics EDW THE RIGHT TOOL FOR THE RIGHT WORKLOAD

More information

Cloud Based Analytics for SAP

Cloud Based Analytics for SAP Cloud Based Analytics for SAP Gary Patterson, Global Lead for Big Data About Virtustream A Dell Technologies Business 2,300+ employees 20+ data centers Major operations in 10 countries One of the fastest

More information

E-guide Hadoop Big Data Platforms Buyer s Guide part 3

E-guide Hadoop Big Data Platforms Buyer s Guide part 3 Big Data Platforms Buyer s Guide part 3 Your expert guide to big platforms enterprise MapReduce cloud-based Abie Reifer, DecisionWorx The Amazon Elastic MapReduce Web service offers a managed framework

More information

Oracle Enterprise Data Quality Product Roadmap and Statement of Direction. October 2016

Oracle Enterprise Data Quality Product Roadmap and Statement of Direction. October 2016 Oracle Enterprise Data Quality Product Roadmap and Statement of Direction October 2016 Oracle Confidential Internal/Restricted/Highly Restricted 2 Safe Harbor Statement The following is intended to outline

More information

Ray M Sugiarto MAPR Champion Indonesia

Ray M Sugiarto MAPR Champion Indonesia Ray M Sugiarto MAPR Champion Indonesia 0815 167 2882 2015 MapR Technologies 2015 MapR Technologies 1 Why Big Data? University of Texas: The median Fortune 1000 company could increase its revenue by more

More information

Boston Azure Cloud User Group. a journey of a thousand miles begins with a single step

Boston Azure Cloud User Group. a journey of a thousand miles begins with a single step Boston Azure Cloud User Group a journey of a thousand miles begins with a single step 3 Solution Architect at Slalom Boston Business Intelligence User Group Leader I am a bit shy but passionate. BI Architect

More information

Big Data Job Descriptions. Software Engineer - Algorithms

Big Data Job Descriptions. Software Engineer - Algorithms Big Data Job Descriptions Software Engineer - Algorithms This position is responsible for meeting the big data needs of our various products and businesses. Specifically, this position is responsible for

More information

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara Pentaho 8.0 and Beyond Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara Safe Harbor Statement The forward-looking statements contained in this document represent an outline of our

More information

Architecture Overview for Data Analytics Deployments

Architecture Overview for Data Analytics Deployments Architecture Overview for Data Analytics Deployments Mahmoud Ghanem Sr. Systems Engineer GLOBAL SPONSORS Agenda The Big Picture Top Use Cases for Data Analytics Modern Architecture Concepts for Data Analytics

More information

Copyright - Diyotta, Inc. - All Rights Reserved. Page 2

Copyright - Diyotta, Inc. - All Rights Reserved. Page 2 Page 2 Page 3 Page 4 Page 5 Humanizing Analytics Analytic Solutions that Provide Powerful Insights about Today s Healthcare Consumer to Manage Risk and Enable Engagement and Activation Industry Alignment

More information

Big Data Management Best Practices for Data Lakes Philip Russom, Ph.D.

Big Data Management Best Practices for Data Lakes Philip Russom, Ph.D. Big Data Management Best Practices for Data Lakes Philip Russom, Ph.D. Senior Research Director, TDWI October 27, 2016 Sponsor 2 Speakers Philip Russom Senior Research Director for Data Management, TDWI

More information

Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics

Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics Key takeaways Analytic Insights Module for self-service analytics Automate data ingestion into Isilon Data Lake Three methods

More information

Pentaho 8.0 Overview. Pedro Alves

Pentaho 8.0 Overview. Pedro Alves Pentaho 8.0 Overview Pedro Alves Safe Harbor Statement The forward-looking statements contained in this document represent an outline of our current intended product direction. It is provided for information

More information

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11 Top 5 Challenges for Hadoop MapReduce in the Enterprise Whitepaper - May 2011 http://platform.com/mapreduce 2 5/9/11 Table of Contents Introduction... 2 Current Market Conditions and Drivers. Customer

More information

The Rise of Engineering-Driven Analytics

The Rise of Engineering-Driven Analytics The Rise of Engineering-Driven Analytics Roy Lurie, Ph.D. Vice President Engineering, MATLAB Products 2015 The MathWorks, Inc. 1 The Rise of Engineering-Driven Analytics 2 The Rise of Engineering-Driven

More information

Leveraging Oracle Big Data Discovery to Master CERN s Data. Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden

Leveraging Oracle Big Data Discovery to Master CERN s Data. Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden Leveraging Oracle Big Data Discovery to Master CERN s Data Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden Manuel Martin Marquez Intel IoT Ignition Lab Cloud and

More information

Azure PaaS and SaaS Microsoft s two approaches to building IoT solutions

Azure PaaS and SaaS Microsoft s two approaches to building IoT solutions Azure PaaS and SaaS Microsoft s two approaches to building IoT solutions Hector Garcia Tellado Program Manager Lead, Azure IoT Suite #IoTinActionMS #IoTinActionMS Agenda Customers using IoT today Microsoft

More information

Hortonworks Data Platform. Buyer s Guide

Hortonworks Data Platform. Buyer s Guide Hortonworks Data Platform Buyer s Guide Hortonworks Data Platform (HDP Completely Open and Versatile Hadoop Data Platform 2 2014 Hortonworks, Inc. All rights reserved. Hadoop and the Hadoop elephant logo

More information

Enterprise Architecture for Digital Business

Enterprise Architecture for Digital Business Enterprise Architecture for Digital Business Dave Chappelle Enterprise Architect Global EA Program October 26, 2015. Safe Harbor Statement The following is intended to outline our general product direction.

More information

SAP Business One OnDemand. SAP Business One OnDemand Solution Overview

SAP Business One OnDemand. SAP Business One OnDemand Solution Overview SAP Business One OnDemand SAP Business One OnDemand Solution Overview SAP Business One OnDemand Table of Contents 4 Executive Summary Introduction SAP Business One Today 8 A Technical Overview: SAP Business

More information

Berkeley Data Analytics Stack (BDAS) Overview

Berkeley Data Analytics Stack (BDAS) Overview Berkeley Analytics Stack (BDAS) Overview Ion Stoica UC Berkeley UC BERKELEY What is Big used For? Reports, e.g., - Track business processes, transactions Diagnosis, e.g., - Why is user engagement dropping?

More information

The IoT Solutions Space: Edge-Computing IoT architecture, the FAR EDGE Project John Professor Athens Information

The IoT Solutions Space: Edge-Computing IoT architecture, the FAR EDGE Project John Professor Athens Information The IoT Solutions Space: Edge-Computing IoT architecture, the FAR EDGE Project John Soldatos (jsol@ait.gr, @jsoldatos), Professor Athens Information Technology Contributor: Solufy Blog (http://www.solufy.com/blog)

More information

Oracle Big Data Cloud Service

Oracle Big Data Cloud Service Oracle Big Data Cloud Service Delivering Hadoop, Spark and Data Science with Oracle Security and Cloud Simplicity Oracle Big Data Cloud Service is an automated service that provides a highpowered environment

More information

Let s distribute.. NOW: Modern Data Platform as Basis for Transformation and new Services

Let s distribute.. NOW: Modern Data Platform as Basis for Transformation and new Services Let s distribute.. NOW: Modern Data Platform as Basis for Transformation and new Services Matthias Kupczak, Michael Probst; SAP June, 2017 Agenda 09:30-09:55 Coffee 09:55-10:00 Welcome Message T-Systems

More information

Bringing Big Data to Life: Overcoming The Challenges of Legacy Data in Hadoop

Bringing Big Data to Life: Overcoming The Challenges of Legacy Data in Hadoop 0101 001001010110100 010101000101010110100 1000101010001000101011010 00101010001010110100100010101 0001001010010101001000101010001 010101101001000101010001001010010 010101101 000101010001010 1011010 0100010101000

More information

Hortonworks Powering the Future of Data

Hortonworks Powering the Future of Data Hortonworks Powering the Future of Simon Gregory Vice President Eastern Europe, Middle East & Africa 1 Hortonworks Inc. 2011 2016. All Rights Reserved MASTER THE VALUE OF DATA EVERY BUSINESS IS A DATA

More information

Integrating MATLAB Analytics into Enterprise Applications

Integrating MATLAB Analytics into Enterprise Applications Integrating MATLAB Analytics into Enterprise Applications David Willingham 2015 The MathWorks, Inc. 1 Run this link. http://bit.ly/matlabapp 2 Key Takeaways 1. What is Enterprise Integration 2. What is

More information

ActualTests.C Q&A C Foundations of IBM Big Data & Analytics Architecture V1

ActualTests.C Q&A C Foundations of IBM Big Data & Analytics Architecture V1 ActualTests.C2030-136.40Q&A Number: C2030-136 Passing Score: 800 Time Limit: 120 min File Version: 4.8 http://www.gratisexam.com/ C2030-136 Foundations of IBM Big Data & Analytics Architecture V1 Hello,

More information

Konica Minolta Business Innovation Center

Konica Minolta Business Innovation Center Konica Minolta Business Innovation Center Advance Technology/Big Data Lab May 2016 2 2 3 4 4 Konica Minolta BIC Technology and Research Initiatives Data Science Program Technology Trials (Technology partner

More information

Microsoft FastTrack For Azure Service Level Description

Microsoft FastTrack For Azure Service Level Description ef Microsoft FastTrack For Azure Service Level Description 2017 Microsoft. All rights reserved. 1 Contents Microsoft FastTrack for Azure... 3 Eligible Solutions... 3 FastTrack for Azure Process Overview...

More information

Smart Mortgage Lending

Smart Mortgage Lending Whitepaper Smart Mortgage Lending Advanced Business Intelligence for the Residential Mortgage Industry Contents 1 Introduction 1 Analytics Today 2 From Data Warehouse to Data Lake 2 Machine Learning and

More information

BIG DATA and DATA SCIENCE

BIG DATA and DATA SCIENCE Integrated Program In BIG DATA and DATA SCIENCE CONTINUING STUDIES Table of Contents About the Course...03 Key Features of Integrated Program in Big Data and Data Science...04 Learning Path...05 Key Learning

More information

Sr. Sergio Rodríguez de Guzmán CTO PUE

Sr. Sergio Rodríguez de Guzmán CTO PUE PRODUCT LATEST NEWS Sr. Sergio Rodríguez de Guzmán CTO PUE www.pue.es Hadoop & Why Cloudera Sergio Rodríguez Systems Engineer sergio@pue.es 3 Industry-Leading Consulting and Training PUE is the first Spanish

More information

Hadoop Integration Deep Dive

Hadoop Integration Deep Dive Hadoop Integration Deep Dive Piyush Chaudhary Spectrum Scale BD&A Architect 1 Agenda Analytics Market overview Spectrum Scale Analytics strategy Spectrum Scale Hadoop Integration A tale of two connectors

More information

Your Big Data to Big Data tools using the family of PI Integrators

Your Big Data to Big Data tools using the family of PI Integrators 1 Your Big Data to Big Data tools using the family of PI Integrators Presented by Martin Bryant Field Service Engineer @osisoft PI Integrators PI Integrator for Business Analytics PI Integrator for Business

More information

THE MAGIC OF DATA INTEGRATION IN THE ENTERPRISE WITH TIPS AND TRICKS

THE MAGIC OF DATA INTEGRATION IN THE ENTERPRISE WITH TIPS AND TRICKS THE MAGIC OF DATA INTEGRATION IN THE ENTERPRISE WITH TIPS AND TRICKS DATA HOLDS ALL THE POTENTIAL TO HELP BUSINESSES WIN CUSTOMERS INCREASE REVENUE GAIN COMPETITIVE ADVANTAGE STREAMLINE OPERATIONS BUT

More information

Digitalisieren Sie Ihr Unternehmen mit dem Internet der Dinge Michael Epprecht Microsoft GBB IoT

Digitalisieren Sie Ihr Unternehmen mit dem Internet der Dinge Michael Epprecht Microsoft GBB IoT Digicomp Microsoft Evolution Day 2015 1 Digitalisieren Sie Ihr Unternehmen mit dem Internet der Dinge Michael Epprecht Microsoft GBB IoT michael.epprecht@microsoft.com @fastflame Partner: Becoming a digital

More information

The Alpine Data Platform

The Alpine Data Platform The Alpine Data Platform TABLE OF CONTENTS ABOUT ALPINE.... 2 ALPINE PRODUCT OVERVIEW... 3 PRODUCT ARCHITECTURE.... 5 SYSTEM REQUIREMENTS.... 6 ABOUT ALPINE DATA ADVANCED ANALYTICS FOR THE ENTERPRISE Alpine

More information

LEVERAGING DATA ANALYTICS TO GAIN COMPETITIVE ADVANTAGE IN YOUR INDUSTRY

LEVERAGING DATA ANALYTICS TO GAIN COMPETITIVE ADVANTAGE IN YOUR INDUSTRY LEVERAGING DATA ANALYTICS TO GAIN COMPETITIVE ADVANTAGE IN YOUR INDUSTRY Unlock the value of your data with analytics solutions from Dell EMC ABSTRACT To unlock the value of their data, organizations around

More information

Architected Blended Big Data With Pentaho. A Solution Brief

Architected Blended Big Data With Pentaho. A Solution Brief Architected Blended Big Data With Pentaho A Solution Brief Introduction The value of big data is well recognized, with implementations across every size and type of business today. However, the most powerful

More information

FORIS Business Intelligence. Innovative Analytics

FORIS Business Intelligence. Innovative Analytics Innovative Analytics FORIS BI V5 is the latest version of SITRONICS Business intelligence platform developed as a proactive, real time business Intelligence suite that enables company management a complete

More information

Operational Intelligence in Industrial Environments

Operational Intelligence in Industrial Environments Copyright 2014 Splunk Inc. Operational Intelligence in Industrial Environments Brian Gilmore, Splunk bgilmore@splunk.com @brianmgilmore Safe Harbor Statement During the course of this presentation, we

More information

Data Lake or Data Swamp?

Data Lake or Data Swamp? Data Lake or Data Swamp? Keeping the Data Lake from Becoming a Data Swamp. 1 INTRODUCTION Increasingly, businesses of all kinds are beginning to see their data as an important asset that can help make

More information

Application Integrator Automate Any Application

Application Integrator Automate Any Application Application Integrator Automate Any Application BMC Control-M by applications BMC Control-M by platforms ERP Business Intelligence Data Integration / ETL OS Platform SAP Oracle ebusiness Suite PeopleSoft

More information

PCM Update. Jérémie Brunet Solution Management Dave Parsons Engineering

PCM Update. Jérémie Brunet Solution Management Dave Parsons Engineering PCM Update Jérémie Brunet Solution Management Dave Parsons Engineering Disclaimer The information in this document is confidential and proprietary to SAP and may not be disclosed without the permission

More information