By: Shrikant Gawande (Cloudera Certified )

Size: px
Start display at page:

Download "By: Shrikant Gawande (Cloudera Certified )"

Transcription

1 By: Shrikant Gawande (Cloudera Certified )

2 What is Big Data? For every 30 mins, a airline jet collects 10 terabytes of sensor data (flying time) NYSE generates about one terabyte of new trade data per day to Perform stock trading analytics to determine trends for optimal trades.

3 Facebook Example Facebook users spend 10.5 billion minutes (almost 20,000 years) online on the social network. Facebook has an average of 3.2 billion likes and comments are posted every day.

4 Twitter Example Twitter has over 500 million registered users. The USA, whose million accounts represents 27.4 percent of all Twitter users, good enough to finish well ahead of Brazil, Japan, the UK and Indonesia. 79% of US Twitter users are more likely to recommend brands they follow. 67% of US Twitter users are more likely to buy from brands they follow. 57% of all companies that use social media for business use Twitter.

5 Hadoop is being used across industries Industries using Hadoop Source : Karmasphere

6 Why to learn Big Data?

7 What Big Companies Have To Say..

8 Data Volume Is Growing Exponentially Estimated Global Data Volume: 2011: 1.8 ZB 2015: 7.9 ZB The world's information doubles every two years Over the next 10 years: The number of servers worldwide will grow by 10x Amount of information managed by enterprise data centers will grow by 50x Number of files enterprise data center handle will grow by 75x Source: was based on the 2011 IDC Digital Universe Study

9 IBM s Definition IBM s definition Big Data Characteristics A collection of large and complex data sets which are difficult to process using common database management tools or traditional data processing applications. Big Data is the amount of data that is beyond the storage and the processing capabilities of a single physical machine. Data that has extra large volume, comes from variety of sources, variety of formats and comes at us with a great velocity it normally referred as Big Data

10 It s more of unstructured Data than Structured Data

11 A Traditional Approach Under Pressure

12 Why Big Data? ERP CRM Data ( few TBs) Enterprise data What Data We have been adding in last 3-4 Years Customer Experience Click Streams Online Campaign Banner Ads capturing every click 100 n TBs User Entered data Search In product search Social media to understand general sentiments

13 Common Business Applications Industry Use Cases Types of Data New Account Risk Screens Text, Server Logs Financial Services Trading Risk Server Logs Insurance Underwriting Geographic, Sensor, Text Call Details records (CDR) Machine, Geographic Telecom Infrastructure Investment Machine, Server Logs Real-Time Bandwidth Allocation Server Logs, Text, Social 360 Degree View of Customer ClickStream, Text Retail Localized, Personal Promotion Geographic Website Optimization ClickStream Supply Chain and Logistics Sensor Manufacturing Assembly Line Quality Assurance Sensor Crowd sourced Quality Assurance Social HealthCare Use Genomic in Medical Trials Structured Monitor Patient Vitals in Real-Time Sensor Pharmaceuticals Recruit and Retain Patients for Drug Trails Improve Prescription Adherence Social, Clickstream Social, Unstructured, Geographic Oil and Gas Unify Exploration and Production Data Monitor Rig Safety in Real Time Sensor, Unstructured, Geographic Sensor, Unstructured

14 How can we find products that customers are interested in BUT DON T BUY?

15 Leveraging ALL Business Data How to Extract Insights from 9TBs of Web Logs? How do you make sense of this?

16 Leveraging ALL Business Data How to Extract Insights from 9TBs of Web Logs? What users did when they come to our web site? Which product they viewed? Which product seen but not purchased? Why? New Offering based on past data? In the First line User has seen some product by some particular ID?

17 Leveraging ALL Business Data How to Extract Insights from 9TBs of Web Logs? (Contd Visitor views 2nd product - We want to do this not just for 1 customer but all the customers

18 Hidden Treasure Insight into data can provide Business Advantage. Some key early indicators can mean Fortunes to Business. More Precise Analysis with more data New offerings to the customer

19 Limitations of Existing Data Analytics Architecture

20 Solution: A Combined Storage Computer Layer

21 Differentiating factors

22 Some of the Hadoop Users

23 Why DFS?

24 What is Hadoop? Apache Hadoop is a framework that allows for the distributed processing of large data sets across clusters of commodity computers using a simple programming model. It is an Open-source Data Management with scale-out storage & distributed processing

25 Hadoop Key Characteristics

26 Hadoop History

27 Hadoop Eco-System

28 Hadoop Core Components HDFS Hadoop Distributed File System(Storage) Distributed across nodes Natively redundant Name Node tracks locations. MapReduce (Processing) Splits a task across processors near the data & assembles results Self-Healing, High Bandwidth Clustered storage

29 Hadoop Core Components (contd.)

30 HDFS Architecture

31 Main Components of HDFS NameNode master of the system maintains and manages the blocks which are present on the DataNodes DataNodes slaves which are deployed on each machine and provide the actual storage responsible for serving read and write requests for the clients

32 NameNode and Datanode

33 NameNode Meta Data Meta-data in Memory The entire metadata is in main memory No demand paging of FS meta-data Types of Metadata List of files List of Blocks for each file List of DataNode for each block File attributes, e.g. access time, replication factor A Transaction Log Records file creations, file deletions. etc

34 Storage : Name-Node and Data-Node.S Processing : Job-Tracker and Task-Tracker.S H1 H2 H3 H4

35 Poll - 01

36 Poll - 02

37 Poll - 03

38 Poll - 04

39 Poll - 05

40

41 Hadoop Courses and its fees across major training institutes

42 Hadoop Course fee at Cloudera Cloudera Hadoop Training :

43 Hadoop Course fee at HortonWorks and Edureka $ 2,795 = Rs. 1,73,290

44 My Contact Details:

45 Thank You

Table of Contents. Are You Ready for Digital Transformation? page 04. Take Advantage of This Big Data Opportunity with Cisco and Hortonworks page 06

Table of Contents. Are You Ready for Digital Transformation? page 04. Take Advantage of This Big Data Opportunity with Cisco and Hortonworks page 06 Table of Contents 01 02 Are You Ready for Digital Transformation? page 04 Take Advantage of This Big Data Opportunity with Cisco and Hortonworks page 06 03 Get Open Access to Your Data and Help Ensure

More information

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN From Information to Insight: The Big Value of Big Data Faire Ann Co Marketing Manager, Information Management Software, ASEAN The World is Changing and Becoming More INSTRUMENTED INTERCONNECTED INTELLIGENT

More information

MapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia

MapR: Converged Data Pla3orm and Quick Start Solu;ons. Robin Fong Regional Director South East Asia MapR: Converged Data Pla3orm and Quick Start Solu;ons Robin Fong Regional Director South East Asia Who is MapR? MapR is the creator of the top ranked Hadoop NoSQL SQL-on-Hadoop Real Database time streaming

More information

5th Annual. Cloudera, Inc. All rights reserved.

5th Annual. Cloudera, Inc. All rights reserved. 5th Annual 1 The Essentials of Apache Hadoop The What, Why and How to Meet Agency Objectives Sarah Sproehnle, Vice President, Customer Success 2 Introduction 3 What is Apache Hadoop? Hadoop is a software

More information

Spark, Hadoop, and Friends

Spark, Hadoop, and Friends Spark, Hadoop, and Friends (and the Zeppelin Notebook) Douglas Eadline Jan 4, 2017 NJIT Presenter Douglas Eadline deadline@basement-supercomputing.com @thedeadline HPC/Hadoop Consultant/Writer http://www.basement-supercomputing.com

More information

Intro to Big Data and Hadoop

Intro to Big Data and Hadoop Intro to Big and Hadoop Portions copyright 2001 SAS Institute Inc., Cary, NC, USA. All Rights Reserved. Reproduced with permission of SAS Institute Inc., Cary, NC, USA. SAS Institute Inc. makes no warranties

More information

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

E-guide Hadoop Big Data Platforms Buyer s Guide part 1 Hadoop Big Data Platforms Buyer s Guide part 1 Your expert guide to Hadoop big data platforms for managing big data David Loshin, Knowledge Integrity Inc. Companies of all sizes can use Hadoop, as vendors

More information

Outline of Hadoop. Background, Core Services, and Components. David Schwab Synchronic Analytics Nov.

Outline of Hadoop. Background, Core Services, and Components. David Schwab Synchronic Analytics   Nov. Outline of Hadoop Background, Core Services, and Components David Schwab Synchronic Analytics https://synchronicanalytics.com Nov. 1, 2018 Hadoop s Purpose and Origin Hadoop s Architecture Minimum Configuration

More information

Louis Bodine IBM STG WW BAO Tiger Team Leader

Louis Bodine IBM STG WW BAO Tiger Team Leader Louis Bodine IBM STG WW BAO Tiger Team Leader Presentation Objectives Discuss the value of Business Analytics Discuss BAO Ecosystem Discuss Transformational Solutions http://www.youtube.com/watch?v=eiuick5oqdm

More information

GET MORE VALUE OUT OF BIG DATA

GET MORE VALUE OUT OF BIG DATA GET MORE VALUE OUT OF BIG DATA Enterprise data is increasing at an alarming rate. An International Data Corporation (IDC) study estimates that data is growing at 50 percent a year and will grow by 50 times

More information

red red red red red red red red red red red red red red red red red red red red CYS Rithu P Ravi CYS Saumya K

red red red red red red red red red red red red red red red red red red red red CYS Rithu P Ravi CYS Saumya K red red red red red red red red red red red red red red red red red red red red CYS14011 - Rithu P Ravi CYS14012 - Saumya K Why and What HADOOP?... Apache Hadoop is an open-source software framework A

More information

Bringing the Power of SAS to Hadoop Title

Bringing the Power of SAS to Hadoop Title WHITE PAPER Bringing the Power of SAS to Hadoop Title Combine SAS World-Class Analytics With Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities ii Contents Introduction... 1 What

More information

BIG DATA AND HADOOP DEVELOPER

BIG DATA AND HADOOP DEVELOPER BIG DATA AND HADOOP DEVELOPER Approximate Duration - 60 Hrs Classes + 30 hrs Lab work + 20 hrs Assessment = 110 Hrs + 50 hrs Project Total duration of course = 160 hrs Lesson 00 - Course Introduction 0.1

More information

Augmented Real-time Clinical DataMart. Phani S Srinivasan Ponnapalli, Syneos Health Subrahmanyam Rayaprolu, Syneos Health

Augmented Real-time Clinical DataMart. Phani S Srinivasan Ponnapalli, Syneos Health Subrahmanyam Rayaprolu, Syneos Health Augmented Real-time Clinical DataMart Phani S Srinivasan Ponnapalli, Syneos Health Subrahmanyam Rayaprolu, Syneos Health Agenda Introduction Traditional Clinical Data warehouse vs Digital Data Modern Data

More information

2012 SNIA Analytics and Big Data Summit. Insert Your Company Name. All Rights Reserved.

2012 SNIA Analytics and Big Data Summit. Insert Your Company Name. All Rights Reserved. A Working Definition of Big Data Data sets whose size is beyond the ability of commonly used software tools to capture, manage, and process the data within a tolerable elapsed time. Wikipedia, 4/26/2011

More information

Engaging in Big Data Transformation in the GCC

Engaging in Big Data Transformation in the GCC Sponsored by: IBM Author: Megha Kumar December 2015 Engaging in Big Data Transformation in the GCC IDC Opinion In a rapidly evolving IT ecosystem, "transformation" and in some cases "disruption" is changing

More information

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW TOPICS COVERED 1 2 Fundamentals of Big Data Platforms Major Big Data Tools Scaling Up vs. Out SCALE UP (SMP) SCALE OUT (MPP) + (n) Upgrade

More information

Modernizing Your Data Warehouse with Azure

Modernizing Your Data Warehouse with Azure Modernizing Your Data Warehouse with Azure Big data. Small data. All data. Christian Coté S P O N S O R S The traditional BI Environment The traditional data warehouse data warehousing has reached the

More information

Session 30 Powerful Ways to Use Hadoop in your Healthcare Big Data Strategy

Session 30 Powerful Ways to Use Hadoop in your Healthcare Big Data Strategy Session 30 Powerful Ways to Use Hadoop in your Healthcare Big Data Strategy Bryan Hinton Senior Vice President, Platform Engineering Health Catalyst Sean Stohl Senior Vice President, Product Development

More information

APAC Big Data & Cloud Summit 2013

APAC Big Data & Cloud Summit 2013 APAC Big Data & Cloud Summit 2013 Big Data Analytics & Hadoop Use Cases Eddie Toh Server Marketing Manager 21 August 2013 From the dawn of civilization until 2003, we humans created 5 Exabyte of information.

More information

Smarter Analytics for Big Data

Smarter Analytics for Big Data Smarter Analytics for Big Data Anjul Bhambhri IBM Vice President, Big Data February 27, 2011 The World is Changing and Becoming More INSTRUMENTED INTERCONNECTED INTELLIGENT The resulting explosion of information

More information

Big Data Introduction

Big Data Introduction Big Data Introduction Who we are Experts At Your Service Over 50 specialists in IT infrastructure Certified, experienced, passionate Based In Switzerland 100% self-financed Swiss company Over CHF8 mio.

More information

EXAMPLE SOLUTIONS Hadoop in Azure HBase as a columnar NoSQL transactional database running on Azure Blobs Storm as a streaming service for near real time processing Hadoop 2.4 support for 100x query gains

More information

Big Data The Big Story

Big Data The Big Story Big Data The Big Story Jean-Pierre Dijcks Big Data Product Mangement 1 Agenda What is Big Data? Architecting Big Data Building Big Data Solutions Oracle Big Data Appliance and Big Data Connectors Customer

More information

Big Data s Big Impact on Businesses. Webconference : Jan 29, 2013

Big Data s Big Impact on Businesses. Webconference : Jan 29, 2013 Big Data s Big Impact on Businesses Webconference : Jan 29, 2013 Key Takeaways Slide 3 Introduction to Big Data Slide 5 Global Landscape and Trends Slide 12 The Big Data Opportunity Slide 20 Big Data s

More information

Optimal Infrastructure for Big Data

Optimal Infrastructure for Big Data Optimal Infrastructure for Big Data Big Data 2014 Managing Government Information Kevin Leong January 22, 2014 2014 VMware Inc. All rights reserved. The Right Big Data Tools for the Right Job Real-time

More information

BIG DATA ANALYTICS WITH HADOOP. 40 Hour Course

BIG DATA ANALYTICS WITH HADOOP. 40 Hour Course 1 BIG DATA ANALYTICS WITH HADOOP 40 Hour Course OVERVIEW Learning Objectives Understanding Big Data Understanding various types of data that can be stored in Hadoop Setting up and Configuring Hadoop in

More information

Architecture Overview for Data Analytics Deployments

Architecture Overview for Data Analytics Deployments Architecture Overview for Data Analytics Deployments Mahmoud Ghanem Sr. Systems Engineer GLOBAL SPONSORS Agenda The Big Picture Top Use Cases for Data Analytics Modern Architecture Concepts for Data Analytics

More information

New Approach for scheduling tasks and/or jobs in Big Data Cluster

New Approach for scheduling tasks and/or jobs in Big Data Cluster New Approach for scheduling tasks and/or jobs in Big Data Cluster IT College, Chairperson of MS Dept. Agenda Introduction What is Big Data? The 4 characteristics of Big Data V4s Different Categories of

More information

Hadoop Administration Course Content

Hadoop Administration Course Content Hadoop Administration Course Content Weekend Batch (2 Months): SAT & SUN (8-12pm) Course Fee: 16,000/- New Batch starts on: Free Demo Session scheduled on : Ph : 8892499499 Web:www.dvstechnologies.in mail:dvs.training@gmail.com

More information

Optimizing Outcomes in a Connected World: Turning information into insights

Optimizing Outcomes in a Connected World: Turning information into insights Optimizing Outcomes in a Connected World: Turning information into insights Michael Eden Management Brand Executive Central & Eastern Europe Vilnius 18 October 2011 2011 IBM Corporation IBM celebrates

More information

COMP9321 Web Application Engineering

COMP9321 Web Application Engineering COMP9321 Web Application Engineering Semester 1, 2017 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 11 (Part II) http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2457

More information

Hadoop Stories. Tim Marston. Director, Regional Alliances Page 1. Hortonworks Inc All Rights Reserved

Hadoop Stories. Tim Marston. Director, Regional Alliances Page 1. Hortonworks Inc All Rights Reserved Hadoop Stories Tim Marston Director, Regional Alliances EMEA Page 1 @timmarston Page 2 Plans for Hadoop Adoption (Gartner, May 2015) Start within 1 year 11% Start within 2 years 7% Already doing 27% No

More information

The Sysprog s Guide to the Customer Facing Mainframe: Cloud / Mobile / Social / Big Data

The Sysprog s Guide to the Customer Facing Mainframe: Cloud / Mobile / Social / Big Data Glenn Anderson, IBM Lab Services and Training The Sysprog s Guide to the Customer Facing Mainframe: Cloud / Mobile / Social / Big Data Summer SHARE August 2015 Session 17794 2 (c) Copyright 2015 IBM Corporation

More information

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme VIRT1400BU Real-World Customer Architecture for Big Data on VMware vsphere Joe Bruneau, General Mills Justin Murray, Technical Marketing, VMware #VMworld #VIRT1400BU Disclaimer This presentation may contain

More information

Big Data: A BIG problem and a HUGE opportunity. Version MAY 2013 xcommedia

Big Data: A BIG problem and a HUGE opportunity. Version MAY 2013 xcommedia Big Data: A BIG problem and a HUGE opportunity. Version 1.0 22 MAY 2013 xcommedia 2013 www.xcommedia.com.au Page 1 Introduction The volume and amount of data in the world has been increasing exponentially

More information

TechValidate Survey Report. Converged Data Platform Key to Competitive Advantage

TechValidate Survey Report. Converged Data Platform Key to Competitive Advantage TechValidate Survey Report Converged Data Platform Key to Competitive Advantage TechValidate Survey Report Converged Data Platform Key to Competitive Advantage Executive Summary What Industry Analysts

More information

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics ESSENTIALS EMC ISILON Use the industry's first and only scale-out NAS solution with native Hadoop

More information

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS)

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS) ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS) Hadoop Storage-as-a-Service ABSTRACT This White Paper illustrates how Dell EMC Elastic Cloud Storage (ECS ) can be used to streamline

More information

Nouvelle Génération de l infrastructure Data Warehouse et d Analyses

Nouvelle Génération de l infrastructure Data Warehouse et d Analyses Nouvelle Génération de l infrastructure Data Warehouse et d Analyses November 2011 André Münger andre.muenger@emc.com +41 79 708 85 99 1 Agenda BIG Data Challenges Greenplum Overview Use Cases Summary

More information

Angat Pinoy. Angat Negosyo. Angat Pilipinas.

Angat Pinoy. Angat Negosyo. Angat Pilipinas. Angat Pinoy. Angat Negosyo. Angat Pilipinas. Four megatrends will dominate the next decade Mobility Social Cloud Big data 91% of organizations expect to spend on mobile devices in 2012 In 2012, mobile

More information

Leveraging smart meter data for electric utilities:

Leveraging smart meter data for electric utilities: Leveraging smart meter data for electric utilities: Comparison of Spark SQL with Hive 5/16/2017 Hitachi, Ltd. OSS Solution Center Yusuke Furuyama Shogo Kinoshita Who are we? Yusuke Furuyama Solutions engineer

More information

Leveraging smart meter data for electric utilities:

Leveraging smart meter data for electric utilities: Leveraging smart meter data for electric utilities: Comparison of Spark SQL with Hive 5/16/2017 Hitachi, Ltd. OSS Solution Center Yusuke Furuyama Shogo Kinoshita Who are we? Yusuke Furuyama Solutions engineer

More information

AMD and Cloudera : Big Data Analytics for On-Premise, Cloud and Hybrid Deployments

AMD and Cloudera : Big Data Analytics for On-Premise, Cloud and Hybrid Deployments August, 2018 AMD and Cloudera : Big Data Analytics for On-Premise, Cloud and Hybrid Deployments Standards Based AMD is committed to industry standards, offering you a choice in x86 architecture with design

More information

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK Are you drowning in Big Data? Do you lack access to your data? Are you having a hard time managing Big Data processing requirements?

More information

Copyright - Diyotta, Inc. - All Rights Reserved. Page 2

Copyright - Diyotta, Inc. - All Rights Reserved. Page 2 Page 2 Page 3 Page 4 Page 5 Humanizing Analytics Analytic Solutions that Provide Powerful Insights about Today s Healthcare Consumer to Manage Risk and Enable Engagement and Activation Industry Alignment

More information

Contents at a Glance COPYRIGHTED MATERIAL. Introduction... 1 Part I: Getting Started with Big Data... 7

Contents at a Glance COPYRIGHTED MATERIAL. Introduction... 1 Part I: Getting Started with Big Data... 7 Contents at a Glance Introduction... 1 Part I: Getting Started with Big Data... 7 Chapter 1: Grasping the Fundamentals of Big Data...9 Chapter 2: Examining Big Data Types...25 Chapter 3: Old Meets New:

More information

EMC IT Big Data Analytics Journey. Mahmoud Ghanem Sr. Systems Engineer

EMC IT Big Data Analytics Journey. Mahmoud Ghanem Sr. Systems Engineer EMC IT Big Data Analytics Journey Mahmoud Ghanem Sr. Systems Engineer Agenda 1 2 3 4 5 Introduction To Big Data EMC IT Big Data Journey Marketing Science Lab Use Case Technical Benefits Lessons Learned

More information

Analytics in Action transforming the way we use and consume information

Analytics in Action transforming the way we use and consume information Analytics in Action transforming the way we use and consume information Big Data Ecosystem The Data Traditional Data BIG DATA Repositories MPP Appliances Internet Hadoop Data Streaming Big Data Ecosystem

More information

2014 Nordic Partner Day. Big Data

2014 Nordic Partner Day. Big Data 2014 Nordic Partner Day Big Data Legal Disclaimer This Presentation contains forward-looking statements, including, but not limited to, statements regarding the value and effectiveness of Qlik's products,

More information

Store. Analyze. Preserve. Big Data Assets

Store. Analyze. Preserve. Big Data Assets Dell EMC Forum Cairo, 19 th April 2017 Ali Hassib Regional Sales Manager, ISD Dell EMC Store. Analyze. Preserve. Big Data Assets UNSTRUCTURED DATA TRENDS 90 % 650 % 80 % 70 % OF ALL DATA WAS CREATED IN

More information

Hadoop Integration Deep Dive

Hadoop Integration Deep Dive Hadoop Integration Deep Dive Piyush Chaudhary Spectrum Scale BD&A Architect 1 Agenda Analytics Market overview Spectrum Scale Analytics strategy Spectrum Scale Hadoop Integration A tale of two connectors

More information

MATLAB for Data Analytics The MathWorks, Inc. 1

MATLAB for Data Analytics The MathWorks, Inc. 1 MATLAB for Analytics 2016 The MathWorks, Inc. 1 Railway Automotive Aeronautics Retail Finance Off-highway vehicles Prognostics Fleet Analytics Condition Monitoring Retail Analytics Operational Analytics

More information

Myths, good Bets, and Realities: Breaking the Health Digital Deadlock through Big Data and AI

Myths, good Bets, and Realities: Breaking the Health Digital Deadlock through Big Data and AI Myths, good Bets, and Realities: Breaking the Health Digital Deadlock through Big Data and AI Silvia Piai, IDC HIMSS Impact 2017 IDC Are Healthcare Systems Stupid? In a lifetime, an average human will

More information

StackIQ Enterprise Data Reference Architecture

StackIQ Enterprise Data Reference Architecture WHITE PAPER StackIQ Enterprise Data Reference Architecture StackIQ and Hortonworks worked together to Bring You World-class Reference Configurations for Apache Hadoop Clusters. Abstract Contents The Need

More information

Market Disruptions. The world is fundamentally changing. Media and Entertainment. Financial Services. Netflix. Apple Pay.

Market Disruptions. The world is fundamentally changing. Media and Entertainment. Financial Services. Netflix. Apple Pay. 2017 Analytics 1 The world is fundamentally changing Media and Entertainment Retail Financial Services Transportation Netflix Amazon Apple Pay Uber Market Disruptions Healthcare Human Resources Insurance

More information

Welcome to. enterprise-class big data and financial a. Putting big data and advanced analytics to work in financial services.

Welcome to. enterprise-class big data and financial a. Putting big data and advanced analytics to work in financial services. Welcome to enterprise-class big data and financial a Putting big data and advanced analytics to work in financial services. MapR-FSI Martin Darling We reinvented the data platform for next-gen intelligent

More information

Redefine Big Data: EMC Data Lake in Action. Andrea Prosperi Systems Engineer

Redefine Big Data: EMC Data Lake in Action. Andrea Prosperi Systems Engineer Redefine Big Data: EMC Data Lake in Action Andrea Prosperi Systems Engineer 1 Agenda Data Analytics Today Big data Hadoop & HDFS Different types of analytics Data lakes EMC Solutions for Data Lakes 2 The

More information

IBM Software IBM InfoSphere BigInsights

IBM Software IBM InfoSphere BigInsights IBM Software IBM InfoSphere BigInsights Enabling new, cost-effective solutions to turn complex information into business insight 2 IBM InfoSphere BigInsights Executive summary Companies are hyper-connected

More information

Big Data & Hadoop Advance

Big Data & Hadoop Advance Course Durations: 30 Hours About Company: Course Mode: Online/Offline EduNextgen extended arm of Product Innovation Academy is a growing entity in education and career transformation, specializing in today

More information

Advancing Information Management and Analysis with Entity Resolution. Whitepaper ADVANCING INFORMATION MANAGEMENT AND ANALYSIS WITH ENTITY RESOLUTION

Advancing Information Management and Analysis with Entity Resolution. Whitepaper ADVANCING INFORMATION MANAGEMENT AND ANALYSIS WITH ENTITY RESOLUTION Advancing Information Management and Analysis with Entity Resolution Whitepaper February 2016 novetta.com 2016, Novetta ADVANCING INFORMATION MANAGEMENT AND ANALYSIS WITH ENTITY RESOLUTION Advancing Information

More information

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11 Top 5 Challenges for Hadoop MapReduce in the Enterprise Whitepaper - May 2011 http://platform.com/mapreduce 2 5/9/11 Table of Contents Introduction... 2 Current Market Conditions and Drivers. Customer

More information

Insights-Driven Operations with SAP HANA and Cloudera Enterprise

Insights-Driven Operations with SAP HANA and Cloudera Enterprise Insights-Driven Operations with SAP HANA and Cloudera Enterprise Unleash your business with pervasive Big Data Analytics with SAP HANA and Cloudera Enterprise The missing link to operations As big data

More information

DLT AnalyticsStack. Powering big data, analytics and data science strategies for government agencies

DLT AnalyticsStack. Powering big data, analytics and data science strategies for government agencies DLT Stack Powering big data, analytics and data science strategies for government agencies Now, government agencies can have a scalable reference model for success with Big Data, Advanced and Data Science

More information

MapR Pentaho Business Solutions

MapR Pentaho Business Solutions MapR Pentaho Business Solutions The Benefits of a Converged Platform to Big Data Integration Tom Scurlock Director, WW Alliances and Partners, MapR Key Takeaways 1. We focus on business values and business

More information

Insights to HDInsight

Insights to HDInsight Insights to HDInsight Why Hadoop in the Cloud? No hardware costs Unlimited Scale Pay for What You Need Deployed in minutes Azure HDInsight Big Data made easy Enterprise Ready Easier and more productive

More information

Luxoft and the Internet of Things

Luxoft and the Internet of Things Luxoft and the Internet of Things Bridging the gap between Imagination and Technology www.luxoft.com/iot Luxoft and The Internet of Things Table of Contents Introduction... 3 Driving Business Value with

More information

Datametica. The Modern Data Platform Enterprise Data Hub Implementations. Why is workload moving to Cloud

Datametica. The Modern Data Platform Enterprise Data Hub Implementations. Why is workload moving to Cloud Datametica The Modern Data Platform Enterprise Data Hub Implementations Why is workload moving to Cloud 1 What we used do Enterprise Data Hub & Analytics What is Changing Why it is Changing Enterprise

More information

SAP Big Data. Markus Tempel SAP Big Data and Cloud Analytics Services

SAP Big Data. Markus Tempel SAP Big Data and Cloud Analytics Services SAP Big Data Markus Tempel SAP Big Data and Cloud Analytics Services Is that Big Data? 2015 SAP AG or an SAP affiliate company. All rights reserved. 2 What if you could turn new signals from Big Data into

More information

Investor Presentation. Second Quarter 2016

Investor Presentation. Second Quarter 2016 Investor Presentation Second Quarter 2016 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences

More information

INDUSTRY BRIEF THE ENTERPRISE DATA HUB IN FINANCIAL SERVICES: THREE CUSTOMER CASE STUDIES

INDUSTRY BRIEF THE ENTERPRISE DATA HUB IN FINANCIAL SERVICES: THREE CUSTOMER CASE STUDIES INDUSTRY BRIEF THE ENTERPRISE DATA HUB IN FINANCIAL SERVICES: THREE CUSTOMER CASE STUDIES The Enterprise Data Hub in Financial Services: Three Customer Case Studies CLOUDERA INDUSTRY BRIEF 2 Table of Contents

More information

Big Data Trends to Watch

Big Data Trends to Watch Big Data Trends to Watch Bill Peterson NetApp September, 2012 1 Bill Peterson @thebillp What I hope to accomplish today... ...and avoid this. What is Big Data? Big Data refers to datasets whose volume,

More information

EMC Big Data: Become Data-Driven

EMC Big Data: Become Data-Driven 1 EMC Big Data: Become Data-Driven 2 What Is Big Data Exactly? Enterprise Internet 3 How Much Data Is There? 44 Zettabytes 1 ZB = 1B TBs 44 zettabytes is estimated to be 50 times the amount of all the

More information

Hortonworks Powering the Future of Data

Hortonworks Powering the Future of Data Hortonworks Powering the Future of Simon Gregory Vice President Eastern Europe, Middle East & Africa 1 Hortonworks Inc. 2011 2016. All Rights Reserved MASTER THE VALUE OF DATA EVERY BUSINESS IS A DATA

More information

Big Data makes a Big Difference for Life Sciences

Big Data makes a Big Difference for Life Sciences Big Data makes a Big Difference for Life Sciences Fran Daly Sr. Director, Life Sciences Apps Associates LLC Julian Troake Marketing Director Apps Associates LLC 4 September, 2014 Copyright 2014. Apps Associates

More information

Got Hadoop? Whitepaper: Hadoop and EXASOL - a perfect combination for processing, storing and analyzing big data volumes

Got Hadoop? Whitepaper: Hadoop and EXASOL - a perfect combination for processing, storing and analyzing big data volumes Got Hadoop? Whitepaper: Hadoop and EXASOL - a perfect combination for processing, storing and analyzing big data volumes Contents Introduction...3 Hadoop s humble beginnings...4 The benefits of Hadoop...5

More information

Hadoop Solutions. Increase insights and agility with an Intel -based Dell big data Hadoop solution

Hadoop Solutions. Increase insights and agility with an Intel -based Dell big data Hadoop solution Big Data Hadoop Solutions Increase insights and agility with an Intel -based Dell big data Hadoop solution Are you building operational efficiencies or increasing your competitive advantage with big data?

More information

Common Customer Use Cases in FSI

Common Customer Use Cases in FSI Common Customer Use Cases in FSI 1 Marketing Optimization 2014 2014 MapR MapR Technologies Technologies 2 Fortune 100 Financial Services Company 104M CARD MEMBERS 3 Financial Services: Recommendation Engine

More information

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Roger Ding Cloudera February 3rd, 2018 1 Agenda Hadoop History Introduction to Apache Hadoop

More information

Measure Consume. Store. Data Governance

Measure Consume. Store. Data Governance Collect Process Manage Measure Consume Store Data Governance Big Data Sources (Raw, Unstructured) Azure Machine Learning Business Insights Sensors Devices Intelligent Systems Service Hadoop on Windows

More information

Managing explosion of data. Cloudera, Inc. All rights reserved.

Managing explosion of data. Cloudera, Inc. All rights reserved. Managing explosion of data 1 Customer experience expectations are converging on the brand, not channel Consistent across all channels and lines of business Contextualized to present location and circumstances

More information

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect 2005 Concert de Coldplay 2014 Concert de Coldplay 90% of the world s data has been created over the last two years alone 1 1. Source

More information

Jisc. Jisc Group Structure. Jisc Services Ltd. Jisc Group Sales. Research & Education. Private & Public Enterprise. Association of Colleges

Jisc. Jisc Group Structure. Jisc Services Ltd. Jisc Group Sales. Research & Education. Private & Public Enterprise. Association of Colleges October 16 1 Jisc Group Structure Universities UK Guild HE Association of Colleges Institutional Members Funded by HEFCE (BIS), DFE, DCLG & Institutional Subscriptions Jisc Board of Trustees Board = Jisc

More information

BIG DATA TRANSFORMS BUSINESS

BIG DATA TRANSFORMS BUSINESS BIG DATA TRANSFORMS BUSINESS Johannes Fellner November 7 th, 2012 1 IN 2000 THE WORLD GENERATED TWO EXABYTES OF NEW INFORMATION Sources: How Much Information? Peter Lyman and Hal Varian, UC Berkeley,.

More information

OVERVIEW MAPR: THE CONVERGED PLATFORM FOR RETAIL

OVERVIEW MAPR: THE CONVERGED PLATFORM FOR RETAIL OVERVIEW MAPR: THE CONVERGED PLATFORM FOR RETAIL 1 CREATING LASTING EXPERIENCES FOR SHOPPERS The retail market is being redefined by changing shopper behaviors. As digital technologies spread and the shopper

More information

HP SummerSchool TechTalks Kenneth Donau Presale Technical Consulting, HP SW

HP SummerSchool TechTalks Kenneth Donau Presale Technical Consulting, HP SW HP SummerSchool TechTalks 2013 Kenneth Donau Presale Technical Consulting, HP SW Copyright Copyright 2013 2013 Hewlett-Packard Development Development Company, Company, L.P. The L.P. information The information

More information

Big Data Anwendungsfälle aus dem Bereich der digitalen Medien

Big Data Anwendungsfälle aus dem Bereich der digitalen Medien Presented by Kate Tickner Date 12 th October 2012 Big Data Anwendungsfälle aus dem Bereich der digitalen Medien Using Big Data and Smarter Analytics to Increase Consumer Engagement Dramatic forces affecting

More information

DATA SCIENCE: HYPE AND REALITY PATRICK HALL

DATA SCIENCE: HYPE AND REALITY PATRICK HALL DATA SCIENCE: HYPE AND REALITY PATRICK HALL About me SAS Enterprise Miner, 2012 Cloudera Data Scientist, 2014 Do you use Kolmogorov Smirnov often? Statistician No, I mix my martinis with gin. Data Scientist

More information

Charter Global. Digital Solutions and Consulting Services. Digital Solutions. QA Testing

Charter Global. Digital Solutions and Consulting Services. Digital Solutions. QA Testing Charter Global Digital Solutions and Consulting Services IT Strategy and Assessment Digital Solutions Big Data Mobility Application Development QA Testing Infrastructure Management Services Professional

More information

Datasheet FUJITSU Integrated System PRIMEFLEX for Hadoop

Datasheet FUJITSU Integrated System PRIMEFLEX for Hadoop Datasheet FUJITSU Integrated System PRIMEFLEX for Hadoop FUJITSU Integrated System PRIMEFLEX for Hadoop is a powerful and scalable platform analyzing big data volumes at high velocity FUJITSU Integrated

More information

SSRG International Journal of Civil Engineering ( SSRG IJCE ) Volume 4 Issue 10 October 2017

SSRG International Journal of Civil Engineering ( SSRG IJCE ) Volume 4 Issue 10 October 2017 Big Data Analytics in Civil Engineering: The Case of China YouseokKang 1, JiayanYu 2, JiaruiChang 3 1 Hanyang University (Korea), 2 Johns Hopkins University (USA), 3 Dalian No. 24 (China) Abstract China

More information

A REVIEW ON HADOOP ARCHITECTURE FOR BIG DATA

A REVIEW ON HADOOP ARCHITECTURE FOR BIG DATA International Journal of Research in Engineering, Technology and Science, Volume VI, Special Issue, July 2016 www.ijrets.com, editor@ijrets.com, ISSN 2454-1915 A REVIEW ON HADOOP ARCHITECTURE FOR BIG DATA

More information

The Intersection of Big Data and DB2

The Intersection of Big Data and DB2 The Intersection of Big Data and DB2 May 20, 2014 Mike McCarthy, IBM Big Data Channels Development mmccart1@us.ibm.com Agenda What is Big Data? Concepts Characteristics What is Hadoop Relational vs Hadoop

More information

SOLUTION SHEET End to End Data Flow Management and Streaming Analytics Platform

SOLUTION SHEET End to End Data Flow Management and Streaming Analytics Platform SOLUTION SHEET End to End Data Flow Management and Streaming Analytics Platform CREATE STREAMING ANALYTICS APPLICATIONS IN MINUTES WITHOUT WRITING CODE The increasing growth of data, especially data-in-motion,

More information

Apache Hadoop in the Datacenter and Cloud

Apache Hadoop in the Datacenter and Cloud Apache Hadoop in the Datacenter and Cloud The Shift to the Connected Data Architecture Digital Transformation fueled by Big Data Analytics and IoT ACTIONABLE INTELLIGENCE Cloud and Data Center IDMS Relational

More information

YASHAJIT SAHA & ABHISHEK SHARMA, SUBJECT MATTER EXPERTS, RESEARCH & ANALYTICS ADVANCED ANALYTICS: A REMEDY FOR COMMERCIAL SUCCESS IN PHARMA.

YASHAJIT SAHA & ABHISHEK SHARMA, SUBJECT MATTER EXPERTS, RESEARCH & ANALYTICS ADVANCED ANALYTICS: A REMEDY FOR COMMERCIAL SUCCESS IN PHARMA. YASHAJIT SAHA & ABHISHEK SHARMA, SUBJECT MATTER EXPERTS, RESEARCH & ANALYTICS ADVANCED ANALYTICS: A REMEDY FOR COMMERCIAL SUCCESS IN PHARMA wns wns ADVANCED ANALYTICS: A REMEDY FOR COMMERCIAL SUCCESS IN

More information

SAS & HADOOP ANALYTICS ON BIG DATA

SAS & HADOOP ANALYTICS ON BIG DATA SAS & HADOOP ANALYTICS ON BIG DATA WHY HADOOP? OPEN SOURCE MASSIVE SCALE FAST PROCESSING COMMODITY COMPUTING DATA REDUNDANCY DISTRIBUTED WHY HADOOP? Hadoop will soon become a replacement complement to:

More information

SOLUTION SHEET Hortonworks DataFlow (HDF ) End-to-end data flow management and streaming analytics platform

SOLUTION SHEET Hortonworks DataFlow (HDF ) End-to-end data flow management and streaming analytics platform SOLUTION SHEET Hortonworks DataFlow (HDF ) End-to-end data flow management and streaming analytics platform CREATE STREAMING ANALYTICS APPLICATIONS IN MINUTES WITHOUT WRITING CODE The increasing growth

More information

Business Intelligence, 4e (Sharda/Delen/Turban) Chapter 1 An Overview of Business Intelligence, Analytics, and Data Science

Business Intelligence, 4e (Sharda/Delen/Turban) Chapter 1 An Overview of Business Intelligence, Analytics, and Data Science Business Intelligence, 4e (Sharda/Delen/Turban) Chapter 1 An Overview of Business Intelligence, Analytics, and Data Science 1) Computerized support is only used for organizational decisions that are responses

More information

The Rise of Engineering-Driven Analytics. Richard Rovner VP Marketing

The Rise of Engineering-Driven Analytics. Richard Rovner VP Marketing The Rise of Engineering-Driven Analytics Richard Rovner VP Marketing MathWorks @RichardRovner The Rise of Engineering-Driven Analytics The Rise of Engineering-Driven Analytics Limited users, scope & technology

More information