IBM BigInsights - Hadoop jako rozwiązanie korporacyjne. Tomasz Zawadzki Dyrektor Zarządzający Atom-tech

Size: px
Start display at page:

Download "IBM BigInsights - Hadoop jako rozwiązanie korporacyjne. Tomasz Zawadzki Dyrektor Zarządzający Atom-tech"

Transcription

1 IBM BigInsights - Hadoop jako rozwiązanie korporacyjne Tomasz Zawadzki Dyrektor Zarządzający Atom-tech

2 IBM BigInsights - Hadoop jako rozwiązanie korporacyjne Tomasz Zawadzki Dyrektor Zarządzający Atom-tech

3 Information is at the Center of a New Wave of Opportunity 44x as much Data and Content Over Coming Decade zettabytes And Organizations Need Deeper Insights 1 in 3 Business leaders frequently make decisions based on information they don t trust, or don t have Business leaders say they don t have access to the information 1 in 2 they need to do their jobs ,000 petabytes 80% Of world s data is unstructured 83% of CIOs cited Business intelligence and analytics as part of their visionary plans to enhance competitiveness 60% of CEOs need to do a better job capturing and understanding information rapidly in order to make swift business decisions

4 Big Data Presents Big Opportunities Extract insight from a high volume, variety and velocity of data in a timely and cost-effective manner Variety: Velocity: Volume: Manage and benefit from diverse data types and data structures Analyze streaming data and large volumes of persistent data Scale from terabytes to zettabytes

5 Customer Engagements Common use cases Analyze machine data (logs, sensors, etc.) to reduce operational risk, predict / identify / correct problems, improve customer service, etc. Analyze social media data and customer behavior for to improve customer retention, brand management, lead generation, operations, etc.... Text, Blog, Weblog Click streams Log & transactions Technology mandates Extract business insight from large volumes of raw data (often outside operational systems) Integrate with existing software Ready for enterprise use Consumer Insight Multi-channel sales Next Gen Fraud Models Text Analytics Biological Sequences Operational system & streams data sources New Business Development Statistical Model Building

6 IBM Big Data Platform Strategy Integrate and manage the full variety, velocity and volume of Big Data Apply advanced analytics to information in its native form Visualize all available data for ad-hoc analysis Development environment for building new analytic applications Support workload optimization and scheduling Provide for security and governance Integrate with enterprise software Analytic Applications BI / Reporting Exploration / Visualization Functional App IBM Big Data Platform Visualization & Discovery Hadoop System Accelerators Industry App Application Development Stream Computing Predictive Analytics Information Integration & Governance BI Content / Reporting Analytics Systems Management Data Warehouse

7 Enterprise class Platform for volume, variety, velocity Enhanced Hadoop foundation Analytics Text analytics & tooling Application accelerators Usability Web console Spreadsheet-style tool Ready-made apps Enterprise Class Storage, security, cluster management Integration Connectivity to Netezza, DB2, JDBC databases, etc InfoSphere BigInsights Apache Hadoop Basic Edition Free download Integrated install Online InfoCenter BigData Univ. Enterprise Edition Licensed Application accelerators Pre-built applications Text analytics Spreadsheet-style tool RDBMS, warehouse connectivity Administrative tools, security Eclipse development tools Performance enhancements.... Breadth of capabilities

8 Function BigInsights Content Version Basic Edition Integrated Install Inc Inc Hadoop (including common utilities, HDFS, MapReduce framework) Inc Inc Jaql (programming / query language) Inc Inc Pig (programming / query language) Inc Inc Flume (data collection/aggregation) Inc Inc Hive (data summarization/querying) Inc Inc Lucene (text search)* Inc Inc Zookeeper (process coordination) Inc Inc Avro (data serialization) Inc Inc HBase (real time read/write) Inc Inc HCatalog (table and storage management service) Inc Inc Sqoop (RDBMS bulk data transfer) Inc Inc Oozie (workflow/ job orchestration) Inc Inc Online documentation Inc Inc Integration with JDBC sources through general-purpose Jaql module Inc Inc Integration with DB2 (sample functions to submit jobs, read data) Inc Inc Enterprise Edition

9 Function BigInsights Content (cont d) Basic Edition Integration with R (Jaql module to invoke R statistical capabilities from BigInsights) n/a Inc Integration with Netezza, DB2 LUW with DPF from Jaql n/a Inc LDAP authentication, Guardium support, etc. n/a Inc Integrated Web Console n/a Inc Business process accelerators (social data, machine data analytics) n/a Inc Platform performance enhancements (Adaptive MapReduce, large scale indexing, efficient processing of compressed text files, flexible job scheduler, etc.) Text analytics n/a Inc Eclipse tools for text analytic development, Jaql, Hive, Java n/a Inc Applications for data import/export, Web crawl, machine learning, etc. n/a Inc Web-based application catalog n/a Inc Spreadsheet-like analytical tool n/a Inc IBM support Opt Inc Streams, Data Explorer, Cognos BI (limited use licenses) n/a Inc Unlimited storage n/a Inc n/a Enterprise Edition Inc

10 Support for other Hadoop Distributions Installation option for BigInsights Enterprise Edition Support Hadoop distributions from third parties, beginning with Cloudera Configures key BigInsights technologies to work with Cloudera foundation Easy way for Cloudera customers to leverage leading BigInsights capabilities Text analytics BigSheets (spreadsheet-style tool for business analysts) Eclipse plug-ins for rapid application development Ready-made sample applications for Web crawl, social media data collection, data import/export, ad hoc query, etc. Administrative tooling

11 BigInsights: Value Beyond Open Source Open source components Enterprise Capabilities Visualization & Exploration Development Tools Advanced Engines Connectors Workload Optimization Administration & Security IBM-certified Apache Hadoop or or Key differentiators Built-in analytics Text engine, annotators, Eclipse tooling Interface to project R (statistical platform) Enterprise software integration Spreadsheet-style analysis Integrated installation of supported open source and other components Web Console for admin and application access Platform enrichment: additional security, performance features,... World-class support Full open source compatibility Business benefits Quicker time-to-value due to IBM technology and support Reduced operational risk Enhanced business knowledge with flexible analytical platform Leverages and complements existing software

12 BigInsights and the data warehouse Big Data analytic applications Traditional analytic tools Data warehouse BigInsights Filter Transform Aggregate

13 BigInsights and the data warehouse Traditional analytic tools Big Data analytic applications BigInsights Data Warehouse Query-ready archive for cold warehouse data

14 About the BigInsights Platform Flexible, enterprise-class support for processing large volumes of data Based on Google s MapReduce technology Inspired by Apache Hadoop; compatible with its ecosystem and distribution Well-suited to batch-oriented, read-intensive applications Supports wide variety of data Enables applications to work with thousands of nodes and petabytes of data in a highly parallel, cost effective manner CPU + disks = node Nodes can be combined into clusters New nodes can be added as needed without changing Data formats How data is loaded How jobs are written

15 "Map" step: Input split into pieces The MapReduce Programming Model Worker nodes process individual pieces in parallel (under global control of the Job Tracker node) Each worker node stores its result in its local file system where a reducer is able to access it "Reduce" step: Data is aggregated ( reduced from the map steps) by worker nodes (under control of the Job Tracker) Multiple reduce tasks can parallelize the aggregation

16 Logical MapReduce Example: Word Count map(string key, String value): // key: document name // value: document contents for each word w in value: EmitIntermediate(w, "1"); reduce(string key, Iterator values): // key: a word // values: a list of counts int result = 0; for each v in values: result += ParseInt(v); Emit(AsString(result)); Content of Input Documents Hello World Bye World Hello IBM Map 1 emits: < Hello, 1> < World, 1> < Bye, 1> < World, 1> Map 2 emits: < Hello, 1> < IBM, 1> Reduce (final output): < Bye, 1> < IBM, 1> < Hello, 2> < World, 2>

17 Input Documents MapReduce Processing Hello World Bye World Hello IBM Map 1 emits: < Hello, 1> < World, 1> < Bye, 1> < World, 1> Map 2 emits: < Hello, 1> < IBM, 1> Reduce (final output): < Bye, 1> < IBM, 1> < Hello, 2> < World, 2>

18 So What Does This Result In? Easy To Scale Fault Tolerant and Self-Healing Data Agnostic Extremely Flexible

19 Seamless process for single node and cluster environments Integrated installation of all selected components Post-install validation of IBM and open source components Get up and running quickly! Web Installation Tool No need to iteratively download, configure, and test multiple open source projects and pre-requisite software.

20 Manage BigInsights Inspect /monitor system health Add / drop nodes Start / stop services Run / monitor jobs (applications) Explore / modify file system Create custom dashboards... Launch applications Spreadsheet-like analysis tool Pre-built applications (IBM supplied or user developed) Publish applications Monitor cluster, applications, data, etc. Web Console

21 Web-based analysis and visualization Spreadsheet-style Analysis Spreadsheet-like interface Define and manage long running data collection jobs Analyze content of the text on the pages that have been retrieved

22 Big Data Application Ecosystem Data integration scenario: Pre-defined work flows simplify loading data from various sources Work flows can be configured, deployed, executed and scheduled Application scenarios (web log, , social media, ): Samples provide starting point, speed time to value Big Data Web Console Publish Eclipse Development tooling: Text analytics MapReduce Query languages... App Development App library MapReduce, Text Analytics Query App Development Code application program, and generate associated App Deploy Apps to Enterprise Manager

23 Quick start sample applications 20+ software assets based on common customer needs Useful for starting point for various applications Accessible through Web console Available assets Data movement From relational DBMS, files, REST-based sources To relational DBMS, files Web crawler, social media data collectors, etc. Ad hoc queries for Jaql, Hive, Pig Machine learning algorithms and data processing Data sampling and subsetting TeraGen-TeraSort, WordCount sample applications

24 Running Applications from the Web Console Import & Export Data Database & Files Web and Social Analyze and Query Predictive Analytics Text Analytics SQL/Hive, Jaql, Pig, HBase

25 Build a Big Data Program MapReduce example Eclipse tools For Jaql, Hive, Pig Java MapReduce, BigSheets plug-ins, text analytics, etc.

26 Quickly drag and drop to create new applications

27 Visualize results through dashboards Built-in dashboards for monitoring system health, application status, distributed file system, etc. Easy to customize.... Add, group, or remove widgets for: BigSheets collections and charts Cluster/system Monitoring HDFS monitoring MapReduce metrics Third party Widgets or Open Social Gadgets can be added to a dashboard Create new, custom dashboards to suit your needs!

28 BigInsights and Text Analytics Distills structured info from unstructured text Sentiment analysis Consumer behavior Illegal or suspicious activities Parses text and detects meaning with annotators Unstructured text (document, , etc) Football World Cup 2010, one team distinguished themselves well, losing to the eventual champions 1-0 in the Final. Early in the second half, Netherlands striker, Arjen Robben, had a breakaway, but the keeper for Spain, Iker Casillas made the save. Winger Andres Iniesta scored for Spain for the win. Understands the context in which the text is analyzed Features pre-built extractors for names, addresses, phone numbers, etc. Classification and Insight Built-in support for English, Spanish, French, German, Portuguese, Dutch, Japanese, Chinese

29 BigInsights Text Analytics Development

30 Example Analysis : Extraction from Twitter messages Extract intent, interests, life events and micro segmentation attributes Monetizable Intent Relocation Name, Birth Day Location I had an iphone, but it's (I've no idea where it's)!want a blackberry im moving to miami in 3 months. i look foward to the new good!!! U shouldnt! Think about the important stuff, like ur birthday ;) btw happy birthday Sylvia ;) I'm at Mickey's Irish Pub Downtown (206 3rd St, Court Ave, Des Moines) w/ 2 others While accounting for less relevant messages Subtle Spam, Advertising I think deserves his 2 AMAZING songs in top ten!!! Buy them on itunes Looking to buy a phone? WiFi Cell Phones, Windows Mobile Sarcasm, Wishful Gotta do more research my Versace term paper 2day. Before I die, I want a versace purple diamond tiara. Im just sayin>lol had so much fun today! I want to buy a million dollar house with a wrap around porch wading river on the long island sound, ha i wish!

31 Unified tooling environment for the data analytics lifecycle Sample data (if desired) Develop, test, and deploy analytic applications 1. Sample your Data 2. Develop your application using BigInsights tools Administer, execute, and monitor the deployed applications 5. Deploy your application on the cluster 3. Test your application 4. Package and publish your application

32 Application Accelerators Quickly build, deploy custom applications in high-value areas IBM Accelerator for Social Data Analytics B2C businesses Sample applications: Customer acquisition / retention, Customer Segmentation or Micro Segmentation, Marketing Campaign Optimization, Lead generation, Brand Management or Surveillance Ships with BigInsights v2 and Streams v3 IBM Accelerator for Machine Data Analytics Cross-industry: manufacturing, oil & gas, energy and utility, healthcare, travel and transportation, CPG, Retail, etc. Operational efficiency monitoring, security incident investigation. proactive maintenance, troubleshooting, outage prevention, efficiency tracking, etc Ships with BigInsights v2 IBM Accelerator for Telco Event Data Analytics Telcos Campaign management, real-time promotion, fraud detection, service assurance and network monitoring, Ships with Streams v3, but works with BigInsights or PureSparta for Analytics (a.k.a. Netezza)

33 Flexible job scheduler option Performance enhancements Optimize response time for small jobs Available in addition to FAIR, FIFO scheduling Adaptive MapReduce Speeds up a class of jobs (e.g., jobs that process small files) Accomplished by changing how certain MapReduce tasks executed Mappers can decide at runtime to take on more work (until it doesn t make sense anymore). Communication via ZooKeeper. Enabled through Jaql option, MapReduce job property setting Efficient processing of compressed text data Use multiple Map tasks (vs Hadoop default of 1) for processing compressed text files Enabled through BigInsights LZO-based compression technology Automatic with Jaql; programming option with Java MapReduce

34 BigInsights Connectivity to DBMS / Warehouse DB2 LUW, IW with DPF Netezza BigInsights BigInsights drives RDBMS work DBMS drives BigInsights work JDBC DBMS

35 On Big Data, BigInsights and Atom-tech... Big Data is our strategic initiative as an application of cloud Broad experience in cloud & infrastructure projects Vendor-independent hypervisor vision of IT IBM InfoSphere BigInsights & Atom-tech Tool to exploit growing variety, velocity, and volume of data Delivers diverse range of analytics to our customers Leverages and extends open source aligned with our vision of business Provides enterprise-class features and supporting services we like it Complement existing software investments and commercial offerings Available in basic (free) and enterprise editions we can easily provide POCs & POTs Atom-tech has a strong partnership with IBM to deliver efficient solutions

36 Pytania? Tomasz Zawadzki Dyrektor Zarządzający Atom-tech

37 Dziękuję! Tomasz Zawadzki Dyrektor Zarządzający Atom-tech

IBM Big Data Summit 2012

IBM Big Data Summit 2012 IBM Big Data Summit 2012 12.10.2012 InfoSphere BigInsights Introduction Wilfried Hoge Leading Technical Sales Professional hoge@de.ibm.com twitter.com/wilfriedhoge 12.10.1012 IBM Big Data Strategy: Move

More information

IBM s InfoSphere BigInsights: Smart Analytics for Big Data

IBM s InfoSphere BigInsights: Smart Analytics for Big Data An IBM Proof of Technology IBM s InfoSphere BigInsights: Smart Analytics for Big Data Meridee Lowry < BigInsights & Streams Technical Specialist meridee@us.ibm.com 2013 IBM Corporation IBM Disclaimer Information

More information

Big Data Platform Overview

Big Data Platform Overview Big Data Platform Overview Alex Hay (athay@us.ibm.com), Big Data CTP Meridee Lowry (meridee@us.ibm.com), Big Data CTP April 30 th, 2014 Big Data is a Concept Big Data 2 IBM Big Data and Analytics Offerings

More information

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN From Information to Insight: The Big Value of Big Data Faire Ann Co Marketing Manager, Information Management Software, ASEAN The World is Changing and Becoming More INSTRUMENTED INTERCONNECTED INTELLIGENT

More information

Hadoop Course Content

Hadoop Course Content Hadoop Course Content Hadoop Course Content Hadoop Overview, Architecture Considerations, Infrastructure, Platforms and Automation Use case walkthrough ETL Log Analytics Real Time Analytics Hbase for Developers

More information

Big Data The Big Story

Big Data The Big Story Big Data The Big Story Jean-Pierre Dijcks Big Data Product Mangement 1 Agenda What is Big Data? Architecting Big Data Building Big Data Solutions Oracle Big Data Appliance and Big Data Connectors Customer

More information

Angat Pinoy. Angat Negosyo. Angat Pilipinas.

Angat Pinoy. Angat Negosyo. Angat Pilipinas. Angat Pinoy. Angat Negosyo. Angat Pilipinas. Four megatrends will dominate the next decade Mobility Social Cloud Big data 91% of organizations expect to spend on mobile devices in 2012 In 2012, mobile

More information

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake White Paper Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake Motivation for Modernization It is now a well-documented realization among Fortune 500 companies

More information

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Rohit Bakhshi, Solution Architect, Hortonworks Jim Walker, Director Product Marketing, Talend Page 1 About Us Rohit Bakhshi Solution

More information

Smarter Analytics for Big Data

Smarter Analytics for Big Data Smarter Analytics for Big Data Anjul Bhambhri IBM Vice President, Big Data February 27, 2011 The World is Changing and Becoming More INSTRUMENTED INTERCONNECTED INTELLIGENT The resulting explosion of information

More information

5th Annual. Cloudera, Inc. All rights reserved.

5th Annual. Cloudera, Inc. All rights reserved. 5th Annual 1 The Essentials of Apache Hadoop The What, Why and How to Meet Agency Objectives Sarah Sproehnle, Vice President, Customer Success 2 Introduction 3 What is Apache Hadoop? Hadoop is a software

More information

The Intersection of Big Data and DB2

The Intersection of Big Data and DB2 The Intersection of Big Data and DB2 May 20, 2014 Mike McCarthy, IBM Big Data Channels Development mmccart1@us.ibm.com Agenda What is Big Data? Concepts Characteristics What is Hadoop Relational vs Hadoop

More information

EXAMPLE SOLUTIONS Hadoop in Azure HBase as a columnar NoSQL transactional database running on Azure Blobs Storm as a streaming service for near real time processing Hadoop 2.4 support for 100x query gains

More information

BIG DATA AND HADOOP DEVELOPER

BIG DATA AND HADOOP DEVELOPER BIG DATA AND HADOOP DEVELOPER Approximate Duration - 60 Hrs Classes + 30 hrs Lab work + 20 hrs Assessment = 110 Hrs + 50 hrs Project Total duration of course = 160 hrs Lesson 00 - Course Introduction 0.1

More information

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Roger Ding Cloudera February 3rd, 2018 1 Agenda Hadoop History Introduction to Apache Hadoop

More information

Intro to Big Data and Hadoop

Intro to Big Data and Hadoop Intro to Big and Hadoop Portions copyright 2001 SAS Institute Inc., Cary, NC, USA. All Rights Reserved. Reproduced with permission of SAS Institute Inc., Cary, NC, USA. SAS Institute Inc. makes no warranties

More information

Big Data Live selbst analysieren

Big Data Live selbst analysieren Big Data Live selbst analysieren Hands on Workshop zu IBM InfoSphere Big Insights Harald Gröger Wilfried Hoge Gerhard Wenzel IBM 2013 IBM Corporation Agenda 15:00-15:10 Einführung IBM Big Data Plattform

More information

Optimizing Outcomes in a Connected World: Turning information into insights

Optimizing Outcomes in a Connected World: Turning information into insights Optimizing Outcomes in a Connected World: Turning information into insights Michael Eden Management Brand Executive Central & Eastern Europe Vilnius 18 October 2011 2011 IBM Corporation IBM celebrates

More information

Modernizing Your Data Warehouse with Azure

Modernizing Your Data Warehouse with Azure Modernizing Your Data Warehouse with Azure Big data. Small data. All data. Christian Coté S P O N S O R S The traditional BI Environment The traditional data warehouse data warehousing has reached the

More information

DataAdapt Active Insight

DataAdapt Active Insight Solution Highlights Accelerated time to value Enterprise-ready Apache Hadoop based platform for data processing, warehousing and analytics Advanced analytics for structured, semistructured and unstructured

More information

Microsoft Azure Essentials

Microsoft Azure Essentials Microsoft Azure Essentials Azure Essentials Track Summary Data Analytics Explore the Data Analytics services in Azure to help you analyze both structured and unstructured data. Azure can help with large,

More information

GET MORE VALUE OUT OF BIG DATA

GET MORE VALUE OUT OF BIG DATA GET MORE VALUE OUT OF BIG DATA Enterprise data is increasing at an alarming rate. An International Data Corporation (IDC) study estimates that data is growing at 50 percent a year and will grow by 50 times

More information

IBM InfoSphere BigInsights V2.0 delivering enterprise Hadoop capabilities with easy-to-use analytic tools and visualization

IBM InfoSphere BigInsights V2.0 delivering enterprise Hadoop capabilities with easy-to-use analytic tools and visualization IBM United States Software Announcement 212-442, dated November 13, 2012 IBM InfoSphere BigInsights V2.0 delivering enterprise Hadoop capabilities with easy-to-use analytic tools and visualization Table

More information

Bringing the Power of SAS to Hadoop Title

Bringing the Power of SAS to Hadoop Title WHITE PAPER Bringing the Power of SAS to Hadoop Title Combine SAS World-Class Analytics With Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities ii Contents Introduction... 1 What

More information

Louis Bodine IBM STG WW BAO Tiger Team Leader

Louis Bodine IBM STG WW BAO Tiger Team Leader Louis Bodine IBM STG WW BAO Tiger Team Leader Presentation Objectives Discuss the value of Business Analytics Discuss BAO Ecosystem Discuss Transformational Solutions http://www.youtube.com/watch?v=eiuick5oqdm

More information

SAS and Hadoop Technology: Overview

SAS and Hadoop Technology: Overview SAS and Hadoop Technology: Overview SAS Documentation September 19, 2017 The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2015. SAS and Hadoop Technology: Overview.

More information

Cask Data Application Platform (CDAP)

Cask Data Application Platform (CDAP) Cask Data Application Platform (CDAP) CDAP is an open source, Apache 2.0 licensed, distributed, application framework for delivering Hadoop solutions. It integrates and abstracts the underlying Hadoop

More information

Exploring Big Data and Data Analytics with Hadoop and IDOL. Brochure. You are experiencing transformational changes in the computing arena.

Exploring Big Data and Data Analytics with Hadoop and IDOL. Brochure. You are experiencing transformational changes in the computing arena. Brochure Software Education Exploring Big Data and Data Analytics with Hadoop and IDOL You are experiencing transformational changes in the computing arena. Brochure Exploring Big Data and Data Analytics

More information

Cask Data Application Platform (CDAP) Extensions

Cask Data Application Platform (CDAP) Extensions Cask Data Application Platform (CDAP) Extensions CDAP Extensions provide additional capabilities and user interfaces to CDAP. They are use-case specific applications designed to solve common and critical

More information

MapR: Solution for Customer Production Success

MapR: Solution for Customer Production Success 2015 MapR Technologies 2015 MapR Technologies 1 MapR: Solution for Customer Production Success Big Data High Growth 700+ Customers Cloud Leaders Riding the Wave with Hadoop The Big Data Platform of Choice

More information

Harnessing the Power of Big Data to Transform Your Business Anjul Bhambhri VP, Big Data, Information Management, IBM

Harnessing the Power of Big Data to Transform Your Business Anjul Bhambhri VP, Big Data, Information Management, IBM May, 2012 Harnessing the Power of Big Data to Transform Your Business Anjul Bhambhri VP, Big Data, Information Management, IBM 12+ TBs of tweet data every day 30 billion RFID tags today (1.3B in 2005)

More information

Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved.

Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved. Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved. 1 SAP Big Data Webinar Series Big Data - Introduction to SAP Big Data Technologies Big Data - Streaming Analytics Big Data - Smarter

More information

WELCOME TO. Cloud Data Services: The Art of the Possible

WELCOME TO. Cloud Data Services: The Art of the Possible WELCOME TO Cloud Data Services: The Art of the Possible Goals for Today Share the cloud-based data management and analytics technologies that are enabling rapid development of new mobile applications Discuss

More information

Datametica. The Modern Data Platform Enterprise Data Hub Implementations. Why is workload moving to Cloud

Datametica. The Modern Data Platform Enterprise Data Hub Implementations. Why is workload moving to Cloud Datametica The Modern Data Platform Enterprise Data Hub Implementations Why is workload moving to Cloud 1 What we used do Enterprise Data Hub & Analytics What is Changing Why it is Changing Enterprise

More information

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

E-guide Hadoop Big Data Platforms Buyer s Guide part 1 Hadoop Big Data Platforms Buyer s Guide part 1 Your expert guide to Hadoop big data platforms for managing big data David Loshin, Knowledge Integrity Inc. Companies of all sizes can use Hadoop, as vendors

More information

ABOUT THIS TRAINING: This Hadoop training will also prepare you for the Big Data Certification of Cloudera- CCP and CCA.

ABOUT THIS TRAINING: This Hadoop training will also prepare you for the Big Data Certification of Cloudera- CCP and CCA. ABOUT THIS TRAINING: The world of Hadoop and Big Data" can be intimidating - hundreds of different technologies with cryptic names form the Hadoop ecosystem. This comprehensive training has been designed

More information

OSIsoft Super Regional Transform Your World

OSIsoft Super Regional Transform Your World OSIsoft Super Regional Transform Your World Copyright 208 OSIsoft, LLC OSIsoft Vision & Roadmap Chris Nelson, VP Software Development 2 st August, 208 Copyright 208 OSIsoft, LLC Copyright 208 OSIsoft,

More information

BigInsights on Cloud. Mike Nobles Executive, BigInsights Solution Specialist WW Technical Sales, Cloud Data Services

BigInsights on Cloud. Mike Nobles Executive, BigInsights Solution Specialist WW Technical Sales, Cloud Data Services BigInsights on Cloud Mike Nobles Executive, BigInsights Solution Specialist WW Technical Sales, Cloud Data Services For questions about this presentation contact Mike Nobles at mnobles@us.ibm.com 2015

More information

Hortonworks Data Platform

Hortonworks Data Platform Hortonworks Data Platform An open-architecture platform to manage data in motion and at rest Highlights Addresses a range of data-at-rest use cases Powers real-time customer applications Delivers robust

More information

Confidential

Confidential June 2017 1. Is your EDW becoming too expensive to maintain because of hardware upgrades and increasing data volumes? 2. Is your EDW becoming a monolith, which is too slow to adapt to business s analytical

More information

Hadoop Integration Deep Dive

Hadoop Integration Deep Dive Hadoop Integration Deep Dive Piyush Chaudhary Spectrum Scale BD&A Architect 1 Agenda Analytics Market overview Spectrum Scale Analytics strategy Spectrum Scale Hadoop Integration A tale of two connectors

More information

Realising Value from Data

Realising Value from Data Realising Value from Data Togetherwith Open Source Drives Innovation & Adoption in Big Data BCS Open Source SIG London 1 May 2013 Timings 6:00-6:30pm. Register / Refreshments 6:30-8:00pm, Presentation

More information

How In-Memory Computing can Maximize the Performance of Modern Payments

How In-Memory Computing can Maximize the Performance of Modern Payments How In-Memory Computing can Maximize the Performance of Modern Payments 2018 The mobile payments market is expected to grow to over a trillion dollars by 2019 How can in-memory computing maximize the performance

More information

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK Are you drowning in Big Data? Do you lack access to your data? Are you having a hard time managing Big Data processing requirements?

More information

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara Pentaho 8.0 and Beyond Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara Safe Harbor Statement The forward-looking statements contained in this document represent an outline of our

More information

Trifacta Data Wrangling for Hadoop: Accelerating Business Adoption While Ensuring Security & Governance

Trifacta Data Wrangling for Hadoop: Accelerating Business Adoption While Ensuring Security & Governance 575 Market St, 11th Floor San Francisco, CA 94105 www.trifacta.com 844.332.2821 1 WHITEPAPER Trifacta Data Wrangling for Hadoop: Accelerating Business Adoption While Ensuring Security & Governance 2 Introduction

More information

Building a Flexible Information Platform. Mark McConnell Business Unit Executive Enterprise Data Management IBM Software Group, Asia Pacific.

Building a Flexible Information Platform. Mark McConnell Business Unit Executive Enterprise Data Management IBM Software Group, Asia Pacific. Building a Flexible Information Platform Mark McConnell Business Unit Executive Enterprise Data Management IBM Software Group, Asia Pacific. The World is Becoming Smarter Every Day 2.5 billion RFID tags

More information

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

ORACLE DATA INTEGRATOR ENTERPRISE EDITION ORACLE DATA INTEGRATOR ENTERPRISE EDITION Oracle Data Integrator Enterprise Edition delivers high-performance data movement and transformation among enterprise platforms with its open and integrated E-LT

More information

By: Shrikant Gawande (Cloudera Certified )

By: Shrikant Gawande (Cloudera Certified ) By: Shrikant Gawande (Cloudera Certified ) What is Big Data? For every 30 mins, a airline jet collects 10 terabytes of sensor data (flying time) NYSE generates about one terabyte of new trade data per

More information

InfoSphere Warehouse. Flexible. Reliable. Simple. IBM Software Group

InfoSphere Warehouse. Flexible. Reliable. Simple. IBM Software Group IBM Software Group Flexible Reliable InfoSphere Warehouse Simple Ser Yean Tan Regional Technical Sales Manager Information Management Software IBM Software Group ASEAN 2007 IBM Corporation Business Intelligence

More information

Why Big Data Matters? Speaker: Paras Doshi

Why Big Data Matters? Speaker: Paras Doshi Why Big Data Matters? Speaker: Paras Doshi If you re wondering about what is Big Data and why does it matter to you and your organization, then come to this talk and get introduced to Big Data and learn

More information

IBM Digital Analytics Accelerator

IBM Digital Analytics Accelerator IBM Digital Analytics Accelerator On-premises web analytics solution for high-performance, granular insights Highlights: Efficiently capture, store, and analyze online data Benefit from highly scalable

More information

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved.

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved. Apache Spark 2.0 GA The General Engine for Modern Analytic Use Cases 1 Apache Spark Drives Business Innovation Apache Spark is driving new business value that is being harnessed by technology forward organizations.

More information

Introduction to Stream Processing

Introduction to Stream Processing Introduction to Processing Guido Schmutz DOAG Big Data 2018 20.9.2018 @gschmutz BASEL BERN BRUGG DÜSSELDORF HAMBURG KOPENHAGEN LAUSANNE guidoschmutz.wordpress.com FRANKFURT A.M. FREIBURG I.BR. GENF MÜNCHEN

More information

Operational Hadoop and the Lambda Architecture for Streaming Data

Operational Hadoop and the Lambda Architecture for Streaming Data Operational Hadoop and the Lambda Architecture for Streaming Data 2015 MapR Technologies 2015 MapR Technologies 1 Topics From Batch to Operational Workloads on Hadoop Streaming Data Environments The Lambda

More information

ActualTests.C Q&A C Foundations of IBM Big Data & Analytics Architecture V1

ActualTests.C Q&A C Foundations of IBM Big Data & Analytics Architecture V1 ActualTests.C2030-136.40Q&A Number: C2030-136 Passing Score: 800 Time Limit: 120 min File Version: 4.8 http://www.gratisexam.com/ C2030-136 Foundations of IBM Big Data & Analytics Architecture V1 Hello,

More information

Apache Hadoop in the Datacenter and Cloud

Apache Hadoop in the Datacenter and Cloud Apache Hadoop in the Datacenter and Cloud The Shift to the Connected Data Architecture Digital Transformation fueled by Big Data Analytics and IoT ACTIONABLE INTELLIGENCE Cloud and Data Center IDMS Relational

More information

NICE Customer Engagement Analytics - Architecture Whitepaper

NICE Customer Engagement Analytics - Architecture Whitepaper NICE Customer Engagement Analytics - Architecture Whitepaper Table of Contents Introduction...3 Data Principles...4 Customer Identities and Event Timelines...................... 4 Data Discovery...5 Data

More information

Cognitive Data Warehouse and Analytics

Cognitive Data Warehouse and Analytics Cognitive Data Warehouse and Analytics Hemant R. Suri, Sr. Offering Manager, Hybrid Data Warehouses, IBM (twitter @hemantrsuri or feel free to reach out to me via LinkedIN!) Over 90% of the world s data

More information

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11 Top 5 Challenges for Hadoop MapReduce in the Enterprise Whitepaper - May 2011 http://platform.com/mapreduce 2 5/9/11 Table of Contents Introduction... 2 Current Market Conditions and Drivers. Customer

More information

Microsoft Big Data. Solution Brief

Microsoft Big Data. Solution Brief Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,

More information

Uncovering the Hidden Truth In Log Data with vcenter Insight

Uncovering the Hidden Truth In Log Data with vcenter Insight Uncovering the Hidden Truth In Log Data with vcenter Insight April 2014 VMware vforum Istanbul 2014 Serdar Arıcan 2014 VMware Inc. All rights reserved. VMware Strategy To help customers realize the promise

More information

Common Customer Use Cases in FSI

Common Customer Use Cases in FSI Common Customer Use Cases in FSI 1 Marketing Optimization 2014 2014 MapR MapR Technologies Technologies 2 Fortune 100 Financial Services Company 104M CARD MEMBERS 3 Financial Services: Recommendation Engine

More information

Amsterdam. (technical) Updates & demonstration. Robert Voermans Governance architect

Amsterdam. (technical) Updates & demonstration. Robert Voermans Governance architect (technical) Updates & demonstration Robert Voermans Governance architect Amsterdam Please note IBM s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice

More information

Architecture Optimization for the new Data Warehouse. Cloudera, Inc. All rights reserved.

Architecture Optimization for the new Data Warehouse. Cloudera, Inc. All rights reserved. Architecture Optimization for the new Data Warehouse Guido Oswald - @GuidoOswald 1 Use Cases This image cannot currently be displayed. This image cannot currently be displayed. This image cannot currently

More information

InfoSphere Warehousing 9.5

InfoSphere Warehousing 9.5 IBM Software Group Optimised InfoSphere Warehousing 9.5 Flexible Simple Phil Downey InfoSphere Warehouse Technical Marketing 2007 IBM Corporation Information On Demand End-to-End Capabilities Optimization

More information

ETL on Hadoop What is Required

ETL on Hadoop What is Required ETL on Hadoop What is Required Keith Kohl Director, Product Management October 2012 Syncsort Copyright 2012, Syncsort Incorporated Agenda Who is Syncsort Extract, Transform, Load (ETL) Overview and conventional

More information

What You Can Accomplish with IBM Content Analytics*

What You Can Accomplish with IBM Content Analytics* What You Can Accomplish With (IBM) Content Analytics Bruce S. Tannenbaum Managing Consultant, IBM Text Analytics Group btannenb@us.ibm.com What You Can Accomplish with IBM Content Analytics* *Currently

More information

Enterprise Analytics Accelerating Your Path to Value with an Open Analytics Platform

Enterprise Analytics Accelerating Your Path to Value with an Open Analytics Platform Enterprise Analytics Accelerating Your Path to Value with an Open Analytics Platform Federico Pozzi @fedealbpozzi Mathias Coopmans @macoopma Characteristics of a badly managed platform No clear data

More information

Transforming Analytics with Cloudera Data Science WorkBench

Transforming Analytics with Cloudera Data Science WorkBench Transforming Analytics with Cloudera Data Science WorkBench Process data, develop and serve predictive models. 1 Age of Machine Learning Data volume NO Machine Learning Machine Learning 1950s 1960s 1970s

More information

SAP Cloud Platform Big Data Services EXTERNAL. SAP Cloud Platform Big Data Services From Data to Insight

SAP Cloud Platform Big Data Services EXTERNAL. SAP Cloud Platform Big Data Services From Data to Insight EXTERNAL FULL-SERVICE BIG DATA IN THE CLOUD, a fully managed Apache Hadoop and Apache Spark cloud offering, form the cornerstone of many successful Big Data implementations. Enterprises harness the performance

More information

APAC Big Data & Cloud Summit 2013

APAC Big Data & Cloud Summit 2013 APAC Big Data & Cloud Summit 2013 Big Data Analytics & Hadoop Use Cases Eddie Toh Server Marketing Manager 21 August 2013 From the dawn of civilization until 2003, we humans created 5 Exabyte of information.

More information

KnowledgeSTUDIO. Advanced Modeling for Better Decisions. Data Preparation, Data Profiling and Exploration

KnowledgeSTUDIO. Advanced Modeling for Better Decisions. Data Preparation, Data Profiling and Exploration KnowledgeSTUDIO Advanced Modeling for Better Decisions Companies that compete with analytics are looking for advanced analytical technologies that accelerate decision making and identify opportunities

More information

Information Architecture: Leveraging Information in an SOA Environment. David McCarty IBM Software IT Architect. IBM SOA Architect Summit

Information Architecture: Leveraging Information in an SOA Environment. David McCarty IBM Software IT Architect. IBM SOA Architect Summit Information Architecture: Leveraging Information in an SOA Environment David McCarty IBM Software IT Architect 2008 IBM Corporation SOA Architect Summit Roadmap What is the impact of SOA on current Enterprise

More information

Cloud Based Analytics for SAP

Cloud Based Analytics for SAP Cloud Based Analytics for SAP Gary Patterson, Global Lead for Big Data About Virtustream A Dell Technologies Business 2,300+ employees 20+ data centers Major operations in 10 countries One of the fastest

More information

More information for FREE VS ENTERPRISE LICENCE :

More information for FREE VS ENTERPRISE LICENCE : Source : http://www.splunk.com/ Splunk Enterprise is a fully featured, powerful platform for collecting, searching, monitoring and analyzing machine data. Splunk Enterprise is easy to deploy and use. It

More information

Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand

Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand Paper 2698-2018 Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand ABSTRACT Digital analytics is no longer just about tracking the number

More information

MicroStrategy 10. Adam Leno Technical Architect NDM Technologies

MicroStrategy 10. Adam Leno Technical Architect NDM Technologies MicroStrategy 10 Adam Leno Technical Architect NDM Technologies aleno@ndm.net Other analytics solutions Agility or Governance Great for the Business User or Great for IT Ease of Use or Enterprise 10 Agility

More information

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. Course Content Course Description: The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. At Course Completion: After competing this course,

More information

Business is being transformed by three trends

Business is being transformed by three trends Business is being transformed by three trends Big Cloud Intelligence Stay ahead of the curve with Cortana Intelligence Suite Business apps People Custom apps Apps Sensors and devices Cortana Intelligence

More information

Databricks Cloud. A Primer

Databricks Cloud. A Primer Databricks Cloud A Primer Who is Databricks? Databricks was founded by the team behind Apache Spark, the most active open source project in the big data ecosystem today. Our mission at Databricks is to

More information

Enterprise-Scale MATLAB Applications

Enterprise-Scale MATLAB Applications Enterprise-Scale Applications Sylvain Lacaze Rory Adams 2018 The MathWorks, Inc. 1 Enterprise Integration Access and Explore Data Preprocess Data Develop Predictive Models Integrate Analytics with Systems

More information

HADOOP ADMINISTRATION

HADOOP ADMINISTRATION HADOOP ADMINISTRATION PROSPECTUS HADOOP ADMINISTRATION UNIVERSITY OF SKILLS ABOUT ISM UNIV UNIVERSITY OF SKILLS ISM UNIV is established in 1994, past 21 years this premier institution has trained over

More information

Pentaho 8.0 Overview. Pedro Alves

Pentaho 8.0 Overview. Pedro Alves Pentaho 8.0 Overview Pedro Alves Safe Harbor Statement The forward-looking statements contained in this document represent an outline of our current intended product direction. It is provided for information

More information

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics ESSENTIALS EMC ISILON Use the industry's first and only scale-out NAS solution with native Hadoop

More information

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica Accelerating Your Big Data Analytics Jeff Healey, Director Product Marketing, HPE Vertica Recent Waves of Disruption IT Infrastructu re for Analytics Data Warehouse Modernization Big Data/ Hadoop Cloud

More information

20775A: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight 20775A: Performing Data Engineering on Microsoft HD Insight Duration: 5 days; Instructor-led Implement Spark Streaming Using the DStream API. Develop Big Data Real-Time Processing Solutions with Apache

More information

Redefine Big Data: EMC Data Lake in Action. Andrea Prosperi Systems Engineer

Redefine Big Data: EMC Data Lake in Action. Andrea Prosperi Systems Engineer Redefine Big Data: EMC Data Lake in Action Andrea Prosperi Systems Engineer 1 Agenda Data Analytics Today Big data Hadoop & HDFS Different types of analytics Data lakes EMC Solutions for Data Lakes 2 The

More information

New Big Data Solutions and Opportunities for DB Workloads

New Big Data Solutions and Opportunities for DB Workloads New Big Data Solutions and Opportunities for DB Workloads Hadoop and Spark Ecosystem for Data Analytics, Experience and Outlook Luca Canali, IT-DB Hadoop and Spark Service WLCG, GDB meeting CERN, September

More information

Analytics Platform System

Analytics Platform System Analytics Platform System Big data. Small data. All data. Audie Wright, DW & Big Data Specialist Audie.Wright@Microsoft.com Ofc 425-538-0044, Cell 303-324-2860 Sean Mikha, DW & Big Data Architect semikha@microsoft.com

More information

Hortonworks Connected Data Platforms

Hortonworks Connected Data Platforms Hortonworks Connected Data Platforms MASTER THE VALUE OF DATA EVERY BUSINESS IS A DATA BUSINESS EMBRACE AN OPEN APPROACH 2 Hortonworks Inc. 2011 2016. All Rights Reserved Data Drives the Connected Car

More information

Active Analytics Overview

Active Analytics Overview Active Analytics Overview The Fourth Industrial Revolution is predicated on data. Success depends on recognizing data as the most valuable corporate asset. From smart cities to autonomous vehicles, logistics

More information

Investor Presentation. Fourth Quarter 2015

Investor Presentation. Fourth Quarter 2015 Investor Presentation Fourth Quarter 2015 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences

More information

Machine-generated data: creating new opportunities for utilities, mobile and broadcast networks

Machine-generated data: creating new opportunities for utilities, mobile and broadcast networks APPLICATION BRIEF Machine-generated data: creating new opportunities for utilities, mobile and broadcast networks Electronic devices generate data every millisecond they are in operation. This data is

More information

Real-Time Streaming: IMS to Apache Kafka and Hadoop

Real-Time Streaming: IMS to Apache Kafka and Hadoop Real-Time Streaming: IMS to Apache Kafka and Hadoop - 2017 Scott Quillicy SQData Outline methods of streaming mainframe data to big data platforms Set throughput / latency expectations for popular big

More information

20775 Performing Data Engineering on Microsoft HD Insight

20775 Performing Data Engineering on Microsoft HD Insight Duración del curso: 5 Días Acerca de este curso The main purpose of the course is to give students the ability plan and implement big data workflows on HD. Perfil de público The primary audience for this

More information

IBM Business Intelligence and Business Analytics

IBM Business Intelligence and Business Analytics IBM Business Intelligence and Business Analytics Ganesh 1 Kedari IBM India Software Labs, Pune #1 concern Business Analytics 83% Virtualization 76% Risk Management & Compliance 71% Mobility Solutions 68%

More information

IBM Cognos What s New? Webinar. December 12,

IBM Cognos What s New? Webinar. December 12, IBM Cognos 10.2 What s New? Webinar December 12, 2012 www.senturus.com 1 Helping Companies Learn From the Past, Manage the Present and Shape the Future GoToWebinar Control Panel Submit questions here Click

More information

20775A: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight 20775A: Performing Data Engineering on Microsoft HD Insight Course Details Course Code: Duration: Notes: 20775A 5 days This course syllabus should be used to determine whether the course is appropriate

More information