IBM Big Data Summit 2012

Size: px
Start display at page:

Download "IBM Big Data Summit 2012"

Transcription

1 IBM Big Data Summit

2 InfoSphere BigInsights Introduction Wilfried Hoge Leading Technical Sales Professional twitter.com/wilfriedhoge

3 IBM Big Data Strategy: Move the Analytics Closer to the Data New analytic applications drive the requirements for a big data platform Integrate and manage the full variety, velocity and volume of data Apply advanced analytics to information in its native form Visualize all available data for adhoc analysis Development environment for building new analytic applications Workload optimization and scheduling Security and Governance BI / Exploration / Functional Industry Predictive Reporting Visualization App App Analytics Visualization & Discovery Hadoop System Analytic Applications IBM Big Data Platform Application Development Accelerators Stream Computing Content Analytics Systems Management Data Warehouse Information Integration & Governance

4 BigInsights analytical platform for persistent Big Data Based on open source & IBM technologies Distinguishing characteristics Built-in analytics... enhances business knowledge Enterprise software integration... complements and extends existing capabilities Production-ready platform with tooling for analysts, developers, and administrators... speeds time-to-value and simplifies development/maintenance IBM advantage Combination of software, hardware, services and advanced research BI / Exploration / Functional Industry Predictive Reporting Visualization App App Analytics Visualization & Discovery Hadoop System Analytic Applications IBM Big Data Platform Application Development Accelerators Stream Computing Content Analytics Systems Management Data Warehouse Information Integration & Governance

5 About the BigInsights Platform Flexible, enterprise-class support for processing large volumes of data Based on Google s MapReduce technology Inspired by Apache Hadoop; compatible with its ecosystem and distribution Well-suited to batch-oriented, read-intensive applications Supports wide variety of data Enables applications to work with thousands of nodes and petabytes of data in a highly parallel, cost effective manner CPU + disks = node Nodes can be combined into clusters New nodes can be added as needed without changing Data formats How data is loaded How jobs are written

6 Hadoop Explained Map Reduce Hadoop computation model Data stored in a distributed file system spanning many inexpensive computers Bring function to the data Distribute application to the compute resources where the data is stored Scalable to thousands of nodes and petabytes of data public static class TokenizerMapper extends Mapper<Object,Text,Text,IntWritable> { private final static IntWritable one = new IntWritable(1); private Text word = new Text(); Hadoop Data Nodes public void map(object key, Text val, Context StringTokenizer itr = new StringTokenizer(val.toString()); while (itr.hasmoretokens()) { word.set(itr.nexttoken()); context.write(word, one); } } } public static class IntSumReducer extends Reducer<Text,IntWritable,Text,IntWrita private IntWritable result = new Intritable(); public void reduce(text key, Iterable<IntWritable> val, Context context){ int sum = 0; for (IntWritable v : val) { sum += v.get();... MapReduce Application Distribute map tasks to cluster Shuffle 1. Map Phase (break job into small parts) 2. Shuffle (transfer interim output for final processing) 3. Reduce Phase (boil all output down to a single result set) Result Set Return a single result set

7 BigInsights Value Beyond Open Source Technical differentiators Built-in analytics Text processing engine, annotators, Eclipse tooling Statistical and predictive analysis Interface to project R (statistical platform) Enterprise software integration (DBMS, warehouse) Spreadsheet-style analytical tool for analysts Ready-made business process accelerators Integrated installation of supported open source and IBM components Web Console for administration and application access Platform enrichment: additional security, performance features,... Standard IBM licensing agreement and world-class support Business benefits Quicker time-to-value due to IBM technology and support Reduced operational risk Enhanced business knowledge with flexible analytical platform Leverages and complements existing software assets

8 Zookeeper IBM LZO Compression Avro InfoSphere BigInsights Embrace and Extend Hadoop Analytics ML Analytics Text Analytics BigSheets Interface Web console Application Pig Hive Jaql MapReduce AdaptiveMR FLEX BigIndex Oozie Lucene Monitor cluster health Add / remove nodes Start / stop services Inspect job status Inspect workflow status Deploy apps Launch apps / jobs Work with distrib. file system Work with spreadsheet interface Support REST-based API... Storage HDFS HBase GPFS-SNC Eclipse plug-ins Data Sources/ Connectors Streams Netezza BoardReader R Text analytics MapReduce programming Jaql development Hive query development Data Stage DB2 CSV / XML / JSON SPSS Flume JDBC Web Crawler IBM Open Source

9 Web Installation Tool Seamless process for single node and cluster environments Integrated installation of all selected components Post-install validation of IBM and open source components No need to iteratively download, configure, and test multiple open source projects and their pre-requisite software.

10 Web Console Manage BigInsights Inspect system health Add / drop nodes Start / stop services Run / monitor jobs (applications) Explore / modify file system Launch applications Spreadsheet-like analysis tool Pre-built applications (IBM supplied or user developed) Publish applications Leverage community resources

11 Quick start applications or apps Reusable software assets based on customer engagements Useful for starting point for various applications Can be customized by BigInsights application developers as needed Accessible through Web console Available assets Data export (to relational DBMS, files, HBase) Data import (from relational DBMS, files) Web crawler, Twitter crawler Boardreader.com support (Web forum search engine) Ad hoc queries for Jaql, Hive, Pig TeraGen-TeraSort, WordCount sample applications

12 Running Applications from the Web Console

13 DEMO web console

14 BigSheets BigSheets is a visual tool for data manipulation and prototyping Allows more users to do more work, more quickly Simply stated, growing an army of MapReduce developers is not cost effective In your BI environments you have a ratio of 30+ report users for every complex SQL developer. We need to support the same ratios with BigInsights Sample Uses Data exploration and visualization Visual job creation

15 BigSheets Spreadsheet-style Data Analysis and Discovery

16 BigSheets Visualization

17 DEMO BigSheets

18 Text Analytics in BigInsights Text analytics Distill structured information from unstructured data Rich annotator library supports multiple languages Declarative Information Extraction (IE) system based on an algebraic framework Richer, cleaner rule semantics Better performance through optimization Developed at IBM Research since 2004 Embedded in several IBM products Lotus Notes Cognos Consumer Insights InfoSphere Streams Compose operators to build complex annotators

19 Text Analytics highly accurate analysis of textual content How it works Parses text and detects meaning with annotators Understands the context in which the text is analyzed Hundreds of pre-built annotators for names, addresses, phone numbers, along others Accuracy Highly accurate in deriving meaning from complex text Performance AQL language optimized for MapReduce Unstructured text (document, , etc) Football World Cup 2010, one team distinguished themselves well, losing to the eventual champions 1-0 in the Final. Early in the second half, Netherlands striker, Arjen Robben, had a breakaway, but the keeper for Spain, Iker Casillas made the save. Winger Andres Iniesta scored for Spain for the win. Classification and Insight

20 BigInsights Text Analytics Development AQL

21 Text Analytics Tooling AQL Editor Result Viewer Runtime Explain

22 DEMO Text Analytics

23 Ways to get started with BigInsights In the Cloud Via RightScale, or directly on Amazon, Rackspace, IBM Smart Enterprise Cloud, or on private clouds. Pay only for the resources used. In the Virtual Classroom Free Hadoop Fundamentals training course e.g. BD105EN - Text Analytics Essentials On Your Cluster Download Basic Edition from ibm.com. In the Classroom Enroll in the InfoSphere BigInsights Essentials course.

24 Visit the BigInsights technical portal.... Free links to papers, demos, discussion forum, and more

25 IBM Big Data Summit

IBM BigInsights - Hadoop jako rozwiązanie korporacyjne. Tomasz Zawadzki Dyrektor Zarządzający Atom-tech

IBM BigInsights - Hadoop jako rozwiązanie korporacyjne. Tomasz Zawadzki Dyrektor Zarządzający Atom-tech IBM BigInsights - Hadoop jako rozwiązanie korporacyjne Tomasz Zawadzki Dyrektor Zarządzający Atom-tech IBM BigInsights - Hadoop jako rozwiązanie korporacyjne Tomasz Zawadzki Dyrektor Zarządzający Atom-tech

More information

Big Data Platform Overview

Big Data Platform Overview Big Data Platform Overview Alex Hay (athay@us.ibm.com), Big Data CTP Meridee Lowry (meridee@us.ibm.com), Big Data CTP April 30 th, 2014 Big Data is a Concept Big Data 2 IBM Big Data and Analytics Offerings

More information

IBM s InfoSphere BigInsights: Smart Analytics for Big Data

IBM s InfoSphere BigInsights: Smart Analytics for Big Data An IBM Proof of Technology IBM s InfoSphere BigInsights: Smart Analytics for Big Data Meridee Lowry < BigInsights & Streams Technical Specialist meridee@us.ibm.com 2013 IBM Corporation IBM Disclaimer Information

More information

BigInsights on Cloud. Mike Nobles Executive, BigInsights Solution Specialist WW Technical Sales, Cloud Data Services

BigInsights on Cloud. Mike Nobles Executive, BigInsights Solution Specialist WW Technical Sales, Cloud Data Services BigInsights on Cloud Mike Nobles Executive, BigInsights Solution Specialist WW Technical Sales, Cloud Data Services For questions about this presentation contact Mike Nobles at mnobles@us.ibm.com 2015

More information

Big Data Live selbst analysieren

Big Data Live selbst analysieren Big Data Live selbst analysieren Hands on Workshop zu IBM InfoSphere Big Insights Harald Gröger Wilfried Hoge Gerhard Wenzel IBM 2013 IBM Corporation Agenda 15:00-15:10 Einführung IBM Big Data Plattform

More information

WELCOME TO. Cloud Data Services: The Art of the Possible

WELCOME TO. Cloud Data Services: The Art of the Possible WELCOME TO Cloud Data Services: The Art of the Possible Goals for Today Share the cloud-based data management and analytics technologies that are enabling rapid development of new mobile applications Discuss

More information

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Rohit Bakhshi, Solution Architect, Hortonworks Jim Walker, Director Product Marketing, Talend Page 1 About Us Rohit Bakhshi Solution

More information

The Intersection of Big Data and DB2

The Intersection of Big Data and DB2 The Intersection of Big Data and DB2 May 20, 2014 Mike McCarthy, IBM Big Data Channels Development mmccart1@us.ibm.com Agenda What is Big Data? Concepts Characteristics What is Hadoop Relational vs Hadoop

More information

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN From Information to Insight: The Big Value of Big Data Faire Ann Co Marketing Manager, Information Management Software, ASEAN The World is Changing and Becoming More INSTRUMENTED INTERCONNECTED INTELLIGENT

More information

Bringing the Power of SAS to Hadoop Title

Bringing the Power of SAS to Hadoop Title WHITE PAPER Bringing the Power of SAS to Hadoop Title Combine SAS World-Class Analytics With Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities ii Contents Introduction... 1 What

More information

Microsoft Azure Essentials

Microsoft Azure Essentials Microsoft Azure Essentials Azure Essentials Track Summary Data Analytics Explore the Data Analytics services in Azure to help you analyze both structured and unstructured data. Azure can help with large,

More information

Microsoft Big Data. Solution Brief

Microsoft Big Data. Solution Brief Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,

More information

BIG DATA AND HADOOP DEVELOPER

BIG DATA AND HADOOP DEVELOPER BIG DATA AND HADOOP DEVELOPER Approximate Duration - 60 Hrs Classes + 30 hrs Lab work + 20 hrs Assessment = 110 Hrs + 50 hrs Project Total duration of course = 160 hrs Lesson 00 - Course Introduction 0.1

More information

IBM InfoSphere BigInsights V2.0 delivering enterprise Hadoop capabilities with easy-to-use analytic tools and visualization

IBM InfoSphere BigInsights V2.0 delivering enterprise Hadoop capabilities with easy-to-use analytic tools and visualization IBM United States Software Announcement 212-442, dated November 13, 2012 IBM InfoSphere BigInsights V2.0 delivering enterprise Hadoop capabilities with easy-to-use analytic tools and visualization Table

More information

Angat Pinoy. Angat Negosyo. Angat Pilipinas.

Angat Pinoy. Angat Negosyo. Angat Pilipinas. Angat Pinoy. Angat Negosyo. Angat Pilipinas. Four megatrends will dominate the next decade Mobility Social Cloud Big data 91% of organizations expect to spend on mobile devices in 2012 In 2012, mobile

More information

E-guide Hadoop Big Data Platforms Buyer s Guide part 3

E-guide Hadoop Big Data Platforms Buyer s Guide part 3 Big Data Platforms Buyer s Guide part 3 Your expert guide to big platforms enterprise MapReduce cloud-based Abie Reifer, DecisionWorx The Amazon Elastic MapReduce Web service offers a managed framework

More information

SAS and Hadoop Technology: Overview

SAS and Hadoop Technology: Overview SAS and Hadoop Technology: Overview SAS Documentation September 19, 2017 The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2015. SAS and Hadoop Technology: Overview.

More information

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Roger Ding Cloudera February 3rd, 2018 1 Agenda Hadoop History Introduction to Apache Hadoop

More information

KnowledgeSTUDIO. Advanced Modeling for Better Decisions. Data Preparation, Data Profiling and Exploration

KnowledgeSTUDIO. Advanced Modeling for Better Decisions. Data Preparation, Data Profiling and Exploration KnowledgeSTUDIO Advanced Modeling for Better Decisions Companies that compete with analytics are looking for advanced analytical technologies that accelerate decision making and identify opportunities

More information

Hadoop Course Content

Hadoop Course Content Hadoop Course Content Hadoop Course Content Hadoop Overview, Architecture Considerations, Infrastructure, Platforms and Automation Use case walkthrough ETL Log Analytics Real Time Analytics Hbase for Developers

More information

DataAdapt Active Insight

DataAdapt Active Insight Solution Highlights Accelerated time to value Enterprise-ready Apache Hadoop based platform for data processing, warehousing and analytics Advanced analytics for structured, semistructured and unstructured

More information

ETL on Hadoop What is Required

ETL on Hadoop What is Required ETL on Hadoop What is Required Keith Kohl Director, Product Management October 2012 Syncsort Copyright 2012, Syncsort Incorporated Agenda Who is Syncsort Extract, Transform, Load (ETL) Overview and conventional

More information

EXAMPLE SOLUTIONS Hadoop in Azure HBase as a columnar NoSQL transactional database running on Azure Blobs Storm as a streaming service for near real time processing Hadoop 2.4 support for 100x query gains

More information

Big Data & Hadoop Advance

Big Data & Hadoop Advance Course Durations: 30 Hours About Company: Course Mode: Online/Offline EduNextgen extended arm of Product Innovation Academy is a growing entity in education and career transformation, specializing in today

More information

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

E-guide Hadoop Big Data Platforms Buyer s Guide part 1 Hadoop Big Data Platforms Buyer s Guide part 1 Your expert guide to Hadoop big data platforms for managing big data David Loshin, Knowledge Integrity Inc. Companies of all sizes can use Hadoop, as vendors

More information

5th Annual. Cloudera, Inc. All rights reserved.

5th Annual. Cloudera, Inc. All rights reserved. 5th Annual 1 The Essentials of Apache Hadoop The What, Why and How to Meet Agency Objectives Sarah Sproehnle, Vice President, Customer Success 2 Introduction 3 What is Apache Hadoop? Hadoop is a software

More information

Big Data und Hadoop. BI/DW Modernisierungs-Szenarien auf System z

Big Data und Hadoop. BI/DW Modernisierungs-Szenarien auf System z Big Data und Hadoop BI/DW Modernisierungs-Szenarien auf System z Eberhard Hechler Executive Architect, Member IBM Academy of Technology IBM Germany R&D Lab Trademarks The following are trademarks of the

More information

Hadoop Integration Deep Dive

Hadoop Integration Deep Dive Hadoop Integration Deep Dive Piyush Chaudhary Spectrum Scale BD&A Architect 1 Agenda Analytics Market overview Spectrum Scale Analytics strategy Spectrum Scale Hadoop Integration A tale of two connectors

More information

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica Accelerating Your Big Data Analytics Jeff Healey, Director Product Marketing, HPE Vertica Recent Waves of Disruption IT Infrastructu re for Analytics Data Warehouse Modernization Big Data/ Hadoop Cloud

More information

ABOUT THIS TRAINING: This Hadoop training will also prepare you for the Big Data Certification of Cloudera- CCP and CCA.

ABOUT THIS TRAINING: This Hadoop training will also prepare you for the Big Data Certification of Cloudera- CCP and CCA. ABOUT THIS TRAINING: The world of Hadoop and Big Data" can be intimidating - hundreds of different technologies with cryptic names form the Hadoop ecosystem. This comprehensive training has been designed

More information

IBM SmartCloud public images with selected software

IBM SmartCloud public images with selected software IBM SmartCloud public images with selected software Current as of September 1, 2011. To find out how your organization can leverage the IBM SmartCloud, visit our IBM SmartCloud Enterprise website. PAYG:

More information

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11

Top 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11 Top 5 Challenges for Hadoop MapReduce in the Enterprise Whitepaper - May 2011 http://platform.com/mapreduce 2 5/9/11 Table of Contents Introduction... 2 Current Market Conditions and Drivers. Customer

More information

Why Big Data Matters? Speaker: Paras Doshi

Why Big Data Matters? Speaker: Paras Doshi Why Big Data Matters? Speaker: Paras Doshi If you re wondering about what is Big Data and why does it matter to you and your organization, then come to this talk and get introduced to Big Data and learn

More information

IBM Analytics Unleash the power of data with Apache Spark

IBM Analytics Unleash the power of data with Apache Spark IBM Analytics Unleash the power of data with Apache Spark Agility, speed and simplicity define the analytics operating system of the future 1 2 3 4 Use Spark to create value from data-driven insights Lower

More information

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK Are you drowning in Big Data? Do you lack access to your data? Are you having a hard time managing Big Data processing requirements?

More information

GET MORE VALUE OUT OF BIG DATA

GET MORE VALUE OUT OF BIG DATA GET MORE VALUE OUT OF BIG DATA Enterprise data is increasing at an alarming rate. An International Data Corporation (IDC) study estimates that data is growing at 50 percent a year and will grow by 50 times

More information

Datametica. The Modern Data Platform Enterprise Data Hub Implementations. Why is workload moving to Cloud

Datametica. The Modern Data Platform Enterprise Data Hub Implementations. Why is workload moving to Cloud Datametica The Modern Data Platform Enterprise Data Hub Implementations Why is workload moving to Cloud 1 What we used do Enterprise Data Hub & Analytics What is Changing Why it is Changing Enterprise

More information

Cognitive Data Warehouse and Analytics

Cognitive Data Warehouse and Analytics Cognitive Data Warehouse and Analytics Hemant R. Suri, Sr. Offering Manager, Hybrid Data Warehouses, IBM (twitter @hemantrsuri or feel free to reach out to me via LinkedIN!) Over 90% of the world s data

More information

Harnessing the Power of Big Data to Transform Your Business Anjul Bhambhri VP, Big Data, Information Management, IBM

Harnessing the Power of Big Data to Transform Your Business Anjul Bhambhri VP, Big Data, Information Management, IBM May, 2012 Harnessing the Power of Big Data to Transform Your Business Anjul Bhambhri VP, Big Data, Information Management, IBM 12+ TBs of tweet data every day 30 billion RFID tags today (1.3B in 2005)

More information

Intro to Big Data and Hadoop

Intro to Big Data and Hadoop Intro to Big and Hadoop Portions copyright 2001 SAS Institute Inc., Cary, NC, USA. All Rights Reserved. Reproduced with permission of SAS Institute Inc., Cary, NC, USA. SAS Institute Inc. makes no warranties

More information

1. Intoduction to Hadoop

1. Intoduction to Hadoop 1. Intoduction to Hadoop Hadoop is a rapidly evolving ecosystem of components for implementing the Google MapReduce algorithms in a scalable fashion on commodity hardware. Hadoop enables users to store

More information

Trifacta Data Wrangling for Hadoop: Accelerating Business Adoption While Ensuring Security & Governance

Trifacta Data Wrangling for Hadoop: Accelerating Business Adoption While Ensuring Security & Governance 575 Market St, 11th Floor San Francisco, CA 94105 www.trifacta.com 844.332.2821 1 WHITEPAPER Trifacta Data Wrangling for Hadoop: Accelerating Business Adoption While Ensuring Security & Governance 2 Introduction

More information

Big Data The Big Story

Big Data The Big Story Big Data The Big Story Jean-Pierre Dijcks Big Data Product Mangement 1 Agenda What is Big Data? Architecting Big Data Building Big Data Solutions Oracle Big Data Appliance and Big Data Connectors Customer

More information

Lesson 3 Cloud Platform as a Service usages for accelerated Design and Deployment of IoTs

Lesson 3 Cloud Platform as a Service usages for accelerated Design and Deployment of IoTs Lesson 3 Cloud Platform as a Service usages for accelerated Design and Deployment of IoTs 1 Large and Big Data platform Oracle IOT PaaS For delivering, integrating, securing and retrieving For analysing

More information

Big Data Analytics met Hadoop

Big Data Analytics met Hadoop Big Data Analytics met Hadoop Jos van Dongen Arno Klijnman What is Distributed storage and processing of (big) data on large clusters of commodity hardware HDFS Map/Reduce HDFS - Distributed storage for

More information

Mobile Application Developer

Mobile Application Developer Mobile Application Developer The Mobile Application Developer career path prepares students to develop, test, debug and deploy hybrid mobile applications. This will require skills in application development

More information

IBM PureData System for Analytics Overview

IBM PureData System for Analytics Overview IBM PureData System for Analytics Overview Chris Jackson Technical Sales Specialist chrisjackson@us.ibm.com Traditional Data Warehouses are just too complex They do NOT meet the demands of advanced analytics

More information

Enterprise-Scale MATLAB Applications

Enterprise-Scale MATLAB Applications Enterprise-Scale Applications Sylvain Lacaze Rory Adams 2018 The MathWorks, Inc. 1 Enterprise Integration Access and Explore Data Preprocess Data Develop Predictive Models Integrate Analytics with Systems

More information

David Taylor

David Taylor Sept 10, 2013 What s New! IBM Cognos Business Intelligence 10.2.1.1 (released Sept 10, 2013) Analytic Catalyst TM1 10.2 Cognos Insight David Taylor david.taylor@us.ibm.com Agenda Overview of innovations

More information

MapR Pentaho Business Solutions

MapR Pentaho Business Solutions MapR Pentaho Business Solutions The Benefits of a Converged Platform to Big Data Integration Tom Scurlock Director, WW Alliances and Partners, MapR Key Takeaways 1. We focus on business values and business

More information

Cask Data Application Platform (CDAP) Extensions

Cask Data Application Platform (CDAP) Extensions Cask Data Application Platform (CDAP) Extensions CDAP Extensions provide additional capabilities and user interfaces to CDAP. They are use-case specific applications designed to solve common and critical

More information

Hortonworks Data Platform

Hortonworks Data Platform Hortonworks Data Platform An open-architecture platform to manage data in motion and at rest Highlights Addresses a range of data-at-rest use cases Powers real-time customer applications Delivers robust

More information

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

ORACLE DATA INTEGRATOR ENTERPRISE EDITION ORACLE DATA INTEGRATOR ENTERPRISE EDITION Oracle Data Integrator Enterprise Edition delivers high-performance data movement and transformation among enterprise platforms with its open and integrated E-LT

More information

Business is being transformed by three trends

Business is being transformed by three trends Business is being transformed by three trends Big Cloud Intelligence Stay ahead of the curve with Cortana Intelligence Suite Business apps People Custom apps Apps Sensors and devices Cortana Intelligence

More information

Alexander Klein. ETL meets Azure

Alexander Klein. ETL meets Azure Alexander Klein ETL meets Azure Thanks to our sponsors: Who am I? Independent BI Consultant > 15 years experience of SQL Server Focus on Microsoft BI Stack & AI & Azure a.klein@consulting-bi.de @SQL_Alex

More information

Practices of Business Intelligence. (Business Intelligence, Analytics, and Data Science)

Practices of Business Intelligence. (Business Intelligence, Analytics, and Data Science) Tamkang University Practices of Business Intelligence Tamkang University (Business Intelligence, Analytics, and Data Science) 1071BI02 MI4 (M2084) (2888) Wed, 7, 8 (14:10-16:00) (B217) Min-Yuh Day Assistant

More information

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. Course Content Course Description: The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. At Course Completion: After competing this course,

More information

The Information Integration Platform

The Information Integration Platform The Information Integration Platform IIS Product and Technology Vision & Roadmap Bob Zurek Director, Advanced Technologies and Product Strategy Information Integration Solutions IBM Software Group IBM

More information

New Big Data Solutions and Opportunities for DB Workloads

New Big Data Solutions and Opportunities for DB Workloads New Big Data Solutions and Opportunities for DB Workloads Hadoop and Spark Ecosystem for Data Analytics, Experience and Outlook Luca Canali, IT-DB Hadoop and Spark Service WLCG, GDB meeting CERN, September

More information

Exploring Big Data and Data Analytics with Hadoop and IDOL. Brochure. You are experiencing transformational changes in the computing arena.

Exploring Big Data and Data Analytics with Hadoop and IDOL. Brochure. You are experiencing transformational changes in the computing arena. Brochure Software Education Exploring Big Data and Data Analytics with Hadoop and IDOL You are experiencing transformational changes in the computing arena. Brochure Exploring Big Data and Data Analytics

More information

Information Server 11.3 Overview. Kevin D Silva Client Technical Professional, InfoSphere Information Server

Information Server 11.3 Overview. Kevin D Silva Client Technical Professional, InfoSphere Information Server Information Server 11.3 Overview Kevin D Silva Client Technical Professional, InfoSphere Information Server Governance Concerns for Big Customers Integrate & Link Big Big as a Source Big as a Target Transformations

More information

20775A: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight 20775A: Performing Data Engineering on Microsoft HD Insight Duration: 5 days; Instructor-led Implement Spark Streaming Using the DStream API. Develop Big Data Real-Time Processing Solutions with Apache

More information

MapR: Solution for Customer Production Success

MapR: Solution for Customer Production Success 2015 MapR Technologies 2015 MapR Technologies 1 MapR: Solution for Customer Production Success Big Data High Growth 700+ Customers Cloud Leaders Riding the Wave with Hadoop The Big Data Platform of Choice

More information

IBM Db2 Warehouse. Hybrid data warehousing using a software-defined environment in a private cloud. The evolution of the data warehouse

IBM Db2 Warehouse. Hybrid data warehousing using a software-defined environment in a private cloud. The evolution of the data warehouse IBM Db2 Warehouse Hybrid data warehousing using a software-defined environment in a private cloud The evolution of the data warehouse Managing a large-scale, on-premises data warehouse environments to

More information

Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved.

Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved. Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved. 1 SAP Big Data Webinar Series Big Data - Introduction to SAP Big Data Technologies Big Data - Streaming Analytics Big Data - Smarter

More information

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect 2005 Concert de Coldplay 2014 Concert de Coldplay 90% of the world s data has been created over the last two years alone 1 1. Source

More information

Analytics in Action transforming the way we use and consume information

Analytics in Action transforming the way we use and consume information Analytics in Action transforming the way we use and consume information Big Data Ecosystem The Data Traditional Data BIG DATA Repositories MPP Appliances Internet Hadoop Data Streaming Big Data Ecosystem

More information

20775 Performing Data Engineering on Microsoft HD Insight

20775 Performing Data Engineering on Microsoft HD Insight Duración del curso: 5 Días Acerca de este curso The main purpose of the course is to give students the ability plan and implement big data workflows on HD. Perfil de público The primary audience for this

More information

20775A: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight 20775A: Performing Data Engineering on Microsoft HD Insight Course Details Course Code: Duration: Notes: 20775A 5 days This course syllabus should be used to determine whether the course is appropriate

More information

InfoSphere Warehousing 9.5

InfoSphere Warehousing 9.5 IBM Software Group Optimised InfoSphere Warehousing 9.5 Flexible Simple Phil Downey InfoSphere Warehouse Technical Marketing 2007 IBM Corporation Information On Demand End-to-End Capabilities Optimization

More information

Insights to HDInsight

Insights to HDInsight Insights to HDInsight Why Hadoop in the Cloud? No hardware costs Unlimited Scale Pay for What You Need Deployed in minutes Azure HDInsight Big Data made easy Enterprise Ready Easier and more productive

More information

Cask Data Application Platform (CDAP)

Cask Data Application Platform (CDAP) Cask Data Application Platform (CDAP) CDAP is an open source, Apache 2.0 licensed, distributed, application framework for delivering Hadoop solutions. It integrates and abstracts the underlying Hadoop

More information

20775: Performing Data Engineering on Microsoft HD Insight

20775: Performing Data Engineering on Microsoft HD Insight Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com

More information

Analyzing Data with Power BI

Analyzing Data with Power BI Course 20778A: Analyzing Data with Power BI Course Outline Module 1: Introduction to Self-Service BI Solutions Introduces business intelligence (BI) and how to self-serve with BI. Introduction to business

More information

Information Builders Enterprise Information Management Solution Transforming data into business value Fateh NAILI Enterprise Solutions Manager

Information Builders Enterprise Information Management Solution Transforming data into business value Fateh NAILI Enterprise Solutions Manager Information Builders Enterprise Information Management Solution Transforming data into business value Fateh NAILI Enterprise Solutions Manager June 20 th, 2017 1 Agenda Introduction Information Builders

More information

Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand

Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand Paper 2698-2018 Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand ABSTRACT Digital analytics is no longer just about tracking the number

More information

Modernizing Your Data Warehouse with Azure

Modernizing Your Data Warehouse with Azure Modernizing Your Data Warehouse with Azure Big data. Small data. All data. Christian Coté S P O N S O R S The traditional BI Environment The traditional data warehouse data warehousing has reached the

More information

Amsterdam. (technical) Updates & demonstration. Robert Voermans Governance architect

Amsterdam. (technical) Updates & demonstration. Robert Voermans Governance architect (technical) Updates & demonstration Robert Voermans Governance architect Amsterdam Please note IBM s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice

More information

SAP Cloud Platform Big Data Services EXTERNAL. SAP Cloud Platform Big Data Services From Data to Insight

SAP Cloud Platform Big Data Services EXTERNAL. SAP Cloud Platform Big Data Services From Data to Insight EXTERNAL FULL-SERVICE BIG DATA IN THE CLOUD, a fully managed Apache Hadoop and Apache Spark cloud offering, form the cornerstone of many successful Big Data implementations. Enterprises harness the performance

More information

COPYRIGHTED MATERIAL. 1Big Data and the Hadoop Ecosystem

COPYRIGHTED MATERIAL. 1Big Data and the Hadoop Ecosystem 1Big Data and the Hadoop Ecosystem WHAT S IN THIS CHAPTER? Understanding the challenges of Big Data Getting to know the Hadoop ecosystem Getting familiar with Hadoop distributions Using Hadoop-based enterprise

More information

What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data?

What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data? Glenn Anderson, IBM Lab Services and Training What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data? Winter SHARE March 2014 Session 15126 Today s mainframe is a hybrid system InfoSphere Streams

More information

EXECUTIVE BRIEF. Successful Data Warehouse Approaches to Meet Today s Analytics Demands. In this Paper

EXECUTIVE BRIEF. Successful Data Warehouse Approaches to Meet Today s Analytics Demands. In this Paper Sponsored by Successful Data Warehouse Approaches to Meet Today s Analytics Demands EXECUTIVE BRIEF In this Paper Organizations are adopting increasingly sophisticated analytics methods Analytics usage

More information

The Sysprog s Guide to the Customer Facing Mainframe: Cloud / Mobile / Social / Big Data

The Sysprog s Guide to the Customer Facing Mainframe: Cloud / Mobile / Social / Big Data Glenn Anderson, IBM Lab Services and Training The Sysprog s Guide to the Customer Facing Mainframe: Cloud / Mobile / Social / Big Data Summer SHARE August 2015 Session 17794 2 (c) Copyright 2015 IBM Corporation

More information

Oracle Big Data Discovery The Visual Face of Big Data

Oracle Big Data Discovery The Visual Face of Big Data Oracle Big Data Discovery The Visual Face of Big Data Today's Big Data challenge is not how to store it, but how to make sense of it. Oracle Big Data Discovery is a fundamentally new approach to making

More information

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara Pentaho 8.0 and Beyond Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara Safe Harbor Statement The forward-looking statements contained in this document represent an outline of our

More information

Oracle Service Cloud. New Feature Summary

Oracle Service Cloud. New Feature Summary Oracle Service Cloud New Feature Summary May 2017 TABLE OF CONTENTS REVISION HISTORY... 3 ORACLE SERVICE CLOUD MAY RELEASE OVERVIEW... 4 WEB CUSTOMER SERVICE... 4 Community Self-Service Enhancements...

More information

InfoSphere Warehouse. Flexible. Reliable. Simple. IBM Software Group

InfoSphere Warehouse. Flexible. Reliable. Simple. IBM Software Group IBM Software Group Flexible Reliable InfoSphere Warehouse Simple Ser Yean Tan Regional Technical Sales Manager Information Management Software IBM Software Group ASEAN 2007 IBM Corporation Business Intelligence

More information

Realising Value from Data

Realising Value from Data Realising Value from Data Togetherwith Open Source Drives Innovation & Adoption in Big Data BCS Open Source SIG London 1 May 2013 Timings 6:00-6:30pm. Register / Refreshments 6:30-8:00pm, Presentation

More information

Social Analytics. More than Listening Social Media Strategy. Creating relationship. Build advocacy. Improve loyalty

Social Analytics. More than Listening Social Media Strategy. Creating relationship. Build advocacy. Improve loyalty Social Analytics Creating relationship. Build advocacy. Improve loyalty Alok Kumar Jha BI Solution Architect India Software Lab, SWG Bangalore More than Listening Social Media Strategy Social Media Analytics

More information

Copyright 2014, Oracle and/or its affiliates. All rights reserved. 2

Copyright 2014, Oracle and/or its affiliates. All rights reserved. 2 Copyright 2014, Oracle and/or its affiliates. All rights reserved. 2 Oracle Cloud Marketplace: An Innovation Ecosystem for Partners and Customers Neelesh Gurnani Sr. Director Product Development Ajay Seetharam

More information

Spark and Hadoop Perfect Together

Spark and Hadoop Perfect Together Spark and Hadoop Perfect Together Arun Murthy Hortonworks Co-Founder @acmurthy Data Operating System Enable all data and applications TO BE accessible and shared BY any end-users Data Operating System

More information

NICE Customer Engagement Analytics - Architecture Whitepaper

NICE Customer Engagement Analytics - Architecture Whitepaper NICE Customer Engagement Analytics - Architecture Whitepaper Table of Contents Introduction...3 Data Principles...4 Customer Identities and Event Timelines...................... 4 Data Discovery...5 Data

More information

Course 20467C: Designing Self-Service Business Intelligence and Big Data Solutions

Course 20467C: Designing Self-Service Business Intelligence and Big Data Solutions Course 20467C: Designing Self-Service Business Intelligence and Big Data Solutions Length : 5 Days Audience(s) : IT Professionals Level : 300 Technology : Microsoft SQL Server 2014 Delivery Method : Instructor-led

More information

Architecture Optimization for the new Data Warehouse. Cloudera, Inc. All rights reserved.

Architecture Optimization for the new Data Warehouse. Cloudera, Inc. All rights reserved. Architecture Optimization for the new Data Warehouse Guido Oswald - @GuidoOswald 1 Use Cases This image cannot currently be displayed. This image cannot currently be displayed. This image cannot currently

More information

Berkeley Data Analytics Stack (BDAS) Overview

Berkeley Data Analytics Stack (BDAS) Overview Berkeley Analytics Stack (BDAS) Overview Ion Stoica UC Berkeley UC BERKELEY What is Big used For? Reports, e.g., - Track business processes, transactions Diagnosis, e.g., - Why is user engagement dropping?

More information

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics ESSENTIALS EMC ISILON Use the industry's first and only scale-out NAS solution with native Hadoop

More information

WHY THE GEOGRAPHIC APPROACH? Everything occurs somewhere. Geography is common

WHY THE GEOGRAPHIC APPROACH? Everything occurs somewhere. Geography is common WHY THE GEOGRAPHIC APPROACH? Everything occurs somewhere Geography is common Across organisational boundaries, operational levels & environment Location data is found everywhere spreadsheets sensor networks

More information

Active Analytics Overview

Active Analytics Overview Active Analytics Overview The Fourth Industrial Revolution is predicated on data. Success depends on recognizing data as the most valuable corporate asset. From smart cities to autonomous vehicles, logistics

More information

Hortonworks Connected Data Platforms

Hortonworks Connected Data Platforms Hortonworks Connected Data Platforms MASTER THE VALUE OF DATA EVERY BUSINESS IS A DATA BUSINESS EMBRACE AN OPEN APPROACH 2 Hortonworks Inc. 2011 2016. All Rights Reserved Data Drives the Connected Car

More information

Achieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform

Achieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform Achieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform Analytics R&D and Product Management Document Version 1 WP-Urika-GX-Big-Data-Analytics-0217 www.cray.com

More information