Big Data Live selbst analysieren

Size: px
Start display at page:

Download "Big Data Live selbst analysieren"

Transcription

1 Big Data Live selbst analysieren Hands on Workshop zu IBM InfoSphere Big Insights Harald Gröger Wilfried Hoge Gerhard Wenzel IBM 2013 IBM Corporation

2 Agenda 15:00-15:10 Einführung IBM Big Data Plattform und BigInsights 15:15-15:25 Lab 1: Managing your big data environment 15:25-16:05 Lab 2: Analyzing big data with BigSheets 16:05-16:10 Demo BigSheets Highlights 16:10-16:20 Demo Textanalyse Highlights

3 Was ist Big Data? Volume Variety Velocity Veracity Data at Scale Terabytes to petabytes of data Data in Many Forms Structured, unstructured, text, multimedia Data in Motion Analysis of streaming data to enable decisions within fractions of a second. Data Uncertainty Managing the reliability and predictability of inherently imprecise data types.

4 Die IBM Big Data Zonen-Architektur Real-time Analytics Intelligence Analysis Data in Motion Ingestion and Integration Streams Integrated Exploration Decision Management Data at Rest ETL, Quality, MDM Landing, Analytics and Archive Warehouse / Marts BI and Predictive Analytics Data in Many Forms MapReduce Navigation and Discovery Hadoop Information Governance, Security and Business Continuity

5 Was ist Hadoop? Apache Hadoop is an open source software project that enables the distributed processing of large data sets across clusters of commodity servers. MapReduce - The framework that understands and assigns work to the nodes in a cluster. HDFS - A file system that spans all the nodes in a Hadoop cluster for data storage. It links together the file systems on many local nodes to make them into one big file system. HDFS assumes nodes will fail, so it achieves reliability by replicating data across multiple nodes Scalable add nodes without changing data formats, how data is loaded, how jobs are written, or the applications on top Cost effective massively parallel computing on commodity servers with sizeable decrease in storage cost, which makes it affordable to model all your data Flexible schema-less, can absorb any type of data, data from multiple sources can be joined and aggregated in arbitrary ways enabling deep analyses Fault tolerant loss of a node results in work redirect to another location of the data and continues processing

6 Umfang der IBM BigInsights Hadoop-Distribution Enterprise class Quick Start Edition New for V2.1. Free. Non-production only Apache Hadoop Basic Edition Free download - Jaql - Integrated install Enterprise Edition Sold by # of terabytes managed PureData for Hadoop - Appliance simplicity Enterprise ready - Integrated web console - Administrative tools, security - RDBMS, warehouse connectivity - Enterprise Integration - Performance Optimization - Pre-built applications Analytics included - Visualization Capabilities - Spreadsheet-style tool - Big SQL - Text analytics - Eclipse development -- Accelerators PureData for Hadoop brings BigInsights as an appliance form factor to the market Breadth of capabilities IBM Corporation

7 Generelle Informationen Name Hostname der VM = bivm Login Benutzer = biadmin Kennwort = biadmin

8 Tutorial - Managing your Big Data environment Dauer ca. 10 Minuten Start BigInsights Web Console über Desktop Icon, dann weiter mit Chapter 2 / Lesson 1 / Schritt 3 (Seite 4).

9 Tutorial - Analyzing Big Data with BigSheets Dauer ca. 40 Minuten Alle Prerequisites sind bereits erfüllt. Die Daten sind heruntergeladen und importiert. Start im Files Tab der BigInsights Web Console mit Lesson 1 / Schritt 3 (Seite 14), (hdfs/biginsights/sheets/watson_data_preloaded) Ende nach Lesson 6 / Schritt 3 (Seite 21).

10 Console Demo

11 BigSheets Demo Blog News Spreadsheet Format From unstructured text to formatted spreadsheets and charts Chart

12 Text Analytics Demo unstructured text Labels / Examples AQL Regex / Dictionary generate From unstructured text documents to text analytics result table text highlight AQL Candidates create combination of regex and dictionaries plus distance, case,... AQL Filter Result Table result table duplicates, irrelevant candidates,...

13 Thank You! 2013 IBM Corporation

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN

From Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN From Information to Insight: The Big Value of Big Data Faire Ann Co Marketing Manager, Information Management Software, ASEAN The World is Changing and Becoming More INSTRUMENTED INTERCONNECTED INTELLIGENT

More information

IBM Big Data Summit 2012

IBM Big Data Summit 2012 IBM Big Data Summit 2012 12.10.2012 InfoSphere BigInsights Introduction Wilfried Hoge Leading Technical Sales Professional hoge@de.ibm.com twitter.com/wilfriedhoge 12.10.1012 IBM Big Data Strategy: Move

More information

The Intersection of Big Data and DB2

The Intersection of Big Data and DB2 The Intersection of Big Data and DB2 May 20, 2014 Mike McCarthy, IBM Big Data Channels Development mmccart1@us.ibm.com Agenda What is Big Data? Concepts Characteristics What is Hadoop Relational vs Hadoop

More information

5th Annual. Cloudera, Inc. All rights reserved.

5th Annual. Cloudera, Inc. All rights reserved. 5th Annual 1 The Essentials of Apache Hadoop The What, Why and How to Meet Agency Objectives Sarah Sproehnle, Vice President, Customer Success 2 Introduction 3 What is Apache Hadoop? Hadoop is a software

More information

Big Data Platform Overview

Big Data Platform Overview Big Data Platform Overview Alex Hay (athay@us.ibm.com), Big Data CTP Meridee Lowry (meridee@us.ibm.com), Big Data CTP April 30 th, 2014 Big Data is a Concept Big Data 2 IBM Big Data and Analytics Offerings

More information

ActualTests.C Q&A C Foundations of IBM Big Data & Analytics Architecture V1

ActualTests.C Q&A C Foundations of IBM Big Data & Analytics Architecture V1 ActualTests.C2030-136.40Q&A Number: C2030-136 Passing Score: 800 Time Limit: 120 min File Version: 4.8 http://www.gratisexam.com/ C2030-136 Foundations of IBM Big Data & Analytics Architecture V1 Hello,

More information

BIG DATA AND HADOOP DEVELOPER

BIG DATA AND HADOOP DEVELOPER BIG DATA AND HADOOP DEVELOPER Approximate Duration - 60 Hrs Classes + 30 hrs Lab work + 20 hrs Assessment = 110 Hrs + 50 hrs Project Total duration of course = 160 hrs Lesson 00 - Course Introduction 0.1

More information

Nouvelle Génération de l infrastructure Data Warehouse et d Analyses

Nouvelle Génération de l infrastructure Data Warehouse et d Analyses Nouvelle Génération de l infrastructure Data Warehouse et d Analyses November 2011 André Münger andre.muenger@emc.com +41 79 708 85 99 1 Agenda BIG Data Challenges Greenplum Overview Use Cases Summary

More information

BigInsights on Cloud. Mike Nobles Executive, BigInsights Solution Specialist WW Technical Sales, Cloud Data Services

BigInsights on Cloud. Mike Nobles Executive, BigInsights Solution Specialist WW Technical Sales, Cloud Data Services BigInsights on Cloud Mike Nobles Executive, BigInsights Solution Specialist WW Technical Sales, Cloud Data Services For questions about this presentation contact Mike Nobles at mnobles@us.ibm.com 2015

More information

Louis Bodine IBM STG WW BAO Tiger Team Leader

Louis Bodine IBM STG WW BAO Tiger Team Leader Louis Bodine IBM STG WW BAO Tiger Team Leader Presentation Objectives Discuss the value of Business Analytics Discuss BAO Ecosystem Discuss Transformational Solutions http://www.youtube.com/watch?v=eiuick5oqdm

More information

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Rohit Bakhshi, Solution Architect, Hortonworks Jim Walker, Director Product Marketing, Talend Page 1 About Us Rohit Bakhshi Solution

More information

Intro to Big Data and Hadoop

Intro to Big Data and Hadoop Intro to Big and Hadoop Portions copyright 2001 SAS Institute Inc., Cary, NC, USA. All Rights Reserved. Reproduced with permission of SAS Institute Inc., Cary, NC, USA. SAS Institute Inc. makes no warranties

More information

Realising Value from Data

Realising Value from Data Realising Value from Data Togetherwith Open Source Drives Innovation & Adoption in Big Data BCS Open Source SIG London 1 May 2013 Timings 6:00-6:30pm. Register / Refreshments 6:30-8:00pm, Presentation

More information

Angat Pinoy. Angat Negosyo. Angat Pilipinas.

Angat Pinoy. Angat Negosyo. Angat Pilipinas. Angat Pinoy. Angat Negosyo. Angat Pilipinas. Four megatrends will dominate the next decade Mobility Social Cloud Big data 91% of organizations expect to spend on mobile devices in 2012 In 2012, mobile

More information

Spark, Hadoop, and Friends

Spark, Hadoop, and Friends Spark, Hadoop, and Friends (and the Zeppelin Notebook) Douglas Eadline Jan 4, 2017 NJIT Presenter Douglas Eadline deadline@basement-supercomputing.com @thedeadline HPC/Hadoop Consultant/Writer http://www.basement-supercomputing.com

More information

Bringing the Power of SAS to Hadoop Title

Bringing the Power of SAS to Hadoop Title WHITE PAPER Bringing the Power of SAS to Hadoop Title Combine SAS World-Class Analytics With Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities ii Contents Introduction... 1 What

More information

Modernizing Your Data Warehouse with Azure

Modernizing Your Data Warehouse with Azure Modernizing Your Data Warehouse with Azure Big data. Small data. All data. Christian Coté S P O N S O R S The traditional BI Environment The traditional data warehouse data warehousing has reached the

More information

IBM BigInsights - Hadoop jako rozwiązanie korporacyjne. Tomasz Zawadzki Dyrektor Zarządzający Atom-tech

IBM BigInsights - Hadoop jako rozwiązanie korporacyjne. Tomasz Zawadzki Dyrektor Zarządzający Atom-tech IBM BigInsights - Hadoop jako rozwiązanie korporacyjne Tomasz Zawadzki Dyrektor Zarządzający Atom-tech IBM BigInsights - Hadoop jako rozwiązanie korporacyjne Tomasz Zawadzki Dyrektor Zarządzający Atom-tech

More information

IBM PureData System for Analytics Overview

IBM PureData System for Analytics Overview IBM PureData System for Analytics Overview Chris Jackson Technical Sales Specialist chrisjackson@us.ibm.com Traditional Data Warehouses are just too complex They do NOT meet the demands of advanced analytics

More information

Big Data at the Speed of Business IBM Innovations for a new era! Rob Thomas Vice President, Big Data Sales IBM Software Group, Information Management

Big Data at the Speed of Business IBM Innovations for a new era! Rob Thomas Vice President, Big Data Sales IBM Software Group, Information Management Big Data at the Speed of Business IBM Innovations for a new era! Rob Thomas Vice President, Big Data Sales IBM Software Group, Information Management Agenda for today 1 IBM s viewpoint on on big big data

More information

Extend the Value of Your Data Warehouse with Big Data

Extend the Value of Your Data Warehouse with Big Data Rick Clements Director, Marketing, Big Data, IBM Software Group, Information Management Extend the Value of Your Data Warehouse with Big Data You are likely familiar with the traditional warehouse infrastructure

More information

Microsoft Big Data. Solution Brief

Microsoft Big Data. Solution Brief Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,

More information

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Roger Ding Cloudera February 3rd, 2018 1 Agenda Hadoop History Introduction to Apache Hadoop

More information

DataAdapt Active Insight

DataAdapt Active Insight Solution Highlights Accelerated time to value Enterprise-ready Apache Hadoop based platform for data processing, warehousing and analytics Advanced analytics for structured, semistructured and unstructured

More information

Microsoft Azure Essentials

Microsoft Azure Essentials Microsoft Azure Essentials Azure Essentials Track Summary Data Analytics Explore the Data Analytics services in Azure to help you analyze both structured and unstructured data. Azure can help with large,

More information

Big Data Introduction

Big Data Introduction Big Data Introduction Who we are Experts At Your Service Over 50 specialists in IT infrastructure Certified, experienced, passionate Based In Switzerland 100% self-financed Swiss company Over CHF8 mio.

More information

Harnessing the Power of Big Data to Transform Your Business Anjul Bhambhri VP, Big Data, Information Management, IBM

Harnessing the Power of Big Data to Transform Your Business Anjul Bhambhri VP, Big Data, Information Management, IBM May, 2012 Harnessing the Power of Big Data to Transform Your Business Anjul Bhambhri VP, Big Data, Information Management, IBM 12+ TBs of tweet data every day 30 billion RFID tags today (1.3B in 2005)

More information

Datametica. The Modern Data Platform Enterprise Data Hub Implementations. Why is workload moving to Cloud

Datametica. The Modern Data Platform Enterprise Data Hub Implementations. Why is workload moving to Cloud Datametica The Modern Data Platform Enterprise Data Hub Implementations Why is workload moving to Cloud 1 What we used do Enterprise Data Hub & Analytics What is Changing Why it is Changing Enterprise

More information

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect 2005 Concert de Coldplay 2014 Concert de Coldplay 90% of the world s data has been created over the last two years alone 1 1. Source

More information

Optimizing Outcomes in a Connected World: Turning information into insights

Optimizing Outcomes in a Connected World: Turning information into insights Optimizing Outcomes in a Connected World: Turning information into insights Michael Eden Management Brand Executive Central & Eastern Europe Vilnius 18 October 2011 2011 IBM Corporation IBM celebrates

More information

IBM Software IBM InfoSphere BigInsights

IBM Software IBM InfoSphere BigInsights IBM Software IBM InfoSphere BigInsights Enabling new, cost-effective solutions to turn complex information into business insight 2 IBM InfoSphere BigInsights Executive summary Companies are hyper-connected

More information

Designing Business Intelligence Solutions with Microsoft SQL Server 2014

Designing Business Intelligence Solutions with Microsoft SQL Server 2014 Designing Business Intelligence Solutions with Microsoft SQL Server 2014 20467D; 5 Days, Instructor-led Course Description This five-day instructor-led course teaches students how to implement self-service

More information

MapR Pentaho Business Solutions

MapR Pentaho Business Solutions MapR Pentaho Business Solutions The Benefits of a Converged Platform to Big Data Integration Tom Scurlock Director, WW Alliances and Partners, MapR Key Takeaways 1. We focus on business values and business

More information

IBM InfoSphere BigInsights V2.0 delivering enterprise Hadoop capabilities with easy-to-use analytic tools and visualization

IBM InfoSphere BigInsights V2.0 delivering enterprise Hadoop capabilities with easy-to-use analytic tools and visualization IBM United States Software Announcement 212-442, dated November 13, 2012 IBM InfoSphere BigInsights V2.0 delivering enterprise Hadoop capabilities with easy-to-use analytic tools and visualization Table

More information

Cognitive Data Warehouse and Analytics

Cognitive Data Warehouse and Analytics Cognitive Data Warehouse and Analytics Hemant R. Suri, Sr. Offering Manager, Hybrid Data Warehouses, IBM (twitter @hemantrsuri or feel free to reach out to me via LinkedIN!) Over 90% of the world s data

More information

Designing Business Intelligence Solutions with Microsoft SQL Server 2014 Course Code: 20467D

Designing Business Intelligence Solutions with Microsoft SQL Server 2014 Course Code: 20467D Designing Business Intelligence Solutions with Microsoft SQL Server 2014 Course Code: 20467D Duration: 5 Days Overview About this course This five-day instructor-led course teaches students how to implement

More information

Welcome to this special series of Rational. Talks to You podcasts focusing on Innovate 2013, the IBM

Welcome to this special series of Rational. Talks to You podcasts focusing on Innovate 2013, the IBM IBM Podcast [ MUSIC ] Welcome to this special series of Rational Talks to You podcasts focusing on Innovate 2013, the IBM Technical Summit. I'm Kimberly Gist with IBM. Innovate 2013 is the premier conference

More information

Mobile Application Developer

Mobile Application Developer Mobile Application Developer The Mobile Application Developer career path prepares students to develop, test, debug and deploy hybrid mobile applications. This will require skills in application development

More information

Why Big Data Matters? Speaker: Paras Doshi

Why Big Data Matters? Speaker: Paras Doshi Why Big Data Matters? Speaker: Paras Doshi If you re wondering about what is Big Data and why does it matter to you and your organization, then come to this talk and get introduced to Big Data and learn

More information

SAS and Hadoop Technology: Overview

SAS and Hadoop Technology: Overview SAS and Hadoop Technology: Overview SAS Documentation September 19, 2017 The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2015. SAS and Hadoop Technology: Overview.

More information

Investor Presentation. Fourth Quarter 2015

Investor Presentation. Fourth Quarter 2015 Investor Presentation Fourth Quarter 2015 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences

More information

GPU ACCELERATED BIG DATA ARCHITECTURE

GPU ACCELERATED BIG DATA ARCHITECTURE INNOVATION PLATFORM WHITE PAPER 1 Today s enterprise is producing and consuming more data than ever before. Enterprise data storage and processing architectures have struggled to keep up with this exponentially

More information

Investor Presentation. Second Quarter 2016

Investor Presentation. Second Quarter 2016 Investor Presentation Second Quarter 2016 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences

More information

Augmented Real-time Clinical DataMart. Phani S Srinivasan Ponnapalli, Syneos Health Subrahmanyam Rayaprolu, Syneos Health

Augmented Real-time Clinical DataMart. Phani S Srinivasan Ponnapalli, Syneos Health Subrahmanyam Rayaprolu, Syneos Health Augmented Real-time Clinical DataMart Phani S Srinivasan Ponnapalli, Syneos Health Subrahmanyam Rayaprolu, Syneos Health Agenda Introduction Traditional Clinical Data warehouse vs Digital Data Modern Data

More information

InfoSphere Warehouse. Flexible. Reliable. Simple. IBM Software Group

InfoSphere Warehouse. Flexible. Reliable. Simple. IBM Software Group IBM Software Group Flexible Reliable InfoSphere Warehouse Simple Ser Yean Tan Regional Technical Sales Manager Information Management Software IBM Software Group ASEAN 2007 IBM Corporation Business Intelligence

More information

IBM s InfoSphere BigInsights: Smart Analytics for Big Data

IBM s InfoSphere BigInsights: Smart Analytics for Big Data An IBM Proof of Technology IBM s InfoSphere BigInsights: Smart Analytics for Big Data Meridee Lowry < BigInsights & Streams Technical Specialist meridee@us.ibm.com 2013 IBM Corporation IBM Disclaimer Information

More information

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake White Paper Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake Motivation for Modernization It is now a well-documented realization among Fortune 500 companies

More information

Smarter Analytics for Big Data

Smarter Analytics for Big Data Smarter Analytics for Big Data Anjul Bhambhri IBM Vice President, Big Data February 27, 2011 The World is Changing and Becoming More INSTRUMENTED INTERCONNECTED INTELLIGENT The resulting explosion of information

More information

Analyzing Data with Power BI

Analyzing Data with Power BI Analyzing Data with Power BI Course 20778B 3 Days Instructor-led, Hands-on Course Description The main purpose of this three-day instructor-led course is to give students a good understanding of data analysis

More information

WELCOME TO. Cloud Data Services: The Art of the Possible

WELCOME TO. Cloud Data Services: The Art of the Possible WELCOME TO Cloud Data Services: The Art of the Possible Goals for Today Share the cloud-based data management and analytics technologies that are enabling rapid development of new mobile applications Discuss

More information

Mid Atlantic Virtual Users Group May 9, 2013 IBM Big Data Announcements What You Should Know

Mid Atlantic Virtual Users Group May 9, 2013 IBM Big Data Announcements What You Should Know Mid Atlantic Virtual Users Group May 9, 2013 IBM Big Data Announcements What You Should Know Agenda: Introduction to the Mid-Atlantic Virtual Users Group Warren Heising Introduction The Big Picture Big

More information

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics ESSENTIALS EMC ISILON Use the industry's first and only scale-out NAS solution with native Hadoop

More information

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK Are you drowning in Big Data? Do you lack access to your data? Are you having a hard time managing Big Data processing requirements?

More information

BIG DATA TRANSFORMS BUSINESS. Copyright 2013 EMC Corporation. All rights reserved.

BIG DATA TRANSFORMS BUSINESS. Copyright 2013 EMC Corporation. All rights reserved. BIG DATA TRANSFORMS BUSINESS 1 Big Data = Structured+Unstructured Data Internet Of Things Non-Enterprise Information Structured Information In Relational Databases Managed & Unmanaged Unstructured Information

More information

Big Data und Hadoop. BI/DW Modernisierungs-Szenarien auf System z

Big Data und Hadoop. BI/DW Modernisierungs-Szenarien auf System z Big Data und Hadoop BI/DW Modernisierungs-Szenarien auf System z Eberhard Hechler Executive Architect, Member IBM Academy of Technology IBM Germany R&D Lab Trademarks The following are trademarks of the

More information

KnowledgeSTUDIO. Advanced Modeling for Better Decisions. Data Preparation, Data Profiling and Exploration

KnowledgeSTUDIO. Advanced Modeling for Better Decisions. Data Preparation, Data Profiling and Exploration KnowledgeSTUDIO Advanced Modeling for Better Decisions Companies that compete with analytics are looking for advanced analytical technologies that accelerate decision making and identify opportunities

More information

Analyzing Data with Power BI

Analyzing Data with Power BI Course 20778A: Analyzing Data with Power BI Course Outline Module 1: Introduction to Self-Service BI Solutions Introduces business intelligence (BI) and how to self-serve with BI. Introduction to business

More information

GET MORE VALUE OUT OF BIG DATA

GET MORE VALUE OUT OF BIG DATA GET MORE VALUE OUT OF BIG DATA Enterprise data is increasing at an alarming rate. An International Data Corporation (IDC) study estimates that data is growing at 50 percent a year and will grow by 50 times

More information

Contents at a Glance COPYRIGHTED MATERIAL. Introduction... 1 Part I: Getting Started with Big Data... 7

Contents at a Glance COPYRIGHTED MATERIAL. Introduction... 1 Part I: Getting Started with Big Data... 7 Contents at a Glance Introduction... 1 Part I: Getting Started with Big Data... 7 Chapter 1: Grasping the Fundamentals of Big Data...9 Chapter 2: Examining Big Data Types...25 Chapter 3: Old Meets New:

More information

Analytics in Action transforming the way we use and consume information

Analytics in Action transforming the way we use and consume information Analytics in Action transforming the way we use and consume information Big Data Ecosystem The Data Traditional Data BIG DATA Repositories MPP Appliances Internet Hadoop Data Streaming Big Data Ecosystem

More information

Big Data and Storage Technologies Futures and Trends

Big Data and Storage Technologies Futures and Trends Big Data and Storage Technologies Futures and Trends Christian Bandulet, Oracle EMEA Master Principal Sales Consultant Elite Engineering Exchange 13.06.2014 1 Do you remember... vt100 Internet Copyright

More information

HRS IIOT, Advanced Analytics, & Big Data Forum

HRS IIOT, Advanced Analytics, & Big Data Forum HRS 2016 - IIOT, Advanced Analytics, & Big Data Forum 1976 1979: concept of Teradata grows from research at California Institute of Technology (Caltech) and from the discussions of Citibank's advanced

More information

Advancing Information Management and Analysis with Entity Resolution. Whitepaper ADVANCING INFORMATION MANAGEMENT AND ANALYSIS WITH ENTITY RESOLUTION

Advancing Information Management and Analysis with Entity Resolution. Whitepaper ADVANCING INFORMATION MANAGEMENT AND ANALYSIS WITH ENTITY RESOLUTION Advancing Information Management and Analysis with Entity Resolution Whitepaper February 2016 novetta.com 2016, Novetta ADVANCING INFORMATION MANAGEMENT AND ANALYSIS WITH ENTITY RESOLUTION Advancing Information

More information

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica Accelerating Your Big Data Analytics Jeff Healey, Director Product Marketing, HPE Vertica Recent Waves of Disruption IT Infrastructu re for Analytics Data Warehouse Modernization Big Data/ Hadoop Cloud

More information

Operational Hadoop and the Lambda Architecture for Streaming Data

Operational Hadoop and the Lambda Architecture for Streaming Data Operational Hadoop and the Lambda Architecture for Streaming Data 2015 MapR Technologies 2015 MapR Technologies 1 Topics From Batch to Operational Workloads on Hadoop Streaming Data Environments The Lambda

More information

E-guide Hadoop Big Data Platforms Buyer s Guide part 3

E-guide Hadoop Big Data Platforms Buyer s Guide part 3 Big Data Platforms Buyer s Guide part 3 Your expert guide to big platforms enterprise MapReduce cloud-based Abie Reifer, DecisionWorx The Amazon Elastic MapReduce Web service offers a managed framework

More information

Research of the Social Media Data Analyzing Platform Based on Cloud Mining Yi-Tang ZENG, Yu-Feng ZHANG, Sheng CAO, Li LI, Cheng-Wei ZHANG *

Research of the Social Media Data Analyzing Platform Based on Cloud Mining Yi-Tang ZENG, Yu-Feng ZHANG, Sheng CAO, Li LI, Cheng-Wei ZHANG * 2016 3 rd International Conference on Social Science (ICSS 2016) ISBN: 978-1-60595-410-3 Research of the Social Media Data Analyzing Platform Based on Cloud Mining Yi-Tang ZENG, Yu-Feng ZHANG, Sheng CAO,

More information

Six Critical Capabilities for a Big Data Analytics Platform

Six Critical Capabilities for a Big Data Analytics Platform White Paper Analytics & Big Data Six Critical Capabilities for a Big Data Analytics Platform Table of Contents page Executive Summary...1 Key Requirements for a Big Data Analytics Platform...1 Vertica:

More information

Report Studio Fundamentals for eschoolplus Custom Training Guide

Report Studio Fundamentals for eschoolplus Custom Training Guide Report Studio Fundamentals for eschoolplus Custom Training Guide Capitalize Analytics 320 Decker Drive, Suite 100 Irving, TX 75062 214.531.3904 info@capitalizeconsulting.com 1 Table of Contents Course

More information

Ein Rennen der anderen Art: Big-Data Plattformen im Automobilbau

Ein Rennen der anderen Art: Big-Data Plattformen im Automobilbau Ein Rennen der anderen Art: Big-Data Plattformen im Automobilbau Thomas Pagel, Principal Technologist, Data & Analytics Franziska Weng, Consultant, Data & Analytics 2017 Avanade Inc. All Rights Reserved.

More information

ABOUT THIS TRAINING: This Hadoop training will also prepare you for the Big Data Certification of Cloudera- CCP and CCA.

ABOUT THIS TRAINING: This Hadoop training will also prepare you for the Big Data Certification of Cloudera- CCP and CCA. ABOUT THIS TRAINING: The world of Hadoop and Big Data" can be intimidating - hundreds of different technologies with cryptic names form the Hadoop ecosystem. This comprehensive training has been designed

More information

IBM Cognos 10.2 BI Demo

IBM Cognos 10.2 BI Demo IBM Cognos 10.2 BI Demo IBM Cognos Course Overview: In this training, participants acquire skills needed to develop activity, modeling and some admin works. Each and every concept is supported with documents

More information

Oracle BI Applications 7.9: Develop a Data Warehouse

Oracle BI Applications 7.9: Develop a Data Warehouse Oracle University Kontakt: +43 (0)1 33 777 401 Oracle BI Applications 7.9: Develop a Data Warehouse Dauer: 5 Tage Lerninhalte Zielgruppe dieser Schulung sind Personen, die für den ETL-Vorgang (Extract,

More information

Datawarehousing and Analytics Introduction to Assignments

Datawarehousing and Analytics Introduction to Assignments Anwendersoftware a Datawarehousing and Analytics Introduction to Assignments Holger Schwarz Universität Stuttgart Winter Term 2014/2015 Anwendersoftware a Assignment 1 Data Mining Discuss option to apply

More information

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

E-guide Hadoop Big Data Platforms Buyer s Guide part 1 Hadoop Big Data Platforms Buyer s Guide part 1 Your expert guide to Hadoop big data platforms for managing big data David Loshin, Knowledge Integrity Inc. Companies of all sizes can use Hadoop, as vendors

More information

SQL Server Course Analyzing Data with Power BI Length. Audience. What You'll Learn. Course Outline. 3 days

SQL Server Course Analyzing Data with Power BI Length. Audience. What You'll Learn. Course Outline. 3 days SQL Server Course - 20778 Analyzing Data with Power BI 2017 Length 3 days Audience The course will likely be attended by SQL Server report creators who are interested in alternative methods of presenting

More information

EXECUTIVE BRIEF. Successful Data Warehouse Approaches to Meet Today s Analytics Demands. In this Paper

EXECUTIVE BRIEF. Successful Data Warehouse Approaches to Meet Today s Analytics Demands. In this Paper Sponsored by Successful Data Warehouse Approaches to Meet Today s Analytics Demands EXECUTIVE BRIEF In this Paper Organizations are adopting increasingly sophisticated analytics methods Analytics usage

More information

InfoSphere Warehousing 9.5

InfoSphere Warehousing 9.5 IBM Software Group Optimised InfoSphere Warehousing 9.5 Flexible Simple Phil Downey InfoSphere Warehouse Technical Marketing 2007 IBM Corporation Information On Demand End-to-End Capabilities Optimization

More information

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved.

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved. Apache Spark 2.0 GA The General Engine for Modern Analytic Use Cases 1 Apache Spark Drives Business Innovation Apache Spark is driving new business value that is being harnessed by technology forward organizations.

More information

Rhonda Stonaker Infosemantics, Inc.

Rhonda Stonaker Infosemantics, Inc. Rhonda Stonaker Infosemantics, Inc. Professional Background 2 OBIEE Architect at Infosemantics, Inc. Experience with BI solutions for Oracle EBS including R12 since 2002 Experience with Packaged Solutions

More information

red red red red red red red red red red red red red red red red red red red red CYS Rithu P Ravi CYS Saumya K

red red red red red red red red red red red red red red red red red red red red CYS Rithu P Ravi CYS Saumya K red red red red red red red red red red red red red red red red red red red red CYS14011 - Rithu P Ravi CYS14012 - Saumya K Why and What HADOOP?... Apache Hadoop is an open-source software framework A

More information

Table of Contents. Are You Ready for Digital Transformation? page 04. Take Advantage of This Big Data Opportunity with Cisco and Hortonworks page 06

Table of Contents. Are You Ready for Digital Transformation? page 04. Take Advantage of This Big Data Opportunity with Cisco and Hortonworks page 06 Table of Contents 01 02 Are You Ready for Digital Transformation? page 04 Take Advantage of This Big Data Opportunity with Cisco and Hortonworks page 06 03 Get Open Access to Your Data and Help Ensure

More information

Big Data The Big Story

Big Data The Big Story Big Data The Big Story Jean-Pierre Dijcks Big Data Product Mangement 1 Agenda What is Big Data? Architecting Big Data Building Big Data Solutions Oracle Big Data Appliance and Big Data Connectors Customer

More information

Simplifying Hadoop. Sponsored by. July >> Computing View Point

Simplifying Hadoop. Sponsored by. July >> Computing View Point Sponsored by >> Computing View Point Simplifying Hadoop July 2013 The gap between the potential power of Hadoop and the technical difficulties in its implementation are narrowing and about time too Contents

More information

Modernizing Data Integration

Modernizing Data Integration Modernizing Data Integration To Accommodate New Big Data and New Business Requirements Philip Russom Research Director for Data Management, TDWI December 16, 2015 Sponsor Speakers Philip Russom TDWI Research

More information

POWER NEW POSSIBILITIES

POWER NEW POSSIBILITIES POWER NEW POSSIBILITIES Solutions for your data analytics journey About this brochure This brochure explains the capabilities and benefits of the Dell EMC options for starting on and maturing in your data

More information

Datametica DAMA. The Modern Data Platform Enterprise Data Hub Implementations. What is happening with Hadoop Why is workload moving to Cloud

Datametica DAMA. The Modern Data Platform Enterprise Data Hub Implementations. What is happening with Hadoop Why is workload moving to Cloud DAMA Datametica The Modern Data Platform Enterprise Data Hub Implementations What is happening with Hadoop Why is workload moving to Cloud 1 The Modern Data Platform The Enterprise Data Hub What do we

More information

Audience Profile The course will likely be attended by SQL Server report creators who are interested in alternative methods of presenting data.

Audience Profile The course will likely be attended by SQL Server report creators who are interested in alternative methods of presenting data. [MS20778]: Analyzing Data with Power BI Length : 3 Days Audience(s) : Information Workers Level : 300 Technology : Power BI Delivery Method : Instructor-led (Classroom) Course Overview The main purpose

More information

Insights to HDInsight

Insights to HDInsight Insights to HDInsight Why Hadoop in the Cloud? No hardware costs Unlimited Scale Pay for What You Need Deployed in minutes Azure HDInsight Big Data made easy Enterprise Ready Easier and more productive

More information

IBM Analytics Unleash the power of data with Apache Spark

IBM Analytics Unleash the power of data with Apache Spark IBM Analytics Unleash the power of data with Apache Spark Agility, speed and simplicity define the analytics operating system of the future 1 2 3 4 Use Spark to create value from data-driven insights Lower

More information

New Approach for scheduling tasks and/or jobs in Big Data Cluster

New Approach for scheduling tasks and/or jobs in Big Data Cluster New Approach for scheduling tasks and/or jobs in Big Data Cluster IT College, Chairperson of MS Dept. Agenda Introduction What is Big Data? The 4 characteristics of Big Data V4s Different Categories of

More information

#mstrworld. A Deep Dive Into Self-Service Data Discovery In MicroStrategy. Vijay Anand Gianthomas Tewksbury Volpe. #mstrworld

#mstrworld. A Deep Dive Into Self-Service Data Discovery In MicroStrategy. Vijay Anand Gianthomas Tewksbury Volpe. #mstrworld A Deep Dive Into Self-Service Data Discovery In MicroStrategy Vijay Anand Gianthomas Tewksbury Volpe Introducing MicroStrategy Analytics Agenda Introduction to MicroStrategy Analytics Platform Product

More information

zdata Solutions BI / Advanced Analytic Platform and Pilot Programs

zdata Solutions BI / Advanced Analytic Platform and Pilot Programs zdata Solutions BI / Advanced Analytic Platform and Pilot Programs BI & Analytics Platform Store Gather, integrate, load and manage your data in the cloud or on premise Collaborate Validate and dimensionalize

More information

Big data for the intelligence community

Big data for the intelligence community Big data for the intelligence community Contents 1 Summary 2 The big data challenge 3 Big data perspective 4 Differing big data solutions 5 Acquire 8 Apply 11 Store 15 IBM Research drives the future of

More information

In search of the Holy Grail?

In search of the Holy Grail? In search of the Holy Grail? Our Clients Journey to the Data Lake André De Locht Sr Business Consultant Data Lake, Information Integration and Governance $ andre.de.locht@be.ibm.com ( +32 476 870 354 Data

More information

Next Generation OSS as a key enabler for Modern Telecoms and Enterprise Business Transformation

Next Generation OSS as a key enabler for Modern Telecoms and Enterprise Business Transformation Next Generation OSS as a key enabler for Modern Telecoms and Enterprise Business Transformation 2 Strategic Partnership IBM ~ 430.000 Employees 12 R&D labs globally ~ 100 Billion $ Revenue 5 Nobel prizes

More information

ETL on Hadoop What is Required

ETL on Hadoop What is Required ETL on Hadoop What is Required Keith Kohl Director, Product Management October 2012 Syncsort Copyright 2012, Syncsort Incorporated Agenda Who is Syncsort Extract, Transform, Load (ETL) Overview and conventional

More information

Got Hadoop? Whitepaper: Hadoop and EXASOL - a perfect combination for processing, storing and analyzing big data volumes

Got Hadoop? Whitepaper: Hadoop and EXASOL - a perfect combination for processing, storing and analyzing big data volumes Got Hadoop? Whitepaper: Hadoop and EXASOL - a perfect combination for processing, storing and analyzing big data volumes Contents Introduction...3 Hadoop s humble beginnings...4 The benefits of Hadoop...5

More information

BIG DATA & ADVANCED ANALYTICS ROADSHOW

BIG DATA & ADVANCED ANALYTICS ROADSHOW BIG DATA & ADVANCED ANALYTICS ROADSHOW 2 Copyright 2014, Neudesic. All rights reserved. CO-SPONSORS UPCOMING ROADSHOW STOPS Los Angeles: Wednesday, February 10 th Orange County: Thursday, February 11 th

More information