20775: Performing Data Engineering on Microsoft HD Insight

Size: px
Start display at page:

Download "20775: Performing Data Engineering on Microsoft HD Insight"

Transcription

1 Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: ; traincert@tdt-tanduc.com Website: : Performing Data Engineering on Microsoft HD Insight Duration: 05 days Level: 300 The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight Authorized Training Silver Learning Authorized Training Authorized Training

2 AUDIENCE The primary audience for this course is data engineers, data architects, data scientists, and data developers who plan to implement big data engineering workflows on HDInsight. AT COURSE COMPLETION After completing this course, students will be able to: Deploy HDInsight Clusters. Authorizing Users to Access Resources. Loading Data into HDInsight. Troubleshooting HDInsight. Implement Batch Solutions. Design Batch ETL Solutions for Big Data with Spark Analyze Data with Spark SQL. Analyze Data with Hive and Phoenix. Describe Stream Analytics. Implement Spark Streaming Using the DStream API. Develop Big Data Real-Time Processing Solutions with Apache Storm. Build Solutions that use Kafka and HBase. COURSE OUTLINE Module 1: Getting Started with HDInsight This module introduces Hadoop, the MapReduce paradigm, and HDInsight. What is Big Data? Introduction to Hadoop Working with MapReduce Function Introducing HDInsight Lab: Working with HDInsight Provision an HDInsight cluster and run MapReduce jobs Describe Hadoop, MapReduce and HDInsight. Use scripts to provision an HDInsight Cluster. Run a word-counting MapReduce program using PowerShell. Module 2: Deploying HDInsight Clusters This module provides an overview of the Microsoft Azure HDInsight cluster types, in addition to the creation and maintenance of the HDInsight clusters. The module also demonstrates how to customize clusters by using script actions through the Azure Portal, Azure PowerShell, and the Azure command-line interface (CLI). This module includes labs that provide the steps to deploy and manage the clusters. [1]

3 Identifying HDInsight cluster types Managing HDInsight clusters by using the Azure portal Managing HDInsight Clusters by using Azure PowerShell Lab: Managing HDInsight clusters with the Azure Portal Create an HDInsight cluster that uses Data Lake Store storage Customize HDInsight by using script actions Delete an HDInsight cluster Identify HDInsight cluster types Manage HDInsight clusters by using the Azure Portal. Manage HDInsight clusters by using Azure PowerShell. Module 3: Authorizing Users to Access Resources This module provides an overview of non-domain and domain-joined Microsoft HDInsight clusters, in addition to the creation and configuration of domain-joined HDInsight clusters. The module also demonstrates how to manage domain-joined clusters using the Ambari management UI and the Ranger Admin UI. This module includes the labs that will provide the steps to create and manage domain-joined clusters. Non-domain Joined clusters Configuring domain-joined HDInsight clusters Manage domain-joined HDInsight clusters Lab: Authorizing Users to Access Resources Prepare the Lab Environment Manage a non-domain joined cluster Identify the characteristics of non-domain and domain-joined HDInsight clusters. Create and configure domain-joined HDInsight clusters through the Azure PowerShell. Manage the domain-joined cluster using the Ambari management UI and the Ranger Admin UI. Create Hive policies and manage user permissions. Module 4: Loading data into HDInsight This module provides an introduction to loading data into Microsoft Azure Blob storage and Microsoft Azure Data Lake storage. At the end of this lesson, you will know how to use multiple tools to transfer data to an HDInsight cluster. You will also learn how to load and transform data to decrease your query run time. Storing data for HDInsight processing Using data loading tools Maximising value from stored data Lab: Loading Data into your Azure account Load data for use with HDInsight [2]

4 Discuss the architecture of key HDInsight storage solutions. Use tools to upload data to HDInsight clusters. Compress and serialize uploaded data for decreased processing time. Module 5: Troubleshooting HDInsight In this module, you will learn how to interpret logs associated with the various services of Microsoft Azure HDInsight cluster to troubleshoot any issues you might have with these services. You will also learn about Operations Management Suite (OMS) and its capabilities. Analyze HDInsight logs YARN logs Heap dumps Operations management suite Lab: Troubleshooting HDInsight Analyze HDInsight logs Analyze YARN logs Monitor resources with Operations Management Suite Locate and analyse HDInsight logs. Use YARN logs for application troubleshooting. Understand and enable heap dumps. Describe how the OMS can be used with Azure resources. Module 6: Implementing Batch Solutions In this module, you will look at implementing batch solutions in Microsoft Azure HDInsight by using Hive and Pig. You will also discuss the approaches for data pipeline operationalization that are available for big data workloads on an HDInsight stack. Apache Hive storage HDInsight data queries using Hive and Pig Operationalize HDInsight Lab: Implement Batch Solutions Deploy HDInsight cluster and data storage Use data transfers with HDInsight clusters Query HDInsight cluster data Understand Apache Hive and the scenarios where it can be used. Run batch jobs using Apache Hive and Apache Pig. Explain the capabilities of the Microsoft Azure Data Factory and Apache Oozie and how they can orchestrate and automate big data workflows. [3]

5 Module 7: Design Batch ETL solutions for big data with Spark This module provides an overview of Apache Spark, describing its main characteristics and key features. Before you start, it s helpful to understand the basic architecture of Apache Spark and the different components that are available. The module also explains how to design batch Extract, Transform, Load (ETL) solutions for big data with Spark on HDInsight. The final lesson includes some guidelines to improve Spark performance. What is Spark? ETL with Spark Spark performance Lab: Design Batch ETL solutions for big data with Spark. Create a HDInsight Cluster with access to Data Lake Store Use HDInsight Spark cluster to analyze data in Data Lake Store Analyzing website logs using a custom library with Apache Spark cluster on HDInsight Managing resources for Apache Spark cluster on Azure HDInsight Describe the architecture of Spark on HDInsight. Describe the different components required for a Spark application on HDInsight. Identify the benefits of using Spark for ETL processes. Create Python and Scala code in a Spark program to ingest or process data. Identify cluster settings for optimal performance. Track and debug jobs running on an Apache Spark cluster in HDInsight. Module 8: Analyze Data with Spark SQL This module describes how to analyze data by using Spark SQL. In it, you will be able to explain the differences between RDD, Datasets and Dataframes, identify the uses cases between Iterative and Interactive queries, and describe best practices for Caching, Partitioning and Persistence. You will also look at how to use Apache Zeppelin and Jupyter notebooks, carry out exploratory data analysis, then submit Spark jobs remotely to a Spark cluster. Implementing iterative and interactive queries Perform exploratory data analysis Lab: Performing exploratory data analysis by using iterative and interactive queries Build a machine learning application Use zeppelin for interactive data analysis View and manage Spark sessions by using Livy Implement interactive queries. Perform exploratory data analysis. [4]

6 Module 9: Analyze Data with Hive and Phoenix In this module, you will learn about running interactive queries using Interactive Hive (also known as Hive LLAP or Live Long and Process) and Apache Phoenix. You will also learn about the various aspects of running interactive queries using Apache Phoenix with HBase as the underlying query engine. Implement interactive queries for big data with interactive hive. Perform exploratory data analysis by using Hive Perform interactive processing by using Apache Phoenix Lab: Analyze data with Hive and Phoenix Implement interactive queries for big data with interactive Hive Perform exploratory data analysis by using Hive Perform interactive processing by using Apache Phoenix Implement interactive queries with interactive Hive. Perform exploratory data analysis using Hive. Perform interactive processing by using Apache Phoenix. Module 10: Stream Analytics The Microsoft Azure Stream Analytics service has some built-in features and capabilities that make it as easy to use as a flexible stream processing service in the cloud. You will see that there are a number of advantages to using Stream Analytics for your streaming solutions, which you will discuss in more detail. You will also compare features of Stream Analytics to other services available within the Microsoft Azure HDInsight stack, such as Apache Storm. You will learn how to deploy a Stream Analytics job, connect it to the Microsoft Azure Event Hub to ingest real-time data, and execute a Stream Analytics query to gain low-latency insights. After that, you will learn how Stream Analytics jobs can be monitored when deployed and used in production settings. Stream analytics Process streaming data from stream analytics Managing stream analytics jobs Lab: Implement Stream Analytics Process streaming data with stream analytics Managing stream analytics jobs Describe stream analytics and its capabilities. Process streaming data with stream analytics. Manage stream analytics jobs. Module 11: Implementing Streaming Solutions with Kafka and HBase In this module, you will learn how to use Kafka to build streaming solutions. You will also see how to use Kafka to persist data to HDFS by using Apache HBase, and then query this data. [5]

7 Building and Deploying a Kafka Cluster Publishing, Consuming, and Processing data using the Kafka Cluster Using HBase to store and Query Data Lab: Implementing Streaming Solutions with Kafka and HBase Create a virtual network and gateway Create a storm cluster for Kafka Create a Kafka producer Create a streaming processor client topology Create a Power BI dashboard and streaming dataset Create an HBase cluster Create a streaming processor to write to HBase Build and deploy a Kafka Cluster. Publish data to a Kafka Cluster, consume data from a Kafka Cluster, and perform stream processing using the Kafka Cluster. Save streamed data to HBase, and perform queries using the HBase API. Module 12: Develop big data real-time processing solutions with Apache Storm This module explains how to develop big data real-time processing solutions with Apache Storm. Persist long term data Stream data with Storm Create Storm topologies Configure Apache Storm Lab: Developing big data real-time processing solutions with Apache Storm Stream data with Storm Create Storm Topologies Persist long term data. Stream data with Storm. Create Storm topologies. Configure Apache Storm. Module 13: Create Spark Streaming Applications This module describes Spark Streaming; explains how to use discretized streams (DStreams); and explains how to apply the concepts to develop Spark Streaming applications. Working with Spark Streaming Creating Spark Structured Streaming Applications Persistence and Visualization Lab: Building a Spark Streaming Application Installing Required Software [6]

8 Building the Azure Infrastructure Building a Spark Streaming Pipeline Describe Spark Streaming and how it works. Use discretized streams (DStreams). Work with sliding window operations. Apply the concepts to develop Spark Streaming applications. Describe Structured Streaming. [7]

Microsoft Azure Essentials

Microsoft Azure Essentials Microsoft Azure Essentials Azure Essentials Track Summary Data Analytics Explore the Data Analytics services in Azure to help you analyze both structured and unstructured data. Azure can help with large,

More information

80318: Reporting in Microsoft Dynamics AX 2012

80318: Reporting in Microsoft Dynamics AX 2012 Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com

More information

: 20776A: Performing Big Data Engineering on Microsoft Cloud Services

: 20776A: Performing Big Data Engineering on Microsoft Cloud Services Module Title Duration : 20776A: Performing Big Data Engineering on Microsoft Cloud Services : 5 days About this course This five-day instructor-led course describes how to process Big Data using Azure

More information

Cask Data Application Platform (CDAP)

Cask Data Application Platform (CDAP) Cask Data Application Platform (CDAP) CDAP is an open source, Apache 2.0 licensed, distributed, application framework for delivering Hadoop solutions. It integrates and abstracts the underlying Hadoop

More information

Big Data & Hadoop Advance

Big Data & Hadoop Advance Course Durations: 30 Hours About Company: Course Mode: Online/Offline EduNextgen extended arm of Product Innovation Academy is a growing entity in education and career transformation, specializing in today

More information

Designing Business Intelligence Solutions with Microsoft SQL Server 2014

Designing Business Intelligence Solutions with Microsoft SQL Server 2014 Designing Business Intelligence Solutions with Microsoft SQL Server 2014 20467D; 5 Days, Instructor-led Course Description This five-day instructor-led course teaches students how to implement self-service

More information

Jason Virtue Business Intelligence Technical Professional

Jason Virtue Business Intelligence Technical Professional Jason Virtue Business Intelligence Technical Professional jvirtue@microsoft.com Agenda Microsoft Azure Data Services Azure Cloud Services Azure Machine Learning Azure Service Bus Azure Stream Analytics

More information

ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics

ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics ADVANCED ANALYTICS & IOT ARCHITECTURES Presented by: Orion Gebremedhin Director of Technology, Data & Analytics Marc Lobree National Architect, Advanced Analytics EDW THE RIGHT TOOL FOR THE RIGHT WORKLOAD

More information

Hadoop and Analytics at CERN IT CERN IT-DB

Hadoop and Analytics at CERN IT CERN IT-DB Hadoop and Analytics at CERN IT CERN IT-DB 1 Hadoop Use cases Parallel processing of large amounts of data Perform analytics on a large scale Dealing with complex data: structured, semi-structured, unstructured

More information

Microsoft SharePoint WorkFlow

Microsoft SharePoint WorkFlow Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com

More information

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved.

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved. Apache Spark 2.0 GA The General Engine for Modern Analytic Use Cases 1 Apache Spark Drives Business Innovation Apache Spark is driving new business value that is being harnessed by technology forward organizations.

More information

AVANTUS TRAINING PTE LTD

AVANTUS TRAINING PTE LTD [MS10979]: Microsoft Azure Fundamentals Length : 2 Days Audience(s) : IT Professionals Level : 100 Technology : Azure Delivery Method : Instructor-led (Classroom) Course Overview This course provides the

More information

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB Data Analytics and CERN IT Hadoop Service CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB 1 Data Analytics at Scale The Challenge When you cannot fit your workload in a desktop Data

More information

Operational Hadoop and the Lambda Architecture for Streaming Data

Operational Hadoop and the Lambda Architecture for Streaming Data Operational Hadoop and the Lambda Architecture for Streaming Data 2015 MapR Technologies 2015 MapR Technologies 1 Topics From Batch to Operational Workloads on Hadoop Streaming Data Environments The Lambda

More information

Leveraging Oracle Big Data Discovery to Master CERN s Data. Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden

Leveraging Oracle Big Data Discovery to Master CERN s Data. Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden Leveraging Oracle Big Data Discovery to Master CERN s Data Manuel Martín Márquez Oracle Business Analytics Innovation 12 October- Stockholm, Sweden Manuel Martin Marquez Intel IoT Ignition Lab Cloud and

More information

Azure Offerings for Big data. In Kee Paek Cloud Data Solution Architect Microsoft Korea October. 2016

Azure Offerings for Big data. In Kee Paek Cloud Data Solution Architect Microsoft Korea October. 2016 Azure Offerings for Big data In Kee Paek Cloud Data Solution Architect Microsoft Korea October. 2016 Agenda 1. Integrated Big data Platform - Cortana Intelligent Suite 2. Scalable Machine Learning - R

More information

Master Planning in Microsoft Dynamics AX 2012

Master Planning in Microsoft Dynamics AX 2012 CÔNG TY CỔ PHẦN TRƯỜNG CNTT TÂN ĐỨC TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC LEARN MORE WITH LESS! Course 80423: Master Planning in Microsoft Dynamics AX 2012 Length: Audience: 2 Days Level: 300 Information

More information

BIG DATA and DATA SCIENCE

BIG DATA and DATA SCIENCE Integrated Program In BIG DATA and DATA SCIENCE CONTINUING STUDIES Table of Contents About the Course...03 Key Features of Integrated Program in Big Data and Data Science...04 Learning Path...05 Key Learning

More information

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Rohit Bakhshi, Solution Architect, Hortonworks Jim Walker, Director Product Marketing, Talend Page 1 About Us Rohit Bakhshi Solution

More information

Building a Data Lake with Spark and Cassandra Brendon Smith & Mayur Ladwa

Building a Data Lake with Spark and Cassandra Brendon Smith & Mayur Ladwa Building a Data Lake with Spark and Cassandra Brendon Smith & Mayur Ladwa July 2015 BlackRock: Who We Are BLK data as of 31 st March 2015 is the world s largest investment manager Manages over $4.7 trillion

More information

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

E-guide Hadoop Big Data Platforms Buyer s Guide part 1 Hadoop Big Data Platforms Buyer s Guide part 1 Your expert guide to Hadoop big data platforms for managing big data David Loshin, Knowledge Integrity Inc. Companies of all sizes can use Hadoop, as vendors

More information

"Charting the Course... MOC A: Architecting Microsoft Azure Solutions. Course Summary

Charting the Course... MOC A: Architecting Microsoft Azure Solutions. Course Summary MOC 20535 A: Architecting Microsoft Course Summary Description This course is intended for architects who have experience building infrastructure and applications on the Microsoft platform. Students should

More information

MapR: Solution for Customer Production Success

MapR: Solution for Customer Production Success 2015 MapR Technologies 2015 MapR Technologies 1 MapR: Solution for Customer Production Success Big Data High Growth 700+ Customers Cloud Leaders Riding the Wave with Hadoop The Big Data Platform of Choice

More information

E-guide Hadoop Big Data Platforms Buyer s Guide part 3

E-guide Hadoop Big Data Platforms Buyer s Guide part 3 Big Data Platforms Buyer s Guide part 3 Your expert guide to big platforms enterprise MapReduce cloud-based Abie Reifer, DecisionWorx The Amazon Elastic MapReduce Web service offers a managed framework

More information

Deloitte School of Analytics. Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward

Deloitte School of Analytics. Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward Deloitte School of Analytics Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward February 2018 Agenda 7 February 2018 8 February 2018 9 February 2018 8:00 9:00 Networking

More information

Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications

Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications Copyright 2015 Cask Data, Inc. All Rights Reserved. February

More information

Implementing a Data Warehouse with Microsoft SQL Server

Implementing a Data Warehouse with Microsoft SQL Server Implementing a Data Warehouse with Microsoft SQL Server Course 20463D 5 Days Instructor-led, Hands-on Course Description In this five day instructor-led course, you will learn how to implement a data warehouse

More information

Boston Azure Cloud User Group. a journey of a thousand miles begins with a single step

Boston Azure Cloud User Group. a journey of a thousand miles begins with a single step Boston Azure Cloud User Group a journey of a thousand miles begins with a single step 3 Solution Architect at Slalom Boston Business Intelligence User Group Leader I am a bit shy but passionate. BI Architect

More information

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS)

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS) ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS) Hadoop Storage-as-a-Service ABSTRACT This White Paper illustrates how Dell EMC Elastic Cloud Storage (ECS ) can be used to streamline

More information

Azure PaaS and SaaS Microsoft s two approaches to building IoT solutions

Azure PaaS and SaaS Microsoft s two approaches to building IoT solutions Azure PaaS and SaaS Microsoft s two approaches to building IoT solutions Hector Garcia Tellado Program Manager Lead, Azure IoT Suite #IoTinActionMS #IoTinActionMS Agenda Customers using IoT today Microsoft

More information

Analyzing Data with Power BI

Analyzing Data with Power BI Course 20778A: Analyzing Data with Power BI Course Outline Module 1: Introduction to Self-Service BI Solutions Introduces business intelligence (BI) and how to self-serve with BI. Introduction to business

More information

HPE Flexible Capacity with Microsoft Azure & Azure Stack

HPE Flexible Capacity with Microsoft Azure & Azure Stack HPE Flexible Capacity with Microsoft Azure & Azure Stack The vision behind making Hybrid IT consumption a reality Reuben Melville Version 2.0 Compliance Recent outages of Public Cloud solutions, major

More information

Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics

Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics Key takeaways Analytic Insights Module for self-service analytics Automate data ingestion into Isilon Data Lake Three methods

More information

: Integrating MDM and Cloud Services with System Center Configuration Manager

: Integrating MDM and Cloud Services with System Center Configuration Manager 20703-2: Integrating MDM and Cloud Services with System Center Configuration Manager Overview This is a three-day Instructor Led Training (ILT) course that describes mobile device management (MDM) technologies

More information

Integrating MATLAB Analytics into Enterprise Applications

Integrating MATLAB Analytics into Enterprise Applications Integrating MATLAB Analytics into Enterprise Applications David Willingham 2015 The MathWorks, Inc. 1 Run this link. http://bit.ly/matlabapp 2 Key Takeaways 1. What is Enterprise Integration 2. What is

More information

Audience Profile The course will likely be attended by SQL Server report creators who are interested in alternative methods of presenting data.

Audience Profile The course will likely be attended by SQL Server report creators who are interested in alternative methods of presenting data. [MS20778]: Analyzing Data with Power BI Length : 3 Days Audience(s) : Information Workers Level : 300 Technology : Power BI Delivery Method : Instructor-led (Classroom) Course Overview The main purpose

More information

Microsoft Big Data. Solution Brief

Microsoft Big Data. Solution Brief Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,

More information

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect 2005 Concert de Coldplay 2014 Concert de Coldplay 90% of the world s data has been created over the last two years alone 1 1. Source

More information

Data Center Operating System (DCOS) IBM Platform Solutions

Data Center Operating System (DCOS) IBM Platform Solutions April 2015 Data Center Operating System (DCOS) IBM Platform Solutions Agenda Market Context DCOS Definitions IBM Platform Overview DCOS Adoption in IBM Spark on EGO EGO-Mesos Integration 2 Market Context

More information

Adobe Deploys Hadoop as a Service on VMware vsphere

Adobe Deploys Hadoop as a Service on VMware vsphere Adobe Deploys Hadoop as a Service A TECHNICAL CASE STUDY APRIL 2015 Table of Contents A Technical Case Study.... 3 Background... 3 Why Virtualize Hadoop on vsphere?.... 3 The Adobe Marketing Cloud and

More information

More information for FREE VS ENTERPRISE LICENCE :

More information for FREE VS ENTERPRISE LICENCE : Source : http://www.splunk.com/ Splunk Enterprise is a fully featured, powerful platform for collecting, searching, monitoring and analyzing machine data. Splunk Enterprise is easy to deploy and use. It

More information

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

ORACLE DATA INTEGRATOR ENTERPRISE EDITION ORACLE DATA INTEGRATOR ENTERPRISE EDITION Oracle Data Integrator Enterprise Edition delivers high-performance data movement and transformation among enterprise platforms with its open and integrated E-LT

More information

Big Data Job Descriptions. Software Engineer - Algorithms

Big Data Job Descriptions. Software Engineer - Algorithms Big Data Job Descriptions Software Engineer - Algorithms This position is responsible for meeting the big data needs of our various products and businesses. Specifically, this position is responsible for

More information

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK Are you drowning in Big Data? Do you lack access to your data? Are you having a hard time managing Big Data processing requirements?

More information

80309: Microsoft Dynamics AX 2012 Process Manufacturing Production and Logistics

80309: Microsoft Dynamics AX 2012 Process Manufacturing Production and Logistics Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com

More information

Bringing the Power of SAS to Hadoop Title

Bringing the Power of SAS to Hadoop Title WHITE PAPER Bringing the Power of SAS to Hadoop Title Combine SAS World-Class Analytics With Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities ii Contents Introduction... 1 What

More information

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica Accelerating Your Big Data Analytics Jeff Healey, Director Product Marketing, HPE Vertica Recent Waves of Disruption IT Infrastructu re for Analytics Data Warehouse Modernization Big Data/ Hadoop Cloud

More information

SharePoint 2013 PerformancePoint Services

SharePoint 2013 PerformancePoint Services SharePoint 2013 PerformancePoint Services Course 55057; 3 Days, Instructor-led Course Description This three-day instructor-led course provides students with the necessary knowledge to work with PerformancePoint

More information

Designing Business Intelligence Solutions with Microsoft SQL Server 2014

Designing Business Intelligence Solutions with Microsoft SQL Server 2014 Designing Business Intelligence Solutions with Microsoft SQL Server 2014 Referencia MOC 20467C Duración (horas) 25 Última actualización 27 mayo 2016 Modalidades Presencial, a medida Examen 70-467 Introducción

More information

Analyzing Data with Power BI

Analyzing Data with Power BI Analyzing Data with Power BI Course 20778B 3 Days Instructor-led, Hands-on Course Description The main purpose of this three-day instructor-led course is to give students a good understanding of data analysis

More information

Oracle Big Data Cloud Service

Oracle Big Data Cloud Service Oracle Big Data Cloud Service Delivering Hadoop, Spark and Data Science with Oracle Security and Cloud Simplicity Oracle Big Data Cloud Service is an automated service that provides a highpowered environment

More information

Sr. Sergio Rodríguez de Guzmán CTO PUE

Sr. Sergio Rodríguez de Guzmán CTO PUE PRODUCT LATEST NEWS Sr. Sergio Rodríguez de Guzmán CTO PUE www.pue.es Hadoop & Why Cloudera Sergio Rodríguez Systems Engineer sergio@pue.es 3 Industry-Leading Consulting and Training PUE is the first Spanish

More information

Cloud Based Analytics for SAP

Cloud Based Analytics for SAP Cloud Based Analytics for SAP Gary Patterson, Global Lead for Big Data About Virtustream A Dell Technologies Business 2,300+ employees 20+ data centers Major operations in 10 countries One of the fastest

More information

Building Your Big Data Team

Building Your Big Data Team Building Your Big Data Team With all the buzz around Big Data, many companies have decided they need some sort of Big Data initiative in place to stay current with modern data management requirements.

More information

Hortonworks Data Platform. Buyer s Guide

Hortonworks Data Platform. Buyer s Guide Hortonworks Data Platform Buyer s Guide Hortonworks Data Platform (HDP Completely Open and Versatile Hadoop Data Platform 2 2014 Hortonworks, Inc. All rights reserved. Hadoop and the Hadoop elephant logo

More information

Transforming Big Data to Business Benefits

Transforming Big Data to Business Benefits Transforming Big Data to Business Benefits Automagical EDW to Big Data Migration BI at the Speed of Thought Stream Processing + Machine Learning Platform Table of Contents Introduction... 3 Case Study:

More information

Embracing Disruption: Financial Services and the Microsoft Cloud

Embracing Disruption: Financial Services and the Microsoft Cloud Embracing Disruption: Financial Services and the Microsoft Cloud Resources Executive Summary Challenges and Opportunities in the Microsoft Cloud Security, Privacy & Data Sovereignty http://endj.in/cloud/trust

More information

Building a Robust Analytics Platform

Building a Robust Analytics Platform akass@ + dmi@ Building a Robust Analytics Platform with an open-source stack What s coming up: 1) DigitalOcean - a company background 2) Data @ DigitalOcean 3) The Big Data Tech Stack @ DO 4) Use-cases

More information

Common Customer Use Cases in FSI

Common Customer Use Cases in FSI Common Customer Use Cases in FSI 1 Marketing Optimization 2014 2014 MapR MapR Technologies Technologies 2 Fortune 100 Financial Services Company 104M CARD MEMBERS 3 Financial Services: Recommendation Engine

More information

THE MAGIC OF DATA INTEGRATION IN THE ENTERPRISE WITH TIPS AND TRICKS

THE MAGIC OF DATA INTEGRATION IN THE ENTERPRISE WITH TIPS AND TRICKS THE MAGIC OF DATA INTEGRATION IN THE ENTERPRISE WITH TIPS AND TRICKS DATA HOLDS ALL THE POTENTIAL TO HELP BUSINESSES WIN CUSTOMERS INCREASE REVENUE GAIN COMPETITIVE ADVANTAGE STREAMLINE OPERATIONS BUT

More information

"Charting the Course... MOC C Administering System Center Configuration Manager and Intune. Course Summary

Charting the Course... MOC C Administering System Center Configuration Manager and Intune. Course Summary Description Course Summary Get expert instruction and hands-on practice configuring and managing clients and devices by using Microsoft System Center v1511, Microsoft Intune, and their associated site

More information

Managing Office 365 Identities and Services 20346C; 5 Days, Instructor-led

Managing Office 365 Identities and Services 20346C; 5 Days, Instructor-led Managing Office 365 Identities and Services 20346C; 5 Days, Instructor-led Course Description Get hands-on instruction and practice implementing Microsoft Azure in this two day Microsoft Official Course.

More information

Fast Start Business Analytics with Power BI

Fast Start Business Analytics with Power BI Fast Start Business Analytics with Power BI Accelerate Through classroom, challenging, training and a quick proof of concept, learn about Power BI and how it can help speed up your decision making and

More information

Incorporating Predictive Models for Operational Intelligence

Incorporating Predictive Models for Operational Intelligence Incorporating Predictive Models for Operational Intelligence Presented by Curt Hertler Partner Solutions Architect, OSIsoft Copyright 2015 OSIsoft, LLC History The only has thing a way new of repeating

More information

Microsoft FastTrack For Azure Service Level Description

Microsoft FastTrack For Azure Service Level Description ef Microsoft FastTrack For Azure Service Level Description 2017 Microsoft. All rights reserved. 1 Contents Microsoft FastTrack for Azure... 3 Eligible Solutions... 3 FastTrack for Azure Process Overview...

More information

Berkeley Data Analytics Stack (BDAS) Overview

Berkeley Data Analytics Stack (BDAS) Overview Berkeley Analytics Stack (BDAS) Overview Ion Stoica UC Berkeley UC BERKELEY What is Big used For? Reports, e.g., - Track business processes, transactions Diagnosis, e.g., - Why is user engagement dropping?

More information

Edge Analytics for IoT Device Intelligence

Edge Analytics for IoT Device Intelligence Edge Analytics for IoT Device Intelligence 1. IoT Trends 2. IoT Analytics 3. Edge Analytics Platform: Kanga 4. Future Direction 2017. 3. 10 IoT Trends - Business/Technology (1/3) Google : IoT Solution

More information

Integrating MDM and Cloud Services with System Center Configuration Manager

Integrating MDM and Cloud Services with System Center Configuration Manager Integrating MDM and Cloud Services with System Center Configuration Manager 20703-2; 3 days, Instructor-led About this course This is a three-day Instructor Led Training (ILT) course that describes mobile

More information

FLINK IN ZALANDO S WORLD OF MICROSERVICES JAVIER LOPEZ MIHAIL VIERU

FLINK IN ZALANDO S WORLD OF MICROSERVICES JAVIER LOPEZ MIHAIL VIERU FLINK IN ZALANDO S WORLD OF MICROSERVICES JAVIER LOPEZ MIHAIL VIERU 12-09-2016 AGENDA Zalando s Microservices Architecture Saiki - Data Integration and Distribution at Scale Flink in a Microservices World

More information

Oracle Enterprise Data Quality Product Roadmap and Statement of Direction. October 2016

Oracle Enterprise Data Quality Product Roadmap and Statement of Direction. October 2016 Oracle Enterprise Data Quality Product Roadmap and Statement of Direction October 2016 Oracle Confidential Internal/Restricted/Highly Restricted 2 Safe Harbor Statement The following is intended to outline

More information

Streaming Analytics, Data Lakes and PI Integrators

Streaming Analytics, Data Lakes and PI Integrators Streaming Analytics, Data Lakes and PI Integrators Presented by Matt Ziegler Aaron Loe Conference Theme and Keywords 2 A journey through history 2010 Machine 2014 Learning and IoT 2016 2012 2014 2015 3

More information

BIG WITH BIG DATA ANALYTICS

BIG WITH BIG DATA ANALYTICS Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS www.upxacademy.com 1800-123-1260 About us UpX Academy is an ed-tech platform providing advanced professional training in Big Data Analytics

More information

Setup HSP Ambari Cluster

Setup HSP Ambari Cluster Setup HSP Ambari Cluster Prerequisites 1. An initialized HSP cluster running at least HSP 1.1.1. 2. Downloaded copy of the ISO containing the vm-template for Ambari from HortonWorks. This ISO is available

More information

Microsoft Monitoring and Operating a Private Cloud

Microsoft Monitoring and Operating a Private Cloud 1800 ULEARN (853 276) www.ddls.com.au Microsoft 20246 - Monitoring and Operating a Private Cloud Length 5 days Price $4290.00 (inc GST) Version D Overview Please note: Microsoft have released a replacement

More information

Microsoft BI Product Suite

Microsoft BI Product Suite Microsoft BI Product Suite On Premises and In the Cloud What is Business Intelligence? How is the BI industry evolving? What are the typical components of a BI solution? How can BI be deployed within your

More information

Hadoop Integration Deep Dive

Hadoop Integration Deep Dive Hadoop Integration Deep Dive Piyush Chaudhary Spectrum Scale BD&A Architect 1 Agenda Analytics Market overview Spectrum Scale Analytics strategy Spectrum Scale Hadoop Integration A tale of two connectors

More information

HP Network Automation 7.2 Fundamentals and Administration

HP Network Automation 7.2 Fundamentals and Administration HP Network Automation 7.2 Fundamentals and Administration Instructor-Led Training INTENDED AUDIENCE New users of HP (formerly Opsware) Network Automation software (HP NA) OVERVIEW The HP Network Automation

More information

Hadoop in the Cloud. Ryan Lippert, Cloudera Product Cloudera, Inc. All rights reserved.

Hadoop in the Cloud. Ryan Lippert, Cloudera Product Cloudera, Inc. All rights reserved. Hadoop in the Cloud Ryan Lippert, Cloudera Product Marketing @lippertryan 1 2 Cloudera Confidential 3 Drive Customer Insights Improve Product & Services Efficiency Lower Business Risk 4 The world s largest

More information

Limitless Creativity in the Cloud

Limitless Creativity in the Cloud Limitless Creativity in the Cloud (Secure and on Schedule) Michael Krulik, Principal Solutions Specialist, Avid Joel Sloss, Sr. Program Manager, Microsoft Dec. 6, 2017 Emerging Threats Specific/sequential

More information

Sanoma Big Data Migration Sander Kieft

Sanoma Big Data Migration Sander Kieft Sanoma Big Data Migration Sander Kieft 2 23 June 2016 Budapest Big Data Forum - Sanoma Big Data Migration 3 23 June 2016 Budapest Big Data Forum - Sanoma Big Data Migration About me Manager Core Services

More information

1. Intoduction to Hadoop

1. Intoduction to Hadoop 1. Intoduction to Hadoop Hadoop is a rapidly evolving ecosystem of components for implementing the Google MapReduce algorithms in a scalable fashion on commodity hardware. Hadoop enables users to store

More information

MapR Pentaho Business Solutions

MapR Pentaho Business Solutions MapR Pentaho Business Solutions The Benefits of a Converged Platform to Big Data Integration Tom Scurlock Director, WW Alliances and Partners, MapR Key Takeaways 1. We focus on business values and business

More information

MICROSOFT AI PLATFORM

MICROSOFT AI PLATFORM MICROSOFT AI PLATFORM Build Intelligent Software Artificial Intelligence productivity for every developer and every scenario With the Azure platform and productivity services, you can create the next generation

More information

Hadoop in Production. Charles Zedlewski, VP, Product

Hadoop in Production. Charles Zedlewski, VP, Product Hadoop in Production Charles Zedlewski, VP, Product Cloudera In One Slide Hadoop meets enterprise Investors Product category Business model Jeff Hammerbacher Amr Awadallah Doug Cutting Mike Olson - CEO

More information

Data Analytics. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC

Data Analytics. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC Data Analytics Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC Last 15 years IT-centric Traditional Analytics Traditional Applications Rigid Infrastructure Internet Next

More information

Copyright 2015, Oracle and/or its affiliates. All rights reserved.

Copyright 2015, Oracle and/or its affiliates. All rights reserved. Copyright 2015, Oracle and/or its affiliates. All rights reserved. Finding new business potential with Big Data Analytics Carsten Frisch Oracle Business Analytics DOAG 2015 Business Solutions Conference

More information

SharePoint 2013 Business Intelligence

SharePoint 2013 Business Intelligence SharePoint 2013 Business Intelligence 55042; 4 Days, Instructor-led Course Description This 4-day instructor-led course provides students with the necessary knowledge to work with all the associated SharePoint

More information

Free On-Line Microsoft PDF

Free On-Line Microsoft PDF Free On-Line Microsoft 70-534 PDF Microsoft 70-534 Dumps Available Here: microsoft-exam/70-534-dumps.html Enrolling now you will get access to 126 questions with a unique 70-534 dumps. Testlet 1 VanArsdel,

More information

Business Intelligence in Azure Alex Whittles

Business Intelligence in Azure Alex Whittles Business Intelligence in Azure Alex Whittles Alex@PurpleFrogSystems.com PurpleFrogSystems.com PurpleFrogSystems.com/blog @PurpleFrogSys Alex Whittles SQL Relay Committee SQLRelay.co.uk SQL Bits Committee

More information

Implementing Data Models and Reports with Microsoft SQL Server

Implementing Data Models and Reports with Microsoft SQL Server 20466 - Implementing Data Models and Reports with Microsoft SQL Server Duration: 5 Days Course Price: $2,975 Software Assurance Eligible Course Description Note: This course is designed for customers who

More information

Hybrid Data Management

Hybrid Data Management Kelly Schlamb Executive IT Specialist, Worldwide Analytics Platform Enablement and Technical Sales (kschlamb@ca.ibm.com, @KSchlamb) Hybrid Data Management IBM Analytics Summit 2017 November 8, 2017 5 Essential

More information

Exelon Utilities Data Analytics Journey

Exelon Utilities Data Analytics Journey Exelon Utilities Data Analytics Journey Presented by Dean M Hengst PI System uses with-in Exelon Utilities Intelligent Substation Substation Security Historical Playback / Capacity Planning ComEd as implemented

More information

NFLABS SIMPLIFYING BIG DATA. Real &me, interac&ve data analy&cs pla4orm for Hadoop

NFLABS SIMPLIFYING BIG DATA. Real &me, interac&ve data analy&cs pla4orm for Hadoop NFLABS SIMPLIFYING BIG DATA Real &me, interac&ve data analy&cs pla4orm for Hadoop Did you know? Founded in 2011, NFLabs is an enterprise software company working on developing solutions to simplify big

More information

IBM Big Data Summit 2012

IBM Big Data Summit 2012 IBM Big Data Summit 2012 12.10.2012 InfoSphere BigInsights Introduction Wilfried Hoge Leading Technical Sales Professional hoge@de.ibm.com twitter.com/wilfriedhoge 12.10.1012 IBM Big Data Strategy: Move

More information

Exam /Course 20332B Advanced Solutions of Microsoft SharePoint Server 2013

Exam /Course 20332B Advanced Solutions of Microsoft SharePoint Server 2013 Exam 70-332/Course 20332B Advanced Solutions of Microsoft SharePoint Server 2013 Prerequisites Before attending this course, students must have: Completed Course 20331: Core Solutions of Microsoft SharePoint

More information

Azure IoT Suite. Secure device connectivity and management. Data ingestion and command + control. Rich dashboards and visualizations

Azure IoT Suite. Secure device connectivity and management. Data ingestion and command + control. Rich dashboards and visualizations Azure IoT Suite Secure device connectivity and management Data ingestion and command + control Rich dashboards and visualizations Business workflow integration Move beyond building blocks with pre-configured

More information

Turn Data into Business Value

Turn Data into Business Value Turn Data into Business Value Infinite Video Platform Analytics Layne Berg, Product Manager Steve Epstein, Distinguished Engineer June 21, 2017 Applying Big Data Analytics to Video Today, primarily descriptive

More information

2016 Big Data. Mejores pra cticas en AWS

2016 Big Data. Mejores pra cticas en AWS 2016 Big Data. Mejores pra cticas en AWS Javier Ros, Solution Architect Jun, 2016 2016, Web Services, Inc. or its Affiliates. All rights reserved. Agenda Big Data challenges Design Patterns on AWS RavenPack.

More information

Tikuhao ᦤկϧϮ,7䅸䆕㗗䆩乬ᑧ ᙼ䕏ᵒ䗮䖛㗗䆩 ІЗ߃Ҳ ޏ߆ԇ NZZV ]]] ZOQ[NGU IUS

Tikuhao ᦤկϧϮ,7䅸䆕㗗䆩乬ᑧ ᙼ䕏ᵒ䗮䖛㗗䆩 ІЗ߃Ҳ ޏ߆ԇ NZZV ]]] ZOQ[NGU IUS Tikuhao Exam : 70-534 Title : Architecting Microsoft Azure Solutions Version : Demo 1 / 7 1. Topic 1, VanArsdel, Ltd Overview VanArsdel, Ltd. builds skyscrapers, subways, and bridges. VanArsdel is a leader

More information