Big Data with Azure: where to begin?

Size: px
Start display at page:

Download "Big Data with Azure: where to begin?"

Transcription

1 Big Data with Azure: where to begin? Concepts and best practices October 15 th 2016 Sofia Satya SK Jayanty Principal Architect & Managing Consultant

2 Sponsors Gold sponsors: Silver sponsors: Bronze sponsors:

3 Speaking Engagements

4 Author d

5 Agenda.what agenda? no agenda!..... you like: small data big data all data!..that s why you are here today

6 What differentiates today s thriving organizations? Data. Data in all forms & sizes is being generated faster than ever before Capture & combine it for new insights & better, faster decisions

7 Strategic opportunity with Big Data Cloud Mobile Social How do you use technology innovation Big data? to architect business innovation? Increased productivity Customer growth Real-time insights Embrace new models

8 Security & Management Security & Management The Azure Platform Strategy Public Cloud Platfor m Hybrid Operations SaaS (Software as a Service) O365, CRM, VSO etc + 3 rd Party SaaS Solutions Hybrid Operations Microsoft Azure Stack & Cloud Platform System Public, Global, Shared Datacenters

9

10 Breaking points of traditional approach

11 Breaking points of traditional approach

12 Breaking points of traditional approach

13 Breaking points of traditional approach

14 Breaking points of traditional approach

15 What if you could handle big data? Petabytes Terabytes Click stream Wikis/blogs Sensors RFID Devices Social sentiment Audio/video Big Data Log files Spatial and GPS coordinates Gigabytes Data market feeds egov feeds Megabytes Weather Text/image Data Complexity: Variety and Velocity

16 Introducing Big Data Big data is a collection of data sets Cheap so Storage large and complex that it becomes awkward to work with using on-hand database management tools. > 2 billion users Difficulties include capture, storage, search, sharing, analysis, Sensor Networks and visualization. Inexpensive Computing Wikipedia Enormous amounts of data. online behavior social networking users... samples of medical ailments.. purchasing habits of grocery shoppers. crime statistics of cities... internet of things IoT.. 24/7 out-patient monitor. real-time tele-metric devices. 90% Of data in the world, has been created in the last 2 years

17 5 Vs

18 Evolving Approaches to Analytics Extract Transform Load Original Data ETL Tool (SSIS, etc) Transformed Data EDW (SQL Svr, Teradata, etc) BI Tools Data Marts Data Lake(s) Ingest (EL) Original Data Scale-out Storage & Compute (HDFS, Blob Storage, etc) Dashboards Apps Streaming data Transform & Load

19 Introducing Apache Hadoop Hadoop stores files in a distributed file system Hadoop can store very large amounts of data

20 Introducing Hadoop Comparison to Traditional RDBMS TRADITIONAL RDBMS HADOOP Data Size Access Updates Structure Integrity Scaling DBA Ratio

21 Data variety

22 Data velocity

23 Hadoop is a platform with portfolio of projects Hadoop common utilities to support modules HDFS (Hadoop Distributed File System) high throughput YARN job scheduling and cluster RM MapReduce YARN-based for parallel processing Spark compute engine Pig data-flow language & execution framework Oozie workflow scheduler Ambari provisioning, managing and monitoring clusters Sqoop bulk data transfer between Hadoop & Relational DB Batch processing centric using a Map-Reduce processing paradigm

24 Getting Started with HDInsight Introducing Azure HDInsight 100% Apache Hadoop Powered by the cloud Immersive insights 25

25 HDInsight supports Hive Hadoop 2.0

26 HDInsight supports HBase Coordination HMaster Name Node Region Server Region Server Region Server Region Server Job Tracker Data Node Data Node Data Node Data Node Task Tracker Task Tracker Task Tracker Task Tracker

27 HDInsight supports Mahout

28 HDInsight supports Storm

29 TCO, Deployment & Geo-Redundancy $

30 Connect cloud Hadoop with on-premises

31 Scenarios for deploying Hadoop as hybrid

32 Bringing Hadoop to a billion people

33 Industry use cases of Hadoop Financial services Retail Telecom Manufacturing Healthcare Utilities, oil and gas Public sector

34 Introducing the zoo: HDInsight/Hadoop Eco system Legend Red = Core Hadoop Blue = Data processing Green = Packages Distributed Processing (MapReduce) Distributed Storage (HDFS) Purple = Microsoft integration points and value adds Orange = Data Movement

35 Programming HDInsight Since HDInsight is a service-based implementation, you get immediate access to the tools you need to program against HDInsight/Hadoop Existing Ecosystem.NET JavaScript DevOps/IT Pros: Hive, Pig, Sqoop, Mahout, Cascading, Scalding, Scoobi, Pegasus, etc. C#, F# Map/Reduce, LINQ to Hive,.Net Management Clients, etc. JavaScript Map/Reduce, Browser-hosted Console, Node.js management clients PowerShell, Cross-Platform CLI Tools

36 Challenges with implementing Hadoop

37 Why Hadoop in the cloud?

38 Applications Reports Dashboards Natural language query Mobile Data Orchestration Information management Complex event processing Modeling Machine learning The Microsoft data Relational platform Non-relational NoSQL Streaming Internal & external

39 Cortana Analytics Suite Transform data into intelligent action DATA INTELLIGENCE ACTION

40 Azure Data Factory A managed cloud service for building & operating data pipelines Part of the Cortana Analytics Suite

41 What about Non-Relational and NoSQL? fully featured RDBMS rich query transactional processing managed as a service elastic scale schema-free data model internet accessible http/rest arbitrary data formats There s a great David Chappell paper for getting up to speed on NoSQL -

42 PolyBase unites STRUCTURED UNSTRUCTURED BUSINESS DATA DATA DATA for a better together world of analytics

43 PolyBase and queries Provides a scalable, T-SQL-compatible query processing framework for combining data from both universes Access any data

44 So what is PolyBase? Answer: Component of the PDW Region in APS Answer: Unique Innovative Technology Answer: Seamless Integration Answer: Highly parallelised distributed query engine accessing heterogeneous data via SQL

45 Agnostic architecture PolyBase is agnostic = No vendor lock in PolyBase integrates with the cloud PolyBase supports Hadoop on Linux & Windows PolyBase supports HDInsight in APS & external Hadoop clusters

46 PolyBase builds the bridge Just-in-Time data integration Across relational and non-relational data High performance parallel architecture Fast, simple data loading Best of both worlds Uses computational power at source for both relational data & Hadoop Opportunity for new types of analysis Uses existing analytical skills Familiar SQL semantics & behaviour Query with familiar tools SSDT PolyBase = run time integration Includes Power BI

47 PolyBase User Perspective Systems Perspective External Table External Data Source External File Format PDW Engine PDW Service Bridge

48 Mobile BI apps for SQL Server (Datazen) On-premises implementations are optimized for SQL Server Rich, interactive data visualization on all major mobile platforms View on any major mobile platform Access reports with online/offline support Data visualization and publishing Powerful insights

49 What is R? Extensible via packages Talented community of contributors High accuracy ML classifiers In-memory analytics Open source implementation Big data analytics Top tool for machine learning OOL for statistical computing Industry standard for computational mining Amazing data-visualization capabilities

50 Why R is famous? R plotting Box plot Bar plot Histogram Contour Dot plot Mosaic Scatter Latticist

51 Revolution R Enterprise and SQL Big data analytics platform Based on open source R High-performance, scalable, full-featured Statistical and machine-learning algorithms are performant, scalable, and distributable Write once, deploy anywhere Scripts and models can be executed on a variety of platforms, including non- Microsoft (Hadoop, Teradata in-db) Integration with the R Ecosystem Analytic algorithms accessed via R function with similar syntax for R users. Arbitrary R functions/packages can be used in conjunction Advanced analytics

52 SQL Server 2016 R integration scenario Exploration Use RRE from R IDE to analyze large datasets and build predictive and embedded models with the compute happening on the SQL Server machine (SQL Server compute context) Operationalization Developer can operationalize R script/model over SQL Server data by using T-SQL constructs DBA can manage resource, secure, and govern R runtime execution in SQL Server

53 R script library in Microsoft Azure Marketplace Example solutions Fraud detection Sales forecasting Warehouse efficiency Predictive maintenance Extensibilit y Launch External Process R Integration R New R scripts Microsoft Azure Machine Learning Marketplace Benefits Faster deployment of ML models Faster performance (moves compute close to the data) Analytic library Data Scientist Interacts directly with data Improved scalability Benefits T-SQL interface Relational data Data Developer/DBA Manages data and analytics together Built into SQL Server Advanced analytics

54 Summary: R integration and advanced analytics SQL Server Analytics library Share and collaborate Manage and deploy Analytical engines Full R integration Fully extensible R + Data Scientists Publish algorithms, interact directly with data DBAs Manage storage and analytics together Capability Extensible in-database analytics, integrated with R, exposed through T-SQL Centralize enterprise library for analytic models Benefits Data Management Layer Relational data T-SQL interface Stream data in-memory Business Analysts Analysis through TSQL, tools, and vetted algorithms Advanced analytics

55 Standard approach to learn R Self-training is the key Math: Statistics, calculus, probability Machine learning algorithms Opensource R packages Industrial R with R: Hadoop, RRE Applied R with Microsoft Azure ML, RevR

56 Machine learning tools Open source R considered best fit Python Monte Carlo Machine Learning Library H2O Weka Octave-Forge Commercial Microsoft Azure Machine Learning SAS Enterprise Miner IBM SPSS Modeler RapidMiner Apache Mahout MATLAB Oracle Data Mining

57 Rich Services Heterogeneity Integrate with on-premises Lower Your Risk

58 Scaling

59 Azure in hawk-eye mode Platform Services Security & Management Portal Cloud Services Service Fabric Web Apps API Apps SQL Database Data Warehouse DocumentDB Hybrid Operations Azure AD Health Monitoring Azure Active Directory Azure AD B2C Batch RemoteApp Mobile Apps Logic Apps Redis Cache Azure Search Storage Tables AD Privileged Identity Management Domain Services Multi-Factor Authentication Automation Storage Queues BizTalk Services API Management Notification Hubs Backup Scheduler Hybrid Connections Service Bus HDInsight Machine Learning Stream Analytics Data Lake Operational Analytics Key Vault Visual Studio Azure SDK Data Factory Event Hubs Data Catalog Import/Export Store/ Marketplace VM Image Gallery & VM Depot Media Services Content Delivery Network (CDN) VS Online App Insights Infrastructure Services IoT Hub Mobile Engagement Azure Site Recovery StorSimple

60 Azure IT Capabilities Platform Services Security & Management Service Creation & Configuration User/Group Directory Store Identity Sign-Up and sign-in Multi-Factor Authentication Scheduled Service Management Task Scheduler Stateless Compute Scheduled Compute Jobs Simple Queuing Hybrid Connections Distributed Compute Virtual App Streaming B2B Integration Pub/Sub Queuing Web Apps Infrastructure Mobile Backends API Management API App Infrastructure Business Process Automation Push Notifications Big Data Analytics Relational SQL Database Distributed In-Memory Cache Predictive Analytics Data Warehouse Search Data Stream Analytics Document Database Service Simple Key/Value Store Big Data Storage Hybrid Operations Directory Health Monitoring Privileged Identity Management Domain Join & Policy Management Server Data Backup Operational Analytics Encryption Key Store Development Tools Software Development Kits Data Pipelines Device Data Collection Data Source Management Bulk Data Import And Export Software/Solution Marketplace Pre-Build VM Images Live & OD Media Streaming Content Delivery Network (CDN) Software Lifecycle Management Application Instrumentation Infrastructure Services IoT Device Management Mobile Analytics Disaster Recovery Hybrid/Intelligent Data Backup

61 Summary Big Data refers to data sets so large and/or complex that they become awkward to work with in conventional ways Hadoop and HDInsight = Microsoft s answer to Big Data Hadoop can store petabytes of data reliably and execute huge distributed computations However Big Data query results often involve significant latency Power BI includes authoring add-ins to query, analyze and visualize data sourced from Azure HDInsight Preload data in advance of business user queries Big Data is just another data source!

62 Resources Microsoft Big Data web site Azure HDInsight web site Hortonworks tutorials Numerous tutorials are available to learn about Big Data by using the Hortonworks Sandbox Follow r

63 Sponsors Gold sponsors: Silver sponsors: Bronze sponsors:

Modernizing Your Data Warehouse with Azure

Modernizing Your Data Warehouse with Azure Modernizing Your Data Warehouse with Azure Big data. Small data. All data. Christian Coté S P O N S O R S The traditional BI Environment The traditional data warehouse data warehousing has reached the

More information

Two offerings which interoperate really well

Two offerings which interoperate really well Microsoft Two offerings which interoperate really well On-premises Cortana Intelligence Suite SQL Server 2016 Cloud IAAS Enterprise PAAS Cloud Storage Service 9 SQL Server 2016: Everything built-in built-in

More information

Microsoft Azure Essentials

Microsoft Azure Essentials Microsoft Azure Essentials Azure Essentials Track Summary Data Analytics Explore the Data Analytics services in Azure to help you analyze both structured and unstructured data. Azure can help with large,

More information

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. Course Content Course Description: The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. At Course Completion: After competing this course,

More information

20775A: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight 20775A: Performing Data Engineering on Microsoft HD Insight Duration: 5 days; Instructor-led Implement Spark Streaming Using the DStream API. Develop Big Data Real-Time Processing Solutions with Apache

More information

20775: Performing Data Engineering on Microsoft HD Insight

20775: Performing Data Engineering on Microsoft HD Insight Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com

More information

Business is being transformed by three trends

Business is being transformed by three trends Business is being transformed by three trends Big Cloud Intelligence Stay ahead of the curve with Cortana Intelligence Suite Business apps People Custom apps Apps Sensors and devices Cortana Intelligence

More information

20775A: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight 20775A: Performing Data Engineering on Microsoft HD Insight Course Details Course Code: Duration: Notes: 20775A 5 days This course syllabus should be used to determine whether the course is appropriate

More information

EXAMPLE SOLUTIONS Hadoop in Azure HBase as a columnar NoSQL transactional database running on Azure Blobs Storm as a streaming service for near real time processing Hadoop 2.4 support for 100x query gains

More information

20775 Performing Data Engineering on Microsoft HD Insight

20775 Performing Data Engineering on Microsoft HD Insight Duración del curso: 5 Días Acerca de este curso The main purpose of the course is to give students the ability plan and implement big data workflows on HD. Perfil de público The primary audience for this

More information

Big data is hard. Top 3 Challenges To Adopting Big Data

Big data is hard. Top 3 Challenges To Adopting Big Data Big data is hard Top 3 Challenges To Adopting Big Data Traditionally, analytics have been over pre-defined structures Data characteristics: Sales Questions answered with BI and visualizations: Customer

More information

Azure ML Data Camp. Ivan Kosyakov MTC Architect, Ph.D. Microsoft Technology Centers Microsoft Technology Centers. Experience the Microsoft Cloud

Azure ML Data Camp. Ivan Kosyakov MTC Architect, Ph.D. Microsoft Technology Centers Microsoft Technology Centers. Experience the Microsoft Cloud Microsoft Technology Centers Microsoft Technology Centers Experience the Microsoft Cloud Experience the Microsoft Cloud ML Data Camp Ivan Kosyakov MTC Architect, Ph.D. Top Manager IT Analyst Big Data Strategic

More information

Jason Virtue Business Intelligence Technical Professional

Jason Virtue Business Intelligence Technical Professional Jason Virtue Business Intelligence Technical Professional jvirtue@microsoft.com Agenda Microsoft Azure Data Services Azure Cloud Services Azure Machine Learning Azure Service Bus Azure Stream Analytics

More information

Security Solutions in Azure

Security Solutions in Azure Security Solutions in Azure Dylan de Jong Cloud solution architect Dyjong@Microsoft.com Welk jaar was dit? ADD A FOOTER Welk jaar werd het Microsoft Azure? 4 ADD A FOOTER 10 Jaar + Geleden ADD A FOOTER

More information

AZURE HDINSIGHT. Azure Machine Learning Track Marek Chmel

AZURE HDINSIGHT. Azure Machine Learning Track Marek Chmel AZURE HDINSIGHT Azure Machine Learning Track Marek Chmel SESSION AGENDA Understanding different scenarios of Hadoop Building an end to end pipeline using HDInsight Using in-memory techniques to analyze

More information

MICROSOFT AZURE THE CLOUD PLATFORM FOR DIGITAL TRANSFORMATION

MICROSOFT AZURE THE CLOUD PLATFORM FOR DIGITAL TRANSFORMATION MICROSOFT AZURE THE CLOUD PLATFORM FOR DIGITAL TRANSFORMATION G I N A M O N T G O M E R Y, V - T S P, M C S A, M C T S, M C P S R. D I R E C T O R, M I C R O S O F T C L O U D S E R V I C E S S E P T E

More information

Data Lake Organization A Hadoop Eco-System. Jan Cordtz, Microsoft Denmark Cloud Solution Architect

Data Lake Organization A Hadoop Eco-System. Jan Cordtz, Microsoft Denmark Cloud Solution Architect Data Lake Organization A Hadoop Eco-System Jan Cordtz, Microsoft Denmark jcordtz@microsoft.com Cloud Solution Architect Hyper scale Infrastructure 100+ Datacenters across 42 Regions Worldwide Learn more:

More information

Building IoT Solutions in Azure

Building IoT Solutions in Azure Building IoT Solutions in Azure About me Mayank Srivastava Evangelist, Organizer, SPR Consulting CNUG The Chicago.Net User Group @MayankSri www.linkedin.com/in/mayanksri/ MayankSri@Live.com Agenda IoT

More information

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW TOPICS COVERED 1 2 Fundamentals of Big Data Platforms Major Big Data Tools Scaling Up vs. Out SCALE UP (SMP) SCALE OUT (MPP) + (n) Upgrade

More information

Azure: Microsoft Cloud. Microsoft Cloud End-to-end solutions

Azure: Microsoft Cloud. Microsoft Cloud End-to-end solutions Azure: Microsoft Cloud Microsoft Cloud End-to-end solutions 5 Azure is an open cloud DevOps Clients Management Applications PaaS & DevOps App Frameworks & Tools Databases & Middleware Infrastructure Hyper

More information

Azure Data Analytics & Machine Learning Seminar. Daire Cunningham: BI Practice Area Manager

Azure Data Analytics & Machine Learning Seminar. Daire Cunningham: BI Practice Area Manager Azure Data Analytics & Machine Learning Seminar Daire Cunningham: BI Practice Area Manager AGENDA 09:00 AM 09:30 AM Registration & Refreshments 09.30AM 10:00 AM 10:00 AM 10:30 AM Welcome & Keynote, Ger

More information

Azure Data Lake How to organize. Jan Cordtz, Microsoft Denmark Cloud Solution Architect

Azure Data Lake How to organize. Jan Cordtz, Microsoft Denmark Cloud Solution Architect Azure Data Lake How to organize Jan Cordtz, Microsoft Denmark jcordtz@microsoft.com Cloud Solution Architect Platform as a Service Security & Management Security Center Portal Azure Active Directory Azure

More information

Alexander Klein. ETL meets Azure

Alexander Klein. ETL meets Azure Alexander Klein ETL meets Azure Thanks to our sponsors: Who am I? Independent BI Consultant > 15 years experience of SQL Server Focus on Microsoft BI Stack & AI & Azure a.klein@consulting-bi.de @SQL_Alex

More information

Course 20535A: Architecting Microsoft Azure Solutions

Course 20535A: Architecting Microsoft Azure Solutions Course 20535A: Architecting Microsoft Azure Solutions Module 1: Application Architecture Patterns in Azure This module introduces and reviews common Azure patterns and architectures as prescribed by the

More information

Azure. Bruno Kovačić Axilis, Microsoft MVP

Azure. Bruno Kovačić Axilis, Microsoft MVP Azure Bruno Kovačić Axilis, Microsoft MVP Why the cloud? Game sessions hosted using Azure Hosted using >100,000 Azure Virtual Machines Why the cloud? Rapidly setup environments to drive business priorities

More information

HDInsight - Hadoop for the Commoner Matt Stenzel Data Platform Technical Specialist

HDInsight - Hadoop for the Commoner Matt Stenzel Data Platform Technical Specialist HDInsight - Hadoop for the Commoner 10-1-2016 Matt Stenzel Data Platform Technical Specialist SQL Saturday #557 Thank you Sponsors! Please visit the sponsors and enter their end-of-day raffles. Event After

More information

Analytics Platform System

Analytics Platform System Analytics Platform System Big data. Small data. All data. Audie Wright, DW & Big Data Specialist Audie.Wright@Microsoft.com Ofc 425-538-0044, Cell 303-324-2860 Sean Mikha, DW & Big Data Architect semikha@microsoft.com

More information

Angat Pinoy. Angat Negosyo. Angat Pilipinas.

Angat Pinoy. Angat Negosyo. Angat Pilipinas. Angat Pinoy. Angat Negosyo. Angat Pilipinas. Four megatrends will dominate the next decade Mobility Social Cloud Big data 91% of organizations expect to spend on mobile devices in 2012 In 2012, mobile

More information

Mobile:

Mobile: Email: GaryHope@Microsoft.com Twitter: @GaryHope Mobile: 0827778886 Each chapter has its own style, leadership, and schedule to meet the local members needs and provide a relevant forum so its up to

More information

Insights to HDInsight

Insights to HDInsight Insights to HDInsight Why Hadoop in the Cloud? No hardware costs Unlimited Scale Pay for What You Need Deployed in minutes Azure HDInsight Big Data made easy Enterprise Ready Easier and more productive

More information

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect

Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect Aurélie Pericchi SSP APS Laurent Marzouk Data Insight & Cloud Architect 2005 Concert de Coldplay 2014 Concert de Coldplay 90% of the world s data has been created over the last two years alone 1 1. Source

More information

30 min. Close. Facilitating innovation with IoT. Digital Transformation. Microsoft portfolio for product development

30 min. Close. Facilitating innovation with IoT. Digital Transformation. Microsoft portfolio for product development Close Facilitating innovation with IoT Microsoft portfolio for product development 30 min Digital Transformation IoT & analytics in the product lifecycle Data Analytics Cloud Digital Transformation Engage

More information

Architecting Microsoft Azure Solutions

Architecting Microsoft Azure Solutions Architecting Microsoft Azure Solutions 20535A; 5 Days; Instructor-led Course Description This course is intended for architects who have experience building infrastructure and applications on the Microsoft

More information

Microsoft Big Data. Solution Brief

Microsoft Big Data. Solution Brief Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,

More information

Azure Part 2 - Cloud Agility with ZVR. Mike Nelson, Cloud Architect Shannon Snowden, Sr. Technical Architect

Azure Part 2 - Cloud Agility with ZVR. Mike Nelson, Cloud Architect Shannon Snowden, Sr. Technical Architect Azure Part 2 - Cloud Agility with ZVR Mike Nelson, Cloud Architect Shannon Snowden, Sr. Technical Architect Digital Transformation DVDs > Streaming Video Distributing Content > Producing Content Netflix-

More information

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Rohit Bakhshi, Solution Architect, Hortonworks Jim Walker, Director Product Marketing, Talend Page 1 About Us Rohit Bakhshi Solution

More information

Architecting Microsoft Azure Solutions

Architecting Microsoft Azure Solutions Course 20535: Architecting Microsoft Azure Solutions Page 1 of 8 Architecting Microsoft Azure Solutions Course 20535: 4 days; Instructor-Led Introduction This course is intended for architects who have

More information

AmCham Vietnam Digital Transformation with Cloud. Jeremy Showalter

AmCham Vietnam Digital Transformation with Cloud. Jeremy Showalter AmCham Vietnam Digital Transformation with Cloud Jeremy Showalter jeremys@microsoft.com Cloud computing is a paradigm for enabling network access to a scalable and elastic pool of shareable physical or

More information

Digital transformation is the next industrial revolution

Digital transformation is the next industrial revolution Digital transformation is the next industrial revolution Steam, water, mechanical production equipment Division of labor, electricity, mass production Electronics, IT, automated production Blurring the

More information

5th Annual. Cloudera, Inc. All rights reserved.

5th Annual. Cloudera, Inc. All rights reserved. 5th Annual 1 The Essentials of Apache Hadoop The What, Why and How to Meet Agency Objectives Sarah Sproehnle, Vice President, Customer Success 2 Introduction 3 What is Apache Hadoop? Hadoop is a software

More information

Why & How Public Cloud. Deepthi Anantharam Technology

Why & How Public Cloud. Deepthi Anantharam Technology Why & How Public Cloud Deepthi Anantharam Technology Evangelist @deananth Why the cloud? Rapidly setup environments to drive business priorities Scale to meet peak demands Increase daily activities, efficiency

More information

Visual Studio Everywhere. Build Great Cloud Apps

Visual Studio Everywhere. Build Great Cloud Apps Visual Studio Everywhere Build Great Cloud Apps Agenda Why use the cloud to build apps? An overview of Microsoft Azure Virtual machines for lift-shift scenarios Microservices and Azure Service Fabric Data

More information

aka.ms/ uber-selfies

aka.ms/ uber-selfies aka.ms/uber-selfies aka.ms/computing-cancer aka.ms/carnival Security & Management Platform Services Hybrid Cloud Security Center Portal Azure Active Directory Azure AD B2C Multi-Factor Authentication Media

More information

Architecting Microsoft Azure Solutions

Architecting Microsoft Azure Solutions Microsoft Official Course - 20535 Architecting Microsoft Azure Solutions Length 5 days Prerequisites Create resources and resource group in Azure. Manage users, groups, and subscriptions in an Azure Active

More information

Limitless Creativity in the Cloud

Limitless Creativity in the Cloud Limitless Creativity in the Cloud (Secure and on Schedule) Michael Krulik, Principal Solutions Specialist, Avid Joel Sloss, Sr. Program Manager, Microsoft Dec. 6, 2017 Emerging Threats Specific/sequential

More information

Depending on who you ask, IoT is either:

Depending on who you ask, IoT is either: Depending on who you ask, IoT is either: Nothing new We ve been doing this for 40 years A unicorn Magic, and will soon change everything. Connect devices and monitor telemetry Things Monitor and track

More information

The Importance of good data management and Power BI

The Importance of good data management and Power BI The Importance of good data management and Power BI The BI Iceberg Visualising Data is only the tip of the iceberg Data Preparation and provisioning is a complex process Streamlining this process is key

More information

Hortonworks Connected Data Platforms

Hortonworks Connected Data Platforms Hortonworks Connected Data Platforms MASTER THE VALUE OF DATA EVERY BUSINESS IS A DATA BUSINESS EMBRACE AN OPEN APPROACH 2 Hortonworks Inc. 2011 2016. All Rights Reserved Data Drives the Connected Car

More information

Apache Hadoop in the Datacenter and Cloud

Apache Hadoop in the Datacenter and Cloud Apache Hadoop in the Datacenter and Cloud The Shift to the Connected Data Architecture Digital Transformation fueled by Big Data Analytics and IoT ACTIONABLE INTELLIGENCE Cloud and Data Center IDMS Relational

More information

Azure Offerings for Big data. In Kee Paek Cloud Data Solution Architect Microsoft Korea October. 2016

Azure Offerings for Big data. In Kee Paek Cloud Data Solution Architect Microsoft Korea October. 2016 Azure Offerings for Big data In Kee Paek Cloud Data Solution Architect Microsoft Korea October. 2016 Agenda 1. Integrated Big data Platform - Cortana Intelligent Suite 2. Scalable Machine Learning - R

More information

BIG DATA AND HADOOP DEVELOPER

BIG DATA AND HADOOP DEVELOPER BIG DATA AND HADOOP DEVELOPER Approximate Duration - 60 Hrs Classes + 30 hrs Lab work + 20 hrs Assessment = 110 Hrs + 50 hrs Project Total duration of course = 160 hrs Lesson 00 - Course Introduction 0.1

More information

ADVANCED ANALYTICS & IOT ARCHITECTURES

ADVANCED ANALYTICS & IOT ARCHITECTURES ADVANCED ANALYTICS & IOT ARCHITECTURES Presented by: Orion Gebremedhin Director of Technology, Data & Analytics Marc Lobree National Architect, Advanced Analytics EDW THE RIGHT TOOL FOR THE RIGHT WORKLOAD

More information

Implementing Microsoft Azure Infrastructure Solutions

Implementing Microsoft Azure Infrastructure Solutions Implementing Microsoft Azure Infrastructure Solutions Course # Exam: Prerequisites Technology: Delivery Method: Length: 20533 70-533 20532 Microsoft Products Instructor-led (classroom) 5 Days Overview

More information

"Charting the Course... MOC A: Architecting Microsoft Azure Solutions. Course Summary

Charting the Course... MOC A: Architecting Microsoft Azure Solutions. Course Summary MOC 20535 A: Architecting Microsoft Course Summary Description This course is intended for architects who have experience building infrastructure and applications on the Microsoft platform. Students should

More information

Architecting Microsoft Azure Solutions

Architecting Microsoft Azure Solutions Architecting Microsoft Azure Solutions Duración: 5 Días Código del Curso: M20534 Temario: This course is intended for architects who have experience building infrastructure and applications on the Microsoft

More information

Integrating the Enterprise. How Business Leaders are Implementing Digital Integration

Integrating the Enterprise. How Business Leaders are Implementing Digital Integration Integrating the Enterprise How Business Leaders are Implementing Digital Integration Today s Session In Review Business Value of IoT Building an IoT Backbone Integrating the Enterprise Market Potential

More information

Cask Data Application Platform (CDAP)

Cask Data Application Platform (CDAP) Cask Data Application Platform (CDAP) CDAP is an open source, Apache 2.0 licensed, distributed, application framework for delivering Hadoop solutions. It integrates and abstracts the underlying Hadoop

More information

Advanced Analytics in Azure

Advanced Analytics in Azure Explore What s Possible. Advanced Analytics in Azure Amie Mason, Practice Lead Data Science & Analytics amiem@attunix.com The Attunix Difference business technology Attunix delivers results at the intersection

More information

Implementing Microsoft Azure Infrastructure Solutions 20533B; 5 Days, Instructor-led

Implementing Microsoft Azure Infrastructure Solutions 20533B; 5 Days, Instructor-led Lincoln Land Community College Capital City Training Center 130 West Mason Springfield, IL 62702 217-782-7436 www.llcc.edu/cctc Implementing Microsoft Azure Infrastructure Solutions 20533B; 5 Days, Instructor-led

More information

Hadoop Course Content

Hadoop Course Content Hadoop Course Content Hadoop Course Content Hadoop Overview, Architecture Considerations, Infrastructure, Platforms and Automation Use case walkthrough ETL Log Analytics Real Time Analytics Hbase for Developers

More information

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

E-guide Hadoop Big Data Platforms Buyer s Guide part 1 Hadoop Big Data Platforms Buyer s Guide part 1 Your expert guide to Hadoop big data platforms for managing big data David Loshin, Knowledge Integrity Inc. Companies of all sizes can use Hadoop, as vendors

More information

http://azure123.rocks/ Agenda Why use the cloud to build apps? Virtual machines for lift-shift scenarios Microservices and Azure Service Fabric Data services in Azure DevOps solutions Compute Compute

More information

HPE Flexible Capacity with Microsoft Azure & Azure Stack

HPE Flexible Capacity with Microsoft Azure & Azure Stack HPE Flexible Capacity with Microsoft Azure & Azure Stack The vision behind making Hybrid IT consumption a reality Reuben Melville Version 2.0 Compliance Recent outages of Public Cloud solutions, major

More information

MapR: Solution for Customer Production Success

MapR: Solution for Customer Production Success 2015 MapR Technologies 2015 MapR Technologies 1 MapR: Solution for Customer Production Success Big Data High Growth 700+ Customers Cloud Leaders Riding the Wave with Hadoop The Big Data Platform of Choice

More information

Why Big Data Matters? Speaker: Paras Doshi

Why Big Data Matters? Speaker: Paras Doshi Why Big Data Matters? Speaker: Paras Doshi If you re wondering about what is Big Data and why does it matter to you and your organization, then come to this talk and get introduced to Big Data and learn

More information

IMPLEMENTING MICROSOFT AZURE INFRASTRUCTURE SOLUTIONS

IMPLEMENTING MICROSOFT AZURE INFRASTRUCTURE SOLUTIONS IMPLEMENTING MICROSOFT AZURE INFRASTRUCTURE SOLUTIONS Course Duration: 5 Days About this course This course is aimed at experienced IT professionals who currently administer their on-premise infrastructure.

More information

House Keeping. You are in Listen Only Mode. Azure 101: Azure Overview. Azure 201: How to do a Cost Estimate for Virtual Machines

House Keeping. You are in Listen Only Mode. Azure 101: Azure Overview. Azure 201: How to do a Cost Estimate for Virtual Machines House Keeping You are in Listen Only Mode Use the WebEx Chat window to enter question Azure 101: Azure Overview First Tuesday of each Month Azure 201: How to do a Cost Estimate for Virtual Machines Second

More information

What s new on Azure? Jan Willem Groenenberg

What s new on Azure? Jan Willem Groenenberg What s new on Azure? Jan Willem Groenenberg Why the cloud? Rapidly setup environments to drive business priorities Scale to meet peak demands Increase daily activities, efficiency and reduced cost. Why

More information

Hortonworks Data Platform

Hortonworks Data Platform Hortonworks Data Platform An open-architecture platform to manage data in motion and at rest Highlights Addresses a range of data-at-rest use cases Powers real-time customer applications Delivers robust

More information

Intro to Big Data and Hadoop

Intro to Big Data and Hadoop Intro to Big and Hadoop Portions copyright 2001 SAS Institute Inc., Cary, NC, USA. All Rights Reserved. Reproduced with permission of SAS Institute Inc., Cary, NC, USA. SAS Institute Inc. makes no warranties

More information

Azure Data Factory Hybrid data integration, at global scale. Erika Harris Senior Program Manager AzureCAT

Azure Data Factory Hybrid data integration, at global scale. Erika Harris Senior Program Manager AzureCAT Azure Data Factory Hybrid data integration, at global scale Erika Harris Senior Program Manager AzureCAT Data Cloud AI There are barriers to getting value from data Data silos Incongruent data types Complexity

More information

How In-Memory Computing can Maximize the Performance of Modern Payments

How In-Memory Computing can Maximize the Performance of Modern Payments How In-Memory Computing can Maximize the Performance of Modern Payments 2018 The mobile payments market is expected to grow to over a trillion dollars by 2019 How can in-memory computing maximize the performance

More information

Microsoft Azure Architect Design (AZ301)

Microsoft Azure Architect Design (AZ301) Microsoft Azure Architect Design (AZ301) COURSE OVERVIEW: This four-day course is aligned to Azure Exam:AZ-301, Azure Solutions Architect-Design and contains the following: AZ-301T01: Designing for Identity

More information

Cloud service models

Cloud service models Onur Dogruoz Cloud service models Security & Management Security Center Portal Azure Active Directory Azure AD B2C Multi-Factor Authentication Media Services Logic Apps Media & CDN API Management Media

More information

Analytics in Action transforming the way we use and consume information

Analytics in Action transforming the way we use and consume information Analytics in Action transforming the way we use and consume information Big Data Ecosystem The Data Traditional Data BIG DATA Repositories MPP Appliances Internet Hadoop Data Streaming Big Data Ecosystem

More information

EXECUTIVE BRIEF. Successful Data Warehouse Approaches to Meet Today s Analytics Demands. In this Paper

EXECUTIVE BRIEF. Successful Data Warehouse Approaches to Meet Today s Analytics Demands. In this Paper Sponsored by Successful Data Warehouse Approaches to Meet Today s Analytics Demands EXECUTIVE BRIEF In this Paper Organizations are adopting increasingly sophisticated analytics methods Analytics usage

More information

Bringing the Power of SAS to Hadoop Title

Bringing the Power of SAS to Hadoop Title WHITE PAPER Bringing the Power of SAS to Hadoop Title Combine SAS World-Class Analytics With Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities ii Contents Introduction... 1 What

More information

ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics

ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics ADVANCED ANALYTICS & IOT ARCHITECTURES Presented by: Orion Gebremedhin Director of Technology, Data & Analytics Marc Lobree National Architect, Advanced Analytics EDW THE RIGHT TOOL FOR THE RIGHT WORKLOAD

More information

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Roger Ding Cloudera February 3rd, 2018 1 Agenda Hadoop History Introduction to Apache Hadoop

More information

Cognitive Data Warehouse and Analytics

Cognitive Data Warehouse and Analytics Cognitive Data Warehouse and Analytics Hemant R. Suri, Sr. Offering Manager, Hybrid Data Warehouses, IBM (twitter @hemantrsuri or feel free to reach out to me via LinkedIN!) Over 90% of the world s data

More information

A World of Data. Raghu Ramakrishnan. CTO for Data, Technical Fellow Microsoft

A World of Data. Raghu Ramakrishnan. CTO for Data, Technical Fellow Microsoft A World of Data Raghu Ramakrishnan CTO for Data, Technical Fellow Microsoft Content Optimization Agrawal et al., CACM 56(6):92-101 (2013) Content Recommendation on Web Portals Key Features Package Ranker

More information

Turn your conversations into memorable conversations by learning how to showcase Dynamics CRM Online value proposition to Technical Decision Makers.

Turn your conversations into memorable conversations by learning how to showcase Dynamics CRM Online value proposition to Technical Decision Makers. The Technical Brief provides answers to key Technical Decision Maker questions and/or concerns in topics such as: Extensibility Security Deployment and Management Integration Application Architecture Collaboration

More information

WELCOME TO. Cloud Data Services: The Art of the Possible

WELCOME TO. Cloud Data Services: The Art of the Possible WELCOME TO Cloud Data Services: The Art of the Possible Goals for Today Share the cloud-based data management and analytics technologies that are enabling rapid development of new mobile applications Discuss

More information

Industrial IoT Solution Architecture Design From Connectivity to Data

Industrial IoT Solution Architecture Design From Connectivity to Data Industrial IoT Solution Architecture Design From Connectivity to Data Cheryl Hsu Program Manager Strategic Engagement & Industrial IoT, Microsoft IoT Enables a Digital Feedback Loop The benefits are profound

More information

Microsoft FastTrack For Azure Service Level Description

Microsoft FastTrack For Azure Service Level Description ef Microsoft FastTrack For Azure Service Level Description 2017 Microsoft. All rights reserved. 1 Contents Microsoft FastTrack for Azure... 3 Eligible Solutions... 3 FastTrack for Azure Process Overview...

More information

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake White Paper Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake Motivation for Modernization It is now a well-documented realization among Fortune 500 companies

More information

Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved.

Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved. Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved. 1 SAP Big Data Webinar Series Big Data - Introduction to SAP Big Data Technologies Big Data - Streaming Analytics Big Data - Smarter

More information

How to create an Azure subscription

How to create an Azure subscription How to create an Azure subscription Azure is a cloud hosting service offered by Microsoft, and offers services like file storage, backups, database and Windows and Linux virtual machines. Anyone can harness

More information

ETL challenges on IOT projects. Pedro Martins Head of Implementation

ETL challenges on IOT projects. Pedro Martins Head of Implementation ETL challenges on IOT projects Pedro Martins Head of Implementation Outline What is Pentaho Pentaho Data Integration (PDI) Smartcity Copenhagen Example of Data structure without an OLAP schema Telematics

More information

This module introduces students to cloud services and the various Azure services. It describes how to

This module introduces students to cloud services and the various Azure services. It describes how to Course Outline Module 1: Getting Started with Microsoft Azure This module introduces students to cloud services and the various Azure services. It describes how to use the Azure portal to access and manage

More information

Architecture Overview for Data Analytics Deployments

Architecture Overview for Data Analytics Deployments Architecture Overview for Data Analytics Deployments Mahmoud Ghanem Sr. Systems Engineer GLOBAL SPONSORS Agenda The Big Picture Top Use Cases for Data Analytics Modern Architecture Concepts for Data Analytics

More information

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK Are you drowning in Big Data? Do you lack access to your data? Are you having a hard time managing Big Data processing requirements?

More information

Cask Data Application Platform (CDAP) Extensions

Cask Data Application Platform (CDAP) Extensions Cask Data Application Platform (CDAP) Extensions CDAP Extensions provide additional capabilities and user interfaces to CDAP. They are use-case specific applications designed to solve common and critical

More information

Integrating MATLAB Analytics into Enterprise Applications

Integrating MATLAB Analytics into Enterprise Applications Integrating MATLAB Analytics into Enterprise Applications David Willingham 2015 The MathWorks, Inc. 1 Run this link. http://bit.ly/matlabapp 2 Key Takeaways 1. What is Enterprise Integration 2. What is

More information

Confidential

Confidential June 2017 1. Is your EDW becoming too expensive to maintain because of hardware upgrades and increasing data volumes? 2. Is your EDW becoming a monolith, which is too slow to adapt to business s analytical

More information

MapR Pentaho Business Solutions

MapR Pentaho Business Solutions MapR Pentaho Business Solutions The Benefits of a Converged Platform to Big Data Integration Tom Scurlock Director, WW Alliances and Partners, MapR Key Takeaways 1. We focus on business values and business

More information

Azure IoT Suite. Secure device connectivity and management. Data ingestion and command + control. Rich dashboards and visualizations

Azure IoT Suite. Secure device connectivity and management. Data ingestion and command + control. Rich dashboards and visualizations Azure IoT Suite Secure device connectivity and management Data ingestion and command + control Rich dashboards and visualizations Business workflow integration Move beyond building blocks with pre-configured

More information

Common Customer Use Cases in FSI

Common Customer Use Cases in FSI Common Customer Use Cases in FSI 1 Marketing Optimization 2014 2014 MapR MapR Technologies Technologies 2 Fortune 100 Financial Services Company 104M CARD MEMBERS 3 Financial Services: Recommendation Engine

More information

Cloud Based Analytics for SAP

Cloud Based Analytics for SAP Cloud Based Analytics for SAP Gary Patterson, Global Lead for Big Data About Virtustream A Dell Technologies Business 2,300+ employees 20+ data centers Major operations in 10 countries One of the fastest

More information