Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics

Size: px
Start display at page:

Download "Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics"

Transcription

1 Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics

2 Key takeaways Analytic Insights Module for self-service analytics Automate data ingestion into Isilon Data Lake Three methods to ingest data from external data sources Fine-grained data control and governance 2

3 Agenda Introduction to Analytics Insights Module Personal On-demand Workspace Demo Automate Isilon HDFS Provisioning Data Ingestion Methods Fine-grained Data Access Policies Q&A 3

4 The new digital customer Rising and continuously changing expectations around experiences Always available Real-time updates Intelligent interactions Intelligent applications are the new face of business 4

5 Challenges to realizing business value from your data 80% 60% Time spent discovering and preparing data 1 Data Analytics projects failing to move past exploratory stage 3 25% 41% 90% 71% Of unstructured data is used 2 Of structured data is used 2 Silo d data analytic efforts 5 Employees have access to data they shouldn t Boost Your Business Insights By Converging Big Data And BI, March 25, Corporate Data: A Protected Asset or a Ticking Time Bomb?, Ponemon Institute, Dec Business Technographics, Global Data and Analytics Survey, Forrester Research, Information Innovation Key Overview, April 22, Predicts 2015: Big Data Challenges Move From Technology to the Organization, Gartner, 28 November 2014

6 Analytic Insights Module Increases your speed through the virtuous cycle Analyze via self-service to create new insights Act on insights for monetizing new opportunities 6 Gather the right data with deep awareness

7 ANALYTIC INSIGHTS MODULE Personal On-demand Workspace Access data, create analytics, and collaborate ANALYZE GET TO WORK IN MINUTES 7

8 Analytic Insights Module engineered for speed Platform manager Coming Soon! 8

9 Logical View of Analytic Insights Module Global UI app Pivotal CF Attivio Bedrock Analytic Insights Module Controller Web client WORKSPACE DAC applications User tools & apps Rabbit MQ DAC Services Data scientist SSH Data containers [Hadoop databases etc.] DAC client services ISILON Published data/apps 9

10 Automate Isilon HDFS provisioning for Hadoop ACCESS ZONE: creates an access zone within Isilon IP POOL: creates an IP pool for the new zone HADOOP USERS: creates Hadoop users within the access zone and assigns GID and UID USER MAPPING: maps hdfs user to root DIRECTORY CONFIGURATION: creates required directory structure 10

11 Isilon NameNode registration The Controller deploys Cloudera Manager or Ambari and registers Isilon as a Namenode 11

12 Data ingest methods Analytic Insights Module supports three methods to ingest data from external data sources 1 Workspace 1 Workspace 2... Workspace n 2 3 Data ingest to ODC Ability for data engineers bring commonly used sets to ODC, thus avoiding multiple access requests from sources External data DSD Bedrock ODC Analytic Insights Module 12

13 Data ingestion into a workspace data path External data sources control path DATA USER Analytic Insights Module 1 Access data source definition & build marts Access Attivio DSD from workspace UI using user credential via LDAP integration EDW Enterprise Apps Social media Ingest workflow Bedrock 3 2 Provide metadata 4 Create & execute workflow DSD Register metadata DAC Browse discovered data sources, review sample data, semantic search of data sources Choose data sets and prepare a custom data model Auto-provision custom data model to a data container in workspace Cloud SaaS Data containers No operating knowledge of Bedrock required Devices & sensors Workbench VM Data store Ingest process runs on-demand 13

14 Data source connection and ingestion Direct upload Single.csv,.xml, and.zip files CMS CONNECTION Data Profiling Connectors Small/ unstructured data sources All connection types perform Metadata Collection and Content Ingestion 14 DSD Spiders Large, complex, structured sources (e.g., data warehouse) Ingest only a sample of the content from each detected table

15 Unify Automatically generates data models Correlates all structured data and unstructured content Enables Dynamic Modeling to create a data mart with multiple sources 15

16 Provision dataset to workspace data container SalesData SalesData Attivio DSD triggers Bedrock workflow to provision any dataset to user s workspace User can select destination workspace data container to provision into and Bedrock will execute the ingestion process 16

17 Trigger Bedrock ingestion workflow from DSD Ingestion Driver is available out-of-the-box in Bedrock Workflow is triggered by Attivio provisioner plug-in Identifies the target container Branches to Hive/MongoDB/MySQL ingestion depending on request 17

18 Dataset is automatically available in workspace Container URL provides a way to access the data from within the workbench data container SalesData appears in user workspace automatically User is able to view the URL to access the data using their own credential which is integrated with LDAP 18

19 Dirt road ingestion into workspace BYOD External data sources EDW Enterprise Apps data path control path DATA USER Workspace in Analytic Insights Module 1 2 Register data source(s) in DAC Ingest data from external sources DAC Use ingestion applications from Hadoop clusters to source external data Build ingest workflow pipelines to gather and transform data before loading to workspace containers Social media Support BYOD use cases wrangling in real-time or batch data feeds Cloud SaaS Hadoop Data store Develop machine learning algorithms using Spark platform on Hadoop Work bench VM Devices & sensors Data store 19

20 Data ingestion into the Data Catalog External data sources EDW Enterprise Apps Control Path Data Path Analytic Insights Module Design & run ingest workflow 1 Bedrock DATA ENGINEER DAC Create batch or real-time process in Zaloni and execute to ingest data from external data sources Ability to support complex data transformations in Zaloni workflows Social media 2 Execute workflow 3 Register metadata Stage may contain look ups or master data for enrichment Ingest workflow Ability to add ingestion rules and/or definitions to data elements Cloud SaaS Stage Ingest workflow ODC Monitor and Administer Zaloni workflow runs Devices & sensors Data store 20

21 Define file ingest in Bedrock Test[0-9]+.dat File pattern associated with the source 21

22 Workflow Designer Orchestrate set of actions Supports simple/complex flows Hive, Spark, Spark-SQL, Shell, Java, Mapreduce generic actions Built-in actions for CDC, Watermarking, Tokenization, Avro conversion, Parquet conversion 22

23 Transformation Transformation library Build transformations using drag-and-drop interface Supports Spark for efficient transformation Integrates with workflow module Metadata entity data flows 23

24 Data governor policy engine All data requests are intercepted and forwarded to policy engine. Policy engine evaluates request against policies and returns modified request that gives user policy-compliant results. Full audit trail at the user and data level is built automatically. Business users, data scientists, developers Active directory Applications 2 USER REQUEST COMPLIANT RESULTS 5 Security admins 1 BlueTalon policy console BlueTalon policy engine 3 USER REQUEST 4 MODIFIED, COMPLIANT REQUEST ODC 6 BlueTalon audit engine Security admins 24 BlueTalon Enforcement Points

25 Fine-grained data access policies XXXX Create rule to mask customer s Social Security Number based on user roles 25

26 Analytic Insights Module Increases your speed through the virtuous cycle Analyze via self-service to create new insights Act on insights for monetizing new opportunities 26 Gather the right data with deep awareness

27 Questions? 27

28 Realize your next steps Attend Breakout Sessions Secure IT's Seat At The Table: Deliver The Business Self-Service Data Analytics Wed. (5/10), 1:30 PM - 2:30 PM, San Polo 3405 IoT Analytics: A Modern Manufacturing Surveillance Use Case Wed. (5/10), 12:00 PM - 1:00 PM, Delfino 4003 See the Blueprint solutions in action at the Expo Kiosks and Customer Presentations in the Converged Platforms and Solutions booth #872 Engage with our Dell EMC Big Data and Cloud subject matter experts to learn more Visit dellemc.com/aim 28

29

Your Top 5 Reasons Why You Should Choose SAP Data Hub INTERNAL

Your Top 5 Reasons Why You Should Choose SAP Data Hub INTERNAL Your Top 5 Reasons Why You Should Choose INTERNAL Top 5 reasons for choosing the solution 1 UNIVERSAL 2 INTELLIGENT 3 EFFICIENT 4 SCALABLE 5 COMPLIANT Universal view of the enterprise and Big Data: Get

More information

Architecture Overview for Data Analytics Deployments

Architecture Overview for Data Analytics Deployments Architecture Overview for Data Analytics Deployments Mahmoud Ghanem Sr. Systems Engineer GLOBAL SPONSORS Agenda The Big Picture Top Use Cases for Data Analytics Modern Architecture Concepts for Data Analytics

More information

Microsoft Azure Essentials

Microsoft Azure Essentials Microsoft Azure Essentials Azure Essentials Track Summary Data Analytics Explore the Data Analytics services in Azure to help you analyze both structured and unstructured data. Azure can help with large,

More information

Azure Data Analytics & Machine Learning Seminar. Daire Cunningham: BI Practice Area Manager

Azure Data Analytics & Machine Learning Seminar. Daire Cunningham: BI Practice Area Manager Azure Data Analytics & Machine Learning Seminar Daire Cunningham: BI Practice Area Manager AGENDA 09:00 AM 09:30 AM Registration & Refreshments 09.30AM 10:00 AM 10:00 AM 10:30 AM Welcome & Keynote, Ger

More information

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake White Paper Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake Motivation for Modernization It is now a well-documented realization among Fortune 500 companies

More information

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. Course Content Course Description: The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. At Course Completion: After competing this course,

More information

20775A: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight 20775A: Performing Data Engineering on Microsoft HD Insight Duration: 5 days; Instructor-led Implement Spark Streaming Using the DStream API. Develop Big Data Real-Time Processing Solutions with Apache

More information

REDEFINE BIG DATA. Zvi Brunner CTO. Copyright 2015 EMC Corporation. All rights reserved.

REDEFINE BIG DATA. Zvi Brunner CTO. Copyright 2015 EMC Corporation. All rights reserved. 1 REDEFINE BIG DATA Zvi Brunner CTO 2 2020: A NEW DIGITAL WORLD 30B DEVICES 7B PEOPLE Millions OF NEW BUSINESSES Source: Gartner Group, 2014 DIGITIZATION IS ALREADY BEGINNING PRECISION FARMING DRESS THAT

More information

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara

Pentaho 8.0 and Beyond. Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara Pentaho 8.0 and Beyond Matt Howard Pentaho Sr. Director of Product Management, Hitachi Vantara Safe Harbor Statement The forward-looking statements contained in this document represent an outline of our

More information

In search of the Holy Grail?

In search of the Holy Grail? In search of the Holy Grail? Our Clients Journey to the Data Lake André De Locht Sr Business Consultant Data Lake, Information Integration and Governance $ andre.de.locht@be.ibm.com ( +32 476 870 354 Data

More information

5th Annual. Cloudera, Inc. All rights reserved.

5th Annual. Cloudera, Inc. All rights reserved. 5th Annual 1 The Essentials of Apache Hadoop The What, Why and How to Meet Agency Objectives Sarah Sproehnle, Vice President, Customer Success 2 Introduction 3 What is Apache Hadoop? Hadoop is a software

More information

EXAMPLE SOLUTIONS Hadoop in Azure HBase as a columnar NoSQL transactional database running on Azure Blobs Storm as a streaming service for near real time processing Hadoop 2.4 support for 100x query gains

More information

From Data Deluge to Intelligent Data

From Data Deluge to Intelligent Data SAP Data Hub From Data Deluge to Intelligent Data Orchestrate Your Data for an Intelligent Enterprise Data for Intelligence, Speed, and With Today, corporate data landscapes are growing increasingly diverse

More information

20775 Performing Data Engineering on Microsoft HD Insight

20775 Performing Data Engineering on Microsoft HD Insight Duración del curso: 5 Días Acerca de este curso The main purpose of the course is to give students the ability plan and implement big data workflows on HD. Perfil de público The primary audience for this

More information

Cloudera, Inc. All rights reserved.

Cloudera, Inc. All rights reserved. 1 Data Analytics 2018 CDSW Teamplay und Governance in der Data Science Entwicklung Thomas Friebel Partner Sales Engineer tfriebel@cloudera.com 2 We believe data can make what is impossible today, possible

More information

20775A: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight 20775A: Performing Data Engineering on Microsoft HD Insight Course Details Course Code: Duration: Notes: 20775A 5 days This course syllabus should be used to determine whether the course is appropriate

More information

20775: Performing Data Engineering on Microsoft HD Insight

20775: Performing Data Engineering on Microsoft HD Insight Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com

More information

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Rohit Bakhshi, Solution Architect, Hortonworks Jim Walker, Director Product Marketing, Talend Page 1 About Us Rohit Bakhshi Solution

More information

Paul Chang Senior Consultant, Data Scientist, IBM Cloud tw.ibm.com

Paul Chang Senior Consultant, Data Scientist, IBM Cloud tw.ibm.com Paul Chang Senior Consultant, Data Scientist, IBM Cloud paulyc@ tw.ibm.com 2 no AI without IA 3 3 3 3 3 3 AI Machine Learning Analytics Data The AI Ladder 3 Most are here Data Driven Insight Driven Digital

More information

Cloudera Data Science and Machine Learning. Robin Harrison, Account Executive David Kemp, Systems Engineer. Cloudera, Inc. All rights reserved.

Cloudera Data Science and Machine Learning. Robin Harrison, Account Executive David Kemp, Systems Engineer. Cloudera, Inc. All rights reserved. Cloudera Data Science and Machine Learning Robin Harrison, Account Executive David Kemp, Systems Engineer 1 This is the age of machine learning. Data volume NO Machine Learning Machine Learning 1950s 1960s

More information

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics ESSENTIALS EMC ISILON Use the industry's first and only scale-out NAS solution with native Hadoop

More information

Building data-driven applications with SAP Data Hub and Amazon Web Services

Building data-driven applications with SAP Data Hub and Amazon Web Services Building data-driven applications with SAP Data Hub and Amazon Web Services Dr. Lars Dannecker, Steffen Geissinger September 18 th, 2018 Cross-department disconnect Cross-department disconnect Cross-department

More information

Hortonworks Connected Data Platforms

Hortonworks Connected Data Platforms Hortonworks Connected Data Platforms MASTER THE VALUE OF DATA EVERY BUSINESS IS A DATA BUSINESS EMBRACE AN OPEN APPROACH 2 Hortonworks Inc. 2011 2016. All Rights Reserved Data Drives the Connected Car

More information

Business is being transformed by three trends

Business is being transformed by three trends Business is being transformed by three trends Big Cloud Intelligence Stay ahead of the curve with Cortana Intelligence Suite Business apps People Custom apps Apps Sensors and devices Cortana Intelligence

More information

BIG DATA AND HADOOP DEVELOPER

BIG DATA AND HADOOP DEVELOPER BIG DATA AND HADOOP DEVELOPER Approximate Duration - 60 Hrs Classes + 30 hrs Lab work + 20 hrs Assessment = 110 Hrs + 50 hrs Project Total duration of course = 160 hrs Lesson 00 - Course Introduction 0.1

More information

Meta-Managed Data Exploration Framework and Architecture

Meta-Managed Data Exploration Framework and Architecture Meta-Managed Data Exploration Framework and Architecture CONTENTS Executive Summary Meta-Managed Data Exploration Framework Meta-Managed Data Exploration Architecture Data Exploration Process: Modules

More information

Alexander Klein. ETL meets Azure

Alexander Klein. ETL meets Azure Alexander Klein ETL meets Azure Thanks to our sponsors: Who am I? Independent BI Consultant > 15 years experience of SQL Server Focus on Microsoft BI Stack & AI & Azure a.klein@consulting-bi.de @SQL_Alex

More information

Trifacta Data Wrangling for Hadoop: Accelerating Business Adoption While Ensuring Security & Governance

Trifacta Data Wrangling for Hadoop: Accelerating Business Adoption While Ensuring Security & Governance 575 Market St, 11th Floor San Francisco, CA 94105 www.trifacta.com 844.332.2821 1 WHITEPAPER Trifacta Data Wrangling for Hadoop: Accelerating Business Adoption While Ensuring Security & Governance 2 Introduction

More information

Spotlight Sessions. Nik Rouda. Director of Product Marketing Cloudera, Inc. All rights reserved. 1

Spotlight Sessions. Nik Rouda. Director of Product Marketing Cloudera, Inc. All rights reserved. 1 Spotlight Sessions Nik Rouda Director of Product Marketing Cloudera @nrouda Cloudera, Inc. All rights reserved. 1 Spotlight: Protecting Your Data Nik Rouda Product Marketing Cloudera, Inc. All rights reserved.

More information

Building a Single Source of Truth across the Enterprise An Integrated Solution

Building a Single Source of Truth across the Enterprise An Integrated Solution SOLUTION BRIEF Building a Single Source of Truth across the Enterprise An Integrated Solution From EDW modernization to self-service BI on big data This solution brief showcases an integrated approach

More information

Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand

Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand Paper 2698-2018 Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand ABSTRACT Digital analytics is no longer just about tracking the number

More information

Modern Analytics Architecture

Modern Analytics Architecture Modern Analytics Architecture So what is a. Modern analytics architecture? Machine Learning AI Open source Big Data DevOps Cloud In-memory IoT Trends supporting Next-Generation analytics Source: Next-Generation

More information

Embark on Your Data Management Journey with Confidence

Embark on Your Data Management Journey with Confidence SAP Brief SAP Data Hub Embark on Your Data Management Journey with Confidence SAP Brief Managing data operations across your complex IT landscape Proliferation of any kind of data presents a wealth of

More information

AZURE HDINSIGHT. Azure Machine Learning Track Marek Chmel

AZURE HDINSIGHT. Azure Machine Learning Track Marek Chmel AZURE HDINSIGHT Azure Machine Learning Track Marek Chmel SESSION AGENDA Understanding different scenarios of Hadoop Building an end to end pipeline using HDInsight Using in-memory techniques to analyze

More information

Transforming Analytics with Cloudera Data Science WorkBench

Transforming Analytics with Cloudera Data Science WorkBench Transforming Analytics with Cloudera Data Science WorkBench Process data, develop and serve predictive models. 1 Age of Machine Learning Data volume NO Machine Learning Machine Learning 1950s 1960s 1970s

More information

Managing explosion of data. Cloudera, Inc. All rights reserved.

Managing explosion of data. Cloudera, Inc. All rights reserved. Managing explosion of data 1 Customer experience expectations are converging on the brand, not channel Consistent across all channels and lines of business Contextualized to present location and circumstances

More information

Cask Data Application Platform (CDAP) Extensions

Cask Data Application Platform (CDAP) Extensions Cask Data Application Platform (CDAP) Extensions CDAP Extensions provide additional capabilities and user interfaces to CDAP. They are use-case specific applications designed to solve common and critical

More information

ETL challenges on IOT projects. Pedro Martins Head of Implementation

ETL challenges on IOT projects. Pedro Martins Head of Implementation ETL challenges on IOT projects Pedro Martins Head of Implementation Outline What is Pentaho Pentaho Data Integration (PDI) Smartcity Copenhagen Example of Data structure without an OLAP schema Telematics

More information

Who is Databricks? Today, hundreds of organizations around the world use Databricks to build and power their production Spark applications.

Who is Databricks? Today, hundreds of organizations around the world use Databricks to build and power their production Spark applications. Databricks Primer Who is Databricks? Databricks was founded by the team who created Apache Spark, the most active open source project in the big data ecosystem today, and is the largest contributor to

More information

Architecting an Open Data Lake for the Enterprise

Architecting an Open Data Lake for the Enterprise Architecting an Open Data Lake for the Enterprise 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Today s Presenters Daniel Geske, Solutions Architect, Amazon Web Services Armin

More information

The Importance of good data management and Power BI

The Importance of good data management and Power BI The Importance of good data management and Power BI The BI Iceberg Visualising Data is only the tip of the iceberg Data Preparation and provisioning is a complex process Streamlining this process is key

More information

Outline of Hadoop. Background, Core Services, and Components. David Schwab Synchronic Analytics Nov.

Outline of Hadoop. Background, Core Services, and Components. David Schwab Synchronic Analytics   Nov. Outline of Hadoop Background, Core Services, and Components David Schwab Synchronic Analytics https://synchronicanalytics.com Nov. 1, 2018 Hadoop s Purpose and Origin Hadoop s Architecture Minimum Configuration

More information

Machine Learning For Enterprise: Beyond Open Source. April Jean-François Puget

Machine Learning For Enterprise: Beyond Open Source. April Jean-François Puget Machine Learning For Enterprise: Beyond Open Source April 2018 Jean-François Puget Use Cases for Machine/Deep Learning Cyber Defense Drug Discovery Fraud Detection Aeronautics IoT Earth Monitoring Advanced

More information

Taking Advantage of Cloud Elasticity and Flexibility

Taking Advantage of Cloud Elasticity and Flexibility Taking Advantage of Cloud Elasticity and Flexibility Fred Koopmans Sr. Director of Product Management 1 Public cloud adoption is surging 2 Cloudera customers are leading the way 3 Hadoop was born for the

More information

PORTFOLIO AND TECHNOLOGY DIRECTION ARMISTEAD SAPP & RANDY GUARD

PORTFOLIO AND TECHNOLOGY DIRECTION ARMISTEAD SAPP & RANDY GUARD PORTFOLIO AND TECHNOLOGY DIRECTION ARMISTEAD SAPP & RANDY GUARD FOCUS MARKETS SAS Addressable Market Size $US Billions $14.7 2015 2019 $10.6 $9.6 $7.0 $7.9 $5.0 $2.6 $3.7 $5.7 $4.4 $3.0 $4.2 BUSINESS INTELLIGENCE

More information

Copyright 2014, Oracle and/or its affiliates. All rights reserved. 2

Copyright 2014, Oracle and/or its affiliates. All rights reserved. 2 Copyright 2014, Oracle and/or its affiliates. All rights reserved. 2 Oracle Cloud Marketplace: An Innovation Ecosystem for Partners and Customers Neelesh Gurnani Sr. Director Product Development Ajay Seetharam

More information

Make Business Intelligence Work on Big Data

Make Business Intelligence Work on Big Data Make Business Intelligence Work on Big Data Speed. Scale. Simplicity. Put the Power of Big Data in the Hands of Business Users Connect your BI tools directly to your big data without compromising scale,

More information

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK Are you drowning in Big Data? Do you lack access to your data? Are you having a hard time managing Big Data processing requirements?

More information

Analytics for All Data

Analytics for All Data Analytics for All Data How Oracle Analytics Helps Agencies Improve Their Effectiveness FORCES 2017 Jim Penn Sr Manager, Public Sector Oracle Analytics & Big Data Agenda Oracle s Analytics Platform Overview

More information

How In-Memory Computing can Maximize the Performance of Modern Payments

How In-Memory Computing can Maximize the Performance of Modern Payments How In-Memory Computing can Maximize the Performance of Modern Payments 2018 The mobile payments market is expected to grow to over a trillion dollars by 2019 How can in-memory computing maximize the performance

More information

Bringing the Power of SAS to Hadoop Title

Bringing the Power of SAS to Hadoop Title WHITE PAPER Bringing the Power of SAS to Hadoop Title Combine SAS World-Class Analytics With Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities ii Contents Introduction... 1 What

More information

THE CIO GUIDE TO BIG DATA ARCHIVING. How to pick the right product?

THE CIO GUIDE TO BIG DATA ARCHIVING. How to pick the right product? THE CIO GUIDE TO BIG DATA ARCHIVING How to pick the right product? The landscape of enterprise data is changing with the advent of enterprise social data, IoT, logs and click-streams. The data is too big,

More information

EMC IT Big Data Analytics Journey. Mahmoud Ghanem Sr. Systems Engineer

EMC IT Big Data Analytics Journey. Mahmoud Ghanem Sr. Systems Engineer EMC IT Big Data Analytics Journey Mahmoud Ghanem Sr. Systems Engineer Agenda 1 2 3 4 5 Introduction To Big Data EMC IT Big Data Journey Marketing Science Lab Use Case Technical Benefits Lessons Learned

More information

Adobe Deploys Hadoop as a Service on VMware vsphere

Adobe Deploys Hadoop as a Service on VMware vsphere Adobe Deploys Hadoop as a Service A TECHNICAL CASE STUDY APRIL 2015 Table of Contents A Technical Case Study.... 3 Background... 3 Why Virtualize Hadoop on vsphere?.... 3 The Adobe Marketing Cloud and

More information

CREATING A FOUNDATION FOR BUSINESS VALUE

CREATING A FOUNDATION FOR BUSINESS VALUE CREATING A FOUNDATION FOR BUSINESS VALUE Building initial use cases to drive predictive and prescriptive analytics ABSTRACT This white paper highlights three initial big data use cases that can help your

More information

Cask Data Application Platform (CDAP)

Cask Data Application Platform (CDAP) Cask Data Application Platform (CDAP) CDAP is an open source, Apache 2.0 licensed, distributed, application framework for delivering Hadoop solutions. It integrates and abstracts the underlying Hadoop

More information

This tutorial helps you to learn all the fundamentals of Talend tool for data integration and big data with examples.

This tutorial helps you to learn all the fundamentals of Talend tool for data integration and big data with examples. i About the Tutorial Talend is an ETL tool for Data Integration. It provides software solutions for data preparation, data quality, data integration, application integration, data management and big data.

More information

Modernizing Your Data Warehouse with Azure

Modernizing Your Data Warehouse with Azure Modernizing Your Data Warehouse with Azure Big data. Small data. All data. Christian Coté S P O N S O R S The traditional BI Environment The traditional data warehouse data warehousing has reached the

More information

Optimal Infrastructure for Big Data

Optimal Infrastructure for Big Data Optimal Infrastructure for Big Data Big Data 2014 Managing Government Information Kevin Leong January 22, 2014 2014 VMware Inc. All rights reserved. The Right Big Data Tools for the Right Job Real-time

More information

Spark and Hadoop Perfect Together

Spark and Hadoop Perfect Together Spark and Hadoop Perfect Together Arun Murthy Hortonworks Co-Founder @acmurthy Data Operating System Enable all data and applications TO BE accessible and shared BY any end-users Data Operating System

More information

Redefine Big Data: EMC Data Lake in Action. Andrea Prosperi Systems Engineer

Redefine Big Data: EMC Data Lake in Action. Andrea Prosperi Systems Engineer Redefine Big Data: EMC Data Lake in Action Andrea Prosperi Systems Engineer 1 Agenda Data Analytics Today Big data Hadoop & HDFS Different types of analytics Data lakes EMC Solutions for Data Lakes 2 The

More information

Data Ingestion in. Adobe Experience Platform

Data Ingestion in. Adobe Experience Platform Contents The challenges with data Adobe Experience Platform Data Ingestion in Adobe Experience Platform Data Ingestion Service Data Lake Conclusion Adobe Experience Platform helps customers to centralize

More information

Analytics for All Your Data: Cloud Essentials. Pervasive Insight in the World of Cloud

Analytics for All Your Data: Cloud Essentials. Pervasive Insight in the World of Cloud Analytics for All Your Data: Cloud Essentials Pervasive Insight in the World of Cloud The Opportunity We re living in a world where just about everything we see, do, hear, feel, and experience is captured

More information

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB

Data Analytics and CERN IT Hadoop Service. CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB Data Analytics and CERN IT Hadoop Service CERN openlab Technical Workshop CERN, December 2016 Luca Canali, IT-DB 1 Data Analytics at Scale The Challenge When you cannot fit your workload in a desktop Data

More information

Secure information access is critical & more complex than ever

Secure information access is critical & more complex than ever WHITE PAPER Purpose-built Cloud Platform for Enabling Identity-centric and Internet of Things Solutions Connecting people, systems and things across the extended digital business ecosystem. Secure information

More information

Datameer for Data Preparation: Empowering Your Business Analysts

Datameer for Data Preparation: Empowering Your Business Analysts Datameer for Data Preparation: Empowering Your Business Analysts As businesses strive to be data-driven organizations, self-service data preparation becomes a critical cog in the analytic process. Self-service

More information

Microsoft Big Data. Solution Brief

Microsoft Big Data. Solution Brief Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,

More information

Jason Virtue Business Intelligence Technical Professional

Jason Virtue Business Intelligence Technical Professional Jason Virtue Business Intelligence Technical Professional jvirtue@microsoft.com Agenda Microsoft Azure Data Services Azure Cloud Services Azure Machine Learning Azure Service Bus Azure Stream Analytics

More information

Advanced Analytics in Azure

Advanced Analytics in Azure Explore What s Possible. Advanced Analytics in Azure Amie Mason, Practice Lead Data Science & Analytics amiem@attunix.com The Attunix Difference business technology Attunix delivers results at the intersection

More information

IBM WebSphere Information Integrator Content Edition Version 8.2

IBM WebSphere Information Integrator Content Edition Version 8.2 Introducing content-centric federation IBM Content Edition Version 8.2 Highlights Access a broad range of unstructured information sources as if they were stored and managed in one system Unify multiple

More information

Oracle Infinity TM. Key Components

Oracle Infinity TM. Key Components Oracle Infinity TM Digital Analytics Oracle Infinity TM is an enterprise analytics solution that harnesses big data to provide actionable customer intelligence at scale, in real time and with unlimited

More information

Big and Fast Data: The Path To New Business Value

Big and Fast Data: The Path To New Business Value Big and Fast Data: The Path To New Business Value A Pivotal Overview Umair Riaz vspecialist 2 Gain Business Value with Big and Fast Data Pivotal Provides Agile Platform for Data-Driven Applications Ingest

More information

Actionable Insights with PI Integrators

Actionable Insights with PI Integrators Actionable Insights with PI Integrators Elizabeth Ammarell, Product Manager Joy Wang, Product Manager #OSIsoftUC #PIWorld 28 OSIsoft, LLC Agenda Introduction to PI Integrators Learn about Integrators and

More information

Pentaho 8.0 Overview. Pedro Alves

Pentaho 8.0 Overview. Pedro Alves Pentaho 8.0 Overview Pedro Alves Safe Harbor Statement The forward-looking statements contained in this document represent an outline of our current intended product direction. It is provided for information

More information

Mastering Your Data Power Your Connected Business With Your Master Data. Scott Walz, Sales Engineer June 27, 2018

Mastering Your Data Power Your Connected Business With Your Master Data. Scott Walz, Sales Engineer June 27, 2018 Mastering Your Data Power Your Connected Business With Your Master Data Scott Walz, Sales Engineer June 27, 2018 Agenda Boomi MDH Overview Product Demo Dell Boomi s Unified Platform Master Data Hub B2B/EDI

More information

Two offerings which interoperate really well

Two offerings which interoperate really well Microsoft Two offerings which interoperate really well On-premises Cortana Intelligence Suite SQL Server 2016 Cloud IAAS Enterprise PAAS Cloud Storage Service 9 SQL Server 2016: Everything built-in built-in

More information

Analytics in the Digital Economy data, experience, ideas & people. Juergen Hagedorn, Viktor Kehayov Product Management, SAP Analytics March 2017

Analytics in the Digital Economy data, experience, ideas & people. Juergen Hagedorn, Viktor Kehayov Product Management, SAP Analytics March 2017 Analytics in the Digital Economy data, experience, ideas & people Juergen Hagedorn, Viktor Kehayov Product Management, SAP Analytics March 2017 Our Portfolio Business Intelligence Data Warehousing End-to-end

More information

Hadoop and Analytics at CERN IT CERN IT-DB

Hadoop and Analytics at CERN IT CERN IT-DB Hadoop and Analytics at CERN IT CERN IT-DB 1 Hadoop Use cases Parallel processing of large amounts of data Perform analytics on a large scale Dealing with complex data: structured, semi-structured, unstructured

More information

Cisco Connected Asset Manager for IoT Intelligence

Cisco Connected Asset Manager for IoT Intelligence Cisco Connected Asset Manager for IoT Intelligence Enabling Digital Transformation Across Industries 1 2017 2017 Cisco Cisco and/or and/or its affiliates. its affiliates. All rights All rights reserved.

More information

BIG DATA TRANSFORMS BUSINESS. Copyright 2013 EMC Corporation. All rights reserved.

BIG DATA TRANSFORMS BUSINESS. Copyright 2013 EMC Corporation. All rights reserved. BIG DATA TRANSFORMS BUSINESS 1 Big Data = Structured+Unstructured Data Internet Of Things Non-Enterprise Information Structured Information In Relational Databases Managed & Unmanaged Unstructured Information

More information

MicroStrategy 10. Adam Leno Technical Architect NDM Technologies

MicroStrategy 10. Adam Leno Technical Architect NDM Technologies MicroStrategy 10 Adam Leno Technical Architect NDM Technologies aleno@ndm.net Other analytics solutions Agility or Governance Great for the Business User or Great for IT Ease of Use or Enterprise 10 Agility

More information

Organizations do not need a Big Data Strategy; they need a Business Strategy that incorporates Big Data

Organizations do not need a Big Data Strategy; they need a Business Strategy that incorporates Big Data Organizations do not need a Big Data Strategy; they need a Business Strategy that incorporates Big Data BILL SCHMARZO, CTO, DELL EMC GLOBAL SERVICES UNIVERSITY SAN FRANCISCO, SCHOOL OF MANAGEMENT EXECUTIVE

More information

Pentaho Technical Overview. Max Felber Solution Engineer September 22, 2016

Pentaho Technical Overview. Max Felber Solution Engineer September 22, 2016 Pentaho Technical Overview Max Felber Solution Engineer mfelber@pentaho.com September 22, 2016 Industry Leader in Self-Service Big Data Preparation Gartner recently completed a study on 36 selfservice

More information

Education Course Catalog Accelerate your success with the latest training in enterprise analytics, mobility, and identity intelligence.

Education Course Catalog Accelerate your success with the latest training in enterprise analytics, mobility, and identity intelligence. Education Course Catalog 2018 Accelerate your success with the latest training in enterprise analytics, mobility, and identity intelligence. Table of Contents WELCOME LETTER 3 JUMP START: YOUR MICROSTRATEGY

More information

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved.

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved. Apache Spark 2.0 GA The General Engine for Modern Analytic Use Cases 1 Apache Spark Drives Business Innovation Apache Spark is driving new business value that is being harnessed by technology forward organizations.

More information

Adobe and Hadoop Integration

Adobe and Hadoop Integration Predictive Behavioral Analytics Adobe and Hadoop Integration DECEMBER 2016 SYNTASA Copyright 1.0 Introduction For many years large enterprises have relied on the Adobe Marketing Cloud for capturing and

More information

Processing Big Data with Pentaho. Rakesh Saha Pentaho Senior Product Manager, Hitachi Vantara

Processing Big Data with Pentaho. Rakesh Saha Pentaho Senior Product Manager, Hitachi Vantara Processing Big Data with Pentaho Rakesh Saha Pentaho Senior Product Manager, Hitachi Vantara Agenda Pentaho s Latest and Upcoming Features for Processing Big Data Batch or Real-time Process big data visually

More information

TechValidate Survey Report. Converged Data Platform Key to Competitive Advantage

TechValidate Survey Report. Converged Data Platform Key to Competitive Advantage TechValidate Survey Report Converged Data Platform Key to Competitive Advantage TechValidate Survey Report Converged Data Platform Key to Competitive Advantage Executive Summary What Industry Analysts

More information

Boomi Basics: Going Beyond Integration with APIs, Data Management and Workflow Automation

Boomi Basics: Going Beyond Integration with APIs, Data Management and Workflow Automation Boomi Basics: Going Beyond Integration with APIs, Data Management and Workflow Automation Div Manickam, Product Marketing Manager, Dell Boomi Jay Mandl, Senior Sales Engineer, Dell Boomi 1 The Connected

More information

POWER NEW POSSIBILITIES

POWER NEW POSSIBILITIES POWER NEW POSSIBILITIES Solutions for your data analytics journey About this brochure This brochure explains the capabilities and benefits of the Dell EMC options for starting on and maturing in your data

More information

EMC Big Data: Become Data-Driven

EMC Big Data: Become Data-Driven 1 EMC Big Data: Become Data-Driven 2 What Is Big Data Exactly? Enterprise Internet 3 How Much Data Is There? 44 Zettabytes 1 ZB = 1B TBs 44 zettabytes is estimated to be 50 times the amount of all the

More information

Enabling Self-Service Analytics Across The UDA With Teradata AppCenter

Enabling Self-Service Analytics Across The UDA With Teradata AppCenter Enabling Self-Service Analytics Across The UDA With Teradata AppCenter Chaitanya Atreya Director, AppCenter Engineering, Teradata Jeremy Wilken AppCenter Architect, Product Manager, Teradata #TDPARTNERS16

More information

Data Analytics. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC

Data Analytics. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC Data Analytics Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC Last 15 years IT-centric Traditional Analytics Traditional Applications Rigid Infrastructure Internet Next

More information

TECHNOLOGY PLATFORM STRATEGY

TECHNOLOGY PLATFORM STRATEGY TECHNOLOGY PLATFORM STRATEGY Dr. Wolfram Jost CTO and Member of the Executive Board UBS Technology One-on-One Conference 2018 March 7, 2018 SAFE-HARBOR-STATEMENT This presentation includes forward-looking

More information

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS)

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS) ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS) Hadoop Storage-as-a-Service ABSTRACT This White Paper illustrates how Dell EMC Elastic Cloud Storage (ECS ) can be used to streamline

More information

The Internet of Everything and the Research on Big Data. Angelo E. M. Ciarlini Research Head, Brazil R&D Center

The Internet of Everything and the Research on Big Data. Angelo E. M. Ciarlini Research Head, Brazil R&D Center The Internet of Everything and the Research on Big Data Angelo E. M. Ciarlini Research Head, Brazil R&D Center A New Industrial Revolution Sensors everywhere: 50 billion connected devices by 2020 Industrial

More information

Copyright 2012 EMC Corporation. All rights reserved.

Copyright 2012 EMC Corporation. All rights reserved. 1 BIG DATA TRANSFORMS BUSINESS Hatem ElMohandes EMC Qatar Technology Consultant Team Lead 2 IN 2000 THE WORLD GENERATED TWO EXABYTES OF NEW INFORMATION Sources: How Much Information? Peter Lyman and Hal

More information

Making Data Science Simple

Making Data Science Simple Making Data Science Simple IBM Code Tech Talk Oct 18 th 2017 https://developer.ibm.com/code/videos/tech-talk-replay-making-datascience-simple/ IBM Code Tech Talk Making Data Science Simple David Taieb

More information

By 2020, more than half of major new business processes and systems will incorporate some element of the IoT.

By 2020, more than half of major new business processes and systems will incorporate some element of the IoT. Trends in Analytics By 2020, more than half of major new business processes and systems will incorporate some element of the IoT. Gartner Unexpected Implications Arising From the Internet of Things report

More information

Store. Analyze. Preserve. Big Data Assets

Store. Analyze. Preserve. Big Data Assets Dell EMC Forum Cairo, 19 th April 2017 Ali Hassib Regional Sales Manager, ISD Dell EMC Store. Analyze. Preserve. Big Data Assets UNSTRUCTURED DATA TRENDS 90 % 650 % 80 % 70 % OF ALL DATA WAS CREATED IN

More information