A New Product: Hunk Splunk AnalyBcs for Hadoop (BETA)

Size: px
Start display at page:

Download "A New Product: Hunk Splunk AnalyBcs for Hadoop (BETA)"

Transcription

1 Copyright 2013 Splunk Inc. A New Product: Hunk Splunk AnalyBcs for Hadoop (BETA) Clint Sharp Director of Product Management #splunkconf

2 Legal NoBces During the course of this presentabon, we may make forward- looking statements regarding future events or the expected performance of the company. We caubon you that such statements reflect our current expectabons and esbmates based on factors currently known to us and that actual events or results could differ materially. For important factors that may cause actual results to differ from those contained in our forward- looking statements, please review our filings with the SEC. The forward- looking statements made in this presentabon are being made as of the Bme and date of its live presentabon. If reviewed ayer its live presentabon, this presentabon may not contain current or accurate informabon. We do not assume any obligabon to update any forward- looking statements we may make. In addibon, any informabon about our roadmap outlines our general product direcbon and is subject to change at any Bme without nobce. It is for informabonal purposes only and shall not, be incorporated into any contract or other commitment. Splunk undertakes no obligabon either to develop the features or funcbonality described or to include any such feature or funcbonality in a future release. Splunk, Splunk>, Splunk Storm, Listen to Your Data, SPL and The Engine for Machine Data are trademarks and registered trademarks of Splunk Inc. in the United States and other countries. All other brand names, product names, or trademarks belong to their respeccve owners Splunk Inc. All rights reserved. 2

3 Announcing Hunk Beta Splunk AnalyBcs for Hadoop New product from Splunk delivers interac(ve data explora(on, analysis and visualiza(ons for Hadoop 3

4 A Lot of OrganizaBonal Data Ends Up in Hadoop Challenges Deploying and Leveraging Hadoop! 20X services relabve to soyware! Inadequate skills for big data analybcs! 13+ Hadoop- related projects requiring integrabon Chukwa 13+ Hadoop- related projects HBase Avro Mahout Pig ZooKeeper Cassandr Ambari a H i v e YARN! Data is too big to move Hadoop (MapReduce & HDFS) 4

5 October 2012: Splunk Hadoop Connect To Address Common Challenges Deploying and Running Hadoop Splunk Hadoop Connect Ad hoc search Monitor and alert Report and analyze Custom dashboards Reliable bi- direcbonal integrabon to Hadoop >1000 downloads HA indexes and storage Commodity servers Import Browse Export Hadoop (MapReduce & HDFS) 5

6 What About ExtracBng Value Directly from Hadoop? How can we leverage the full capabilibes of Splunk Ad hoc search Monitor and alert Report and analyze Custom dashboards nabvely on data in Hadoop? HA indexes and storage Commodity servers Hadoop (MapReduce & HDFS) Data in Hadoop is too big to move 6

7 Hunk: Splunk AnalyBcs for Hadoop Full- featured, integrated product Insights for everyone Distribu(on agnos(c Delivers interacbve data explorabon, analysis and visualizabon for Hadoop Empowers broader user groups to derive acbonable insights from raw data in Hadoop Works with leading distribubons to maximize enterprise technology investments Explore Analyze Visualize Dashboards Share Hadoop (MapReduce & HDFS) 7

8 Derive AcBonable Insights from Raw Data 1 2 Point Splunk at Hadoop cluster Explore Analyze Visualize Dashboards Share Immediately start exploring, analyzing and visualizing raw data in Hadoop Hadoop storage 8

9 Explore, Analyze and Visualize Data, On- the- fly Virtual index Schema- on- the- fly Flexibility and fast (me to value Enables seamless use of the enbre Splunk technology stack on data wherever it rests Hadoop virtual index automabcally handles MapReduce Technology is patent pending Structure applied at search Bme No brille schema to work around AutomaBcally find transacbons, palerns and trends NormalizaBon as it s needed Faster implementabon Easy search language MulBple views into the same data 9

10 InteracBve Data ExploraBon Search and explore data from one place Search assistant! Powerful search processing language (SPL) Search interface! Designed for data explorabon across large datasets preview data, iterate quickly! No requirement to understand data upfront InteracBve results window 10

11 InteracBve Data Analysis Rapidly analyze and interact with data Formaong opbons! InteracBve analybcs interface! Deep analysis, palern detecbon and finding anomalies with over 100 stabsbcal commands! Enrich results with informabon from external relabonal databases InteracBve, analybcs interface 11

12 Powerful Plaporm for Enterprise Developers Add new UI components Integrate into exisbng systems With known languages and frameworks JavaScript Java Python PHP C# Ruby API 12

13 Technology Overview

14 Hunk Server Explore Analyze Visualize Dashboards Share splunkweb Web and applicabon server Python, AJAX, CSS, XSLT, XML REST API Search head Virtual indexes C++, web services COMMAND LINE splunkd 64- bit Linux OS 14 ODBC (beta) Hadoop interface Hadoop client libraries JAVA

15 Connect to HDFS and MapReduce Explore Analyze Visualize Dashboards Share splunkweb Web and applicabon server Python, AJAX, CSS, XSLT, XML REST API Search head Virtual indexes C++, web services COMMAND LINE splunkd ODBC (beta) Hadoop interface Hadoop client libraries JAVA Connect to Apache HDFS and MapReduce or your choice of Hadoop distribubon Hadoop cluster bit Linux OS 15

16 Hunk Scales with Your Hadoop Deployments Explore Analyze Visualize Dashboards Share splunkweb Web and applicabon server Python, AJAX, CSS, XSLT, XML REST API Search head Virtual indexes C++, web services COMMAND LINE splunkd ODBC (beta) Hadoop interface Hadoop client libraries JAVA Connect Hunk to mulbple Hadoop clusters Hadoop cluster 1 Hadoop cluster 2 Hadoop cluster bit Linux OS 16

17 Prerequisites Data in Hadoop to analyze Hadoop client libraries Hadoop access rights Java 1.6+ HDFS scratch space DataNode local temp disk space 17

18 MapReduce as The OrchestraBon Framework Hunk search head > 1. Copy splunkd binary.tgz.tgz HDFS 2. Copy.tgz TaskTracker 1 TaskTracker 2 TaskTracker 3 3. Expand in specified locabon on each TaskTracker 4. Receive binary in subsequent searches 18

19 Hunk Usage in HDFS hdfs://<scratch_space_path>/ bundles Search head bundles: keeps last 5 bundles packages Hunk.tgz packages: no automabc cleanup dispatch/<sid> Search scratch space: cleanup when sid is invalid 19

20 Hunk Uses Virtual Indexes! Enables seamless use of almost the enbre Splunk stack on data in Hadoop! AutomaBcally handles MapReduce! Technology is patent pending 20

21 Examples of Virtual Indexes External system 1 index = syslog (/home/syslog/ ) Hunk search head > External system 2 index = apache_logs index = sensor_data External system 3 index = twiler 21

22 Define Virtual Indexes and Paths External resource (e.g. hadoop.prod) Virtual index (e.g. twiler) Virtual index (e.g. sensor data) Virtual index (e.g. Apache logs) Specify virtual index and data paths, and opbonally:! Filter files or directories using a whitelist or blacklist! Extract metadata or Bme range from paths! Use props/transforms.conf to specify search Bme processing 22

23 Search Data in Hadoop Run a copy of splunkd to process Hunk search head > NameNode DataNode / TaskTracker (Node in YARN) / working directory 1 JSON configs External resource (e.g. hadoop.prod) 5 2 MapReduce jobs JobTracker (MapReduce resource manager in YARN) Tasks 3 DataNode / TaskTracker (Node in YARN) DataNode / TaskTracker (Node in YARN) 4 HDFS 23

24 Data Processing Pipeline Raw data (HDFS) Custom processing stdin Indexing pipeline Search pipeline You can plug in data preprocessors e.g. Apache Avro or format readers Event breaking Timestamping Event typing Lookups Tagging Search processors MapReduce/Java splunkd/c++ 24

25 Hunk Applies Schema on The Fly Structure applied at search Bme No brille schema to work around AutomaBcally find palerns and trends Hunk applies schema for all fields including transacbons at search Bme 25

26 Search OpBmizaBon: ParBBon Pruning! Most data types are stored in hierarchical directories Such as /<base_path>/<date>/<hour>/<hostname>/somefile.log! You can instruct Hunk to extract fields and Bme ranges from a path! Searches ignore directories that cannot possibly contain search results Such as Bme ranges outside of a defined range Example Bme- based parbbon pruning Search: index=hunk earliest_(me= T01:00:00 latest_(me = T02:00:00 26

27 Search Performance with MapReduce MapReduce considerabons! Stats/chart/Bmechart/top/etc. commands work well in a distributed environment They MapReduce well! Time and order commands don t work well in a distributed environment They don t MapReduce well Summary indexing Useful for speeding up searches Summaries could have different retenbon policy In most cases resides on the search head Backfill is a manual (scripted) process 27

28 Mixed- mode Search Streaming Transfers first several blocks from HDFS to the Hunk search head for immediate processing ReporBng Pushes computabon to the DataNodes and TaskTrackers for the complete search! Hunk starts the streaming and reporbng modes concurrently! Streaming results show unbl the reporbng results come in! Allows users to search interacbvely by pausing and refining queries 28

29 Flexible, IteraBve Workflow for Business Users Interac(ve Analy(cs Preview results NormalizaBon as it s needed Faster implementabon and flexibility Easy search language + data models & pivot MulBple views into the same data Share Visualize Explore Pivot Analyze Model 29

30 Demo

31 Next Steps Download the.conf2013 Mobile App If not iphone, ipad or Android, use the Web App Take the survey & WIN A PASS FOR.CONF2014 Or one of these bags! Go to Technical Deep Dive: Hadoop Opera(ons Management Brera 6, Level 3 Today, 11:30-12:30pm 31

32 Thank You

Faster Bme to Value with ITSI Modules

Faster Bme to Value with ITSI Modules Copyright 2016 Splunk Inc. Faster Bme to Value with ITSI Modules Nicholas Tankersley Bill Babilon Product Manager, Splunk Sales Engineer, Splunk Disclaimer During the course of this presentabon, we may

More information

IT OperaBons Super Session

IT OperaBons Super Session Copyright 2015 Splunk Inc. IT OperaBons Super Session Clint Sharp Director, Product Management Splunk Disclaimer During the course of this presentabon, we may make forward looking statements regarding

More information

BIG DATA AND HADOOP DEVELOPER

BIG DATA AND HADOOP DEVELOPER BIG DATA AND HADOOP DEVELOPER Approximate Duration - 60 Hrs Classes + 30 hrs Lab work + 20 hrs Assessment = 110 Hrs + 50 hrs Project Total duration of course = 160 hrs Lesson 00 - Course Introduction 0.1

More information

PosiBoning the Business Value of Splunk Across Your OrganizaBon

PosiBoning the Business Value of Splunk Across Your OrganizaBon Copyright 2014 Splunk Inc. PosiBoning the Business Value of Splunk Across Your OrganizaBon Doug May Director, Global Business Value ConsulBng Splunk> Disclaimer During the course of this presentabon, we

More information

BIG DATA ANALYTICS WITH HADOOP. 40 Hour Course

BIG DATA ANALYTICS WITH HADOOP. 40 Hour Course 1 BIG DATA ANALYTICS WITH HADOOP 40 Hour Course OVERVIEW Learning Objectives Understanding Big Data Understanding various types of data that can be stored in Hadoop Setting up and Configuring Hadoop in

More information

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Rohit Bakhshi, Solution Architect, Hortonworks Jim Walker, Director Product Marketing, Talend Page 1 About Us Rohit Bakhshi Solution

More information

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. Course Content Course Description: The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. At Course Completion: After competing this course,

More information

20775A: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight 20775A: Performing Data Engineering on Microsoft HD Insight Duration: 5 days; Instructor-led Implement Spark Streaming Using the DStream API. Develop Big Data Real-Time Processing Solutions with Apache

More information

20775 Performing Data Engineering on Microsoft HD Insight

20775 Performing Data Engineering on Microsoft HD Insight Duración del curso: 5 Días Acerca de este curso The main purpose of the course is to give students the ability plan and implement big data workflows on HD. Perfil de público The primary audience for this

More information

20775A: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight 20775A: Performing Data Engineering on Microsoft HD Insight Course Details Course Code: Duration: Notes: 20775A 5 days This course syllabus should be used to determine whether the course is appropriate

More information

EXAMPLE SOLUTIONS Hadoop in Azure HBase as a columnar NoSQL transactional database running on Azure Blobs Storm as a streaming service for near real time processing Hadoop 2.4 support for 100x query gains

More information

20775: Performing Data Engineering on Microsoft HD Insight

20775: Performing Data Engineering on Microsoft HD Insight Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com

More information

Monitoring End User Experiences With Splunk and New Relic

Monitoring End User Experiences With Splunk and New Relic Monitoring End User Experiences With Splunk and New Relic Break down the silos in your observability infrastructure. Abner Germanow New Relic, Partner Marketing & Evangelism Tom Martin Splunk, Staff ITOA

More information

Hadoop Course Content

Hadoop Course Content Hadoop Course Content Hadoop Course Content Hadoop Overview, Architecture Considerations, Infrastructure, Platforms and Automation Use case walkthrough ETL Log Analytics Real Time Analytics Hbase for Developers

More information

Exploring Big Data and Data Analytics with Hadoop and IDOL. Brochure. You are experiencing transformational changes in the computing arena.

Exploring Big Data and Data Analytics with Hadoop and IDOL. Brochure. You are experiencing transformational changes in the computing arena. Brochure Software Education Exploring Big Data and Data Analytics with Hadoop and IDOL You are experiencing transformational changes in the computing arena. Brochure Exploring Big Data and Data Analytics

More information

Hadoop Administration Course Content

Hadoop Administration Course Content Hadoop Administration Course Content Weekend Batch (2 Months): SAT & SUN (8-12pm) Course Fee: 16,000/- New Batch starts on: Free Demo Session scheduled on : Ph : 8892499499 Web:www.dvstechnologies.in mail:dvs.training@gmail.com

More information

Microsoft Azure Essentials

Microsoft Azure Essentials Microsoft Azure Essentials Azure Essentials Track Summary Data Analytics Explore the Data Analytics services in Azure to help you analyze both structured and unstructured data. Azure can help with large,

More information

Physical State Analytics with Machine Learning

Physical State Analytics with Machine Learning Physical State Analytics with Machine Learning Sandeep Vasani Forward Deployed Software Engineer Jaime Sanchez Sales Engineer 9/27/2017 Washington, DC Forward-Looking Statements During the course of this

More information

Enterprise-Scale MATLAB Applications

Enterprise-Scale MATLAB Applications Enterprise-Scale Applications Sylvain Lacaze Rory Adams 2018 The MathWorks, Inc. 1 Enterprise Integration Access and Explore Data Preprocess Data Develop Predictive Models Integrate Analytics with Systems

More information

Apache Hadoop in the Datacenter and Cloud

Apache Hadoop in the Datacenter and Cloud Apache Hadoop in the Datacenter and Cloud The Shift to the Connected Data Architecture Digital Transformation fueled by Big Data Analytics and IoT ACTIONABLE INTELLIGENCE Cloud and Data Center IDMS Relational

More information

Splunk This! - Bringing Natural Language Processing To Splunk

Splunk This! - Bringing Natural Language Processing To Splunk 2017 SPLUNK INC. Splunk This! - Bringing Natural Language Processing To Splunk Dipock Das Incubation Projects September 2017 Washington, DC Forward-Looking Statements During the course of this presentation,

More information

ETL on Hadoop What is Required

ETL on Hadoop What is Required ETL on Hadoop What is Required Keith Kohl Director, Product Management October 2012 Syncsort Copyright 2012, Syncsort Incorporated Agenda Who is Syncsort Extract, Transform, Load (ETL) Overview and conventional

More information

Hadoop Integration Deep Dive

Hadoop Integration Deep Dive Hadoop Integration Deep Dive Piyush Chaudhary Spectrum Scale BD&A Architect 1 Agenda Analytics Market overview Spectrum Scale Analytics strategy Spectrum Scale Hadoop Integration A tale of two connectors

More information

SIMPLIFYING BUSINESS ANALYTICS FOR COMPLEX DATA. Davidi Boyarski, Channel Manager

SIMPLIFYING BUSINESS ANALYTICS FOR COMPLEX DATA. Davidi Boyarski, Channel Manager SIMPLIFYING BUSINESS ANALYTICS FOR COMPLEX DATA Davidi Boyarski, Channel Manager Sisense Overview 4000+ Top Quartile HQ in Awards 2016 & 2017 Wisdom of Crowds Report LEADER IN CUSTOMER EXPERIANCE Top 10

More information

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Roger Ding Cloudera February 3rd, 2018 1 Agenda Hadoop History Introduction to Apache Hadoop

More information

Trifacta Data Wrangling for Hadoop: Accelerating Business Adoption While Ensuring Security & Governance

Trifacta Data Wrangling for Hadoop: Accelerating Business Adoption While Ensuring Security & Governance 575 Market St, 11th Floor San Francisco, CA 94105 www.trifacta.com 844.332.2821 1 WHITEPAPER Trifacta Data Wrangling for Hadoop: Accelerating Business Adoption While Ensuring Security & Governance 2 Introduction

More information

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

E-guide Hadoop Big Data Platforms Buyer s Guide part 1 Hadoop Big Data Platforms Buyer s Guide part 1 Your expert guide to Hadoop big data platforms for managing big data David Loshin, Knowledge Integrity Inc. Companies of all sizes can use Hadoop, as vendors

More information

Big Data & Hadoop Advance

Big Data & Hadoop Advance Course Durations: 30 Hours About Company: Course Mode: Online/Offline EduNextgen extended arm of Product Innovation Academy is a growing entity in education and career transformation, specializing in today

More information

Common Customer Use Cases in FSI

Common Customer Use Cases in FSI Common Customer Use Cases in FSI 1 Marketing Optimization 2014 2014 MapR MapR Technologies Technologies 2 Fortune 100 Financial Services Company 104M CARD MEMBERS 3 Financial Services: Recommendation Engine

More information

ABOUT THIS TRAINING: This Hadoop training will also prepare you for the Big Data Certification of Cloudera- CCP and CCA.

ABOUT THIS TRAINING: This Hadoop training will also prepare you for the Big Data Certification of Cloudera- CCP and CCA. ABOUT THIS TRAINING: The world of Hadoop and Big Data" can be intimidating - hundreds of different technologies with cryptic names form the Hadoop ecosystem. This comprehensive training has been designed

More information

Cask Data Application Platform (CDAP) Extensions

Cask Data Application Platform (CDAP) Extensions Cask Data Application Platform (CDAP) Extensions CDAP Extensions provide additional capabilities and user interfaces to CDAP. They are use-case specific applications designed to solve common and critical

More information

Databricks Cloud. A Primer

Databricks Cloud. A Primer Databricks Cloud A Primer Who is Databricks? Databricks was founded by the team behind Apache Spark, the most active open source project in the big data ecosystem today. Our mission at Databricks is to

More information

SAS and Hadoop Technology: Overview

SAS and Hadoop Technology: Overview SAS and Hadoop Technology: Overview SAS Documentation September 19, 2017 The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2015. SAS and Hadoop Technology: Overview.

More information

Business is being transformed by three trends

Business is being transformed by three trends Business is being transformed by three trends Big Cloud Intelligence Stay ahead of the curve with Cortana Intelligence Suite Business apps People Custom apps Apps Sensors and devices Cortana Intelligence

More information

SeBng Up a Chargeback Model for Splunk Brian Wooden

SeBng Up a Chargeback Model for Splunk Brian Wooden Copyright 2013 Splunk Inc. SeBng Up a Chargeback Model for Splunk Brian Wooden Principal Consultant, Splunk #splunkconf Legal NoHces During the course of this presentahon, we may make forward- looking

More information

MIDDLEWARE IMPLEMENTATIONS Sakthi Ramanathan Sivaraman

MIDDLEWARE IMPLEMENTATIONS Sakthi Ramanathan Sivaraman MIDDLEWARE IMPLEMENTATIONS Sakthi Ramanathan Sivaraman Topics to be discussed o ApplicaBon Level Middleware - ImplementaBon in IFTTT, ROS frameworks o Middleware for data & network management - ImplementaBon

More information

Scalability and High Performance with MicroStrategy 10

Scalability and High Performance with MicroStrategy 10 Scalability and High Performance with MicroStrategy 10 Enterprise Analytics and Mobility at Scale. Copyright Information All Contents Copyright 2017 MicroStrategy Incorporated. All Rights Reserved. Trademark

More information

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW TOPICS COVERED 1 2 Fundamentals of Big Data Platforms Major Big Data Tools Scaling Up vs. Out SCALE UP (SMP) SCALE OUT (MPP) + (n) Upgrade

More information

Cask Data Application Platform (CDAP)

Cask Data Application Platform (CDAP) Cask Data Application Platform (CDAP) CDAP is an open source, Apache 2.0 licensed, distributed, application framework for delivering Hadoop solutions. It integrates and abstracts the underlying Hadoop

More information

Real World Use Cases: Hadoop & NoSQL in Production. Big Data Everywhere London 4 June 2015

Real World Use Cases: Hadoop & NoSQL in Production. Big Data Everywhere London 4 June 2015 Real World Use Cases: Hadoop & NoSQL in Production Ted Dunning Big Data Everywhere London 4 June 2015 1 Contact Information Ted Dunning Chief Applications Architect at MapR Technologies Committer & PMC

More information

AZURE HDINSIGHT. Azure Machine Learning Track Marek Chmel

AZURE HDINSIGHT. Azure Machine Learning Track Marek Chmel AZURE HDINSIGHT Azure Machine Learning Track Marek Chmel SESSION AGENDA Understanding different scenarios of Hadoop Building an end to end pipeline using HDInsight Using in-memory techniques to analyze

More information

SOLUTION SHEET Hortonworks DataFlow (HDF ) End-to-end data flow management and streaming analytics platform

SOLUTION SHEET Hortonworks DataFlow (HDF ) End-to-end data flow management and streaming analytics platform SOLUTION SHEET Hortonworks DataFlow (HDF ) End-to-end data flow management and streaming analytics platform CREATE STREAMING ANALYTICS APPLICATIONS IN MINUTES WITHOUT WRITING CODE The increasing growth

More information

WHITE PAPER SPLUNK SOFTWARE AS A SIEM

WHITE PAPER SPLUNK SOFTWARE AS A SIEM SPLUNK SOFTWARE AS A SIEM Improve your security posture by using Splunk as your SIEM HIGHLIGHTS Splunk software can be used to build and operate security operations centers (SOC) of any size (large, med,

More information

Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications

Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications Cask Data Application Platform (CDAP) The Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications Copyright 2015 Cask Data, Inc. All Rights Reserved. February

More information

Microsoft Big Data. Solution Brief

Microsoft Big Data. Solution Brief Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,

More information

Splunk 4.3 Overview. Curt Monash 1/9/12. Under NDA un:l 1/10/12

Splunk 4.3 Overview. Curt Monash 1/9/12. Under NDA un:l 1/10/12 Splunk 4.3 Overview Curt Monash Under NDA un:l 1/10/12 1/9/12 Make machine data accessible, usable and valuable to everyone. 2 Most Enterprise Data is Machine- generated Addi:onal Sources Core IT Customer-

More information

What s new in Machine Learning across the Splunk Portfolio

What s new in Machine Learning across the Splunk Portfolio What s new in Machine Learning across the Splunk Portfolio Manish Sainani Director, Product Management Bob Pratt Sr. Director, Product Management September 2017 Forward-Looking Statements During the course

More information

Transforming Analytics with Cloudera Data Science WorkBench

Transforming Analytics with Cloudera Data Science WorkBench Transforming Analytics with Cloudera Data Science WorkBench Process data, develop and serve predictive models. 1 Age of Machine Learning Data volume NO Machine Learning Machine Learning 1950s 1960s 1970s

More information

Integrating MATLAB Analytics into Enterprise Applications

Integrating MATLAB Analytics into Enterprise Applications Integrating MATLAB Analytics into Enterprise Applications David Willingham 2015 The MathWorks, Inc. 1 Run this link. http://bit.ly/matlabapp 2 Key Takeaways 1. What is Enterprise Integration 2. What is

More information

Outline of Hadoop. Background, Core Services, and Components. David Schwab Synchronic Analytics Nov.

Outline of Hadoop. Background, Core Services, and Components. David Schwab Synchronic Analytics   Nov. Outline of Hadoop Background, Core Services, and Components David Schwab Synchronic Analytics https://synchronicanalytics.com Nov. 1, 2018 Hadoop s Purpose and Origin Hadoop s Architecture Minimum Configuration

More information

Social Media Analytics Using Greenplum s Data Computing Appliance

Social Media Analytics Using Greenplum s Data Computing Appliance Social Media Analytics Using Greenplum s Data Computing Appliance Johann Schleier-Smith Co-founder & CTO Tagged Inc. @jssmith February 2, 2011 What is Tagged? 3 rd largest social network in US (minutes

More information

TIBCO Live Datamart providing an operational command and control center in a virtual train application.

TIBCO Live Datamart providing an operational command and control center in a virtual train application. TIBCO Live Datamart BENEFITS Serve your real-time application needs with a purpose-built live data mart that continuously pushes the right data to the right clients. Detect problems when they happen by

More information

Matthew David Levy Arib Alimuddin Patel Shesh Nath Mishra Ashwin Sethu Baskaran

Matthew David Levy Arib Alimuddin Patel Shesh Nath Mishra Ashwin Sethu Baskaran Matthew David Levy Arib Alimuddin Patel Shesh Nath Mishra Ashwin Sethu Baskaran Introduction Terminologies Features of System Data Model Operations Supported Languages Supported Implementation Concepts

More information

Splunk App Lifecycle Management

Splunk App Lifecycle Management Splunk App Lifecycle Management Take Control of Your Apps in the Cloud! Cecelia Redding Senior Software Engineer Blaine Wastell Area Product Owner September 26, 2017 Washington, DC Forward-Looking Statements

More information

Hadoop Roadmap 2012 A Hortonworks perspective

Hadoop Roadmap 2012 A Hortonworks perspective Hadoop Roadmap 2012 A Hortonworks perspective Eric Baldeschwieler CTO Hortonworks Twitter: @jeric14, @hortonworks February 2012 Page 1 About Eric Baldeschwieler Co-Founder and CTO of Hortonworks Prior

More information

Modernizing Your Data Warehouse with Azure

Modernizing Your Data Warehouse with Azure Modernizing Your Data Warehouse with Azure Big data. Small data. All data. Christian Coté S P O N S O R S The traditional BI Environment The traditional data warehouse data warehousing has reached the

More information

Insights to HDInsight

Insights to HDInsight Insights to HDInsight Why Hadoop in the Cloud? No hardware costs Unlimited Scale Pay for What You Need Deployed in minutes Azure HDInsight Big Data made easy Enterprise Ready Easier and more productive

More information

Sisense. Product Highlights

Sisense. Product Highlights Sisense Product Highlights Aditional information November 2017 SISENSE PRODUCT HIGHLIGHTS... 3 Overview... 3 Mash-Up... 4 Connect Data... 4 Mash-up Data... 4 Cleanse Data... 4 Transform Data... 4 Manage

More information

Analytics in Action transforming the way we use and consume information

Analytics in Action transforming the way we use and consume information Analytics in Action transforming the way we use and consume information Big Data Ecosystem The Data Traditional Data BIG DATA Repositories MPP Appliances Internet Hadoop Data Streaming Big Data Ecosystem

More information

IBM Big Data Summit 2012

IBM Big Data Summit 2012 IBM Big Data Summit 2012 12.10.2012 InfoSphere BigInsights Introduction Wilfried Hoge Leading Technical Sales Professional hoge@de.ibm.com twitter.com/wilfriedhoge 12.10.1012 IBM Big Data Strategy: Move

More information

Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics

Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics Key takeaways Analytic Insights Module for self-service analytics Automate data ingestion into Isilon Data Lake Three methods

More information

1. Intoduction to Hadoop

1. Intoduction to Hadoop 1. Intoduction to Hadoop Hadoop is a rapidly evolving ecosystem of components for implementing the Google MapReduce algorithms in a scalable fashion on commodity hardware. Hadoop enables users to store

More information

Alexander Klein. ETL meets Azure

Alexander Klein. ETL meets Azure Alexander Klein ETL meets Azure Thanks to our sponsors: Who am I? Independent BI Consultant > 15 years experience of SQL Server Focus on Microsoft BI Stack & AI & Azure a.klein@consulting-bi.de @SQL_Alex

More information

More information for FREE VS ENTERPRISE LICENCE :

More information for FREE VS ENTERPRISE LICENCE : Source : http://www.splunk.com/ Splunk Enterprise is a fully featured, powerful platform for collecting, searching, monitoring and analyzing machine data. Splunk Enterprise is easy to deploy and use. It

More information

Integrating MATLAB Analytics into Enterprise Applications The MathWorks, Inc. 1

Integrating MATLAB Analytics into Enterprise Applications The MathWorks, Inc. 1 Integrating Analytics into Enterprise Applications 2015 The MathWorks, Inc. 1 Agenda Example Problem Access and Preprocess Data Develop a Predictive Model Integrate Analytics with Production Systems Build

More information

Efficiently Develop Powerful Apps for An Intelligent Enterprise

Efficiently Develop Powerful Apps for An Intelligent Enterprise SAP Brief SAP Technology SAP Web IDE Efficiently Develop Powerful Apps for An Intelligent Enterprise SAP Brief Agility to build and extend applications SAP Web IDE puts the power of agile in your hands.

More information

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS)

ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS) ENABLING GLOBAL HADOOP WITH DELL EMC S ELASTIC CLOUD STORAGE (ECS) Hadoop Storage-as-a-Service ABSTRACT This White Paper illustrates how Dell EMC Elastic Cloud Storage (ECS ) can be used to streamline

More information

SOLUTION SHEET End to End Data Flow Management and Streaming Analytics Platform

SOLUTION SHEET End to End Data Flow Management and Streaming Analytics Platform SOLUTION SHEET End to End Data Flow Management and Streaming Analytics Platform CREATE STREAMING ANALYTICS APPLICATIONS IN MINUTES WITHOUT WRITING CODE The increasing growth of data, especially data-in-motion,

More information

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved.

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved. Apache Spark 2.0 GA The General Engine for Modern Analytic Use Cases 1 Apache Spark Drives Business Innovation Apache Spark is driving new business value that is being harnessed by technology forward organizations.

More information

MapR: Solution for Customer Production Success

MapR: Solution for Customer Production Success 2015 MapR Technologies 2015 MapR Technologies 1 MapR: Solution for Customer Production Success Big Data High Growth 700+ Customers Cloud Leaders Riding the Wave with Hadoop The Big Data Platform of Choice

More information

Spark and Hadoop Perfect Together

Spark and Hadoop Perfect Together Spark and Hadoop Perfect Together Arun Murthy Hortonworks Co-Founder @acmurthy Data Operating System Enable all data and applications TO BE accessible and shared BY any end-users Data Operating System

More information

Mobile Application Developer

Mobile Application Developer Mobile Application Developer The Mobile Application Developer career path prepares students to develop, test, debug and deploy hybrid mobile applications. This will require skills in application development

More information

Oracle Enterprise Data Quality Product Roadmap and Statement of Direction. October 2016

Oracle Enterprise Data Quality Product Roadmap and Statement of Direction. October 2016 Oracle Enterprise Data Quality Product Roadmap and Statement of Direction October 2016 Oracle Confidential Internal/Restricted/Highly Restricted 2 Safe Harbor Statement The following is intended to outline

More information

What s New. Bernd Wiswedel KNIME KNIME AG. All Rights Reserved.

What s New. Bernd Wiswedel KNIME KNIME AG. All Rights Reserved. What s New Bernd Wiswedel KNIME 2018 KNIME AG. All Rights Reserved. What this session is about Presenting (and demo ing) enhancements added in the last year By the team Questions? See us at the booth.

More information

Adobe and Hadoop Integration

Adobe and Hadoop Integration Predictive Behavioral Analytics Adobe and Hadoop Integration JANUARY 2016 SYNTASA Copyright 1.0 Introduction For many years large enterprises have relied on the Adobe Marketing Cloud for capturing and

More information

Spark, Hadoop, and Friends

Spark, Hadoop, and Friends Spark, Hadoop, and Friends (and the Zeppelin Notebook) Douglas Eadline Jan 4, 2017 NJIT Presenter Douglas Eadline deadline@basement-supercomputing.com @thedeadline HPC/Hadoop Consultant/Writer http://www.basement-supercomputing.com

More information

Hortonworks Connected Data Platforms

Hortonworks Connected Data Platforms Hortonworks Connected Data Platforms MASTER THE VALUE OF DATA EVERY BUSINESS IS A DATA BUSINESS EMBRACE AN OPEN APPROACH 2 Hortonworks Inc. 2011 2016. All Rights Reserved Data Drives the Connected Car

More information

Big data is hard. Top 3 Challenges To Adopting Big Data

Big data is hard. Top 3 Challenges To Adopting Big Data Big data is hard Top 3 Challenges To Adopting Big Data Traditionally, analytics have been over pre-defined structures Data characteristics: Sales Questions answered with BI and visualizations: Customer

More information

Rapid Start with Big Data Appliance X6-2 Technical & Operational Overview

Rapid Start with Big Data Appliance X6-2 Technical & Operational Overview Rapid Start with Big Data Appliance X6-2 Technical & Operational Overview Dirk Augustin Solution Architect Hardware Presales Germany The Realities of Today s Data Center... Accelerating Customer Expectations

More information

Managing Data in Motion with the Connected Data Architecture

Managing Data in Motion with the Connected Data Architecture Managing in Motion with the Connected Architecture Dmitry Baev Director, Solutions Engineering Doing It Right SYMPOSIUM March 23-24, 2017 1 Hortonworks Inc. 2011 2016. All Rights Reserved 4 th Big & Business

More information

Adobe Deploys Hadoop as a Service on VMware vsphere

Adobe Deploys Hadoop as a Service on VMware vsphere Adobe Deploys Hadoop as a Service A TECHNICAL CASE STUDY APRIL 2015 Table of Contents A Technical Case Study.... 3 Background... 3 Why Virtualize Hadoop on vsphere?.... 3 The Adobe Marketing Cloud and

More information

Operational Intelligence in Industrial Environments

Operational Intelligence in Industrial Environments Copyright 2014 Splunk Inc. Operational Intelligence in Industrial Environments Brian Gilmore, Splunk bgilmore@splunk.com @brianmgilmore Safe Harbor Statement During the course of this presentation, we

More information

Fun with Analytics. Marcello Lino SVP, Security Analytics Engineering James Sullivan VP, Security Analytics Engineering. Sep/2017 Washington, DC

Fun with Analytics. Marcello Lino SVP, Security Analytics Engineering James Sullivan VP, Security Analytics Engineering. Sep/2017 Washington, DC Fun with Analytics Marcello Lino SVP, Security Analytics Engineering James Sullivan VP, Security Analytics Engineering Sep/2017 Washington, DC Forward-Looking Statements During the course of this presentation,

More information

Big Data Analytics met Hadoop

Big Data Analytics met Hadoop Big Data Analytics met Hadoop Jos van Dongen Arno Klijnman What is Distributed storage and processing of (big) data on large clusters of commodity hardware HDFS Map/Reduce HDFS - Distributed storage for

More information

Operational Hadoop and the Lambda Architecture for Streaming Data

Operational Hadoop and the Lambda Architecture for Streaming Data Operational Hadoop and the Lambda Architecture for Streaming Data 2015 MapR Technologies 2015 MapR Technologies 1 Topics From Batch to Operational Workloads on Hadoop Streaming Data Environments The Lambda

More information

Optimal Infrastructure for Big Data

Optimal Infrastructure for Big Data Optimal Infrastructure for Big Data Big Data 2014 Managing Government Information Kevin Leong January 22, 2014 2014 VMware Inc. All rights reserved. The Right Big Data Tools for the Right Job Real-time

More information

Hortonworks Data Platform for Enterprise Data Lakes delivers robust, big data analytics that accelerate decision making and innovation

Hortonworks Data Platform for Enterprise Data Lakes delivers robust, big data analytics that accelerate decision making and innovation IBM United States Software Announcement 218-187, dated March 20, 2018 Hortonworks Data Platform for Enterprise Data Lakes delivers robust, big data analytics that accelerate decision making and innovation

More information

An Introduction to Splunk IT Service Intelligence (ITSI)

An Introduction to Splunk IT Service Intelligence (ITSI) An Introduction to Splunk IT Service Intelligence (ITSI) Brief introduction to ITSI s goals, use cases and a demo Alok Bhide Director of Product Management, ITSI September 26, 2017 Washington, DC Forward-Looking

More information

MicroStrategy CTO newsletter

MicroStrategy CTO newsletter Dear MicroStrategy Customer, At MicroStrategy, we take our responsibility to you very seriously. Every morning my leadership team meets with me to review the state of our products and customers. We discuss

More information

Redefine Big Data: EMC Data Lake in Action. Andrea Prosperi Systems Engineer

Redefine Big Data: EMC Data Lake in Action. Andrea Prosperi Systems Engineer Redefine Big Data: EMC Data Lake in Action Andrea Prosperi Systems Engineer 1 Agenda Data Analytics Today Big data Hadoop & HDFS Different types of analytics Data lakes EMC Solutions for Data Lakes 2 The

More information

Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved.

Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved. Welcome! 2013 SAP AG or an SAP affiliate company. All rights reserved. 1 SAP Big Data Webinar Series Big Data - Introduction to SAP Big Data Technologies Big Data - Streaming Analytics Big Data - Smarter

More information

Bringing the Power of SAS to Hadoop Title

Bringing the Power of SAS to Hadoop Title WHITE PAPER Bringing the Power of SAS to Hadoop Title Combine SAS World-Class Analytics With Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities ii Contents Introduction... 1 What

More information

E-Business Suite: BI Publisher for Developers Volume I - Student Guide

E-Business Suite: BI Publisher for Developers Volume I - Student Guide E-Business Suite: BI Publisher 5.6.3 for Developers Volume I - Student Guide D59123GC10 Edition 1.0 January 2011 D59936 Disclaimer This document contains proprietary information and is protected by copyright

More information

By: Shrikant Gawande (Cloudera Certified )

By: Shrikant Gawande (Cloudera Certified ) By: Shrikant Gawande (Cloudera Certified ) What is Big Data? For every 30 mins, a airline jet collects 10 terabytes of sensor data (flying time) NYSE generates about one terabyte of new trade data per

More information

Who is Databricks? Today, hundreds of organizations around the world use Databricks to build and power their production Spark applications.

Who is Databricks? Today, hundreds of organizations around the world use Databricks to build and power their production Spark applications. Databricks Primer Who is Databricks? Databricks was founded by the team who created Apache Spark, the most active open source project in the big data ecosystem today, and is the largest contributor to

More information

The Expert Guide To Mobile Loyalty Sign-Up & Engagement

The Expert Guide To Mobile Loyalty Sign-Up & Engagement The Expert Guide To Mobile Loyalty Sign-Up & Engagement Loyalty MarkeBng Challenges Three common themes consistently rise to the top when CodeBroker speaks with loyalty markebng professionals: 1. Engagement

More information

Jason Virtue Business Intelligence Technical Professional

Jason Virtue Business Intelligence Technical Professional Jason Virtue Business Intelligence Technical Professional jvirtue@microsoft.com Agenda Microsoft Azure Data Services Azure Cloud Services Azure Machine Learning Azure Service Bus Azure Stream Analytics

More information

INTRODUCTION AUX APPLICATIONS CLOUD NATIVE AVEC PIVOTAL READY SYSTEM

INTRODUCTION AUX APPLICATIONS CLOUD NATIVE AVEC PIVOTAL READY SYSTEM INTRODUCTION AUX APPLICATIONS CLOUD NATIVE AVEC PIVOTAL READY SYSTEM EMMANUEL BERNARD PRINCIPAL SYSTEM ENGINEER, CLOUD PLATFORM SPECIALIST DELL EMC @_ebernard GLOBAL SPONSORS Every Business is Becoming

More information