Software EMEA Performance Tour 2013 17.-19 Juni, Berlin
Service Health & Ops Analytics Die nächste Generation des IT Monitorings Matthias Precht / Juni, 2012
Agenda Eine kurze Betrachtung der heutigen Herausforderungen im IT Management Service Health Lösungen von HP Was verbirgt sich dahinter? Operations Analytics Die Wiederauferstehung der Logfile Überwachung oder mehr?
Challenges for IT Being able to prevent or resolve issues quickly KNOWN issues Need to be able to monitor, prevent and resolve UNKNOWN issues Don t know what new issues they might encounter Too much DATA Need the right solutions to quickly search and analyze LOGS PREVENT, RESOLVE DETERMINE ANALYZE 4
Reactive Monitoring Proactive Monitoring A unified approach to solving IT Management problems Operations Manager i + TBEC Advanced Correlation Advanced Analytics NNMi Operations Manager Operations Intelligence SiteScope Event Triage BSM End User Monitoring Log Management Known Problems Unknown Problems 5
Next-generation analytics is driving new ways of organizations to make decisions. This trend is about using pattern recognition to optimize, simulate, and predict - PC Magazine By 2016, 20% of Global 2000 enterprises will have an IT operations analytics architecture in place, up from less than 1% today - Gartner Predictive analytics is emerging as a game-changer. It helps answer What's next? and What should we do about it? - Forbes.com
Agenda Eine kurze Betrachtung der heutigen Herausforderungen im IT Management Service Health Lösungen von HP Was verbirgt sich dahinter? Operations Analytics Die Wiederauferstehung der Logfile Überwachung oder mehr?
HP Service Intelligence Analytics that converts data into actionable knowledge Service Health Analyzer (SHA) Anticipate issues before they occur Service Health Optimizer (SHO) Service Health Optimizer (SHO) Optimize your IT engine Service Health Reporter (SHR) Service Health Reporter (SHR) Understand issues from a services point of view Service Level Management (SLM) Align IT to business and gauge total health Predictive analytics Capacity management Cross-domain reporting SLA/KPI dashboard 8
HP Service Health Analyzer (SHA) Predictive Analytics 1. Anticipate problems before the business is impacted and prevent downtime 2. Automatically correlate information from multiple domains 3. Reduce cost of handling events by proactively investigating anomalies 4. Self learning system 9
Predictive analytics how SHA works Aspects the Realtime Anomaly Detection Engine uses to define anomaly Baseline Self learning, automatic threshold creation Topology Determines if metrics and topology info are related Temporal Real anomaly or spike? Anomaly DNA Can match current anomalies with past ones Statistical learning Determines normal state for specific service, suppresses anomaly noise Statistical Learning Algorithm Anomaly DNA Technology Baseline RAD Engine (Core of SHA) Temporal Analysis Topology Analysis 10
Metrics monitored by SHA collects to detect anomalies Creates a dynamic baseline with seasonality based on historical metric data: Business Process Monitor Real User Monitor Diagnostics SiteScope OM/PA NNM 3rd Party 11
SHA discovers issues, generates event Metrics BPM/RUM Performance Agent Diagnostics Identify lead suspects and see available Run-books Assess business impact See affected applications/services SiteScope See locations impacted NNM 3 rd Party Data See similar anomalies from past and their associated tickets 12
SHA Topology View investigate the root cause Use the various tools in the Topology View to assist you in investigating the root causes of the anomaly 13
Agenda Eine kurze Betrachtung der heutigen Herausforderungen im IT Management Service Health Lösungen von HP Was verbirgt sich dahinter? Operations Analytics Die Wiederauferstehung der Logfile Überwachung oder mehr?
A new approach: Operational Analytics Making the most of your IT data With complexity of today s heterogeneous environment sprawl, IT is faced with a new set of questions: How do I know what s important? Collect everything How do I know when I ll need the data? Store everything What am I to make of all this information? Analyze anything 15
Consolidated Ops the Service and Ops Bridge Federate all fault & performance information to one place OMi BSM Platform ArcSight Logger Run-time Service Model OOTB Integrations Open BSM Connector Interfaces HP Integration Adapters NNMi IBM Tivoli Microsoft SCOM NAGIOS 3 rd Party Domain Mgrs APM SiteScope OM/PM 16 Appliance/ Virtualization Application Infrastructure App Applications Storage Mobile Cloud Clients Network Systems
Typical Operations Windows and Linux, web servers, web app servers, databases, all virtualized, all monitored 17
Operational Analytics What the IT operations will get 18
Operating system logs : Linux and Windows Deep insight into the OS of the managed systems Benefit OMi usually only receives events when an error or issue has been detected But: much more information is available on the systems that 19 speeds up troubleshooting Allows you to define your monitoring strategy Examples Processes that were started or stopped on the system Logged in users or denied login request Application startup messages What you will get Quick start guide to connect the Linux Syslog or Windows Event Log to Logger Logger content pack with search queries to build dashboards OMi content pack with cross launch tools to show Logger in the context of an event
HP Operations Analytics in the future Advanced analytics based on COMPLETE Operations data Cloud Network Apps Storage Texts Events Data Topology Metrics Logs Structured & Unstructured data HP & 3 rd party sources IT Search Across logs, events, topology, performance metrics Guided Troubleshooting Anomaly / outlier detection and suggestions Operations Analytics Visual Analytics Intuitive, visual depiction of relationships, impacts 24
Powerful global search, guided troubleshooting & breakthrough visual analytics to provide actionable intelligence IT Search Guided Troubleshooting Visual Analytics Reduce Escalations Reduce Downtime Faster Triage Visibility to non-ops Boost Collaboration Improve SLAs Faster 25
Capitalizing on HP Software assets HP BSM Performance Metrics Events (alerts, informational) Topology content/context Scalable BigData platform Columnar Store Database Close to data analytical functions Extensible algorithms written in R Vertica HP Operations Analytics ArcSight Logger Collects machine data from any log-generating source Indexing, compression, storage and search Internal or Federated Logger 26
Reactive Monitoring Proactive Monitoring A unified approach to solving IT Management problems Operations Manager i + TBEC Advanced Correlation Service Health Analyzer Advanced Analytics NNMi Operations Manager SiteScope Event Triage Operations Intelligence BSM End User Monitoring Operations Analytics Log Management Known Problems Unknown Problems 27
Resources HP Software http://www.hp.com/software Service Health Service Health Analyzer Service Health Reporter Service Health Optimizer Service Level Management http://www.hp.com/go/sha http://www.hp.com/go/shr http://www.hp.com/go/sho http://www.hp.com/go/apm Ops Analytics Operations Analytics http://www.hp.com/go/opsanalytics (Free Version Download!) http://www.youtube.com/user/hewlettpackardvideos Matthias Precht matthias.precht@hp.com 28
Vielen Dank