Monitoring: Know Your Audience

Similar documents
SSL ClearView Reporter Data Sheet

With Aruba Central, you get anywhere-anytime access to ensure that your network is up and performing efficiently.

Brocade SANnav Management Portal and. Global View. Product Brief. Storage Modernization. Highlights. Brocade Fabric Vision Technology

Cisco Tidal Enterprise Reporter

The Leading Low-code Application Platform For Modern Work Management

Service management solutions White paper. Six steps toward assuring service availability and performance.

VideojetConnect TM Remote Service. For select Ethernet-enabled Videojet printers

Bringing Nagios IT Monitoring From Good to Great

Disaster Recovery Service Guide

Introducing a hosted CRM system designed exclusively for telecoms, fixed-line & IT resellers. ordering

Delivering high-integrity accounting with Xero

Configurable Policy Enforcement. Automated Remedy Actions. Granular Reporting - Scheduled and On-Demand

Automated Service Intelligence (ASI)

Brochure. IT Operations Management. Enhance Data Protection with Analytics and Insights. Micro Focus Backup Navigator for Micro Focus Data Protector

Key Benefits. Overview. Field Service empowers companies to improve customer satisfaction, first time fix rates, and resource productivity.

ITSM + ITOM = Outsmart Service Outages

April Microsoft Corp. All rights reserved

Using Micro Focus Chatbots with Microsoft Teams

Ensuring High Service Levels in Enterprise Management

SOLVE PROBLEMS FASTER 4 WAYS TO EMPOWER YOUR IT SERVICE MANAGEMENT

The first call handling software for 9-1-1

Actionable Information Instantly Delivered

Enterprise Modeling to Measure, Analyze, and Optimize Your Business Processes

HYBRID FAX CRITICAL BUSINESS CONSIDERATIONS

1 Vendor Management Module v6.0 User s Guide

Moving to Mobility: The Data Collection Checklist

IBM Cloud Application Performance Management

A Framework Approach to Ensuring Application Recovery Readiness. White Paper

Services. Dell ProSupport TM. Improve productivity and optimize resources with efficient, flexible, and reliable support

HQX HQX. HQXchange Is Your Single Comprehensive EDI Platform

Exception Handling in Service-Oriented Architecture

Optanix Platform The Technical Value: How it Works POSITION PAPER

Modernize your grid: Simplify smart metering with an intelligent partner.

IBM Tivoli Monitoring

AppManager + Operations Center

Accelerate and assure wireless services with intelligent solutions for wireless network and service management.

Decision Maker s Guide

Mastering the operational complexity of IoT Applications

Verint Engagement Management Solution Brief. Overview of the Applications and Benefits of

Cloud Automation a beginner s guide

Better information, better results siemens.com/xhq

ALTIRIS Implementing an Asset and Contract Management System

REVISED 6 NOVEMBER 2018 COMPONENT DESIGN: WORKSPACE ONE INTELLIGENCE

Optimizing Service Assurance with Vitria Operational Intelligence

Brochure. VIAVI Observer Apex. Centralized performance monitoring and troubleshooting

We help companies operate responsibly and sustainably, grow with a clear understanding of strategic risk and

Moving to Service Centric Management with HP OMi

Infor Cloverleaf Integration Suite

Five Stages of IoT. Five Stages of IoT 2016 Bsquare Corp.

Data Sheet. VIAVI Observer Apex. Centralized performance monitoring and troubleshooting

Gain strategic insight into business services to help optimize IT.

NetIQ AppManager Plus NetIQ Operations Center

Carahsoft End-User Computing Solutions Services

Service Manager Simplifying modern ITSM

Opsview Enterprise Architecture Whitepaper

WHITE PAPER. Managing the Intelligence Life Cycle: Title A More Effective Way to Tackle Crime

Enterprise Performance Management Bridging the Gap from Strategy to Operations

Maintenance and Service Management User Guide

Solution White Paper Drive Radical Business Value with a High-Speed IT Organization

Enabling a Comprehensive Platform for BCMP that integrates People, Process and Technology

Accelerate Your Riverbed

Avaya Aura MANAGEMENT. The Complete Solution for Intelligent Monitoring and Management of the Entire Avaya Ecosystem

20246D: Monitoring and Operating a Private Cloud

Knowledge Management Process

What is Castleton P2P?

SysAid. Service Level Agreement Service Level Management (SLA/SLM)

The Sumo Logic Solution: Application Management

TABLE OF CONTENTS DOCUMENT HISTORY

LAVASTORM lavastorm.com. Five Technologies that Transform Auditing to Continuous Business Improvement

Small Business CRM Evaluation Guide

Architecting JIRA for the Enterprise. JIRA is a powerful product, both flexible and highly configurable. Miles Faulkner

Solution White Paper Drive Radical Business Value with a High-Speed IT Organization

White Paper. Managed IT Services as a Business Solution

You can plan and execute tests across multiple concurrent projects and people by sharing and scheduling software/hardware resources.

DevOps Guide: How to Use APM to Enhance Performance Testing

PhaseWare Tracker Beyond

IBM Cloud Application Performance Management

STORYTELLING WITH KPIS

ROOT CAUSE ANALYSIS: YOUR UNTAPPED RESOURCE

Salesforce Knowledge. Overview and Best Practices

Why an Open Architecture Is Vital to Security Operations

Cut Costs and Improve Agility by Automating Common System Administration Tasks

The importance of the right reporting, analytics and information delivery

Developing Your Business s Technology Strategy

Oracle Risk Management Cloud. Release 13 (updates 18A 18C) What s New

Capacity Management - Telling the story

SYNTHETIC ACTIVE MONITORING. Copyright 2015 TestPoint All Rights Reserved

Data Center Infrastructure Scorecard Governance, RAM, and Cost-to-Serve Assessment

Safer Pipeline Operations: Smart Notifications for Faster Incident Response

Get The Best Out Of Oracle Scheduler

SPOK e.notify. Enabling Sophisticated, Efficient Incident Management

Top six performance challenges in managing microservices in a hybrid cloud

SOLUTION BRIEF DOSSIER MANAGEMENT

Overview. About NewVoiceMedia 2 / 8

Driving Radical Customer Service Innovation Move beyond operational demands to deliver proactive strategies that drive business growth

Veritas NetBackup Self Service Release Notes

Protect Your SharePoint Data

PROGNOSIS FOUNDATION FOR BASE24-eps (UNIX)

Service management solutions White paper. Integrate systems management and predictive intelligence with IBM Service Management solutions.

Implementing a Service Management Architecture

Transcription:

Monitoring: Know Your Audience Best Practice for Opsview Business Service Monitoring and Keywords Opsview Technical Overview

Page 2 Monitoring: Know Your Audience Contents Summary...3 Modeling Your Business in Opsview...4 BSM Components...4 Keywords...5 BSM Services...7 Suitable Visibility for Separate Target Audiences...8 Dashboards...8 Notifications...10 Reporting...10 Conclusion...12

Page 3 Summary Monitoring data, like all operations data, is at its most valuable when it leverages a presentation layer that puts the information in the proper context for any audience. When QoS values like uptime and throughput make up strict SLA requirements it becomes essential to make sure that the correct metrics and status information goes to the right audience. Complex IT environments leverage the concept of redundancy as a way of creating resilience and of improving overall performance. While high availability, disaster recovery, and load balancing clusters provide invaluable peace of mind for stakeholders, it manages to significantly complicate SLA reporting at the same time. How can this qualitative peace of mind be translated into a quantitative and reportable SLA value? At the same time, it is important to make sure that isolated outages are still resolved before they can propagate into a true loss of service. The more efficiently that isolated outages can be prioritized and resolved, the better the overall SLA report will be in the end. It is, therefore, important to stay ahead of isolated outages in order to ensure SLA requirements for redundant and resilient IT offerings. It is worth pointing out that these two levels of granularity are valuable to two completely separate target audiences. Real life, end user availability is valuable for service consumers such as customers, executive stakeholders, or compliance departments. SLA reporting against individual IT services running on hosts is best utilized by administrators and team leads. This is the primary focus of this white paper; that building business rules into a monitoring solution and properly reporting to application consumers and application administrators appropriately can improve visibility and communication between the two parties and contribute to the overall success of the business. This guide will go over the use cases for the Opsview BSM feature and for Opsview Keywords and will demonstrate how to get the most out of both features. It will cover the creation of BSM components and matching keywords to be used as a means of drilling into the component for extra information. It will then cover some effective dashboard configurations, notification rules, and reporting practices.

Page 4 Modeling Your Business in Opsview The presentation layer is a vital element to any business intelligence platform. Without being able to decode the underlying data to the business terms that they represent, the information would provide little benefit to anyone. Therefore, like any other business intelligence tool, Opsview must provide a presentation layer so that real architecture rules can be modeled in the tool providing accurate end user availability information. By being thorough and doing this correctly, risk can be identified and resolved before problems effect the service consumer rather than allowing a costly outage to happen and tracing the failure back to a root cause. BSM Components The first essential things to define in Opsview are all BSM Components. BSM components are a functional grouping of hosts and services together with an operating region used to determine the overall health and priority of the grouping. These are commonly used to define clusters, farms or failovers. It is a way to simplify the complicated architecture rules to the application by approaching it in pieces. The way that a component is defined starts with the Opsview Host Template feature. This is the same feature that allowed service checks to be applied to hosts in bulk by function. It is similarly able to group the check results by the function. This will serve as the starting point for creating any component. Selecting the desired Host Template filters the host selection box to only display hosts that are currently using that template. These are all hosts that are being monitored in the same way because functionally they are all nearly the same. It is then the job of the Opsview administrator to determine which of these common hosts are contributing to a shared goal like a cluster of Solaris servers would be. This supports an arbitrary number of hosts to be grouped together with an operational zone applied to the newly created BSM Component. This Operational Zone indicates

Page 5 the total percentage of the component that needs to be healthy for the entire component to be considered effective. This now means that small failures within a cluster are flagged as a potential impact to service rather than immediately being marked as a failure. Keywords Before BSM Components, keywords were the only way to group together services to evaluate the impact of a critical event. The figure below shows the process for creating a complementary keyword to the BSM component that was made in the previous step. It is best practice to create these complementary keyword/component pairs for every reportable component. The reason for this will become clearer in later stages of this exercise For a BSM component complement, it is best to make a Keyword with the same name as the component that is grouped by service. This way each of the functions the cluster needs to provide are accounted for and can be broken down by each node.

Page 6 Configuring this complementary keyword to reflect the same hosts and services as the component is a manual process. The hosts are selected first. These are the same hosts selected to create the component. Next there is a check box to Filter by selected hosts so that choosing the correct service checks is less daunting. By selecting the same checks that would be included in the host template, Opsview will now have a grouping that is effectively the component minus the resiliency rules.

BSM Services Page 7 The next step is to model the consumable service. Examples of these consumables could include VoIP phone systems, email, the company website, collaboration portals, and other various applications. The status of these consumables are of particular interest to anyone who is looking from the outside in. The list of those interested could include customers, executive stakeholders, compliance departments, and auditors. In order to provide an accurate status of these consumables they must first be properly modelled within the monitoring software. By understanding the anatomy of the application, website, or workflow and representing its uptime needs properly it is possible to foster a culture that concentrates on proactive troubleshooting rather than fighting fires. This may appear to be an intimidating task but the difficult part has already been accomplished. These consumable business services are simply a grouping together of BSM components which already have been assigned priority and itemized SLA requirements when the operational zone was defined. When creating a BSM service, there will be a section named the Components Drawer. This can be filtered with the text box adjacent to it to make components easier to find. To create a BSM service, simply click and drag components from the drawer into the BSM service.

Suitable Visibility for Separate Target Audiences Page 8 Monitoring is a three phase effort: data collection, presentation, and response. Now that Opsview has been configured to model consumable business services, both in BSM and keywords, it is time to move on to the second and third phases of the project. First, real-time presentation views will be created, then notifications rules will be put in place keywords, BSM components, and BSM services according to the recipient and finally historical reporting will be defined for both administrators and for external audiences alike. Dashboards Dashboards should be valuable both for tactical and strategic audiences. High-level views are often the most important things for customers or executives to see. This view would have very little detail and will rarely use keywords, if at all. One of the most powerful tools at your disposal for executive dashboards is the filtering feature on the BSM Summary dashlet. By placing a BSM Summary dashlet and configuring the settings correctly it is possible to view a subset of services that may be of importance to you and it is also possible to filter by status. By unchecking the Operational box, the widget is now set to be a traffic light for application level statuses. If any BSM services appear in the widget at all, it means that they are either in scheduled maintenance mode, impacted by an underlying problem, or in a full failure. This is an ideal way to see IT operations at the highest levels possible.

Page 9 The next task is to create dashboards that show details about the BSM service or BSM component for application owners or administrators respectively. A good practice for dashboarding is to create a user for the purpose of holding shared dashboards for other contacts. For the purpose of displaying Business Service views, the contact BSM with local authentication should be created. This user should have VIEWALL access, CONFIGUREBSM access, and DASHBOARDEDIT access at a minimum. In the dashboard tab for this user the following layout can be created per BSM to provide some detailed value for the top level Business Service. For larger environments with many applications and services it may be a good idea to create an additional contact named COMPONENTS to hold component level dashboards to be shared with others. Dashboards for this contact may look like the example below.

Page 10 Each of these contacts should share their dashboards with the roles that might find value in them. This practice now essentially provides saved dashboards that can be pulled up and deleted as needed by other Opsview contacts. This saves dashboard space without having to recreate these views from scratch every time that they are needed. The list of shared dashboards that are available should display BSM: <dashboard name> or Component: <dashboard name> for each choice, making it easy to find. Notifications With the addition of the BSM feature in Opsview it is now possible to set a notification rule for BSM services or components rather than the previous choices of host groups, service groups, and keywords. As with everything else, the audience dictates the alerting requirements. BSM service notifications are ideal for application owners and should then escalate to management levels. BSM component notifications are directed towards team leads of various disciplines such as database, server, and network teams with an escalation to the appropriate architect. This leaves the legacy host group, service group, and keyword based notifications for the front lines of monitoring where every new issue should be investigated as quickly as possible. Reporting The ultimate goal of reporting as it relates to monitoring is to tell a story that is both accurate and puts IT operations in a positive light. Accuracy can actually be flattering when it comes to SLAs in such complicated architectures. High availability, load balancing, site failovers, and other architecture concepts are implemented because they make for a more stable environment. This means that reporting on SLAs as it relates to a BSM service is going to be a real depiction of end user availability. Opsview now provides SLA reporting against the application or business service so that this information can be sent to management or a customer.

Page 11 It is important, however, to make sure that this report will be telling a positive story at the end of the day, week, month, or year. An ideal way to make sure that the final, often automated, report is going to satisfy the SLA requirements is to stay in front of application outages by maintaining the components that make them up. To do this, Daily Service Level Reports and Daily Performance Reports can be run against the keyword that was created to complement the components that make up the BSM service.

Page 12 These can be regularly scheduled to be sent to the appropriate administrators so that individual service checks and host failures can be corrected before they exceed the component s operational zone. By reporting both at the BSM service level and at the component/keyword level, different audiences can be provided the correct level of granularity for their individual needs. Conclusion Monitoring is made up of three factors: data collection, presentation, and action. In order to use this information for high value business intelligence reasons it is important to always keep in mind, Who is my audience? Monitoring data can be used for an immediate response by an administrator that specializes in networks, servers, or hardware as appropriate. This same data can be rolled up for team leads, architects, and application owners. Finally, at the highest level, transparency should be provided for executive stakeholders and the expected consumers of these applications and business services. This way risks can be identified at every stage and prevented instead of root causes being determined after outages occur. This promotes a culture of better communication and better relationships between Information Technology and other business units in the organization leading to better overall operations.