Get started with Service Reliability Management

  • Release version: Washingtondc
  • Updated February 1, 2024
  • 2 minutes to read
  • Summarize
    Summarized using AI
    This content was generated using new OpenAI-powered functionality. Results are provided on an as is basis and are not guaranteed to be accurate or complete.

    Summary of Get Started with Service Reliability Management

    Service Reliability Management (SRM) facilitates a comprehensive understanding of service health in relation to service level objectives and incident resolution. It supports IT Operations Management and DevOps teams in achieving agility, performance, and uptime. This guide outlines the essential components of the SRM interface and basic setup tasks to enhance your experience.

    Show full answer Show less

    Key Features

    • Setup Guide Modules: Learn to configure your Homepage, Services, and Teams landing pages, ensuring a successful SRM setup.
    • Homepage Overview: Access critical alerts, incidents, and tasks assigned to you or your team.
    • SRM Navigation: Familiarize yourself with the interface's sections and elements, crucial for efficient usage.
    • Team and Service Management: Create and administer teams responsible for service issues and define service parameters.
    • On-Call Scheduling: Establish on-call schedules and escalation policies to ensure timely incident responses.
    • Service Relationships: Utilize a map canvas to configure and arrange service dependencies.
    • Third-Party Integrations: Connect SRM with monitoring tools like Datadog for streamlined alert notifications.
    • SLOs, SLIs & Error Budgets: Set operational goals and failure tolerances for your services.
    • Alert Automations: Define alert conditions and rules, available with the Alert Automations application from the ServiceNow Store.
    • Change Management: Monitor change requests relevant to your reliability tasks.

    Key Outcomes

    By getting started with SRM, you can effectively manage service reliability, enhance team responsiveness to incidents, and maintain service performance in alignment with established objectives. This ensures your organization can meet its service commitments and improve overall operational efficiency.

    SRM accelerates your path to viewing service health in context of service level objectives and incident resolution. Helps IT Operations Management and DevOps teams deliver on the promise of agility, performance, and uptime.

    Get started with SRM to understand what are the different sections on the SRM interface.

    For more information on roles, see SRM roles and responsibilities.

    Basic SRM tasks

    Step Description Reference
    Set up guide modules Our Setup guide modules for your Homepage, Services, and Teams landing pages show you how to add a team or a service. It includes all the key milestones to make sure you’re set up for success. Add an SRM team

    View an SRM service

    Your homepage Your home page is where you’ll find the things most important to you. Like the Services with critical alerts and incidents, work assigned to you and your team. SRM (SRM) home page
    Learn the ins-and-outs of SRM navigation. You can get familiar with the different sections and elements of the SRM interface. The sections and elements are used throughout our documentation. SRM interface

    SRM helps you when you need to create and administer teams, services, and integrations.

    Table 1. Optional SRM tasks
    Step Description See this

    Manage an application or technical service in SRM

    Define the basic tasks and parameters that make up your service and how it should behave. Add a service to SRM

    Set up an SRM team

    Set up a team. Teams are responsible for the issues that occur in the associated services. Add an SRM team

    Set up on-call schedule and escalation policies

    Create an on-call schedule for your team to ensure to ensure that dedicated support team members are available to resolve issues as they arise. You can set up an escalation policy for your team so that at least one team member is engaged in incident response. Create your SRM On-call schedule

    Configure service relationships

    Use a map canvas to add, configure, and arrange services. You can add child services that depend on parent services.

    Integrate services with third-party monitoring tools

    Set up a third-party integration, such as Datadog or ServiceNow Cloud Observability, with SRM so that alerts are available to your teams within SRM. Add an integration to SRM.

    Establish SLOs, SLIs & error budgets for services

    Establish goals for how well your service should operate. Also specify the maximum amount of time that a technical system can fail without contractual consequences. Create an SLO, an SLI, and Error budget policies for SRM.
    Set up alert automations
    Note:
    This functionality is only available if you have installed the Alert automations application.
    Alert automations allow you to define alert conditions. Set up alert rules for each APM tool to define the conditions when the APM tool should send notifications to SRM. The Alert automation application is available upon request, from the ServiceNow Store.

    View change

    View change requests related to your reliability tasks. View a change request for SRM service approval

    SRM flow