After the fact: Post Incident Analysis Reporting in OpsGenie

Apr 26, 2018 by Emel Dogrusoz

 

If you work with large-scale, complex, and distributed systems, you must embrace the idea that incidents are inevitable. When an incident occurs, as well as fixing the underlying issues and returning the services back to their normal operating conditions, incident response teams should also have formalized processes of learning from these incidents so that similar incidents do not recur and the people involved can see what worked (or not) to build successful response patterns for the future. For this reason, comprehensive post incident analysis is a critical step in incident management.

Read more »

Leveraging OpsGenie and Slack Integration (Video)

Apr 24, 2018 by Kevin Davis
As Dev and Ops teams continue to scale, time to respond becomes more and more important to get systems back to a healthy state. In this short video, we will show you how to leverage our Slack integration to help incorporate your current chat tool setup into your team’s response process. The ability to immediately take the appropriate action by using Slack buttons or Genie commands will help your team become even more agile in the growing alert landscape.
Read more »

Amp up your Incident Management Ops: Flock + OpsGenie

Apr 19, 2018 by Rahul Menon

Flock is a messaging tool that helps teams communicate, collaborate, and get things done in real-time. With integrations for popular business tools and services such as Google Drive, Calendar, Dropbox, Jira, and GitHub, Flock makes it easy for teams to work together - faster than ever.

Read more »

Creating Actionable Alerts Series #3: Initiating Communication and Collaboration

Apr 17, 2018 by Serhat Can

 

Read more »

On-Call Schedule Patterns for Growing Companies

Apr 12, 2018 by Emel Dogrusoz

This post is an excerpt from our White Paper: Scaling On-Call in a DevOps Organization which you can download to learn more about best on-call practices. 

Download the White Paper

Read more »

5 Phases of a Solid Security Incident Response Plan

Apr 10, 2018 by Serhat Can

steps to improve security

Today’s post is an excerpt from “5 Steps to Improve Security and Incident Response in AWS”, which was co-authored by OpsGenie and Evident.io, a Palo Alto Company. You can download the complete white paper here.

Read more »

Ticket Closed! - OpsGenie & ConnectWise Integration (Video)

Apr 5, 2018 by Darren Henry

More and more ConnectWise users are embracing OpsGenie to help manage their IT incidents and respond to their service desk tickets as quickly as possible. In this short video, we walk through effectively responding to two different ConnectWise tickets; one of high priority, and one of low. You will notice OpsGenie escalates the tickets differently and keeps all systems up to date with changes. It even continuously communicates all actions in Slack. 

Read more »

How to Manage On-Call in a DevOps Organization

Apr 3, 2018 by Emel Dogrusoz
Read more »

Searching Alerts Within OpsGenie (Video)

Mar 29, 2018 by Kate Clavet

The OpsGenie Alerts Page shows all visible alerts and appears by default once you log in to your account. Searching for alerts is easy and can be done two ways, using a form-based search or a google-style search. Watch the below video to become more familiar with searching alerts, exporting the results, and saving searches for future use.

Read more »

Creating Actionable Alerts Series #2: Remedial Actions

Mar 27, 2018 by Serhat Can

As part of the series promoting our newest White Paper: Creating Actionable Alerts to Maximize Resolution Speed, we’d like to introduce another automated task that can be implemented during alert enrichment. Check out our previous post Creating Actionable Alerts Series #1: Gathering Investigative Information to learn more about our first alert enrichment automation practice. 

Download the White Paper now!

Read more »
1 2 3 4 5