Modern applications require monitoring at different levels, consisting of various tools such as Network Monitoring, Logging, Analytics, APM, Metrics collecting, among others. Some of these tools produce alerts that are standalone, but some may produce related alerts during an issue or incident. OpsGenie’s Alert page is a dashboard where all these alerts are collected and displayed, ready for responders to take action from one singular place.
This is an excerpt from our newest Whitepaper: Essential Guide to AWS Cloud Migrations which covers successful traits of cloud-centric IT organizations, tips for collaboration between Dev and Ops teams, the challenges in moving to the cloud and how to mitigate them– a step-by-step guide to cloud migration done right.
Notification rules within OpsGenie empower team members to specify how and when they would like to be alerted. Users can be notified by email, SMS, voice call, or iOS and Android push notifications. With notification rules, team members specify what contact method will be used by alert type and time of day. Watch the video below to become more familiar with notification rules within OpsGenie.
How many servers can be managed by one system administrator? This question is pretty hard to answer since it depends decisively on the tasks that need to be operated. It is clear, however, that the amount of servers one engineer can manage has increased tremendously over the time, and is still growing. Public and private clouds, in combination with automation tools, enables us to automate many daily tasks. In a modern IT infrastructure almost everything can, and should, be automated. Starting from the creation of a new instance up to software deployment. In this whole scenario, automated monitoring is an essential component.
We pride ourselves at OpsGenie for being the most reliable and flexible alert and incident management solution. However, what happens when you simply don’t want notifications? Even with escalations, routing rules, and on-call schedules, you may want extra configuration on when you are notified, and for what types of alerts.
Do you receive your support and/or internal queries through calls? OpsGenie’s Incoming Call Routing allows you to manage your phone numbers and how the calls to these numbers are being routed from one single place. Use your on-call schedules and escalations to determine the right team member to route the incoming call to, and make sure the call is not missed, just like your alerts.
Incident response procedures for IT incidents are similar to the processes required for emergencies in the medical field. In previous posts we’ve compared on-call responders to doctors on-call- called during emergencies and expected to contain and remedy the problem, preventing loss and reducing impact. Using priority is a great method of alert enrichment to accomplish this. Download our white paper to learn more best practices to maximize resolution speed.
Adopting a true DevOps culture is chock-full of challenges such as shifting away from legacy infrastructure to a more microservices-centric approach, integrating tools, managing priorities, environment provisioning, traceability, and more. However, implementing an approach in which Dev teams and Ops teams can work together, using the same assets, can drastically change the way an organization is able to move forward with each new release. For DevOps teams, proactively monitoring “left” of production can be a revolutionary element in achieving these goals.