Alerts are states, and notifications are messages that you receive about alerts. OpsDash supports threshold-based alerts – you can set upper and lower warning and critical limits for any metric. When the value of a metric goes beyond these set thresholds, it is said to be in an alerting state.
OpsDash can notify you about alerts via:
Slack - post messages to Slack channels
HipChat - post messages to HipChat rooms
Mattermost - post messages to Mattermost channels
OpsGenie - manage alerts with OpsGenie
PagerDuty - raise incidents in PagerDuty
VictorOps - manage incidents in VictorOps
OpsDash alerting is designed to be spam-free. Summary emails are sent only when something changes, and you can define parameters for other notification types to ensure your team isn’t overloaded.
It’s fast and easy to set up alerts and notifications in OpsDash. Here are the key steps to help you get started:
1) CONFIGURE NOTIFICATION CHANNELS
You can setup these integrations in the System Settings page. This is a one-time setup.
You can find detailed instructions on how to configure each integration here:
2) CONFIGURE NOTIFICATION ROUTING RULES
Once the integrations are setup, you can add notification rules. Notification rules allow you to send, for example, all alerts related to database servers to the database team’s Slack channel.
Rules are set from the Notifications page. Each rule can be toggled on and off independently, to make it easy to turn off some notifications for some time (e.g. for planned maintenance). Here’s how the rules look when set:
3) SET ALERTS
You can easily set alerts for upper and lower limits for any metric. Both critical and warning limits can be set. Just click on the graph in your OpsDash dashboard, and enter the appropriate values under alert settings.
OpsDash alerting has been carefully crafted to prevent notification spam. Email notifications are sent out only at the intervals you specify, summarizing the ops situation at that point, and the changes that happened since the last email. For chat-based channels, both summary and changes between notifications are posted when there are any changes in the ops situation.
You can review all the thresholds that have been set across all graphs at a glance, using the Thresholds page. No need to hunt around, it’s all there on a single page for easy review.
Try setting an alert and sending a test notification now.