Alerts and Incidents Overview
ELEMENTARY CLOUD
Alerts and incidents in Elementary are designed to shorten your time to response and time to resolution when data issues occur.
- Alert - Notification about an event that indicates a data issue.
- Incident - A data issue that starts with an event, but can include several events grouped to an incident. An incident has a start time, status, severity, assignee and end time.
Alerts provide information and context for recipients to quickly triage, prioritize and resolve issues. For collaboration and promoting ownership, alerts include owners and tags. You can create distribution rules to route alerts to the relevant people and channels, for faster response.
An alert would either open a new incident, or be automatically grouped and added to an ongoing incident. From the alert itself, you can update the status and assignee of an incident. In the incidents page, you will be able to track all open and historical incidents, and get metrics on the quality of your response.
Alerts & incidents core functionality
- Alerts customization - Alerts should include relevant context for quick triage such as owner, tags, *description. In Elementary, alerts can be customized to include this information.
- Alert distribution rules - Alerts should be sent to relevant recipients. By creating Alert Rules, alerts can be distributed to different channels and systems.
- Incidents management - When alerts are distributed to different channels, it can become hard to track what is open. Elementary offers a centralized Incidents page to monitor what is open, and manage incident properties: assignee, status and severity.
- Grouping alerts to incidents - New failures related to already open incidents will not trigger new alerts, and will be automatically added to the opgoing incident. This reduces noise and alerts fatigue.
- Automated resolution - When there is a successful run that means an open incident is resolved, Elementary will automatically resolve the incident. This will help you manage the state of incidents and communicate it to stake holders in real time.
Alert types
Elementary can be configured to send alerts on:
- Model run failures
- Failures and/or warnings of dbt tests (including Elementary dbt package and other packages)
- Failures and/or warnings Elementary Anomaly Detection monitors
- Failures and/or warning of custom SQL tests
- dbt source freshness failures