elementary.dimension_anomalies

The test counts rows grouped by given dimensions (columns/expressions).

This test practically monitors the frequency of values in the configured dimension over time, and alerts on unexpected changes in the distribution. It is best to configure it on low-cardinality fields.

If timestamp_column is configured, the distribution is collected per time_bucket. If not, it counts the total rows per dimension.

Test configuration

Required configuration: dimensions

tests:   — elementary.dimension_anomalies:     dimensions: sql expression     timestamp_column: column name     where_expression: sql expression     anomaly_sensitivity: int     anomaly_direction: [both | spike | drop]     detection_period:       period: [hour | day | week | month]       count: int     training_period:       period: [hour | day | week | month]       count: int     time_bucket:       period: [hour | day | week | month]       count: int     seasonality: day_of_week     detection_delay:       period: [hour | day | week | month]       count: int     ignore_small_changes:       spike_failure_percent_threshold: int       drop_failure_percent_threshold: int     anomaly_exclude_metrics: [SQL expression]     exclude_final_results: [SQL expression]