WebMonitoring

Alert Rules

Alert rules define the conditions that trigger alerts. When a rule's condition is met during a scheduled evaluation cycle, an Alert instance is created and the configured escalation policy begins. Rules can be toggled active or inactive without being deleted.

Rule List

The alert rules list at /monitoring/alert-rules shows all configured rules with:

  • Rule name and description
  • Metric being monitored with current value
  • Active / Inactive toggle — disabling a rule halts evaluation without deleting it
  • Severity badge (info / warning / critical)
  • Last triggered timestamp
  • Associated escalation policy name

Rule Configuration Fields

FieldOptions / FormatDescription
NamestringHuman-readable rule identifier shown in alert notifications
MetricdropdownThe metric to monitor. Populated from a catalog of available FinOps and Redshift metrics.
Condition>, <, ==, !=The comparison operator applied between the metric value and threshold
ThresholdnumberThe value the metric is compared against to determine if the condition is met
Evaluation Window5 / 15 / 30 / 60 minThe time window over which the metric is aggregated before comparison
Severityinfo / warning / criticalDetermines the urgency level used in notification messages and escalation priority
Notification Channeldropdown (multi-select)Channels notified at the first escalation level when the rule fires
Escalation PolicydropdownThe escalation policy that governs notification progression if the alert is not acknowledged

Available Metrics

MetricSourceTypical Threshold
avg_query_duration_secondsstv_wlm_query_state300s (5 minutes)
disk_spill_gbsys_query_detail10 GB
daily_cost_usdCID cost viewsTenant-specific
unsorted_pctsvv_table_info20%
ghost_rows_pctsvv_table_info30%

Evaluation Cycle

Rule evaluation runs on a scheduled background job. Each evaluation cycle:

  1. Fetches the current metric value from the appropriate data source
  2. Compares the value against the threshold using the configured condition
  3. If the condition is met, creates an Alert instance with status: firing
  4. Triggers the first level of the configured escalation policy
  5. If the condition is no longer met in a subsequent cycle, the alert is auto-resolved

Duplicate Prevention

If an alert rule fires and an active Alert instance already exists for that rule (status: firing or acknowledged), no duplicate alert is created. A new alert is only created after the previous one is resolved.