Alert policies
An alert policy allows you to configure a degradation threshold for your metric. If the feature flag causes the metric to go below or above that threshold you are alerted by your configured notification channel.
Create a metric alert policy
Alert policies can only be created for metrics that are measured per traffic type, rather than those that are measured across all traffic type keys.
To create an alert policy:
-
Create a new metric or navigate to the metric that you want to be alerted of a change. For more information about creating a metric, refer to the Creating a metric guide.
-
Once you select the metric to create a new alert policy for, click the Alert policy tab by the Metric definition tab. If there are no policies available, click the Create alert policy button.
-
Fill in the fields as follows:
-
In the Name field, give your alert policy a human recognizable name. We recommend also including the metric name your alert policy is associated with in your policy name.
-
In the Description field, optionally give your alert policy a description. This can include anything that would be useful to you and your team if an alert is fired. Include runbooks, alerting protocols, and key information about the alert.
-
1st alert condition area An alert policy can have multiple alert conditions. Each alert condition relates to a particular environment.
noteYou are limited to one alert policy per environment.
-
In the Choose your environment field, select the environment you want to apply the alert condition to. If you already have an alert condition for a particular environment, this is not available in the environment menu list when you create a second alert condition.
-
In the Set alert threshold field, add a degradation threshold in the form of a relative percentage threshold or absolute value threshold. The degradation direction is assumed to be the opposite of the desired direction of the metric. Learn more about setting degradation thresholds in the Choosing your degradation threshold for alerting guide.
-
In the Define alert notification channel field, configure who to notify if an alert is fired. As default, the metric owner is selected. You can also select the emails of any group as well as add emails using the freeform section.
-
From here, you can add additional alert conditions.
-
After you finish entering your conditions, click the Create alert policy button. You now have a new alert policy.
When a metric alert is triggered
When an alert is triggered for a metric, the people you've selected in this section receive an email with the following format:
Subject: "Good News!/Alert: Feature Flag Name
had a positive/negative impact on Metric Name
Body: "Good News!/Alert: Feature Flag Name
had a positive/negative impact on Metric Name
Split has detected a(n) DESIRED/UNDESIRED improvement/degradation of X
% on your Metric Name
metric.
Feature flag: (name and link to feature flag)
Metric: (name and link to metric)
Environment:
Baseline treatment:
Baseline metric value:
Comparison treatment:
Comparison metric value:
Relative Impact: X
% in the DESIRED/UNDESIRED direction
-The Split Team
Visit the Split application to view the feature flag's metric impact (or kill the feature flag)."
Alert policy sensitivity
There are two configurable parameters for feature monitoring - the length of the monitoring window and the degradation threshold. These can be changed in the Monitor and Experiment Settings section of your admin panel. By adjusting these parameters you can tailor the coverage and sensitivity of feature monitoring alerts in the way that best suits your organization and metrics. See this article for a more in depth discussion on factors to consider when choosing these settings.
Sensitivity calculators
As with any statistical test in experimentation, we can't detect everything - each metric will only have the power to detect degradations larger than a given size. Smaller degradations will not be distinguishable from natural noise and variations in your data.
The calculators below can be used to help you calculate what range of degradations you can expect to be able to detect for a given sample size and set of metric characteristics. If your metric is a count, sum, average or ratio metric, use the first calculator for means metrics. Otherwise, if your metric is a percent of unique users metric, use the second calculator for proportions.
These calculators assume your statistical settings are set at a significance threshold of 0.05 and a power threshold of 80%.
For example, imagine you have a Percentage of Unique Users metric which has a value of 60% in the baseline treatment, and you use a relative degradation threshold of 10%. If the desired direction of the metric is a decrease, then we would be testing for evidence that the Percentage of Unique Users in the comparison group is more than 66% (more than 10% higher than the baseline value).
Assuming a 50/50 percentage rollout of users between baseline and comparison treatments, an Org wide significance level of 0.05, and a monitoring window of 24 hours, with 10,000 unique users you would only see an alert if the observed percentage for the comparison group increased by more than 16.2% and hence had a value higher than 69.7%. If instead you had 1000 or 100,000 unique users, the comparison group value would need to be higher than 77% and 67%, respectively, for an alert to be raised.
Using the calculators
If you have the experiment pack and you are unsure of any of the data required for the calculator, we recommended looking at the metric results for a similar feature flag you have already ran, or running a 100% off flag with your intended targeting rules. You can then find the sample size, metric value and standard deviation from the Metric details and trends view reached by clicking into the metric card.
Expected sample size during the monitoring window
This is the total sample size expected to enter your experimental rule during the monitoring window, it will be the total across both treatments rather than per treatment.
The Sample size column is shown under the Sample population section of the data table. You may need to adjust this value if the feature flag you are referencing ran for a different length of time than your chosen monitoring window length.
For example, imagine you see the below table for a feature flag which ran for 48 hours, and you intend to run a similar flag with a monitoring window length of 24 hours. To get the estimated sample size during the 24 hour monitoring window, first sum the sample sizes across the two treatments, to get 2000, then divide by 2 to go from 48 hours to 24 hours - this will give you an estimated sample size during the monitoring window of 1000.
Baseline metric value
This is the expected value of the metric in your control group, or the value you expect to see for the treatment set as the alerting baseline. If you are using a reference feature flag, you can also find an estimate for this value in the Metric Details and Trends view, you will need the number under the Mean column in the Metric Dispersion section of the data table.
Baseline standard deviation
The standard deviation characterizes how much variation there is in your metric. It is needed for the means calculator but not for the percent-unique calculator.
You can find this value under the Stdev column in the Metric Dispersion section of the data table.