What is Chauvenet’s Criterion and can you provide an example?

Chauvenet’s Criterion is a statistical method used to identify and eliminate outliers in a dataset. It is based on the assumption that a data point that deviates significantly from the majority of the data is likely due to error or measurement bias. The criterion calculates the probability of a data point being considered an outlier and compares it to a predetermined threshold. If the probability is lower than the threshold, the data point is considered an outlier and can be removed from the dataset.

For example, in a study measuring the heights of individuals, a data point that is significantly lower or higher than the average height of the population may be identified as an outlier using Chauvenet’s Criterion. This could be due to measurement error or misreporting of height. By removing this outlier, the dataset becomes more accurate and representative of the population.

Chauvenet’s Criterion: Definition & Example


An outlier is an that lies abnormally far away from other values in a dataset. Outliers can be problematic because they can affect the results of an analysis.

One way to identify outliers in a dataset is to use Chauvenet’s Criterion, which uses the following process:

1. For each individual value xi in the dataset, calculate the deviation from the mean as:

Deviation = |xix| / s

where x is the sample mean and s is the sample standard deviation.

2. Compare the deviations of each individual value to the critical values of Chauvenet’s Criterion Table below. For individual data values with deviations greater than those found in the table, declare those data values to be outliers.

Chauvenet's Criterion table

Chauvenet’s Criterion: An Example

Suppose we have the following dataset of 15 values:

The sample mean for this dataset is x = 17.067 and the sample standard deviation is s =10.096. For each individual data value, we can calculate calculate its deviation as:

Deviation = |xix| / s

For example:

  • The first data value would have a deviation of |4 – 17.067| / 10.096 = 1.294.
  • The first data value would have a deviation of |6 – 17.067| / 10.096 = 1.096.

And so on.

We can use the same formula to calculate the deviation of each individual data value:

Chauvenet's Criterion example

It turns out that the value 42 has a deviation greater than 2.128:

Example of Chauvenet's Criterion for outlier detection

Thus, the value 42 is the only outlier in this dataset.

Cautions on Using Chauvenet’s Criterion

Chauvenet’s Criterion makes the assumption that the values in a dataset are normally distributed. If this assumption is not met, then using Chauvenet’s Criterion to identify outliers is likely not valid.

If you do use this method and find that a value is an outlier, you should first verify that the value is not a result of a data entry error. Sometimes data is simply entered incorrectly. 

If the value is a true outlier, you may choose to remove it if it will have a significant impact on your overall analysis. Just be sure to mention that you removed an outlier when you report your results.

Also, this method should only be used on a given dataset once. For example, suppose we use this criterion to identify the value 42 as an outlier in the previous example and remove this value from the dataset.

We then shouldn’t recalculate the and sample standard deviation and calculate the deviations once again to find more outliers.

x