Kendall’s Tau

Kendall’s Tau is a statistical measure used to assess the strength and direction of the relationship between two variables. It is a non-parametric measure, meaning it does not rely on any assumptions about the underlying distribution of the data. Kendall’s Tau is particularly useful for analyzing ordinal or ranked data, as it takes into account the order of the values rather than just the magnitude. This measure ranges from -1 to 1, with 0 indicating no correlation and values closer to -1 or 1 representing a stronger correlation. It is commonly used in various fields, such as social sciences, economics, and biology, to understand the relationship between variables and make informed decisions.


What is Kendall’s Tau?

Kendall’s Tau is used to understandthe strength of the relationship between two variables. Your variables of interest can be continuous or ordinal and should have a monotonic relationship. See more below.

Kendall's Tau measures the relationship between two variables when one or more of the variables is ordinal, non-linear, skewed, or has outliers.

Kendall’s Tau is also called Kendall rank correlation coefficient, and Kendall’s tau-b.


Assumptions for Kendall’s Tau

Every statistical method has assumptions. Assumptions mean that your data must satisfy certain properties in order for statistical method results to be accurate.

The assumptions for Kendall’s Tau include:

  1. Continuous or ordinal
  2. Monotonicity

Let’s dive in to each one of these individually.

Continuous or Ordinal

The variables that you care about must be continuous or ordinal. Continuous means that the variable can take on any reasonable value. Some good examples of continuous variables include age, weight, height, test scores, survey scores, yearly salary, etc. Kendall’s Tau is often used for correlation on continuous data if there are outliers in the data.

Ordinal variables are categories that have an inherent order. For instance, education level (GDE/Bachelors/Masters/PhD), income level (if grouped into high/medium/low) etc.

Monotonicity

Your two variables should have a monotonic relationship. This means that the direction of the relationship between the variables is consistent. For instance, when one variable goes up, the other goes up (in general). In this case, a plot of the two variables would move consistently in the up-right direction. The relationship would also be monotonic if when one variable goes up, the other goes down (in general). In this case, the plot of the two variables would move consistently in the down-right direction.

Monotonicity means that as one variable increases or decreases on average, so does the other.

When to use Kendall’s Tau

You should use Kendall’s Tau in the following scenario:

  1. You want to know the relationship between two variables
  2. Your variables of interest are continuous with outliers or ordinal
  3. You have only two variables

Let’s clarify these to help you know when to use Kendall’s Taup

Relationship

You are looking for a statistical test to look at how two variables are related. Other types of analyses include testing for a difference between two variables or predicting one variable using another variable (prediction).

Continuous with Outliers or Ordinal

Your variables of interest must be either continuous or ordinal. Continuous means that your variables of interest can basically take on any value, such as heart rate, height, weight, number of ice cream bars you can eat in 1 minute, etc. Kendall’s Tau is often used on continuous data when the data have outliers.

Ordinal variables are categories that have an inherent order. For instance, education level (GDE/Bachelors/Masters/PhD), income level (if grouped into high/medium/low) etc.

If your data are continuous and do not have outliers, you should probably use Pearson Correlation instead. If one of your variables is continuous and the other is binary, you should use Point Biserial Correlation. And if your variables are categorical, you should use the Phi Coefficient or Cramer’s V.

Two Variables

Kendall’s Tau can only be used to compare two variables.


Kendall’s Tau Example

Variable 1: Hours worked per week.
Variable 2: Income.

In this example, we are interested in investigating the relationship between a person’s average hours worked per week and income. To begin, we collect these data from a group of people.

Depending on the population, one or both of these variables is likely skewed, or does not fit a bell curve. For this reason, we use Kendall’s Tau instead of Pearson Correlation. We double check that the other assumptions of Kendall’s Tau are met.

The analysis will result in a correlation coefficient (called “Tau”) and a p-value. Tau values range from -1 to 1. A negative value of Tau indicates that the variables are inversely related, or when one variable increases, the other decreases. On the other hand, positive values indicate that when one variable increases, so does the other.

The p-value represents the chance of seeing our results if there was no actual relationship between our variables. A p-value less than or equal to 0.05 means that our result is statistically significant and we can trust that the difference is not due to chance alone.

x