What is Paired Data?


When two datasets are of equal length and each observation in one dataset can be “paired” with an observation in another dataset, we call this paired data.

Paired data

In order for two datasets to be paired, it’s important that each in one dataset can only be paired with one observation in the other dataset. 

Examples of Paired Data

Here are a few examples of paired data:

Example 1: Duplicate Measurements.

Suppose researchers want to know if a scale is consistent in weighing boxes at all hours of the day in a certain warehouse. To test this, researchers use the scale to weigh 30 different boxes in the morning and then again in the evening.

The end result is two datasets in which the morning and evening weights of each box can be “paired” with themselves.

Example of paired data on duplicate measurements

Example 2: Pre-Post Measurements.

A doctor wants to know if a new drug is capable of reducing blood pressure in patients. To test this, he measures the blood pressure of 20 different patients before and after using the drug for one week.

The end result is two datasets, in which the before and after blood pressure of each individual can be “paired” with themselves.

Example of paired data

How to Analyze Paired Data

There are two common ways to analyze paired data:

1. Perform a paired t-test.

One way to analyze paired data is to perform a , which compares the means of two when each observation in one sample can be paired with an observation in the other sample.

2. Calculate the correlation between the two datasets.

Another way to analyze paired data is to calculate the between the two datasets.

This gives us an idea of the direction and the strength of the relationship between the values in the two datasets.

Paired Data vs. Unpaired Data

Unlike paired data, unpaired data occurs when the observations of one dataset cannot be uniquely paired with an observation in another dataset.

For example, suppose researchers want to know whether or not a certain training program increases the average vertical jump of basketball players.

One way to test this using paired data would be to measure the max vertical jump of the same 20 players before and after using the training program:

Paired vs. unpaired data

To test this using unpaired data, the researchers could measure the max vertical jump of 20 players who did not use the training program and then measure the max vertical jump of 20 different players who did use the training program:

Unpaired data example

When we’re working with paired data, we use a to determine if the difference between the sample means is different.

And when we’re working with unpaired data, we use an to determine if the difference between the sample means is different.

x