Lesson 15Quartiles and Interquartile Range
Let's look at other measures for describing distributions.
Learning Targets:
- I can use IQR to describe the spread of data.
- I know what quartiles and interquartile range (IQR) measure and what they tell us about the data.
- When given a list of data values or a dot plot, I can find the quartiles and interquartile range (IQR) for data.
15.1 Notice and Wonder: Two Parties
Here are two dot plots including the mean marked with a triangle. Each shows the ages of partygoers at a party.
15.2 The Five-Number Summary
Here are the ages of a group of the 20 partygoers you saw earlier, shown in order from least to greatest.
7 | 8 | 9 | 10 | 10 | 11 | 12 | 15 | 16 | 20 | 20 | 22 | 23 | 24 | 28 | 30 | 33 | 35 | 38 | 42 |
-
-
Find and mark the median on the table, and label it “50th percentile.” The data is now partitioned into an upper half and a lower half.
-
Find and mark the middle value of the lower half of the data, excluding the median. If there is an even number of values, find and write down the average of the middle two. Label this value “25th percentile.”
-
Find and mark the middle value of the upper half of the data, excluding the median. If there is an even number of values, find and write down the average of the middle two. Label the value “75th percentile.”
-
You have now partitioned the data set into four pieces. Each of the three values that “cut” the data is called a quartile.
- The first (or lower) quartile is the 25th percentile mark. Write “Q1” next to “25th percentile.”
- The second quartile is the median. Write “Q2” next to that label.
- The third (or upper) quartile is the 75th percentile mark. Write “Q3” next to that label.
-
Label the least value in the set “minimum” and the greatest value “maximum.”
-
-
Record the five values that you have just identified. They are the five-number summary of the data.
Minimum: _____ Q1: _____ Q2: _____ Q3: _____ Maximum: _____
-
The median (or Q2) value of this data set is 20. This tells us that half of the partygoers are 20 or younger, and that the other half are 20 or older. What does each of the following values tell us about the ages of the partygoers?
- Q3
- Minimum
- Maximum
Are you ready for more?
Here is the five-number summary of the age distribution at another party of 21 people.
Minimum: 5 years Q1: 6 years Q2: 27 years Q3: 32 years Maximum: 60 years
- Do you think this party has more or fewer children than the other one in this activity? Explain your reasoning.
- Are there more children or adults at this party? Explain your reasoning.
15.3 Range and Interquartile Range
-
Here is a dot plot you saw in an earlier task. It shows how long Elena’s bus rides to school took, in minutes, over 12 days.
Write the five-number summary for this data set by finding the minimum, Q1, Q2, Q3, and the maximum. Show your reasoning.
-
The range of a data set is one way to describe the spread of values in a data set. It is the difference between the greatest and least data values. What is the range of Elena’s data?
-
Another number that is commonly used to describe the spread of values in a data set is the interquartile range (IQR), which is the difference between Q1, the lower quartile, and Q3, the upper quartile.
-
What is the interquartile range (IQR) of Elena’s data?
-
What fraction of the data values are between the lower and upper quartiles? Use your answer to complete the following statement:
The interquartile range (IQR) is the length that contains the middle ______ of the values in a data set.
-
-
Here are two dot plots that represent two data sets.
Without doing any calculations, predict:
a. Which data set has the smaller IQR? Explain your reasoning.
b. Which data set has the smaller range? Explain your reasoning.
- Check your predictions by calculating the IQR and range for the data in each dot plot.
Lesson 15 Summary
Earlier we learned that the mean is a measure of the center of a distribution and the MAD is a measure of the variability (or spread) that goes with the mean. There is also a measure of spread that goes with the median called the interquartile range (IQR).
Finding the IQR involves partitioning a data set into fourths. Each of the three values that cut the data into fourths is called a quartile.
- The median, which cuts the data into a lower half and an upper half, is the second quartile (Q2).
- The first quartile (Q1) is the middle value of the lower half of the data.
- The third quartile (Q3) is the middle value of the upper half of the data.
Here is a set of data with 11 values.
12 | 19 | 20 | 21 | 22 | 33 | 34 | 35 | 40 | 40 | 49 |
Q1 | Q2 | Q3 |
- The median (Q2) is 33.
- The first quartile (Q1) is 20, the median of the numbers less than 33.
- The third quartile (Q3) is 40, the median of the numbers greater than 33.
The difference between the minimum and maximum values of a data set is the range.
The difference between Q1 and Q3 is the interquartile range (IQR). Because the distance between Q1 and Q3 includes the middle two-fourths of the distribution, the values between those two quartiles are sometimes called the middle half of the data.
The bigger the IQR, the more spread out the middle half of the data are. The smaller the IQR, the closer the middle half of the data are. We consider the IQR a measure of spread for this reason.
The five numbers in this example are 12, 20, 33, 40, and 49. Their locations are marked with diamonds in the following dot plot.
Different data sets could have the same five-number summary. For instance, the following data has the same maximum, minimum, and quartiles as the one above.
Glossary Terms
The interquartile range is one way to measure how spread out a data set is. We sometimes call this the IQR. To find the interquartile range we subtract the first quartile from the third quartile.
22 | 29 | 30 | 31 | 32 | 43 | 44 | 45 | 50 | 50 | 59 |
Q1 | Q2 | Q3 |
For example, the IQR of this data set is 20 because .
Quartiles are the numbers that divide a data set into four sections that each have the same number of values.
For example, in this data set the first quartile is 20. The second quartile is the same thing as the median, which is 33. The third quartile is 40.
12 | 19 | 20 | 21 | 22 | 33 | 34 | 35 | 40 | 40 | 49 |
Q1 | Q2 | Q3 |
The range is the distance between the smallest and largest values in a data set. For example, for the data set 3, 5, 6, 8, 11, 12, the range is 9, because .
Lesson 15 Practice Problems
Suppose that there are 20 numbers in a data set and that they are all different.
- How many of the values in this data set are between the first quartile and the third quartile?
- How many of the values in this data set are between the first quartile and the median?
In a word game, 1 letter is worth 1 point. This dot plot shows the scores for 20 common words.
- What is the median score?
- What is the first quartile (Q1)?
- What is the third quartile (Q3)?
- What is the interquartile range (IQR)?
Here are five dot plots that show the amounts of time that ten sixth-grade students in five countries took to get to school. Match each dot plot with the appropriate median and IQR.
- Median: 17.5, IQR: 11
- Median: 15, IQR: 30
- Median: 8, IQR: 4
- Median: 7, IQR: 10
- Median: 12.5, IQR: 8
Mai and Priya each played 10 games of bowling and recorded the scores. Mai’s median score was 120, and her IQR was 5. Priya’s median score was 118, and her IQR was 15. Whose scores probably had less variability? Explain how you know.
Draw and label an appropriate pair of axes and plot the points. , , ,
There are 20 pennies in a jar. If 16% of the coins in the jar are pennies, how many coins are there in the jar?