Lesson 9Looking for Associations
Let’s look for associations in data.
Learning Targets:
- I can identify the same data represented in a bar graph, a segmented bar graph, and a two-way table.
- I can use a two-way frequency table or relative frequency table to find associations among variables.
9.1 Notice and Wonder: Bar Association
What do you notice? What do you wonder?
9.2 Matching Representations Card Sort
Your teacher will hand out some cards.
Some cards show two-way tables like this:
has cell phone | does not have cell phone | total | |
---|---|---|---|
10 to 12 years old | 25 | 35 | 60 |
13 to 15 years old | 40 | 10 | 50 |
16 to 18 years old | 50 | 10 | 60 |
total | 115 | 55 | 170 |
Some cards show bar graphs like this:
Some cards show segmented bar graphs like this:
The bar graphs and segmented bar graphs have their labels removed.
-
Put all the cards that describe the same situation in the same group.
-
One of the groups does not have a two-way table. Make a two-way table for the situation described by the graphs in the group.
-
Label the bar graphs and segmented bar graphs so that the categories represented by each bar are indicated.
-
Describe in your own words the kind of information shown by a segmented bar graph.
Are you ready for more?
9.3 Building Another Type of Two-Way Table
Here is a two-way table that shows data about cell phone usage among children aged 10 to 18.
has cell phone | does not have cell phone | total | |
---|---|---|---|
10 to 12 years old | 25 | 35 | 60 |
13 to 15 years old | 40 | 10 | 50 |
16 to 18 years old | 50 | 10 | 60 |
total | 115 | 55 | 170 |
-
Complete the table. In each row, the entries for “has cell phone” and “does not have cell phone” should have the total 100%. Round entries to the nearest percentage point.
has cell phone does not have cell phone total 10 to 12 years old 42% 13 to 15 years old 100% 16 to 18 years old 17% This is still a two-way table. Instead of showing frequency, this table shows relative frequency.
- Two-way tables that show relative frequencies often don’t include a “total” row at the bottom. Why?
- Is there an association between age and cell phone use? How does the two-way table of relative frequencies help to illustrate this?
Are you ready for more?
A pollster attends a rally and surveys many of the participants about whether they associate with political Party A or political Party B and whether they are for or against Proposition 3.14 going up for vote soon. The results are sorted into the table shown.
for | against | |
---|---|---|
party A | 832 | 165 |
party B | 80 | 160 |
- A news station reports these results by saying, “A poll shows that about the same number of people from both parties are voting against Proposition 3.14.”
- A second news station shows this graphic.
- Are any of the news reports misleading? Explain your reasoning.
- Create a headline, graphic, and short description that more accurately represents the data in the table.
Lesson 9 Summary
When we collect data by counting things in various categories, like red, blue, or yellow, we call the data categorical data, and we say that color is a categorical variable.
We can use two-way tables to investigate possible connections between two categorical variables. For example, this two-way table of frequencies shows the results of a study of meditation and state of mind of athletes before a track meet.
meditated | did not meditate | total | |
---|---|---|---|
calm | 45 | 8 | 53 |
agitated | 23 | 21 | 44 |
total | 68 | 29 | 97 |
If we are interested in the question of whether there is an association between meditating and being calm, we might present the frequencies in a bar graph, grouping data about meditators and grouping data about non-meditators, so we can compare the numbers of calm and agitated athletes in each group.
If we want to know the proportions of calm meditators and calm non-meditators, we can make a two-way table of relative frequencies and present the relative frequencies in a segmented bar graph.
meditated | did not meditate | |
---|---|---|
calm | 66% | 28% |
agitated | 34% | 72% |
total | 100% | 100% |
Glossary Terms
The relative frequency of a category tells us the proportion at which the category occurs in the data set. It is displayed as a fraction or a percentage of the total number.
There were 21 dogs in the park, some white, some brown, some black, and some multi-color. The table shows the frequency and the relative frequency of each color. The relative frequency can also be expressed as a decimal or a percentage.
color | frequency | relative frequency |
---|---|---|
white | 5 | 5/21 = 24% |
brown | 7 | 7/21 = 33% |
black | 3 | 3/21 = 14% |
multi-color | 6 | 6/21 = 29% |
A segmented bar graph compares two categories within a data set. The whole bar represents all the data within one category. Then, each bar is separated into parts (segments) that show the percentage of each part in the second category.
This segmented bar graph shows the percentage of people in different age groups that do and do not have a cell phone. For example, among people ages 10 to 12, about 40% have a cell phone and 60% do not have a cell phone.A two-way table provides a way to compare two categorical variables.
It shows one of the variables across the top and the other down one side. Each entry in the table is the frequency or relative frequency of the category shown by the column and row headings.
A study investigates the connection between meditation and the state of mind of athletes before a track meet. This two-way table shows the results of the study.
meditated | did not meditate | total | |
---|---|---|---|
calm | 45 | 8 | 53 |
agitated | 23 | 21 | 44 |
total | 68 | 29 | 97 |
Lesson 9 Practice Problems
A scientist wants to know if the color of the water affects how much animals drink. The average amount of water each animal drinks was recorded in milliliters for a week and then graphed. Is there evidence to suggest an association between water color and animal?
cat intake (ml) dog intake (ml) total (ml) blue water 210 1200 1410 green water 200 1100 1300 total 410 2300 2710 A farmer brings his produce to the farmer’s market and records whether people buy lettuce, apples, both, or something else.
bought apples did not buy apples bought lettuce 14 58 did not buy lettuce 8 29 Make a table that shows the relative frequencies for each row. Use this table to decide if there is an association between buying lettuce and buying apples.Researchers at a media company want to study news-reading habits among different age groups. They tracked print and online subscription data and made a 2-way table.
internet articles print articles 18–25 year olds 151 28 26–45 year olds 132 72 45–65 year olds 48 165 - Create a segmented bar graph using one bar for each row of the table.
- Is there an association between age groups and the method they use to read articles? Explain your reasoning.
Using the data in the scatter plot, what is a reasonable slope of a model that fits this data?
- -2.5
- -1
- 1
- 2.5