Enjoy any 5 free lessons!
You can pick. No account needed.
Watch VideoBecome a member to get full access to our entire library of learning videos, reading material, quiz games, simple DIY activities & more.
Become a member to get full access to our entire library of learning videos, quiz games, & more.
Plans & Pricingto watch this full video.
Access All Videos
and Lessons, No Limits.
Access All Videos
No credit card required,
takes 7 sec to signup.
No card required
Ready-to-go lessons
that save you time.
Ready-to-go lessons
If you are on a school computer or network, ask your tech person to whitelist these URLs:
*.wistia.com, fast.wistia.com, fast.wistia.net, embedwistia-a.akamaihd.net
Sometimes a simple refresh solves this issue. If you need further help, contact us.
Statistics: Random Samples & Comparing Data Sets
- Show lesson plan & teacher guide
- Show answers to discussion questions
- Show video only
- Allow visiting of other pages
- Hide assessments
- We will learn what a statistical question is.
- We’ll also learn how to answer them by collecting random samples and analyzing data.
- And we'll see how this knowledge can help us plan events, learn about animals, and even make water rockets!
-
Discussion Questions
-
Before VideoWhat is categorical data?ANSWER
-
Categorical data sorts objects based on a qualitative attribute that can be described using distinct categories, like color, tree type, or drink sizes.
-
A frequency is the number of times something happens. In data, a frequency is the number of individuals or objects that fit in a particular category or that produce measurements within a bin interval.
-
I use data collection to answer questions. Particularly if the answer could be different for different individuals or objects. I can collect data to get information about a population.
-
Continuous data is quantitative, and it is measured using a tool that allows for a continuous set of possible measurements, like length, mass, or volume.
-
Data can be difficult to visualize and compare when it is written in a table. By making graphs, I can communicate a lot of intuitive information about the data easily and quickly.
-
-
After VideoWhat is a population?ANSWER
-
A population is the group of individuals I am interested in. Populations can be people, pets, animals, plants, objects, etc. Populations need to be well defined, for example, if I am interested in studying middle school students in Arizona, I need to be specific. Just saying “people” implies everyone in the world, “people from Arizona” implies all Arizonans, while “people who are students in an Arizona middle school” adequately describes the population.
-
A sample is a subset of the population that a researcher selects to study. A sample should be representative of the population of interest, so if I am interested in “people who are students in a Michigan middle school,” I wouldn’t sample middle school students from outside Michigan, or sample elementary or high school students. Samples are important because they give information about the population without having to measure every member of the population. Sampling saves time and money.
-
A sampling method is the procedure I use to select a sample. There are many sampling methods, but the most important thing is to choose a sample that is likely to be representative of the population. Random sampling and a large enough sample size are the best ways to ensure a representative sample.
-
If a sample is different from the population, it is biased. Biased sampling means I can’t interpret the data properly or make conclusions about the population based on the results of the sample. I take samples so I can infer things about the population, but if the sampling method I choose doesn’t give a representative sample, then I can’t make inferences from that. Convenience sampling and low sample size can introduce bias to a sample.
-
A statistical question is one where the answer is uncertain, or where the answer may vary between individuals. For example, “how tall are Oklahoma middle school students?” is a statistical question because the height of a student changes from student to student. I can answer this question by taking a random sample of Oklahoma middle school students and measuring their heights. I can then use a histogram or line graph to summarize and display that data.
-
-
-
Vocabulary
-
Statistical question
DEFINE
A question where the answer varies, so we need to collect data to find the answer.
-
Population
DEFINE
The entire group we want to study; this could be all lizards on an island, all students in a school, all trees in a forest, etc.
-
Sample
DEFINE
A subset of the population selected to be measured.
-
Random sample
DEFINE
A sample where every individual in the sample is selected at random from the population.
-
Sample size
DEFINE
The number of observations/individuals in the sample. Larger sample sizes make better data.
-
Data
DEFINE
The values generated when someone takes measurements for a group of objects or individuals; usually, the group is a sample.
-
Bias
DEFINE
A characteristic of bad samples where the sample is very different from the population. For example, a sample meant to represent all students at the school is biased if the only students sampled are on the soccer team because the soccer team does not accurately reflect all students in the school.
-
Mean
DEFINE
The average observation in a data set, calculated by adding together all the observations and dividing the sum by the number of observations in the data set.
-
Median
DEFINE
The middle value in the data set when the observations are sorted from smallest to largest.
-
Mode
DEFINE
The most frequent observation in a dataset.
-
Range
DEFINE
The spread of the data, calculated by identifying the smallest and largest observations in the data set, and taking their difference.
-
Histogram
DEFINE
A graphical display of data where the bar height reflects the count or frequency of observations within the interval (bin) or set of values the bar represents.
-
Statistical question
DEFINE
-
Reading Material
Download as PDF Download PDF View as Separate PageWHAT ARE STATISTICS: RANDOM SAMPLES & COMPARING DATA SETS?A population is a group that you are interested in knowing something about. A sample of a population is a selection of some members of that population. Random samples are samples made by randomly selecting members of a population.
To better understand Statistics: Random samples & comparing data sets…
WHAT ARE STATISTICS: RANDOM SAMPLES & COMPARING DATA SETS?. A population is a group that you are interested in knowing something about. A sample of a population is a selection of some members of that population. Random samples are samples made by randomly selecting members of a population. To better understand Statistics: Random samples & comparing data sets…LET’S BREAK IT DOWN!
Statistical questions yield various answers.
Statistical questions are questions where the answers vary. For example, if I ask, “How much does this dog weigh?” that is not a statistical question. There is only one possible answer: the correct weight. But if you ask many people “What kind of pet do you have?” you get a variety of answers. Emily and Amari are planning a field day for their school. They need to choose the sport to play. Her statistical question is “What sport do students want to play?” Try this yourself: Choose a statistical question. Ask some classmates your question.
Statistical questions yield various answers. Statistical questions are questions where the answers vary. For example, if I ask, “How much does this dog weigh?” that is not a statistical question. There is only one possible answer: the correct weight. But if you ask many people “What kind of pet do you have?” you get a variety of answers. Emily and Amari are planning a field day for their school. They need to choose the sport to play. Her statistical question is “What sport do students want to play?” Try this yourself: Choose a statistical question. Ask some classmates your question.Samples can be biased.
Emily and Amari need to decide how to get answers to their question. They could ask everyone in the school, but there are over 1,000 students! That would take a long time. They should only ask a sample of the school population. What if they ask everyone on the soccer team? That sample is biased; the soccer players will likely vote for soccer. Instead, Emily and Amari could sample people at lunchtime. Amari samples 10 people, and all of them selected baseball. That doesn’t make sense, because they know there are lots of people at school who like other sports. Their sample size was too small, which can also make it biased. A good sample is random and has a large enough sample size. Try this yourself: A musician wants to know if people like his new song. He asks 5 people at his most recent concert. Is this a good sample?
Samples can be biased. Emily and Amari need to decide how to get answers to their question. They could ask everyone in the school, but there are over 1,000 students! That would take a long time. They should only ask a sample of the school population. What if they ask everyone on the soccer team? That sample is biased; the soccer players will likely vote for soccer. Instead, Emily and Amari could sample people at lunchtime. Amari samples 10 people, and all of them selected baseball. That doesn’t make sense, because they know there are lots of people at school who like other sports. Their sample size was too small, which can also make it biased. A good sample is random and has a large enough sample size. Try this yourself: A musician wants to know if people like his new song. He asks 5 people at his most recent concert. Is this a good sample?You can compare datasets using line plots and means.
A team of scientists study lizards on a small island and want to know if there is a difference in lengths of male and female lizards. Since there are only about 100 lizards on the island, they sample 20 female and 20 male lizards. The scientists summarize their data in a line plot. You can describe a data set using measures of center, like mean, median, and mode. Amari calculates the mean of the female lizard lengths by adding all the measurements and dividing by the sample size. The mean length for females is 16 cm. The mean length for the male lizards is 20 cm. On the island, the male lizards are about 4 cm longer than the female lizards. You can also compare the data sets based on their range or median. The female lizard lengths are between 6 cm and 21 cm. The male lizard lengths are between 10 cm and 24 cm. What measure you use depends on what question you want to answer. Try this yourself: If another sample from the island has 4 lizards that measure 21 cm, 24 cm, 30 cm, and 10 cm, what is the mean?
You can compare datasets using line plots and means. A team of scientists study lizards on a small island and want to know if there is a difference in lengths of male and female lizards. Since there are only about 100 lizards on the island, they sample 20 female and 20 male lizards. The scientists summarize their data in a line plot. You can describe a data set using measures of center, like mean, median, and mode. Amari calculates the mean of the female lizard lengths by adding all the measurements and dividing by the sample size. The mean length for females is 16 cm. The mean length for the male lizards is 20 cm. On the island, the male lizards are about 4 cm longer than the female lizards. You can also compare the data sets based on their range or median. The female lizard lengths are between 6 cm and 21 cm. The male lizard lengths are between 10 cm and 24 cm. What measure you use depends on what question you want to answer. Try this yourself: If another sample from the island has 4 lizards that measure 21 cm, 24 cm, 30 cm, and 10 cm, what is the mean?A histogram shows the center and spread of data.
Two 7th grade classes made water rockets and measured how far the rockets went. Adesina uses histograms to compare the distances for each class's water rockets. A histogram is like a bar graph for continuous data. Along the bottom she makes bins for the distances traveled by the rockets, and the side has how many rockets went each distance. A histogram doesn't show the actual measures of center, like mean, median, and mode, but it does provide a useful visual of the data. Since graphs are meant to communicate information, the histogram does this well. Try this one yourself: Choose a statistical question whose answer is a distance or length. Gather data and display it in a histogram.
A histogram shows the center and spread of data. Two 7th grade classes made water rockets and measured how far the rockets went. Adesina uses histograms to compare the distances for each class's water rockets. A histogram is like a bar graph for continuous data. Along the bottom she makes bins for the distances traveled by the rockets, and the side has how many rockets went each distance. A histogram doesn't show the actual measures of center, like mean, median, and mode, but it does provide a useful visual of the data. Since graphs are meant to communicate information, the histogram does this well. Try this one yourself: Choose a statistical question whose answer is a distance or length. Gather data and display it in a histogram.You can compare datasets using a histogram.
Looking at the rocket histogram for Mrs. Lin's class, Emily notices some small bars at very short and very long distances. The taller bars are near the middle. The short distances mean the rockets didn't fire right. The long distances mean the rocket did very well. The histogram for Mr. Doyle's class looks different. Most of the rockets went 15-20 meters and very few went longer or shorter. This histogram shows that all the rockets worked in Mr. Doyle's class, but none went extremely far. The data are more consistent than in Mrs. Lin's class. Try this yourself: Imagine a scenario where you sample two populations and want to compare them. Sketch two possible histograms that would show that the groups have about the same center, but one is more spread out than the other.
You can compare datasets using a histogram. Looking at the rocket histogram for Mrs. Lin's class, Emily notices some small bars at very short and very long distances. The taller bars are near the middle. The short distances mean the rockets didn't fire right. The long distances mean the rocket did very well. The histogram for Mr. Doyle's class looks different. Most of the rockets went 15-20 meters and very few went longer or shorter. This histogram shows that all the rockets worked in Mr. Doyle's class, but none went extremely far. The data are more consistent than in Mrs. Lin's class. Try this yourself: Imagine a scenario where you sample two populations and want to compare them. Sketch two possible histograms that would show that the groups have about the same center, but one is more spread out than the other.Good data comes from good sampling practices.
The data displayed in a histogram is only valid if it comes from a good sample. Since every water rocket in both classes was measured, the data set is as good as it can be. But if the population is larger, say 20,000 people, measuring everyone is impractical. Then a sample must be taken. Random samples give the best data and need to be large enough to capture the information you need without making the sampling impractical. Try this one yourself: Choose a statistical question. Design a sampling method that would give biased samples.
Good data comes from good sampling practices. The data displayed in a histogram is only valid if it comes from a good sample. Since every water rocket in both classes was measured, the data set is as good as it can be. But if the population is larger, say 20,000 people, measuring everyone is impractical. Then a sample must be taken. Random samples give the best data and need to be large enough to capture the information you need without making the sampling impractical. Try this one yourself: Choose a statistical question. Design a sampling method that would give biased samples. -
Practice Word Problems
-
Practice Number Problems
-
Teacher Resources
These downloadable teacher resources can help you create a full lesson around the video. These PDFs incorporate using class discussion questions, vocabulary lists, printable math worksheets, quizzes, games, and more.
Select a Google Form
Choose a way to play this quiz game
-
Questions appear on the teacher's screen. Students answer on their own devices.
Start a Free Trial Today. Get a $5 Amazon Gift Card!
Teachers! Start a free trial & we'll send your gift card within 1 day. Only cards left. Try it now.
This email is associated with a Science Kit subscription. Kit subscriptions are managed on this separate page: Manage Subscription
-
Science & Math$/yr
-
Science Only$/yr
What is a random sample?
Identify a statistical question you would like to answer.
Describe how the sample size impacts the quality of the data you collect.
access all lessons
• No credit card required •
"My students loved the videos. I started the video subscription in May and used them as a review before the state test, which I know contributed to 100% of my class passing the state test."
Rhonda Fox 4th Grade Teacher, Ocala, Florida• No credit card required •
"My students loved the videos. I started the video subscription in May and used them as a review before the state test, which I know contributed to 100% of my class passing the state test."
Rhonda Fox 4th Grade Teacher, Ocala, Florida• No credit card required •
Already a member? Sign In
* no credit card required *
* no credit card required *
* no credit card required *
no credit card required
Skip, I will use a 3 day free trial
Enjoy your free 30 days trial
-
Unlimited access to our full library
of videos & lessons for grades K-5. -
You won’t be billed unless you keep your
account open past your 14-day free trial. -
You can cancel anytime in 1 click on the
manage account page or by emailing us.
-
Unlimited access to our full library of videos & lessons for grades K-5.
-
You won't be billed unless you keep your account open past 14 days.
-
You can cancel anytime in 1-click on the manage account page.
Cancel anytime in 1-click on the manage account page before the trial ends and you won't be charged.
Otherwise you will pay just $10 CAD/month for the service as long as your account is open.
Cancel anytime on the manage account page in 1-click and you won't be charged.
Otherwise you will pay $10 CAD/month for the service as long as your account is open.
We just sent you a confirmation email. Enjoy!
DonePlease login or join.