If a fair coin is tossed 10 times, what will we see? 24.61% 20.51% 20.51% 11.72% 11.72% 4.39% 4.39% 0.98% 0.98% 0.098% 0.098%

Similar documents
CHAPTER 2 Modeling Distributions of Data

(b) Express the event of getting a sum of 12 when you add up the two numbers in the tosses.

STA 103: Midterm I. Print clearly on this exam. Only correct solutions that can be read will be given credit.

Confidence Interval Notes Calculating Confidence Intervals

AP Statistics Midterm Exam 2 hours

Smoothing the histogram: The Normal Curve (Chapter 8)

5.1A Introduction, The Idea of Probability, Myths about Randomness

Skills Practice Skills Practice for Lesson 17.1

Bouncing Ball A C T I V I T Y 8. Objectives. You ll Need. Name Date

2016 Sports Performance Presidents Day Challenge

Lesson 20: Estimating a Population Proportion

MATH 118 Chapter 5 Sample Exam By: Maan Omran

Chapter 5 - Probability Section 1: Randomness, Probability, and Simulation

Common Plots Conditional Verification. Dimitra Boucouvala & WG5

MATH 114 QUANTITATIVE REASONING PRACTICE TEST 2

Exploring Measures of Central Tendency (mean, median and mode) Exploring range as a measure of dispersion

% per year Age (years)

How to evaluate pressure measurement specifications

Experimental & Theoretical Probability

Practice Test Unit 6B/11A/11B: Probability and Logic

Practice Test Unit 06B 11A: Probability, Permutations and Combinations. Practice Test Unit 11B: Data Analysis

Two Machine Learning Approaches to Understand the NBA Data

Expected Value 3.1. On April 14, 1993, during halftime of a basketball game between the. One-and-One Free-Throws

CHAPTER 5 Probability: What Are the Chances?

Building an NFL performance metric

Section 5.1 Randomness, Probability, and Simulation

Confidence Intervals with proportions

Project Title: Overtime Rules in Soccer and their Effect on Winning Percentages

101 Ways to. Way #27 Understanding the Odds.

Year 10 Term 2 Homework

The term event is also often used in conjunction with sports. For example, the gymnastics event will be held at the Worthington Arena.

First Server Advantage in Tennis. Michelle Okereke

Calculation of Trail Usage from Counter Data

Probability: Bernoulli Trials, Expected Value, and More About Human Beings!

Lesson 20: Estimating a Population Proportion

The Normal Distribution, Margin of Error, and Hypothesis Testing. Additional Resources

living with the lab control of salinity 2012 David Hall

Denise L Seman City of Youngstown

Math 1070 Sample Final Exam Spring 2016

Beamex. Calibration White Paper. Weighing scale calibration - How to calibrate weighing instruments

SAP Predictive Analysis and the MLB Post Season

#1 Accurately Rate and Rank each FBS team, and

SPORTS STARS WARS: WHERE TO BUILD THE NEW ARENA(S) MEADOWLANDS PREFERRED

b) (2 pts.) Does the study show that drinking 4 or more cups of coffee a day caused the higher death rate?

Statistician s Report on Bonus Points 2016

Using New Iterative Methods and Fine Grain Data to Rank College Football Teams. Maggie Wigness Michael Rowell & Chadd Williams Pacific University

Student Exploration: Ripple Tank

PLEASE MARK YOUR ANSWERS WITH AN X, not a circle! 1. (a) (b) (c) (d) (e) 2. (a) (b) (c) (d) (e) (a) (b) (c) (d) (e) 4. (a) (b) (c) (d) (e)...

Chapter 4 Displaying Quantitative Data

Highway & Transportation (I) ECIV 4333 Chapter (4): Traffic Engineering Studies. Spot Speed

Chapter 5: Methods and Philosophy of Statistical Process Control

Improve pipetting results in pharmaceutical formulation by using MICROMAN E

PURE SUBSTANCE. Nitrogen and gaseous air are pure substances.

Determining Occurrence in FMEA Using Hazard Function

Opleiding Informatica

Statistics Class 3. Jan 30, 2012

TOPIC 10: BASIC PROBABILITY AND THE HOT HAND

Individual Behavior and Beliefs in Parimutuel Betting Markets

1. The data below gives the eye colors of 20 students in a Statistics class. Make a frequency table for the data.

NAME: Math 403 Final Exam 12/10/08 15 questions, 150 points. You may use calculators and tables of z, t values on this exam.

Should bonus points be included in the Six Nations Championship?

CHARACTERISTICS OF LEAK DETECTION BASED ON DIFERENTIAL PRESSURE MEASUREMENT

Guide for Reading. Vocabulary compressibility

GameTheoryNotes.notebook March 01, Game Theory. First, for each possible outcome, determine it's probability and it's "value"

GCSE style questions arranged by topic

March Madness Basketball Tournament

Progressive Practice Drills

a) When measuring the hallway or class, did you have any systematic errors? b) Estimate the size of your random errors using each technique.

Lab 2: Probability. Hot Hands. Template for lab report. Saving your code

STT 315 Section /19/2014

Chapter 2: Modeling Distributions of Data

PLEASE MARK YOUR ANSWERS WITH AN X, not a circle! 1. (a) (b) (c) (d) (e) 2. (a) (b) (c) (d) (e) (a) (b) (c) (d) (e) 4. (a) (b) (c) (d) (e)...

Just How Big are the Pockets, Anyway? Part II

Solutionbank S1 Edexcel AS and A Level Modular Mathematics

Quantitative Literacy: Thinking Between the Lines

Gas Laws. Introduction

Wildlife Ad Awareness & Attitudes Survey 2015

Marking Guidelines 2009 examination June series. Physics Investigative Skills Assignment (ISA) Q. General Certificate of Education

Traffic Parameter Methods for Surrogate Safety Comparative Study of Three Non-Intrusive Sensor Technologies

Money Lost or Won -$5 +$3 +$7

March Madness Basketball Tournament

6-16. Most students will find that Myriah does the dishes between 8 and 17 of the 30 days, or about 2 to 4 days per week.

Agood tennis player knows instinctively how hard to hit a ball and at what angle to get the ball over the. Ball Trajectories

NCSS Statistical Software

Mathematics 43601H. Probability. In the style of General Certificate of Secondary Education Higher Tier. Past Paper Questions by Topic TOTAL

Golden Valley Little League Evaluation Stations Outline

Determining the Outcome of a Match

Advanced driver assistance systems status and future potential

Examples of Dependent and Independent Events

Honest Mirror: Quantitative Assessment of Player Performances in an ODI Cricket Match

Policy Management: How data and information impacts the ability to make policy decisions:

Freak Waves: A Suggested Definition and Possible Consequences for Marine Structures

DETERMINING WHAT NERF GUN IS THE MOST ACCURATE. Finn K. Cary Academy ABSTRACT

Algebra 1 Unit 6 Study Guide

Contingent Valuation Methods

ABOUT THE REPORT. This is a sample report. Report should be accurate but is not

Equine Injury Database Update and Call for More Data

ADVANCED TACTICS: CONCEPTS, FEATURES AND 5 GAME SITUATIONS

Mathematics of Pari-Mutuel Wagering

Drilling Efficiency Utilizing Coriolis Flow Technology

Transcription:

Coin tosses If a fair coin is tossed 10 times, what will we see? 30% 25% 24.61% 20% 15% 10% Probability 20.51% 20.51% 11.72% 11.72% 5% 4.39% 4.39% 0.98% 0.98% 0.098% 0.098% 0 1 2 3 4 5 6 7 8 9 10 Number of H in ten tosses Probability histogram for the number of H in 10 tosses of a fair coin. 1

Observations: The probability of seeing exactly 5 H in 10 tosses is just below 25%. The probability that the number of H is between 4 and 6 is close to 66%. The probability that the number of H is between 3 and 7 is about 89%. We could summarize these numbers by saying that we will probably see about 5 H in 10 tosses of a fair coin. 2

More coin tosses If a fair coin is tossed 100 times, what will we see? 9% 8% 7% Probability 6% 5% 4% 3% 2% 1% 0 35 40 45 50 55 60 65 70 Number of H in 100 tosses of a fair coin Probability histogram for the number of H in 100 tosses of a fair coin. 3

Observations: The probability of seeing exactly 50 H in 100 tosses is 7.96%. The probability that the number of H in 100 tosses is between 49 and 51 is 23.56%. The probability that the number of H in 100 tosses is between 48 and 52 is 38.26%. The probability that the number of H in 100 tosses is between 47 and 53 is 51.58%. The probability that the number of H in 100 tosses is between 46 and 54 is 63.18%. 4

9% 8% 7% Probability 6% 5% 4% 3% 2% 72.88% 1% 0 35 40 45 50 55 60 65 70 Number of H in 100 tosses of a fair coin The probability that the number of H in 100 tosses is between 45 and 55 is 72.88%. Analogously to the 10-toss scenario, we can say that in 100 tosses of a fair coin, we will probably see about 50 H. 5

Question: In which of the two scenarios 10 tosses or 100 tosses is our prediction more accurate? Answer: It depends on how we are measuring the accuracy. In terms of the number of H, the prediction for 10 tosses gives a narrower range of possible values with a higher probability. In terms of the proportion of H, the prediction for 100 tosses gives a narrower range of possible percentages with higher probability. The probability is 66% that percentage of H in 10 tosses will be between 40% and 60%. The probability is 63.18% that the percentage of H in 100 tosses will be between 46% and 54%. The probability is 72% that percentage of H in 100 tosses will be between 45% and 55%. 6

(*) When we toss a coin 10 times, the expected number of H is 5. (*) When we toss a coin 100 times, the expected number of H is 50. Question. Why are the observed numbers of H different from the expected numbers? Answer: Chance error. (*) As the number of tosses grows the size of the chance error increases (on average)... (*)... but the relative size of the chance error decreases (on average). relative size of chance error = size of chance error number of tosses 7

(*) The expected number of H in n tosses is n 2. (*) The average size of the chance error in n tosses is This increases with the number of tosses. n 2. (*) The relative size of the chance error in n tosses is n/2 n = 1 2 n. This decreases with the number of tosses. 8

These phenomena are not limited to coin tosses... Ten tickets are drawn at random with replacement from a box that contains three red tickets and seven blue tickets... ( The 3R7B box ) Questions: 1. How many red tickets do you expect to see when you draw 10 tickets from the 3R7B box? 2. Why do we expect this number? 3. How accurate is the answer to the first question likely to be? I.e., how close is the observed number of red tickets likely to be to the expected number of red tickets? 9

Answers: 1. We expect (about) 3 red tickets in 10 draws... 2....because it seems reasonable that the distribution of the observed tickets (red or blue) will match the distribution of tickets in the box. 3. To answer question 3., we need to find the probabilities of all of the possible outcomes. (*) The possible numbers of red tickets are 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, and for 0 k 10, ( ) 10 P (k red tickets in 10 draws) = (0.3) k (0.7) 10 k k E.g., P (2 red tickets in 10 draws) = ( 10 2 ) (0.3) 2 (0.7) 8 0.2335 10

We can collect all the probabilities in a histogram... 30% 26.68% 25% 23.35% 20% 15% 10% Probability 12.11% 20.01% 10.29% 5% 2.82% 3.68% 0 1 2 3 4 5 6 7 8 9 10 0.9% 0.14% Number of red tickets in ten draws from box with 3 red and 7 blue tickets 0.01% 0.0006% 11

(*) The probability that the number of red tickets in 10 draws is between 2 and 4 ( about 3 ) is almost 70%. (*) The small number of draws is a little misleading (as it was in the case of coin tosses). (*) As the number of draws gets bigger, the observed number of red tickets is more and more likely to be farther and farther from the expected number of reds tickets. (*) As the number of draws increases, the chance error also increases (on average). chance error = observed number expected number. 12

Probability histogram for the number of red tickets observed in 20 random draws (with replacement) from the 3R7B box. 13

Probability histogram for the number of red tickets observed in 100 random draws (with replacement) from the 3R7B box. 14

Observations: The most likely number of red tickets in all three examples is P (red ticket in one draw) (number of draws). This is the expected number of red tickets in each case. The probability that we see precisely the expected number of red tickets decreases as the number of draws increases. From about 27% in 10 draws, to about 19% in 20 draws, to about 8.5% in 100 draws, to about 2.75% in 1000 draws. The probability that the number of red tickets is close (e.g., within 2) of the expected number also decreases: P (between 1 and 5 red tickets in 10 draws) = 92.45% P (between 4 and 8 red tickets in 20 draws) = 77.96% P (between 28 and 32 red tickets in 100 draws) = 41.43% P (between 298 and 302 red tickets in 1000 draws) = 13.69% 15

As the number of draws from the box increases, the chance increases that the observed number of red tickets will deviate significantly from the expected number of red tickets: In 10,000 draws from the 3R7B box, the probability that the number of red tickets is more than 30 away from 3000 is about 50.57% In 1,000,000 draws from the 3R7B box, the probability that the number of red tickets is more than 300 away from 300,000 is about 51.2%. 16

Proportions, not numbers It is more useful to compare the observed percentage of red tickets drawn to the expected percentage. The expected percentage of red tickets is the same as the percentage of red tickets in the box, namely 30%. In 10 draws from the 3R7B box, the chance is about 0.70 that 20% <(observed % of red tickets)< 40%. In 20 draws from the 3R7B box, the chance is about 0.78 that 20% <(observed % of red tickets)< 40%. In 100 draws from the 3R7B box, the chance is about 0.98 that 20% <(observed % of red tickets)< 40%. In 1000 draws from the 3R7B box, the chance is more than 0.99 that 20% <(observed % of red tickets)< 40%. 17

In 1000 draws from the 3R7B box, the chance is about 0.85 that 28% <(observed % of red tickets)< 32%. In 10000 draws from the 3R7B box, the chance is more than 0.99 that 28% <(observed % of red tickets)< 32%. In 10000 draws from the 3R7B box, the chance is about 0.97 that 29% <(observed % of red tickets)< 31%. In 10000 draws from the 3R7B box, the chance is about 0.73 that 29.5% <(observed % of red tickets)< 30.5%. In 1,000,000 draws from the 3R7B box, the chance is more than 0.99 that 29.5% <(observed % of red tickets)< 30.5%. Summarizing: As the number of draws from the 3R7B box increases, the chance approaches 100% that the observed percentage of red tickets is very close to the expected percentage. 18

In other words, If many tickets are drawn at random with replacement from the 3R7B box, then it is very likely that about 30% of the tickets will be red. This is called the Law of Averages (for the 3R7B box). The law of averages does not say that......we will definitely see exactly 30% red tickets or... we will probably see exactly 30% red tickets. or... we will definitely see about 30% red tickets. There is always a chance that the observed percentage of red tickets will be far from the expected percentage, even for an extremely large number of draws. 19

The law of averages, more generally: If tickets are drawn from a box containing red tickets and blue tickets, then as the number of draws increases, the probability approaches 100% that the observed percentage of red tickets is very close to the expected percentage of red tickets (= the percentage of red tickets in the box). Comments: The law is true for draws with replacement and for draws without replacement. The results are even sharper when the draws are done without replacement (because the chance error is smaller in this case). The difference between the observed number of red tickets and the expected number of red tickets is likely to get bigger as the number of draws (with replacement) grows. The law of averages does not say anything about what will happen on the next draw. 20