Today s plan: Section 4.2: Normal Distribution

Similar documents
STANDARD SCORES AND THE NORMAL DISTRIBUTION

3.3 - Measures of Position

CHAPTER 2 Modeling Distributions of Data

1. The data below gives the eye colors of 20 students in a Statistics class. Make a frequency table for the data.

Chapter 6 The Standard Deviation as a Ruler and the Normal Model

(c) The hospital decided to collect the data from the first 50 patients admitted on July 4, 2010.

Unit 3 - Data. Grab a new packet from the chrome book cart. Unit 3 Day 1 PLUS Box and Whisker Plots.notebook September 28, /28 9/29 9/30?

Math 243 Section 4.1 The Normal Distribution

Chapter 2: Modeling Distributions of Data

Effective Use of Box Charts

Full file at

Organizing Quantitative Data

Lesson 2.1 Frequency Tables and Graphs Notes Stats Page 1 of 5

Exploring Measures of Central Tendency (mean, median and mode) Exploring range as a measure of dispersion

Chapter 3.4. Measures of position and outliers. Julian Chan. September 11, Department of Mathematics Weber State University

North Point - Advance Placement Statistics Summer Assignment

Mrs. Daniel- AP Stats Ch. 2 MC Practice

Solutionbank S1 Edexcel AS and A Level Modular Mathematics

Descriptive Stats. Review

STAT 155 Introductory Statistics. Lecture 2-2: Displaying Distributions with Graphs

Study Guide and Intervention

PSY201: Chapter 5: The Normal Curve and Standard Scores

MATH 118 Chapter 5 Sample Exam By: Maan Omran

% per year Age (years)

Reminders. Homework scores will be up by tomorrow morning. Please me and the TAs with any grading questions by tomorrow at 5pm

STT 315 Section /19/2014

Lab 5: Descriptive Statistics

DS5 The Normal Distribution. Write down all you can remember about the mean, median, mode, and standard deviation.

Chapter 4 Displaying Quantitative Data

Warm-Up: Create a Boxplot.

Histogram. Collection

Internet Technology Fundamentals. To use a passing score at the percentiles listed below:

How are the values related to each other? Are there values that are General Education Statistics

Quantitative Literacy: Thinking Between the Lines

Math 1040 Exam 2 - Spring Instructor: Ruth Trygstad Time Limit: 90 minutes

Aim: Normal Distribution and Bell Curve

Warm-up. Make a bar graph to display these data. What additional information do you need to make a pie chart?

IGCSE - Cumulative Frequency Questions

MATH 114 QUANTITATIVE REASONING PRACTICE TEST 2

Frequency Distributions

AP STATISTICS Name Chapter 6 Applications Period: Use summary statistics to answer the question. Solve the problem.

CHAPTER 2 Modeling Distributions of Data

Name Date Period. E) Lowest score: 67, mean: 104, median: 112, range: 83, IQR: 102, Q1: 46, SD: 17

Data Set 7: Bioerosion by Parrotfish Background volume of bites The question:

The pth percentile of a distribution is the value with p percent of the observations less than it.

CHAPTER 1 ORGANIZATION OF DATA SETS

NOTES: STANDARD DEVIATION DAY 4 Textbook Chapter 11.1, 11.3

WHAT IS THE ESSENTIAL QUESTION?

STAT 101 Assignment 1

Scaled vs. Original Socre Mean = 77 Median = 77.1

In the actual exam, you will be given more space to work each problem, so work these problems on separate sheets.

Descriptive Statistics. Dr. Tom Pierce Department of Psychology Radford University

Unit 3 ~ Data about us

ACTIVITY: Drawing a Box-and-Whisker Plot. a. Order the data set and write it on a strip of grid paper with 24 equally spaced boxes.

Homework 7, Due March

Averages. October 19, Discussion item: When we talk about an average, what exactly do we mean? When are they useful?

WorkSHEET 13.3 Univariate data II Name:

Box-and-Whisker Plots

1wsSMAM 319 Some Examples of Graphical Display of Data

DESCRIBE the effect of adding, subtracting, multiplying by, or dividing by a constant on the shape, center, and spread of a distribution of data.

Lesson 4: Describing the Center of a Distribution

Statistics Class 3. Jan 30, 2012

46 Chapter 8 Statistics: An Introduction

How Fast Can You Throw?

2017 AMC 12B. 2. Real numbers,, and satisfy the inequalities,, and. Which of the following numbers is necessarily positive?

Section 3.2: Measures of Variability

Year 10 Term 2 Homework

Bivariate Data. Frequency Table Line Plot Box and Whisker Plot

Diameter in cm. Bubble Number. Bubble Number Diameter in cm

Descriptive Statistics Project Is there a home field advantage in major league baseball?

MATH-A SOL Remediation - A.10 Exam not valid for Paper Pencil Test Sessions

Statistics. Wednesday, March 28, 2012

A definition of depth for functional observations

Math 146 Statistics for the Health Sciences Additional Exercises on Chapter 2

Smoothing the histogram: The Normal Curve (Chapter 8)

Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA

CHAPTER 2 Modeling Distributions of Data

Unit 6 Day 2 Notes Central Tendency from a Histogram; Box Plots

Running head: DATA ANALYSIS AND INTERPRETATION 1

Analyzing Categorical Data & Displaying Quantitative Data Section 1.1 & 1.2

Algebra 1 Unit 7 Day 2 DP Box and Whisker Plots.notebook April 10, Algebra I 04/10/18 Aim: How Do We Create Box and Whisker Plots?

Math 227 Test 1 (Ch2 and 3) Name

Stat 139 Homework 3 Solutions, Spring 2015

A Hare-Lynx Simulation Model

BASEBALL SALARIES: DO YOU GET WHAT YOU PAY FOR? Comparing two or more distributions by parallel box plots

Denise L Seman City of Youngstown

Algebra 1 Unit 6 Study Guide

9.3 Histograms and Box Plots

PLEASE MARK YOUR ANSWERS WITH AN X, not a circle! 1. (a) (b) (c) (d) (e) 2. (a) (b) (c) (d) (e) (a) (b) (c) (d) (e) 4. (a) (b) (c) (d) (e)...

Age of Fans

Descriptive Statistics

box and whisker plot 3880C798CA037B A83B07E6C4 Box And Whisker Plot 1 / 6

Box-and-Whisker Plots

5.1. Data Displays Batter Up. My Notes ACTIVITY

Psychology - Mr. Callaway/Mundy s Mill HS Unit Research Methods - Statistics

Practice Test Unit 6B/11A/11B: Probability and Logic

1 Measures of Center. 1.1 The Mean. 1.2 The Median

PRACTICAL EXPLANATION OF THE EFFECT OF VELOCITY VARIATION IN SHAPED PROJECTILE PAINTBALL MARKERS. Document Authors David Cady & David Williams

0-13 Representing Data

Statistical Studies: Analyzing Data III.B Student Activity Sheet 6: Analyzing Graphical Displays

Transcription:

1 Today s plan: Section 4.2: Normal Distribution

2 Characteristics of a data set: mean median standard deviation five-number summary

2 Characteristics of a data set: mean median standard deviation five-number summary These don t always tell you everything though.

3 Bar graph for scores of Math 109 quiz:

4 Frequency 15 14 13 12 11 10 98 7 6 5 4 3 2 1 0 0 1 2 3 4 5 6 7 8 9 10111213141516171819202122232425 Score

5 Frequency 15 14 13 12 11 10 98 7 6 5 4 3 2 1 0 0 1 2 3 4 5 6 7 8 9 10111213141516171819202122232425 Score It s somewhat close to a bell-shaped curve.

6 Definition We call that bell-shaped curve a normal curve.

6 Definition We call that bell-shaped curve a normal curve. A data set whose distribution is a normal curve has a normal distribution.

7 The quiz scores in Math 109 have a near-normal distribution: it s close, but not super close to the normal curve

7 The quiz scores in Math 109 have a near-normal distribution: it s close, but not super close to the normal curve Note the outliers

7 The quiz scores in Math 109 have a near-normal distribution: it s close, but not super close to the normal curve Note the outliers the spike at 15

7 The quiz scores in Math 109 have a near-normal distribution: it s close, but not super close to the normal curve Note the outliers the spike at 15 the whole picture is a little skewed to the right

8 Non-normal distributions:

9 The first one has an inverted bell shape. The second one is called a bimodal distribution. Lots of data sets do have normal or near-normal distributions though, so we re interested in them.

10 Example Score Freq. 200 180 [200, 250) 11,393 160 [250, 300) 20,874 140 [300, 350) 46,583 120 [350, 400) 94,037 [400, 450) 143,630 100 [450, 500) 193,470 80 [500, 550) 187,773 60 [550, 600) 158,138 [600, 650) 115,401 40 [650, 700) 66,066 20 [700, 750) 30,503 0 [750, 800] 16,857 200250300350400450500550600650700750800 This is a normal distribution. (SAT scores)

11 Section 4.2.2: Properties of Normal Distributions

12 Normal distributions approximate real life distributions

12 Normal distributions approximate real life distributions Knowing the properties of normal curves, we can approximate the properties of near-normal data sets.

13 Normal curves can vary: but they all have common properties.

14 Symmetry A normal curve is symmetric about the vertical line at the mean µ. This is called its axis of symmetry.

15 Concavity A normal curve has a concave up region on the left, a concave down region in the middle, and a concave up region on the right.

16 Inflection points The points where the curve changes concavity are called inflection points.

16 Inflection points The points where the curve changes concavity are called inflection points. There are two inflection points, each one standard deviation from the center.

17 68-95-99.7 Principle Tells where the data points should be:

17 68-95-99.7 Principle Tells where the data points should be: 68% of the data set is located under the concave down part of the curve. That s within σ of µ.

17 68-95-99.7 Principle Tells where the data points should be: 68% of the data set is located under the concave down part of the curve. That s within σ of µ. 95% of the data points are within 2 σ s of µ.

17 68-95-99.7 Principle Tells where the data points should be: 68% of the data set is located under the concave down part of the curve. That s within σ of µ. 95% of the data points are within 2 σ s of µ. 99.7% of the data points are within 3 σ s of µ.

18 Quartiles Q1 = µ 0.675σ and Q3 = µ + 0.675σ

18 Quartiles Q1 = µ 0.675σ and Q3 = µ + 0.675σ So 50% of data points are within 0.675σ of µ.

19 Axis of Symmetry Inflection Point Inflection Point σ σ µ 3σ µ 2σ µ σ µ µ + σ µ + 2σ µ + 3σ 68% 95% 99.7%

20 For near-normal distributions, the above properties should be taken as approximations.

21 Frequency Example Recall Math 109 quiz: 15 14 13 12 11 10 98 7 6 5 4 3 2 1 0 0 1 2 3 4 5 6 7 8 9 10111213141516171819202122232425 Score

22 score 4 5 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 25 freq. 1 1 2 2 3 5 9 12 11 13 9 8 7 5 3 2 1 1 cum fr. 1 2 4 6 9 14 23 35 46 59 68 76 83 88 91 93 94 95 We found earlier: µ = 14.64

22 score 4 5 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 25 freq. 1 1 2 2 3 5 9 12 11 13 9 8 7 5 3 2 1 1 cum fr. 1 2 4 6 9 14 23 35 46 59 68 76 83 88 91 93 94 95 We found earlier: µ = 14.64 σ = 3.35

22 score 4 5 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 25 freq. 1 1 2 2 3 5 9 12 11 13 9 8 7 5 3 2 1 1 cum fr. 1 2 4 6 9 14 23 35 46 59 68 76 83 88 91 93 94 95 We found earlier: µ = 14.64 σ = 3.35 62 out of 95 scores are within one σ from µ 62 100% = 65.3% 95

22 score 4 5 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 25 freq. 1 1 2 2 3 5 9 12 11 13 9 8 7 5 3 2 1 1 cum fr. 1 2 4 6 9 14 23 35 46 59 68 76 83 88 91 93 94 95 We found earlier: µ = 14.64 σ = 3.35 62 out of 95 scores are within one σ from µ 62 100% = 65.3% 95 which is close to the ideal 68%

23 number of scores within 2 σ s of µ:

23 number of scores within 2 σ s of µ: µ 2σ = 7.94 and µ+2σ = 21.34,

23 number of scores within 2 σ s of µ: µ 2σ = 7.94 and µ+2σ = 21.34, so 91 out of 95 (i.e., 95.8%)

23 number of scores within 2 σ s of µ: µ 2σ = 7.94 and µ+2σ = 21.34, so 91 out of 95 (i.e., 95.8%), close to the ideal 95%.

23 number of scores within 2 σ s of µ: µ 2σ = 7.94 and µ+2σ = 21.34, so 91 out of 95 (i.e., 95.8%), close to the ideal 95%. Finally, 93 out of 95 (i.e., 97.9%) points are within 3 σ s of µ.

23 number of scores within 2 σ s of µ: µ 2σ = 7.94 and µ+2σ = 21.34, so 91 out of 95 (i.e., 95.8%), close to the ideal 95%. Finally, 93 out of 95 (i.e., 97.9%) points are within 3 σ s of µ. All of this is close to the ideal 68-95-99.7 principle.

24 We can also estimate the quartiles for the Math 109 quizzes:

24 We can also estimate the quartiles for the Math 109 quizzes: Q1 µ 0.675σ = 12.38 Q3 µ + 0.675σ = 16.90 (Faster than actually calculating them.)

25 Conclusion: If you know that the distribution of data points is normal, it is enough to know just 2 numbers to describe the data set completely:

25 Conclusion: If you know that the distribution of data points is normal, it is enough to know just 2 numbers to describe the data set completely: Mean µ

25 Conclusion: If you know that the distribution of data points is normal, it is enough to know just 2 numbers to describe the data set completely: Mean µ Standard deviation σ

26 Section 4.2.3: Using the 68-95-99.7 Principle

27 Each tail outside one σ contains 100% 68% = 16% 2 16% 84% 84% 16% µ σ µ µ µ+σ

28 This means that a random point has:

28 This means that a random point has: 16% chance of being below µ σ

28 This means that a random point has: 16% chance of being below µ σ 84% chance of being above µ σ

28 This means that a random point has: 16% chance of being below µ σ 84% chance of being above µ σ pr(x < µ σ) = 0.16 and pr(x µ σ) = 0.84.

29 Similarly, each tail outside 2σ contains 100% 95% = 2.5% 2 2.5% µ 2σ µ 97.5%

30 Example The weights of 6-month-old baby boys in the U.S. have a near-normal distribution with µ = 17.25 lbs σ = 2 lbs

31 Example Questions: (a) What can we say about babies whose weight is 19.25 lb.?

31 Example Questions: (a) What can we say about babies whose weight is 19.25 lb.? (b) If we pick a random baby, what s the probability he s less than 13.25 lb.?

32 Solution (a) The baby whose weight is 19.25 lb.

32 Solution (a) The baby whose weight is 19.25 lb. is certainly above the average

32 Solution (a) The baby whose weight is 19.25 lb. is certainly above the average Moreover, his weight 19.25 = 17.25 + 2 = µ + σ is precisely one standard deviation above the average.

33 Solution Thus 84% of babies weigh less than him

33 Solution Thus 84% of babies weigh less than him 16% of babies weigh more than him We say that his weight is in the 84th percentile.

34 Solution (b) A weight of 13.25 lb. or less places a baby at least 2 standard deviations below the average. 13.25 = 17.25 2 2 = µ 2σ

34 Solution (b) A weight of 13.25 lb. or less places a baby at least 2 standard deviations below the average. 13.25 = 17.25 2 2 = µ 2σ The probability for that is 2.5%.

35 Next Time: Section 4.3: Data Collection Section 4.3.1: Population v. Sample