STANDARD SCORES AND THE NORMAL DISTRIBUTION

Similar documents
Today s plan: Section 4.2: Normal Distribution

PSY201: Chapter 5: The Normal Curve and Standard Scores

% per year Age (years)

Frequency Distributions

Descriptive Statistics. Dr. Tom Pierce Department of Psychology Radford University

The pth percentile of a distribution is the value with p percent of the observations less than it.

Chapter 2: Modeling Distributions of Data

Warm-up. Make a bar graph to display these data. What additional information do you need to make a pie chart?

Mrs. Daniel- AP Stats Ch. 2 MC Practice

Internet Technology Fundamentals. To use a passing score at the percentiles listed below:

CHAPTER 2 Modeling Distributions of Data

Name Date Period. E) Lowest score: 67, mean: 104, median: 112, range: 83, IQR: 102, Q1: 46, SD: 17

Unit 3 - Data. Grab a new packet from the chrome book cart. Unit 3 Day 1 PLUS Box and Whisker Plots.notebook September 28, /28 9/29 9/30?

Chapter 6 The Standard Deviation as a Ruler and the Normal Model

Full file at

Aim: Normal Distribution and Bell Curve

Quantitative Literacy: Thinking Between the Lines

How are the values related to each other? Are there values that are General Education Statistics

3.3 - Measures of Position

STAT 101 Assignment 1

Running head: DATA ANALYSIS AND INTERPRETATION 1

March Madness Basketball Tournament

Week 7 One-way ANOVA

March Madness Basketball Tournament

1. The data below gives the eye colors of 20 students in a Statistics class. Make a frequency table for the data.

Exploring Measures of Central Tendency (mean, median and mode) Exploring range as a measure of dispersion

Solutionbank S1 Edexcel AS and A Level Modular Mathematics

Histogram. Collection

IHS AP Statistics Chapter 2 Modeling Distributions of Data MP1

(c) The hospital decided to collect the data from the first 50 patients admitted on July 4, 2010.

DESCRIBE the effect of adding, subtracting, multiplying by, or dividing by a constant on the shape, center, and spread of a distribution of data.

Unit 3 ~ Data about us

STT 315 Section /19/2014

Organizing Quantitative Data

CHAPTER 2 Modeling Distributions of Data

Bivariate Data. Frequency Table Line Plot Box and Whisker Plot

Psychology - Mr. Callaway/Mundy s Mill HS Unit Research Methods - Statistics

CHAPTER 2 Modeling Distributions of Data

Fundamentals of Machine Learning for Predictive Data Analytics

DS5 The Normal Distribution. Write down all you can remember about the mean, median, mode, and standard deviation.

4-3 Rate of Change and Slope. Warm Up. 1. Find the x- and y-intercepts of 2x 5y = 20. Describe the correlation shown by the scatter plot. 2.

Reminders. Homework scores will be up by tomorrow morning. Please me and the TAs with any grading questions by tomorrow at 5pm

Math 243 Section 4.1 The Normal Distribution

SAMPLE RH = P 1. where. P 1 = the partial pressure of the water vapor at the dew point temperature of the mixture of dry air and water vapor

Measuring Relative Achievements: Percentile rank and Percentile point

Data Set 7: Bioerosion by Parrotfish Background volume of bites The question:

Age of Fans

Descriptive Statistics

Averages. October 19, Discussion item: When we talk about an average, what exactly do we mean? When are they useful?

Algebra 1 Unit 7 Day 2 DP Box and Whisker Plots.notebook April 10, Algebra I 04/10/18 Aim: How Do We Create Box and Whisker Plots?

AP STATISTICS Name Chapter 6 Applications Period: Use summary statistics to answer the question. Solve the problem.

Math 1040 Exam 2 - Spring Instructor: Ruth Trygstad Time Limit: 90 minutes

Descriptive Stats. Review

How Fast Can You Throw?

THE NORMAL DISTRIBUTION COMMON CORE ALGEBRA II

Statistics. Wednesday, March 28, 2012

b) (2 pts.) Does the study show that drinking 4 or more cups of coffee a day caused the higher death rate?

Stat 139 Homework 3 Solutions, Spring 2015

Effective Use of Box Charts

Diameter in cm. Bubble Number. Bubble Number Diameter in cm

In the actual exam, you will be given more space to work each problem, so work these problems on separate sheets.

MATH 118 Chapter 5 Sample Exam By: Maan Omran

WHAT IS THE ESSENTIAL QUESTION?

STAT 155 Introductory Statistics. Lecture 2-2: Displaying Distributions with Graphs

Unit 6 Day 2 Notes Central Tendency from a Histogram; Box Plots

In my left hand I hold 15 Argentine pesos. In my right, I hold 100 Chilean

Practice Test Unit 6B/11A/11B: Probability and Logic

Section 3.2: Measures of Variability

9.3 Histograms and Box Plots

A Hare-Lynx Simulation Model

Practice Test Unit 06B 11A: Probability, Permutations and Combinations. Practice Test Unit 11B: Data Analysis

WorkSHEET 13.3 Univariate data II Name:

Math 227 Test 1 (Ch2 and 3) Name

Handicap Differential = (Adjusted Gross Score - USGA Course Rating) x 113 / USGA Slope Rating

CHAPTER 1 ORGANIZATION OF DATA SETS

Homework Exercises Problem Set 1 (chapter 2)

THE USGA HANDICAP SYSTEM. Reference Guide

Calculation of Trail Usage from Counter Data

5.1. Data Displays Batter Up. My Notes ACTIVITY

4-3 Rate of Change and Slope. Warm Up Lesson Presentation. Lesson Quiz

Chapter 3.4. Measures of position and outliers. Julian Chan. September 11, Department of Mathematics Weber State University

save percentages? (Name) (University)

Box-and-Whisker Plots

MEANS, MEDIANS and OUTLIERS

MEANS, MEDIANS and OUTLIERS

Scaled vs. Original Socre Mean = 77 Median = 77.1

Year 10 Term 2 Homework

MATH 114 QUANTITATIVE REASONING PRACTICE TEST 2

North Point - Advance Placement Statistics Summer Assignment

Understanding Winter Road Conditions in Yellowstone National Park Using Cumulative Sum Control Charts

Average Runs per inning,

UNIVERSITY OF CAMBRIDGE INTERNATIONAL EXAMINATIONS General Certificate of Education Ordinary Level

Calculating Probabilities with the Normal distribution. David Gerard

Lab 5: Descriptive Statistics

Study Guide and Intervention

Lab 1. Adiabatic and reversible compression of a gas

SCIENTIFIC COMMITTEE SEVENTH REGULAR SESSION August 2011 Pohnpei, Federated States of Micronesia

46 Chapter 8 Statistics: An Introduction

CONCEPTUAL PHYSICS LAB

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 AUDIT TRAIL

Transcription:

STANDARD SCORES AND THE NORMAL DISTRIBUTION

REVIEW 1.MEASURES OF CENTRAL TENDENCY A.MEAN B.MEDIAN C.MODE 2.MEASURES OF DISPERSIONS OR VARIABILITY A.RANGE B.DEVIATION FROM THE MEAN C.VARIANCE D.STANDARD DEVIATION

Test Scores Suppose you (together with many other students) take tests in three subjects. On each test, the range of possible scores runs from 0 to 100 points. The table below shows your score in each of the three subjects In which subject did you do best? In which subject did you do best relative to other students? The answer to the second question obviously depends on the overall frequency distribution of score. Subject Score ENG 75 MATH 65 SCIE 72

Test Scores (cont.) Indeed, the picture looks different when we look at your scores relative to overall distribution of scores and, in particular, to its summary statistics. Let s compare each of your scores to the mean score in each subject. Now what seems to be your strongest subject? SUBJECT SCORE MEAN SCORE ENG 75 76 MATH 65 55 SCI 72 57

Your Deviations from the Mean SUBJECT SCORE MEAN SCORE DEVIATION ENG 75 76-1 MATH 65 55 +10 SCI 72 57 +15 While you got the highest score in ENGL, this score was actually slightly below (by 1 point) the mean. In fact, it is likely (but not certain) that you scored in the bottom half of all students taking the test. On the other hand, you scored well above average in both of the other subjects (10 points in Math and 15 in SCI). Note that the magnitudes that we have just referred to here are your deviations from mean in each subject. Since your deviation from the mean is greatest with respect to SCI, this may appear to be your strongest subject. But this may not be the case.

Your Deviations from the Mean Compared with Other Deviations from the Mean While you have a deviation from the mean in each subject, so does every other student who took the test. Consider the distribution of SCI scores. Almost certainly quite a few students scored close the mean, but probably quite a few others scored well above the mean (like you) and others well below. On the one hand, but if most students scored very close to the mean (so the dispersion in test scores is small), your score of 72 would make you an outlier, scoring higher than almost all other students. On the other hand, if many students scored well above the mean (and since we know that the sum of all deviations from the mean must sum to zero many other students scored well below the mean, so the dispersion of test scores is large), your score of 72, while certainly good, would be less outstanding.

Your Deviations Compared with Other Deviations (cont.) Thus whether your score is outstanding or merely good depends not just on your score compared with the mean score but also on your deviation from the mean compared with other deviations from the mean, i.e., the dispersion of scores. Recall that the standard measure of dispersion the standard deviation itself is directly based on the deviations from the mean. Recall also that the SD of scores (though precisely defined as the square root of the average of all squared deviations) is approximately the same as (though usually somewhat greater than) the average of the absolute deviations from the mean.

Your Deviations from the Mean Compared with the Standard Deviation from the Mean SUBJECT SCORE MEAN SCORE DEVIATION SD ENG 75 76-1 8 MATH 65 55 +10 5 SCI 72 57 +15 15 Thus, to get a sense of how outstanding your SCI and MATH scores are, we should look at how big your deviation from the mean is compared with the standard ( average ) deviation from the mean, by calculating the ratio of your deviation to the standard deviation. The result of this calculation is called your standard score.

Standard Scores SUBJEC T SCORE MEAN SCORE DEVIATION SD Z- SCORE ENG 75 76-1 8-0.125 MATH 65 55 +10 5 +2.0 SCI 72 57 +15 15 +1.0 So in terms of your standard score, i.e., how your deviation from the mean compares with the standard deviation from the mean, it is evident that your best performance was actually in MATH (where you scored two standard deviations above the mean), compared with POLI (where you scored only one standard deviation above the mean). In ENGL you scored 1/8 of a standard deviation below the mean.

Computing a z-score X µ z= σ or z= X X SD

Examples of computing z-scores X X X z = X SD SD 5 3 2 2 1 6 3 3 2 1.5 5 10-5 4-1.25 6 3 3 4.75 4 8-4 2-2 X X

Computing raw scores from z scores X = zσ + µ or X = zsd + X z = X X SD SD zsd X X 1 2 2 3 5-2 2-4 2-2.5 4 2 10 12-1 5-5 10 5

Other Variants of Standard Scores Approximately half of the people who take any test necessarily get negative standard scores. This unavoidable arithmetical fact apparently is regarded as demoralizing, so standard scores are commonly converted into so-called T-scores, which are all positive. By convention, T-scores are calculated by multiplying standard scores by 10 and then adding 50. IQ scores are also derived from standard scores, calculated by multiplying standard scores by 15 and then adding 100. The table below shows how you performed in the three subjects in terms of each of these scoring systems (where, as is conventional, all derived scores have been rounded to the nearest whole point). SUBJECT SCORE MEAN SCORE DEVIATION SD Z-SCORE T- SCORE IQ SCORE ENG 75 76-1 8-0.125 49 98 MATH 65 55 +10 5 +2.0 70 130 SCI 72 57 +15 15 +1.0 60 115

Review 1. Interpret a z score of 1 2. M = 10, SD = 2, X = 8. Z =? 3. M = 8, SD = 1, z = 3. X =? 4. What is the IQ score for a z score of 1? IQ scores are also derived from standard scores, calculated by multiplying standard scores by 15 and then adding 100.

REMEMBER To move from a raw score to a z score, what must we know about the raw score distribution? 1 mean and standard deviation 2 maximum and minimum 3 median and variance 4 mode and range

Application If Judy got a z score of 1.5 on an in-class exam, what can we say about her score relative to others who took the exam? 1 it is above average 2 it is average 3 it is below average 4 it is a B

Test your mastery of z If a raw score is 8, the mean is 10 and the standard deviation is 4, what is the z-score? 1: -1.0 2: -0.5 3: 0.5 4: 2.0

Test your mastery of z and the normal curve If a distribution is normally distributed, about what percent of the scores fall below +1 SD? 1: 15 2: 50 3: 85 4: 99

Your Percentile Rank in Each Subject? While it is extremely likely that your percentile rank among all students taking each test is highest in MATH and lowest in ENGL, we do not know this for sure in the absence of knowing the full frequency distribution of scores (as opposed to knowing only the two summary statistics: the mean and the SD). Much data particularly including tests scores, many other interval measures, and many types of sample statistics is (at least approximately) normally distributed. However, a lot of other data (especially ratio measures), such as weight, income (as we have seen), wealth, house prices, and many other ratio variables, is skewed with longer thin tails in the direction of (much) higher values while there is a zero-limit on the minimum value.

The Normal Distribution A normal distribution is a continuous frequency density that is a particular type of symmetric bell-shaped curve. Because the curve has a single peak and is symmetric about this peak, its mode, median, and mean values coincide at this peak. Most observed values lie relatively close (in way that is made more specific below) to the center of distribution, and their density falls off on either side of peak.

A Normal Curve

Normal Curve The shape of the distribution changes with only two parameters, σ and µ, so if we know these, we can determine everything else. Normal Curve 20 16 Frequency 12 8 4 0-4 -2 0 Score (X) 2 4

Standard Normal Curve Standard normal curve has a mean of zero and an SD of 1. Probability (Relative Frequency) 0.4 0.3 0.2 0.1 0.0 Standard Normal Curve 34.13 % 13.59% 2.15% 50 Percent -3-2 -1 0 1 2 3 Scores in standard deviations from mu

Normal Curve and the z-score If X is normally distributed, there will be a correspondence between the standard normal curve and the z-score. Probability (Relative Frequency) 0.4 0.3 0.2 0.1 0.0-3 Standard Normal Curve -1 1 3 5 7 Scores in raw score units -3-2 -1 0 1 2 3 Scores in standard deviations from mu 9

Normal curve and z-scores We can use the information from the normal curve to estimate percentages from z-scores. Probability (Relative Frequency) 0.4 0.3 0.2 0.1 0.0 Standard Normal Curve 34.13 % 13.59% 2.15% 50 Percent -3-2 -1 0 1 2 3 Scores in standard deviations from mu

The Mean and SD of the Normal Distribution The mean of a normal distribution determines its location on the horizontal scale. The mean value of the distribution (here equal to the mode) is simply the value (point on the horizontal scale) of the variable that lies under the highest point on the curve. For example, if a constant amount is added to (or subtracted from) every value of the variable, the normal curve slides upwards (or downwards) by that constant amount. The standard deviation of a normal distribution determines how spread out the distribution is. Once the horizontal scale is fixed, if the SD is small, the curve has a high peak with sharp slopes on either side; if the SD is large, the curve it has a low peak with gentle slopes on either side.

Finding the SD of a Normal Curve There is a precise connection between the shape of a normal curve and its SD. The two points of maximum steepness on either side of the peak are called the inflection points of the (normal) curve. It turns out (as a mathematical theorem) that horizontal distance from the mean to each inflection point is identical to the standard deviation of the normal curve.

Finding the SD of a Normal Curve (cont.) Here is another method for LOOKING AT the magnitude of the SD of a normal distribution. Put two vertical lines on either side of, and equidistant from, the peak and then draw them apart or bring them closer together (keeping them equidistant from the peak) until it appears that just about two-thirds of the areas under the curve lies in the interval between the two vertical lines. The horizontal distance from the mean to either line is equal to (a very good approximation of) the standard deviation of the distribution.

The 68%-95%-99.7% Rule More generally, we can state what is called the [approximate] 68%-95%-99.7% rule of the normal distribution. The rule is this: about 68% of all observed values lie within one SD of the mean, about 95% lie within two SDs of the mean, and about 99.7% (that is, virtually all) lie within three SDs of the mean. This is why no SAT scores below 200 [3 standard deviations below the mean] or above 800 [3 standard deviations above the mean] are reported. And here is another useful rule: in a normal distribution, half the cases have observed values that lie within about 2/3 of the SD of the mean, i.e., the first and third quartiles lie at just about 2/3 of a SD below and above the mean respectively, so In a normal distribution, the interquartile range is equal to about 1.33 SDs. All this is illustrated in the following chart, which shows a standardized normal curve, i.e., a normal curve in which the mean is set at 0 and the SD is set at 1. Put otherwise, the units on the horizontal scale shows standard scores.

Your Percentile Ranks (if Test Scores are Normally Distributed) Subject Stan. Score Percentile ENGL -0.125 45 MATH +2.0 97.5 SCI +1.0 84 SCI