MEANS, MEDIANS and OUTLIERS

Similar documents
MEANS, MEDIANS and OUTLIERS

Year 10 Term 2 Homework

Solutionbank S1 Edexcel AS and A Level Modular Mathematics

WorkSHEET 13.3 Univariate data II Name:

How are the values related to each other? Are there values that are General Education Statistics

3 DATA HANDLING. Exercise 3.1

3.3 - Measures of Position

Practice Test Unit 6B/11A/11B: Probability and Logic

46 Chapter 8 Statistics: An Introduction

Unit 3 ~ Data about us

Practice Test Unit 06B 11A: Probability, Permutations and Combinations. Practice Test Unit 11B: Data Analysis

Candidate Number. General Certificate of Secondary Education Higher Tier March 2013

Box-and-Whisker Plots

Unit 3 - Data. Grab a new packet from the chrome book cart. Unit 3 Day 1 PLUS Box and Whisker Plots.notebook September 28, /28 9/29 9/30?

Box-and-Whisker Plots

Unit 6 Day 2 Notes Central Tendency from a Histogram; Box Plots

Full file at

UNIVERSITY OF CAMBRIDGE INTERNATIONAL EXAMINATIONS General Certificate of Education Ordinary Level

Exemplar for Internal Achievement Standard. Mathematics and Statistics Level 1

Chapter 3.4. Measures of position and outliers. Julian Chan. September 11, Department of Mathematics Weber State University

IGCSE - Cumulative Frequency Questions

Descriptive Statistics Project Is there a home field advantage in major league baseball?

STANDARD SCORES AND THE NORMAL DISTRIBUTION

Box-and-Whisker Plots

Algebra 1 Unit 7 Day 2 DP Box and Whisker Plots.notebook April 10, Algebra I 04/10/18 Aim: How Do We Create Box and Whisker Plots?

(c) The hospital decided to collect the data from the first 50 patients admitted on July 4, 2010.


Age of Fans

0-13 Representing Data

The Five Magic Numbers

National Curriculum Statement: Determine quartiles and interquartile range (ACMSP248).

ACTIVITY: Drawing a Box-and-Whisker Plot. a. Order the data set and write it on a strip of grid paper with 24 equally spaced boxes.

9.3 Histograms and Box Plots

A.M. The time between 12:00 midnight and 12:00 noon. Houghton Mifflin Co. 1 Grade 4 Unit 5

The Math and Science of Bowling

Measuring Relative Achievements: Percentile rank and Percentile point

NCERT solution Decimals-2

This sample test provides an indication of the format and structure of the live confirmatory tests that are available.

AP STATISTICS Name Chapter 6 Applications Period: Use summary statistics to answer the question. Solve the problem.

Data and Probability

Year. Small Steps Guidance and Examples. Block 4 Converting Units. Released April 2018

Regents Style Box & Whisker Plot Problems

AP Stats Chapter 2 Notes

Scatter Diagrams SAMs 1 Numeracy Unit 1 Q6

Chapter 6 The Standard Deviation as a Ruler and the Normal Model

Chapter 4 Displaying Quantitative Data

MVSU NCLB 2016 Summer Reading Institute Lesson Plan Template. Name Angela Roberson

NATIONAL SENIOR CERTIFICATE GRADE 12 MLIT.1 MATHEMATICAL LITERACY P1 NOVEMBER This question paper consists of 14 pages and 4 annexures.

1. Identify the sample space and the outcome shown for spinning the game spinner.

! Problem Solving Students will use past Olympic statistics and mathematics to predict the most recent Olympic statistics.

Essentials. Week by. Week. Investigations

Chapter 1: Why is my evil lecturer forcing me to learn statistics?

Chapter 1: Why is my evil lecturer forcing me to learn statistics?

Averages. October 19, Discussion item: When we talk about an average, what exactly do we mean? When are they useful?

Book 6. The wee Maths Book. Growth. Grow your brain. N4 Numeracy. of Big Brain. Guaranteed to make your brain grow, just add some effort and hard work

Algebra 1 Unit 6 Study Guide

THE NORMAL DISTRIBUTION COMMON CORE ALGEBRA II

Math 146 Statistics for the Health Sciences Additional Exercises on Chapter 2

Internet Technology Fundamentals. To use a passing score at the percentiles listed below:

DATA HANDLING EXAM QUESTIONS

6.7 Box-and-Whisker Plots

5.1. Data Displays Batter Up. My Notes ACTIVITY

PRACTICE PROBLEMS FOR EXAM 1

, Candidate Name Number

Question LCHL: Descriptive Statistics

Lesson 2 Pre-Visit Slugging Percentage

7 th Grade Math Name

Data: Central Tendency, Box & Whisker Plot Long-Term Memory Review Review 1

CHAPTER 2 Modeling Distributions of Data

March Madness Basketball Tournament

F For this paper you must have:

Cambridge International Examinations Cambridge International General Certificate of Secondary Education

Level 1 Mathematics and Statistics, 2014

GCSE Mathematics. Foundation Tier

STT 315 Section /19/2014

Exploring Measures of Central Tendency (mean, median and mode) Exploring range as a measure of dispersion

All AQA Unit 1 Questions Higher

Math 227 Test 1 (Ch2 and 3) Name

Fundamentals of Machine Learning for Predictive Data Analytics

Bouncing Ball A C T I V I T Y 8. Objectives. You ll Need. Name Date

Lesson 4: Describing the Center of a Distribution

Borck Test 2 (tborck2)

1. Rewrite the following three numbers in order from smallest to largest. Give a brief explanation of how you decided the correct order.

BIGGAR HIGH SCHOOL HOMEWORK BOOKLET NATIONAL 4

Effective Use of Box Charts

Warm-up. Make a bar graph to display these data. What additional information do you need to make a pie chart?

MATHEMATICS - NUMERACY UNIT 2: CALCULATOR - ALLOWED INTERMEDIATE TIER 1 HOUR 45 MINUTES

Lab 5: Descriptive Statistics

Chapter 0 Pretest = 4

March Madness Basketball Tournament

GCSE 185/08 MATHEMATICS FOUNDATION TIER PAPER 2. A.M. THURSDAY, 17 November hours. Centre Number. Candidate Number. Surname.

Date Period Find the mode, median, mean, lower quartile, upper quartile, interquartile range, and population standard deviation for each data set.

Aim: Normal Distribution and Bell Curve

Running head: DATA ANALYSIS AND INTERPRETATION 1

1. The data below gives the eye colors of 20 students in a Statistics class. Make a frequency table for the data.

UNIT 7 PRACTICE PROBLEMS

NUMB3RS Activity: Is It for Real? Episode: Hardball

Pre-Algebra Chapter 3 Decimals and Equations

In 2018 a total of 56,127 students received an ATAR, 934 fewer than in The gender balance was similar to 2017.

Grade 6 Decimals. Answer the questions. For more such worksheets visit

Transcription:

ESSENTIAL MATHEMATICS 2 WEEK 3 NOTES AND EXERCISES MEANS, MEDIANS and OUTLIERS An outlier is a score much larger (or smaller) than others scores in the data set. Outliers can have a dramatic effect on the mean. The median and mode are usually unaffected. Example: Set A 3, 4, 5, 5, 6, 7 Mean ( x ) = 5, mode = 5, median = 5 Set B 3, 4, 5, 5, 6, 20 x = 7.2, mode = 5, median = 5 In the second set the median and mode are more useful than the mean. Exercise Set 1 Q1. This data shows the ages of the Binns and Thompson families. a) Calculate the mean age of each family. Answer correct to 1 decimal place. b) What is the main difference between these two sets of data? c) What effect does the difference identified in b) have on the mean? Q2. Eleven houses were sold in Keswick Street over the last two years. The selling prices are listed below. a) Find the median sale price for the houses. b) Find the mean sale price.

c) Which measure best describes the price of houses in Keswick Street? Justify your answer. d) Which price is the outlier in this data? e) Calculate the mean of the remaining prices when this outlier is removed. Is this mean closer to the median found in part a)? Q3. Mark and Steve s batting scores for six innings of cricket are shown below. a) Calculate the mean score for each player (to 1 decimal place). b) Which player is better if you use the mean? c) Find the median score for each player. d) Which player is better if you use the median? e) Which player would you rather have in your team? Justify your answer.

Q4. A property developer has 40 new apartments for sale. The 20 apartments on the first 5 floors are priced at $330 000 each. The next 8 apartments on floors 6 and 7 are priced at $380 000 and then the next 8 apartments on floors 8 and 9 are priced at $425 000. The three apartments on the tenth floor are $835 000 each and the penthouse apartment on the top floor is priced at $1.7 million. a) Determine the median price for the apartments. b) Calculate the mean price for the apartments. c) When advertising the apartments for sale which average would the developer use. Explain. d) The developer will be speaking to potential investors in his company. What average might he use to make his company look profitable? Explain. e) Which price is the outlier? f) Calculate the mean after removing the outlier.

QUARTILES, RANGE and INTERQUATILE RANGE Remember, the range is found by highest score lowest score. The range can be affected by extreme values (outliers). Example 3, 4, 5, 6, 7, 20 Range = 20 3 = 17 Most scores, however, lie between 3 and 7 thus a more appropriate range is 4. Interquartile Range One way to overcome the problem of extreme values is to exclude the top and bottom quarter of scores and consider the range of the remaining scores. interquartile range lowest 1 4 median 3 4 highest Q 1 Q 2 Q 3 To find the interquartile range the following steps are used. Data 12, 9, 4, 6, 5, 8, 9, 4, 10, 2 1. Arrange in order 2, 4, 4, 5, 6, 8, 9, 9, 10, 12 2. Find median (middle score). As there are 9 scores the median is between the 4 th and 5 th score. 2 4 4 5 6 8 9 9 10 12 3. Find the middle of the bottom half of the scores. 2 4 4 5 6 median is 7 ie half way between 6 and 8 Q

4. Find the middle of the top half. 8 9 9 10 12 Q 3 5. Interquartile range is Q 3 Q 1 ie 9 4 = 5 The final diagram becomes 2 4 4 5 6 8 9 9 10 12 Q 1 Q 2 median Q 3 We can interpret this as follows. 25% of the scores are between 2 and 4 50% of the scores are between 4 and 9 ie 50% of scores lie in the interquartile range. 25% of the scores are between 9 and 12 Example

So, 50% of the ages lie between 18 and 33. Exercise Set 2 Q1. Sue and Jason work in a fast food shop. The number of hamburgers they sold each day between 12 noon and 2 pm in the month of July is recorded below. For this data, find: a) the range (first arrange the data in ascending order). b) each of the quartiles c) the interquartile range.

Q2. The following data shows the daily maximum temperatures (in C) for 15 days in Cairns in July. 32, 30, 31, 32, 31, 30, 31, 31, 31, 31, 29, 25, 28, 27, 29 For this data, find: a) the range b) each of the quartiles c) the interquartile range. Q3. For the data shown in the stem and leaf plot find the range, median and interquartile range.

DECILES AND PERCENTILES Another way of dividing data into groups is to divide it into deciles or percentiles. The data still must be arranged in ascending order. Deciles and percentiles are usually only used with large sets of data. Deciles: divides the data into 10 equal groups Percentiles: divides the data into 100 equal groups D 1 cuts off the lowest 10% of scores D 4 cuts off the lowest 40% of scores D 9 cuts off the lowest 90% of scores or the top 10% of scores Example The lengths (in centimetres) of 20 new-born infants at a hospital were recorded. Place the values in order first. a) What is the 3 rd decile for this data? As D 3 lies between 48 and 49 cm the value of D 3 is 48.5 cm. Thus 30% of the babies are less than 48.5 cm long. b) What is the 5 th decile for this data? As D 5 is between 50 and 50 its value is 50 cm. This 50% of the infants are less than 50 cm or 50% of the infants are longer than 50 cm. c) What is another name for the 5 th decile? As the 5 th decile is right in the middle it is also the median d) Find the value that separates the bottom 70% from the top 30% if the infants (in terms of length). As D 7 is between 51 and 52 cm, 51.5 cm separates the bottom 70% from the top 30% of infants. e) If the length of new-born baby James is in the top 10% of infant lengths, what value must it be greater than? James length must be greater than D 9 ie 54 cm

Percentiles P 24 cuts off 24% of scores. P 60 cuts off 60% of scores. Note P 60 is the same as D 6 P 87 cuts off 87% of scores, or the top 13% of scores. Exercise Set 3 Q1. The percentage scores of a class of 30 students in a science test are shown. a) What is the 8 th decile? b) What is the 3 rd decile? c) What is the 40 th percentile? d) Find the value that cuts off the lower 20% of scores from the upper 80%. e) What percentage of students scored higher than 79.

Q2. The information below is based on weather records kept by the Bureau of Meteorology. They are the maximum daily temperatures in November for Newcastle, NSW. The mean is 23.5 C The highest temperature on record was 32 C (on 30th November 1968) The lowest temperature on record was 15.6 C (on 1st November 1986) The 1 st decile D 1 = 18.9 C The 9 th decile D 9 = 28.6 C a) What is the range of temperatures? b) What percentage of temperatures was higher than 28.6 C? c) What value would you expect the median to be close (but not necessarily equal)? Hint: mode, mean, range. d) What is the size of the 9 th decile band (the difference between the highest temperature and the 9 th decile)?

Q3. The table shows the percentiles for the heights (in cm) of girls aged 2 to 5 years, according to the child growth standards of the World Health Organisation (WHO). a) What is the median height of a 4-year old girl? b) Libby is aged 2.5 and is 88.3 cm tall. Is she tall for her age? What percentage of girls her age is shorter than her? c) What is Libby s expected height when she turns 5 years old? d) Only 15% of girls of Renee s age are taller than her. How tall is she if she is 3.5 years old? e) Mikayla is 2 years old and 93.2 cm tall. Is she short for her age? What percentage of girls her age are taller than her?