Select Boxplot -> Multiple Y's (simple) and select all variable names.

Similar documents
Statistical Analysis of PGA Tour Skill Rankings USGA Research and Test Center June 1, 2007

One-way ANOVA: round, narrow, wide

Analysis of Variance. Copyright 2014 Pearson Education, Inc.

ANOVA - Implementation.

Example 1: One Way ANOVA in MINITAB

Unit4: Inferencefornumericaldata 4. ANOVA. Sta Spring Duke University, Department of Statistical Science

Week 7 One-way ANOVA

Unit 4: Inference for numerical variables Lecture 3: ANOVA

Name May 3, 2007 Math Probability and Statistics

Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA

Stat 139 Homework 3 Solutions, Spring 2015

Chapter 7. Comparing Two Population Means. Comparing two population means. T-tests: Independent samples and paired variables.

Descriptive Statistics Project Is there a home field advantage in major league baseball?

Quantitative Literacy: Thinking Between the Lines

MGB 203B Homework # LSD = 1 1

Stats 2002: Probabilities for Wins and Losses of Online Gambling

A few things to remember about ANOVA

Biostatistics & SAS programming

PLANNED ORTHOGONAL CONTRASTS

A Statistical Analysis of the Factors that Potentially Affect the Price of A Horse

Announcements. Lecture 19: Inference for SLR & Transformations. Online quiz 7 - commonly missed questions

Factorial ANOVA Problems

Class 23: Chapter 14 & Nested ANOVA NOTES: NOTES: NOTES:

MTB 02 Intermediate Minitab

Standard Errors in the U.S. Regional Price Parities (RPPs)

Running head: DATA ANALYSIS AND INTERPRETATION 1

Guide to Computing Minitab commands used in labs (mtbcode.out)

1. In a hypothesis test involving two-samples, the hypothesized difference in means must be 0. True. False

One-factor ANOVA by example

Political Science 30: Political Inquiry Section 5

Driv e accu racy. Green s in regul ation

Warm-up. Make a bar graph to display these data. What additional information do you need to make a pie chart?

Lesson 5 Post-Visit Do Big League Salaries Equal Big Wins?

Lesson 2 Pre-Visit Big Business of the Big Leagues

Data Set 7: Bioerosion by Parrotfish Background volume of bites The question:

NCSS Statistical Software

ISDS 4141 Sample Data Mining Work. Tool Used: SAS Enterprise Guide

Sample Final Exam MAT 128/SOC 251, Spring 2018

Safety at Intersections in Oregon A Preliminary Update of Statewide Intersection Crash Rates

Major League Baseball Offensive Production in the Designated Hitter Era (1973 Present)

Factorial Analysis of Variance

Lesson 3 Pre-Visit Teams & Players by the Numbers

Using GIS and CTPP Data for Transit Ridership Forecasting in Central Florida

Math 230 Exam 1 Name October 2, 2002

NBA TEAM SYNERGY RESEARCH REPORT 1

Section I: Multiple Choice Select the best answer for each problem.

Probability & Statistics - Solutions

Announcements. % College graduate vs. % Hispanic in LA. % College educated vs. % Hispanic in LA. Problem Set 10 Due Wednesday.

1wsSMAM 319 Some Examples of Graphical Display of Data

Setting up group models Part 1 NITP, 2011

Preview. Second midterm Tables in your paper Mass Transit as alternative to auto California s problems in urban transportation

EXST7015: Salaries of all American league baseball players (1994) Salaries in thousands of dollars RAW DATA LISTING

Experimental Design and Data Analysis Part 2

Laboratory Activity Measurement and Density. Average deviation = Sum of absolute values of all deviations Number of trials

Chapter 12 Practice Test

Unit 6 Day 2 Notes Central Tendency from a Histogram; Box Plots

the 54th Annual Conference of the Association of Collegiate School of Planning (ACSP) in Philadelphia, Pennsylvania November 2 nd, 2014

NUMB3RS Activity: Choosing Contenders. Episode: Contenders

Preview. Tables in your paper Mass Transit as alternative to auto California s problems in urban transportation

CHAPTER 1 ORGANIZATION OF DATA SETS

Design of Experiments Example: A Two-Way Split-Plot Experiment

Advanced Metrics Matchup Guide

DOCUMENT RESUME. A Comparison of Type I Error Rates of Alpha-Max with Established Multiple Comparison Procedures. PUB DATE NOTE

Year 10 Term 2 Homework

Reminders. Homework scores will be up by tomorrow morning. Please me and the TAs with any grading questions by tomorrow at 5pm

STT 315 Section /19/2014

POTENTIAL ENERGY BOUNCE BALL LAB

Does the LA Galaxy Have a Home Field Advantage?

Stats in Algebra, Oh My!

Bivariate Data. Frequency Table Line Plot Box and Whisker Plot

STANDARDIZED AGE SPECIFIC CATCH RATES FOR ALBACORE, Thunnus alalunga, FROM THE SPANISH TROLL FISHERY IN THE NORTHEAST ATLANTIC,

CHAPTER 1 Exploring Data

STA 103: Midterm I. Print clearly on this exam. Only correct solutions that can be read will be given credit.

Factors that affect the motion of a vehicle along a surface

March Madness Basketball Tournament

y ) s x x )(y i (x i r = 1 n 1 s y Statistics Lecture 7 Exploring Data , y 2 ,y n (x 1 ),,(x n ),(x 2 ,y 1 How two variables vary together

In my left hand I hold 15 Argentine pesos. In my right, I hold 100 Chilean

(c) The hospital decided to collect the data from the first 50 patients admitted on July 4, 2010.

May 11, 2005 (A) Name: SSN: Section # Instructors : A. Jain, H. Khan, K. Rappaport

SFMTA Annual Parking Rates & Policies Survey

Empirical Example II of Chapter 7

Case Processing Summary. Cases Valid Missing Total N Percent N Percent N Percent % 0 0.0% % % 0 0.0%

Differentiated Instruction & Understanding By Design Lesson Plan Format

CAL Guard Fuel Running Estimate

Name Date Period. E) Lowest score: 67, mean: 104, median: 112, range: 83, IQR: 102, Q1: 46, SD: 17

Full file at

b) (2 pts.) Does the study show that drinking 4 or more cups of coffee a day caused the higher death rate?

Math 146 Statistics for the Health Sciences Additional Exercises on Chapter 2

Legendre et al Appendices and Supplements, p. 1

An Empirical Comparison of Regression Analysis Strategies with Discrete Ordinal Variables

C R I TFC. Columbia River Inter-Tribal Fish Commission

Chapter 13. Factorial ANOVA. Patrick Mair 2015 Psych Factorial ANOVA 0 / 19

Lesson 14: Modeling Relationships with a Line

Fundamental Certainty

BIOL 101L: Principles of Biology Laboratory

Assignment. To New Heights! Variance in Subjective and Random Samples. Use the table to answer Questions 2 through 7.

United States Commercial Vertical Line Vessel Standardized Catch Rates of Red Grouper in the US South Atlantic,

March Madness Basketball Tournament

Distancei = BrandAi + 2 BrandBi + 3 BrandCi + i

Addendum to SEDAR16-DW-22

Transcription:

One Factor ANOVA in Minitab As an example, we will use the data below. A study looked at the days spent in the hospital for different regions of the United States. Can the company reject the claim the mean number of days patients spend in the hospital is the same for all hour regions? Assume a = 0.05. Data can be found on the blog in anova.mtw. Data Entry Data should be in columns with level labels in each column. Multiple Box Plots Select Boxplot -> Multiple Y's (simple) and select all variable names.

Null and Alternative Hypothesis Ho: ne= mw= s= w Ha: At least one pair not equal ANOVA Table Stat -> ANOVA -> One-Way (unstacked) Conclusion There is enough evidence (F=4.98, num df = 3, den df = 29, p=0.007) to suggest that there is a difference among the regions in terms of the average number of days spent in the hospital.

Tukey Tests If the ANOVA is significant (p<.05), then return to the ANOVA (unstacked) dialog box and select Comparisons... Note, this value must equal (no decimals). Tukey Summary Northeast 7.444 Midwest 5.778 West 5.000 South 4.714

STAT 200 ANOVA Homework/Lab: One Factor Analysis of Variance Due Friday, November 20th You may work in groups, but each person is to hand in a homework assignment. Please hand in your own work, as identical homeworks will have the grade split between those working on it. For problems 1-4, state the null and alternative hypotheses. Also fill in the blanks in the ANOVA tables and state your conclusion. 1. The prices (in dollars) for 16 randomly selected automobile batteries were determined. The prices were split into three groups based on battery type. At = 0.05, can you conclude that a difference exists among battery types? Df Sum Sq Mean Sq F value Pr(>F) Size 513.42 0.1564 Error Total 2067.75 2. The following table represents the ANOVA analysis for the price per gallon for three types of exterior deck treatments. At = 0.01, can you determine if there is a difference in price based on treatment type? Df Sum Sq Mean Sq F value Pr(>F) Type 1307.92 0.001 Error 496.08 Total 14

3. From four regions across the United States, 27 school districts were sampled to determine the annual amount spent on reading in grades K-6. With = 0.05, is there a difference among spending in the four regions? Df Sum Sq Mean Sq F value Pr(>F) Region 0.215 Error 95857.14 Total 115942.23 4. The following ANOVA table shows the results for five age groups to see if there are differences in credit card balances. At = 0.05, can you determine if any age groups have different credit card balances? Df Sum Sq Mean Sq F value Pr(>F) Age 0.006 Error 23 245079.67 Total 1767327.86

5. The following is the output from a Tukey test after it was determined that the mean fuel mileage for five types of vehicles are not equal. Create a line plot/means plot to summarize the results. Individual 95% CIs For Mean Based on Pooled StDev Level N Mean StDev ---------+---------+---------+---------+ Small Sedan 5 43.600 5.128 (----*-----) Medium Sedan 6 54.000 7.127 (----*----) Large Sedan 6 69.500 6.656 (----*----) 4WD SUV 4 73.500 8.660 (-----*------) Minivan 5 61.400 9.711 (-----*-----) ---------+---------+---------+---------+ 48 60 72 84 Tukey 95% Simultaneous Confidence Intervals All Pairwise Comparisons Small Sedan subtracted from: Lower Center Upper -------+---------+---------+---------+-- Medium Sedan -3.129 10.400 23.929 (------*------) Large Sedan 12.371 25.900 39.429 (------*------) 4WD SUV 14.912 29.900 44.888 (-------*------) Minivan 3.669 17.800 31.931 (------*------) -------+---------+---------+---------+-- -20 0 20 40 Medium Sedan subtracted from: Lower Center Upper -------+---------+---------+---------+-- Large Sedan 2.601 15.500 28.399 (------*-----) 4WD SUV 5.078 19.500 33.922 (------*------) Minivan -6.129 7.400 20.929 (------*-----) -------+---------+---------+---------+-- -20 0 20 40 Large Sedan subtracted from: Lower Center Upper -------+---------+---------+---------+-- 4WD SUV -10.422 4.000 18.422 (------*------) Minivan -21.629-8.100 5.429 (------*------) -------+---------+---------+---------+-- -20 0 20 40 4WD SUV subtracted from: Lower Center Upper -------+---------+---------+---------+-- Minivan -27.088-12.100 2.888 (-------*------) -------+---------+---------+---------+-- -20 0 20 40

6. The following is the output for an ANOVA in which it has been determined that there is a difference in the mean income between the six cities listed below. Create a line plot/means plot to summarize the results. Individual 95% CIs For Mean Based on Pooled StDev Level N Mean StDev ---+---------+---------+---------+------ Chicago 6 44345 6037 (----*-----) Dallas 7 44017 9223 (----*----) Miami 7 42386 4571 (----*----) Denver 5 46331 4320 (-----*-----) San Diego 5 56470 6880 (-----*------) Seattle 6 66318 6724 (----*-----) ---+---------+---------+---------+------ 40000 50000 60000 70000 Tukey 95% Simultaneous Confidence Intervals All Pairwise Comparisons Chicago subtracted from: Dallas -11498-328 10842 (-----*----) Miami -13129-1959 9211 (-----*-----) Denver -10171 1986 14143 (-----*-----) San Diego -32 12125 24282 (-----*-----) Seattle 10382 21973 33565 (-----*-----) Dallas subtracted from: Miami -12363-1631 9100 (----*-----) Denver -9442 2314 14070 (-----*-----) San Diego 697 12453 24209 (-----*-----) Seattle 11131 22301 33471 (----*-----) Miami subtracted from: Denver -7811 3945 15701 (-----*-----) San Diego 2328 14084 25840 (-----*-----) Seattle 12763 23933 35102 (-----*-----) Denver subtracted from: San Diego -2559 10139 22837 (-----*-----) Seattle 7830 19987 32145 (-----*-----) San Diego subtracted from: Seattle -2309 9848 22006 (-----*-----)

For each of the following questions, answer them as demonstrated in lab. You will need to complete the following steps for each question: a. Create multiple box plots of the data. Make sure you include units. b. State the null and alternative hypothesis in the context of the problem. c. The ANOVA table. d. The conclusion statement in terms of the problem. Use the level of alpha given in the book. e. If the ANOVA shows significant differences, conduct a Tukey test and summarize the results using the line plot method described in class and found in the Minitab Output. f. Using the results of the analysis, answer the extra question written for each problem. Make sure your answer uses all the available information, not just the means. From Chapter 10.4 starting on page 565. 7. Question 6 Which battery size is cheaper, group size 35 or group size 65? 8. Question 10 Which type or types of cars have the lowest cost per mile, if any? 9. Question 11 Do people in the Northeast have a lower well-being index than people in the West? 10. Question 14 Which city has higher housing prices, Tampa or Orlando?