Unit4: Inferencefornumericaldata 4. ANOVA. Sta Spring Duke University, Department of Statistical Science

Similar documents
Unit 4: Inference for numerical variables Lecture 3: ANOVA

A few things to remember about ANOVA

Analysis of Variance. Copyright 2014 Pearson Education, Inc.

Week 7 One-way ANOVA

Select Boxplot -> Multiple Y's (simple) and select all variable names.

PLANNED ORTHOGONAL CONTRASTS

MGB 203B Homework # LSD = 1 1

Announcements. Lecture 19: Inference for SLR & Transformations. Online quiz 7 - commonly missed questions

One-way ANOVA: round, narrow, wide

Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA

Factorial Analysis of Variance

Stat 139 Homework 3 Solutions, Spring 2015

Name May 3, 2007 Math Probability and Statistics

Statistical Analysis of PGA Tour Skill Rankings USGA Research and Test Center June 1, 2007

ANOVA - Implementation.

1. In a hypothesis test involving two-samples, the hypothesized difference in means must be 0. True. False

Chapter 12 Practice Test

Safety at Intersections in Oregon A Preliminary Update of Statewide Intersection Crash Rates

Section I: Multiple Choice Select the best answer for each problem.

Factorial ANOVA Problems

Legendre et al Appendices and Supplements, p. 1

Example 1: One Way ANOVA in MINITAB

One-factor ANOVA by example

Running head: DATA ANALYSIS AND INTERPRETATION 1

1 Hypothesis Testing for Comparing Population Parameters

Pressured Applied by the Emergency/Israeli Bandage

THE INTEGRATION OF THE SEA BREAM AND SEA BASS MARKET: EVIDENCE FROM GREECE AND SPAIN

One Way ANOVA (Analysis of Variance)

Chapter 7. Comparing Two Population Means. Comparing two population means. T-tests: Independent samples and paired variables.

Empirical Example II of Chapter 7

CHAPTER ANALYSIS AND INTERPRETATION Average total number of collisions for a try to be scored

Stats 2002: Probabilities for Wins and Losses of Online Gambling

Driv e accu racy. Green s in regul ation

IDENTIFYING SUBJECTIVE VALUE IN WOMEN S COLLEGE GOLF RECRUITING REGARDLESS OF SOCIO-ECONOMIC CLASS. Victoria Allred

5.1 Introduction. Learning Objectives

DOCUMENT RESUME. A Comparison of Type I Error Rates of Alpha-Max with Established Multiple Comparison Procedures. PUB DATE NOTE

Major League Baseball Offensive Production in the Designated Hitter Era (1973 Present)

Setting up group models Part 1 NITP, 2011

Chapter 9: Hypothesis Testing for Comparing Population Parameters

Universitas Sumatera Utara

Distancei = BrandAi + 2 BrandBi + 3 BrandCi + i

Therapeutic Exercise Cage for Muscle Development

1. Answer this student s question: Is a random sample of 5% of the students at my school large enough, or should I use 10%?

Class 23: Chapter 14 & Nested ANOVA NOTES: NOTES: NOTES:

Data Set 7: Bioerosion by Parrotfish Background volume of bites The question:

A Statistical Analysis of the Factors that Potentially Affect the Price of A Horse

Announcements. % College graduate vs. % Hispanic in LA. % College educated vs. % Hispanic in LA. Problem Set 10 Due Wednesday.

WATER OIL RELATIVE PERMEABILITY COMPARATIVE STUDY: STEADY VERSUS UNSTEADY STATE

Fruit Fly Exercise 1- Level 1

Guide to Computing Minitab commands used in labs (mtbcode.out)

N. Abid 2 and M. Idrissi 1 ABSTRACT

Chapter 2: ANOVA and regression. Caroline Verhoeven

arxiv:math/ v4 [math.st] 7 Mar 2006

Pairwise Comparison Models: A Two-Tiered Approach to Predicting Wins and Losses for NBA Games

Taking Your Class for a Walk, Randomly

Standardized CPUE of Indian Albacore caught by Taiwanese longliners from 1980 to 2014 with simultaneous nominal CPUE portion from observer data

Competitive Performance of Elite Olympic-Distance Triathletes: Reliability and Smallest Worthwhile Enhancement

Math SL Internal Assessment What is the relationship between free throw shooting percentage and 3 point shooting percentages?

EXPLORING MOTIVATION AND TOURIST TYPOLOGY: THE CASE OF KOREAN GOLF TOURISTS TRAVELLING IN THE ASIA PACIFIC. Jae Hak Kim

Experimental Design and Data Analysis Part 2

Navigate to the golf data folder and make it your working directory. Load the data by typing

HPW Biomechanics

Sample Final Exam MAT 128/SOC 251, Spring 2018

Results of a Suspended Solids Survey at the Whites Point Quarry, Little River, Digby County, Nova Scotia

Planning. In the following: Figure 3.1 and Table 3.1: a. Table 3.1 completed 2. Tasks Previous task Start date End date Duration Resources (humans)

A Combined Recruitment Index for Demersal Juvenile Cod in NAFO Divisions 3K and 3L

STANDARDIZED AGE SPECIFIC CATCH RATES FOR ALBACORE, Thunnus alalunga, FROM THE SPANISH TROLL FISHERY IN THE NORTHEAST ATLANTIC,

Psychology - Mr. Callaway/Mundy s Mill HS Unit Research Methods - Statistics

Statistics for Process Validation Training Master Handbook

STANDARD SCORES AND THE NORMAL DISTRIBUTION

Confidence Interval Notes Calculating Confidence Intervals

STANDARDIZED CATCH RATES OF BLUEFIN TUNA, THUNNUS THYNNUS, FROM THE ROD AND REEL/HANDLINE FISHERY OFF THE NORTHEAST UNITED STATES DURING

NAME: Math 403 Final Exam 12/10/08 15 questions, 150 points. You may use calculators and tables of z, t values on this exam.

How Effective is Change of Pace Bowling in Cricket?

COMPLETING THE RESULTS OF THE 2013 BOSTON MARATHON

INVESTIGATION OF FACTORS AFFECTING CAPACITY AT RURAL FREEWAY WORK ZONES

AP 11.1 Notes WEB.notebook March 25, 2014

Measuring Batting Performance

ASTERISK OR EXCLAMATION POINT?: Power Hitting in Major League Baseball from 1950 Through the Steroid Era. Gary Evans Stat 201B Winter, 2010

Chapter 1: Why is my evil lecturer forcing me to learn statistics?

Chapter 1: Why is my evil lecturer forcing me to learn statistics?

Analysis of recent swim performances at the 2013 FINA World Championship: Counsilman Center, Dept. Kinesiology, Indiana University

Chapter 13. Factorial ANOVA. Patrick Mair 2015 Psych Factorial ANOVA 0 / 19

Analysis of AGFC Historical Crappie Trap-Netting Data. Aaron Kern and Andy Yung Arkansas Game and Fish Commission District 6 Fisheries Camden, AR

Cambridge International Examinations Cambridge International Advanced Subsidiary and Advanced Level

UPDATED STANDARDIZED CPUE FOR ALBACORE CAUGHT BY JAPANESE LONGLINE FISHERY IN THE ATLANTIC OCEAN,

Biostatistics & SAS programming

Does the LA Galaxy Have a Home Field Advantage?

Impact of the Friesian POLLED Mutation on Milk Production Traits in Holstein Friesian

TECHNICAL REPORT STANDARD TITLE PAGE

May 11, 2005 (A) Name: SSN: Section # Instructors : A. Jain, H. Khan, K. Rappaport

Name: Statistics February 25, 2013

The Art and Science of Debt Sustainability Analysis Ugo Panizza

A Comparative Study of Running Agility, Jumping Ability and Throwing Ability among Cricket Players

THE DEVELOPMENTOF A PREDICTION MODEL OF THE PASSENGER CAR EQUIVALENT VALUES AT DIFFERENT LOCATIONS

PGA Tour Scores as a Gaussian Random Variable

MJA Rev 10/17/2011 1:53:00 PM

Effect of homegrown players on professional sports teams

Journal of Emerging Trends in Computing and Information Sciences

Modelling residential prices with cointegration techniques and automatic selection algorithms

Transcription:

Unit4: Inferencefornumericaldata 4. ANOVA Sta 101 - Spring 2016 Duke University, Department of Statistical Science Dr. Çetinkaya-Rundel Slides posted at http://bit.ly/sta101_s16

Outline 1. Housekeeping 2. Main ideas 1. Comparing many means requires care 2. ANOVA tests for some difference in means of many different groups 3. ANOVA compares between group variation to within group variation 4. To identify which means are different, use t-tests and the Bonferroni correction 3. Summary

Announcements PS 4 due Friday PA 4 due Sunday RA 5 Monday after Spring Break 1

Outline 1. Housekeeping 2. Main ideas 1. Comparing many means requires care 2. ANOVA tests for some difference in means of many different groups 3. ANOVA compares between group variation to within group variation 4. To identify which means are different, use t-tests and the Bonferroni correction 3. Summary

Outline 1. Housekeeping 2. Main ideas 1. Comparing many means requires care 2. ANOVA tests for some difference in means of many different groups 3. ANOVA compares between group variation to within group variation 4. To identify which means are different, use t-tests and the Bonferroni correction 3. Summary

NEWS FLASH! Jelly beans rumored to cause acne!!! 2

NEWS FLASH! Jelly beans rumored to cause acne!!! How would you check this rumor? Imagine that doctors can assign an acne score to patients on a 0-100 scale. What would your research question be? How would you conduct your study? What statistical test would you use? 2

NEWS FLASH! Jelly beans rumored to cause acne!!! How would you check this rumor? Imagine that doctors can assign an acne score to patients on a 0-100 scale. What would your research question be? How would you conduct your study? What statistical test would you use? 2

http://imgs.xkcd.com/comics/significant.png 3

Clicker question Suppose α = 0.05. What is the probability of making a Type 1 error and rejecting a null hypothesis like when it is actually true? H 0 : µ purple jelly bean µ placebo = 0 (a) 1% (b) 5% (c) 36% (d) 64% (e) 95% 4

Clicker question Suppose we want to test 20 different colors of jelly beans versus a placebo with hypotheses like H 0 : µ purple jelly bean µ placebo = 0 H 0 : µ brown jelly bean µ placebo = 0 H 0 : µ peach jelly bean µ placebo = 0... and we use α = 0.05 for each of these tests. What is the probability of making at least one Type 1 error in these 20 independent tests? (a) 1% (b) 5% (c) 36% (d) 64% (e) 95% 5

Clicker question Suppose we want to test 20 different colors of jelly beans versus a placebo with hypotheses like H 0 : µ purple jelly bean µ placebo = 0 H 0 : µ brown jelly bean µ placebo = 0 H 0 : µ peach jelly bean µ placebo = 0... and we use α = 0.05 for each of these tests. What is the probability of making at least one Type 1 error in these 20 independent tests? (a) 1% (b) 5% (c) 36% (d) 64% (e) 95% 5

Outline 1. Housekeeping 2. Main ideas 1. Comparing many means requires care 2. ANOVA tests for some difference in means of many different groups 3. ANOVA compares between group variation to within group variation 4. To identify which means are different, use t-tests and the Bonferroni correction 3. Summary

ANOVA tests for some difference in means of many different groups Null hypothesis: H 0 : µ placebo = µ purple = µ brown =... = µ peach = µ orange. Clicker question Which of the following is a correct statement of the alternative hypothesis? (a) For any two groups, including the placebo group, no two group means are the same. (b) For any two groups, not including the placebo group, no two group means are the same. (c) Amongst the jelly bean groups, there are at least two groups that have different group means from each other. (d) Amongst all groups, there are at least two groups that have different group means from each other. 6

Outline 1. Housekeeping 2. Main ideas 1. Comparing many means requires care 2. ANOVA tests for some difference in means of many different groups 3. ANOVA compares between group variation to within group variation 4. To identify which means are different, use t-tests and the Bonferroni correction 3. Summary

ANOVA compares between group variation to within group variation 2/ 2 7

Relatively large WITHIN group variation: little apparent difference 2/ 2 8

Relatively large BETWEEN group variation: there may be a difference 2/ 2 9

For historical reasons, we use a modification of this ratio called the F-statistic: 2 / (k 1) F = = MSG 2 / (n k) MSE k: # of groups; n: # of obs. 10

For historical reasons, we use a modification of this ratio called the F-statistic: 2 / (k 1) F = = MSG 2 / (n k) MSE k: # of groups; n: # of obs. Df Sum Sq Mean Sq F value Pr(>F) Between groups k-1 2 MSG F obs p obs Within groups n-k 2 MSE Total n-1 ( + ) 2 10

Outline 1. Housekeeping 2. Main ideas 1. Comparing many means requires care 2. ANOVA tests for some difference in means of many different groups 3. ANOVA compares between group variation to within group variation 4. To identify which means are different, use t-tests and the Bonferroni correction 3. Summary

To identify which means are different, use t-tests and the Bonferroni correction If the ANOVA yields a significant results, next natural question is: Which means are different? Use t-tests comparing each pair of means to each other, with a common variance (MSE from the ANOVA table) instead of each group s variances in the calculation of the standard error, and with a common degrees of freedom (df E from the ANOVA table) Compare resulting p-values to a modified significance level α = α K where K is the total number of pairwise tests 11

Application exercise: 4.4 ANOVA See the course webpage for details. 12

Outline 1. Housekeeping 2. Main ideas 1. Comparing many means requires care 2. ANOVA tests for some difference in means of many different groups 3. ANOVA compares between group variation to within group variation 4. To identify which means are different, use t-tests and the Bonferroni correction 3. Summary

Summary of main ideas 1. Comparing many means requires care 2. ANOVA tests for some difference in means of many different groups 3. ANOVA compares between group variation to within group variation 4. To identify which means are different, use t-tests and the Bonferroni correction 13