Name May 3, 2007 Math Probability and Statistics

Similar documents
Analysis of Variance. Copyright 2014 Pearson Education, Inc.

Stat 139 Homework 3 Solutions, Spring 2015

Section I: Multiple Choice Select the best answer for each problem.

Chapter 12 Practice Test

Statistical Analysis of PGA Tour Skill Rankings USGA Research and Test Center June 1, 2007

Distancei = BrandAi + 2 BrandBi + 3 BrandCi + i

Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA

Select Boxplot -> Multiple Y's (simple) and select all variable names.

Announcements. Lecture 19: Inference for SLR & Transformations. Online quiz 7 - commonly missed questions

Unit 4: Inference for numerical variables Lecture 3: ANOVA

Unit4: Inferencefornumericaldata 4. ANOVA. Sta Spring Duke University, Department of Statistical Science

Week 7 One-way ANOVA

Legendre et al Appendices and Supplements, p. 1

Confidence Interval Notes Calculating Confidence Intervals

Running head: DATA ANALYSIS AND INTERPRETATION 1

A few things to remember about ANOVA

Factorial Analysis of Variance

5.1 Introduction. Learning Objectives

One-factor ANOVA by example

1. In a hypothesis test involving two-samples, the hypothesized difference in means must be 0. True. False

Probability & Statistics - Solutions

NAME: Math 403 Final Exam 12/10/08 15 questions, 150 points. You may use calculators and tables of z, t values on this exam.

May 11, 2005 (A) Name: SSN: Section # Instructors : A. Jain, H. Khan, K. Rappaport

Announcements. % College graduate vs. % Hispanic in LA. % College educated vs. % Hispanic in LA. Problem Set 10 Due Wednesday.

ISyE 6414 Regression Analysis

Math 146 Statistics for the Health Sciences Additional Exercises on Chapter 2

The Animated Eyes Symbol as Part of the WALK Signal: An Examination of the Generality of its Effectiveness Across a Variety of

An Empirical Comparison of Regression Analysis Strategies with Discrete Ordinal Variables

The Reliability of Intrinsic Batted Ball Statistics Appendix

Taking Your Class for a Walk, Randomly

Midterm Exam 1, section 2. Thursday, September hour, 15 minutes

Is lung capacity affected by smoking, sport, height or gender. Table of contents

1. Answer this student s question: Is a random sample of 5% of the students at my school large enough, or should I use 10%?

Data Set 7: Bioerosion by Parrotfish Background volume of bites The question:

Chapter 2: ANOVA and regression. Caroline Verhoeven

y ) s x x )(y i (x i r = 1 n 1 s y Statistics Lecture 7 Exploring Data , y 2 ,y n (x 1 ),,(x n ),(x 2 ,y 1 How two variables vary together

Math 121 Test Questions Spring 2010 Chapters 13 and 14

Quantitative Methods for Economics Tutorial 6. Katherine Eyal

One-way ANOVA: round, narrow, wide

Instrumental Variables

(JUN10SS0501) General Certificate of Education Advanced Level Examination June Unit Statistics TOTAL.

Setting up group models Part 1 NITP, 2011

Driv e accu racy. Green s in regul ation

Journal of Human Sport and Exercise E-ISSN: Universidad de Alicante España

1 Hypothesis Testing for Comparing Population Parameters

Navigate to the golf data folder and make it your working directory. Load the data by typing

COMPLETING THE RESULTS OF THE 2013 BOSTON MARATHON

Real-Time Electricity Pricing

ANOVA - Implementation.

Lab 11: Introduction to Linear Regression

Political Science 30: Political Inquiry Section 5

Sample Final Exam MAT 128/SOC 251, Spring 2018

Exploring Measures of Central Tendency (mean, median and mode) Exploring range as a measure of dispersion

AP Statistics Midterm Exam 2 hours

The Simple Linear Regression Model ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD

CHAPTER ANALYSIS AND INTERPRETATION Average total number of collisions for a try to be scored

Using Actual Betting Percentages to Analyze Sportsbook Behavior: The Canadian and Arena Football Leagues

PLEASE MARK YOUR ANSWERS WITH AN X, not a circle! 2. (a) (b) (c) (d) (e) (a) (b) (c) (d) (e) (a) (b) (c) (d) (e)...

1. What function relating the variables best describes this situation? 3. How high was the balloon 5 minutes before it was sighted?

Math SL Internal Assessment What is the relationship between free throw shooting percentage and 3 point shooting percentages?

a) List and define all assumptions for multiple OLS regression. These are all listed in section 6.5

The Normal Distribution, Margin of Error, and Hypothesis Testing. Additional Resources

INTERSECTION SAFETY RESEARCH IN CRACOW UNIVERSITY OF TECHNOLOGY

The table below shows how the thinking distance and braking distance vary with speed. Thinking distance in m

Biostatistics & SAS programming

Economic Value of Celebrity Endorsements:

Policy Management: How data and information impacts the ability to make policy decisions:

Title: AJAE Appendix for Measuring Benefits from a Marketing Cooperative in the Copper

Field and Analytical Investigation of Accidents Data on the Egyptian Road Network

The Powerball Regressivity: An Evidence from the World s Largest Lottery Prize

PGA Tour Scores as a Gaussian Random Variable

PLANNED ORTHOGONAL CONTRASTS

Empirical Example II of Chapter 7

Lesson 14: Modeling Relationships with a Line

1 Introduction. 2 EAD and Derived Factors

Why so blue? The determinants of color pattern in killifish, Part II Featured scientist: Becky Fuller from The University of Illinois

Minimum Mean-Square Error (MMSE) and Linear MMSE (LMMSE) Estimation

PLEASE MARK YOUR ANSWERS WITH AN X, not a circle! 1. (a) (b) (c) (d) (e) 2. (a) (b) (c) (d) (e) (a) (b) (c) (d) (e) 4. (a) (b) (c) (d) (e)...

An investigation of the variability of start-up lost times and departure headways at signalized intersections in urban areas

Exploring Factors Affecting Metrorail Ridership in Washington D.C.

MGB 203B Homework # LSD = 1 1

Statistics Unit Statistics 1A

ESP 178 Applied Research Methods. 2/26/16 Class Exercise: Quantitative Analysis

On the association of inrun velocity and jumping width in ski. jumping

Chapter 9: Hypothesis Testing for Comparing Population Parameters

The Effect of Newspaper Entry and Exit on Electoral Politics Matthew Gentzkow, Jesse M. Shapiro, and Michael Sinkinson Web Appendix

Title: Modeling Crossing Behavior of Drivers and Pedestrians at Uncontrolled Intersections and Mid-block Crossings

STT 315 Section /19/2014

Robust specification testing in regression: the FRESET test and autocorrelated disturbances

Discussion: Illusions of Sparsity by Giorgio Primiceri

Safety at Intersections in Oregon A Preliminary Update of Statewide Intersection Crash Rates

Development of Professional Driver Adjustment Factors for the Capacity Analysis of Signalized Intersections

Traffic Safety Barriers to Walking and Bicycling Analysis of CA Add-On Responses to the 2009 NHTS

the 54th Annual Conference of the Association of Collegiate School of Planning (ACSP) in Philadelphia, Pennsylvania November 2 nd, 2014

Lecture 16: Chapter 7, Section 2 Binomial Random Variables

What Causes the Favorite-Longshot Bias? Further Evidence from Tennis

See if you can determine what the following magnified photos are. Number your paper to 5.

Contingent Valuation Methods

Major League Baseball Offensive Production in the Designated Hitter Era (1973 Present)

ANALYSIS OF THE REGIONAL DISTRIBUTION, PATTERNS AND FORECAST OF ROAD TRAFFIC FATALITIES IN GHANA

Transcription:

Name May 3, 2007 Math 341 - Probability and Statistics Long Exam IV Instructions: Please include all relevant work to get full credit. Encircle your final answers. 1. An article in Professional Geographer (Feb. 2000) investigated whether there has been a significant suburban-to-urban shift in the location of major sport facilities. In 1985, 40% of all major sport facilities were located downtown, 30% in central city, and 30% in suburban areas. To check if these proportions remain at this levels, 113 major sports franchises that existed in 1997 were randomly selected and of the 113, 58 were built downtown, 26 in a central city, and 29 in a suburban area. a. Formulate the appropriate null and alternative hypotheses for the test to determine whether the proportions of major sports facilities in downtown, central city, and suburban areas in 1997 are the same as in 1985. [2] b. Using α = 0.05, define your rejection rule. [3] c. If the null hypothesis, in part (a), is true, how many of the 113 sports facilities in 1997 would you expected to be located in downtown, central city, and suburban areas, respectively? [3] d. Compute the observed value of the appropriate test statistic. [4] e. Do you reject the null hypothesis? Write a practical conclusion. [3] 2. Clinical Test of Lipitor. The cholesterol-reducing drug Lipitor consists of atorvastatin calcium, and results summarizing headaches as an adverse reaction in clinical tests are given in the table (based on data from Parke-Davis). Using a 0.01 level of significance, test the claim that getting a headache is independent of the amount of atorvastatin used as a treatment.

Placebo 10 mg 20/40 mg 80 mg Total Headache 19 47 8 6 No Headache 251 816 107 88 Headache Total a. Formulate the appropriate null and alternative hypotheses for this problem. [2] b. Using α = 0.01, define your rejection rule. [3] c. Compute the observed value of the appropriate test statistic. [8] d. Do you reject the null hypothesis? Write a practical conclusion. [3] 3. The USGA wants to compare the mean distances associated with four different brands of golf balls when struck with a driver. A completely randomized design is employed, with Iron Byron, the USGA s robotic golfer, using a driver to hit a random sample of 10 balls of each brand in a random sequence. The distances were recorded and a summary of the results is given below. Brand n Average ( x) Sample Variance (s 2 ) A 10 250.78 22.42 B 10 261.03 14.95 C 10 269.95 20.26 D 10 249.32 27.07 a. Formulate the most appropriate null and alternative hypotheses for this problem. Define clearly the parameters you used in the null hypothesis. [3]

b. Construct the ANOVA Table. [10] c. Using α = 0.01, define your rejection rule. [2] d. Write a practical conclusion in the context of this problem. [3] e. Use the Tukey procedure to construct all relevant confidence intervals in order to determine which brands of golf balls are statistically different. Use a family (or overall) error rate of α = 0.05. Draw the appropriate diagram. [7] f. What are the assumptions for the ANOVA procedure? [3]

4. In the state of Florida, elementary school performance is based on the average score obtained by students on a standardized exam, called the Florida Comprehensive Assessment Test (FCAT). An analysis of the link between FCAT scores and socio-demographic factors was published in the Journal of Educational and Behavioral Statistics (Spring 2004). Part of the data on average math and reading FCAT scores of third-graders, as well as the percentage of students below the poverty level, for a sample of 22 Florida elementary schools are listed in the table. Elementary School 1 2 3 21 22 FCAT-Math (y) 166.4 159.6 159.1 182.8 186.1 FCAT-Reading 165.0 157.2 164.4 181.6 183.8 % Below Poverty (x) 91.7 90.2 86.0 26.5 13.8 x = 1292.7 x 2 = 88, 668.43 s x = 24.6 y = 3781.1 y 2 = 651, 612.45 s y = 9.16 n = 22 xy = 218, 291.63 a. Compute the value of S xx, S yy, S xy, and r. [8] b. Would you recommend using simple linear regression to model the relationship FCAT-Math and the percentage of students below the poverty level? Explain your answer. [2] c. Determine the regression line. [5] d. If another Florida elementary school has 50% of its students below the poverty level, predict the average FCAT-math score of this school. [2] e. Determine the coefficient of determination r 2. Explain the meaning of this quantity in the context of this problem. [2]

f. Using α = 0.01, test H 0 : b = 0 vs. H 1 : b 0 [8] g. Construct and interpret a 99% confidence interval for b. [4] h. Find a 95% confidence interval for the mean response µ Y x =50% [6] 5. In the simple linear model, Y i = β 0 + β 1 x i + ɛ i, where ɛ i are independent and identically distributed with mean E(ɛ i ) = 0 and variance V (ɛ i ) = σ 2, show that ˆβ 1 = SS xy SS xx is an unbiased estimator of β 1. [Note: In this setup, x i is considered constant and so E(Y i ) = β 0 + β 1 x i.] [6]