Quantitative Methods for Economics Tutorial 6. Katherine Eyal

Similar documents
1. Answer this student s question: Is a random sample of 5% of the students at my school large enough, or should I use 10%?

a) List and define all assumptions for multiple OLS regression. These are all listed in section 6.5

College/high school median annual earnings gap,

Chapter 12 Practice Test

Section I: Multiple Choice Select the best answer for each problem.

Business Cycles. Chris Edmond NYU Stern. Spring 2007

Introduction to Topics in Macroeconomics 2

ECO 745: Theory of International Economics. Jack Rossbach Fall Lecture 6

Efficiency Wages in Major League Baseball Starting. Pitchers Greg Madonia

Figure 1a. Top 1% income share: China vs USA vs France

Running head: DATA ANALYSIS AND INTERPRETATION 1

Lesson 14: Modeling Relationships with a Line

Chapter 10 Aggregate Demand I: Building the IS LM Model

The Simple Linear Regression Model ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD

Contingent Valuation Methods

Distancei = BrandAi + 2 BrandBi + 3 BrandCi + i

The Powerball Regressivity: An Evidence from the World s Largest Lottery Prize

Navigate to the golf data folder and make it your working directory. Load the data by typing

Empirical Example II of Chapter 7

Statistical Modeling of Consumers Participation in Gambling Markets and Frequency of Gambling

Announcements. % College graduate vs. % Hispanic in LA. % College educated vs. % Hispanic in LA. Problem Set 10 Due Wednesday.

Algebra I: A Fresh Approach. By Christy Walters

THE FUTURE OF SALES TAX REVENUE

Consumer behaviour and the welfare eects of gambling in

Name May 3, 2007 Math Probability and Statistics

Midterm Exam 1, section 2. Thursday, September hour, 15 minutes

Using Actual Betting Percentages to Analyze Sportsbook Behavior: The Canadian and Arena Football Leagues

Equation 1: F spring = kx. Where F is the force of the spring, k is the spring constant and x is the displacement of the spring. Equation 2: F = mg

O F C E N M JANUARY 1998

STANDARD SCORES AND THE NORMAL DISTRIBUTION

4. A student estimated a regression model using annual data for 1990 through 2015, C = β 0. Y + β 2

By making use of SAFRIM (South African Inter-Industry Macro-Economic Model) By Jeaunes Viljoen, Conningarth Economists, 1

y ) s x x )(y i (x i r = 1 n 1 s y Statistics Lecture 7 Exploring Data , y 2 ,y n (x 1 ),,(x n ),(x 2 ,y 1 How two variables vary together

Algebra I: A Fresh Approach. By Christy Walters

The XVIIIth Annual EAFE Conference

A SURVEY OF 1997 COLORADO ANGLERS AND THEIR WILLINGNESS TO PAY INCREASED LICENSE FEES

5.1 Introduction. Learning Objectives

Lecture 22: Multiple Regression (Ordinary Least Squares -- OLS)

Department of Economics Working Paper

Journal of Sports Economics 2000; 1; 299

An Analysis of the Effects of Long-Term Contracts on Performance in Major League Baseball

Real-Time Electricity Pricing

The University of Georgia

Gamblers Favor Skewness, Not Risk: Further Evidence from United States Lottery Games

University of Nevada, Reno. The Effects of Changes in Major League Baseball Playoff Format: End of Season Attendance

1. What function relating the variables best describes this situation? 3. How high was the balloon 5 minutes before it was sighted?

Announcements. Lecture 19: Inference for SLR & Transformations. Online quiz 7 - commonly missed questions

Lab 11: Introduction to Linear Regression

Big Changes, Unknown Impacts

The Ricardian Continuum Model

LOW PRESSURE EFFUSION OF GASES revised by Igor Bolotin 03/05/12

Temporal reliability of willingness to pay from the National Survey of Fishing, Hunting and Wildlife- Associated Recreation

Name Date Period. E) Lowest score: 67, mean: 104, median: 112, range: 83, IQR: 102, Q1: 46, SD: 17

15, 2015 EXECUTIVE SUMMARY

The Miracle on Thin Ice: How A Nation's GDP Affects its Olympic Performance Jennifer Boudreau, James Kepner, and Joseph Rondone

EFFECTS OF EXTENDING AND EXPANDING ENERGY-EFFICIENCY TAX DEDUCTION FOR COMMERCIAL BUILDINGS

The Impact of Spending Cuts on Road Quality

An early warning system to predict house price bubbles

Very Persistent Current Accounts July Horag Choi, University of Auckland Nelson C. Mark, University of Notre Dame and NBER

The Reliability of Intrinsic Batted Ball Statistics Appendix

Author s Name Name of the Paper Session. Positioning Committee. Marine Technology Society. DYNAMIC POSITIONING CONFERENCE September 18-19, 2001

Factors Affecting the Probability of Arrests at an NFL Game

LOW PRESSURE EFFUSION OF GASES adapted by Luke Hanley and Mike Trenary

An Empirical Analysis of the Impact of Renewable Portfolio Standards and Feed-in-Tariffs on International Markets.

Temporal Reliability of Willingness to Pay from the National Survey of Fishing, Hunting, and Wildlife-Associated Recreation 1. John C.

QSPS Conference. May, 2013 Utah State University

Staking plans in sports betting under unknown true probabilities of the event

What does it take to produce an Olympic champion? A nation naturally

Chapter 10 Gases. Characteristics of Gases. Pressure. The Gas Laws. The Ideal-Gas Equation. Applications of the Ideal-Gas Equation

Is lung capacity affected by smoking, sport, height or gender. Table of contents

Indiana Electricity Projections: The 2018 Forecast Update

1 Introduction. 2 EAD and Derived Factors

B. Single Taxpayers (122,401 obs.) A. Married Taxpayers Filing jointly (266,272 obs.) Density Distribution. Density Distribution

Estuary Management and Recreational Fishing Benefits John C. Bergstrom Jeffrey H. Dorfman John B. Loomis

More of the Same; Or now for Something Completely Different?

NOTES: STANDARD DEVIATION DAY 4 Textbook Chapter 11.1, 11.3

Labor Markets. Chris Edmond NYU Stern. Spring 2007

1. Five more than three times a number x. 3. Seventeen percent of a number x.

The final set in a tennis match: four years at Wimbledon 1

Estimating Ridership of Rural Demand-Response Transit Services for the General Public

Title: 4-Way-Stop Wait-Time Prediction Group members (1): David Held

Economic Growth in the Trump Economy

The pth percentile of a distribution is the value with p percent of the observations less than it.

100-Meter Dash Olympic Winning Times: Will Women Be As Fast As Men?

For an Olive Wreath? Olympic Games and Anticipation Effects in Macroeconomics

THUNDER BAY INTERNATIONAL AIRPORT AUTHORITY BOARD OF DIRECTORS

THE YANKEES EFFECT: THE IMPACT OF INTERLEAGUE PLAY AND THE UNBALANCED SCHEDULE ON MAJOR LEAGUE BASEBALL ATTENDANCE

A Hedonic Price Analysis of Risk Preferences in Yearling Thoroughbred Buyers. Xiurui Iris Cui

Calculation of Trail Usage from Counter Data

Effects of Incentives: Evidence from Major League Baseball. Guy Stevens April 27, 2013

Top incomes in historical and international perspective: Recent developments

Independent Economic Analysis Board. Review of the Estimated Economic Impacts of Salmon Fishing in Idaho. Task Number 99

Graphing Stories Writing Equations

Emergence of a professional sports league and human capital formation for sports: The Japanese Professional Football League.

Deficit Reduction and Economic Growth: Are They Mutually Exclusive Goals? Tuesday, May 1, 2012; 2:30 PM - 3:45 PM

March Madness Basketball Tournament

Reference- Dependence in Dutch Football

Effect of homegrown players on professional sports teams

An American Profile: The United States and Its People

SPORT-FISHING USE AND VALUE: LOWER SNAKE RIVER RESERVOIRS

Transcription:

Quantitative Methods for Economics Tutorial 6 Katherine Eyal

TUTORIAL 6 13 September 2010 ECO3021S Part A: Problems 1. (a) In 1857, the German statistician Ernst Engel formulated his famous law: Households with higher incomes spend a smaller fraction of their income on food. Suppose you have household budget data for a sample of South African households. Specify the econometric model you would use to test Engel s law with this data. What are the appropriate hypotheses on the parameters of the model? (b) An extension of Engel s law states that as income rises, increases in food expenditure grow at a smaller rate. Specify the econometric model you would use to test this extension of Engel s law. What are the appropriate hypotheses on the parameters of the model? 2. Suppose you are asked to find the relationship between weekly hours spent studying (study) and weekly hours spent working (work) for UCT students. Does it make sense to characterise the problem as inferring whether study causes work or work causes study? Explain. 3. (a) Suppose an econometric analysis finds that the relationship between wickets taken and average annual salary for bowlers in the domestic cricket league is expected salary = R230, 000 + R50, 000 (wickets). How much do expected salaries rise for each wicket taken? (b) Suppose, instead, that expected salaries prove to be quadratic in wickets taken: E (salary wickets) = R230, 000 + [R65, 000 (wickets)] [R600 (wickets) 2] i. What is the derivative of expected salary with respect to wickets taken for a bowler who takes 10 wickets? 100 wickets? Interpret these derivatives. ii. Compute the expected salary of a bowler who takes 10 wickets. iii. Compute the expected salary of a bowler who takes 11 wickets. iv. Compute the expected salary of a bowler who takes 20 wickets. (c) Why might a bowler who takes 10 wickets have a different salary from the expected salary you compute in (bii)? 1

Part B: Computer Exercises 1. The data set BWGHT.DTA contains data on births to women in the United States. (a) What type of data structure is contained in this data set? (b) How many women are in the sample, and how many report smoking during pregnancy? The following commands might help you (they do not have to be executed in this specific order, and you don t have to execute every command to answer the question): describe tab cigs count if cigs >0 (c) What is the average number of cigarettes smoked per day? (Use the command: sum cigs) Is the average a good measure of the typical woman in this case? Explain. (d) Among women who smoked during pregnancy, what is the average number of cigarettes smoked per day? (Use the command: sum cigs if cigs> 0) How does this compare with your answer from part (c), and why? (e) Find the average of fatheduc in the sample. Why are only 1,192 observations used to compute this average? (f) Report the average family income and its standard deviation in dollars. 2. The data set in CEOSAL2.DTA contains information on chief executive officers for U.S. corporations. The variable salary is annual compensation, in thousands of dollars, and ceoten is prior number of years as company CEO. (a) Find the average salary and the average tenure in the sample. (b) How many CEOs are in their first year as CEO (i.e., ceoten = 0)? What is the longest tenure as a CEO? (c) Estimate the simple regression model log(salary) = β 0 + β 1 ceoten + u by using the command: reg lsalary ceoten. What is the (approximate) predicted percentage increase in salary given one more year as a CEO? (In the tutorials, as in the textbook, log always refers to the natural logarithm.) 3. Trygve Haavelmo used consumption and income data from 1922 to 1941 to estimate the marginal propensity to consume. This data is in HAAVELMO.DTA. The measure of consumption in this data set real cons and the measure of income is real GDP. 2

Haavelmo reported a marginal propensity to consume of 0.73 when he estimated a consumption function of the form consumption = α + β(income) + u where β is the marginal propensity to consume. Milton Friedman later argued that the consumption function passes through the origin, as in consumption = β(income) + u (a) Find the averages of consumption and income in this sample. (b) Confirm Haavelmo s estimate of the marginal propensity to consume. does the intercept in the consumption function represent? What (c) Estimate the marginal propensity to consume using the model specified by Milton Friedman. How does this compare to Haavelmo s estimate? (d) As you know, in Keynesian macroeconomic theory an independent change in government spending, G increases income by G/ (1 β). This is called the multiplier effect, and the multiplier is 1/ (1 β). What are the estimated multipliers for the two estimates of the marginal propensity to consume? 3

TUTORIAL 6 SOLUTIONS 13 September 2010 ECO3021S Part A: Problems 1. (a) In 1857, the German statistician Ernst Engel formulated his famous law: Households with higher incomes spend a smaller fraction of their income on food. Suppose you have household budget data for a sample of South African households. Specify the econometric model you would use to test Engel s law with this data. What are the appropriate hypotheses on the parameters of the model? (b) An extension of Engel s law states that as income rises, increases in food expenditure grow at a smaller rate. Specify the econometric model you would use to test this extension of Engel s law. What are the appropriate hypotheses on the parameters of the model? (a) One way to examine whether the fraction of income spent on food falls with income is to estimate the relationship food expenditure = β 0 + β 1 income + u In this formulation, the average food consumption for a given income level is β 0 +β 1 income, so the fraction of income spent on food, (food expenditure/income), is (β 0 /income) + β 1 which falls as income rises if β 0 is positive, and rises as income rises if β 0 is negative. Another way to examine Engel s law is to use a double logarithmic relationship: log (food expenditure) = α 0 + α 1 log (income) + v In this formulation, α 1 is the elasticity of food expenditure with respect to income. If α 1 < 1, then average food expenditure grows more slowly than does income, because food demand is income inelastic, and the fraction of income spent on food expenditure falls as income rises, as Engel s law posits. 1

(b) A quadratic specification in which both income and the square of income affect food expenditure allows a test of the hypothesis that as income rises, increases in food expenditure grow at a smaller rate. food expenditure = β 0 + β 1 income + β 2 income 2 + ε In this formulation, the rate of change of food expenditure with respect to d (food expenditure) income is given by the first derivative, = β 1 + β 2 income. d (income) The extension of Engel s law states that this first derivative is a decreasing function of income, i.e. the second derivative d2 (food expenditure) d (income) 2 = β 2 is negative. Thus, if the estimated β 2 is negative, the data support this extension of Engel s law. 2. Suppose you are asked to find the relationship between weekly hours spent studying (study) and weekly hours spent working (work) for UCT students. Does it make sense to characterise the problem as inferring whether study causes work or work causes study? Explain. It does not make sense to pose the question in terms of causality. Economists would assume that students choose a mix of studying and working (and other activities, such as attending class, leisure, and sleeping) based on rational behavior, such as maximizing utility subject to the constraint that there are only 168 hours in a week. We can then use statistical methods to measure the association between studying and working, including regression analysis that we cover starting in Chapter 2. But we would not be claiming that one variable causes the other. They are both choice variables of the student. 3. (a) Suppose an econometric analysis finds that the relationship between wickets taken and average annual salary for bowlers in the domestic cricket league is expected salary = R230, 000 + R50, 000 (wickets). How much do expected salaries rise for each wicket taken? (b) Suppose, instead, that expected salaries prove to be quadratic in wickets taken: E (salary wickets) = R230, 000 + [R65, 000 (wickets)] [R600 (wickets) 2] 2

i. What is the derivative of expected salary with respect to wickets taken for a bowler who takes 10 wickets? 100 wickets? Interpret these derivatives. ii. Compute the expected salary of a bowler who takes 10 wickets. iii. Compute the expected salary of a bowler who takes 11 wickets. iv. Compute the expected salary of a bowler who takes 20 wickets. (c) Why might a bowler who takes 10 wickets have a different salary from the expected salary you compute in (bii)? d expected salary (a) The expected increase in the salaries for each wicket taken is = d wickets R50, 000. (b) i. d E (salary wickets) = 65, 000 1, 200 (wickets). d wickets d E (salary 10) Thus, when wickets = 10 : = 65, 000 1, 200(10) = 53, 000. d wickets d E (salary 100) When wickets = 100 : = 65, 000 1, 200(100) = 55, 000. d wickets Interpretation: rate of change of the expected salary with respect to wickets taken. If a bowler who has already taken 10 wickets takes an additional wicket, then his expected salary increases by R53 000. If a bowler who has already taken 100 wickets takes an additional wicket, then his expected salary increases by R55 000. ii. E(salary wickets = 10) = 230, 000 + 65, 000(10) 600(100) = 820, 000. iii. E(salary wickets = 11) = 230, 000 + 65, 000(11) 600(121) = 872, 400. (The difference in expected salary is 872, 400 820, 000 = 52, 400, which is close to our estimate of 53, 000 from part (bi). iv. E(salary wickets = 20) = 230, 000 + 65, 000(20) 600(400) = 1, 290, 000. (c) There are many other factors that could affect a bowler s salary that are not considered in the econometric analysis. Some of these factors are observable, age, number of years with current team, etc, and some of these factors are unobservable, such as popularity and ability. These factors (as well as any measurement error) would be captured in the error term. Part B: Computer Exercises 1. The data set BWGHT.DTA contains data on births to women in the United States. (a) What type of data structure is contained in this data set? 3

(b) How many women are in the sample, and how many report smoking during pregnancy? The following commands might help you (they do not have to be executed in this specific order, and you don t have to execute every command to answer the question): describe tab cigs count if cigs >0 (c) What is the average number of cigarettes smoked per day? (Use the command: sum cigs) Is the average a good measure of the typical woman in this case? Explain. (d) Among women who smoked during pregnancy, what is the average number of cigarettes smoked per day? (Use the command: sum cigs if cigs>0) How does this compare with your answer from part (c), and why? (e) Find the average of fatheduc in the sample. Why are only 1,192 observations used to compute this average? (f) Report the average family income and its standard deviation in dollars. (a) Cross-sectional (b) There are 1,388 observations in the sample. Tabulating the variable cigs shows that 212 women have cigs > 0. (c) The average of cigs is about 2.09, but this includes the 1,176 women who did not smoke. Reporting just the average masks the fact that almost 85 percent of the women did not smoke. It makes more sense to say that the typical woman does not smoke during pregnancy; indeed, the median number of cigarettes smoked is zero. (d) The average of cigs over the women with cigs > 0 is about 13.7. Of course this is much higher than the average over the entire sample because we are excluding 1,176 zeros. (e) The average of f atheduc is about 13.2. There are 196 observations with a missing value for f atheduc, and those observations are necessarily excluded in computing the average. (f) The average and standard deviation of f aminc are about 29.027 and 18.739, respectively, but f aminc is measured in thousands of dollars. So, in dollars, the average and standard deviation are $29,027 and $18,739. 4

2. The data set in CEOSAL2.DTA contains information on chief executive officers for U.S. corporations. The variable salary is annual compensation, in thousands of dollars, and ceoten is prior number of years as company CEO. (a) Find the average salary and the average tenure in the sample. (b) How many CEOs are in their first year as CEO (i.e., ceoten = 0)? What is the longest tenure as a CEO? (c) Estimate the simple regression model log(salary) = β 0 + β 1 ceoten + u by using the command: reg lsalary ceoten. What is the (approximate) predicted percentage increase in salary given one more year as a CEO? (In the tutorials, as in the textbook, log always refers to the natural logarithm.) (a) Average salary is about 865.864, which means $865,864 because salary is in thousands of dollars. Average ceoten is about 7.95. (b) There are five CEOs with ceoten = 0. The longest tenure is 37 years. (c) The estimated equation is log(salary) = 6.51 +.0097 ceoten n = 177, R 2 =.013 We obtain the approximate percentage change in salary given ceoten = 1 by multiplying the coefficient on ceoten by 100, 100(.0097) =.97%. Therefore, one more year as CEO is predicted to increase salary by almost 1%. 3. Trygve Haavelmo used consumption and income data from 1922 to 1941 to estimate the marginal propensity to consume. This data is in HAAVELMO.DTA. The measure of consumption in this data set real cons and the measure of income is real GDP. Haavelmo reported a marginal propensity to consume of 0.73 when he estimated a consumption function of the form consumption = α + β(income) + u where β is the marginal propensity to consume. Milton Friedman later argued that the consumption function passes through the origin, as in consumption = β(income) + u 5

(a) Find the averages of consumption and income in this sample. (b) Confirm Haavelmo s estimate of the marginal propensity to consume. does the intercept in the consumption function represent? What (c) Estimate the marginal propensity to consume using the model specified by Milton Friedman. How does this compare to Haavelmo s estimate? (d) As you know, in Keynesian macroeconomic theory an independent change in government spending, G increases income by G/ (1 β). This is called the multiplier effect, and the multiplier is 1/ (1 β). What are the estimated multipliers for the two estimates of the marginal propensity to consume? (a) Stata command: sum real cons real GDP. The average of consumption and income in the sample are 437.6 and 482.95 respectively. (b) Stata command: regress real cons real GDP and the MPC is 0.73, same as Haavelmo. The estimated constant or intercept is 84 which is the level of consumption when income is zero. (c) Stata command: regress real cons real GDP, noconstant. The estimated slope is now 0.903, higher than the estimated using the Haavelmo model. (d) Multiplier effect in Haavelmo: 1/(1 0.73) = 3.7 and in Friedman: 1/(1 0.903) = 10.3 (much higher). 6