Stats 2002: Probabilities for Wins and Losses of Online Gambling

Similar documents
Universitas Sumatera Utara

Running head: DATA ANALYSIS AND INTERPRETATION 1

One-factor ANOVA by example

Statistical Analysis of PGA Tour Skill Rankings USGA Research and Test Center June 1, 2007

One-way ANOVA: round, narrow, wide

Driv e accu racy. Green s in regul ation

Week 7 One-way ANOVA

Case Processing Summary. Cases Valid Missing Total N Percent N Percent N Percent % 0 0.0% % % 0 0.0%

Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA

CAPACITY ESTIMATION OF URBAN ROAD IN BAGHDAD CITY: A CASE STUDY OF PALESTINE ARTERIAL ROAD

Stat 139 Homework 3 Solutions, Spring 2015

Analysis of Variance. Copyright 2014 Pearson Education, Inc.

Data Set 7: Bioerosion by Parrotfish Background volume of bites The question:

Unit 4: Inference for numerical variables Lecture 3: ANOVA

Chapter 7. Comparing Two Population Means. Comparing two population means. T-tests: Independent samples and paired variables.

Practice Test Unit 6B/11A/11B: Probability and Logic

Unit 6 Day 2 Notes Central Tendency from a Histogram; Box Plots

Practice Test Unit 06B 11A: Probability, Permutations and Combinations. Practice Test Unit 11B: Data Analysis

ANOVA - Implementation.

Chapter 5: Methods and Philosophy of Statistical Process Control

Analyzing Categorical Data & Displaying Quantitative Data Section 1.1 & 1.2

Descriptive Statistics Project Is there a home field advantage in major league baseball?

Prokopios Chatzakis, National and Kapodistrian University of Athens, Faculty of Physical Education and Sport Science 1

CHAPTER ANALYSIS AND INTERPRETATION Average total number of collisions for a try to be scored

Bivariate Data. Frequency Table Line Plot Box and Whisker Plot

Announcements. Lecture 19: Inference for SLR & Transformations. Online quiz 7 - commonly missed questions

Warm-up. Make a bar graph to display these data. What additional information do you need to make a pie chart?

NCSS Statistical Software

STAT 625: 2000 Olympic Diving Exploration

Confidence Interval Notes Calculating Confidence Intervals

0-13 Representing Data

STAT 155 Introductory Statistics. Lecture 2-2: Displaying Distributions with Graphs

Math SL Internal Assessment What is the relationship between free throw shooting percentage and 3 point shooting percentages?

Select Boxplot -> Multiple Y's (simple) and select all variable names.

1. In a hypothesis test involving two-samples, the hypothesized difference in means must be 0. True. False

1. The data below gives the eye colors of 20 students in a Statistics class. Make a frequency table for the data.

STAT 101 Assignment 1

Lab Report Outline the Bones of the Story

Lesson 3 Pre-Visit Teams & Players by the Numbers

STT 315 Section /19/2014

Internet Technology Fundamentals. To use a passing score at the percentiles listed below:

NBA TEAM SYNERGY RESEARCH REPORT 1

Using Actual Betting Percentages to Analyze Sportsbook Behavior: The Canadian and Arena Football Leagues

Box-and-Whisker Plots

! Problem Solving Students will use past Olympic statistics and mathematics to predict the most recent Olympic statistics.

BASEBALL SALARIES: DO YOU GET WHAT YOU PAY FOR? Comparing two or more distributions by parallel box plots

Was John Adams more consistent his Junior or Senior year of High School Wrestling?

March Madness Basketball Tournament

Lab 5: Descriptive Statistics

Safety at Intersections in Oregon A Preliminary Update of Statewide Intersection Crash Rates

Example 1: One Way ANOVA in MINITAB

Unit 3 ~ Data about us

Is lung capacity affected by smoking, sport, height or gender. Table of contents

Chapter 12 Practice Test

PLANNED ORTHOGONAL CONTRASTS

The Rise in Infield Hits

Legendre et al Appendices and Supplements, p. 1

PGA Tour Scores as a Gaussian Random Variable

Descriptive Statistics

Economic Value of Celebrity Endorsements:

Major League Baseball Offensive Production in the Designated Hitter Era (1973 Present)

March Madness Basketball Tournament

UNIVERSITY OF CAMBRIDGE INTERNATIONAL EXAMINATIONS General Certificate of Education Ordinary Level

Chapter 2 - Frequency Distributions and Graphs

Fruit Fly Exercise 1- Level 1

Solutionbank S1 Edexcel AS and A Level Modular Mathematics

Quantitative Literacy: Thinking Between the Lines

12. School travel Introduction. Part III Chapter 12. School travel

CHAPTER 1 ORGANIZATION OF DATA SETS

Acknowledgement: Author is indebted to Dr. Jennifer Kaplan, Dr. Parthanil Roy and Dr Ashoke Sinha for allowing him to use/edit many of their slides.

Political Science 30: Political Inquiry Section 5

Quality Assurance Charting for QC Data

Denise L Seman City of Youngstown

AP Statistics Midterm Exam 2 hours

How are the values related to each other? Are there values that are General Education Statistics

Diameter in cm. Bubble Number. Bubble Number Diameter in cm

Analysis of recent swim performances at the 2013 FINA World Championship: Counsilman Center, Dept. Kinesiology, Indiana University

Chapter 2 Displaying and Describing Categorical Data

Statistical Method Certification, Coal Combustion Residuals Landfill, Reid Gardner Generating Station

5.1. Data Displays Batter Up. My Notes ACTIVITY

How Fast Can You Throw?

a) List and define all assumptions for multiple OLS regression. These are all listed in section 6.5

Year 10 Term 2 Homework

ASTERISK OR EXCLAMATION POINT?: Power Hitting in Major League Baseball from 1950 Through the Steroid Era. Gary Evans Stat 201B Winter, 2010

Using Markov Chains to Analyze a Volleyball Rally

Section I: Multiple Choice Select the best answer for each problem.

Sportsbook pricing and the behavioral biases of bettors in the NHL

save percentages? (Name) (University)

Deciding When to Quit: Reference-Dependence over Slot Machine Outcomes

Total Morphological Comparison Between Anolis oculatus and Anolis cristatellus

(c) The hospital decided to collect the data from the first 50 patients admitted on July 4, 2010.

Inquiry Investigation: Factors Affecting Photosynthesis

EXST7015: Salaries of all American league baseball players (1994) Salaries in thousands of dollars RAW DATA LISTING

ANALYSIS OF THE DOMINATING POWER OF SERVICE RECEPTION IN VOLLEYBALL IN DIFFERENT LEVELS OF COMPETITIONS

Palythoa Abundance and Coverage in Relation to Depth

USING A PROBABILITY APPROACH TO RANK IN-LINE INSPECTION ANOMALIES FOR EXCAVATION AND FOR SETTING REINSPECTION INTERVALS

Math 146 Statistics for the Health Sciences Additional Exercises on Chapter 2

That pesky golf game and the dreaded stats class

Box-and-Whisker Plots

Algebra 1 Unit 6 Study Guide

Transcription:

Abstract: Jennifer Mateja Andrea Scisinger Lindsay Lacher Stats 2002: Probabilities for Wins and Losses of Online Gambling The objective of this experiment is to determine whether online gambling is a profitable hobby. In order to achieve this goal, data was collected from nine individuals who participate in this past time. The data was then analyzed by using SPSS to create various tables and graphs that allowed us to interpret the data. We were able to conclude from the results that online gambling is not a profitable hobby, and that there was no difference in the net gains/losses between soccer, hockey, and baseball. Background Information: Gambling can either be an individual s rise to riches or fall to rags. With technology exponentially increasing, a new form of gambling that exists over the Internet has been developed. Individuals now have the luxury to win or lose money with the click of a mouse. In order to become involved in this new craze, individuals must register for a user name and a password at one of a few selected websites such as <www.sportsbookbetting.com>. Following the registration process, the individual may immediately begin placing bets on his or her choice of sport. The bets range anywhere from $2 to $50 per game. However, the question now becomes, do the odds of winning exceed those of losing for each individual sport? In order to answer this question, we decided to perform an experiment examining the probabilities of wins and losses for the three selected sports. Inferences were then made about the data in order to determine if online gambling is a profitable hobby.

2 Methods: Nine individuals living on campus at YSU who are all involved with online gambling volunteered to participate in this experiment. Over a one-week period, each individual made approximately ten to fifteen online bets for various sports and activities. Three specific sports were chosen for this particular test: soccer, hockey, and baseball. A list was compiled for a oneweek period containing the amount of money gambled for each bet made and also the amount of money won or lost per bet. The data for each sport was then inputted into SPSS in order to make several comparisons between the probabilities of wins and losses. In order to make such comparisons, several charts and graphs were formatted containing the various probabilities for each sport. Comparisons were then made between the three sports. Following this, the total wins and losses were merged and new probabilities were calculated. These new probabilities allowed us to examine online gambling as a whole rather than examining the individual sports alone. H o = There is no significant difference between the net gain/loss of the three sports. H a = There is significant difference between the net gain/loss of each of the three sports. Results: In order to analyze the data, variables were assigned to each sport when inputting the information into SPSS. Soccer was marked as 1 ; hockey was marked as 2 ; baseball was marked as 3.

3 Table 1. Descriptives TRUERATE GAME 1.00 Mean 95% Confidence Interval for Mean Lower Bound Upper Bound Statistic Std. Error 7.667E-02.3465 -.6320.7853 2.00 5% Trimmed Mean Median Variance Std. Deviation Minimum Maximum Range Interquartile Range Skewness Kurtosis Mean 95% Confidence Interval for Mean Lower Bound Upper Bound -.1062-1.0000 3.602 1.8978-1.00 5.00 6.00 2.3250 1.390.427.344.833.6944.4526 -.2313 1.6201 3.00 5% Trimmed Mean Median Variance Std. Deviation Minimum Maximum Range Interquartile Range Skewness Kurtosis Mean 95% Confidence Interval for Mean Lower Bound Upper Bound.5494-1.0000 6.146 2.4791-1.00 5.00 6.00 4.3333.858.427-1.187.833.1333.3569 -.5965.8632 5% Trimmed Mean Median Variance Std. Deviation Minimum Maximum Range Interquartile Range Skewness Kurtosis -4.32E-02-1.0000 3.821 1.9547-1.00 5.00 6.00 3.2500 1.280.427 -.064.833 The above chart displays the mean rate of net gain or loss for each sport per bet made. Also, the upper and lower bounds illustrate the 95% confidence interval for the gains and losses.

4 Chart 1. 70 Frequency of Wins and Losses 60 50 Frequency 40 30 20 10 0 loss win Wins / Losses The above chart displays the overall frequency distribution of wins and losses, excluding the monetary values of each bet. Chart 2. Wins / Losses win 28.9% loss 71.1%

5 The above pie chart is another way to visually depict the ratio of wins to losses. It also provides percentages. Chart 3. 60 Frequency of Value per Bet 50 40 Frequency 30 20 10 0 3.00 5.00 10.00 20.00 25.00 50.00 Amount of Money Bet The above chart displays the frequencies for the amount of money gambled per bet. The data distribution is skewed to the right meaning that most people bet smaller amounts of money. Table 2. TRUERATE Stem-and-Leaf Plot for GAME= 1.00 Frequency Stem & Leaf 22.00-1. 0000000000000000000000.00-0..00 0. 1.00 1. 0 1.00 2. 3 5.00 3. 00333 1.00 Extremes (>=5.0) Stem width: 1.00 Each leaf: 1 case(s)

6 Table 3. TRUERATE Stem-and-Leaf Plot for GAME= 2.00 Frequency Stem & Leaf 20.00-1. 00000000000000000000.00-0..00 0..00 1..00 2. 5.00 3. 33335 1.00 4. 0 4.00 5. 0000 Stem width: 1.00 Each leaf: 1 case(s) Table 4. TRUERATE Stem-and-Leaf Plot for GAME= 3.00 Frequency Stem & Leaf 22.00-1. 0000000000000000000000.00-0..00 0..00 1. 1.00 2. 0 6.00 3. 000333.00 4. 1.00 5. 0 Stem width: 1.00 Each leaf: 1 case(s) Tables 2-4 display the frequency of the rate of gains and losses for every dollar gambled per bet. The data distribution is skewed to the right.

7 Graph 1. 2.0 1.5 1.0.5 95% CI TRUERATE 0.0 -.5-1.0 N = 30 1.00 30 2.00 30 3.00 GAME The above graph displays the mean rate of gains or losses. It also shows error bars for each of the individual sports. Table 5. ANOVA NETGAIN Sum of Squares df Mean Square F Sig. Between Groups 249.489 2 124.744.348.707 Within Groups 31170.167 87 358.278 Total 31419.656 89

8 The above table displays an analysis performed on SPSS that reveals if the net gain/losses between the three sports are different from one another. Table 6. TRUERATE GAME 1.00 2.00 3.00 Tests of Normality Kolmogorov-Smirnov a Shapiro-Wilk Statistic df Sig. Statistic df Sig..448 30.000.605 30.010** **. This is an upper bound of the true significance. a. Lilliefors Significance Correction.420 30.000.646 30.010**.452 30.000.602 30.010** Because the stem and leaf plots were drastically skewed, we decided to perform a normality to test. The above table displays the whether or not the true rate for each sport follows a normal distribution. Table 7. Kruskal-Wallis Test Test Statistics a,b TRUERATE Chi-Square 1.440 df 2 Asymp. Sig..487 a. Kruskal Wallis Test b. Grouping Variable: GAME The above table illustrates the results of a Kruskal-Wallis test, which is used to show comparisons for data that is not normally distributed.

9 Analyses: Figure 1 shows the mean rate value for each of the three sports. For sport one, soccer, there is an average gain of 7.667 cents per bet. For sport two, baseball, an average of 69.4 cents is gained per bet. Lastly, for sport three, hockey, there is a net gain of about 13.3 cents per bet. Based on this data alone, none of the three sports analyzed shows a large margin of profit. Chart 2 and 3 depict the frequency of the total number of wins and losses. By examining this data, we noticed that the difference in percentages between the two was quite large. Overall, 71.1% of the total bets made were wins, and 28.9% of the bets were losses, leading us to conclude that online gambling is unprofitable. Chart 4 displays the frequency of the average amount of money gambled per bet. The data is skewed to the right meaning that the majority of the bets were for a smaller amount. As the value of the bets increased, the number of bets made at that value decreased. Figures 2-4 are stem and leaf plots that display the frequency of the gains and losses made per dollar bet. The distribution pattern for each of the three plots is skewed to the right. This means that there is a much higher frequency of losses than gains. These results also drew us to the conclusion that online gambling is not a profitable hobby. Graph 1 displays the mean rate of gain or loss per bet for each sport. Because the three means are very close to one another and they are all close to zero, we can conclude that none of the three sports analyzed produce a large net profit per bet. Table 5 illustrates a comparison of the relationship between the three sports. Because the p- value was.707 which is greater than.05, we fail to reject the null hypothesis because there is not significant evidence supporting the alternative hypothesis.

10 Because the stem and leaf plots were drastically skewed, we decided to perform a normality test. Table 6 displays the results of this test. Because the p-values were all very small, the true rate values for the three sports are not normally distributed. Therefore, a nonparametric test was used as a second method to test the null hypothesis. Table 7 illustrates the results to a Kruskal-Wallis test, which is a non-parametric test. This test is used to test the relationship between data that is not normally distributed. Because the p- value was.487, which is greater than.05, we failed to reject the null hypothesis again because there was not significant evidence to support the alternative. Therefore, there is no significant difference between the true rates for each of the three sports. Discussion: By performing this experiment, we were able to conclude that online gambling is an unprofitable hobby. By examining the various charts and graphs produced on SPSS, an obvious trend was revealed illustrating that online gambling is not profitable. By looking at the frequency distribution of wins and losses, the amount of losses exceeded that of wins by nearly 1/3. The Kruskal-Wallis test, which is used to test data that is not normally distributed, provides very important results for this experiment. Because the null hypothesis was.487, we failed to reject the null hypothesis because there was not significant evidence to support the alternative hypothesis. Therefore, there is no significant difference between the average net gain of three sports. In our experiment, possible biases may have affected the outcome. Since the data was received from college students only, the results may not be a true representation of online gambling. Also the sample size was fairly small because we only collected data from nine

11 individuals. However, there was still a large quantity of values to work with. If more individuals had been used, the results may have been more accurate. In such an experiment as this, one way to avoid bias is by using a large sample size. Overall, the results indicate that an individual s safest bet is to not bet at all.