Deconstructing Data Science

Size: px
Start display at page:

Download "Deconstructing Data Science"

Transcription

1 Deconstructing Data Science David Bamman, UC Berkele Info 29 Lecture 4: Regression overview Feb 1, 216

2 Regression A mapping from input data (drawn from instance space ) to a point in R (R = the set of real numbers) = the empire state building =

3 task Y predicting bo office revenue movie R

4 Eperiment design training development testing size 8% 1% 1% purpose training models model selection evaluation; never look at it until the ver end

5 Metrics Measure difference between the prediction ŷ and the true Mean squared error 1 N (ŷ i i ) 2 (MSE) N i=1 Mean absolute error 1 N ŷ i i (MAE) N i=1

6 Linear regression F ŷ = i β i i= β R F (F-dimensional vector of real numbers)

7 Polnomial regression F F ŷ = i β a,i + 2 i β b,i i=1 i=1 4 3 ^2 2 1 βa, βb R F (F-dimensional vector of real numbers)

8 Polnomial regression F F F ŷ = i β a,i + 2 i β b,i + 3 i β c,i i=1 i=1 i=1 5 ^3-5 βa, βb, βc R F (F-dimensional vector of real numbers)

9 Nonlinear regression Deep learning Decision trees Probabilistic graphical models Random forests Support vector machines (regression) Networks Neural networks

10 Number of Parameters order 1 (linear reg.) ŷ = F i=1 i β a,i F F order 2 ŷ = i β a,i + 2 i β b,i i=1 i=1 F F F order 3 ŷ = i β a,i + 2 i β b,i + 3 i β c,i i=1 i=1 i=1

11

12 instance space labeled data labeled data labeled data

13 degree 1, training MSE = 73.4

14 degree 2, training MSE = 71.9

15 degree 3, training MSE = 6.9

16 degree 4, training MSE = 6.6

17 degree 5, training MSE = 59.1

18 degree 6, training MSE = 5.2

19 degree 7, training MSE = 49.6

20 degree 8, training MSE = 46.8

21 degree 9, training MSE = 41.2

22 degree 1, training MSE = 35.8

23 degree 11, training MSE = 21.1

24 degree 12, training MSE = 18.4

25

26

27

28

29

30 Overfitting Memorizing the nuances (and noise) of the training data that prevents generalizing to unseen data

31 Sources of error Bias: Error due to mis-specifing the relationship between input and the output. [too few parameters, or the wrong kinds] Variance: Error due to sensitivit to random fluctuations in the training data. If ou train on different data, do ou get radicall different predictions? [too man parameters]

32 Low variance High variance Low bias High bias Image from Flach 212

33 Eample: High bias, low variance: Alwas predict Berkele geolocation on Twitter High bias, high variance: Predict most frequent cit in training data Low bias, high variance: man features, some of which capture true signal but capture random noise Low bias, low variance: enough features to capture the true signal

34 Ordinal regression In between classification and regression Y is categorical (e.g.,,, ) Elements of Y are ordered < < <

35 Ordinal regression task Y predicting star ratings movie {,, }

36 Computational Journalism Sarah Cohen, James T. Hamilton, and Fred Turner, Computational Journalism, Communications of the ACM (211) Slvain Parasie, Data-Driven Revelation? Epistemological tensions in investigative journalism in the age of big data, Digital Journalism (215)

37 Computational Journalism Changing how stories are discovered, presented, aggregated, monetized and archived (Cohen et al. 212) Draws on earlier tradition of computer-assisted reporting and precision journalism (Meer 1972)

38 Computational Journalism Database linking, e.g.: voting records to the deceased press releases from different members of congress indictments/settlements from U.S. attornes documents from SEC, Pentagon, defense contractors to note movement to industr (Cohen 212) DSA database of safet status of CA public schools + US seismic zones + school list from CA Dept of (Parasie 215)

39 Computational Journalism Information etraction: need to pull out people, places, organizations and their relationship from large (often sudden) dumps of documents. Analzing the relationship between entities

40 Computational Journalism Data-driven stories about large-scale trends Relationship between birth ear and political views NY Times (Jul 7, 214) Change in insured Americans under the ACA, NY Times (Oct 29, 214) 4

41 Computational Journalism Data-driven lead generation; the outliers in analsis that point to a stor

Deconstructing Data Science

Deconstructing Data Science Deconstructing Data Science David Bamman, UC Berkele Info 29 Lecture 4: Regression overview Jan 26, 217 Regression A mapping from input data (drawn from instance space ) to a point in R (R = the set of

More information

Building an NFL performance metric

Building an NFL performance metric Building an NFL performance metric Seonghyun Paik (spaik1@stanford.edu) December 16, 2016 I. Introduction In current pro sports, many statistical methods are applied to evaluate player s performance and

More information

Projecting Three-Point Percentages for the NBA Draft

Projecting Three-Point Percentages for the NBA Draft Projecting Three-Point Percentages for the NBA Draft Hilary Sun hsun3@stanford.edu Jerold Yu jeroldyu@stanford.edu December 16, 2017 Roland Centeno rcenteno@stanford.edu 1 Introduction As NBA teams have

More information

Estimating the Probability of Winning an NFL Game Using Random Forests

Estimating the Probability of Winning an NFL Game Using Random Forests Estimating the Probability of Winning an NFL Game Using Random Forests Dale Zimmerman February 17, 2017 2 Brian Burke s NFL win probability metric May be found at www.advancednflstats.com, but the site

More information

BASKETBALL PREDICTION ANALYSIS OF MARCH MADNESS GAMES CHRIS TSENG YIBO WANG

BASKETBALL PREDICTION ANALYSIS OF MARCH MADNESS GAMES CHRIS TSENG YIBO WANG BASKETBALL PREDICTION ANALYSIS OF MARCH MADNESS GAMES CHRIS TSENG YIBO WANG GOAL OF PROJECT The goal is to predict the winners between college men s basketball teams competing in the 2018 (NCAA) s March

More information

PREDICTING THE NCAA BASKETBALL TOURNAMENT WITH MACHINE LEARNING. The Ringer/Getty Images

PREDICTING THE NCAA BASKETBALL TOURNAMENT WITH MACHINE LEARNING. The Ringer/Getty Images PREDICTING THE NCAA BASKETBALL TOURNAMENT WITH MACHINE LEARNING A N D R E W L E V A N D O S K I A N D J O N A T H A N L O B O The Ringer/Getty Images THE TOURNAMENT MARCH MADNESS 68 teams (4 play-in games)

More information

Fun Neural Net Demo Site. CS 188: Artificial Intelligence. N-Layer Neural Network. Multi-class Softmax Σ >0? Deep Learning II

Fun Neural Net Demo Site. CS 188: Artificial Intelligence. N-Layer Neural Network. Multi-class Softmax Σ >0? Deep Learning II Fun Neural Net Demo Site CS 188: Artificial Intelligence Demo-site: http://playground.tensorflow.org/ Deep Learning II Instructors: Pieter Abbeel & Anca Dragan --- University of California, Berkeley [These

More information

Predicting Horse Racing Results with Machine Learning

Predicting Horse Racing Results with Machine Learning Predicting Horse Racing Results with Machine Learning LYU 1703 LIU YIDE 1155062194 Supervisor: Professor Michael R. Lyu Outline Recap of last semester Object of this semester Data Preparation Set to sequence

More information

Introduction to Machine Learning NPFL 054

Introduction to Machine Learning NPFL 054 Introduction to Machine Learning NPFL 054 http://ufal.mff.cuni.cz/course/npfl054 Barbora Hladká hladka@ufal.mff.cuni.cz Martin Holub holub@ufal.mff.cuni.cz Charles University, Faculty of Mathematics and

More information

Predicting NBA Shots

Predicting NBA Shots Predicting NBA Shots Brett Meehan Stanford University https://github.com/brettmeehan/cs229 Final Project bmeehan2@stanford.edu Abstract This paper examines the application of various machine learning algorithms

More information

Unit 4: Inference for numerical variables Lecture 3: ANOVA

Unit 4: Inference for numerical variables Lecture 3: ANOVA Unit 4: Inference for numerical variables Lecture 3: ANOVA Statistics 101 Thomas Leininger June 10, 2013 Announcements Announcements Proposals due tomorrow. Will be returned to you by Wednesday. You MUST

More information

knn & Naïve Bayes Hongning Wang

knn & Naïve Bayes Hongning Wang knn & Naïve Bayes Hongning Wang CS@UVa Today s lecture Instance-based classifiers k nearest neighbors Non-parametric learning algorithm Model-based classifiers Naïve Bayes classifier A generative model

More information

A Novel Approach to Predicting the Results of NBA Matches

A Novel Approach to Predicting the Results of NBA Matches A Novel Approach to Predicting the Results of NBA Matches Omid Aryan Stanford University aryano@stanford.edu Ali Reza Sharafat Stanford University sharafat@stanford.edu Abstract The current paper presents

More information

Lecture 5. Optimisation. Regularisation

Lecture 5. Optimisation. Regularisation Lecture 5. Optimisation. Regularisation COMP90051 Statistical Machine Learning Semester 2, 2017 Lecturer: Andrey Kan Copyright: University of Melbourne Iterative optimisation Loss functions Coordinate

More information

CS 221 PROJECT FINAL

CS 221 PROJECT FINAL CS 221 PROJECT FINAL STUART SY AND YUSHI HOMMA 1. INTRODUCTION OF TASK ESPN fantasy baseball is a common pastime for many Americans, which, coincidentally, defines a problem whose solution could potentially

More information

Introduction to Pattern Recognition

Introduction to Pattern Recognition Introduction to Pattern Recognition Jason Corso SUNY at Buffalo 12 January 2009 J. Corso (SUNY at Buffalo) Introduction to Pattern Recognition 12 January 2009 1 / 28 Pattern Recognition By Example Example:

More information

Title: 4-Way-Stop Wait-Time Prediction Group members (1): David Held

Title: 4-Way-Stop Wait-Time Prediction Group members (1): David Held Title: 4-Way-Stop Wait-Time Prediction Group members (1): David Held As part of my research in Sebastian Thrun's autonomous driving team, my goal is to predict the wait-time for a car at a 4-way intersection.

More information

PREDICTING the outcomes of sporting events

PREDICTING the outcomes of sporting events CS 229 FINAL PROJECT, AUTUMN 2014 1 Predicting National Basketball Association Winners Jasper Lin, Logan Short, and Vishnu Sundaresan Abstract We used National Basketball Associations box scores from 1991-1998

More information

CS 7641 A (Machine Learning) Sethuraman K, Parameswaran Raman, Vijay Ramakrishnan

CS 7641 A (Machine Learning) Sethuraman K, Parameswaran Raman, Vijay Ramakrishnan CS 7641 A (Machine Learning) Sethuraman K, Parameswaran Raman, Vijay Ramakrishnan Scenario 1: Team 1 scored 200 runs from their 50 overs, and then Team 2 reaches 146 for the loss of two wickets from their

More information

A) The linear correlation is weak, and the two variables vary in the same direction.

A) The linear correlation is weak, and the two variables vary in the same direction. 1 Which of the following is NOT affected b outliers in a data set? A) Mean C) Range B) Mode D) Standard deviation 2 The following scatter plot represents a two-variable statistical distribution. Which

More information

A computer program that improves its performance at some task through experience.

A computer program that improves its performance at some task through experience. 1 A computer program that improves its performance at some task through experience. 2 Example: Learn to Diagnose Patients T: Diagnose tumors from images P: Percent of patients correctly diagnosed E: Pre

More information

Environmental Science: An Indian Journal

Environmental Science: An Indian Journal Environmental Science: An Indian Journal Research Vol 14 Iss 1 Flow Pattern and Liquid Holdup Prediction in Multiphase Flow by Machine Learning Approach Chandrasekaran S *, Kumar S Petroleum Engineering

More information

Name May 3, 2007 Math Probability and Statistics

Name May 3, 2007 Math Probability and Statistics Name May 3, 2007 Math 341 - Probability and Statistics Long Exam IV Instructions: Please include all relevant work to get full credit. Encircle your final answers. 1. An article in Professional Geographer

More information

EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Wenbing Zhao. Department of Electrical and Computer Engineering

EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Wenbing Zhao. Department of Electrical and Computer Engineering EEC 686/785 Modeling & Performance Evaluation of Computer Systems Lecture 6 Department of Electrical and Computer Engineering Cleveland State University wenbing@ieee.org Outline 2 Review of lecture 5 The

More information

Outline. Terminology. EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Steps in Capacity Planning and Management

Outline. Terminology. EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Steps in Capacity Planning and Management EEC 686/785 Modeling & Performance Evaluation of Computer Systems Lecture 6 Department of Electrical and Computer Engineering Cleveland State University wenbing@ieee.org Outline Review of lecture 5 The

More information

Chapter 12 Practice Test

Chapter 12 Practice Test Chapter 12 Practice Test 1. Which of the following is not one of the conditions that must be satisfied in order to perform inference about the slope of a least-squares regression line? (a) For each value

More information

Decision Trees. Nicholas Ruozzi University of Texas at Dallas. Based on the slides of Vibhav Gogate and David Sontag

Decision Trees. Nicholas Ruozzi University of Texas at Dallas. Based on the slides of Vibhav Gogate and David Sontag Decision Trees Nicholas Ruozzi University of Texas at Dallas Based on the slides of Vibhav Gogate and David Sontag Announcements Course TA: Hao Xiong Office hours: Friday 2pm-4pm in ECSS2.104A1 First homework

More information

Evaluating and Classifying NBA Free Agents

Evaluating and Classifying NBA Free Agents Evaluating and Classifying NBA Free Agents Shanwei Yan In this project, I applied machine learning techniques to perform multiclass classification on free agents by using game statistics, which is useful

More information

Announcements. % College graduate vs. % Hispanic in LA. % College educated vs. % Hispanic in LA. Problem Set 10 Due Wednesday.

Announcements. % College graduate vs. % Hispanic in LA. % College educated vs. % Hispanic in LA. Problem Set 10 Due Wednesday. Announcements Announcements UNIT 7: MULTIPLE LINEAR REGRESSION LECTURE 1: INTRODUCTION TO MLR STATISTICS 101 Problem Set 10 Due Wednesday Nicole Dalzell June 15, 2015 Statistics 101 (Nicole Dalzell) U7

More information

Universal Style Transfer via Feature Transforms

Universal Style Transfer via Feature Transforms Universal Style Transfer via Feature Transforms Yijun Li, Chen Fang, Jimei Yang, Zhaowen Wang, Xin Lu, Ming-Hsuan Yang UC Merced, Adobe Research, NVIDIA Research Presented: Dong Wang (Refer to slides by

More information

intended velocity ( u k arm movements

intended velocity ( u k arm movements Fig. A Complete Brain-Machine Interface B Human Subjects Closed-Loop Simulator ensemble action potentials (n k ) ensemble action potentials (n k ) primary motor cortex simulated primary motor cortex neuroprosthetic

More information

Pairwise Comparison Models: A Two-Tiered Approach to Predicting Wins and Losses for NBA Games

Pairwise Comparison Models: A Two-Tiered Approach to Predicting Wins and Losses for NBA Games Pairwise Comparison Models: A Two-Tiered Approach to Predicting Wins and Losses for NBA Games Tony Liu Introduction The broad aim of this project is to use the Bradley Terry pairwise comparison model as

More information

Influence of Forecasting Factors and Methods or Bullwhip Effect and Order Rate Variance Ratio in the Two Stage Supply Chain-A Case Study

Influence of Forecasting Factors and Methods or Bullwhip Effect and Order Rate Variance Ratio in the Two Stage Supply Chain-A Case Study International Journal of Engineering and Technical Research (IJETR) ISSN: 31-0869 (O) 454-4698 (P), Volume-4, Issue-1, January 016 Influence of Forecasting Factors and Methods or Bullwhip Effect and Order

More information

y ) s x x )(y i (x i r = 1 n 1 s y Statistics Lecture 7 Exploring Data , y 2 ,y n (x 1 ),,(x n ),(x 2 ,y 1 How two variables vary together

y ) s x x )(y i (x i r = 1 n 1 s y Statistics Lecture 7 Exploring Data , y 2 ,y n (x 1 ),,(x n ),(x 2 ,y 1 How two variables vary together Statistics 111 - Lecture 7 Exploring Data Numerical Summaries for Relationships between Variables Administrative Notes Homework 1 due in recitation: Friday, Feb. 5 Homework 2 now posted on course website:

More information

Predicting Season-Long Baseball Statistics. By: Brandon Liu and Bryan McLellan

Predicting Season-Long Baseball Statistics. By: Brandon Liu and Bryan McLellan Stanford CS 221 Predicting Season-Long Baseball Statistics By: Brandon Liu and Bryan McLellan Task Definition Though handwritten baseball scorecards have become obsolete, baseball is at its core a statistical

More information

Efficiency Wages in Major League Baseball Starting. Pitchers Greg Madonia

Efficiency Wages in Major League Baseball Starting. Pitchers Greg Madonia Efficiency Wages in Major League Baseball Starting Pitchers 1998-2001 Greg Madonia Statement of Problem Free agency has existed in Major League Baseball (MLB) since 1974. This is a mechanism that allows

More information

Announcements. Lecture 19: Inference for SLR & Transformations. Online quiz 7 - commonly missed questions

Announcements. Lecture 19: Inference for SLR & Transformations. Online quiz 7 - commonly missed questions Announcements Announcements Lecture 19: Inference for SLR & Statistics 101 Mine Çetinkaya-Rundel April 3, 2012 HW 7 due Thursday. Correlation guessing game - ends on April 12 at noon. Winner will be announced

More information

Neural Networks II. Chen Gao. Virginia Tech Spring 2019 ECE-5424G / CS-5824

Neural Networks II. Chen Gao. Virginia Tech Spring 2019 ECE-5424G / CS-5824 Neural Networks II Chen Gao ECE-5424G / CS-5824 Virginia Tech Spring 2019 Neural Networks Origins: Algorithms that try to mimic the brain. What is this? A single neuron in the brain Input Output Slide

More information

Predicting the Total Number of Points Scored in NFL Games

Predicting the Total Number of Points Scored in NFL Games Predicting the Total Number of Points Scored in NFL Games Max Flores (mflores7@stanford.edu), Ajay Sohmshetty (ajay14@stanford.edu) CS 229 Fall 2014 1 Introduction Predicting the outcome of National Football

More information

B. AA228/CS238 Component

B. AA228/CS238 Component Abstract Two supervised learning methods, one employing logistic classification and another employing an artificial neural network, are used to predict the outcome of baseball postseason series, given

More information

Navigate to the golf data folder and make it your working directory. Load the data by typing

Navigate to the golf data folder and make it your working directory. Load the data by typing Golf Analysis 1.1 Introduction In a round, golfers have a number of choices to make. For a particular shot, is it better to use the longest club available to try to reach the green, or would it be better

More information

How Do Injuries in the NFL Affect the Outcome of the Game

How Do Injuries in the NFL Affect the Outcome of the Game How Do Injuries in the NFL Affect the Outcome of the Game Andy Sun STATS 50 2014 Motivation NFL injury surveillance shows upward trend in both the number and severity of injuries Packers won 2010 Super

More information

CSC242: Intro to AI. Lecture 21

CSC242: Intro to AI. Lecture 21 CSC242: Intro to AI Lecture 21 Quiz Stop Time: 2:15 Learning (from Examples) Learning Learning gives computers the ability to learn without being explicitly programmed (Samuel, 1959)... agents that can

More information

Matrix-analog measure-cerrelatepredict

Matrix-analog measure-cerrelatepredict Matrix-analog measure-cerrelatepredict approach ICEM 2015 22-26 June 2015, Boulder David Hanslian Institute of Atmospheric Physics AS CR "Measure-correlate-predict" (MCP) = methods to estimate long-term

More information

Basketball field goal percentage prediction model research and application based on BP neural network

Basketball field goal percentage prediction model research and application based on BP neural network ISSN : 0974-7435 Volume 10 Issue 4 BTAIJ, 10(4), 2014 [819-823] Basketball field goal percentage prediction model research and application based on BP neural network Jijun Guo Department of Physical Education,

More information

Inferring land use from mobile phone activity

Inferring land use from mobile phone activity Inferring land use from mobile phone activity Jameson L. Toole (MIT) Michael Ulm (AIT) Dietmar Bauer (AIT) Marta C. Gonzalez (MIT) UrbComp 2012 Beijing, China 1 The Big Questions Can land use be predicted

More information

Legendre et al Appendices and Supplements, p. 1

Legendre et al Appendices and Supplements, p. 1 Legendre et al. 2010 Appendices and Supplements, p. 1 Appendices and Supplement to: Legendre, P., M. De Cáceres, and D. Borcard. 2010. Community surveys through space and time: testing the space-time interaction

More information

Introduction to Pattern Recognition

Introduction to Pattern Recognition Introduction to Pattern Recognition Jason Corso SUNY at Buffalo 19 January 2011 J. Corso (SUNY at Buffalo) Introduction to Pattern Recognition 19 January 2011 1 / 32 Examples of Pattern Recognition in

More information

Machine Learning Methods for Climbing Route Classification

Machine Learning Methods for Climbing Route Classification Machine Learning Methods for Climbing Route Classification Alejandro Dobles Mathematics adobles@stanford.edu Juan Carlos Sarmiento Management Science & Engineering jcs10@stanford.edu Abstract Peter Satterthwaite

More information

Section I: Multiple Choice Select the best answer for each problem.

Section I: Multiple Choice Select the best answer for each problem. Inference for Linear Regression Review Section I: Multiple Choice Select the best answer for each problem. 1. Which of the following is NOT one of the conditions that must be satisfied in order to perform

More information

Naïve Bayes. Robot Image Credit: Viktoriya Sukhanova 123RF.com

Naïve Bayes. Robot Image Credit: Viktoriya Sukhanova 123RF.com Naïve Bayes These slides were assembled by Eric Eaton, with grateful acknowledgement of the many others who made their course materials freely available online. Feel free to reuse or adapt these slides

More information

Two Machine Learning Approaches to Understand the NBA Data

Two Machine Learning Approaches to Understand the NBA Data Two Machine Learning Approaches to Understand the NBA Data Panagiotis Lolas December 14, 2017 1 Introduction In this project, I consider applications of machine learning in the analysis of nba data. To

More information

GALLUP NEWS SERVICE 2018 MIDTERM ELECTION

GALLUP NEWS SERVICE 2018 MIDTERM ELECTION GALLUP NEWS SERVICE 2018 MIDTERM ELECTION Results are based on telephone interviews with a random sample of 1,508 -- national adults, aged 18+, living in all 50 states and the District of Columbia, conducted

More information

E STIMATING KILN SCHEDULES FOR TROPICAL AND TEMPERATE HARDWOODS USING SPECIFIC GRAVITY

E STIMATING KILN SCHEDULES FOR TROPICAL AND TEMPERATE HARDWOODS USING SPECIFIC GRAVITY P ROCESSES E STIMATING KILN SCHEDULES FOR TROPICAL AND TEMPERATE HARDWOODS USING SPECIFIC GRAVITY W ILLIAM T. SIMPSON S TEVE P. VERRILL A BSTRACT Dry-kiln schedules have been developed for many hardwood

More information

Acquisition and prediction of wave surface by marine radar for the safety of small ships

Acquisition and prediction of wave surface by marine radar for the safety of small ships Proceedings of the 16 th International Ship Stabilit Workshop, 5-7 June 2017, Belgrade, Serbia 1 Acquisition and prediction of wave surface b marine radar for the safet of small ships Hironori Susaki,

More information

JPEG-Compatibility Steganalysis Using Block-Histogram of Recompression Artifacts

JPEG-Compatibility Steganalysis Using Block-Histogram of Recompression Artifacts JPEG-Compatibility Steganalysis Using Block-Histogram of Recompression Artifacts Jan Kodovský, Jessica Fridrich May 16, 2012 / IH Conference 1 / 19 What is JPEG-compatibility steganalysis? Detects embedding

More information

The Economic Factors Analysis in Olympic Game

The Economic Factors Analysis in Olympic Game ISSN 0- (print) International Journal of Sports Science and Engineering Vol. 0 (0) No. 0, pp. - The Economic Factors Analsis in Olmpic Game Yong Jiang, Tingting Ma, Zhe Huang Facult of Mathematics and

More information

Naïve Bayes. Robot Image Credit: Viktoriya Sukhanova 123RF.com

Naïve Bayes. Robot Image Credit: Viktoriya Sukhanova 123RF.com Naïve Bayes These slides were assembled by Byron Boots, with only minor modifications from Eric Eaton s slides and grateful acknowledgement to the many others who made their course materials freely available

More information

Lecture 39: Training Neural Networks (Cont d)

Lecture 39: Training Neural Networks (Cont d) Lecture 39: Training Neural Networks (Cont d) CS 4670/5670 Sean Bell Strawberry Goblet Throne (Side Note for PA5) AlexNet: 1 vs 2 parts Caffe represents caffe like the above image, but computes as if it

More information

Performance of Fully Automated 3D Cracking Survey with Pixel Accuracy based on Deep Learning

Performance of Fully Automated 3D Cracking Survey with Pixel Accuracy based on Deep Learning Performance of Fully Automated 3D Cracking Survey with Pixel Accuracy based on Deep Learning Kelvin C.P. Wang Oklahoma State University and WayLink Systems Corp. 2017-10-19, Copenhagen, Denmark European

More information

CAAD CTF 2018 Rules June 21, 2018 Version 1.1

CAAD CTF 2018 Rules June 21, 2018 Version 1.1 CAAD CTF 2018 Rules June 21, 2018 Version 1.1 The organizer will invite 5 teams to participate CAAD CTF 2018. We will have it in Las Vegas on Aug. 10 th, 2018. The rules details are below: 1. Each team

More information

A Machine Learning Approach to Predicting Winning Patterns in Track Cycling Omnium

A Machine Learning Approach to Predicting Winning Patterns in Track Cycling Omnium A Machine Learning Approach to Predicting Winning Patterns in Track Cycling Omnium Bahadorreza Ofoghi 1,2, John Zeleznikow 1, Clare MacMahon 1,andDanDwyer 2 1 Victoria University, Melbourne VIC 3000, Australia

More information

Figure 1 Location of the ANDRILL SMS 2006 mooring site labeled ADCP1 above.

Figure 1 Location of the ANDRILL SMS 2006 mooring site labeled ADCP1 above. ANDRILL McMurdo Sound Tidal Current Analysis Richard Limeburner, Robert Beardsley and Sean Whelan Department of Physical Oceanography Woods Hole Oceanographic Institution Woods Hole, MA 02543 rlimeburner@whoi.edu

More information

Predicting Horse Racing Results with TensorFlow

Predicting Horse Racing Results with TensorFlow Predicting Horse Racing Results with TensorFlow LYU 1703 LIU YIDE WANG ZUOYANG News CUHK Professor, Gu Mingao, wins 50 MILLIONS dividend using his sure-win statistical strategy. News AlphaGO defeats human

More information

CSE 190a Project Report: Golf Club Head Tracking

CSE 190a Project Report: Golf Club Head Tracking CSE 190a Project Report: Golf Club Head Tracking Ravi Chugh rchugh@cs.ucsd.edu Krystle de Mesa kdemesa@cs.ucsd.edu Abstract Computer vision and graphics technologies have been used extensively in developing

More information

Analysis of Variance. Copyright 2014 Pearson Education, Inc.

Analysis of Variance. Copyright 2014 Pearson Education, Inc. Analysis of Variance 12-1 Learning Outcomes Outcome 1. Understand the basic logic of analysis of variance. Outcome 2. Perform a hypothesis test for a single-factor design using analysis of variance manually

More information

Machine Learning Application in Aviation Safety

Machine Learning Application in Aviation Safety Machine Learning Application in Aviation Safety Surface Safety Metric MOR Classification Presented to: By: Date: ART Firdu Bati, PhD, FAA September, 2018 Agenda Surface Safety Metric (SSM) development

More information

FREEWAY WORK ZONE SPEED MODEL DOCUMENTATION

FREEWAY WORK ZONE SPEED MODEL DOCUMENTATION APPENDIX B FREEWAY WORK ZONE SPEED MODEL DOCUMENTATION B-1 APPENDIX B FREEWAY WORK ZONE SPEED MODEL DOCUMENTATION B.1 INTRODUCTION This software can be used for predicting the speed of vehicles traveling

More information

An Empirical Comparison of Regression Analysis Strategies with Discrete Ordinal Variables

An Empirical Comparison of Regression Analysis Strategies with Discrete Ordinal Variables Kromrey & Rendina-Gobioff An Empirical Comparison of Regression Analysis Strategies with Discrete Ordinal Variables Jeffrey D. Kromrey Gianna Rendina-Gobioff University of South Florida The Type I error

More information

Predicting the NCAA Men s Basketball Tournament with Machine Learning

Predicting the NCAA Men s Basketball Tournament with Machine Learning Predicting the NCAA Men s Basketball Tournament with Machine Learning Andrew Levandoski and Jonathan Lobo CS 2750: Machine Learning Dr. Kovashka 25 April 2017 Abstract As the popularity of the NCAA Men

More information

July 2015 Sept Cork City Pedestrian Counter Report

July 2015 Sept Cork City Pedestrian Counter Report July 2015 Sept 2015 Cork City Pedestrian Counter Report Disclaimer Cork City Council has produced this report based on information extracted from our Pedestrian Counter Monitoring System. Although care

More information

An Investigation of Freeway Capacity Before and During Incidents

An Investigation of Freeway Capacity Before and During Incidents An Investigation of Freeway Capacity Before and During Incidents Cuie Lu and Lily Elefteriadou Department of Civil and Coastal Engineering University of Florida March 4, 2011 Outline Database and Analysis

More information

Dynamic validation of Globwave SAR wave spectra data using an observation-based swell model. R. Husson and F. Collard

Dynamic validation of Globwave SAR wave spectra data using an observation-based swell model. R. Husson and F. Collard Dynamic validation of Globwave SAR wave spectra data using an observation-based swell model. R. Husson and F. Collard Context 1978 1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992

More information

One Way ANOVA (Analysis of Variance)

One Way ANOVA (Analysis of Variance) One Wa ANOVA (Analsis of Variance) The one-wa analsis of variance (ANOVA) is used to determine whether there are an significant differences between the means of two or more independent (unrelated) groups

More information

COMPLETING THE RESULTS OF THE 2013 BOSTON MARATHON

COMPLETING THE RESULTS OF THE 2013 BOSTON MARATHON COMPLETING THE RESULTS OF THE 2013 BOSTON MARATHON Dorit Hammerling 1, Matthew Cefalu 2, Jessi Cisewski 3, Francesca Dominici 2, Giovanni Parmigiani 2,4, Charles Paulson 5, Richard Smith 1,6 1 Statistical

More information

Running head: DATA ANALYSIS AND INTERPRETATION 1

Running head: DATA ANALYSIS AND INTERPRETATION 1 Running head: DATA ANALYSIS AND INTERPRETATION 1 Data Analysis and Interpretation Final Project Vernon Tilly Jr. University of Central Oklahoma DATA ANALYSIS AND INTERPRETATION 2 Owners of the various

More information

GALLUP NEWS SERVICE GALLUP POLL SOCIAL SERIES: WORLD AFFAIRS

GALLUP NEWS SERVICE GALLUP POLL SOCIAL SERIES: WORLD AFFAIRS GALLUP NEWS SERVICE GALLUP POLL SOCIAL SERIES: WORLD AFFAIRS -- FINAL TOPLINE -- Timberline: 937008 IS: 954 Princeton Job #: 17-02-002 Jeff Jones, Lydia Saad February 1-5, 2017 Results are based on telephone

More information

Computer Practical: Gaussian Plume Model Paul Connolly, October 2017

Computer Practical: Gaussian Plume Model Paul Connolly, October 2017 1 Overview Computer Practical: Gaussian Plume Model Paul Connoll, October 2017 In this handout we look at the problem of advection and turbulent diffusion of material from a point source, such as a industrial

More information

Player Availability Rating (PAR) - A Tool for Quantifying Skater Performance for NHL General Managers

Player Availability Rating (PAR) - A Tool for Quantifying Skater Performance for NHL General Managers Player Availability Rating (PAR) - A Tool for Quantifying Skater Performance for NHL General Managers Shuja Khalid 1 1 Department of Computer Science, University of Toronto arxiv:1811.02885v1 [cs.cy] 15

More information

Lab 11: Introduction to Linear Regression

Lab 11: Introduction to Linear Regression Lab 11: Introduction to Linear Regression Batter up The movie Moneyball focuses on the quest for the secret of success in baseball. It follows a low-budget team, the Oakland Athletics, who believed that

More information

Guide to Computing Minitab commands used in labs (mtbcode.out)

Guide to Computing Minitab commands used in labs (mtbcode.out) Guide to Computing Minitab commands used in labs (mtbcode.out) A full listing of Minitab commands can be found by invoking the HELP command while running Minitab. A reference card, with listing of available

More information

ISDS 4141 Sample Data Mining Work. Tool Used: SAS Enterprise Guide

ISDS 4141 Sample Data Mining Work. Tool Used: SAS Enterprise Guide ISDS 4141 Sample Data Mining Work Taylor C. Veillon Tool Used: SAS Enterprise Guide You may have seen the movie, Moneyball, about the Oakland A s baseball team and general manager, Billy Beane, who focused

More information

Anabela Brandão and Doug S. Butterworth

Anabela Brandão and Doug S. Butterworth Obtaining a standardised CPUE series for toothfish (Dissostichus eleginoides) in the Prince Edward Islands EEZ calibrated to incorporate both longline and trotline data over the period 1997-2013 Anabela

More information

What Causes the Favorite-Longshot Bias? Further Evidence from Tennis

What Causes the Favorite-Longshot Bias? Further Evidence from Tennis MPRA Munich Personal RePEc Archive What Causes the Favorite-Longshot Bias? Further Evidence from Tennis Jiri Lahvicka 30. June 2013 Online at http://mpra.ub.uni-muenchen.de/47905/ MPRA Paper No. 47905,

More information

GALLUP NEWS SERVICE GALLUP POLL SOCIAL SERIES: WORLD AFFAIRS

GALLUP NEWS SERVICE GALLUP POLL SOCIAL SERIES: WORLD AFFAIRS GALLUP NEWS SERVICE GALLUP POLL SOCIAL SERIES: WORLD AFFAIRS -- FINAL TOPLINE -- Timberline: 937008 JT: 165 Princeton Job #: 18-02-002 Jeff Jones, Lydia Saad February 1-10, 2018 Results are based on telephone

More information

Single-step genomic BLUP for national beef cattle evaluation in US:

Single-step genomic BLUP for national beef cattle evaluation in US: Single-step genomic BLUP for national beef cattle evaluation in US: from initial developments to final implementation Daniela Lourenco S. Tsuruta, B.O. Fragomeni, Y. Masuda, I. Aguilar A. Legarra, S. Miller,

More information

Visual Traffic Jam Analysis Based on Trajectory Data

Visual Traffic Jam Analysis Based on Trajectory Data Visual Traffic Jam Analysis Based on Trajectory Data Zuchao Wang, Min Lu, Xiaoru Yuan, Peking University Junping Zhang, Fudan University Huub van de Wetering, Technische Universiteit Eindhoven Introduction

More information

ARTIFICIAL NEURAL NETWORK BASED DESIGN FOR DUAL LATERAL WELL APPLICATIONS

ARTIFICIAL NEURAL NETWORK BASED DESIGN FOR DUAL LATERAL WELL APPLICATIONS The Pennsylvania State University the Graduate School Department of Energy and Mineral Engineering ARTIFICIAL NEURAL NETWORK BASED DESIGN FOR DUAL LATERAL WELL APPLICATIONS Thesis in Energy and Mineral

More information

GALLUP NEWS SERVICE GALLUP POLL SOCIAL SERIES: WORK AND EDUCATION

GALLUP NEWS SERVICE GALLUP POLL SOCIAL SERIES: WORK AND EDUCATION GALLUP NEWS SERVICE GALLUP POLL SOCIAL SERIES: WORK AND EDUCATION -- FINAL TOPLINE -- Timberline: 937008 IS: 786 Princeton Job #: 16-08-012 Jeff Jones, Lydia Saad August 3-7, 2016 Results are based on

More information

Neural Network in Computer Vision for RoboCup Middle Size League

Neural Network in Computer Vision for RoboCup Middle Size League Journal of Software Engineering and Applications, 2016, *,** Neural Network in Computer Vision for RoboCup Middle Size League Paulo Rogério de Almeida Ribeiro 1, Gil Lopes 1, Fernando Ribeiro 1 1 Department

More information

Habit Formation in Voting: Evidence from Rainy Elections Thomas Fujiwara, Kyle Meng, and Tom Vogl ONLINE APPENDIX

Habit Formation in Voting: Evidence from Rainy Elections Thomas Fujiwara, Kyle Meng, and Tom Vogl ONLINE APPENDIX Habit Formation in Voting: Evidence from Rainy Elections Thomas Fujiwara, Kyle Meng, and Tom Vogl ONLINE APPENDIX Figure A1: Share of Counties with Election-Day Rainfall by Year Share of counties with

More information

Machine Learning an American Pastime

Machine Learning an American Pastime Nikhil Bhargava, Andy Fang, Peter Tseng CS 229 Paper Machine Learning an American Pastime I. Introduction Baseball has been a popular American sport that has steadily gained worldwide appreciation in the

More information

Grade 6 Math Circles Fall October 7/8 Statistics

Grade 6 Math Circles Fall October 7/8 Statistics Faculty of Mathematics Waterloo, Ontario Centre for Education in Mathematics and Computing Grade 6 Math Circles Fall 2014 - October 7/8 Statistics Statistics (or Stats) is a branch of math that deals with

More information

RELATIONSHIP BETWEEN CONGESTION AND TRAFFIC ACCIDENTS ON EXPRESSWAYS AN INVESTIGATION WITH BAYESIAN BELIEF NETWORKS

RELATIONSHIP BETWEEN CONGESTION AND TRAFFIC ACCIDENTS ON EXPRESSWAYS AN INVESTIGATION WITH BAYESIAN BELIEF NETWORKS RELATIONSHIP BETWEEN CONGESTION AND TRAIC ACCIDENTS ON EXPRESSWAYS AN INESTIGATION WITH BAYESIAN BELIEF NETWORKS By Charitha Dias**, Marc Miska***, Masao Kuwahara****, and Hiroshi Warita***** 1. Introduction

More information

Application of Bayesian Networks to Shopping Assistance

Application of Bayesian Networks to Shopping Assistance Application of Bayesian Networks to Shopping Assistance Yang Xiang, Chenwen Ye, and Deborah Ann Stacey University of Guelph, CANADA Abstract. We develop an on-line shopping assistant that can help a e-shopper

More information

Modeling Salmon Behavior on the Umpqua River. By Scott Jordan 6/2/2015

Modeling Salmon Behavior on the Umpqua River. By Scott Jordan 6/2/2015 Modeling Salmon Behavior on the Umpqua River By Scott Jordan 6/2/2015 1 Importance of Salmon Delicious Recreation 631,000 people in Oregon went fishing in 2008 spent $264.6 Million on fishing trips Commercial

More information

Staking plans in sports betting under unknown true probabilities of the event

Staking plans in sports betting under unknown true probabilities of the event Staking plans in sports betting under unknown true probabilities of the event Andrés Barge-Gil 1 1 Department of Economic Analysis, Universidad Complutense de Madrid, Spain June 15, 2018 Abstract Kelly

More information

An Assessment of Quality in Underwater Archaeological Surveys Using Tape Measurements

An Assessment of Quality in Underwater Archaeological Surveys Using Tape Measurements An Assessment of Quality in Underwater Archaeological Surveys Using Tape Measurements Peter Holt 3H Consulting Ltd, 6 Honcray, Oreston, Plymouth, Devon, PL9 9XB The quality of a typical underwater archaeological

More information

Cycling Volume Estimation Methods for Safety Analysis

Cycling Volume Estimation Methods for Safety Analysis Cycling Volume Estimation Methods for Safety Analysis XI ICTCT extra Workshop in Vancouver, Canada Session: Methods and Simulation Date: March, 01 The Highway Safety Manual (HSM) documents many safety

More information

a) List and define all assumptions for multiple OLS regression. These are all listed in section 6.5

a) List and define all assumptions for multiple OLS regression. These are all listed in section 6.5 Prof. C. M. Dalton ECN 209A Spring 2015 Practice Problems (After HW1, HW2, before HW3) CORRECTED VERSION Question 1. Draw and describe a relationship with heteroskedastic errors. Support your claim with

More information