Naïve Bayes. Robot Image Credit: Viktoriya Sukhanova 123RF.com

Size: px
Start display at page:

Download "Naïve Bayes. Robot Image Credit: Viktoriya Sukhanova 123RF.com"

Transcription

1 Naïve Bayes These slides were assembled by Eric Eaton, with grateful acknowledgement of the many others who made their course materials freely available online. Feel free to reuse or adapt these slides for your own academic purposes, provided that you include proper attribution. Please send comments and corrections to Eric. Robot Image Credit: Viktoriya Sukhanova 123RF.com

2 Essential Probability Concepts Marginalization: P (B) = X v2values(a) P (B ^ A = v) Conditional Probability: Bayes Rule: Independence: P (A B) = P (A B) = P (A ^ B) P (B) ^ P (B A) P (A) P (B) A? B $ P (A ^ B) =P (A) P (B) $ P (A B) =P (A) A? B C $ P (A ^ B C) =P (A C) P (B C) 2

3 Density Estimation 3

4 Recall the Joint Distribution... alarm alarm earthquake earthquake earthquake earthquake burglary burglary

5 How Can We Obtain a Joint Distribution? Option 1: Elicit it from an expert human Option 2: Build it up from simpler probabilistic facts e.g, if we knew P(a) = 0.7 P(b a) = 0.2 P(b a) = 0.1 then, we could compute P(a Ù b) Option 3: Learn it from data... Based on slide by Andrew Moore 5

6 Learning a Joint Distribution Step 1: Step 2: Build a JD table for your Then, fill in each row with: attributes in which the recordsmatching row probabilities are unspecified Pˆ (row) = total number of records A B C Prob 0 0 0? 0 0 1? 0 1 0? 0 1 1? 1 0 0? 1 0 1? 1 1 0? 1 1 1? A B C Prob Slide Andrew Moore Fraction of all records in which A and B are true but C is false

7 Example of Learning a Joint PD This Joint PD was obtained by learning from three attributes in the UCI Adult Census Database [Kohavi 1995] Slide Andrew Moore

8 Density Estimation Our joint distribution learner is an example of something called Density Estimation A Density Estimator learns a mapping from a set of attributes to a probability Input Attributes Density Estimator Probability Slide Andrew Moore

9 Density Estimation Compare it against the two other major kinds of models: Input Attributes Input Attributes Input Attributes Classifier Density Estimator Regressor Prediction of categorical output Probability Prediction of real- valued output Slide Andrew Moore

10 Evaluating Density Estimation Test- set criterion for estimating performance on future data Input Attributes Classifier Prediction of categorical output Test set Accuracy Input Attributes Density Estimator Probability? Input Attributes Regressor Prediction of real- valued output Test set Accuracy Slide Andrew Moore

11 Evaluating a Density Estimator Given a record x, a density estimator M can tell you how likely the record is: ˆP (x M) The density estimator can also tell you how likely the dataset is: Under the assumption that all records were independently generated from the Density Estimator s JD (that is, i.i.d.) ˆP (x 1 ^ x 2 ^...^ x n M) = ny ˆP (x i M) dataset i=1 Slide by Andrew Moore

12 Example Small Dataset: Miles Per Gallon From the UCI repository (thanks to Ross Quinlan) 192 records in the training set mpg modelyear maker Slide by Andrew Moore good 75to78 asia bad 70to74 america bad 75to78 europe bad 70to74 america bad 70to74 america bad 70to74 asia bad 70to74 asia bad 75to78 america : : : : : : : : : bad 70to74 america good 79to83 america bad 75to78 america good 79to83 america bad 75to78 america good 79to83 america good 79to83 america bad 70to74 america good 75to78 europe bad 75to78 europe

13 Example Small Dataset: Miles Per Gallon From the UCI repository (thanks to Ross Quinlan) 192 records in the training set mpg modelyear maker Slide by Andrew Moore good 75to78 asia bad 70to74 america bad 75to78 europe bad 70to74 america bad 70to74 america bad 70to74 asia bad 70to74 asia bad 75to78 america : : : : : : : : : bad 70to74 america good 79to83 america bad 75to78 america good 79to83 america bad 75to78 america good 79to83 america good 79to83 america bad 70to74 america good 75to78 europe bad 75to78 europe ˆP (dataset M) = ny i=1 ˆP (x i M) = (in this case)

14 Log Probabilities For decent sized data sets, this product will underflow ny ˆP (dataset M) = ˆP (x i M) i=1 Therefore, since probabilities of datasets get so small, we usually use log probabilities log ˆP ny nx (dataset M) = log ˆP (x i M) = log ˆP (x i M) i=1 i=1 Based on slide by Andrew Moore

15 Example Small Dataset: Miles Per Gallon From the UCI repository (thanks to Ross Quinlan) 192 records in the training set mpg modelyear maker Slide by Andrew Moore good 75to78 asia bad 70to74 america bad 75to78 europe bad 70to74 america bad 70to74 america bad 70to74 asia bad 70to74 asia bad 75to78 america : : : : : : : : : bad 70to74 america good 79to83 america bad 75to78 america good 79to83 america bad 75to78 america good 79to83 america good 79to83 america bad 70to74 america good 75to78 europe bad 75to78 europe log ˆP (dataset M) = nx i=1 log ˆP (x i M) = (in this case)

16 Pros/Cons of the Joint Density Estimator The Good News: We can learn a Density Estimator from data. Density estimators can do many good things Can sort the records by probability, and thus spot weird records (anomaly detection) Can do inference Ingredient for Bayes Classifiers (coming very soon...) The Bad News: Density estimation by directly learning the joint is impractical, may result in adverse behavior Slide by Andrew Moore

17 Slide by Christopher Bishop Curse of Dimensionality

18 The Joint Density Estimator on a Test Set An independent test set with 196 cars has a much worse log- likelihood Actually it s a billion quintillion quintillion quintillion quintillion times less likely Density estimators can overfit......and the full joint density estimator is the overfittiest of them all! Slide by Andrew Moore

19 Overfitting Density Estimators If this ever happens, the joint PDE learns there are certain combinations that are impossible log ˆP nx (dataset M) = log ˆP (x i M) i=1 = 1 if for any i, ˆP (xi M) =0 Copyright Andrew W. Moore Slide by Andrew Moore

20 The Joint Density Estimator on a Test Set The only reason that the test set didn t score - is that the code was hard- wired to always predict a probability of at least 1/10 20 Slide by Andrew Moore

21 The Naïve Bayes Classifier 21

22 Recall Baye s Rule: P (hypothesis evidence) = Bayes Rule Equivalently, we can write: where X is a random variable representing the evidence and Y is a random variable for the label This is actually short for: P (evidence hypothesis) P (hypothesis) P (evidence) P (Y = y k X = x i )= P (Y = y k)p (X = x i Y = y k ) P (X = x i ) P (Y = y k X = x i )= P (Y = y k)p (X 1 = x i,1 ^...^ X d = x i,d Y = y k ) P (X 1 = x i,1 ^...^ X d = x i,d ) where X j denotes the random variable for the j th feature 22

23 Naïve Bayes Classifier Idea: Use the training data to estimate P (X Y ) and P (Y ). Then, use Bayes rule to infer P (Y X new ) for new data Easy to estimate from data Impractical, but necessary P (Y = y k X = x i )= P (Y = y k)p (X 1 = x i,1 ^...^ X d = x i,d Y = y k ) P (X 1 = x i,1 ^...^ X d = x i,d ) Unnecessary, as it turns out Estimating the joint probability distribution P (X 1,X 2,...,X d Y ) is not practical 23

24 Naïve Bayes Classifier Problem: estimating the joint PD or CPD isn t practical Severely overfits, as we saw before However, if we make the assumption that the attributes are independent given the class label, estimation is easy! dy P (X 1,X 2,...,X d Y )= P (X j Y ) j=1 In other words, we assume all attributes are conditionally independent given Y Often this assumption is violated in practice, but more on that later 24

25 Training Naïve Bayes Estimate P (X j Y ) and P (Y ) directly from the training data by counting! Sky Temp Humid Wind Water Forecast Play? sunny warm normal strong warm same yes sunny warm high strong warm same yes rainy cold high strong warm change no sunny warm high strong cool change yes P(play) =? P( play) =? P(Sky = sunny play) =? P(Sky = sunny play) =? P(Humid = high play) =? P(Humid = high play) =?

26 Training Naïve Bayes Estimate P (X j Y ) and P (Y ) directly from the training data by counting! Sky Temp Humid Wind Water Forecast Play? sunny warm normal strong warm same yes sunny warm high strong warm same yes rainy cold high strong warm change no sunny warm high strong cool change yes P(play) =? P( play) =? P(Sky = sunny play) =? P(Sky = sunny play) =? P(Humid = high play) =? P(Humid = high play) =?

27 Training Naïve Bayes Estimate P (X j Y ) and P (Y ) directly from the training data by counting! Sky Temp Humid Wind Water Forecast Play? sunny warm normal strong warm same yes sunny warm high strong warm same yes rainy cold high strong warm change no sunny warm high strong cool change yes P(play) = 3/4 P( play) = 1/4 P(Sky = sunny play) =? P(Sky = sunny play) =? P(Humid = high play) =? P(Humid = high play) =?

28 Training Naïve Bayes Estimate P (X j Y ) and P (Y ) directly from the training data by counting! Sky Temp Humid Wind Water Forecast Play? sunny warm normal strong warm same yes sunny warm high strong warm same yes rainy cold high strong warm change no sunny warm high strong cool change yes P(play) = 3/4 P( play) = 1/4 P(Sky = sunny play) =? P(Sky = sunny play) =? P(Humid = high play) =? P(Humid = high play) =?

29 Training Naïve Bayes Estimate P (X j Y ) and P (Y ) directly from the training data by counting! Sky Temp Humid Wind Water Forecast Play? sunny warm normal strong warm same yes sunny warm high strong warm same yes rainy cold high strong warm change no sunny warm high strong cool change yes P(play) = 3/4 P( play) = 1/4 P(Sky = sunny play) = 1 P(Sky = sunny play) =? P(Humid = high play) =? P(Humid = high play) =?

30 Training Naïve Bayes Estimate P (X j Y ) and P (Y ) directly from the training data by counting! Sky Temp Humid Wind Water Forecast Play? sunny warm normal strong warm same yes sunny warm high strong warm same yes rainy cold high strong warm change no sunny warm high strong cool change yes P(play) = 3/4 P( play) = 1/4 P(Sky = sunny play) = 1 P(Sky = sunny play) =? P(Humid = high play) =? P(Humid = high play) =?

31 Training Naïve Bayes Estimate P (X j Y ) and P (Y ) directly from the training data by counting! Sky Temp Humid Wind Water Forecast Play? sunny warm normal strong warm same yes sunny warm high strong warm same yes rainy cold high strong warm change no sunny warm high strong cool change yes P(play) = 3/4 P( play) = 1/4 P(Sky = sunny play) = 1 P(Sky = sunny play) = 0 P(Humid = high play) =? P(Humid = high play) =?

32 Training Naïve Bayes Estimate P (X j Y ) and P (Y ) directly from the training data by counting! Sky Temp Humid Wind Water Forecast Play? sunny warm normal strong warm same yes sunny warm high strong warm same yes rainy cold high strong warm change no sunny warm high strong cool change yes P(play) = 3/4 P( play) = 1/4 P(Sky = sunny play) = 1 P(Sky = sunny play) = 0 P(Humid = high play) =? P(Humid = high play) =?

33 Training Naïve Bayes Estimate P (X j Y ) and P (Y ) directly from the training data by counting! Sky Temp Humid Wind Water Forecast Play? sunny warm normal strong warm same yes sunny warm high strong warm same yes rainy cold high strong warm change no sunny warm high strong cool change yes P(play) = 3/4 P( play) = 1/4 P(Sky = sunny play) = 1 P(Sky = sunny play) = 0 P(Humid = high play) = 2/3 P(Humid = high play) =?

34 Training Naïve Bayes Estimate P (X j Y ) and P (Y ) directly from the training data by counting! Sky Temp Humid Wind Water Forecast Play? sunny warm normal strong warm same yes sunny warm high strong warm same yes rainy cold high strong warm change no sunny warm high strong cool change yes P(play) = 3/4 P( play) = 1/4 P(Sky = sunny play) = 1 P(Sky = sunny play) = 0 P(Humid = high play) = 2/3 P(Humid = high play) =?

35 Training Naïve Bayes Estimate P (X j Y ) and P (Y ) directly from the training data by counting! Sky Temp Humid Wind Water Forecast Play? sunny warm normal strong warm same yes sunny warm high strong warm same yes rainy cold high strong warm change no sunny warm high strong cool change yes P(play) = 3/4 P( play) = 1/4 P(Sky = sunny play) = 1 P(Sky = sunny play) = 0 P(Humid = high play) = 2/3 P(Humid = high play) =

36 Training Naïve Bayes Estimate P (X j Y ) and P (Y ) directly from the training data by counting! Sky Temp Humid Wind Water Forecast Play? sunny warm normal strong warm same yes sunny warm high strong warm same yes rainy cold high strong warm change no sunny warm high strong cool change yes P(play) = 3/4 P( play) = 1/4 P(Sky = sunny play) = 1 P(Sky = sunny play) = 0 P(Humid = high play) = 2/3 P(Humid = high play) =

37 Laplace Smoothing Notice that some probabilities estimated by counting might be zero Possible overfitting! Fix by using Laplace smoothing: Adds 1 to each count P (X j = v Y = y k )= X c v +1 c v 0 + values(x j ) where v 0 2values(X j ) c v is the count of training instances with a value of v for attribute j and class label y k values(x j ) is the number of values X j can take on 37

38 Training Naïve Bayes with Laplace Smoothing Estimate P (X j Y ) and P (Y ) directly from the training data by counting with Laplace smoothing: Sky Temp Humid Wind Water Forecast Play? sunny warm normal strong warm same yes sunny warm high strong warm same yes rainy cold high strong warm change no sunny warm high strong cool change yes P(play) = 3/4 P( play) = 1/4 P(Sky = sunny play) = 4/5 P(Sky = sunny play) =? P(Humid = high play) =? P(Humid = high play) =?

39 Training Naïve Bayes with Laplace Smoothing Estimate P (X j Y ) and P (Y ) directly from the training data by counting with Laplace smoothing: Sky Temp Humid Wind Water Forecast Play? sunny warm normal strong warm same yes sunny warm high strong warm same yes rainy cold high strong warm change no sunny warm high strong cool change yes P(play) = 3/4 P( play) = 1/4 P(Sky = sunny play) = 4/5 P(Sky = sunny play) = 1/3 P(Humid = high play) =? P(Humid = high play) =?

40 Training Naïve Bayes with Laplace Smoothing Estimate P (X j Y ) and P (Y ) directly from the training data by counting with Laplace smoothing: Sky Temp Humid Wind Water Forecast Play? sunny warm normal strong warm same yes sunny warm high strong warm same yes rainy cold high strong warm change no sunny warm high strong cool change yes P(play) = 3/4 P( play) = 1/4 P(Sky = sunny play) = 4/5 P(Sky = sunny play) = 1/3 P(Humid = high play) = 3/5 P(Humid = high play) =?

41 Training Naïve Bayes with Laplace Smoothing Estimate P (X j Y ) and P (Y ) directly from the training data by counting with Laplace smoothing: Sky Temp Humid Wind Water Forecast Play? sunny warm normal strong warm same yes sunny warm high strong warm same yes rainy cold high strong warm change no sunny warm high strong cool change yes P(play) = 3/4 P( play) = 1/4 P(Sky = sunny play) = 4/5 P(Sky = sunny play) = 1/3 P(Humid = high play) = 3/5 P(Humid = high play) = 2/

42 Using the Naïve Bayes Classifier Now, we have P (Y = y k X = x i )= P (Y = y k) Q d j=1 P (X j = x i,j Y = y k ) P (X = x i ) This is constant for a given instance, and so irrelevant to our prediction In practice, we use log- probabilities to prevent underflow To classify a new point x, h(x) = arg max y k P (Y = y k ) YdY P (X j = x j Y = y k ) j=1 = arg max y k log P (Y = y k )+ X j th attribute value of x dx log P (X j = x j Y = y k ) j=1 42

43 The Naïve Bayes Classifier Algorithm For each class label y k Estimate P(Y = y k ) from the data For each value x i,j of each attribute X i Estimate P(X i = x i,j Y = y k ) Classify a new point via: h(x) = arg max y k log P (Y = y k )+ dx log P (X j = x j Y = y k ) j=1 In practice, the independence assumption doesn t often hold true, but Naïve Bayes performs very well despite this 43

44 Computing Probabilities (Not Just Predicting Labels) NB classifier gives predictions, not probabilities, because we ignore P(X) (the denominator in Bayes rule) Can produce probabilities by: For each possible class label y k, compute dy P (Y = y k X = x) =P (Y = y k ) P (X j = x j Y = y k ) This is the numerator of Bayes rule, and is therefore off the true probability by a factor of α that makes probabilities sum to 1 α is given by = P #classes k=1 Class probability is given by j=1 1 P (Y = y k X = x) P (Y = y k X = x) = P (Y = y k X = x) 44

45 Naïve Bayes Applications Text classification Which e- mails are spam? Which e- mails are meeting notices? Which author wrote a document? Classifying mental states Learning P(BrainActivity WordCategory) Pairwise Classification Accuracy: 85% People Words Animal Words

46 The Naïve Bayes Graphical Model Labels (hypotheses) Y P(Y) Attributes (evidence) X 1 X i X d P(X 1 Y) P(X i Y) P(X d Y) Nodes denote random variables Edges denote dependency Each node has an associated conditional probability table (CPT), conditioned upon its parents 46

47 Example NB Graphical Model Data: Sky Temp Humid Play? sunny warm normal yes sunny warm high yes rainy cold high no sunny warm high yes Play Sky Temp Humid 47

48 Example NB Graphical Model Data: Sky Temp Humid Play? sunny warm normal yes sunny warm high yes rainy cold high no sunny warm high yes Play Play? yes no P(Play) Sky Temp Humid 48

49 Example NB Graphical Model Data: Sky Temp Humid Play? sunny warm normal yes sunny warm high yes rainy cold high no sunny warm high yes Play Play? P(Play) yes 3/4 no 1/4 Sky Temp Humid 49

50 Example NB Graphical Model Data: Sky Temp Humid Play? sunny warm normal yes sunny warm high yes rainy cold high no sunny warm high yes Play Play? P(Play) yes 3/4 no 1/4 Sky Temp Humid Sky sunny rainy sunny rainy Play? P(Sky Play) yes yes no no 50

51 Example NB Graphical Model Data: Sky Temp Humid Play? sunny warm normal yes sunny warm high yes rainy cold high no sunny warm high yes Play Play? P(Play) yes 3/4 no 1/4 Sky Temp Humid Sky Play? P(Sky Play) sunny yes 4/5 rainy yes 1/5 sunny no 1/3 rainy no 2/3 51

52 Example NB Graphical Model Data: Sky Temp Humid Play? sunny warm normal yes sunny warm high yes rainy cold high no sunny warm high yes Play Play? P(Play) yes 3/4 no 1/4 Sky Temp Humid Sky Play? P(Sky Play) sunny yes 4/5 rainy yes 1/5 sunny no 1/3 rainy no 2/3 Temp Play? P(Temp Play) warm yes cold yes warm no cold no 52

53 Example NB Graphical Model Data: Sky Temp Humid Play? sunny warm normal yes sunny warm high yes rainy cold high no sunny warm high yes Play Play? P(Play) yes 3/4 no 1/4 Sky Temp Humid Sky Play? P(Sky Play) sunny yes 4/5 rainy yes 1/5 sunny no 1/3 rainy no 2/3 Temp Play? P(Temp Play) warm yes 4/5 cold yes 1/5 warm no 1/3 cold no 2/3 53

54 Example NB Graphical Model Data: Sky Temp Humid Play? sunny warm normal yes sunny warm high yes rainy cold high no sunny warm high yes Play Play? P(Play) yes 3/4 no 1/4 Sky Temp Humid Sky Play? P(Sky Play) sunny yes 4/5 rainy yes 1/5 sunny no 1/3 rainy no 2/3 Temp Play? P(Temp Play) warm yes 4/5 cold yes 1/5 warm no 1/3 cold no 2/3 Humid Play? P(Humid Play) high yes norm yes high no norm no 54

55 Example NB Graphical Model Data: Sky Temp Humid Play? sunny warm normal yes sunny warm high yes rainy cold high no sunny warm high yes Play Play? P(Play) yes 3/4 no 1/4 Sky Temp Humid Sky Play? P(Sky Play) sunny yes 4/5 rainy yes 1/5 sunny no 1/3 rainy no 2/3 Temp Play? P(Temp Play) warm yes 4/5 cold yes 1/5 warm no 1/3 cold no 2/3 Humid Play? P(Humid Play) high yes 3/5 norm yes 2/5 high no 2/3 norm no 1/3 55

56 Example Using NB for Classification Sky Play Temp Humi d Play? P(Play) yes 3/4 no 1/4 Sky Play? P(Sky Play) sunny yes 4/5 rainy yes 1/5 sunny no 1/3 rainy no 2/3 Temp Play? P(Temp Play) warm yes 4/5 cold yes 1/5 warm no 1/3 cold no 2/3 Humid Play? P(Humid Play) high yes 3/5 norm yes 2/5 high no 2/3 norm no 1/3 h(x) = arg max y k log P (Y = y k )+ dx log P (X j = x j Y = y k ) j=1 Goal: Predict label for x = (rainy, warm, normal) 56

57 Example Using NB for Classification Sky Play Temp Predict label for: Humi d x = (rainy, warm, normal) Play? P(Play) yes 3/4 no 1/4 Sky Play? P(Sky Play) sunny yes 4/5 rainy yes 1/5 sunny no 1/3 rainy no 2/3 Temp Play? P(Temp Play) warm yes 4/5 cold yes 1/5 warm no 1/3 cold no 2/3 Humid Play? P(Humid Play) high yes 3/5 norm yes 2/5 high no 2/3 norm no 1/3 predict PLAY 57

58 Naïve Bayes Summary Advantages: Fast to train (single scan through data) Fast to classify Not sensitive to irrelevant features Handles real and discrete data Handles streaming data well Disadvantages: Assumes independence of features Slide by Eamonn Keogh 58

Naïve Bayes. Robot Image Credit: Viktoriya Sukhanova 123RF.com

Naïve Bayes. Robot Image Credit: Viktoriya Sukhanova 123RF.com Naïve Bayes These slides were assembled by Byron Boots, with only minor modifications from Eric Eaton s slides and grateful acknowledgement to the many others who made their course materials freely available

More information

Essen,al Probability Concepts

Essen,al Probability Concepts Naïve Bayes Essen,al Probability Concepts Marginaliza,on: Condi,onal Probability: Bayes Rule: P (B) = P (A B) = X v2values(a) P (A B) = P (B ^ A = v) P (A ^ B) P (B) ^ P (B A) P (A) P (B) Independence:

More information

Bayesian Methods: Naïve Bayes

Bayesian Methods: Naïve Bayes Bayesian Methods: Naïve Bayes Nicholas Ruozzi University of Texas at Dallas based on the slides of Vibhav Gogate Last Time Parameter learning Learning the parameter of a simple coin flipping model Prior

More information

2 When Some or All Labels are Missing: The EM Algorithm

2 When Some or All Labels are Missing: The EM Algorithm CS769 Spring Advanced Natural Language Processing The EM Algorithm Lecturer: Xiaojin Zhu jerryzhu@cs.wisc.edu Given labeled examples (x, y ),..., (x l, y l ), one can build a classifier. If in addition

More information

Decision Trees. Nicholas Ruozzi University of Texas at Dallas. Based on the slides of Vibhav Gogate and David Sontag

Decision Trees. Nicholas Ruozzi University of Texas at Dallas. Based on the slides of Vibhav Gogate and David Sontag Decision Trees Nicholas Ruozzi University of Texas at Dallas Based on the slides of Vibhav Gogate and David Sontag Announcements Course TA: Hao Xiong Office hours: Friday 2pm-4pm in ECSS2.104A1 First homework

More information

Decision Trees. an Introduction

Decision Trees. an Introduction Decision Trees an Introduction Outline Top-Down Decision Tree Construction Choosing the Splitting Attribute Information Gain and Gain Ratio Decision Tree An internal node is a test on an attribute A branch

More information

knn & Naïve Bayes Hongning Wang

knn & Naïve Bayes Hongning Wang knn & Naïve Bayes Hongning Wang CS@UVa Today s lecture Instance-based classifiers k nearest neighbors Non-parametric learning algorithm Model-based classifiers Naïve Bayes classifier A generative model

More information

Predicting NBA Shots

Predicting NBA Shots Predicting NBA Shots Brett Meehan Stanford University https://github.com/brettmeehan/cs229 Final Project bmeehan2@stanford.edu Abstract This paper examines the application of various machine learning algorithms

More information

FEATURES. Features. UCI Machine Learning Repository. Admin 9/23/13

FEATURES. Features. UCI Machine Learning Repository. Admin 9/23/13 Admin Assignment 2 This class will make you a better programmer! How did it go? How much time did you spend? FEATURES David Kauchak CS 451 Fall 2013 Assignment 3 out Implement perceptron variants See how

More information

PREDICTING THE NCAA BASKETBALL TOURNAMENT WITH MACHINE LEARNING. The Ringer/Getty Images

PREDICTING THE NCAA BASKETBALL TOURNAMENT WITH MACHINE LEARNING. The Ringer/Getty Images PREDICTING THE NCAA BASKETBALL TOURNAMENT WITH MACHINE LEARNING A N D R E W L E V A N D O S K I A N D J O N A T H A N L O B O The Ringer/Getty Images THE TOURNAMENT MARCH MADNESS 68 teams (4 play-in games)

More information

PREDICTING the outcomes of sporting events

PREDICTING the outcomes of sporting events CS 229 FINAL PROJECT, AUTUMN 2014 1 Predicting National Basketball Association Winners Jasper Lin, Logan Short, and Vishnu Sundaresan Abstract We used National Basketball Associations box scores from 1991-1998

More information

Logistic Regression. Hongning Wang

Logistic Regression. Hongning Wang Logistic Regression Hongning Wang CS@UVa Today s lecture Logistic regression model A discriminative classification model Two different perspectives to derive the model Parameter estimation CS@UVa CS 6501:

More information

DATA MINING ON CRICKET DATA SET FOR PREDICTING THE RESULTS. Sushant Murdeshwar

DATA MINING ON CRICKET DATA SET FOR PREDICTING THE RESULTS. Sushant Murdeshwar DATA MINING ON CRICKET DATA SET FOR PREDICTING THE RESULTS by Sushant Murdeshwar A Project Report Submitted in Partial Fulfillment of the Requirements for the Degree of Master of Science in Computer Science

More information

Evaluating and Classifying NBA Free Agents

Evaluating and Classifying NBA Free Agents Evaluating and Classifying NBA Free Agents Shanwei Yan In this project, I applied machine learning techniques to perform multiclass classification on free agents by using game statistics, which is useful

More information

EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Wenbing Zhao. Department of Electrical and Computer Engineering

EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Wenbing Zhao. Department of Electrical and Computer Engineering EEC 686/785 Modeling & Performance Evaluation of Computer Systems Lecture 6 Department of Electrical and Computer Engineering Cleveland State University wenbing@ieee.org Outline 2 Review of lecture 5 The

More information

Unit 4: Inference for numerical variables Lecture 3: ANOVA

Unit 4: Inference for numerical variables Lecture 3: ANOVA Unit 4: Inference for numerical variables Lecture 3: ANOVA Statistics 101 Thomas Leininger June 10, 2013 Announcements Announcements Proposals due tomorrow. Will be returned to you by Wednesday. You MUST

More information

Outline. Terminology. EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Steps in Capacity Planning and Management

Outline. Terminology. EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Steps in Capacity Planning and Management EEC 686/785 Modeling & Performance Evaluation of Computer Systems Lecture 6 Department of Electrical and Computer Engineering Cleveland State University wenbing@ieee.org Outline Review of lecture 5 The

More information

Mixture Models & EM. Nicholas Ruozzi University of Texas at Dallas. based on the slides of Vibhav Gogate

Mixture Models & EM. Nicholas Ruozzi University of Texas at Dallas. based on the slides of Vibhav Gogate Mixture Models & EM Nicholas Ruozzi University of Texas at Dallas based on the slides of Vibhav Gogate Previously We looked at -means and hierarchical clustering as mechanisms for unsupervised learning

More information

CS145: INTRODUCTION TO DATA MINING

CS145: INTRODUCTION TO DATA MINING CS145: INTRODUCTION TO DATA MINING 3: Vector Data: Logistic Regression Instructor: Yizhou Sun yzsun@cs.ucla.edu October 9, 2017 Methods to Learn Vector Data Set Data Sequence Data Text Data Classification

More information

A computer program that improves its performance at some task through experience.

A computer program that improves its performance at some task through experience. 1 A computer program that improves its performance at some task through experience. 2 Example: Learn to Diagnose Patients T: Diagnose tumors from images P: Percent of patients correctly diagnosed E: Pre

More information

Chapter 5: Methods and Philosophy of Statistical Process Control

Chapter 5: Methods and Philosophy of Statistical Process Control Chapter 5: Methods and Philosophy of Statistical Process Control Learning Outcomes After careful study of this chapter You should be able to: Understand chance and assignable causes of variation, Explain

More information

Predicting Horse Racing Results with Machine Learning

Predicting Horse Racing Results with Machine Learning Predicting Horse Racing Results with Machine Learning LYU 1703 LIU YIDE 1155062194 Supervisor: Professor Michael R. Lyu Outline Recap of last semester Object of this semester Data Preparation Set to sequence

More information

Ivan Suarez Robles, Joseph Wu 1

Ivan Suarez Robles, Joseph Wu 1 Ivan Suarez Robles, Joseph Wu 1 Section 1: Introduction Mixed Martial Arts (MMA) is the fastest growing competitive sport in the world. Because the fighters engage in distinct martial art disciplines (boxing,

More information

Simulating Major League Baseball Games

Simulating Major League Baseball Games ABSTRACT Paper 2875-2018 Simulating Major League Baseball Games Justin Long, Slippery Rock University; Brad Schweitzer, Slippery Rock University; Christy Crute Ph.D, Slippery Rock University The game of

More information

CS 528 Mobile and Ubiquitous Computing Lecture 7a: Applications of Activity Recognition + Machine Learning for Ubiquitous Computing.

CS 528 Mobile and Ubiquitous Computing Lecture 7a: Applications of Activity Recognition + Machine Learning for Ubiquitous Computing. CS 528 Mobile and Ubiquitous Computing Lecture 7a: Applications of Activity Recognition + Machine Learning for Ubiquitous Computing Emmanuel Agu Applications of Activity Recognition Recall: Activity Recognition

More information

Looking at Spacings to Assess Streakiness

Looking at Spacings to Assess Streakiness Looking at Spacings to Assess Streakiness Jim Albert Department of Mathematics and Statistics Bowling Green State University September 2013 September 2013 1 / 43 The Problem Collect hitting data for all

More information

5.1 Introduction. Learning Objectives

5.1 Introduction. Learning Objectives Learning Objectives 5.1 Introduction Statistical Process Control (SPC): SPC is a powerful collection of problem-solving tools useful in achieving process stability and improving capability through the

More information

Predicting the Total Number of Points Scored in NFL Games

Predicting the Total Number of Points Scored in NFL Games Predicting the Total Number of Points Scored in NFL Games Max Flores (mflores7@stanford.edu), Ajay Sohmshetty (ajay14@stanford.edu) CS 229 Fall 2014 1 Introduction Predicting the outcome of National Football

More information

#1 Accurately Rate and Rank each FBS team, and

#1 Accurately Rate and Rank each FBS team, and The goal of the playoffpredictor website is to use statistical analysis in the absolute simplest terms to: #1 Accurately Rate and Rank each FBS team, and #2 Predict what the playoff committee will list

More information

CSC242: Intro to AI. Lecture 21

CSC242: Intro to AI. Lecture 21 CSC242: Intro to AI Lecture 21 Quiz Stop Time: 2:15 Learning (from Examples) Learning Learning gives computers the ability to learn without being explicitly programmed (Samuel, 1959)... agents that can

More information

Navigate to the golf data folder and make it your working directory. Load the data by typing

Navigate to the golf data folder and make it your working directory. Load the data by typing Golf Analysis 1.1 Introduction In a round, golfers have a number of choices to make. For a particular shot, is it better to use the longest club available to try to reach the green, or would it be better

More information

Machine Learning an American Pastime

Machine Learning an American Pastime Nikhil Bhargava, Andy Fang, Peter Tseng CS 229 Paper Machine Learning an American Pastime I. Introduction Baseball has been a popular American sport that has steadily gained worldwide appreciation in the

More information

Predicting the NCAA Men s Basketball Tournament with Machine Learning

Predicting the NCAA Men s Basketball Tournament with Machine Learning Predicting the NCAA Men s Basketball Tournament with Machine Learning Andrew Levandoski and Jonathan Lobo CS 2750: Machine Learning Dr. Kovashka 25 April 2017 Abstract As the popularity of the NCAA Men

More information

A few things to remember about ANOVA

A few things to remember about ANOVA A few things to remember about ANOVA 1) The F-test that is performed is always 1-tailed. This is because your alternative hypothesis is always that the between group variation is greater than the within

More information

Which On-Base Percentage Shows. the Highest True Ability of a. Baseball Player?

Which On-Base Percentage Shows. the Highest True Ability of a. Baseball Player? Which On-Base Percentage Shows the Highest True Ability of a Baseball Player? January 31, 2018 Abstract This paper looks at the true on-base ability of a baseball player given their on-base percentage.

More information

Calculation of Trail Usage from Counter Data

Calculation of Trail Usage from Counter Data 1. Introduction 1 Calculation of Trail Usage from Counter Data 1/17/17 Stephen Martin, Ph.D. Automatic counters are used on trails to measure how many people are using the trail. A fundamental question

More information

Attacking and defending neural networks. HU Xiaolin ( 胡晓林 ) Department of Computer Science and Technology Tsinghua University, Beijing, China

Attacking and defending neural networks. HU Xiaolin ( 胡晓林 ) Department of Computer Science and Technology Tsinghua University, Beijing, China Attacking and defending neural networks HU Xiaolin ( 胡晓林 ) Department of Computer Science and Technology Tsinghua University, Beijing, China Outline Background Attacking methods Defending methods 2 AI

More information

Application of Bayesian Networks to Shopping Assistance

Application of Bayesian Networks to Shopping Assistance Application of Bayesian Networks to Shopping Assistance Yang Xiang, Chenwen Ye, and Deborah Ann Stacey University of Guelph, CANADA Abstract. We develop an on-line shopping assistant that can help a e-shopper

More information

Minimum Mean-Square Error (MMSE) and Linear MMSE (LMMSE) Estimation

Minimum Mean-Square Error (MMSE) and Linear MMSE (LMMSE) Estimation Minimum Mean-Square Error (MMSE) and Linear MMSE (LMMSE) Estimation Outline: MMSE estimation, Linear MMSE (LMMSE) estimation, Geometric formulation of LMMSE estimation and orthogonality principle. Reading:

More information

An Artificial Neural Network-based Prediction Model for Underdog Teams in NBA Matches

An Artificial Neural Network-based Prediction Model for Underdog Teams in NBA Matches An Artificial Neural Network-based Prediction Model for Underdog Teams in NBA Matches Paolo Giuliodori University of Camerino, School of Science and Technology, Computer Science Division, Italy paolo.giuliodori@unicam.it

More information

CS249: ADVANCED DATA MINING

CS249: ADVANCED DATA MINING CS249: ADVANCED DATA MINING Linear Regression, Logistic Regression, and GLMs Instructor: Yizhou Sun yzsun@cs.ucla.edu April 24, 2017 About WWW2017 Conference 2 Turing Award Winner Sir Tim Berners-Lee 3

More information

Remote Towers: Videopanorama Framerate Requirements Derived from Visual Discrimination of Deceleration During Simulated Aircraft Landing

Remote Towers: Videopanorama Framerate Requirements Derived from Visual Discrimination of Deceleration During Simulated Aircraft Landing www.dlr.de Chart 1 > SESARInno > Fürstenau RTOFramerate> 2012-11-30 Remote Towers: Videopanorama Framerate Requirements Derived from Visual Discrimination of Deceleration During Simulated Aircraft Landing

More information

Estimating Paratransit Demand Forecasting Models Using ACS Disability and Income Data

Estimating Paratransit Demand Forecasting Models Using ACS Disability and Income Data Estimating Paratransit Demand Forecasting Models Using ACS Disability and Income Data Presenter: Daniel Rodríguez Román University of Puerto Rico, Mayagüez Co-author: Sarah V. Hernandez University of Arkansas,

More information

Deconstructing Data Science

Deconstructing Data Science Deconstructing Data Science David Bamman, UC Berkele Info 29 Lecture 4: Regression overview Feb 1, 216 Regression A mapping from input data (drawn from instance space ) to a point in R (R = the set of

More information

/435 Artificial Intelligence Fall 2015

/435 Artificial Intelligence Fall 2015 Final Exam 600.335/435 Artificial Intelligence Fall 2015 Name: Section (335/435): Instructions Please be sure to write both your name and section in the space above! Some questions will be exclusive to

More information

A history-based estimation for LHCb job requirements

A history-based estimation for LHCb job requirements A history-based estimation for LHCb job requirements Nathalie Rauschmayr on behalf of LHCb Computing 13th April 2015 Introduction N. Rauschmayr 2 How long will a job run and how much memory might it need?

More information

Smart-Walk: An Intelligent Physiological Monitoring System for Smart Families

Smart-Walk: An Intelligent Physiological Monitoring System for Smart Families Smart-Walk: An Intelligent Physiological Monitoring System for Smart Families P. Sundaravadivel 1, S. P. Mohanty 2, E. Kougianos 3, V. P. Yanambaka 4, and M. K. Ganapathiraju 5 University of North Texas,

More information

Home Team Advantage in English Premier League

Home Team Advantage in English Premier League Patrice Marek* and František Vávra** *European Centre of Excellence NTIS New Technologies for the Information Society, Faculty of Applied Sciences, University of West Bohemia, Czech Republic: patrke@kma.zcu.cz

More information

Measuring range Δp (span = 100%) Pa

Measuring range Δp (span = 100%) Pa 4.4/ RLE 5: Volume-flow controller, continuous How energy efficiency is improved Enables demand-led volume flow control for the optimisation of energy consumption in ventilation systems. Areas of application

More information

Lecture 5. Optimisation. Regularisation

Lecture 5. Optimisation. Regularisation Lecture 5. Optimisation. Regularisation COMP90051 Statistical Machine Learning Semester 2, 2017 Lecturer: Andrey Kan Copyright: University of Melbourne Iterative optimisation Loss functions Coordinate

More information

University of Cincinnati

University of Cincinnati Mapping the Design Space of a Recuperated, Recompression, Precompression Supercritical Carbon Dioxide Power Cycle with Intercooling, Improved Regeneration, and Reheat Andrew Schroder Mark Turner University

More information

Interpretable Discovery in Large Image Data Sets

Interpretable Discovery in Large Image Data Sets Interpretable Discovery in Large Image Data Sets Kiri L. Wagstaff and Jake Lee Jet Propulsion Laboratory, California Institute of Technology December 7, 2017 NIPS Interpretable Machine Learning Symposium

More information

Modeling Salmon Behavior on the Umpqua River. By Scott Jordan 6/2/2015

Modeling Salmon Behavior on the Umpqua River. By Scott Jordan 6/2/2015 Modeling Salmon Behavior on the Umpqua River By Scott Jordan 6/2/2015 1 Importance of Salmon Delicious Recreation 631,000 people in Oregon went fishing in 2008 spent $264.6 Million on fishing trips Commercial

More information

Basketball data science

Basketball data science Basketball data science University of Brescia, Italy Vienna, April 13, 2018 paola.zuccolotto@unibs.it marica.manisera@unibs.it BDSports, a network of people interested in Sports Analytics http://bodai.unibs.it/bdsports/

More information

Note that all proportions are between 0 and 1. at risk. How to construct a sentence describing a. proportion:

Note that all proportions are between 0 and 1. at risk. How to construct a sentence describing a. proportion: Biostatistics and Research Design in Dentistry Categorical Data Reading assignment Chapter 3 Summarizing data in Dawson-Trapp starting with Summarizing nominal and ordinal data with numbers on p 40 thru

More information

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 03, 2016 ISSN (online):

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 03, 2016 ISSN (online): IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 03, 2016 ISSN (online): 2321-0613 A Survey on Different Classification technique in Data-Mining Miss.Mayuri Jadav 1 Prof.Risha

More information

Mathematics of Pari-Mutuel Wagering

Mathematics of Pari-Mutuel Wagering Millersville University of Pennsylvania April 17, 2014 Project Objectives Model the horse racing process to predict the outcome of a race. Use the win and exacta betting pools to estimate probabilities

More information

Transmitter CS 21 Operation Manual

Transmitter CS 21 Operation Manual Transmitter CS 21 Operation Manual Content Page For your Safety 3 General Description 3 Detection Principle 4 Operational Notes 4 Design 4 Mounting Position of CS21 5 Mounting 6 Installation of Electrical

More information

#19 MONITORING AND PREDICTING PEDESTRIAN BEHAVIOR USING TRAFFIC CAMERAS

#19 MONITORING AND PREDICTING PEDESTRIAN BEHAVIOR USING TRAFFIC CAMERAS #19 MONITORING AND PREDICTING PEDESTRIAN BEHAVIOR USING TRAFFIC CAMERAS Final Research Report Luis E. Navarro-Serment, Ph.D. The Robotics Institute Carnegie Mellon University November 25, 2018. Disclaimer

More information

Strategic Research PROGRAM FOLLOW-UP ASSESSMENT OF THE MOBILEYE SHIELD+ COLLISION AVOIDANCE SYSTEM

Strategic Research PROGRAM FOLLOW-UP ASSESSMENT OF THE MOBILEYE SHIELD+ COLLISION AVOIDANCE SYSTEM 165610-1 Strategic Research PROGRAM FOLLOW-UP ASSESSMENT OF THE MOBILEYE SHIELD+ COLLISION AVOIDANCE SYSTEM March 2017 1. Report No. TTI/SRP/17/165610-1 2. Government Accession No. 3. Recipient's Catalog

More information

Lab 11: Introduction to Linear Regression

Lab 11: Introduction to Linear Regression Lab 11: Introduction to Linear Regression Batter up The movie Moneyball focuses on the quest for the secret of success in baseball. It follows a low-budget team, the Oakland Athletics, who believed that

More information

POWER Quantifying Correction Curve Uncertainty Through Empirical Methods

POWER Quantifying Correction Curve Uncertainty Through Empirical Methods Proceedings of the ASME 2014 Power Conference POWER2014 July 28-31, 2014, Baltimore, Maryland, USA POWER2014-32187 Quantifying Correction Curve Uncertainty Through Empirical Methods ABSTRACT Christopher

More information

Introduction to Pattern Recognition

Introduction to Pattern Recognition Introduction to Pattern Recognition Jason Corso SUNY at Buffalo 12 January 2009 J. Corso (SUNY at Buffalo) Introduction to Pattern Recognition 12 January 2009 1 / 28 Pattern Recognition By Example Example:

More information

Transformer fault diagnosis using Dissolved Gas Analysis technology and Bayesian networks

Transformer fault diagnosis using Dissolved Gas Analysis technology and Bayesian networks Proceedings of the 4th International Conference on Systems and Control, Sousse, Tunisia, April 28-30, 2015 TuCA.2 Transformer fault diagnosis using Dissolved Gas Analysis technology and Bayesian networks

More information

Deconstructing Data Science

Deconstructing Data Science Deconstructing Data Science David Bamman, UC Berkele Info 29 Lecture 4: Regression overview Jan 26, 217 Regression A mapping from input data (drawn from instance space ) to a point in R (R = the set of

More information

Real Time Early Warning Indicators for Costly Asset Price Boom/Bust Cycles: A Role for Global Liquidity

Real Time Early Warning Indicators for Costly Asset Price Boom/Bust Cycles: A Role for Global Liquidity Indicators Asset Price : A Role for Global Liquidity Lucia Alessi European Central Bank Carsten Detken Milan, 24 June 2010 The need for Models The set up of an System is one of the key tasks of the European

More information

ORF 201 Computer Methods in Problem Solving. Final Project: Dynamic Programming Optimal Sailing Strategies

ORF 201 Computer Methods in Problem Solving. Final Project: Dynamic Programming Optimal Sailing Strategies Princeton University Department of Operations Research and Financial Engineering ORF 201 Computer Methods in Problem Solving Final Project: Dynamic Programming Optimal Sailing Strategies Due 11:59 pm,

More information

Determination of the Design Load for Structural Safety Assessment against Gas Explosion in Offshore Topside

Determination of the Design Load for Structural Safety Assessment against Gas Explosion in Offshore Topside Determination of the Design Load for Structural Safety Assessment against Gas Explosion in Offshore Topside Migyeong Kim a, Gyusung Kim a, *, Jongjin Jung a and Wooseung Sim a a Advanced Technology Institute,

More information

Projecting Three-Point Percentages for the NBA Draft

Projecting Three-Point Percentages for the NBA Draft Projecting Three-Point Percentages for the NBA Draft Hilary Sun hsun3@stanford.edu Jerold Yu jeroldyu@stanford.edu December 16, 2017 Roland Centeno rcenteno@stanford.edu 1 Introduction As NBA teams have

More information

Open Research Online The Open University s repository of research publications and other research outputs

Open Research Online The Open University s repository of research publications and other research outputs Open Research Online The Open University s repository of research publications and other research outputs Developing an intelligent table tennis umpiring system Conference or Workshop Item How to cite:

More information

A Behavioral Theory. Teck H. Ho. & National University of Singapore. Joint Work with Noah Lim, Houston Juanjuan Zhang, MIT. July 2008 Teck H.

A Behavioral Theory. Teck H. Ho. & National University of Singapore. Joint Work with Noah Lim, Houston Juanjuan Zhang, MIT. July 2008 Teck H. Designing Pricing Contracts for Homo Sapiens: A Behavioral Theory Teck H. Ho University of California, Berkeley & National University of Singapore Joint Work with Noah Lim, Houston Juanjuan Zhang, MIT

More information

Transmitter CS 21 Operation Manual

Transmitter CS 21 Operation Manual Transmitter CS 21 Operation Manual 1194 Oak Valley Drive, Suite 20, Ann Arbor, MI 48108 800-959-0573 734-769-1888 Content Page For your Safety 3 General Description 3 Detection Principle 4 Operation 4

More information

Chapter 4 (sort of): How Universal Are Turing Machines? CS105: Great Insights in Computer Science

Chapter 4 (sort of): How Universal Are Turing Machines? CS105: Great Insights in Computer Science Chapter 4 (sort of): How Universal Are Turing Machines? CS105: Great Insights in Computer Science Some Probability Theory If event has probability p, 1/p tries before it happens (on average). If n distinct

More information

Real-Time Electricity Pricing

Real-Time Electricity Pricing Real-Time Electricity Pricing Xi Chen, Jonathan Hosking and Soumyadip Ghosh IBM Watson Research Center / Northwestern University Yorktown Heights, NY, USA X. Chen, J. Hosking & S. Ghosh (IBM) Real-Time

More information

Spatio-temporal analysis of team sports Joachim Gudmundsson

Spatio-temporal analysis of team sports Joachim Gudmundsson Spatio-temporal analysis of team sports Joachim Gudmundsson The University of Sydney Page 1 Team sport analysis Talk is partly based on: Joachim Gudmundsson and Michael Horton Spatio-Temporal Analysis

More information

Biostatistics & SAS programming

Biostatistics & SAS programming Biostatistics & SAS programming Kevin Zhang March 6, 2017 ANOVA 1 Two groups only Independent groups T test Comparison One subject belongs to only one groups and observed only once Thus the observations

More information

PEDESTRIAN behavior modeling and analysis is

PEDESTRIAN behavior modeling and analysis is 4354 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 25, NO. 9, SEPTEMBER 2016 Pedestrian Behavior Modeling From Stationary Crowds With Applications to Intelligent Surveillance Shuai Yi, Hongsheng Li, and

More information

Lecture 04 ( ) Hazard Analysis. Systeme hoher Qualität und Sicherheit Universität Bremen WS 2015/2016

Lecture 04 ( ) Hazard Analysis. Systeme hoher Qualität und Sicherheit Universität Bremen WS 2015/2016 Systeme hoher Qualität und Sicherheit Universität Bremen WS 2015/2016 Lecture 04 (02.11.2015) Hazard Analysis Christoph Lüth Jan Peleska Dieter Hutter Where are we? 01: Concepts of Quality 02: Legal Requirements:

More information

Lecture 39: Training Neural Networks (Cont d)

Lecture 39: Training Neural Networks (Cont d) Lecture 39: Training Neural Networks (Cont d) CS 4670/5670 Sean Bell Strawberry Goblet Throne (Side Note for PA5) AlexNet: 1 vs 2 parts Caffe represents caffe like the above image, but computes as if it

More information

THe rip currents are very fast moving narrow channels,

THe rip currents are very fast moving narrow channels, 1 Rip Current Detection using Optical Flow Shweta Philip sphilip@ucsc.edu Abstract Rip currents are narrow currents of fast moving water that are strongest near the beach. These type of currents are dangerous

More information

Beamex. Calibration White Paper. Weighing scale calibration - How to calibrate weighing instruments

Beamex. Calibration White Paper.  Weighing scale calibration - How to calibrate weighing instruments Beamex Calibration White Paper info@beamex.com Weighing scale calibration - How to calibrate weighing instruments Weighing scale calibration - How to calibrate weighing instruments Weighing scales, weighing

More information

Determining bicycle infrastructure preferences A case study of Dublin

Determining bicycle infrastructure preferences A case study of Dublin *Manuscript Click here to view linked References 1 Determining bicycle infrastructure preferences A case study of Dublin Brian Caulfield 1, Elaine Brick 2, Orla Thérèse McCarthy 1 1 Department of Civil,

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 256 Introduction This procedure computes summary statistics and common non-parametric, single-sample runs tests for a series of n numeric, binary, or categorical data values. For numeric data,

More information

Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA

Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only

More information

Reliable Real-Time Recognition of Motion Related Human Activities using MEMS Inertial Sensors

Reliable Real-Time Recognition of Motion Related Human Activities using MEMS Inertial Sensors Reliable Real-Time Recognition of Motion Related Human Activities using MEMS Inertial Sensors K. Frank,, y; M.J. Vera-Nadales, University of Malaga, Spain; P. Robertson, M. Angermann, v36 Motivation 2

More information

Special Topics: Data Science

Special Topics: Data Science Special Topics: Data Science L Linear Methods for Prediction Dr. Vidhyasaharan Sethu School of Electrical Engineering & Telecommunications University of New South Wales Sydney, Australia V. Sethu 1 Topics

More information

University of Cincinnati

University of Cincinnati Mapping the Design Space of a Recuperated, Recompression, Precompression Supercritical Carbon Dioxide Power Cycle with Intercooling, Improved Regeneration, and Reheat Andrew Schroder Mark Turner University

More information

Fail Operational Controls for an Independent Metering Valve

Fail Operational Controls for an Independent Metering Valve Group 14 - System Intergration and Safety Paper 14-3 465 Fail Operational Controls for an Independent Metering Valve Michael Rannow Eaton Corporation, 7945 Wallace Rd., Eden Prairie, MN, 55347, email:

More information

CS 112 Introduction to Programming

CS 112 Introduction to Programming CS 112 Introduction to Programming (Spring 2012) Lecture #15: Random Web Surfer Zhong Shao Department of Computer Science Yale University Office: 314 Watson http://flint.cs.yale.edu/cs112 Acknowledgements:

More information

Chem 110 General Principles of Chemistry

Chem 110 General Principles of Chemistry CHEM110 Worksheet - Gases Chem 110 General Principles of Chemistry Chapter 9 Gases (pages 337-373) In this chapter we - first contrast gases with liquids and solids and then discuss gas pressure. - review

More information

Analysis of Variance. Copyright 2014 Pearson Education, Inc.

Analysis of Variance. Copyright 2014 Pearson Education, Inc. Analysis of Variance 12-1 Learning Outcomes Outcome 1. Understand the basic logic of analysis of variance. Outcome 2. Perform a hypothesis test for a single-factor design using analysis of variance manually

More information

Inferring land use from mobile phone activity

Inferring land use from mobile phone activity Inferring land use from mobile phone activity Jameson L. Toole (MIT) Michael Ulm (AIT) Dietmar Bauer (AIT) Marta C. Gonzalez (MIT) UrbComp 2012 Beijing, China 1 The Big Questions Can land use be predicted

More information

21st ECMI Modelling Week Final Report

21st ECMI Modelling Week Final Report 21st ECMI Modelling Week Final Report 2007.08.26 2007.09.02 Team 9 Stategies for darts Borset Kari Norwegian University of Science and Technology, Trondheim Hemmelmeir Claudia Johannes Kepler Universität,

More information

POKEMON HACKS. Jodie Ashford Josh Baggott Chloe Barnes Jordan bird

POKEMON HACKS. Jodie Ashford Josh Baggott Chloe Barnes Jordan bird POKEMON HACKS Jodie Ashford Josh Baggott Chloe Barnes Jordan bird Why pokemon? 1997 The Pokemon Problem an episode of the anime caused 685 children to have seizures Professor Graham Harding from Aston

More information

Lecture 10. Support Vector Machines (cont.)

Lecture 10. Support Vector Machines (cont.) Lecture 10. Support Vector Machines (cont.) COMP90051 Statistical Machine Learning Semester 2, 2017 Lecturer: Andrey Kan Copyright: University of Melbourne This lecture Soft margin SVM Intuition and problem

More information

Using Poisson Distribution to predict a Soccer Betting Winner

Using Poisson Distribution to predict a Soccer Betting Winner Using Poisson Distribution to predict a Soccer Betting Winner By SYED AHMER RIZVI 1511060 Section A Quantitative Methods - I APPLICATION OF DESCRIPTIVE STATISTICS AND PROBABILITY IN SOCCER Concept This

More information

Finding your feet: modelling the batting abilities of cricketers using Gaussian processes

Finding your feet: modelling the batting abilities of cricketers using Gaussian processes Finding your feet: modelling the batting abilities of cricketers using Gaussian processes Oliver Stevenson & Brendon Brewer PhD candidate, Department of Statistics, University of Auckland o.stevenson@auckland.ac.nz

More information

Attribute Adaptation for Personalized Image Search Adriana Kovashka and Kristen Grauman (Supplementary Materials)

Attribute Adaptation for Personalized Image Search Adriana Kovashka and Kristen Grauman (Supplementary Materials) Attribute Adaptation for Personalized Image Search Adriana Kovashka and Kristen Grauman (Supplementary Materials) 1 Additional Accuracy Results In Figure 1, we show more results of the methods performance

More information

A decision maker subjectively assigned the following probabilities to the four outcomes of an experiment: P(E 1) =.10, P(E 2) =.

A decision maker subjectively assigned the following probabilities to the four outcomes of an experiment: P(E 1) =.10, P(E 2) =. Name: _ Class: Date: _ (First Page) Name: _ Class: Date: _ (Subsequent Pages) 1. {Exercise 4.03} How many permutations of three items can be selected from a group of six? 2. {Exercise 4.07} A decision

More information

Epidemics and zombies

Epidemics and zombies Epidemics and zombies (Sethna, "Entropy, Order Parameters, and Complexity", ex. 6.21) 2018, James Sethna, all rights reserved. This exercise is based on Alemi and Bierbaum's class project, published in

More information