Mixture Models & EM. Nicholas Ruozzi University of Texas at Dallas. based on the slides of Vibhav Gogate

Size: px
Start display at page:

Download "Mixture Models & EM. Nicholas Ruozzi University of Texas at Dallas. based on the slides of Vibhav Gogate"

Transcription

1 Mixture Models & EM Nicholas Ruozzi University of Texas at Dallas based on the slides of Vibhav Gogate

2 Previously We looked at -means and hierarchical clustering as mechanisms for unsupervised learning -means was simply a block coordinate descent scheme for a specific objective function Today: how to learn probabilistic models for unsupervised learning problems 2

3 EM: Soft Clustering Clustering (e.g., k-means) typically assumes that each instance is given a hard assignment to exactly one cluster Does not allow uncertainty in class membership or for an instance to belong to more than one cluster Problematic because data points that lie roughly midway between cluster centers are assigned to one cluster Soft clustering gives probabilities that an instance belongs to each of a set of clusters 3

4 Probabilistic Clustering Try a probabilistic model! Allows overlaps, clusters of different size, etc. Can tell a generative story for data pp(xx yy) pp(yy) Challenge: we need to estimate model parameters without labeled yy s (i.e., in the unsupervised setting) Z X 1 X 2?? ?? ?? ?? ??

5 Probabilistic Clustering Clusters of different shapes and sizes Clusters can overlap! (-means doesn t allow this) 5

6 Finite Mixture Models Given a dataset: xx (1),, xx () Mixture model: Θ = {λλ 1,, λλ, θθ 1,, θθ } pp xx Θ = λλ yy pp yy (xx θθ yy ) yy=1 pp yy (xx θθ yy ) is a mixture component from some family of probability distributions parameterized by θθ yy and λλ 0 such that yy λλ yy = 1 are the mixture weights We can think of λλ yy = pp YY = yy Θ for some random variable YY that takes values in {1,, } 6

7 Finite Mixture Models Uniform mixture of 3 Gaussians 7

8 Multivariate Gaussian A dd-dimensional multivariate Gaussian distribution is defined by a dd dd covariance matrix Σ and a mean vector μμ pp xx μμ, Σ = 1 2ππ d det Σ exp 1 2 xx μμ TT Σ 1 (xx μμ) The covariance matrix describes the degree to which pairs of variables vary together The diagonal elements correspond to variances of the individual variables 8

9 Multivariate Gaussian A dd-dimensional multivariate Gaussian distribution is defined by a dd dd covariance matrix Σ and a mean vector μμ pp xx μμ, Σ = 1 2ππ d det Σ exp 1 2 xx μμ TT Σ 1 (xx μμ) The covariance matrix must be a symmetric positive definite matrix in order for the above to make sense Positive definite: all eigenvalues are positive & matrix is invertible Ensures that the quadratic form is concave 9

10 Gaussian Mixture Models (GMMs) We can define a GMM by choosing the ttt component of the mixture to be a Gaussian density with parameters θθ = μμ, Σ pp xx μμ, Σ k = 1 2ππ d det Σ k exp 1 2 xx μμ TT Σ 1 (xx μμ ) We could cluster by fitting a mixture of Gaussians to our data How do we learn these kinds of models? 10

11 Learning Gaussian Parameters MLE for supervised univariate Gaussian μμ MMMMMM = 1 xx ii 2 σσ MMMMMM = 1 1 xx ii μμ MMMMMM 2 MLE for supervised multivariate Gaussian μμ MMMMMM = 1 xx ii ΣΣ MLE = 1 1 xx ii μμ MMMMMM xx ii μμ MMMMMM TT 11

12 Learning Gaussian Parameters MLE for supervised multivariate mixture of Gaussian distributions μμ MMMMMM = 1 MM xx ii ii MM Σ MMMMMM = 1 MM xx ii μμ MMMMMM xx ii μμ MMMMMM ii MM Sums are over the observations that were generated by the ttt mixture component (this requires that we know which points were generated by which distribution!) TT 12

13 The Unsupervised Case What if our observations do not include information about which of the mixture components generated them? Consider a joint probability distribution over data points, xx (ii), and mixture assignments, yy {1,, } arg max Θ pp(xx ii Θ) = arg max Θ = arg max Θ pp(xx ii, YY = yy Θ) yy=1 pp xx ii YY = yy, Θ pp(yy = yy Θ) yy=1 13

14 The Unsupervised Case What if our observations do not include information about which of the mixture components generated them? Consider a joint probability distribution over data points, xx (ii), and mixture assignments, yy {1,, } arg max Θ pp(xx ii Θ) = arg max Θ = arg max Θ pp(xx ii, YY = yy Θ) yy=1 pp xx ii YY = yy, Θ pp(yy = yy Θ) yy=1 We only know how to compute the probabilities for each mixture component 14

15 The Unsupervised Case In the case of a Gaussian mixture model pp xx ii YY = yy, Θ = (xx (ii) μμ yy, Σ yy ) pp YY = yy Θ = λλ yy Differentiating the MLE objective yields a system of equations that is difficult to solve in general The solution: modify the objective to make the optimization easier 15

16 Expectation Maximization 16

17 Jensen s Inequality For a convex function ff: R nn R, any aa 1,, aa [0,1] such that aa aa = 1, and any xx (1),, xx R nn, aa 1 ff xx aa ff xx ff aa 1 xx aa xx Inequality is reversed for concave functions 17

18 EM Algorithm log l(θ) = log pp xx ii, YY = yy Θ yy=1 = log yy=1 FF(Θ, qq) yy=1 qqii yy qq ii yy pp xx ii, YY = yy Θ qq ii yy log pp xx ii, YY = yy Θ qq ii yy 18

19 EM Algorithm log l(θ) = log pp xx ii, YY = yy Θ yy=1 = log yy=1 FF(Θ, qq) yy=1 qqii yy qq ii yy pp xx ii, YY = yy Θ qq ii yy log pp xx ii, YY = yy Θ qq ii yy qq ii (yy) is an arbitrary positive probability distribution 19

20 EM Algorithm log l(θ) = log pp xx ii, YY = yy Θ yy=1 = log yy=1 FF(Θ, qq) yy=1 qqii yy qq ii yy pp xx ii, YY = yy Θ qq ii yy log pp xx ii, YY = yy Θ qq ii yy Jensen s ineq. 20

21 EM Algorithm arg max Θ,qq 1,..,qq yy=1 qq ii yy log pp xx ii, YY = yy Θ qq ii yy This objective is not jointly concave in Θ and qq 1,, qq Best we can hope for is a local maxima (and there could be A LOT of them) The EM algorithm is a block coordinate ascent scheme that finds a local optimum of this objective Start from an initialization Θ 0 and qq 1 0,, qq 0 21

22 EM Algorithm E step: with the θθ s fixed, maximize the objective over qq qq tt+1 arg max qq 1,..,qq yy=1 qq ii yy log pp xx ii, YY = yy Θ tt qq ii yy Using the method of Lagrange multipliers for the constraint that yy qq ii yy = 1 gives qq ii tt+1 yy = pp YY = yy XX = xx ii, Θ tt 22

23 EM Algorithm M step: with the qq s fixed, maximize the objective over Θ θθ tt+1 arg max Θ yy=1 qq ii tt+1 yy log pp xx ii, YY = yy θθ qq ii tt+1 yy For the case of GMM, we can compute this update in closed form This is not necessarily the case for every model May require gradient ascent 23

24 EM Algorithm Start with random parameters E-step maximizes a lower bound on the log-sum for fixed parameters M-step solves a MLE estimation problem for fixed probabilities Iterate between the E-step and M-step until convergence 24

25 EM for Gaussian Mixtures E-step: M-step: qq tt ii yy = λλtt yy pp xx ii μμ tt tt yy, Σ yy tt pp xx ii μμ tt tt yy, Σ yy yy λλ yy μμ tt+1 yy = qq ii tt yy xx ii qq ii tt yy Probability of xx (ii) under the appropriate multivariate normal distribution Σ tt+1 yy = qq tt ii yy xx ii tt+1 μμ yy xx ii tt+1 μμ TT yy qq tt ii yy λλ yy tt+1 = 1 qq ii tt yy 25

26 EM for Gaussian Mixtures E-step: M-step: qq tt ii yy = λλtt yy pp xx ii μμ tt tt yy, Σ yy tt pp xx ii μμ tt tt yy, Σ yy yy λλ yy μμ tt+1 yy = qq ii tt yy xx ii qq ii tt yy Probability of xx (ii) under the mixture model Σ tt+1 yy = qq tt ii yy xx ii tt+1 μμ yy xx ii tt+1 μμ TT yy qq tt ii yy λλ yy tt+1 = 1 qq ii tt yy 26

27 Gaussian Mixture Example: Start 27

28 After first iteration 28

29 After 2nd iteration 29

30 After 3rd iteration 30

31 After 4th iteration 31

32 After 5th iteration 32

33 After 6th iteration 33

34 After 20th iteration 34

35 Properties of EM EM converges to a local optimum This is because each iteration improves the log-likelihood Proof same as -means (just block coordinate ascent) E-step can never decrease likelihood M-step can never decrease likelihood If we make hard assignments instead of soft ones, algorithm is equivalent to -means! 35

Bayesian Methods: Naïve Bayes

Bayesian Methods: Naïve Bayes Bayesian Methods: Naïve Bayes Nicholas Ruozzi University of Texas at Dallas based on the slides of Vibhav Gogate Last Time Parameter learning Learning the parameter of a simple coin flipping model Prior

More information

2 When Some or All Labels are Missing: The EM Algorithm

2 When Some or All Labels are Missing: The EM Algorithm CS769 Spring Advanced Natural Language Processing The EM Algorithm Lecturer: Xiaojin Zhu jerryzhu@cs.wisc.edu Given labeled examples (x, y ),..., (x l, y l ), one can build a classifier. If in addition

More information

Logistic Regression. Hongning Wang

Logistic Regression. Hongning Wang Logistic Regression Hongning Wang CS@UVa Today s lecture Logistic regression model A discriminative classification model Two different perspectives to derive the model Parameter estimation CS@UVa CS 6501:

More information

Decision Trees. Nicholas Ruozzi University of Texas at Dallas. Based on the slides of Vibhav Gogate and David Sontag

Decision Trees. Nicholas Ruozzi University of Texas at Dallas. Based on the slides of Vibhav Gogate and David Sontag Decision Trees Nicholas Ruozzi University of Texas at Dallas Based on the slides of Vibhav Gogate and David Sontag Announcements Course TA: Hao Xiong Office hours: Friday 2pm-4pm in ECSS2.104A1 First homework

More information

Lecture 5. Optimisation. Regularisation

Lecture 5. Optimisation. Regularisation Lecture 5. Optimisation. Regularisation COMP90051 Statistical Machine Learning Semester 2, 2017 Lecturer: Andrey Kan Copyright: University of Melbourne Iterative optimisation Loss functions Coordinate

More information

Special Topics: Data Science

Special Topics: Data Science Special Topics: Data Science L Linear Methods for Prediction Dr. Vidhyasaharan Sethu School of Electrical Engineering & Telecommunications University of New South Wales Sydney, Australia V. Sethu 1 Topics

More information

CS249: ADVANCED DATA MINING

CS249: ADVANCED DATA MINING CS249: ADVANCED DATA MINING Linear Regression, Logistic Regression, and GLMs Instructor: Yizhou Sun yzsun@cs.ucla.edu April 24, 2017 About WWW2017 Conference 2 Turing Award Winner Sir Tim Berners-Lee 3

More information

Course 495: Advanced Statistical Machine Learning/Pattern Recognition

Course 495: Advanced Statistical Machine Learning/Pattern Recognition Course 495: Advanced Statistical Machine Learning/Pattern Recognition Lectures: Stefanos Zafeiriou Goal (Lectures): To present modern statistical machine learning/pattern recognition algorithms. The course

More information

CS145: INTRODUCTION TO DATA MINING

CS145: INTRODUCTION TO DATA MINING CS145: INTRODUCTION TO DATA MINING 3: Vector Data: Logistic Regression Instructor: Yizhou Sun yzsun@cs.ucla.edu October 9, 2017 Methods to Learn Vector Data Set Data Sequence Data Text Data Classification

More information

knn & Naïve Bayes Hongning Wang

knn & Naïve Bayes Hongning Wang knn & Naïve Bayes Hongning Wang CS@UVa Today s lecture Instance-based classifiers k nearest neighbors Non-parametric learning algorithm Model-based classifiers Naïve Bayes classifier A generative model

More information

Lecture 10. Support Vector Machines (cont.)

Lecture 10. Support Vector Machines (cont.) Lecture 10. Support Vector Machines (cont.) COMP90051 Statistical Machine Learning Semester 2, 2017 Lecturer: Andrey Kan Copyright: University of Melbourne This lecture Soft margin SVM Intuition and problem

More information

Operational Risk Management: Preventive vs. Corrective Control

Operational Risk Management: Preventive vs. Corrective Control Operational Risk Management: Preventive vs. Corrective Control Yuqian Xu (UIUC) July 2018 Joint Work with Lingjiong Zhu and Michael Pinedo 1 Research Questions How to manage operational risk? How does

More information

Communication Amid Uncertainty

Communication Amid Uncertainty Communication Amid Uncertainty Madhu Sudan Harvard University Based on joint works with Brendan Juba, Oded Goldreich, Adam Kalai, Sanjeev Khanna, Elad Haramaty, Jacob Leshno, Clement Canonne, Venkatesan

More information

Minimum Mean-Square Error (MMSE) and Linear MMSE (LMMSE) Estimation

Minimum Mean-Square Error (MMSE) and Linear MMSE (LMMSE) Estimation Minimum Mean-Square Error (MMSE) and Linear MMSE (LMMSE) Estimation Outline: MMSE estimation, Linear MMSE (LMMSE) estimation, Geometric formulation of LMMSE estimation and orthogonality principle. Reading:

More information

Product Decomposition in Supply Chain Planning

Product Decomposition in Supply Chain Planning Mario R. Eden, Marianthi Ierapetritou and Gavin P. Towler (Editors) Proceedings of the 13 th International Symposium on Process Systems Engineering PSE 2018 July 1-5, 2018, San Diego, California, USA 2018

More information

Communication Amid Uncertainty

Communication Amid Uncertainty Communication Amid Uncertainty Madhu Sudan Harvard University Based on joint works with Brendan Juba, Oded Goldreich, Adam Kalai, Sanjeev Khanna, Elad Haramaty, Jacob Leshno, Clement Canonne, Venkatesan

More information

EE 364B: Wind Farm Layout Optimization via Sequential Convex Programming

EE 364B: Wind Farm Layout Optimization via Sequential Convex Programming EE 364B: Wind Farm Layout Optimization via Sequential Convex Programming Jinkyoo Park 1 Introduction In a wind farm, the wakes formed by upstream wind turbines decrease the power outputs of downstream

More information

Imperfectly Shared Randomness in Communication

Imperfectly Shared Randomness in Communication Imperfectly Shared Randomness in Communication Madhu Sudan Harvard Joint work with Clément Canonne (Columbia), Venkatesan Guruswami (CMU) and Raghu Meka (UCLA). 11/16/2016 UofT: ISR in Communication 1

More information

Jasmin Smajic 1, Christian Hafner 2, Jürg Leuthold 2, March 16, 2015 Introduction to Finite Element Method (FEM) Part 1 (2-D FEM)

Jasmin Smajic 1, Christian Hafner 2, Jürg Leuthold 2, March 16, 2015 Introduction to Finite Element Method (FEM) Part 1 (2-D FEM) Jasmin Smajic 1, Christian Hafner 2, Jürg Leuthold 2, March 16, 2015 Introduction to Finite Element Method (FEM) Part 1 (2-D FEM) 1 HSR - University of Applied Sciences of Eastern Switzerland Institute

More information

Support Vector Machines: Optimization of Decision Making. Christopher Katinas March 10, 2016

Support Vector Machines: Optimization of Decision Making. Christopher Katinas March 10, 2016 Support Vector Machines: Optimization of Decision Making Christopher Katinas March 10, 2016 Overview Background of Support Vector Machines Segregation Functions/Problem Statement Methodology Training/Testing

More information

ECO 745: Theory of International Economics. Jack Rossbach Fall Lecture 6

ECO 745: Theory of International Economics. Jack Rossbach Fall Lecture 6 ECO 745: Theory of International Economics Jack Rossbach Fall 2015 - Lecture 6 Review We ve covered several models of trade, but the empirics have been mixed Difficulties identifying goods with a technological

More information

The Intrinsic Value of a Batted Ball Technical Details

The Intrinsic Value of a Batted Ball Technical Details The Intrinsic Value of a Batted Ball Technical Details Glenn Healey, EECS Department University of California, Irvine, CA 9617 Given a set of observed batted balls and their outcomes, we develop a method

More information

Optimizing Cyclist Parking in a Closed System

Optimizing Cyclist Parking in a Closed System Optimizing Cyclist Parking in a Closed System Letu Qingge, Killian Smith Gianforte School of Computing, Montana State University, Bozeman, MT 59717, USA Abstract. In this paper, we consider the two different

More information

The next criteria will apply to partial tournaments. Consider the following example:

The next criteria will apply to partial tournaments. Consider the following example: Criteria for Assessing a Ranking Method Final Report: Undergraduate Research Assistantship Summer 2003 Gordon Davis: dagojr@email.arizona.edu Advisor: Dr. Russel Carlson One of the many questions that

More information

Trouble With The Curve: Improving MLB Pitch Classification

Trouble With The Curve: Improving MLB Pitch Classification Trouble With The Curve: Improving MLB Pitch Classification Michael A. Pane Samuel L. Ventura Rebecca C. Steorts A.C. Thomas arxiv:134.1756v1 [stat.ap] 5 Apr 213 April 8, 213 Abstract The PITCHf/x database

More information

The Simple Linear Regression Model ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD

The Simple Linear Regression Model ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD The Simple Linear Regression Model ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD Outline Definition. Deriving the Estimates. Properties of the Estimates. Units of Measurement and Functional Form. Expected

More information

Operations on Radical Expressions; Rationalization of Denominators

Operations on Radical Expressions; Rationalization of Denominators 0 RD. 1 2 2 2 2 2 2 2 Operations on Radical Expressions; Rationalization of Denominators Unlike operations on fractions or decimals, sums and differences of many radicals cannot be simplified. For instance,

More information

On the convergence of fitting algorithms in computer vision

On the convergence of fitting algorithms in computer vision On the convergence of fitting algorithms in computer vision N. Chernov Department of Mathematics University of Alabama at Birmingham Birmingham, AL 35294 chernov@math.uab.edu Abstract We investigate several

More information

Prediction Market and Parimutuel Mechanism

Prediction Market and Parimutuel Mechanism Prediction Market and Parimutuel Mechanism Yinyu Ye MS&E and ICME Stanford University Joint work with Agrawal, Peters, So and Wang Math. of Ranking, AIM, 2 Outline World-Cup Betting Example Market for

More information

Two Machine Learning Approaches to Understand the NBA Data

Two Machine Learning Approaches to Understand the NBA Data Two Machine Learning Approaches to Understand the NBA Data Panagiotis Lolas December 14, 2017 1 Introduction In this project, I consider applications of machine learning in the analysis of nba data. To

More information

San Francisco State University ECON 560 Summer Midterm Exam 2. Monday, July hour 15 minutes

San Francisco State University ECON 560 Summer Midterm Exam 2. Monday, July hour 15 minutes San Francisco State University Michael Bar ECON 560 Summer 2018 Midterm Exam 2 Monday, July 30 1 hour 15 minutes Name: Instructions 1. This is closed book, closed notes exam. 2. No calculators or electronic

More information

Evaluating and Classifying NBA Free Agents

Evaluating and Classifying NBA Free Agents Evaluating and Classifying NBA Free Agents Shanwei Yan In this project, I applied machine learning techniques to perform multiclass classification on free agents by using game statistics, which is useful

More information

SNARKs with Preprocessing. Eran Tromer

SNARKs with Preprocessing. Eran Tromer SNARKs with Preprocessing Eran Tromer BIU Winter School on Verifiable Computation and Special Encryption 4-7 Jan 206 G: Someone generates and publishes a common reference string P: Prover picks NP statement

More information

New Class of Almost Unbiased Modified Ratio Cum Product Estimators with Knownparameters of Auxiliary Variables

New Class of Almost Unbiased Modified Ratio Cum Product Estimators with Knownparameters of Auxiliary Variables Journal of Mathematics and System Science 7 (017) 48-60 doi: 10.1765/159-591/017.09.00 D DAVID PUBLISHING New Class of Almost Unbiased Modified Ratio Cum Product Estimators with Knownparameters of Auxiliary

More information

Machine Learning Application in Aviation Safety

Machine Learning Application in Aviation Safety Machine Learning Application in Aviation Safety Surface Safety Metric MOR Classification Presented to: By: Date: ART Firdu Bati, PhD, FAA September, 2018 Agenda Surface Safety Metric (SSM) development

More information

Finding your feet: modelling the batting abilities of cricketers using Gaussian processes

Finding your feet: modelling the batting abilities of cricketers using Gaussian processes Finding your feet: modelling the batting abilities of cricketers using Gaussian processes Oliver Stevenson & Brendon Brewer PhD candidate, Department of Statistics, University of Auckland o.stevenson@auckland.ac.nz

More information

Below are the graphing equivalents of the above constraints.

Below are the graphing equivalents of the above constraints. Detailed Step- by- Step description of the Wilson Problem Part A. LP Formulation Step 1. Figure out the problem Although it may seem obvious, the first step in solving the Wilson Problem is to understand

More information

Design Project 2 Sizing of a Bicycle Chain Ring Bolt Set ENGR 0135 Sangyeop Lee November 16, 2016 Jordan Gittleman Noah Sargent Seth Strayer Desmond

Design Project 2 Sizing of a Bicycle Chain Ring Bolt Set ENGR 0135 Sangyeop Lee November 16, 2016 Jordan Gittleman Noah Sargent Seth Strayer Desmond 1 Design Project 2 Sizing of a Bicycle Chain Ring Bolt Set ENGR 0135 Sangyeop Lee November 16, 2016 Jordan Gittleman Noah Sargent Seth Strayer Desmond Zheng 2 Abstract This report will analyze our calculations,

More information

ISyE 6414: Regression Analysis

ISyE 6414: Regression Analysis ISyE 6414: Regression Analysis Lectures: MWF 8:00-10:30, MRDC #2404 Early five-week session; May 14- June 15 (8:00-9:10; 10-min break; 9:20-10:30) Instructor: Dr. Yajun Mei ( YA_JUNE MAY ) Email: ymei@isye.gatech.edu;

More information

Optimization and Search. Jim Tørresen Optimization and Search

Optimization and Search. Jim Tørresen Optimization and Search Optimization and Search INF3490 - Biologically inspired computing Lecture 1: Marsland chapter 9.1, 9.4-9.6 2017 Optimization and Search Jim Tørresen 2 Optimization and Search Methods (selection) Optimization

More information

Attacking and defending neural networks. HU Xiaolin ( 胡晓林 ) Department of Computer Science and Technology Tsinghua University, Beijing, China

Attacking and defending neural networks. HU Xiaolin ( 胡晓林 ) Department of Computer Science and Technology Tsinghua University, Beijing, China Attacking and defending neural networks HU Xiaolin ( 胡晓林 ) Department of Computer Science and Technology Tsinghua University, Beijing, China Outline Background Attacking methods Defending methods 2 AI

More information

Polynomial DC decompositions

Polynomial DC decompositions Polynomial DC decompositions Georgina Hall Princeton, ORFE Joint work with Amir Ali Ahmadi Princeton, ORFE 7/31/16 DIMACS Distance geometry workshop 1 Difference of convex (dc) programming Problems of

More information

Combining Experimental and Non-Experimental Design in Causal Inference

Combining Experimental and Non-Experimental Design in Causal Inference Combining Experimental and Non-Experimental Design in Causal Inference Kari Lock Morgan Department of Statistics Penn State University Rao Prize Conference May 12 th, 2017 A Tribute to Don Design trumps

More information

ISyE 6414 Regression Analysis

ISyE 6414 Regression Analysis ISyE 6414 Regression Analysis Lecture 2: More Simple linear Regression: R-squared (coefficient of variation/determination) Correlation analysis: Pearson s correlation Spearman s rank correlation Variable

More information

Parsimonious Linear Fingerprinting for Time Series

Parsimonious Linear Fingerprinting for Time Series Parsimonious Linear Fingerprinting for Time Series Lei Li, B. Aditya Prakash, Christos Faloutsos School of Computer Science Carnegie Mellon University VLDB 2010 1 L. Li, 2010 VLDB2010, 36 th International

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 256 Introduction This procedure computes summary statistics and common non-parametric, single-sample runs tests for a series of n numeric, binary, or categorical data values. For numeric data,

More information

A Chiller Control Algorithm for Multiple Variablespeed Centrifugal Compressors

A Chiller Control Algorithm for Multiple Variablespeed Centrifugal Compressors Purdue University Purdue e-pubs International Compressor Engineering Conference School of Mechanical Engineering 2014 A Chiller Control Algorithm for Multiple Variablespeed Centrifugal Compressors Piero

More information

Analysis of Shear Lag in Steel Angle Connectors

Analysis of Shear Lag in Steel Angle Connectors University of New Hampshire University of New Hampshire Scholars' Repository Honors Theses and Capstones Student Scholarship Spring 2013 Analysis of Shear Lag in Steel Angle Connectors Benjamin Sawyer

More information

COMP Intro to Logic for Computer Scientists. Lecture 13

COMP Intro to Logic for Computer Scientists. Lecture 13 COMP 1002 Intro to Logic for Computer Scientists Lecture 13 B 5 2 J Admin stuff Assignments schedule? Split a2 and a3 in two (A2,3,4,5), 5% each. A2 due Feb 17 th. Midterm date? March 2 nd. No office hour

More information

Modelling the distribution of first innings runs in T20 Cricket

Modelling the distribution of first innings runs in T20 Cricket Modelling the distribution of first innings runs in T20 Cricket James Kirkby The joy of smoothing James Kirkby Modelling the distribution of first innings runs in T20 Cricket The joy of smoothing 1 / 22

More information

Mathematics of Pari-Mutuel Wagering

Mathematics of Pari-Mutuel Wagering Millersville University of Pennsylvania April 17, 2014 Project Objectives Model the horse racing process to predict the outcome of a race. Use the win and exacta betting pools to estimate probabilities

More information

HIGH RESOLUTION DEPTH IMAGE RECOVERY ALGORITHM USING GRAYSCALE IMAGE.

HIGH RESOLUTION DEPTH IMAGE RECOVERY ALGORITHM USING GRAYSCALE IMAGE. HIGH RESOLUTION DEPTH IMAGE RECOVERY ALGORITHM USING GRAYSCALE IMAGE Kazunori Uruma 1, Katsumi Konishi 2, Tomohiro Takahashi 1 and Toshihiro Furukawa 1 1 Graduate School of Engineering, Tokyo University

More information

ORF 201 Computer Methods in Problem Solving. Final Project: Dynamic Programming Optimal Sailing Strategies

ORF 201 Computer Methods in Problem Solving. Final Project: Dynamic Programming Optimal Sailing Strategies Princeton University Department of Operations Research and Financial Engineering ORF 201 Computer Methods in Problem Solving Final Project: Dynamic Programming Optimal Sailing Strategies Due 11:59 pm,

More information

Write these equations in your notes if they re not already there. You will want them for Exam 1 & the Final.

Write these equations in your notes if they re not already there. You will want them for Exam 1 & the Final. Tuesday January 30 Assignment 3: Due Friday, 11:59pm.like every Friday Pre-Class Assignment: 15min before class like every class Office Hours: Wed. 10-11am, 204 EAL Help Room: Wed. & Thurs. 6-9pm, here

More information

Simplifying Radical Expressions and the Distance Formula

Simplifying Radical Expressions and the Distance Formula 1 RD. Simplifying Radical Expressions and the Distance Formula In the previous section, we simplified some radical expressions by replacing radical signs with rational exponents, applying the rules of

More information

Abstract In this paper, the author deals with the properties of circumscribed ellipses of convex quadrilaterals, using tools of parallel projective tr

Abstract In this paper, the author deals with the properties of circumscribed ellipses of convex quadrilaterals, using tools of parallel projective tr Study on the Properties of Circumscribed Ellipses of Convex Quadrilaterals Author: Yixi Shen Mentors: Zhongyuan Dai; Yijun Yao No. High School of East China Normal University Shanghai, China December,

More information

Ivan Suarez Robles, Joseph Wu 1

Ivan Suarez Robles, Joseph Wu 1 Ivan Suarez Robles, Joseph Wu 1 Section 1: Introduction Mixed Martial Arts (MMA) is the fastest growing competitive sport in the world. Because the fighters engage in distinct martial art disciplines (boxing,

More information

Bayesian Optimized Random Forest for Movement Classification with Smartphones

Bayesian Optimized Random Forest for Movement Classification with Smartphones Bayesian Optimized Random Forest for Movement Classification with Smartphones 1 2 3 4 Anonymous Author(s) Affiliation Address email 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29

More information

Naïve Bayes. Robot Image Credit: Viktoriya Sukhanova 123RF.com

Naïve Bayes. Robot Image Credit: Viktoriya Sukhanova 123RF.com Naïve Bayes These slides were assembled by Eric Eaton, with grateful acknowledgement of the many others who made their course materials freely available online. Feel free to reuse or adapt these slides

More information

A HYBRID METHOD FOR CALIBRATION OF UNKNOWN PARTIALLY/FULLY CLOSED VALVES IN WATER DISTRIBUTION SYSTEMS ABSTRACT

A HYBRID METHOD FOR CALIBRATION OF UNKNOWN PARTIALLY/FULLY CLOSED VALVES IN WATER DISTRIBUTION SYSTEMS ABSTRACT A HYBRID METHOD FOR CALIBRATION OF UNKNOWN PARTIALLY/FULLY CLOSED VALVES IN WATER DISTRIBUTION SYSTEMS Nhu Cuong Do 1,2, Angus Simpson 3, Jochen Deuerlein 4, Olivier Piller 5 1 University of Saskatchewan,

More information

CS 4649/7649 Robot Intelligence: Planning

CS 4649/7649 Robot Intelligence: Planning CS 4649/7649 Robot Intelligence: Planning Differential Kinematics, Probabilistic Roadmaps Sungmoon Joo School of Interactive Computing College of Computing Georgia Institute of Technology S. Joo (sungmoon.joo@cc.gatech.edu)

More information

PREDICTING the outcomes of sporting events

PREDICTING the outcomes of sporting events CS 229 FINAL PROJECT, AUTUMN 2014 1 Predicting National Basketball Association Winners Jasper Lin, Logan Short, and Vishnu Sundaresan Abstract We used National Basketball Associations box scores from 1991-1998

More information

What is Restrained and Unrestrained Pipes and what is the Strength Criteria

What is Restrained and Unrestrained Pipes and what is the Strength Criteria What is Restrained and Unrestrained Pipes and what is the Strength Criteria Alex Matveev, September 11, 2018 About author: Alex Matveev is one of the authors of pipe stress analysis codes GOST 32388-2013

More information

Naïve Bayes. Robot Image Credit: Viktoriya Sukhanova 123RF.com

Naïve Bayes. Robot Image Credit: Viktoriya Sukhanova 123RF.com Naïve Bayes These slides were assembled by Byron Boots, with only minor modifications from Eric Eaton s slides and grateful acknowledgement to the many others who made their course materials freely available

More information

Modeling the Relationship Between MLB Ballparks and Home Team Performance. Using Shape Analysis. Sara Biesiadny, B.S. A Thesis

Modeling the Relationship Between MLB Ballparks and Home Team Performance. Using Shape Analysis. Sara Biesiadny, B.S. A Thesis Modeling the Relationship Between MLB Ballparks and Home Team Performance Using Shape Analysis by Sara Biesiadny, B.S. A Thesis In Mathematics and Statistics Submitted to the Graduate Faculty of Texas

More information

Functions of Random Variables & Expectation, Mean and Variance

Functions of Random Variables & Expectation, Mean and Variance Functions of Random Variables & Expectation, Mean and Variance Kuan-Yu Chen ( 陳冠宇 ) @ TR-409, NTUST Functions of Random Variables 1 Given a random variables XX, one may generate other random variables

More information

Sample Solution for Problem 1.a

Sample Solution for Problem 1.a Sample Solution for Problem 1.a 1 Inverted Pendulum Model (IPM) 1.1 Equations of Motion and Ground Reaction Forces Figure 1: Scheme of the Inverted Pendulum Model (IPM). The equations of motion of this

More information

A COURSE OUTLINE (September 2001)

A COURSE OUTLINE (September 2001) 189-265A COURSE OUTLINE (September 2001) 1 Topic I. Line integrals: 2 1 2 weeks 1.1 Parametric curves Review of parametrization for lines and circles. Paths and curves. Differentiation and integration

More information

3D Inversion in GM-SYS 3D Modelling

3D Inversion in GM-SYS 3D Modelling 3D Inversion in GM-SYS 3D Modelling GM-SYS 3D provides a wide range of inversion options including inversion for both layer structure and physical properties for gravity and magnetic data. There is an

More information

/435 Artificial Intelligence Fall 2015

/435 Artificial Intelligence Fall 2015 Final Exam 600.335/435 Artificial Intelligence Fall 2015 Name: Section (335/435): Instructions Please be sure to write both your name and section in the space above! Some questions will be exclusive to

More information

Midterm Exam 1, section 2. Thursday, September hour, 15 minutes

Midterm Exam 1, section 2. Thursday, September hour, 15 minutes San Francisco State University Michael Bar ECON 312 Fall 2018 Midterm Exam 1, section 2 Thursday, September 27 1 hour, 15 minutes Name: Instructions 1. This is closed book, closed notes exam. 2. You can

More information

Analysis of Pumping Characteristics of a Multistage Roots Pump

Analysis of Pumping Characteristics of a Multistage Roots Pump Research Paper Applied Science and Convergence Technology Vol.24 No.1, January 2015, pp.9~15 http://dx.doi.org/10.5757/asct.2015.24.1.9 Analysis of Pumping Characteristics of a Multistage Roots Pump S.

More information

College Football Rankings: An Industrial Engineering Approach. An Undergraduate Honors College Thesis. in the

College Football Rankings: An Industrial Engineering Approach. An Undergraduate Honors College Thesis. in the College Football Rankings: An Industrial Engineering Approach An Undergraduate Honors College Thesis in the Department of Industrial Engineering College of Engineering University of Arkansas Fayetteville,

More information

Pre-Kindergarten 2017 Summer Packet. Robert F Woodall Elementary

Pre-Kindergarten 2017 Summer Packet. Robert F Woodall Elementary Pre-Kindergarten 2017 Summer Packet Robert F Woodall Elementary In the fall, on your child s testing day, please bring this packet back for a special reward that will be awarded to your child for completion

More information

Analysis of Gini s Mean Difference for Randomized Block Design

Analysis of Gini s Mean Difference for Randomized Block Design American Journal of Mathematics and Statistics 2015, 5(3): 111-122 DOI: 10.5923/j.ajms.20150503.02 Analysis of Gini s Mean Difference for Randomized Block Design Elsayed A. H. Elamir Department of Statistics

More information

COMPLEX NUMBERS. Powers of j. Consider the quadratic equation; It has no solutions in the real number system since. j 1 ie. j 2 1

COMPLEX NUMBERS. Powers of j. Consider the quadratic equation; It has no solutions in the real number system since. j 1 ie. j 2 1 COMPLEX NUMBERS Consider the quadratic equation; x 0 It has no solutions in the real number system since x or x j j ie. j Similarly x 6 0 gives x 6 or x 6 j Powers of j j j j j. j j j j.j jj j j 5 j j

More information

STANDARD SCORES AND THE NORMAL DISTRIBUTION

STANDARD SCORES AND THE NORMAL DISTRIBUTION STANDARD SCORES AND THE NORMAL DISTRIBUTION REVIEW 1.MEASURES OF CENTRAL TENDENCY A.MEAN B.MEDIAN C.MODE 2.MEASURES OF DISPERSIONS OR VARIABILITY A.RANGE B.DEVIATION FROM THE MEAN C.VARIANCE D.STANDARD

More information

Determination of the Design Load for Structural Safety Assessment against Gas Explosion in Offshore Topside

Determination of the Design Load for Structural Safety Assessment against Gas Explosion in Offshore Topside Determination of the Design Load for Structural Safety Assessment against Gas Explosion in Offshore Topside Migyeong Kim a, Gyusung Kim a, *, Jongjin Jung a and Wooseung Sim a a Advanced Technology Institute,

More information

Use of Auxiliary Variables and Asymptotically Optimum Estimators in Double Sampling

Use of Auxiliary Variables and Asymptotically Optimum Estimators in Double Sampling International Journal of Statistics and Probability; Vol. 5, No. 3; May 2016 ISSN 1927-7032 E-ISSN 1927-7040 Published by Canadian Center of Science and Education Use of Auxiliary Variables and Asymptotically

More information

Massey Method. Introduction. The Process

Massey Method. Introduction. The Process Massey Method Introduction Massey s Method, also referred to as the Point Spread Method, is a rating method created by mathematics professor Kenneth Massey. It is currently used to determine which teams

More information

Practical Approach to Evacuation Planning Via Network Flow and Deep Learning

Practical Approach to Evacuation Planning Via Network Flow and Deep Learning Practical Approach to Evacuation Planning Via Network Flow and Deep Learning Akira Tanaka Nozomi Hata Nariaki Tateiwa Katsuki Fujisawa Graduate School of Mathematics, Kyushu University Institute of Mathematics

More information

On the Impact of Garbage Collection on flash-based SSD endurance

On the Impact of Garbage Collection on flash-based SSD endurance On the Impact of Garbage Collection on flash-based SSD endurance Robin Verschoren and Benny Van Houdt Dept. Mathematics and Computer Science University of Antwerp Antwerp, Belgium INFLOW 16 Robin Verschoren

More information

Anabela Brandão and Doug S. Butterworth

Anabela Brandão and Doug S. Butterworth Obtaining a standardised CPUE series for toothfish (Dissostichus eleginoides) in the Prince Edward Islands EEZ calibrated to incorporate both longline and trotline data over the period 1997-2013 Anabela

More information

Probabilistic Modelling in Multi-Competitor Games. Christopher J. Farmer

Probabilistic Modelling in Multi-Competitor Games. Christopher J. Farmer Probabilistic Modelling in Multi-Competitor Games Christopher J. Farmer Master of Science Division of Informatics University of Edinburgh 2003 Abstract Previous studies in to predictive modelling of sports

More information

STAT/MATH 395 PROBABILITY II

STAT/MATH 395 PROBABILITY II STAT/MATH 395 PROBABILITY II Quick review on Discrete Random Variables Néhémy Lim University of Washington Winter 2017 Example Pick 5 toppings from a total of 15. Give the sample space Ω of the experiment

More information

Modelling and Simulation of Environmental Disturbances

Modelling and Simulation of Environmental Disturbances Modelling and Simulation of Environmental Disturbances (Module 5) Dr Tristan Perez Centre for Complex Dynamic Systems and Control (CDSC) Prof. Thor I Fossen Department of Engineering Cybernetics 18/09/2007

More information

A definition of depth for functional observations

A definition of depth for functional observations A definition of depth for functional observations Sara López-Pintado and Juan Romo Departamento de Estadística y Econometría Universidad Carlos III de Madrid Madrid, Spain A definition of depth for functional

More information

1.1 The size of the search space Modeling the problem Change over time Constraints... 21

1.1 The size of the search space Modeling the problem Change over time Constraints... 21 Introduction : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 1 I What Are the Ages of My Three Sons? : : : : : : : : : : : : : : : : : 9 1 Why Are Some Problems Dicult to Solve? : : :

More information

WHEN TECHNOLOGICAL COEFFICIENTS CHANGES NEED TO BE ENDOGENOUS

WHEN TECHNOLOGICAL COEFFICIENTS CHANGES NEED TO BE ENDOGENOUS WHEN TECHNOLOGICAL COEFFICIENTS CHANGES NEED TO BE ENDOGENOUS Maurizio Grassini 24 st Inforum World Conference Osnabrueck 29 August 2 September 2016 Import shares in an Inforum Model When the IO table

More information

Chapter 20. Planning Accelerated Life Tests. William Q. Meeker and Luis A. Escobar Iowa State University and Louisiana State University

Chapter 20. Planning Accelerated Life Tests. William Q. Meeker and Luis A. Escobar Iowa State University and Louisiana State University Chapter 20 Planning Accelerated Life Tests William Q. Meeker and Luis A. Escobar Iowa State University and Louisiana State University Copyright 1998-2008 W. Q. Meeker and L. A. Escobar. Based on the authors

More information

Copyright 2017 by Glenn Daniel Sidle. All Rights Reserved

Copyright 2017 by Glenn Daniel Sidle. All Rights Reserved ABSTRACT SIDLE, GLENN DANIEL. Using Multi-Class Machine Learning Methods to Predict Major League Baseball Pitches. (Under the direction of Hien Tran.) As the field of machine learning and its applications

More information

DNS Study on Three Vortex Identification Methods

DNS Study on Three Vortex Identification Methods Γ DNS Study on Three Vortex Identification Methods Yinlin Dong Yong Yang Chaoqun Liu Technical Report 2016-07 http://www.uta.edu/math/preprint/ DNS Study on Three Vortex Identification Methods Yinlin Dong

More information

Chapter 2 Ventilation Network Analysis

Chapter 2 Ventilation Network Analysis Chapter 2 Ventilation Network Analysis Abstract Ventilation network analysis deals with complex working procedures to calculate the air currents flowing in the various meshes or branches of a mine network.

More information

Bézier Curves and Splines

Bézier Curves and Splines CS-C3100 Computer Graphics Bézier Curves and Splines Majority of slides from Frédo Durand vectorportal.com CS-C3100 Fall 2016 Lehtinen Before We Begin Anything on your mind concerning Assignment 1? Linux

More information

Physical Design of CMOS Integrated Circuits

Physical Design of CMOS Integrated Circuits Physical Design of CMOS Integrated Circuits Dae Hyun Kim EECS Washington State University References John P. Uyemura, Introduction to VLSI Circuits and Systems, 2002. Chapter 5 Goal Understand how to physically

More information

Gait Based Personal Identification System Using Rotation Sensor

Gait Based Personal Identification System Using Rotation Sensor Gait Based Personal Identification System Using Rotation Sensor Soumik Mondal, Anup Nandy, Pavan Chakraborty, G. C. Nandi Robotics & AI Lab, Indian Institute of Information Technology Allahabad, India

More information

Ocean Fishing Fleet Scheduling Path Optimization Model Research. Based On Improved Ant Colony Algorithm

Ocean Fishing Fleet Scheduling Path Optimization Model Research. Based On Improved Ant Colony Algorithm 4th International Conference on Sensors, Measurement and Intelligent Materials (ICSMIM 205) Ocean Fishing Fleet Scheduling Path Optimization Model Research Based On Improved Ant Colony Algorithm Li Jia-lin2,

More information

Ranking teams in partially-disjoint tournaments

Ranking teams in partially-disjoint tournaments Ranking teams in partially-disjoint tournaments Alex Choy Mentor: Chris Jones September 16, 2013 1 Introduction Throughout sports, whether it is professional or collegiate sports, teams are ranked. In

More information

Modeling the NCAA Tournament Through Bayesian Logistic Regression

Modeling the NCAA Tournament Through Bayesian Logistic Regression Duquesne University Duquesne Scholarship Collection Electronic Theses and Dissertations 0 Modeling the NCAA Tournament Through Bayesian Logistic Regression Bryan Nelson Follow this and additional works

More information

OPTIMAL TRAJECTORY GENERATION OF COMPASS-GAIT BIPED BASED ON PASSIVE DYNAMIC WALKING

OPTIMAL TRAJECTORY GENERATION OF COMPASS-GAIT BIPED BASED ON PASSIVE DYNAMIC WALKING OPTIMAL TRAJECTORY GENERATION OF COMPASS-GAIT BIPED BASED ON PASSIVE DYNAMIC WALKING Minseung Kim Dept. of Computer Science Illinois Institute of Technology 3201 S. State St. Box 2082 Chicago IL 60616

More information