Discriminative Feature Selection for Uncertain Graph Classification

Size: px
Start display at page:

Download "Discriminative Feature Selection for Uncertain Graph Classification"

Transcription

1 Discriminative Feature Selection for Uncertain Graph Classification Xiangnan Kong University of Illinois at Chicago joint work with Philip S. Yu (Univ. Illinois at Chicago) Xue Wang & Ann B. Ragin (Northwestern Univ.)

2 Brain: A Complex Machine How it works? When something is wrong... Alzheimer s Disease ADHD

3 Neuroimaging fmri: A Video of brain activities But, Can you tell which brain is normal?

4 Brain as a Network Brain Activities Functional Connections

5 Brain Region Connectivity healthy brain Good Family

6 Brain Region Connectivity ADHD brain Connectivity problem

7 Brain Activities as an Uncertain Graph Not sure how exactly the network looks like probability that the connection existing in practice

8 Uncertain Graph uncertain graph A B C possible worlds

9 Uncertain Graph uncertain graph A B C possible worlds

10 Uncertain Graph Classification Problem label! label! label! ADHD? +/- Uncertain Graphs Discriminative Subgraph Features x! label! x! label! x! label! Feature Vectors

11 How to tell if a subgraph is Discriminative Uncertain Graphs B A 0.9 C B + A A C B C B A 0.1 C G 1 G 2 G3 G4 Subgraph Features A A B C C B g 1 g 2 g 3 frequent in uncertain graphs discriminative in certain graphs C discriminative in uncertain graphs

12 Discriminative Scores of a Subgraph Certain graphs Utility Score G-test Score label! label! label! Frequency Ratio Confidence HSIC subgraph feature a certain value

13 Discriminative Scores of a Subgraph Uncertain graphs Utility Scores Probability subgraph label! label! label! a distribution

14 How to get the distribution? Confidence It depends on the utility function... Frequency Ratio G-test Score HSIC Generalized Utility Function for Certain Graphs Table 2: Summary of Discrimination Score Functions. Name f(n g +,ng,n +,n ) confidence frequency ratio G-test HSIC(linear) n g + n g + +ng g + n log n n g n + 2n g + ln ng + n n g n + (n g + n ng n + )2 (n + +n 1) 2 (n + +n ) 2 +2(n + n g + ) ln n (n + ng + ) n + (n n g )

15 Dynamic Programming n + i 1 0 [ Pr n g + = i, D ] + (k) 0 1 k n + Pr [n g + = n +, D ] + Pr [n g + =1, D ] + Pr [n g + =0, D ] + [ Pr i, D(k) ] = [ ] ( 1 Pr[g G ) k ] Pr[i, D(k 1)] + Pr[g G(k)] Pr[i 1, D(k 1)] if i k 1 if i = k =0 0 if i>kor i<0

16 How to Measure? MedianMode Mean Subgraph A Probability phi-prob 0 + Mean Mode Subgraph B Probability 0 Median phi-prob + Frequency Ratio

17 Subgraph Statistical Measures ) Mean: Mode: Median: phi-probability: ( Exp F (g, D) ) ( ) Median F (g, D) = ( Mode F (g, D) ) + s= =argmax S ( ϕ-pr F (g, D) ) s Pr[F (g, D) =s] = =argmax s s=ϕ [ Pr F (g, D) ] =s S [ ] Pr F (g, D) =s s= + Pr[F (g, D) =s] 1 2 More Details in the Paper 0-edges 1-edge Pattern Search Tree 2-edges

18 Data Sets Graphs: Brain Images (fmri) Class Label: Brain Diseases Table 3: Summary of experimental datasets. D D + D V avg. E avg. edge prob ADHD ADNI HIV Alzheimer s Disease ADHD

19 Compared Methods Certain Graph Methods Frequent Subgraphs in Uncertain Graphs Utility Functions Statistical Measures Confidence Frequency Ratio HSIC G-test Score Mean Mode Median Phi - probability

20 Uncertain Graph Helps Error Rate Methods t =100 t =200 t =300 t =400 t =500 Uncertain Graph Methods Certain Graph Methods Exp-HSIC (9) (8) (10) (4) (9) Med-HSIC (14) (5) (6) (8) (7) Mod-HSIC (6) (3) (1)* (4) (2) ϕpr-hsic (1)* (1)* (6) (7) (2) HSIC (16) (19) (17) (18) (18) Exp-Ratio (14) (10) (4) (2) (2) Med-Ratio (16) (15) (16) (11) (11) Mod-Ratio (3) (5) (15) (13) (15) ϕpr-ratio (9) (2) (1)* (2) (1)* Ratio (19) (20) (22) (22) (20) Exp-Gtest (2) (8) (4) (8) (11) Med-Gtest (21) (18) (11) (18) (17) Mod-Gtest (21) (22) (21) (18) (19) ϕpr-gtest (16) (15) (13) (11) (2) Gtest (19) (21) (17) (14) (21) Exp-Conf (7) (3) (1)* (1)* (2) Med-Conf (4) (5) (8) (8) (7) Mod-Conf (12) (10) (8) (4) (9) ϕpr-conf (9) (15) (17) (16) (13) Conf (9) (13) (13) (15) (15) Exp-Freq (8) (10) (11) (16) (13) Freq (5) (13) (20) (21) (21)

21 Discriminative Function Helps Error Rate Methods t =100 t =200 t =300 t =400 t =500 Discrimin ative Exp-HSIC (9) (8) (10) (4) (9) Med-HSIC (14) (5) (6) (8) (7) Mod-HSIC (6) (3) (1)* (4) (2) ϕpr-hsic (1)* (1)* (6) (7) (2) HSIC (16) (19) (17) (18) (18) Exp-Ratio (14) (10) (4) (2) (2) Med-Ratio (16) (15) (16) (11) (11) Mod-Ratio (3) (5) (15) (13) (15) ϕpr-ratio (9) (2) (1)* (2) (1)* Ratio (19) (20) (22) (22) (20) Exp-Gtest (2) (8) (4) (8) (11) Med-Gtest (21) (18) (11) (18) (17) Mod-Gtest (21) (22) (21) (18) (19) ϕpr-gtest (16) (15) (13) (11) (2) Gtest (19) (21) (17) (14) (21) Frequent Exp-Conf (7) (3) (1)* (1)* (2) Med-Conf (4) (5) (8) (8) (7) Mod-Conf (12) (10) (8) (4) (9) ϕpr-conf (9) (15) (17) (16) (13) Conf (9) (13) (13) (15) (15) Exp-Freq (8) (10) (11) (16) (13) Freq (5) (13) (20) (21) (21)

22 Statistical Measure Helps Error Rate mean median mode phi-prob Methods t =100 t =200 t =300 t =400 t =500 Exp-HSIC (9) (8) (10) (4) (9) Med-HSIC (14) (5) (6) (8) (7) Mod-HSIC (6) (3) (1)* (4) (2) ϕpr-hsic (1)* (1)* (6) (7) (2) HSIC (16) (19) (17) (18) (18) Exp-Ratio (14) (10) (4) (2) (2) Med-Ratio (16) (15) (16) (11) (11) Mod-Ratio (3) (5) (15) (13) (15) ϕpr-ratio (9) (2) (1)* (2) (1)* Ratio (19) (20) (22) (22) (20) Exp-Gtest (2) (8) (4) (8) (11) Med-Gtest (21) (18) (11) (18) (17) Mod-Gtest (21) (22) (21) (18) (19) ϕpr-gtest (16) (15) (13) (11) (2) Gtest (19) (21) (17) (14) (21) Exp-Conf (7) (3) (1)* (1)* (2) Med-Conf (4) (5) (8) (8) (7) Mod-Conf (12) (10) (8) (4) (9) ϕpr-conf (9) (15) (17) (16) (13) Conf (9) (13) (13) (15) (15) Exp-Freq (8) (10) (11) (16) (13) Freq (5) (13) (20) (21) (21)

23 Summary Mining discriminative subgraph features for uncertain graph classification model brains as uncertain graphs mining discriminative subgraph features using different statistical measures

24 Q&A

Unit 6 Day 2 Notes Central Tendency from a Histogram; Box Plots

Unit 6 Day 2 Notes Central Tendency from a Histogram; Box Plots AFM Unit 6 Day 2 Notes Central Tendency from a Histogram; Box Plots Name Date To find the mean, median and mode from a histogram, you first need to know how many data points were used. Use the frequency

More information

BASKETBALL PREDICTION ANALYSIS OF MARCH MADNESS GAMES CHRIS TSENG YIBO WANG

BASKETBALL PREDICTION ANALYSIS OF MARCH MADNESS GAMES CHRIS TSENG YIBO WANG BASKETBALL PREDICTION ANALYSIS OF MARCH MADNESS GAMES CHRIS TSENG YIBO WANG GOAL OF PROJECT The goal is to predict the winners between college men s basketball teams competing in the 2018 (NCAA) s March

More information

Sensing and Modeling of Terrain Features using Crawling Robots

Sensing and Modeling of Terrain Features using Crawling Robots Czech Technical University in Prague Sensing and Modeling of Terrain Features using Crawling Robots Jakub Mrva 1 Faculty of Electrical Engineering Agent Technology Center Computational Robotics Laboratory

More information

Inferring land use from mobile phone activity

Inferring land use from mobile phone activity Inferring land use from mobile phone activity Jameson L. Toole (MIT) Michael Ulm (AIT) Dietmar Bauer (AIT) Marta C. Gonzalez (MIT) UrbComp 2012 Beijing, China 1 The Big Questions Can land use be predicted

More information

Predicting Horse Racing Results with TensorFlow

Predicting Horse Racing Results with TensorFlow Predicting Horse Racing Results with TensorFlow LYU 1703 LIU YIDE WANG ZUOYANG News CUHK Professor, Gu Mingao, wins 50 MILLIONS dividend using his sure-win statistical strategy. News AlphaGO defeats human

More information

How Do Injuries in the NFL Affect the Outcome of the Game

How Do Injuries in the NFL Affect the Outcome of the Game How Do Injuries in the NFL Affect the Outcome of the Game Andy Sun STATS 50 2014 Motivation NFL injury surveillance shows upward trend in both the number and severity of injuries Packers won 2010 Super

More information

Current Signature Trend 12C

Current Signature Trend 12C Current Signature Trend 12C DB 20 log (Amps60/AmpsSb) < 51DB Alert 2004, 2010 and 2013 Data points (identified With green dot) have poor/suspect frequency match between frequency of highest sideband and

More information

F For this paper you must have:

F For this paper you must have: Centre Number Surname Candidate Number For Examiner s Use Other Names Candidate Signature Examiner s Initials Mathematics Unit 1 General Certificate of Secondary Education Foundation Tier March 2011 43601F

More information

NAME: A graph contains five major parts: a. Title b. The independent variable c. The dependent variable d. The scales for each variable e.

NAME: A graph contains five major parts: a. Title b. The independent variable c. The dependent variable d. The scales for each variable e. NAME: Graphing is an important procedure used by scientists to display the data that is collected during a controlled experiment. Line graphs demonstrate change over time and must be constructed correctly

More information

INTRODUCTION TO PATTERN RECOGNITION

INTRODUCTION TO PATTERN RECOGNITION INTRODUCTION TO PATTERN RECOGNITION 3 Introduction Our ability to recognize a face, to understand spoken words, to read handwritten characters all these abilities belong to the complex processes of pattern

More information

Fundamentals of Machine Learning for Predictive Data Analytics

Fundamentals of Machine Learning for Predictive Data Analytics Fundamentals of Machine Learning for Predictive Data Analytics Appendix A Descriptive Statistics and Data Visualization for Machine learning John Kelleher and Brian Mac Namee and Aoife D Arcy john.d.kelleher@dit.ie

More information

CS145: INTRODUCTION TO DATA MINING

CS145: INTRODUCTION TO DATA MINING CS145: INTRODUCTION TO DATA MINING 3: Vector Data: Logistic Regression Instructor: Yizhou Sun yzsun@cs.ucla.edu October 9, 2017 Methods to Learn Vector Data Set Data Sequence Data Text Data Classification

More information

DATA HANDLING EXAM QUESTIONS

DATA HANDLING EXAM QUESTIONS DATA HANDLING EXAM QUESTIONS MARK SCHEME 1. (a) Reason 1 B1 Makes some mention of bias either directly or making reference to an insufficient or biased range of responses (b) Reason 1 B1 (a) an insufficient

More information

POKEMON HACKS. Jodie Ashford Josh Baggott Chloe Barnes Jordan bird

POKEMON HACKS. Jodie Ashford Josh Baggott Chloe Barnes Jordan bird POKEMON HACKS Jodie Ashford Josh Baggott Chloe Barnes Jordan bird Why pokemon? 1997 The Pokemon Problem an episode of the anime caused 685 children to have seizures Professor Graham Harding from Aston

More information

knn & Naïve Bayes Hongning Wang

knn & Naïve Bayes Hongning Wang knn & Naïve Bayes Hongning Wang CS@UVa Today s lecture Instance-based classifiers k nearest neighbors Non-parametric learning algorithm Model-based classifiers Naïve Bayes classifier A generative model

More information

Running head: DATA ANALYSIS AND INTERPRETATION 1

Running head: DATA ANALYSIS AND INTERPRETATION 1 Running head: DATA ANALYSIS AND INTERPRETATION 1 Data Analysis and Interpretation Final Project Vernon Tilly Jr. University of Central Oklahoma DATA ANALYSIS AND INTERPRETATION 2 Owners of the various

More information

Fish Biorobotics. Fishes as model systems for understanding aquatic propulsion. George V. Lauder Harvard University

Fish Biorobotics. Fishes as model systems for understanding aquatic propulsion. George V. Lauder Harvard University Fish Biorobotics Fishes as model systems for understanding aquatic propulsion 28,000 species of fishes Half of all vertebrates are fishes Fishes are 550 million years old Tremendous opportunity for selection

More information

Modelling Exposure at Default Without Conversion Factors for Revolving Facilities

Modelling Exposure at Default Without Conversion Factors for Revolving Facilities Modelling Exposure at Default Without Conversion Factors for Revolving Facilities Mark Thackham Credit Scoring and Credit Control XV, Edinburgh, August 2017 1 / 27 Objective The objective of this presentation

More information

Example: sequence data set wit two loci [simula

Example: sequence data set wit two loci [simula Example: sequence data set wit two loci [simulated data] -- 1 Example: sequence data set wit two loci [simula MIGRATION RATE AND POPULATION SIZE ESTIMATION using the coalescent and maximum likelihood or

More information

Exploring Measures of Central Tendency (mean, median and mode) Exploring range as a measure of dispersion

Exploring Measures of Central Tendency (mean, median and mode) Exploring range as a measure of dispersion Unit 5 Statistical Reasoning 1 5.1 Exploring Data Goals: Exploring Measures of Central Tendency (mean, median and mode) Exploring range as a measure of dispersion Data: A set of values. A set of data can

More information

VI Classifications of Weldments

VI Classifications of Weldments VI Classifications of Weldments Good Bad AM 11/03 1 Outline TWI Classification system AISC classification system An alternative classification system AM 11/03 2 B - Longitudinal butt AM 11/03 3 C - longit.

More information

A computer program that improves its performance at some task through experience.

A computer program that improves its performance at some task through experience. 1 A computer program that improves its performance at some task through experience. 2 Example: Learn to Diagnose Patients T: Diagnose tumors from images P: Percent of patients correctly diagnosed E: Pre

More information

Mobility Detection Using Everyday GSM Traces

Mobility Detection Using Everyday GSM Traces Mobility Detection Using Everyday GSM Traces Timothy Sohn et al Philip Cootey pcootey@wpi.edu (03/22/2011) Mobility Detection High level activity discerned from course Grained GSM Data provides immediate

More information

1. The data below gives the eye colors of 20 students in a Statistics class. Make a frequency table for the data.

1. The data below gives the eye colors of 20 students in a Statistics class. Make a frequency table for the data. 1. The data below gives the eye colors of 20 students in a Statistics class. Make a frequency table for the data. Green Blue Brown Blue Blue Brown Blue Blue Blue Green Blue Brown Blue Brown Brown Blue

More information

Looking at Statistical Graphics Rigorously

Looking at Statistical Graphics Rigorously Looking at Statistical Graphics Rigorously Jamie Monogan Washington University in St. Louis September 13, 2010 Jamie Monogan (WUStL) Looking at Statistical Graphics Rigorously September 13, 2010 1 / 20

More information

Data: Central Tendency, Box & Whisker Plot Long-Term Memory Review Review 1

Data: Central Tendency, Box & Whisker Plot Long-Term Memory Review Review 1 Review 1 2. The mean, median and mode define the measures of tendencies. Tina kept a record of her points from playing basketball for the last two weeks. Points Scored for each Game GAME 1 2 3 POINTS 13

More information

Reliable Real-Time Recognition of Motion Related Human Activities using MEMS Inertial Sensors

Reliable Real-Time Recognition of Motion Related Human Activities using MEMS Inertial Sensors Reliable Real-Time Recognition of Motion Related Human Activities using MEMS Inertial Sensors K. Frank,, y; M.J. Vera-Nadales, University of Malaga, Spain; P. Robertson, M. Angermann, v36 Motivation 2

More information

Internet Technology Fundamentals. To use a passing score at the percentiles listed below:

Internet Technology Fundamentals. To use a passing score at the percentiles listed below: Internet Technology Fundamentals To use a passing score at the percentiles listed below: PASS candidates with this score or HIGHER: 2.90 High Scores Medium Scores Low Scores Percentile Rank Proficiency

More information

Understanding Rider Differences in Mileage and Riding Frequency through the MSF100 Motorcyclists Naturalistic Study.

Understanding Rider Differences in Mileage and Riding Frequency through the MSF100 Motorcyclists Naturalistic Study. 2/18/216 Understanding Rider Differences in Mileage and Riding through the MSF1 Motorcyclists Naturalistic Study January 13, 214 Mac McCall Shane McLaughlin Motorcycle Research Group 1 Overview Objective

More information

Chapter 2: Modeling Distributions of Data

Chapter 2: Modeling Distributions of Data Chapter 2: Modeling Distributions of Data Section 2.1 The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Chapter 2 Modeling Distributions of Data 2.1 2.2 Normal Distributions Section

More information

EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Wenbing Zhao. Department of Electrical and Computer Engineering

EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Wenbing Zhao. Department of Electrical and Computer Engineering EEC 686/785 Modeling & Performance Evaluation of Computer Systems Lecture 6 Department of Electrical and Computer Engineering Cleveland State University wenbing@ieee.org Outline 2 Review of lecture 5 The

More information

Marine Renewables Industry Association. Marine Renewables Industry: Requirements for Oceanographic Measurements, Data Processing and Modelling

Marine Renewables Industry Association. Marine Renewables Industry: Requirements for Oceanographic Measurements, Data Processing and Modelling Marine Renewables Industry Association Marine Renewables Industry: Requirements for Oceanographic Measurements, Data Processing and Modelling October 2009 Table of Contents 1. Introduction... 1 2. Measurements

More information

Empirical Example II of Chapter 7

Empirical Example II of Chapter 7 Empirical Example II of Chapter 7 1. We use NBA data. The description of variables is --- --- --- storage display value variable name type format label variable label marr byte %9.2f =1 if married wage

More information

Outline. Terminology. EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Steps in Capacity Planning and Management

Outline. Terminology. EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Steps in Capacity Planning and Management EEC 686/785 Modeling & Performance Evaluation of Computer Systems Lecture 6 Department of Electrical and Computer Engineering Cleveland State University wenbing@ieee.org Outline Review of lecture 5 The

More information

Parsimonious Linear Fingerprinting for Time Series

Parsimonious Linear Fingerprinting for Time Series Parsimonious Linear Fingerprinting for Time Series Lei Li, B. Aditya Prakash, Christos Faloutsos School of Computer Science Carnegie Mellon University VLDB 2010 1 L. Li, 2010 VLDB2010, 36 th International

More information

Evaluating and Classifying NBA Free Agents

Evaluating and Classifying NBA Free Agents Evaluating and Classifying NBA Free Agents Shanwei Yan In this project, I applied machine learning techniques to perform multiclass classification on free agents by using game statistics, which is useful

More information

MWGen: A Mini World Generator

MWGen: A Mini World Generator MWGen: A Mini World Generator Jianqiu Xu and Ralf Hartmut Güting Database Systems for New Applications, Mathematics and Computer Science FernUniversität in Hagen, Germany July, 2012 Outline 1 Problem Description

More information

Hazard and Risk Assessment Guide

Hazard and Risk Assessment Guide Page 1 Safety legislation in Canada (Occupational Health & Safety Act, Sec. 25) entails employers to do everything that is reasonable to ensure their workers have a healthy and safe workplace. The first

More information

ASMFC Stock Assessment Overview: Red Drum

ASMFC Stock Assessment Overview: Red Drum Purpose The purpose of this document is to improve the understanding and transparency of the Commission s stock assessment process and results. It is the first of several that will be developed throughout

More information

A Bag-of-Gait Model for Gait Recognition

A Bag-of-Gait Model for Gait Recognition A Bag-of-Gait Model for Gait Recognition Jianzhao Qin, T. Luo, W. Shao, R. H. Y. Chung and K. P. Chow The Department of Computer Science, The University of Hong Kong, Hong Kong, China Abstract In this

More information

PREDICTING the outcomes of sporting events

PREDICTING the outcomes of sporting events CS 229 FINAL PROJECT, AUTUMN 2014 1 Predicting National Basketball Association Winners Jasper Lin, Logan Short, and Vishnu Sundaresan Abstract We used National Basketball Associations box scores from 1991-1998

More information

5.1. Data Displays Batter Up. My Notes ACTIVITY

5.1. Data Displays Batter Up. My Notes ACTIVITY SUGGESTED LEARNING STRATEGIES: Activating Prior Knowledge, Marking the Text, Group Presentation, Interactive Word Wall Henry Hank Aaron and Harmon Killebrew are among the alltime leaders in home runs in

More information

Mrs. Daniel- AP Stats Ch. 2 MC Practice

Mrs. Daniel- AP Stats Ch. 2 MC Practice Mrs. Daniel- AP Stats Ch. 2 MC Practice Name: 1. Jorge s score on Exam 1 in his statistics class was at the 64th percentile of the scores for all students. His score falls (a) between the minimum and the

More information

Level 1 Mathematics and Statistics, 2013

Level 1 Mathematics and Statistics, 2013 91037 910370 1SUPERVISOR S Level 1 Mathematics and Statistics, 2013 91037 Demonstrate understanding of chance and data 9.30 am Wednesday 13 November 2013 Credits: Four Achievement Achievement with Merit

More information

Complete Streets Basics and Benefits

Complete Streets Basics and Benefits Complete Streets Basics and Benefits November 14, 2017 Complete Streets Workshop Ann Ogoreuc, AICP, Allegheny County Economic Development Hannah E. Hardy, Allegheny County Health Department Benefits of

More information

Statistics Class 3. Jan 30, 2012

Statistics Class 3. Jan 30, 2012 Statistics Class 3 Jan 30, 2012 Group Quiz 2 1. The Statistical Abstract of the United States includes the average per capita income for each of the 50 states. When those 50 values are added, then divided

More information

Simultaneous observations of the H2O and SiO masers toward the late-type stars using KVN

Simultaneous observations of the H2O and SiO masers toward the late-type stars using KVN Simultaneous observations of the H2O and SiO masers toward the late-type stars using KVN Youngjoo Yun, Se-Hyung Cho, Richard Dodson, María J. Rioja, and KVN KSP members 2016 EVN Symposium 2016.9.22 Late-type

More information

Statistical Studies: Analyzing Data III.B Student Activity Sheet 6: Analyzing Graphical Displays

Statistical Studies: Analyzing Data III.B Student Activity Sheet 6: Analyzing Graphical Displays The Phoenix Mercury of the Women s National Basketball League had 14 players on the roster for the 2008 season. The players and their average points per game (PPG) are shown below. Player Diana Taurasi

More information

Statistical Studies: Analyzing Data III.B Student Activity Sheet 6: Analyzing Graphical Displays

Statistical Studies: Analyzing Data III.B Student Activity Sheet 6: Analyzing Graphical Displays The Phoenix Mercury of the Women s National Basketball League had 14 players on the roster for the 2008 season. The players and their average points per game (PPG) are shown below. Player Diana Taurasi

More information

STAT 155 Introductory Statistics. Lecture 2-2: Displaying Distributions with Graphs

STAT 155 Introductory Statistics. Lecture 2-2: Displaying Distributions with Graphs The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL STAT 155 Introductory Statistics Lecture 2-2: Displaying Distributions with Graphs 8/31/06 Lecture 2-2 1 Recall Data: Individuals Variables Categorical variables

More information

The Virginia Economy: Labor Markets and Workforce

The Virginia Economy: Labor Markets and Workforce The Virginia Economy: Labor Markets and Workforce April 4, 2017 Sonya Ravindranath Waddell Regional Economist Research Department The views and opinions expressed herein are those of the author. They do

More information

Matrix-analog measure-cerrelatepredict

Matrix-analog measure-cerrelatepredict Matrix-analog measure-cerrelatepredict approach ICEM 2015 22-26 June 2015, Boulder David Hanslian Institute of Atmospheric Physics AS CR "Measure-correlate-predict" (MCP) = methods to estimate long-term

More information

CHAPTER 1 Exploring Data

CHAPTER 1 Exploring Data CHAPTER 1 Exploring Data 1.2 Displaying Quantitative Data with Graphs The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Displaying Quantitative Data

More information

Lab 8 - Continuation of Simulation Let us investigate one of the problems from last week regarding a possible discrimination suit.

Lab 8 - Continuation of Simulation Let us investigate one of the problems from last week regarding a possible discrimination suit. Lab 8 - Continuation of Simulation Let us investigate one of the problems from last week regarding a possible discrimination suit. A company with a large sales staff announces openings for 3 positions

More information

NOTES: STANDARD DEVIATION DAY 4 Textbook Chapter 11.1, 11.3

NOTES: STANDARD DEVIATION DAY 4 Textbook Chapter 11.1, 11.3 NOTES: STANDARD DEVIATION DAY 4 Textbook Chapter 11.1, 11.3 OBJECTIVE: Today you will learn about standard deviation and the normal curve! There are three ways that we can measure how spread out the data

More information

Research Article. Fault diagnosis of power transformer based on chemical properties of insulation oil

Research Article. Fault diagnosis of power transformer based on chemical properties of insulation oil Available online www.jocpr.com Journal of Chemical and Pharmaceutical Research, 015, 7(3):83-88 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 Fault diagnosis of power transformer based on chemical

More information

AN ASSESSMENT OF NEW JERSEY RESIDENT HUNTER OPINION ON CROSSBOW USE

AN ASSESSMENT OF NEW JERSEY RESIDENT HUNTER OPINION ON CROSSBOW USE AN ASSESSMENT OF NEW JERSEY RESIDENT HUNTER OPINION ON CROSSBOW USE Survey mailed: December 2007 Data analyzed: January 2008 Division of Fish and Wildlife David Chanda, Director Larry Herrighty, Assistant

More information

Human Performance Evaluation

Human Performance Evaluation Human Performance Evaluation Minh Nguyen, Liyue Fan, Luciano Nocera, Cyrus Shahabi minhnngu@usc.edu --O-- Integrated Media Systems Center University of Southern California 1 2 Motivating Application 8.2

More information

Biostatistics & SAS programming

Biostatistics & SAS programming Biostatistics & SAS programming Kevin Zhang March 6, 2017 ANOVA 1 Two groups only Independent groups T test Comparison One subject belongs to only one groups and observed only once Thus the observations

More information

A Novel Approach to Predicting the Results of NBA Matches

A Novel Approach to Predicting the Results of NBA Matches A Novel Approach to Predicting the Results of NBA Matches Omid Aryan Stanford University aryano@stanford.edu Ali Reza Sharafat Stanford University sharafat@stanford.edu Abstract The current paper presents

More information

Bending Vibration Analysis of Pipes and Shafts Arranged in Fluid Filled Tubular Spaces Using FEM

Bending Vibration Analysis of Pipes and Shafts Arranged in Fluid Filled Tubular Spaces Using FEM Bending Vibration Analysis of Pipes and Shafts Arranged in Fluid Filled Tubular Spaces Using FEM By Desta Milkessa Under the guidance of : Prof. Dr.Eng. Patrick Kaeding Dipl.-Ing. Michael Holtmann Developed

More information

Computational Analysis Task Casio ClassPad

Computational Analysis Task Casio ClassPad Computational Analysis Task Casio ClassPad The organisers of a surf contest examine two Victorian beaches as proposed venues for the contest. Each day, for two weeks, the organisers record the height of

More information

This file is part of the following reference:

This file is part of the following reference: This file is part of the following reference: Hancock, Timothy Peter (2006) Multivariate consensus trees: tree-based clustering and profiling for mixed data types. PhD thesis, James Cook University. Access

More information

How to use the Turnip Graph Engine in STATA

How to use the Turnip Graph Engine in STATA How to use the Turnip Graph Engine in STATA WHAT YOU HAVE TO DO: 1. Download turnip.ado and put it in an ado file folder 2. Now whenever you open STATA you can run turnip as you would any.ado file: for

More information

That pesky golf game and the dreaded stats class

That pesky golf game and the dreaded stats class That pesky golf game and the dreaded stats class Marsha Jance Indiana University East A case study that involves golf and statistics is presented. This case study focuses on descriptive statistics and

More information

The Application of Data Mining in Sports Events

The Application of Data Mining in Sports Events 2nd International Conference on Electronics, Network and Computer Engineering (ICENCE 2016) The Application of Mining in Sports Events Jiao Yu-ping1,a, Li Xia2,b 1 Physical Education Department Guangdong

More information

The Impact of Narrow Lane on Safety of the Arterial Roads. Hyeonsup Lim

The Impact of Narrow Lane on Safety of the Arterial Roads. Hyeonsup Lim The Impact of Narrow Lane on Safety of the Arterial Roads Hyeonsup Lim What do we know about Narrow Lane? AASHTO Green book, lane widths may vary from 10 to 12 feet for rural and urban arterials. NCHRP

More information

Organizing Quantitative Data

Organizing Quantitative Data Organizing Quantitative Data MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2018 Objectives At the end of this lesson we will be able to: organize discrete data in

More information

Energy Output. Outline. Characterizing Wind Variability. Characterizing Wind Variability 3/7/2015. for Wind Power Management

Energy Output. Outline. Characterizing Wind Variability. Characterizing Wind Variability 3/7/2015. for Wind Power Management Energy Output for Wind Power Management Spring 215 Variability in wind Distribution plotting Mean power of the wind Betz' law Power density Power curves The power coefficient Calculator guide The power

More information

Dangerously bold Featured scientist: Melissa Kjelvik from Michigan State University

Dangerously bold Featured scientist: Melissa Kjelvik from Michigan State University Dangerously bold Featured scientist: Melissa Kjelvik from Michigan State University Research Background: Just as each person has her or his own personality, animals of the same species can behave very

More information

Evaluation of NEA haddock Harvest Control Rules

Evaluation of NEA haddock Harvest Control Rules Evaluation of NEA haddock Harvest Control Rules Alexey Russkikh, Anatolii Chetyrkin, Yuri Kovalev, Bjarte Bogstad, and Gjert Dingsør Background Northeast arctic haddock has been managed by harvest control

More information

Sistrunk Corridor DOTMOCRACY SUMMARY

Sistrunk Corridor DOTMOCRACY SUMMARY DOTMOCRACY SUMMARY The TOUCH initiative is a true collaborative working with multi- sector community organizations and agencies to reduce tobacco use and exposure to secondhand smoke, improve access to

More information

WHAT CAN WE LEARN FROM MUSIC POLLS?

WHAT CAN WE LEARN FROM MUSIC POLLS? Jordi McKenzie 1 1 Department of Economics Macquarie University Vienna Music Business Research Days September 2017 PLAN OF PRESENTATION This presentation.. a little different.. Present two (related) analyses:

More information

EFFECT OF SELECTED EXERCISES ON EXPLOSIVE STRENGTH, SPEED, ENDURANCE AND AGILITY OF MEDIUM FAST BOWLERS IN CRICKET

EFFECT OF SELECTED EXERCISES ON EXPLOSIVE STRENGTH, SPEED, ENDURANCE AND AGILITY OF MEDIUM FAST BOWLERS IN CRICKET EFFECT OF SELECTED EXERCISES ON EXPLOSIVE STRENGTH, SPEED, ENDURANCE AND AGILITY OF MEDIUM FAST BOWLERS IN CRICKET Kumar Praveen* Navnit Ansi** *Assistant Prof. Department of Physical Education, CCS University,

More information

Appendix A Bridge Deck Section Images

Appendix A Bridge Deck Section Images Appendix A Bridge Deck Section Images 127 128 Figure A1. Bridge deck R13, fabricated at the FHWA NDE Validation Center. Figure A2. Bridge deck R12, fabricated at the FHWA NDE Validation Center. 129 Figure

More information

How long would it take to escape from the pendulum?

How long would it take to escape from the pendulum? Week 2, Lesson 1 1. Warm up 2. ICA The Pit & the Pendulum 3. ICA Discussion?s How long would it take to escape from the pendulum? Essential Question Essential Question Essential Question Essential Question

More information

Today s plan: Section 4.2: Normal Distribution

Today s plan: Section 4.2: Normal Distribution 1 Today s plan: Section 4.2: Normal Distribution 2 Characteristics of a data set: mean median standard deviation five-number summary 2 Characteristics of a data set: mean median standard deviation five-number

More information

Case Processing Summary. Cases Valid Missing Total N Percent N Percent N Percent % 0 0.0% % % 0 0.0%

Case Processing Summary. Cases Valid Missing Total N Percent N Percent N Percent % 0 0.0% % % 0 0.0% GET FILE='C:\Users\acantrell\Desktop\demo5.sav'. DATASET NAME DataSet1 WINDOW=FRONT. EXAMINE VARIABLES=PASSYDSPG RUSHYDSPG /PLOT BOXPLOT HISTOGRAM /COMPARE GROUPS /STATISTICS DESCRIPTIVES /CINTERVAL 95

More information

Advantages of Using Combined Bathymetry and Side Scan Data in Survey Processing T.M. Hiller, L.N. Brisson

Advantages of Using Combined Bathymetry and Side Scan Data in Survey Processing T.M. Hiller, L.N. Brisson Advantages of Using Combined Bathymetry and Side Scan Data in Survey Processing T.M. Hiller, L.N. Brisson EdgeTech, West WareHam MA, USA EdgeTech 6205 Combined Bathymetric and Side Scan Sonar EdgeTech

More information

CHAPTER 2 Modeling Distributions of Data

CHAPTER 2 Modeling Distributions of Data CHAPTER 2 Modeling Distributions of Data 2.2 Density Curves and Normal Distributions The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Density Curves

More information

box and whisker plot 3880C798CA037B A83B07E6C4 Box And Whisker Plot 1 / 6

box and whisker plot 3880C798CA037B A83B07E6C4 Box And Whisker Plot 1 / 6 Box And Whisker Plot 1 / 6 2 / 6 3 / 6 Box And Whisker Plot In descriptive statistics, a box plot or boxplot is a method for graphically depicting groups of numerical data through their quartiles.box plots

More information

Name: Statistics February 25, 2013

Name: Statistics February 25, 2013 1. Shooter s basketball team played in ten games. The scores are charted as shown. Which one is greater, the mean or the mode? By how much is it greater? 2. Randy opens up a bag of jelly beans after mixing

More information

Uncovering Anomalous Usage of Medical Records via Social Network Analysis

Uncovering Anomalous Usage of Medical Records via Social Network Analysis TRUST Autumn 2011 Conference Uncovering of Medical Records via Social Network Analysis You Chen, Ph.D. Biomedical Informatics Dept., School of Medicine EECS Dept., School of Engineering November 2, 2011

More information

Travel Behavior of Baby Boomers in Suburban Age Restricted Communities

Travel Behavior of Baby Boomers in Suburban Age Restricted Communities Travel Behavior of Baby Boomers in Suburban Age Restricted Communities TRB Conference Impact of Changing Demographics on the Transportation System 27 October 2008 P. Christopher Zegras Frank Hebbert Eran

More information

CAAD CTF 2018 Rules June 21, 2018 Version 1.1

CAAD CTF 2018 Rules June 21, 2018 Version 1.1 CAAD CTF 2018 Rules June 21, 2018 Version 1.1 The organizer will invite 5 teams to participate CAAD CTF 2018. We will have it in Las Vegas on Aug. 10 th, 2018. The rules details are below: 1. Each team

More information

An exploration of the origins of the favourite-longshot bias in horserace betting markets

An exploration of the origins of the favourite-longshot bias in horserace betting markets An exploration of the origins of the favourite-longshot bias in horserace betting markets Alistair Bruce, Johnnie Johnson & Jiejun Yu Centre for Risk Research University of Southampton 1 University of

More information

Imperceptibly off-center goalkeepers influence penalty-kick direction in soccer

Imperceptibly off-center goalkeepers influence penalty-kick direction in soccer Loughborough University Institutional Repository Imperceptibly off-center goalkeepers influence penalty-kick direction in soccer This item was submitted to Loughborough University's Institutional Repository

More information

AP Statistics Midterm Exam 2 hours

AP Statistics Midterm Exam 2 hours AP Statistics Midterm Exam 2 hours Name Directions: Work on these sheets only. Read each question carefully and answer completely but concisely (point values are from 1 to 3 points so no written answer

More information

Fun with M&M s. By: Cassandra Gucciardo. Sorting

Fun with M&M s. By: Cassandra Gucciardo. Sorting Fun with M&M s Sorting Fractions Objectives: The students will be able to review the measures of central tendency by determining mean, median, mode and range. They will review their understanding of estimation,

More information

Highway & Transportation (I) ECIV 4333 Chapter (4): Traffic Engineering Studies. Spot Speed

Highway & Transportation (I) ECIV 4333 Chapter (4): Traffic Engineering Studies. Spot Speed Introduction Spot Speed Speed is an important measure of the quality of travel and safety of road network. Speed by definition is the rate of movement of vehicle in distance per unit time. A typical unit

More information

CSC242: Intro to AI. Lecture 21

CSC242: Intro to AI. Lecture 21 CSC242: Intro to AI Lecture 21 Quiz Stop Time: 2:15 Learning (from Examples) Learning Learning gives computers the ability to learn without being explicitly programmed (Samuel, 1959)... agents that can

More information

Scaled vs. Original Socre Mean = 77 Median = 77.1

Scaled vs. Original Socre Mean = 77 Median = 77.1 Have out... - notebook - colors - calculator (phone calc will work fine) Tests back as you come in! vocab example Tests Scaled vs. Original Socre Mean = 77 Median = 77.1 + Δ vocab example 1 2.1 Describing

More information

Please, after a carefully reading of given references, answer the following questions:

Please, after a carefully reading of given references, answer the following questions: Report Session 1: Introduction of Respiratory System Student 1 name: Student 2 name: Questionnaire Please, after a carefully reading of given references, answer the following questions: (1) How gas exchange

More information

Real Time Early Warning Indicators for Costly Asset Price Boom/Bust Cycles: A Role for Global Liquidity

Real Time Early Warning Indicators for Costly Asset Price Boom/Bust Cycles: A Role for Global Liquidity Indicators Asset Price : A Role for Global Liquidity Lucia Alessi European Central Bank Carsten Detken Milan, 24 June 2010 The need for Models The set up of an System is one of the key tasks of the European

More information

New Data for Vermont and our Communities: A Focus on Programs of the U.S. Census Bureau Town Officer Education Seminars uvm.

New Data for Vermont and our Communities: A Focus on Programs of the U.S. Census Bureau Town Officer Education Seminars uvm. New Data for Vermont and our Communities: A Focus on Programs of the U.S. Census Bureau 2011 Town Officer Education Seminars Our Focus Today Accessing the Data Target to serve 2010 Census and ACS 5 year

More information

Figure 1: A hockey puck travels to the right in three different cases.

Figure 1: A hockey puck travels to the right in three different cases. Question (12) Average velocity and position (update) for three hockey pucks You view the motion of a hockey puck in a video and mark its location every. The resulting image for three different hockey pucks

More information

Level 1 Mathematics and Statistics, 2014

Level 1 Mathematics and Statistics, 2014 91037 910370 1SUPERVISOR S Level 1 Mathematics and Statistics, 2014 91037 Demonstrate understanding of chance and data 9.30 am Tuesday 18 November 2014 Credits: Four Achievement Achievement with Merit

More information

Persistence racial difference in socioeconomic outcomes. Are Emily and Greg More Employable than Lakisha and Jamal?

Persistence racial difference in socioeconomic outcomes. Are Emily and Greg More Employable than Lakisha and Jamal? Are Emily and Greg More Employable than Lakisha and Jamal? Bertrand and Mullainathan Persistence racial difference in socioeconomic outcomes Large difference in outcomes between similarly defined blacks

More information

Computer diagnostics for the analysis of table tennis matches *

Computer diagnostics for the analysis of table tennis matches * ISSN 1750-9823 (print) International Journal of Sports Science and Engineering Vol. 02 (2008) No. 03, pp. 144-153 Computer diagnostics for the analysis of table tennis matches * Li Juan Yu +, Hui Zhang

More information

DESIGN AND ANALYSIS OF ALGORITHMS (DAA 2017)

DESIGN AND ANALYSIS OF ALGORITHMS (DAA 2017) DESIGN AND ANALYSIS OF ALGORITHMS (DAA 2017) Veli Mäkinen 12/05/2017 1 COURSE STRUCTURE 7 weeks: video lecture -> demo lecture -> study group -> exercise Video lecture: Overview, main concepts, algorithm

More information