Reproducible Research: Peer Assessment 1

Size: px
Start display at page:

Download "Reproducible Research: Peer Assessment 1"

Transcription

1 Introduction Reproducible Research: Peer Assessment 1 It is now possible to collect a large amount of data about personal movement using activity monitoring devices such as a Fitbit, Nike Fuelband, or Jawbone Up. These type of devices are part of the quantified self movement a group of enthusiasts who take measurements about themselves regularly to improve their health, to find patterns in their behavior, or because they are tech geeks. But these data remain under-utilized both because the raw data are hard to obtain and there is a lack of statistical methods and software for processing and interpreting the data. This assignment makes use of data from a personal activity monitoring device. This device collects data at 5 minute intervals through out the day. The data consists of two months of data from an anonymous individual collected during the months of October and November, 2012 and include the number of steps taken in 5 minute intervals each day. Data he data for this assignment was downloaded from the course web site: Dataset: Activity monitoring data The variables included in this dataset are: 1. steps: Number of steps taking in a 5-minute interval (missing values are coded as NA) 2. date: The date on which the measurement was taken in YYYY-MM-DD format 3. interval: Identifier for the 5-minute interval in which measurement was taken The dataset is stored in a comma-separated-value (CSV) file and there are a total of 17,568 observations in this dataset. Assignment Loading and preprocessing the data The following code checks for the data file in the working directory. If it does not find, it downloads the zipfile of the data, unzips it and saves the dataset under the name of activity if(!file.exists("getdata-projectfiles-uci HAR Dataset.zip")){ temp <- tempfile() download.file(" unzip(temp) unlink(temp) } activity <- read.csv("activity.csv") What is mean total number of steps taken per day? Total number of steps taken per day can be calculated with the following code. 1

2 #Converting the date variable into class "Date" activity$date <- as.date(activity$date) #Finding the total number of steps each day total.steps <- with(activity, tapply(steps, date, sum)) Total number of steps taken per day can be plotted in the histogram with the following code #Plotting the histogram using the above dataset hist(total.steps, ylim = c(0,35), xlab = "Total Number of Steps Each Day", ylab = "Frequency", main = "Histogram of the Total Number of Steps Taken Each Day", col = "Magenta") Histogram of the Total Number of Steps Taken Each Day Frequency Total Number of Steps Each Day The mean and median of the total number of steps can be found. mean.steps <- mean(total.steps, na.rm = TRUE) median.steps <- median(total.steps, na.rm = TRUE) mean.steps ## [1] median.steps ## [1]

3 What is the average daily activity pattern? #Averaging the steps based on time of the day over the period of two months interval.mean <- aggregate(steps~interval, activity, mean) #Using ggplot2 package to plot the results library(ggplot2) g <- ggplot(interval.mean, aes(interval, steps)) p <- g + geom_line() + xlab("time") + ylab("total Steps") + ggtitle("average number of steps taken throu print(p) Average number of steps taken throughout the day Total Steps Time The 5-minute interval, on average across all the days in the dataset, which contains the maximum number of step can be found by the following code. max.interval <- interval.mean[which.max(interval.mean$steps),1] max.interval ## [1] 835 max.interval ## [1] :35 is the time interval at which max steps were taken. 3

4 Imputing missing values There are 2304 missing values in the dataset which was found by the following. missing.values <- sum(is.na(activity)) The missing values were replaced by the mean value of the steps taken in that interval, averaged across all days. The new dataset is stored under imputed.activity. ##interval.mean, caluculated in the previous section, contains the mean values needed to replace the mis imputed.activity <- transform(activity, steps = ifelse(is.na(activity$steps), interval.mean$steps[match( Since no data was collected for the first day, all the NA of have to replaced by 0 to prevent any unccessary deviations from the result. imputed.activity[as.character(imputed.activity$date) == " ", 1] <- 0 The dataset with the missing values(activity) and imputed one (imputed.activity) are superimposed on each other imp.total.steps <- with(imputed.activity, tapply(steps, date, sum)) # The imputed.activity dataset is plotted hist(imp.total.steps, ylim = c(0,35), xlab = "Total Number of Steps Each Day", ylab = "Frequency", main # The actvity dataset (with the missing values) is plotted over the imputed.activity hist(total.steps, ylim = c(0,35), xlab = "Total Number of Steps Each Day", ylab = "Frequency", col = "Ma legend("topright", c("imputed", "Non-imputed"), col=c("green", "Magenta"), lwd=6) Histogram of the Total Number of Steps Taken Each Day Frequency Imputed Non imputed Total Number of Steps Each Day 4

5 The mean and the median for the imputed dataset is as follows. i.mean.steps <- mean(imp.total.steps) i.median.steps <- median(imp.total.steps) i.mean.steps ## [1] i.median.steps ## [1] Calculating the difference between the activity and imputed.activity datasets. ##Difference between the means mean.diff <- i.mean.steps - mean.steps ##Difference between the medians median.diff <- i.median.steps - median.steps ##Difference between the total number of steps total.diff <- sum(imputed.activity$steps) - sum(activity$steps, na.rm = T) mean.diff ## [1] median.diff ## [1] total.diff ## [1] It is observed that the mean decreases by steps while the median is increased by 1.2. Also, the total steps have increased by Are there differences in activity patterns between weekdays and weekends? Creating a new factor variable in the dataset with two levels weekday and weekend indicating whether a given date is a weekday or weekend day. #Using the the weekdaysfunction to convert all the dates into Weekdays and storing the dataset under wee week.activity = transform(imputed.activity, date = weekdays(activity$date, abbreviate = FALSE)) #Replacing all the weekdays with factor "Weekday" and weekends with "Weekend" week.activity$date <- gsub("monday Tuesday Wednesday Thursday Friday","Weekday", week.activity$date) week.activity$date <- gsub("saturday Sunday", "Weekend", week.activity$date) week.activity$date <- as.factor(week.activity$date) 5

6 Making a panel plot with ggplot 2 package containing a time series plot of the 5-minute interval and the average number of steps taken, averaged across all weekday days or weekend days. #Filtering the mean steps over 5 min interval, separated by Weekday and Weekend factor variables week.mean.steps <- aggregate(steps~interval+date, week.activity, mean) p1 <- ggplot(week.mean.steps, aes(interval, steps)) + facet_grid(date~.) + geom_line() + xlab("time") + print(p1) 200 Total Steps Weekday Weekend Time 6

STAT 625: 2000 Olympic Diving Exploration

STAT 625: 2000 Olympic Diving Exploration Corey S Brier, Department of Statistics, Yale University 1 STAT 625: 2000 Olympic Diving Exploration Corey S Brier Yale University Abstract This document contains a preliminary investigation of data from

More information

Sample Final Exam MAT 128/SOC 251, Spring 2018

Sample Final Exam MAT 128/SOC 251, Spring 2018 Sample Final Exam MAT 128/SOC 251, Spring 2018 Name: Each question is worth 10 points. You are allowed one 8 1/2 x 11 sheet of paper with hand-written notes on both sides. 1. The CSV file citieshistpop.csv

More information

1. The data below gives the eye colors of 20 students in a Statistics class. Make a frequency table for the data.

1. The data below gives the eye colors of 20 students in a Statistics class. Make a frequency table for the data. 1. The data below gives the eye colors of 20 students in a Statistics class. Make a frequency table for the data. Green Blue Brown Blue Blue Brown Blue Blue Blue Green Blue Brown Blue Brown Brown Blue

More information

Full file at

Full file at Chapter 2 1. Describe the distribution. survival times of persons diagnosed with terminal lymphoma A) approximately normal B) skewed left C) skewed right D) roughly uniform Ans: C Difficulty: low 2. Without

More information

1wsSMAM 319 Some Examples of Graphical Display of Data

1wsSMAM 319 Some Examples of Graphical Display of Data 1wsSMAM 319 Some Examples of Graphical Display of Data 1. Lands End employs numerous persons to take phone orders. Computers on which orders are entered also automatically collect data on phone activity.

More information

Case Studies Homework 3

Case Studies Homework 3 Case Studies Homework 3 Breanne Chryst September 11, 2013 1 In this assignment I did some exploratory analysis on a data set containing diving information from the 2000 Olympics. My code and output is

More information

STAT 625: 2000 Olympic Diving Exploration

STAT 625: 2000 Olympic Diving Exploration Corey S Brier, Department of Statistics, Yale University 1 STAT 625: 2000 Olympic Diving Exploration Corey S Brier Yale University Abstract This document contains an investigation of bias using data from

More information

Chapter 2: Modeling Distributions of Data

Chapter 2: Modeling Distributions of Data Chapter 2: Modeling Distributions of Data Section 2.1 The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Chapter 2 Modeling Distributions of Data 2.1 2.2 Normal Distributions Section

More information

Unit 3 - Data. Grab a new packet from the chrome book cart. Unit 3 Day 1 PLUS Box and Whisker Plots.notebook September 28, /28 9/29 9/30?

Unit 3 - Data. Grab a new packet from the chrome book cart. Unit 3 Day 1 PLUS Box and Whisker Plots.notebook September 28, /28 9/29 9/30? Unit 3 - Data Grab a new packet from the chrome book cart 9/28 9/29 9/30? 10/3 10/4 10/5 10/6 10/7-10/10 10/11 10/12 10/13 Practice ACT #1 Lesson 1: Box and Whisker Plots I can find the 5 number summary

More information

North Point - Advance Placement Statistics Summer Assignment

North Point - Advance Placement Statistics Summer Assignment North Point - Advance Placement Statistics This assignment is due during the first week of class. It is considered an exam grade, which that category is worth approximately 60% of your total grade. All

More information

Unit 3 ~ Data about us

Unit 3 ~ Data about us Unit 3 ~ Data about us Investigation 3: Data Sets & Displays I can construct, interpret, and compare data sets and displays. I can find, interpret, and compare measures of center and variation for data

More information

Activity: the race to beat Ruth

Activity: the race to beat Ruth Activity: the race to beat Ruth Background In baseball, a pitcher from the fielding team throws a ball at a batter from the batting team, who attempts to hit the ball with a stick. If she hits the ball,

More information

A Shallow Dive into Deep Sea Data Sarah Solie and Arielle Fogel 7/18/2018

A Shallow Dive into Deep Sea Data Sarah Solie and Arielle Fogel 7/18/2018 A Shallow Dive into Deep Sea Data Sarah Solie and Arielle Fogel 7/18/2018 Introduction The datasets This data expedition will utilize the World Ocean Atlas (WOA) database to explore two deep sea physical

More information

Statistical Analysis Project - How do you decide Who s the Best?

Statistical Analysis Project - How do you decide Who s the Best? Statistical Analysis Project - How do you decide Who s the Best? In order to choose the best shot put thrower to go to IASAS, the three candidates were asked to throw the shot put for a total of times

More information

Chapter 5: Methods and Philosophy of Statistical Process Control

Chapter 5: Methods and Philosophy of Statistical Process Control Chapter 5: Methods and Philosophy of Statistical Process Control Learning Outcomes After careful study of this chapter You should be able to: Understand chance and assignable causes of variation, Explain

More information

5.1 Introduction. Learning Objectives

5.1 Introduction. Learning Objectives Learning Objectives 5.1 Introduction Statistical Process Control (SPC): SPC is a powerful collection of problem-solving tools useful in achieving process stability and improving capability through the

More information

The pth percentile of a distribution is the value with p percent of the observations less than it.

The pth percentile of a distribution is the value with p percent of the observations less than it. Describing Location in a Distribution (2.1) Measuring Position: Percentiles One way to describe the location of a value in a distribution is to tell what percent of observations are less than it. De#inition:

More information

Fundamentals of Machine Learning for Predictive Data Analytics

Fundamentals of Machine Learning for Predictive Data Analytics Fundamentals of Machine Learning for Predictive Data Analytics Appendix A Descriptive Statistics and Data Visualization for Machine learning John Kelleher and Brian Mac Namee and Aoife D Arcy john.d.kelleher@dit.ie

More information

Bivariate Data. Frequency Table Line Plot Box and Whisker Plot

Bivariate Data. Frequency Table Line Plot Box and Whisker Plot U04 D02 Univariate Data Frequency Table Line Plot Box and Whisker Plot Univariate Data Bivariate Data involving a single variable does not deal with causes or relationships the major purpose of univariate

More information

World Ocean Atlas (WOA) Product Documentation

World Ocean Atlas (WOA) Product Documentation World Ocean Atlas (WOA) Product Documentation This document describes WOA statistical and objectively analyzed field files. This description includes the types of statistical fields available, for which

More information

Navigate to the golf data folder and make it your working directory. Load the data by typing

Navigate to the golf data folder and make it your working directory. Load the data by typing Golf Analysis 1.1 Introduction In a round, golfers have a number of choices to make. For a particular shot, is it better to use the longest club available to try to reach the green, or would it be better

More information

save percentages? (Name) (University)

save percentages? (Name) (University) 1 IB Maths Essay: What is the correlation between the height of football players and their save percentages? (Name) (University) Table of Contents Raw Data for Analysis...3 Table 1: Raw Data...3 Rationale

More information

% per year Age (years)

% per year Age (years) Stat 1001 Winter 1998 Geyer Homework 2 Problem 3.1 66 inches and 72 inches. Problem 3.2 % per year 0.0 0.5 1.0 1.5 0 20 40 60 80 Age (years) (a) Age 1. (b) More 31-year olds (c) More people age 35{44,

More information

September Population analysis of the Finnish Spitz breed

September Population analysis of the Finnish Spitz breed Population analysis of the Finnish Spitz breed Genetic analysis of the Kennel Club pedigree records of the UK Finnish Spitz population has been carried out with the aim of estimating the rate of loss of

More information

September Population analysis of the Portuguese Water Dog breed

September Population analysis of the Portuguese Water Dog breed Population analysis of the Portuguese Water Dog breed Genetic analysis of the Kennel Club pedigree records of the UK Portuguese Water Dog population has been carried out with the aim of estimating the

More information

MoLE Gas Laws Activities

MoLE Gas Laws Activities MoLE Gas Laws Activities To begin this assignment you must be able to log on to the Internet using Internet Explorer (Microsoft) 4.5 or higher. If you do not have the current version of the browser, go

More information

That pesky golf game and the dreaded stats class

That pesky golf game and the dreaded stats class That pesky golf game and the dreaded stats class Marsha Jance Indiana University East A case study that involves golf and statistics is presented. This case study focuses on descriptive statistics and

More information

WIND DATA REPORT. Vinalhaven

WIND DATA REPORT. Vinalhaven WIND DATA REPORT Vinalhaven January - December, 2003 Prepared for Fox Islands Electric Cooperative by Anthony L. Rogers May 12, 2004 Report template version 1.1 Renewable Energy Research Laboratory 160

More information

September Population analysis of the Dogue De Bordeaux breed

September Population analysis of the Dogue De Bordeaux breed Population analysis of the Dogue De Bordeaux breed Genetic analysis of the Kennel Club pedigree records of the UK Dogue De Bordeaux population has been carried out with the aim of estimating the rate of

More information

Model Selection Erwan Le Pennec Fall 2015

Model Selection Erwan Le Pennec Fall 2015 Model Selection Erwan Le Pennec Fall 2015 library("dplyr") library("ggplot2") library("ggfortify") library("reshape2") Model Selection We will now use another classical dataset birthwt which corresponds

More information

Lower Columbia River Dam Fish Ladder Passage Times, Eric Johnson and Christopher Peery University of Idaho

Lower Columbia River Dam Fish Ladder Passage Times, Eric Johnson and Christopher Peery University of Idaho Lower Columbia River Dam Fish Ladder Passage Times, 3 Eric Johnson and Christopher Peery University of Idaho As per your request, we have assembled passage times at Lower Columbia River fish ladders. Ladder

More information

Bayesian Optimized Random Forest for Movement Classification with Smartphones

Bayesian Optimized Random Forest for Movement Classification with Smartphones Bayesian Optimized Random Forest for Movement Classification with Smartphones 1 2 3 4 Anonymous Author(s) Affiliation Address email 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29

More information

Tutorial for the. Total Vertical Uncertainty Analysis Tool in NaviModel3

Tutorial for the. Total Vertical Uncertainty Analysis Tool in NaviModel3 Tutorial for the Total Vertical Uncertainty Analysis Tool in NaviModel3 May, 2011 1. Introduction The Total Vertical Uncertainty Analysis Tool in NaviModel3 has been designed to facilitate a determination

More information

World Leading Traffic Analysis

World Leading Traffic Analysis World Leading Traffic Analysis Over the past 25 years, has worked closely with road authorities and traffic managers around the world to deliver leading traffic monitoring equipment. With products now

More information

BEFORE YOU OPEN ANY FILES:

BEFORE YOU OPEN ANY FILES: Dive Analysis Lab * Make sure to download all the data files for the lab onto your computer. * Bring your computer to lab. * Bring a blank disk or memory stick to class to save your work and files. The

More information

Running head: DATA ANALYSIS AND INTERPRETATION 1

Running head: DATA ANALYSIS AND INTERPRETATION 1 Running head: DATA ANALYSIS AND INTERPRETATION 1 Data Analysis and Interpretation Final Project Vernon Tilly Jr. University of Central Oklahoma DATA ANALYSIS AND INTERPRETATION 2 Owners of the various

More information

Stats 2002: Probabilities for Wins and Losses of Online Gambling

Stats 2002: Probabilities for Wins and Losses of Online Gambling Abstract: Jennifer Mateja Andrea Scisinger Lindsay Lacher Stats 2002: Probabilities for Wins and Losses of Online Gambling The objective of this experiment is to determine whether online gambling is a

More information

Chapter 6 The Standard Deviation as a Ruler and the Normal Model

Chapter 6 The Standard Deviation as a Ruler and the Normal Model Chapter 6 The Standard Deviation as a Ruler and the Normal Model Standardizing with z-scores Expressing data in terms of We standardize to Standardized values can be even if the original variables had

More information

User s Guide for inext Online: Software for Interpolation and

User s Guide for inext Online: Software for Interpolation and Original version (September 2016) User s Guide for inext Online: Software for Interpolation and Extrapolation of Species Diversity Anne Chao, K. H. Ma and T. C. Hsieh Institute of Statistics, National

More information

Chapter 2 - Frequency Distributions and Graphs

Chapter 2 - Frequency Distributions and Graphs - Frequency Distributions and Graphs 1. Which of the following does not need to be done when constructing a frequency distribution? A) select the number of classes desired B) find the range C) make the

More information

Effective Use of Box Charts

Effective Use of Box Charts Effective Use of Box Charts Purpose This tool provides guidelines and tips on how to effectively use box charts to communicate research findings. Format This tool provides guidance on box charts and their

More information

Reshaping data in R. Duncan Golicher. December 9, 2008

Reshaping data in R. Duncan Golicher. December 9, 2008 Reshaping data in R Duncan Golicher December 9, 2008 One of the most frustrating and time consuming parts of statistical analysis is shuffling data into a format for analysis. No one enjoys changing data

More information

Player, Coach, Manager, and Team Account Creation

Player, Coach, Manager, and Team Account Creation Player, Coach, Manager, and Team Account Creation for GotSoccer The below instructions will help walk you though the process of creating player, coach, team official and team accounts (profiles) in GotSoccer.

More information

Using an Adaptive Thresholding Algorithm to Detect CA1 Hippocampal Sharp Wave Ripples. Jay Patel. Michigan State University

Using an Adaptive Thresholding Algorithm to Detect CA1 Hippocampal Sharp Wave Ripples. Jay Patel. Michigan State University Using an Adaptive Thresholding Algorithm to Detect CA1 Hippocampal Sharp Wave Ripples Jay Patel Michigan State University Department of Physics and Astronomy, University of California, Los Angeles 2013

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 256 Introduction This procedure computes summary statistics and common non-parametric, single-sample runs tests for a series of n numeric, binary, or categorical data values. For numeric data,

More information

Statistics. Wednesday, March 28, 2012

Statistics. Wednesday, March 28, 2012 Statistics Wednesday, March 28, 2012 Today's Agenda: 1. Collect HW #2: Activities 9 7 & 9 12 2. Go over HW #1: Activities 9 20 a f & 9 17 3. Practice calculating & s x 4. Activities 9 4, 9 5, 9 11, 9 13

More information

Descriptive Statistics Project Is there a home field advantage in major league baseball?

Descriptive Statistics Project Is there a home field advantage in major league baseball? Descriptive Statistics Project Is there a home field advantage in major league baseball? DUE at the start of class on date posted on website (in the first 5 minutes of class) There may be other due dates

More information

Lesson 3 Pre-Visit Teams & Players by the Numbers

Lesson 3 Pre-Visit Teams & Players by the Numbers Lesson 3 Pre-Visit Teams & Players by the Numbers Objective: Students will be able to: Review how to find the mean, median and mode of a data set. Calculate the standard deviation of a data set. Evaluate

More information

Analysis of Highland Lakes Inflows Using Process Behavior Charts Dr. William McNeese, Ph.D. Revised: Sept. 4,

Analysis of Highland Lakes Inflows Using Process Behavior Charts Dr. William McNeese, Ph.D. Revised: Sept. 4, Analysis of Highland Lakes Inflows Using Process Behavior Charts Dr. William McNeese, Ph.D. Revised: Sept. 4, 2018 www.spcforexcel.com Author s Note: This document has been revised to include the latest

More information

SUMMARIZING FROG AND TOAD COUNT DATA

SUMMARIZING FROG AND TOAD COUNT DATA SUMMARIZING FROG AND TOAD COUNT DATA This set of protocols will take you through all the steps necessary for summarizing the frog and toad data for each NAAMP route that was been assigned to you. BEFORE

More information

Codebook for Using Cross-game Behavioral Markers for Early Identification of High-risk Internet Gamblers

Codebook for Using Cross-game Behavioral Markers for Early Identification of High-risk Internet Gamblers Codebook for Using Cross-game Behavioral Markers for Early Identification of High-risk Internet Gamblers This codebook provides information for both raw and analytic datasets used to generate an algorithm

More information

WIND DATA REPORT. Ragged Mt Maine

WIND DATA REPORT. Ragged Mt Maine WIND DATA REPORT Ragged Mt Maine December 1 st 2007 to February 29 th 2007 by James R. Browning James F. Manwell Utama Abdulwahid Anthony F. Ellis April 10, 2008 Report template version 3.1 Renewable Energy

More information

Diameter in cm. Bubble Number. Bubble Number Diameter in cm

Diameter in cm. Bubble Number. Bubble Number Diameter in cm Bubble lab Data Sheet Blow bubbles and measure the diameter to the nearest whole centimeter. Record in the tables below. Try to blow different sized bubbles. Name: Bubble Number Diameter in cm Bubble Number

More information

IHS AP Statistics Chapter 2 Modeling Distributions of Data MP1

IHS AP Statistics Chapter 2 Modeling Distributions of Data MP1 IHS AP Statistics Chapter 2 Modeling Distributions of Data MP1 Monday Tuesday Wednesday Thursday Friday August 22 A Day 23 B Day 24 A Day 25 B Day 26 A Day Ch1 Exploring Data Class Introduction Getting

More information

Looking at Spacings to Assess Streakiness

Looking at Spacings to Assess Streakiness Looking at Spacings to Assess Streakiness Jim Albert Department of Mathematics and Statistics Bowling Green State University September 2013 September 2013 1 / 43 The Problem Collect hitting data for all

More information

CDR File Information. User Case Number EDR Data Imaging Date Crash Date

CDR File Information. User Case Number EDR Data Imaging Date Crash Date CDR File Information User Entered VIN 1ZVFT82H665****** User Case Number EDR Data Imaging Date Crash Date Filename SAMPLE-PCM-ND.CDR Saved on Monday, May 18 2009 at 04:38:07 PM Collected with CDR version

More information

Package STI. August 19, Index 7. Compute the Standardized Temperature Index

Package STI. August 19, Index 7. Compute the Standardized Temperature Index Package STI August 19, 2015 Type Package Title Calculation of the Standardized Temperature Index Version 0.1 Date 2015-08-18 Author Marc Fasel [aut, cre] Maintainer Marc Fasel A set

More information

Cambridge International Examinations Cambridge Ordinary Level

Cambridge International Examinations Cambridge Ordinary Level Cambridge International Examinations Cambridge Ordinary Level *9399919087* STATISTICS 4040/12 Paper 1 October/November 2015 Candidates answer on the Question Paper. Additional Materials: Pair of compasses

More information

Lesson 2 Pre-Visit Slugging Percentage

Lesson 2 Pre-Visit Slugging Percentage Lesson 2 Pre-Visit Slugging Percentage Objective: Students will be able to: Set up and solve equations for batting average and slugging percentage. Review prior knowledge of conversion between fractions,

More information

Unit 6 Day 2 Notes Central Tendency from a Histogram; Box Plots

Unit 6 Day 2 Notes Central Tendency from a Histogram; Box Plots AFM Unit 6 Day 2 Notes Central Tendency from a Histogram; Box Plots Name Date To find the mean, median and mode from a histogram, you first need to know how many data points were used. Use the frequency

More information

box and whisker plot 3880C798CA037B A83B07E6C4 Box And Whisker Plot 1 / 6

box and whisker plot 3880C798CA037B A83B07E6C4 Box And Whisker Plot 1 / 6 Box And Whisker Plot 1 / 6 2 / 6 3 / 6 Box And Whisker Plot In descriptive statistics, a box plot or boxplot is a method for graphically depicting groups of numerical data through their quartiles.box plots

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Descriptive Statistics vs Inferential Statistics Describing a sample Making inferences to a larger population Data = Information but too much information. How do we summarize data?

More information

FIG: 27.1 Tool String

FIG: 27.1 Tool String Bring up Radioactive Tracer service. Click Acquisition Box - Edit - Tool String Edit the tool string as necessary to reflect the tool string being run. This is important to insure proper offsets, filters,

More information

LONG TERM SITE WIND DATA QUARTERLY REPORT. Bishop and Clerks

LONG TERM SITE WIND DATA QUARTERLY REPORT. Bishop and Clerks LONG TERM SITE WIND DATA QUARTERLY REPORT Bishop and Clerks January 1, 2012 March 31, 2012 Prepared for Massachusetts Clean Energy Center 55 Summer Street, 9th Floor Boston, MA 02110 by Frederick Letson

More information

WIND DATA REPORT. Swan s Island, ME

WIND DATA REPORT. Swan s Island, ME WIND DATA REPORT Swan s Island, ME June 1, 2009 August 31, 2009 Prepared for US Department of Energy by Daniel T. Grip Utama Abdulwahid James F. Manwell Anthony F. Ellis September 17, 2009 Report template

More information

DEVELOPMENT OF TRAFFIC CALMING POLICY & WARRANT TASK 3 DELIVERABLE: TRAFFIC CALMING WARRANT

DEVELOPMENT OF TRAFFIC CALMING POLICY & WARRANT TASK 3 DELIVERABLE: TRAFFIC CALMING WARRANT FINAL TECHNICAL MEMORANDUM MAY 2011 DOCUMENT CONTROL Client: Project Name: Development of Traffic Calming Policy & Warrant Report Title: IBI Reference: 27794 Version: 5.0 Development of Traffic Calming

More information

LONG TERM SITE WIND DATA QUARTERLY REPORT. Bishop and Clerks

LONG TERM SITE WIND DATA QUARTERLY REPORT. Bishop and Clerks LONG TERM SITE WIND DATA QUARTERLY REPORT Bishop and Clerks April 1, 2012 June 30, 2012 Prepared for Massachusetts Clean Energy Center 55 Summer Street, 9th Floor Boston, MA 02110 by Frederick Letson James

More information

Internet Technology Fundamentals. To use a passing score at the percentiles listed below:

Internet Technology Fundamentals. To use a passing score at the percentiles listed below: Internet Technology Fundamentals To use a passing score at the percentiles listed below: PASS candidates with this score or HIGHER: 2.90 High Scores Medium Scores Low Scores Percentile Rank Proficiency

More information

DATA SCIENCE SUMMER UNI VIENNA

DATA SCIENCE SUMMER UNI VIENNA Prerequisites - You have installed Tableau Desktop on your computer. Available here: http://www.tableau.com/academic/students - You have downloaded the data (athlete_events.csv) available here: https://www.kaggle.com/heesoo37/120-years-of-olympic-historyathletes-and-results

More information

Data Set 7: Bioerosion by Parrotfish Background volume of bites The question:

Data Set 7: Bioerosion by Parrotfish Background volume of bites The question: Data Set 7: Bioerosion by Parrotfish Background Bioerosion of coral reefs results from animals taking bites out of the calcium-carbonate skeleton of the reef. Parrotfishes are major bioerosion agents,

More information

El Cerrito Sporting Goods Ira Sharenow January 7, 2019

El Cerrito Sporting Goods Ira Sharenow January 7, 2019 El Cerrito Sporting Goods Ira Sharenow January 7, 2019 R Markdown The goal of the analysis is to determine if any of the salespersons are performing exceptionally well or exceptionally poorly. In particular,

More information

Understanding Winter Road Conditions in Yellowstone National Park Using Cumulative Sum Control Charts

Understanding Winter Road Conditions in Yellowstone National Park Using Cumulative Sum Control Charts 1 Understanding Winter Road Conditions in Yellowstone National Park Using Cumulative Sum Control Charts Amber Nuxoll April 1st, 2014 Contents 1 Introduction 2 2 Data Collection and Background 2 3 Exploratory

More information

BEFORE YOU OPEN ANY FILES:

BEFORE YOU OPEN ANY FILES: Dive Analysis Lab *If you are using a school computer bring a USB drive to class to save your work and the files for the lab. *If you are using your own computer, make sure to download the data and files

More information

Overview. 2 Module 13: Advanced Data Processing

Overview. 2 Module 13: Advanced Data Processing 2 Module 13: Advanced Data Processing Overview This section of the course covers advanced data processing when profiling. We will discuss the removal of the fairly gross effects of ship heave and talk

More information

Evaluating The Best. Exploring the Relationship between Tom Brady s True and Observed Talent

Evaluating The Best. Exploring the Relationship between Tom Brady s True and Observed Talent Evaluating The Best Exploring the Relationship between Tom Brady s True and Observed Talent Heather Glenny, Emily Clancy, and Alex Monahan MCS 100: Mathematics of Sports Spring 2016 Tom Brady s recently

More information

STAT 155 Introductory Statistics. Lecture 2-2: Displaying Distributions with Graphs

STAT 155 Introductory Statistics. Lecture 2-2: Displaying Distributions with Graphs The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL STAT 155 Introductory Statistics Lecture 2-2: Displaying Distributions with Graphs 8/31/06 Lecture 2-2 1 Recall Data: Individuals Variables Categorical variables

More information

Cycling Volume Estimation Methods for Safety Analysis

Cycling Volume Estimation Methods for Safety Analysis Cycling Volume Estimation Methods for Safety Analysis XI ICTCT extra Workshop in Vancouver, Canada Session: Methods and Simulation Date: March, 01 The Highway Safety Manual (HSM) documents many safety

More information

9.3 Histograms and Box Plots

9.3 Histograms and Box Plots Name Class Date 9.3 Histograms and Box Plots Essential Question: How can you interpret and compare data sets using data displays? Explore Understanding Histograms Resource Locker A histogram is a bar graph

More information

(c) The hospital decided to collect the data from the first 50 patients admitted on July 4, 2010.

(c) The hospital decided to collect the data from the first 50 patients admitted on July 4, 2010. Math 155, Test 1, 18 October 2011 Name: Instructions. This is a closed-book test. You may use a calculator (but not a cell phone). Make sure all cell-phones are put away and that the ringer is off. Show

More information

Summary of NWA Trail Usage Report November 2, 2015

Summary of NWA Trail Usage Report November 2, 2015 Summary of NWA Trail Usage Report November 2, 2015 Summary Findings: The study showed that Northwest Arkansas (NWA) had relatively high cyclist user counts per capita aggregated across the top three usage

More information

LONG TERM SITE WIND DATA ANNUAL REPORT. Bishop & Clerks

LONG TERM SITE WIND DATA ANNUAL REPORT. Bishop & Clerks LONG TERM SITE WIND DATA ANNUAL REPORT Bishop & Clerks July 1, 2012 June 30, 2013 Prepared for Massachusetts Clean Energy Center 55 Summer Street, 9th Floor Boston, MA 02110 by Frederick Letson James F.

More information

NATIONAL TRANSPORTATION SAFETY BOARD Vehicle Recorder Division Washington, D.C August 26, Event Recorder

NATIONAL TRANSPORTATION SAFETY BOARD Vehicle Recorder Division Washington, D.C August 26, Event Recorder 1. EVENT SUMMARY NATIONAL TRANSPORTATION SAFETY BOARD Vehicle Recorder Division Washington, D.C. 20594 August 26, 2013 Event Recorder Specialist s Factual Report By Bill Tuccio Location: New York, New

More information

Organizing Quantitative Data

Organizing Quantitative Data Organizing Quantitative Data MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2018 Objectives At the end of this lesson we will be able to: organize discrete data in

More information

Lab 1. Adiabatic and reversible compression of a gas

Lab 1. Adiabatic and reversible compression of a gas Lab 1. Adiabatic and reversible compression of a gas Introduction The initial and final states of an adiabatic and reversible volume change of an ideal gas can be determined by the First Law of Thermodynamics

More information

Background Information. Project Instructions. Problem Statement. EXAM REVIEW PROJECT Microsoft Excel Review Baseball Hall of Fame Problem

Background Information. Project Instructions. Problem Statement. EXAM REVIEW PROJECT Microsoft Excel Review Baseball Hall of Fame Problem Background Information Every year, the National Baseball Hall of Fame conducts an election to select new inductees from candidates nationally recognized for their talent or association with the sport of

More information

MoLE Gas Laws Activities

MoLE Gas Laws Activities MoLE Gas Laws Activities To begin this assignment you must be able to log on to the Internet using Internet Explorer (Microsoft) 4.5 or higher. If you do not have the current version of the browser, go

More information

March Madness Basketball Tournament

March Madness Basketball Tournament March Madness Basketball Tournament Math Project COMMON Core Aligned Decimals, Fractions, Percents, Probability, Rates, Algebra, Word Problems, and more! To Use: -Print out all the worksheets. -Introduce

More information

Basic Emergency Planning Awareness for Acadia National Park Rock Climbing Enthusiasts Otter Cliffs Climbing Area. Lina Dailide-Custodio

Basic Emergency Planning Awareness for Acadia National Park Rock Climbing Enthusiasts Otter Cliffs Climbing Area. Lina Dailide-Custodio Basic Emergency Planning Awareness for Acadia National Park Rock Climbing Enthusiasts Otter Cliffs Climbing Area Lina Dailide-Custodio GEOINT Student, Penn State University 1. Introduction Since the 1880s,

More information

Diagnosis of Fuel Evaporative System

Diagnosis of Fuel Evaporative System T S F S 0 6 L A B E X E R C I S E 2 Diagnosis of Fuel Evaporative System April 5, 2017 1 objective The objective with this laboratory exercise is to read, understand, and implement an algorithm described

More information

Statistical Analyses on Roger Federer s Performances in 2013 and 2014 James Kong,

Statistical Analyses on Roger Federer s Performances in 2013 and 2014 James Kong, Statistical Analyses on Roger Federer s Performances in 2013 and 2014 James Kong, kong.james@berkeley.edu Introduction Tennis has become a global sport and I am a big fan of tennis. Since I played college

More information

Report from the Kennel Club/ British Small Animal Veterinary Association Scientific Committee

Report from the Kennel Club/ British Small Animal Veterinary Association Scientific Committee Report from the Kennel Club/ British Small Animal Veterinary Association Scientific Committee Summary results of the Purebred Dog Health Survey for the Grand Bleu de Gascoigne breed Warning: The results

More information

Well Integrity Log Interpretation (Magnetic Thickness Detector and 56-Arm Multi-Finger Caliper)

Well Integrity Log Interpretation (Magnetic Thickness Detector and 56-Arm Multi-Finger Caliper) Well Integrity Log Interpretation (Magnetic Thickness Detector and 56-Arm Multi-Finger Caliper) Company: Well: Field: Province: Alberta Country: Canada License: UWI: Logging Date: 21-Mar-2015 Report Date:

More information

TFRRS CSV Results Format

TFRRS CSV Results Format TFRRS CSV Results Format March 17 th, 2015 Version 2.0 DirectAthletics 2015 Table of Contents I. Introduction... 2 The TFRRS Website...2 II. The Track & Field CSV File Format... 2 Appendix 1: Track & Field

More information

Prerequisites: Layout:

Prerequisites: Layout: Create a.csv File to Import Competitors. Excel, (and most other spread sheets), databases and some word processors allow the export of data ("save as..") a 'flat' text file in Comma Separated Variables

More information

CHAPTER 2 Modeling Distributions of Data

CHAPTER 2 Modeling Distributions of Data CHAPTER 2 Modeling Distributions of Data 2.1 Describing Location in a Distribution The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers 2.1 Reading Quiz

More information

5.1. Data Displays Batter Up. My Notes ACTIVITY

5.1. Data Displays Batter Up. My Notes ACTIVITY SUGGESTED LEARNING STRATEGIES: Activating Prior Knowledge, Marking the Text, Group Presentation, Interactive Word Wall Henry Hank Aaron and Harmon Killebrew are among the alltime leaders in home runs in

More information

WIND DATA REPORT. Quincy DPW, MA

WIND DATA REPORT. Quincy DPW, MA WIND DATA REPORT Quincy DPW, MA March 1 st 2007 to May 31 st 2007 Prepared for Massachusetts Technology Collaborative 75 North Drive Westborough, MA 01581 by James R. Browning James F. Manwell Anthony

More information

Announcements. Unit 7: Multiple Linear Regression Lecture 3: Case Study. From last lab. Predicting income

Announcements. Unit 7: Multiple Linear Regression Lecture 3: Case Study. From last lab. Predicting income Announcements Announcements Unit 7: Multiple Linear Regression Lecture 3: Case Study Statistics 101 Mine Çetinkaya-Rundel April 18, 2013 OH: Sunday: Virtual OH, 3-4pm - you ll receive an email invitation

More information

IBIS File : Problems & Software Solution

IBIS File : Problems & Software Solution UAq EMC Laboratory IBIS File : Problems & Software Solution F. de Paulis, A. Orlandi, D. Di Febo UAq EMC Laboratory, University of L Aquila, L Aquila Italy European IBIS Summit Naples, May 11, 2011 How

More information

QT2 - Coach s Reference

QT2 - Coach s Reference QT2 - Coach s Reference Histograms and Long Term Planning (supra-macro) Buildup Rate The first week of each block should not exceed the highest week of the previous block during the base phase. During

More information