Outline. Terminology. EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Steps in Capacity Planning and Management

Similar documents
EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 6. Wenbing Zhao. Department of Electrical and Computer Engineering

1. The data below gives the eye colors of 20 students in a Statistics class. Make a frequency table for the data.

Organizing Quantitative Data

Opleiding Informatica

Chapter 5: Methods and Philosophy of Statistical Process Control

Training Fees 3,400 US$ per participant for Public Training includes Materials/Handouts, tea/coffee breaks, refreshments & Buffet Lunch.

Internet Technology Fundamentals. To use a passing score at the percentiles listed below:

SIDRA INTERSECTION 6.1 UPDATE HISTORY

Queue analysis for the toll station of the Öresund fixed link. Pontus Matstoms *

Note that all proportions are between 0 and 1. at risk. How to construct a sentence describing a. proportion:

Application of Bayesian Networks to Shopping Assistance

Mark Scheme (Results) Summer 2009

March Madness Basketball Tournament

Chapter 2 - Frequency Distributions and Graphs

March Madness Basketball Tournament

8th Grade. Data.

STT 315 Section /19/2014

Time Pressure Dispensing

(Lab Interface BLM) Acceleration

Lab 1. Adiabatic and reversible compression of a gas

7 th International Conference on Wind Turbine Noise Rotterdam 2 nd to 5 th May 2017

Exercise 3. Power Versus Wind Speed EXERCISE OBJECTIVE DISCUSSION OUTLINE. Air density DISCUSSION

Fractions Print Activity. Use the Explore It mode to answer the following questions:

Walk - Run Activity --An S and P Wave Travel Time Simulation ( S minus P Earthquake Location Method)

Acknowledgement: Author is indebted to Dr. Jennifer Kaplan, Dr. Parthanil Roy and Dr Ashoke Sinha for allowing him to use/edit many of their slides.

POWER Quantifying Correction Curve Uncertainty Through Empirical Methods

Title: Modeling Crossing Behavior of Drivers and Pedestrians at Uncontrolled Intersections and Mid-block Crossings

Tutorial for the. Total Vertical Uncertainty Analysis Tool in NaviModel3

A Nomogram Of Performances In Endurance Running Based On Logarithmic Model Of Péronnet-Thibault

Advanced Hydraulics Prof. Dr. Suresh A. Kartha Department of Civil Engineering Indian Institute of Technology, Guwahati

Chapter 2: Visual Description of Data

Chapter 11 Waves. Waves transport energy without transporting matter. The intensity is the average power per unit area. It is measured in W/m 2.

Broadly speaking, there are four different types of structures, each with its own particular function:

Chapter 2 Displaying and Describing Categorical Data

5.1 Introduction. Learning Objectives

FISH 415 LIMNOLOGY UI Moscow

Manual for continuous distillation

2600T Series Pressure Transmitters Plugged Impulse Line Detection Diagnostic. Pressure Measurement Engineered solutions for all applications

Calculation of Trail Usage from Counter Data

Golf Genius Software

Meter Data Distribution Market Trials

Reversible lane: Exploring Lane Management Alternatives. BY Vijay R Ezapa Arvind Emelinda M Parentela

FISH 415 LIMNOLOGY UI Moscow

Friday 13 June 2014 Morning

STAT 155 Introductory Statistics. Lecture 2: Displaying Distributions with Graphs

Maximum CPU utilization is obtained by multiprogramming. CPU I/O Burst Cycle Process execution consists of a cycle of CPU execution and I/O wait

Safety Performance of Two-Way Stop-Controlled Expressway Intersections

FUNCTIONAL SKILLS MATHEMATICS (level 1)

PGA Tour Scores as a Gaussian Random Variable

by Robert Gifford and Jorge Aranda University of Victoria, British Columbia, Canada

Safety and Design Alternatives for Two-Way Stop-Controlled Expressway Intersections

Handicapping Process Series

PSY201: Chapter 5: The Normal Curve and Standard Scores

How Many Iterations Are Enough?

Operating Instructions METTLER TOLEDO Pipette Check Application for AX and MX/UMX Balances Version 1.xx

STANDARD SCORES AND THE NORMAL DISTRIBUTION

Turn Lane Warrants: Concepts, Standards, Application in Review

WIND DATA REPORT. Paxton, MA

S-CURVE DOCUMENTATION AND TUTORIAL FOR THE S-CURVE ON THE COOL MUSCLE

Recommended Roadway Plan Section 2 - Land Development and Roadway Access

CHAPTER 1 ORGANIZATION OF DATA SETS

Digital Level Control One and Two Loops Proportional and Integral Control Single-Loop and Cascade Control

Wind Regimes 1. 1 Wind Regimes

Analysis of Variance. Copyright 2014 Pearson Education, Inc.

WIND DATA REPORT. Paxton, MA

RELATIVE PLACEMENT SCORING SYSTEM

Chapter 2: Modeling Distributions of Data

Practice Test Unit 06B 11A: Probability, Permutations and Combinations. Practice Test Unit 11B: Data Analysis

Characterization of Boxed Beef Value in Angus Sires

Age of Fans

8-1. The Pythagorean Theorem and Its Converse. Vocabulary. Review. Vocabulary Builder. Use Your Vocabulary

CHAPTER 2 Modeling Distributions of Data

Answers Written Exam Public Transport

LOW PRESSURE EFFUSION OF GASES revised by Igor Bolotin 03/05/12

SHIMADZU LC-10/20 PUMP

Ozobot Bit Classroom Application: Boyle s Law Simulation

APPROVED FACILITY SCHOOLS CURRICULUM DOCUMENT SUBJECT: Mathematics GRADE: 6. TIMELINE: Quarter 1. Student Friendly Learning Objective

Chapter 11 Waves. Waves transport energy without transporting matter. The intensity is the average power per unit area. It is measured in W/m 2.

Database of Winds and Waves. -internet version-

Solutionbank S1 Edexcel AS and A Level Modular Mathematics

Title: 4-Way-Stop Wait-Time Prediction Group members (1): David Held

Year 10 Term 2 Homework

Lab # 03: Visualization of Shock Waves by using Schlieren Technique

Preliminary design of a high-altitude kite. A flexible membrane kite section at various wind speeds

Edexcel GCSE. Mathematics A 1387 Paper 5523/03. November Mark Scheme. Mathematics A 1387

Describing a journey made by an object is very boring if you just use words. As with much of science, graphs are more revealing.

Marking Guidelines 2009 examination June series. Physics Investigative Skills Assignment (ISA) Q. General Certificate of Education

C1960. Multi-recipe profile recorder/controller. Measurement made easy

Title: Standard Operating Procedure for Elemental and Organic Carbon (EC and OC) using Non-Dispersive Infrared Detection (NDIR)

Besides the reported poor performance of the candidates there were a number of mistakes observed on the assessment tool itself outlined as follows:

MARK SCHEME for the October/November 2012 series 0610 BIOLOGY. 0610/51 Paper 5 (Practical Test), maximum raw mark 40

Cambridge International Examinations Cambridge Ordinary Level

Preparation for Salinity Control ME 121

Quick Guide. SGX-CA500 Cycle Computer SGY-PM Series Power Meters

5.1. Data Displays Batter Up. My Notes ACTIVITY

End of Chapter Exercises

BEFORE YOU OPEN ANY FILES:

A statistical model of Boy Scout disc golf skills by Steve West December 17, 2006

Analyzing Categorical Data & Displaying Quantitative Data Section 1.1 & 1.2

Running head: DATA ANALYSIS AND INTERPRETATION 1

Transcription:

EEC 686/785 Modeling & Performance Evaluation of Computer Systems Lecture 6 Department of Electrical and Computer Engineering Cleveland State University wenbing@ieee.org Outline Review of lecture 5 The art of data presentation Gnuplot Histogram Gantt charts Kiviat graphs Schumacher charts Ratio games 2 Terminology 3 Steps in Capacity Planning and Management Capacity planning: ensuring that adequate computer resources will be available to meet the future workload demands in a cost-effective manner while meeting the performance objectives Capacity management: ensuring that the currently available computing resources are used to provide the highest performance Capacity management is concerned with the present while capacity planning is concerned with the future Benchmarking: to compare the performance of two competing systems in an objective manner, benchmarks are run on these systems using automatic load drivers For planning as well as management, the steps are basically the same: Instrument the system Monitor system usage Characterize workload Predict performance under different alternatives Select the lowest cost, highest performance alternative Instrument the system Change system parameters No Monitor usage System model Are cost and performance acceptable? Characterize workload Forecast workload Done Yes 1

Load Drivers 5 Types of Variables 6 Internal-driver method: loading programs directly into the memory and executing Problem 1: the effect of the terminal communication overhead is not visible Problem 2: loading overhead may affect the system performance Remote-terminal emulators (RTEs): the computers that are used to put the load on the system System under test Remote terminal emulator Variables Qualitative Quantitative Ordered Unordered Discrete Continuous Guidelines for Preparing Good Graphic Charts Require minimum effort from the reader Maximize information: there should be sufficient information on the graph to make it self-sufficient Minimize ink: present as much information as possible with as little ink as possible. Too much unnecessary info on a chart makes it cluttered and uninteresting Use commonly accepted practices: present what people expect Avoid ambiguity: show coordinate axes, scale division, and origin. Identify individual curves and bars. Do not plot multiple variables in the same chart 7 Gnuplot Gnuplot is a portable command-line driven interactive data and function plotting utility for many different operating systems, including MS Windows and Linux The software is copyrighted but freely distributed Gnuplot supports many types of plots in either 2D and 3D It can draw using lines, points, boxes, contours, vector fields, surfaces, and various associated text It also supports various specialized plot types Gnuplot Website: http://www.gnuplot.info/ 8 2

Histogram 9 Histogram 10 A histogram is defined as a bar graph that shows frequency data Differences between histograms and bar graphs Think of histograms as sorting bins Given one variable, sort data by this variable by placing them into "bins" Then count how many pieces of data are in each bin. The height of the rectangle on top of each bin is proportional to the number of pieces in that bin Bar graphs show the comparison of several measurements of different items The main question a histogram answers is: "How many measurements are there in each of the classes of measurements?" The main question a bar graph answers is: "What is the measurement for each item?" Gantt Charts Gantt chart can be used to show the relative duration of any number of Boolean conditions, i.e., conditions that are either true or false A resource being used or being idle is an example of a Boolean condition Each condition is shown to be a set of horizontal line segments The total length of the line segments represents the relative duration of the condition The position of various segments is arranged such that the overlap between different lines represents the overlap between the conditions 11 Gantt Charts Gantt chart is particularly useful to show the utilization profiles of the overlap among different resources Any resource whose utilization is too high is a bottleneck and may degrade the performance A resource with very low utilization represents inefficiency in the system The overlap of utilization of different resources is a good thing 12 3

Gantt Charts 13 Procedure to Develop a Gantt Chart from Raw Data 1 CPU utilization: 60% I/O channel utilization: 0% Overlap between the CPU and I/O channel is 20% Network link utilization: 60% During 30% of the time, the CPU and network but not the I/O channel are used During 10% of the time all three resources are used For 5% of the time, I/O and network are used but the CPU is idle Network alone is used for 15% of the time Example 10.1: four resources: CPU (A) Two I/O channels (B and C) A network link (D) => 16 possible states Procedure to Develop a Gantt Chart from Raw Data 15 Procedure to Develop a Gantt Chart from Raw Data 16 First level is divided into two parts representing A and Ā A dotted vertical line is drawn to divide all subsequent levels in two parts as well The sum of 8 entries with A=1 is 55% In the final chart, the division corresponding to A and Ā will have length proportional to 55 and 5, respectively. This is indicated by the labels below A and Ā Final Gantt chart for example 10.1

Kiviat Graphs 17 Example 10.2 18 Kiviat graph: a circular graph in which several different performance metrics are plotted along radial lines In the most popular version of the graph, an even number of metrics are used. Half of these metrics are HB metrics so that a higher value of the metrics is considered better. The other half of the metrics measure are LB metrics, and a lower value is considered better Kiviat graph for an ideal system is star Eight different metrics CPU busy percentage of time CPU busy, i.e., CPU utilization, a HB metric (the resource is being used and not wasted) CPU only busy percentage of time CPU only is busy, measures the fraction of time when only the CPU is used and none of the I/O channels are used, an LB metric CPU/channel overlap measures the percentage of time the CPU is busy along with at least one I/O channel, a HB metric Channel only busy measures the time when the channel is being used but the CPU is idle. The CPU must be waiting for the I/O, an LB metric Example 10.2 19 Example 10.2 20 Eight different metrics Any channel busy a HB metric CPU wait CPU waiting for I/O completion, resource is wasted, an LB metric CPU in problem state indicates time used executing the user s program, a HB metric CPU in supervisor state indicates the time spent executing the operating system code, an LB metric Kiviat graph for a balanced system Problem with this set of metrics: only independent metrics CPU busy = problem state + supervisor state CPU wait = 100 CPU busy Channel only = any busy CPU/channel overlap CPU only = CPU busy CPU/channel overlap 5

Shapes of Kiviat Graphs 21 Shapes of Kiviat Graphs 22 CPU keelboat Kiviat graph for a CPU-bound system I/O wedge Kiviatgraph for an I/O-bound system I/O arrow Kiviatgraph for a CPU- and I/O-bound system Kiviat Graphs Which system is more balanced? 23 Figure of Merit The square root of the area of the reversed Kiviat graph can be used to compare the overall goodness of systems Consider a Kiviat graph with 2n axes. Let the percentage of performance values for a system be {x 1, x 2,, x 2n }, where odd values x 1, x 3,, x 2n-1 represent good metrics and the even values represent bad metrics All x i s are percentages and lie between 0 and 100 Figure of Merit (FOM) is computed as follows: n 1 FOM = ( x2i 1+ x2i+ 1)(100 x2i) 2n i= 1 1/2 2 6

Example 10.3 25 Application of Kiviat Graphs 26 System x 1 x 2 x 3 x x 5 x 6 x 7 x 8 A 100 60 0 0 0 0 0 60 B 70 30 0 30 70 30 0 30 1 FOM A = {(100 + 0)(100 60) + (0 + 0)(100 0) + (0 + 0)(100 0) + (0 + 100)(100 60)} 8 [ ] = (5600 + 8000 + 8000 + 5600) / 8 = 28200 / 8 = 58 1/2 1/2 Most of the literature on Kiviat graphs is on data processing systems Kiviat graph can be extended to networks, databases, and other types of computer systems 1 FOM B = {(70 + 0)(100 30) + (0 + 70)(100 30) + (70 + 0)(100 30) + (0 + 70)(100 30)} 8 [ ] = (7700 + 7700 + 7700 + 7700) / 8 = 30800 / 8 = 62 1/2 1/2 Application of Kiviat Graphs 27 Schumacher Charts 28 Schumacher Chart: another reporting format to show daily, weekly, monthly, quarterly, or yearly resource usage Any number of performance metrics can be plotted in tabular form The values are normalized with respect to the long-term mean and standard deviation Any observations that are beyond the mean plus or minus one standard deviation need to be explained 7

29 30 Decision Maker s Games Ratio Games Even if the performance analysis is correctly done and presented, it may not be enough to persuade your audience the decision makers to follow your recommendations Reasons for rejection: The problem needs more analysis Workload/metrics/configuration/deta is always based on past measurements, their applicability to the current or future environment can always be questioned Ratios provide good opportunities for playing performance games with competitors Ratios have a numerator and denominator (base) Two ratios with different bases are not comparable However, many examples in published literature where computer scientists have knowingly or unknowingly compared ratios with different bases Ratio game: the technique of using ratios with incomparable bases and combining them to one s advantage 31 32 Ratio Games Example of Ratio Games Take an average of each individual system s performance and then take a ratio Normalize each system s performance on each workload by that of system A and then take an average of ratios Normalize each system s performance on each workload by that of system B and then take an average of ratios Ratio of totals: 6502 is worse. It takes.7% more time With 6502 as a base: 6502 is better (1% more time) With 8080 as a base: 6502 is worse (6% more time) 8

Example of Ratio Games 33 Example of Ratio Games 3 Sum: Z8002 requires 9% less code than RISC-I RISC-I as base: Z8002 has 18% more code than RISC-I 35 36 Using an Appropriate Ratio Metric Using Relative Performance Enhancement Another form in which ratio games are used to show better performance is by choosing a suitable performance metric which is a ratio of two different metrics Network A B Network A B Throughput 10 Throughput 10 Response 2 1 Response 2 1 Power 5 If the performance metric is already specified, it is possible to show that relative increase in the performance is better using one type of enhancement than another provided the two are tried on different machines (correct way to compare is to try on the same machine) Floating-point accelerator A on X Floating-point accelerator B on Y Alternative A on X B on Y Alternative Without 2 3 Without With 5 2 3 With 5 Ratio 2.00 1.66 9

Ratio Games with Percentages Percentages are basically ratios. They allow playing ratio games in ways that do not look like ratios Example 11.3. => In fact, neither comparison used compatible bases: (a) number of tests used in each test as base (b) number of tests combined as base 37 Ratio Games with Percentages Other ways to misuse percentages They can be used to have a psychological impact on the reader. For example, 1000% improvement in performance sounds more impressive than an 11-time improvement This is particularly useful when the performance before the improvement and the absolute increase in the performance are not impressive 38 Ratio Games with Percentages 39 Strategies for Winning a Ratio Game 0 Other ways to misuse percentages Small sample sizes can be neatly disguised by specifying the percentage. Saying that 83.3% of the universities in town use system X is more impressive than saying that 5 out of 6 universities use the system The base in the percentage should be the initial value The memory prices have gone done 00% as if they are now paying you to buy memory it should be the current price is 1/5 of the original If one system is better on all benchmarks, contradicting conclusion cannot be drawn by any ratio game technique 10

Strategies for Winning a Ratio Game 1 Strategies for Winning a Ratio Game 2 Even if one system is better than the other on all benchmarks, a better relative performance can be shown by selecting the appropriate base In table 11.9, A is 0% better than B using raw data, 3% better using A as base, 2% better using B as base A designer would prefer to use the raw-data averages If one system is better on some benchmarks and worse on others, contradicting conclusions can be drawn in some cases The easiest way to verify whether a contradictory conclusion can be drawn for a particular data set is to try all possible bases The next several rules may help select the base Strategies for Winning a Ratio Game 3 Strategies for Winning a Ratio Game If the performance metric is an LB metric, it is better to use your system as the base The execution time is an LB metric, and as shown in table 11.9, system A designers would be better off using system A as the base. B designers would prefer to use B as the base If the performance metric is an HB metric, it is better to use your opponent as the base The throughputs, efficiency, MPS, and MFLOPS are examples of HB metrics A higher average ratio would be obtained for A if B is used as a base 11

Strategies for Winning a Ratio Game Those benchmarks that perform better on your system should be elongated and those that perform worse should be shortened The time duration of the benchmarks is often adjustable For example, in Sieve benchmark, the number of prime numbers to be generated can be set by the experimenter Elongating a benchmark in effect increases the weight on the favorable benchmark 5 Derivation of the Rules Assume the performance metric is an HB metric. Consider the case of two systems A and B on two benchmarks I and J etc. as shown in table 11.10 6 Derivation of the Rules 7 Derivation of the Rules 8 Using raw data, A is better if and only if: a a+ b y< x+ b b Using A as base, A is considered better if and only if: y< 2 x Using B as base, A is considered better if and only if: x y < 2x 1 This explains why it is better to use your opponent s system as base for HB metric 12

Correct Analysis 9 The main reason why the analysis of ratios results in contradicting results is that the approach of taking a mean of the ratio is wrong The approach completely ignores the fact that the performance is affected by several factors and by experimental errors A statement about one factor can be made only if the effects of different factors and errors are first isolated The isolation requires developing a model of the way these factors and experimental errors interact with each other Techniques for this require knowledge of several probabilistic, statistical, and experimental design concepts 13