Fundamental Algorithms for System Modeling, Analysis, and Optimization

Similar documents
Efficient Algorithms for finding a Trunk on a Tree Network and its Applications

Design Engineering Challenge: The Big Dig Contest Platter Strategies: Ball Liberation

Red-Black Trees Goodrich, Tamassia Red-Black Trees 1

CS3350B Computer Architecture. Lecture 6.2: Instructional Level Parallelism: Hazards and Resolutions

Noncrossing Trees and Noncrossing Graphs

Lecture Topics. Overview ECE 486/586. Computer Architecture. Lecture # 9. Processor Organization. Basic Processor Hardware Pipelining

Multi-Robot Forest Coverage

Depth-first search and strong connectivity in Coq

Design and Simulation Model for Compensated and Optimized T-junctions in Microstrip Line

Trees & Routing (1) BFS

arxiv:cs/ v1 [cs.ds] 8 Dec 1998

Lecture 22. Humming Noise in Audio Amplifier. Supply Ripple Rejection. Ripple Free Differential Output. Common Inputs to Differential Amp.

Faster Nearest Neighbors: Voronoi Diagrams and k-d Trees

Morrison Drive tel. Ottawa, ON, Canada K2H 8S fax. com

Data Sheet. Linear bearings

Incorporating Location, Routing and Inventory Decisions in Dual Sales Channel - A Hybrid Genetic Approach

CORESTA RECOMMENDED METHOD N 68

Torque. Physics 2. Prepared by Vince Zaccone For Campus Learning Assistance Services at UCSB

Multi-Robot Flooding Algorithm for the Exploration of Unknown Indoor Environments

Electrical Equipment of Machine Tools

10 Torque. Lab. What You Need To Know: Physics 211 Lab

OPTIMAL SCHEDULING MODELS FOR FERRY COMPANIES UNDER ALLIANCES

Lecture 18. ANNOUNCEMENTS HW#10 will be posted tonight OUTLINE. Basic MOSFET amplifier MOSFET biasingi MOSFET current sources Common source amplifier

Lecture 8. OUTLINE BJT Amplifiers (cont d) Common emitter topology CE stage with emitter degeneration Impact of Early effect (r o )

Matlab Simulink Implementation of Switched Reluctance Motor with Direct Torque Control Technique

Cyclostrophic Balance in Surface Gravity Waves: Essay on Coriolis Effects

An Auction Based Mechanism for On-Demand Transport Services

Numerical study of super-critical carbon dioxide flow in steppedstaggered

A Deceleration Control Method of Automobile for Collision Avoidance based on Driver's Perceptual Risk

Experiment #10 Bio-Physics Pre-lab Comments, Thoughts and Suggestions

High Axial Load Capacity, High speed, Long life. Spherical Roller Bearings

DETC A NEW MODEL FOR WIND FARM LAYOUT OPTIMIZATION WITH LANDOWNER DECISIONS

Experiment #10 Bio-Physics Pre-lab Questions

A Three-Axis Magnetic Sensor Array System for Permanent Magnet Tracking*

ECE Introduction to Comm/SP, Fall 2011 Homework #5 Due: 11/04/11

MODELLING THE INTERACTION EFFECTS OF THE HIGH-SPEED TRAIN TRACK BRIDGE SYSTEM USING ADINA

An integrated supply chain design model with random disruptions consideration

Three-axis Attitude Control with Two Reaction Wheels and Magnetic Torquer Bars

Complexity of Data Tree Patterns over XML Documents

ABriefIntroductiontotheBasicsof Game Theory

The Study About Stopping Distance of Vehicles

Lesson 33: Horizontal & Vertical Circular Problems

The Solution to the Bühlmann - Straub Model in the case of a Homogeneous Credibility Estimators

CPE/EE 427, CPE 527 VLSI Design I L21: Sequential Circuits. Review: The Regenerative Property

VLSI Design I; A. Milenkovic 1

Cheat-Proof Playout for Centralized and Distributed Online Games

Angle-restricted Steiner arborescences for flow map layout Buchin, K.A.; Speckmann, B.; Verbeek, K.A.B.

Carnegie Mellon University Forbes Ave., Pittsburgh, PA command as a point on the road and pans the camera in

AIRPLANE PAVEMENT MARKINGS

Phase Behavior Introduction to Phase Behavior F.E. Londono M.S. Thesis (2001)

POSSIBLE AND REAL POWERFLOWS IN CONNECTED DIFFERENTIAL GEAR DRIVES WITH η 0 <i pq <1/η 0 INNER RATIO

A Study on Brushless DC Motor for High Torque Density

A CONCEPTUAL WHEELED ROBOT FOR IN-PIPE INSPECTION Ioan Doroftei, Mihaita Horodinca, Emmanuel Mignon

ELECTRICAL INSTALLATION CERTIFICATE [BS 7671:2008 as amended] Details of of the the Installation. For Design. For Construction

f i r e - p a r t s. c o m

LOADS OF SEWER MANHOLES WITHIN MINING AREA

EcoMobility World Festival 2013 Suwon: an analysis of changes in citizens awareness and satisfaction

Color Encodings: srgb and Beyond

THE GREAT CARDBOARD BOAT RACE INTRODUCTION, BOAT BUILDING & RULES by United Way of Elkhart County

A Machine Vision based Gestural Interface for People with Upper Extremity Physical Impairments

Interior Rule of the Quebec Open 2017

Experimental and Numerical Studies on Fire Whirls

The Properties of. Model Rocket Body Tube Transitions

Physics 1 HW #8: Chapter 3

Asteroid body-fixed hovering using nonideal solar sails

EC-FRM: An Erasure Coding Framework to Speed up Reads for Erasure Coded Cloud Storage Systems

Pennsylvania Congestion Management System: PA 100 Corridor. June 2002

TLV493D-A1B6 3D Magnetic Sensor

A Collision Risk-Based Ship Domain Method Approach to Model the Virtual Force Field

Motivation. Prize-Collecting Steiner Tree Problem (PCSTP) Kosten und Profite. Das Fraktionale Prize-Collecting Steiner Tree Problem auf Baumgraphen

Rearranging trees for robust consensus

Watford Half Marathon. Watford Half Marathon. Sunday February 4th Starting at 10.30am. Enjoy Your Run!!! Notice to all Entrants.

Tree. Tree. Siblings Grand Parent Grand Child H I J I J K. Tree Definitions

Fire-Safety Analysis Timber. FRILO Software GmbH As of 29/06/2016

Multiple Vehicle Driving Control for Traffic Flow Efficiency

Target Allocation Study for Formation Ship-ToAir Missile System Based on the Missile Fire. Zone Division

Physical Design of CMOS Integrated Circuits

DAM SITE 2017 STAGE 1

tr0 TREES Hanan Samet

Range Extension Control System for Electric Vehicles Based on Front and Rear Driving Force Distribution Considering Load Transfer

Watford Half Marathon. Watford Half Marathon. Sunday February 5th Starting at 10.30am. Enjoy Your Run!!! Notice to all Entrants.

Installation and Operation Instructions

Performance Characteristics of Parabolic Trough Solar Collector System for Hot Water Generation

THE IMPACTS OF CONGESTION ON COMMERCIAL VEHICLE TOUR CHARACTERISTICS AND COSTS

British Prime Minister Benjamin Disraeli once remarked that

SHRiMP: Accurate Mapping of Short Color-space Reads

DECO THEORY - BUBBLE MODELS

A Method of Hand Contour Tracking based on GVF, Corner and Optical flow Ke Du1, a, Ying Shi1,b, Jie Chen2,c, MingJun Sun1, Jie Chen1, ShuHai Quan1

Project Proposal: Characterization of Tree Crown Attributes with High Resolution Fixed-Base Aerial Photography. by Rich Grotefendt and Rob Harrison

FALL PROTECTION PROGRAM

Provably Secure Camouflaging Strategy for IC Protection

PREDICTION OF THIRD PARTY DAMAGE FAILURE FREQUENCY FOR PIPELINES TRANSPORTING MIXTURES OF NATURAL GAS AND HYDROGEN Zhang, L. 1, Adey, R.A.

Optimizing Cyclist Parking in a Closed System

Forschungszentrum Karlsruhe Technik und Umwelt. LINEFIT concept. Problem: Transition from measured transmission spectrum to ILS.

College Hill Carthage. Winton Hills. Mill Creek. Spring Grove Village Paddock Hills. North Avondale. Avondale. Evanston. CUF Walnut Hills.

tr0 TREES Hanan Samet

Tracking of High-speed, Non-smooth and Microscale-amplitude Wave Trajectories

THE performance disparity between processor speed and the

I. FORMULATION. Here, p i is the pressure in the bubble, assumed spatially uniform,

OUTDOOR FOOTBALL FIELD

Transcription:

Fundamental Algoithms fo System Modeling, Analysis, and Optimization Edwad A. Lee, Jaijeet Roychowdhuy, Sanjit A. Seshia UC Bekeley EECS 44/44 Fall With thanks to R. K. Bayton, K. Keutze, N. Shenoy, and A. Kuehlmann Copyight -, E. A. Lee, J. Roydhowdhuy, S. A. Seshia, All ights eseed Lectue N: Retiming Retiming Tadeoffs a" b" " " " " " " clock peiod =" # egistes =" 6" 4" [Shenoy, 997] EECS 44/44, UC Bekeley: l

Retiming Tadeoffs a" b" " " " " " " clock peiod =" 5" # egistes =" 4" [Shenoy, 997] EECS 44/44, UC Bekeley: Retiming Tadeoffs a" b" " " " " " " clock peiod =" # egistes =" 4" " [Shenoy, 997] EECS 44/44, UC Bekeley: 4 l

Retiming Tadeoffs a" b" " " " " " " clock peiod =" # egistes =" " 4" [Shenoy, 997] EECS 44/44, UC Bekeley: 5 Goals of Retiming Possible goals: Minimize clock peiod min-peiod etiming Minimize numbe of egistes min-aea etiming Minimize numbe of egistes fo a taget clock peiod constained min-aea etiming a b clock peiod = 6 5 4 # egistes = 4 4 4 [Shenoy, 997] EECS 44/44, UC Bekeley: 6 l

Abstaction a" b" " " " " " " clock peiod =" # egistes =" 6" 4" EECS 44/44, UC Bekeley: 7 Abstaction - Gaph Nodes: cicuit elements Node weights: delay Dummy node fo fanout EECS 44/44, UC Bekeley: 8 l 4

Abstaction - Registes Nodes: cicuit elements Node weights: delay Dummy node fo fanout EECS 44/44, UC Bekeley: 9 Abstaction - Registes Ac weights indicate egistes EECS 44/44, UC Bekeley: l 5

Abstaction - Enionment EECS 44/44, UC Bekeley: Cutset Diides the Gaph in Two EECS 44/44, UC Bekeley: l 6

Retiming: Add a egiste on all acs cossing the cutset in one diection, and subtact a egiste fom all acs cossing the cutset in the othe diection. += += -= -= EECS 44/44, UC Bekeley: Recall: Retiming Tadeoffs a" b" " " " " " " clock peiod =" # egistes =" 6" 4" [Shenoy, 997] EECS 44/44, UC Bekeley: 4 l 7

Recall: Retiming Tadeoffs a" b" " " " " " " clock peiod =" 5" # egistes =" 4" [Shenoy, 997] EECS 44/44, UC Bekeley: 5 Simplest cutset suounds one node -= -= += EECS 44/44, UC Bekeley: 6 l 8

Recall: Retiming Tadeoffs a" b" " " " " " " clock peiod =" 5" # egistes =" 4" [Shenoy, 997] EECS 44/44, UC Bekeley: 7 Recall: Retiming Tadeoffs a" b" " " " " " " clock peiod =" # egistes =" 4" " [Shenoy, 997] EECS 44/44, UC Bekeley: 8 l 9

Fo each node, define = # of egistes moed fom the outputs to the inputs. -= = - -= += A etiming is now an assignment fo eey node such that the weight of eey ac is non-negatie. EECS 44/44, UC Bekeley: 9 Poblem Setup Fo a path p: k, e i = i, i+ d p = w p = Clock cycle c = k i= k i= d i w e i max { d p} p : w p = Fo example on ight: c =? includes endpoints 7 EECS 44/44, UC Bekeley: l

Poblem Setup Fo a path p: k, e i = i, i+ d p = w p = Clock cycle c = k i= k i= d i w e i max { d p} p : w p = delay of longest path without egistes Fo example on ight: c = includes endpoints 7 Path with wp= EECS 44/44, UC Bekeley: Basic Opeation Thus in the example, u = -, = - esults in 7 u 7 u Fo a path p: s t, W p = wp + t - s Retiming : V Z, an intege etex labeling w e = we + - u fo edge e = u, A etiming is legal if w e, e E EECS 44/44, UC Bekeley: l

Retiming - Assumptions Each loop in cicuit contains at least one egiste Cicuit uses single clock and edge-tiggeed elements identical skew Gate delay is constant and non-negatie Registes ae ideal set-up, die independent of load Any powe-up state of the design can be safely handled by the enionment initial state assumption EECS 44/44, UC Bekeley: Retiming - Fomulation Come up with alues: Assign integes to each etex so that objectie is met Valid etiming constaints a" b" p" a" b" e" w " e = we" + b" - a" > " a" b" w p = wp + b - a" " EECS 44/44, UC Bekeley: 4 l

Retiming fo Minimum Clock Cycle Poblem Statement: Minimum cycle time Gien GV, E, d, w, find a Legal etiming so that c = is minimized impotant matices Registe weight matix Delay matix max { d p} p: W p = A p W u, = min{ w p : u } p D u, = max{ d p : u, w p = W u, } D u, > c W u, B EECS 44/44, UC Bekeley: 5 Retiming fo Minimum Clock Peiod V 7 V W egiste path weight matix, min # of egistes on all paths between u and D path delay matix, max delay among all paths between u and with Wu, minimum egistes W V V V V D V V V V V V V V 6 6 7 7 V V V V Note that the D matix indicates that the least possible clock peiod is 7, and a peiod of will obiously wok, so the minimum clock peiod is between 7 and, inclusie. Binay seach of these possibilities will wok. EECS 44/44, UC Bekeley: 6 l

Conditions fo Retiming Suppose we need to check if a etiming exists fo a clock cycle α Legal etiming: w e fo all e. Hence w e = we + - u o u - w e Fo all paths p: u such that dp α, we equie w p Thus k w p = w e i k i= = [w e i + i+ i ] i= = w p + k = w p + u O take the least wp tightest constaint u- Wu,- i.e. thee ae many paths p, choose the p that gies tightest constaint Note: we just need to apply it to u, such that Du, > α EECS 44/44, UC Bekeley: 7 Soling the Constaints All constaints in diffeence of aiables fom How to sole? Conside ou example fo α = 7 Legal: u-we Notice that these constaints ae unaffected by adding o subtacting any constant to/ fom all i. Why? D>7: u-wu,- V V V V W V V V V V 7 D V V V V 6 6 7 7 V EECS 44/44, UC Bekeley: 8 V V V V l 4

l 5 EECS 44/44, UC Bekeley: 9 Constaint gaph has one ac fo each diffeence-of-two-aiables inequality. Soling the Constaints: Constuct a Constaint Gaph Legal: u-we D>7: u-wu,- -,-,- - EECS 44/44, UC Bekeley: A solution to the constaints, if it exists: i is the minimum path weight fom the dummy node to node i. No solution exists if thee ae cycles with negatie weight. Why? Soling the Constaints: Add a dummy stat node to the constaint gaph Legal: u-we D>7: u-wu,- -,-,- -

Soling the Constaints: Use the Bellman-Fod Algoithm: O V Cannot use Dijksta s algoithm, which woks to find minimum path weights only if the weights all hae the same sign. Bellman-Fod can detect whee thee ae cycles with negatie weight. Legal: u-we Notice that this algoithm will only yield nonpositie alues of i. Why is this OK? D>7: u-wu,- - - -,- -,- A solution is = =, = = - EECS 44/44, UC Bekeley: Retiming Solution To find the minimum cycle time, do a binay seach among the enties of the D matix V log V Clock peiod = ++7= 7 V V Retime V V V V W V V V V D V V V V 6 6 7 7 V V V V 7 V Clock peiod= 7 V EECS 44/44, UC Bekeley: l 6

Dawbacks of this Algoithm l Requies W/D matix computation l O V clock peiod constaints, most of which ae edundant l Aeage case is wost case Fotunately, thee s anothe algoithm we can use: l elaxation-based algoithm called FEAS l See: [Shenoy, 997] EECS 44/44, UC Bekeley: Applications beyond cicuits In a dataflow gaph, nodes epesent actos, which fie when input tokens ae aailable. Fiing pefoms a computation that takes time. Weights epesent initial tokens. Retiming can be intepeted as a peamble to a peiodic schedule, and may hae the goal of maximizing paallelism so that the dataflow gaph executes fast on a multicoe machine. EECS 44/44, UC Bekeley: 4 l 7

Applications beyond cicuits In a dataflow gaph, nodes epesent actos, which fie when input tokens ae aailable. Fiing pefoms a computation that takes time. Weights epesent initial tokens. Retiming can be intepeted as a peamble to a peiodic schedule, and may hae the goal of maximizing paallelism so that the dataflow gaph executes fast on a multicoe machine. Fie! EECS 44/44, UC Bekeley: 5 Applications beyond cicuits In a dataflow gaph, nodes epesent actos, which fie when input tokens ae aailable. Fiing pefoms a computation that takes time. Weights epesent initial tokens. Retiming can be intepeted as a peamble to a peiodic schedule, and may hae the goal of maximizing paallelism so that the dataflow gaph executes fast on a multicoe machine. EECS 44/44, UC Bekeley: 6 l 8

Applications beyond cicuits In a dataflow gaph, nodes epesent actos, which fie when input tokens ae aailable. Fiing pefoms a computation that takes time. Weights epesent initial tokens. Retiming can be intepeted as a peamble to a peiodic schedule, and may hae the goal of maximizing paallelism so that the dataflow gaph executes fast on a multicoe machine. Fie! EECS 44/44, UC Bekeley: 7 Applications beyond cicuits In a dataflow gaph, nodes epesent actos, which fie when input tokens ae aailable. Fiing pefoms a computation that takes time. Weights epesent initial tokens. Retiming can be intepeted as a peamble to a peiodic schedule, and may hae the goal of maximizing paallelism so that the dataflow gaph executes fast on a multicoe machine. EECS 44/44, UC Bekeley: 8 l 9

Applications beyond cicuits In a dataflow gaph, nodes epesent actos, which fie when input tokens ae aailable. Fiing pefoms a computation that takes time. Weights epesent initial tokens. Retiming can be intepeted as a peamble to a peiodic schedule, and may hae the goal of maximizing paallelism so that the dataflow gaph executes fast on a multicoe machine. Fie! EECS 44/44, UC Bekeley: 9 Afte etiming Afte etiming the gaph, may be able to constuct a bette faste paallel schedule that will be executed peiodically. EECS 44/44, UC Bekeley: 4 l

Refeences. Leiseson, C. E. and J. B. Saxe 98. "Optimizing synchonous systems." Jounal of VLSI and Compute Systems: pp. 4-67.. Leiseson, C. E. and J. B. Saxe 99. "Retiming synchonous cicuity." Algoithmica 6: pp. 5-5.. Shenoy, N. 997. "Retiming: Theoy and pactice." Integation, the VLSI Jounal : pp. -. EECS 44/44, UC Bekeley: 4 l