Traveling Salesperson Problem and. its Applications for the Optimum Scheduling

Similar documents
Lesson 5 Post-Visit Do Big League Salaries Equal Big Wins?

Double Play System 1.0

(56.3%) AL (60%) (62%) (69%) (+4149) 7* 9-5 (64%) +450 (400% ROI

Stats in Algebra, Oh My!

Lesson 2 Pre-Visit Big Business of the Big Leagues

Paul M. Sommers. March 2010 MIDDLEBURY COLLEGE ECONOMICS DISCUSSION PAPER NO

GEOGRAPHY LESSON 1: PRE-VISIT - SAFE AT HOME LOCATION, PLACE AND BASEBALL BASEBALL COAST TO COAST HOUSTON ASTROS IN PARTNER WITH THE NBHOF

2014 MAJOR LEAGUE LEAGUE BASEBALL ATTENDANCE NOTES

American League Ballpark

One Chrome Autographed Card Per Hobby Box!

2016 MAJOR LEAGUE BASEBALL ATTENDANCE HIGHLIGHTS

2014 Baltimore Orioles

2013 Baltimore Orioles

THE BIRD ON S.T.E.M.

2018 MAJOR LEAGUE AND MINOR LEAGUE BASEBALL ATTENDANCE HIGHLIGHTS

2012 Baltimore Orioles

2018 MAJOR LEAGUE AND MINOR LEAGUE BASEBALL ATTENDANCE HIGHLIGHTS

2015 Baltimore Orioles

Arizona Diamondbacks Turner Field Atlanta Braves Chase Field Turner Field - Advanced Ballpark Factors

2017 MAJOR LEAGUE AND MINOR LEAGUE BASEBALL ATTENDANCE HIGHLIGHTS

MAJOR LEAGUE BASEBALL 2014 ATTENDANCE ANALYSIS. Compiled and Written by David P. Kronheim.

76112F22 BALL COLOR: PMS 802 LOGO COLOR CHART LACE COLOR: PMS 380. White C-0 M-0 Y-0 K-0. Process Black C C-0 M-0 Y-0 K-100 BALL COLOR: PMS 380

Scotty s Spring Training

76112F15 BALL COLOR: PMS 802 LOGO COLOR CHART LACE COLOR: PMS 380. White C-0 M-0 Y-0 K-0. Process Black C C-0 M-0 Y-0 K-100 BALL COLOR: PMS 380

A Competitive Edge? The Impact of State Income Taxes on the Acquisition of Free Agents by Major League Baseball Franchises

Sports. Baseball. PERSONALIZE your cake by adding your own message, photo & icing colors Includes three baseball player figurines!

Descriptive Statistics Project Is there a home field advantage in major league baseball?

76112F06 BALL COLOR: PMS 802 LOGO COLOR CHART LACE COLOR: PMS 380. White C-0 M-0 Y-0 K-0. Process Black C C-0 M-0 Y-0 K-100 BALL COLOR: PMS 380

2004 Baltimore Orioles

MAJOR LEAGUE BASEBALL 2013 ATTENDANCE ANALYSIS. Compiled and Written by David P. Kronheim.

Baseball Basics for Brits

1982 Atlanta Braves. Record: st Place National League West Manager: Joe Torre

PROMOS / CONCEPT=MLB ON FS1

ISDS 4141 Sample Data Mining Work. Tool Used: SAS Enterprise Guide

1977 Boston Red Sox. Record: t-2nd Place American League East Manager: Don Zimmer

Lesson 3 Pre-Visit Teams & Players by the Numbers

1973 Boston Red Sox. Record: nd Place American League East Manager: Eddie Kasko, Eddie Popowski (9/30/73)

1991 Boston Red Sox. Record: t-2nd Place American League East Manager: Joe Morgan

BASEBALL AND THE AMERICAN CITY: An examination of public financing and stadium construction in American professional sports.

1969 Boston Red Sox. Record: rd Place American League East Manager: Dick Williams, Eddie Popowski (9/23/69)

MAJOR LEAGUE BASEBALL 2016 ATTENDANCE ANALYSIS

OYO Baseball Hall of Fame Collector s Checklist

SOUTHERN BASEBALL DIGEST

Take Me Out to the Ball Game. By: Sarah Gates

August 2010 MIDDLEBURY COLLEGE ECONOMICS DISCUSSION PAPER NO DEPARTMENT OF ECONOMICS MIDDLEBURY COLLEGE MIDDLEBURY, VERMONT 05753

New York Sports - New York Yankees (MLB)

Years with Multiple players drafted - 11 Under Jordano - Nine. Players Drafted - 41 Under Jordano - 28

Liberty s MLB Draft Picks Free Agents Clay Elliot, SS Atlanta Braves Drafted 10 th Round

MAJOR LEAGUE BASEBALL 2009 ATTENDANCE ANALYSIS. Written by David P. Kronheim

The streak ended Aug. 17, 2017, with a loss at home to Cincinnati.

2018 MLB SPRING TRAINING ECONOMIC IMPACT STUDY

Iowa's Major Leaguers

Teaching Math Through Sports

4. Fortune magazine publishes the list of the world s billionaires annually. The 1992 list (Fortune,

MAJOR LEAGUE BASEBALL 2010 ATTENDANCE ANALYSIS. Compiled and Written by David P. Kronheim

Part A: Changes in Distance between Major League Baseball Franchises and their Triple-A Affiliates

MEMORABLE DRINKWARE PRODUCT FEATURES LICENSED COLLEGIATE DESIGNS 24 OUNCE TUMBLER

BASE ROOKIES PROSPECTS

Predicting the use of the Sacrifice Bunt in Major League Baseball. Charlie Gallagher Brian Gilbert Neelay Mehta Chao Rao

GUIDE to the KARL ORTH SCORECARD COLLECTION. National Baseball Hall of Fame Library Manuscript Archives

DR. JAY S INFALLIBLE BASEBALL SYSTEM!!!

1957 Boston Red Sox. Record: rd Place American League Manager: Pinky Higgins

Unit 7: Home Practice

Sports & Outdoors. Sports & Outdoors DecoPac

JAY BUCKLEY BASEBALL TOURS 2014 SCHEDULE

1954 Boston Red Sox. Record: th Place American League Manager: Lou Boudreau

MINOR LEAGUE ATHLETIC TRAINERS OF THE YEAR

Iowa's Major Leaguers

Texas Tech Red Raiders 2011 MLB Draft Notes

A Markov Model for Baseball with Applications

Reminders. Homework scores will be up by tomorrow morning. Please me and the TAs with any grading questions by tomorrow at 5pm

SAP Predictive Analysis and the MLB Post Season

Past Major League Players. Top Gun Sports and Hartsell Athletics hosted events.

2012 Bowman Sterling Baseball Checklist

Spirit Cups. $20.00 Beverage 4-pack. 18 ounces of your favorite cold beverage goes here. Heavy-gauge plastic make these cups victorious for years

1960 Washington Senators Ticket Stub vs Baltimore Orioles Harmon Killebrew HR #78 Chuck Stobbs Win #103 - September 5, 1960 PSA 7

Each Price $25 $21 $15 $8 $20 NFL

MEDIA INFORMATION. FOR IMMEDIATE RELEASE February 29, 2016

Q1A. Did you personally attend any Major League Baseball games LAST year, or not?

Future Expectations for Over-Performing Teams

GUIDE to the Chicago Cubs and Chicago White Sox Scorecard Collection. National Baseball Hall of Fame Library Manuscript Archives

DBU BASEBALL PRO SIGNEES

Padres Press Clips Wednesday, January 11, 2017

MEMORABLE DRINKWARE PRODUCT FEATURES 24 OUNCE TUMBLER

Lesson Plan: Bats and Stats

Iowa's Major Leaguers

2018 MLB Draft / Played USSSA Round Pick Name Team Position State 1 1 Casey Mize Detroit Tigers RHP FL 1 2 Joseph Bart San Francisco Giants C GA 1 3

Jeffrey Ohlmann, Ph.D. Michael Fry, Ph.D. Matthew Gibson

NFL SCHEDULE SAMPLE. Green Bay

JAY BUCKLEY BASEBALL TOURS 2013 SCHEDULE

2018 SCHEDULE. For reservations contact: Box 213 La Crosse, Wisconsin

WEEKLY NOTES MAJOR LEAGUE BASEBALL THURSDAY, MAY 17, 2018 CLUB 2,500

Case 1:12-cv SAS Document 329 Filed 03/12/15 Page 1 of 11

Club Attendees Arizona Diamondbacks Arizona Diamondbacks Arizona Diamondbacks Arizona Diamondbacks Arizona Diamondbacks Atlanta Braves

FUNDRAISING CATALOG ALL PRODUCTS OFFICIALLY LICENSED

OVER 350 DESIGNS AVAILABLE ALL FEATURING EXCLUSIVE 3D ANIMATION

Show your support with. $20 FULL-IMAGE 3D Tumblers purchase featuring designs in everyone s favorite teams!

Case 1:12-cv SAS Document 323 Filed 02/18/15 Page 1 of 10

1941 Boston Red Sox. Record: nd Place American League Manager: Joe Cronin

Volume XV, Advanced Edition 9 n2y.com WORLD SERIES. St. Louis Cardinals won the 2011 World Series.

Transcription:

Traveling Salesperson Problem and its Applications for the Optimum Scheduling Introduction When you think of the sports, what word do you imagine first? Competition? Winner? Soccer? Those words pretty well describe the sports. For me, however, fairness is the first word. For the competition, fairness is a necessary and sufficient factor. This is why there are many rules that restrict and prohibit any unfair behaviors. However, some rules are not overwhelming enough to reach the shaded area of fairness which can greatly influence the results of the sports: scheduling. After one season is over, all sports leagues associations announce the next year s schedule. It is highly likely that teams are not fully satisfied with their schedule for many reasons including more traveling distances than other teams during the season. In this project, I will present the current problems of Major League Baseball (MLB) schedule and how to improve in various criteria: distance, the number of home/away games, and circadian advantage. The structure of the M L B There are now teams in MLB that can be divided into American League (AL) and National League (NL). Each league has 3 regional sub-leagues, East, Central, and West. Teams are classified into regional sub-leagues according to the locations of their home stadiums. Table shows the good description of how teams are distributed into each league.

American League Major League Baseball National League East Central West East Central West - Baltimore Orioles -Boston Red Sox -New York Yankees - Tampa Bay Rays -Toronto Blue Jays - Chicago White Sox - Cleveland Indians - Detroit Tigers - Kansas City Royals - Minnesota Twins - Houston Astros - LA Angels - Oakland Athletics - Seattle Mariners - Texas Rangers - Atlanta Braves - Miami Marlines - New York Mets - Philadelphia Phillies - Washington Nationals Table describes the classification of MLB teams into each league. - Chicago Cubs - Cincinnati Reds - Milwaukee Brewers - Pittsburgh Pirates - St. Louis Cardinals - Arizona Diamondbacks - Colorado Rockies - LA Dodgers - San Diego Padres - San Francisco Giants How schedules are made and their problems According to 203 Schedule, on which I will focus, all teams will have 52 rounds. Teams will play 25 rounds against each side within their sub-regional division and 22 rounds against league opponents not in their sub-regional division. The rest rounds will be done with teams in the other league. The problem, here, is that not every teams has a match with all the teams. Each team has its own opponent pools while the other team has other opponent pools; for example, St. Louis Cardinals does not have any match with Baltimore Orioles while Toronto Blue Jays has several matches with Orioles. This complicates laying out the schedule perfectly fair to all teams. Now, let s see the schedule of the year 203. As you can see figure in the next page, the total distances that teams should travel during the season are unevenly distributed. Teams are noted by the alphabetical orders from Arizona Diamondbacks () to Washington Nationals (). The longest distance is 83448 km and the team is Seattle Mariners while St. Louis Cardinals needs to travel only 34068 km during the season, which means that the Mariners should travel twice more than the Cardinals. The distribution of each team total traveling distance is well expressed by variance and standard deviation: 8.7636 0 and 4.328 0 respectively.

00000 80000 60000 40000 20000 0 2 3 4 5 6 7 8 9 0 2 3 4 5 6 7 8 9 20 2 22 23 24 25 26 27 28 29 Traveling Salesman Problem and Formulation Traveling Salesman Problem (TSP) focuses on finding the itinerary with the minimized total distance traveled under the constraint of given locations. Optimizing the schedule is very similar to the TSP in the big picture; however, it necessitates some small changes in detail. First of all, while the TSP does not consider the existences of other salesmen, creating one s schedule needs to take the itineraries of other teams into consideration. Furthermore, the main goal of the TSP is to minimize the total distance, but the main goal of this project is to decrease the unfairness in several criteria: total distance, the number of home/away games, and circadian advantage; therefore, I will aim at reducing the variance. As I am not trying to diminish the total amount of the three categories, the optimized schedule might produce the increased total amount but with smaller variances which represent the fairness. As I am attempting to minimize the variance, I will call it the energy E of the system. In terms of mathematical equation, our E will be expressed as 2 E ( d i m),( d i is the total distance traveled by team i). Also, I will update the local state according to the

Metropolis Hastings algorithm. By combining the simulated annealing technique and the Metropolis Hastings algorithm, local trials will be accepted with the probability min ( e ( E ( ) E ( ')),),( E( ) is the previous energy and E( ' ) is the present energy). The remaining issue for the formulation is the rules of local updates. In common TSPs, the rule of transition to an adjacent permutation is picking two different locations and reversing the interval between the two. In my project, I will take two ways of random local updates for this Monte Carlo simulation. First update is SwapRounds. It involves picking two random rounds among 52 rounds and exchanging the two chosen rounds. For example, after choosing round 2 and round 6, I will swap the two rounds entirely. The next update is SwapHomes. The process of SwapHomes is also simple. I will randomly choose two matches of which home team and away team are converted. For example, I will randomly pick a match in which team A is a home team and team B is an away team. Then, I will pick another match in which team A is an away team and team B is a home team and swap the two matches entirely. Therefore, SwapRounds changes the whole MLB teams schedule and SwapHomes modifies two team s schedule. Fair distance optimization As you can see in the left picture, MLB teams are gathered around northeast and southwest parts of America. The uneven distribution naturally incurs the difficulties of scheduling. To solve this natural problem, I manipulated the 203 schedule into matrices that can be easily interpreted into Matlab program. The execution of Monte Carlo Method followed the formulation that I described

above and I made 00 local updates to reduce the energy function of the system, the variance of the total traveling distances. Left figure depicts how the energy function (variance of total distances) has changed for 00 trials and right figure shows how the sum of all the team travel distance has changed for the same number of steps. As you can see in the left graph, the simulation succeeded in decreasing the variances of MLB teams total distances. It first started from 8.76 0 and reached 6 8.4 0 by diminishing to 2 of the original variance. However, there were also some side effects. One side effect was that some teams have seven consecutive home games while other teams have six consecutive away games. In the original MLB schedule, the maximum number of consecutive home/away game was three. Another side effect was the increased total distance traveled. The right graph above traces how the sum of teams total distances has changed. Its initial value was 574356 km and increased to 767457 km. I would like to explain this outcome as the pie has grown bigger but is divided more evenly. Table 2 in the next page shows the distribution of total distance of MLB teams. Compared to Table based on the original schedule announced by MLB association, all bins in the Table 2 have similar heights, showing the smaller variances. Furthermore, as a measurement to the increased total distance, I used dual energy

2 functions for the variance E ( d i m) and for the total distances E 2 d i. 70000 60000 50000 40000 000 20000 0000 0 2 3 4 5 6 7 8 9 0 2 3 4 5 6 7 8 9 20 2 22 23 24 25 26 27 28 29 Table 2 s bins are higher than those of Table but they are more analogous to each other. Because additional energy function is added to the simulation, it took a lot more time than the previous simulation. This is why the number of local updates is limited to. As you can see in the two left graphs (above is variance and the below is the total distance), the simulation achieved to produce distance optimized schedule. However, posing the dual energy functions could not solve the problem of successive home/away games, which is not the goal of distance-optimization. The problem of home/away games will be treated in the next part of the project.

Home/away games optimization In the fair distance optimization, the biggest problem was that some teams have the relatively large number of consecutive home games while other teams move around the continent. In this part, the simulation will focus on the fair distribution of consecutive home/away games. Having a match in the home stadium is a great advantage to the home team. This is why teams with a higher winning rate have more home games during the post season playoffs. Furthermore, having several matches in the home stadium in a row is a bigger problem. For the optimization, energy function will again be the variance, especially the variance of the number of consecutive home games. The structure of the simulation is exactly same with that of previous simulation. Its local update is also done through SwapHomes and SwapRounds. On interesting find in the left graph is that, contrary to the previous simulation, the energy gets higher than the initial state. From this, I could conclude that the MLB association focused on fair distribution of the number of home and away games when they first created the schedule. The second graph represents the total number of consecutive home games during the whole season. As the local state updates, the total number is decreasing, which means that teams have the least number of consecutive home games. This consequently

implies that the total distance traveled is increased because teams are not likely to stay in the home stadium in a row. With two simulations, I could expect that the total distance traveled by all MLB teams is inversely proportional to the total number of consecutive home games. Circadian advantage optimization According to the research, Measuring circadian advantage in Major League Baseball: a 0-year retrospective study, travels across different time zones would affect the athletic performances and influence the result of the game in that teams with circadian advantage would be more likely to win. In addition to this research paper, there are so many academic publications supporting the effects of the circadian advantage that MLB even invested its funds to investigate the real influences. In the third simulation, I will replace the distances between each stadium in the previous simulations with time differences among them. The big picture would be same but the energy function should be changed to E ( c m) that team i experiences). 2 i,( c i is the total time differences 00 80 60 40 20 0 2 3 4 5 6 7 8 9 0 2 3 4 5 6 7 8 9 20 2 22 23 24 25 26 27 28 29 The above figure describes the distribution of all team circadian advantages. The higher the bins, the more circadian disadvantage the team has. Teams in the central and east

part of the U.S. are likely to have lower circadian disadvantages while teams in the west seemed to have relatively higher circadian disadvantages. The left above graph depicts the sum of total circadian disadvantages and the bottom graph illustrates the variance of circadian disadvantages of all MLB teams for 00 local MC steps. Unlike the first and second simulations, the third simulation was found to decrease the total circadian disadvantage as byproduct of annealing. I could ideally optimize the schedule in terms of the circadian advantage. In this simulation, the size of the pie got smaller and, at the same time, it was divided evenly. Summary and suggestions I approached to the 203 MLB regular season schedule in three ways: distanceoptimization, home/away game-optimization, and circadian advantage-optimization. Each optimization method has its own characteristics and results. How you define the fairness will determine which optimization would best work case by case. Surely, there would be other ways of optimizations such as airfare, which is proportional to the distance but negligible to the distance that can be covered by bus. I also admit that there could be some implicit constraints that the simulations could not reflect but necessary and important for making the

schedule. REFERENCES - Beom Jun Kim, Traveling Baseball Players Problem in Korea, Journal of the Korean Physical Society, Vol 6, No. 3 August 202 - Winter WC, Hammond WR, Green NH, Zhang Z, Bliwise DL, Measuring circadian advantage in Major League Baseball: a 0-year retrospective study, Int J Sports Physical Perform. 2009 Sep; 4(3):394-40 - Ronald W. Shonkwiler and Franklin Mendivil, Explorations in Monte Carlo Methods, Springer (2009) - R. Feres, Math 350 Fall 202 Homework