Basketball data science

Similar documents
Players Movements and Team Shooting Performance: a Data Mining approach for Basketball.

A Novel Approach to Predicting the Results of NBA Matches

Using Spatio-Temporal Data To Create A Shot Probability Model

THE APPLICATION OF BASKETBALL COACH S ASSISTANT DECISION SUPPORT SYSTEM

Evaluating and Classifying NBA Free Agents

Two Machine Learning Approaches to Understand the NBA Data

PREDICTING the outcomes of sporting events

Projecting Three-Point Percentages for the NBA Draft

Estimating the Probability of Winning an NFL Game Using Random Forests

RUGBY is a dynamic, evasive, and highly possessionoriented

DOI /HORIZONS.B P23 UDC : (497.11) PEDESTRIAN CROSSING BEHAVIOUR AT UNSIGNALIZED CROSSINGS 1

Automated Proactive Road Safety Analysis

BASKETBALL PREDICTION ANALYSIS OF MARCH MADNESS GAMES CHRIS TSENG YIBO WANG

CS 7641 A (Machine Learning) Sethuraman K, Parameswaran Raman, Vijay Ramakrishnan


Legendre et al Appendices and Supplements, p. 1

Predicting NBA Shots

NBA TEAM SYNERGY RESEARCH REPORT 1

Revisiting the Hot Hand Theory with Free Throw Data in a Multivariate Framework

OFFICIAL BASKETBALL RULES SUMMARY OF CHANGES 2014

EXPLORING MOTIVATION AND TOURIST TYPOLOGY: THE CASE OF KOREAN GOLF TOURISTS TRAVELLING IN THE ASIA PACIFIC. Jae Hak Kim

WHY AND WHEN PEDESTRIANS WALK ON CARRIAGEWAY IN PRESENCE OF FOOTPATH? A BEHAVIORAL ANALYSIS IN MIXED TRAFFIC SCENARIO OF INDIA.

Citation for published version (APA): Canudas Romo, V. (2003). Decomposition Methods in Demography Groningen: s.n.

This file is part of the following reference:

Spatio-temporal analysis of team sports Joachim Gudmundsson

CHAP Summary 8 TER 155

Transformer fault diagnosis using Dissolved Gas Analysis technology and Bayesian networks

Sedentary Behavior & Older Adults: The Role of Neighborhood Walkability

Ocean Waves and Graphs

Competition performances are related to technique test measures in elite rifle shooters

Chapter 5: Methods and Philosophy of Statistical Process Control

FEDERATION INTERNATIONALE DE BASKETBALL INTERNATIONAL BASKETBALL FEDERATION FIBA. World Technical Commission

An Analysis of NBA Spatio-Temporal Data

The importance of t. Gordon Craig, Coerver Coaching Director

Real life examples of early warnings in railways. Dr. Inka L.M. Locht 4 September 2018

TABLE OF CONTENTS. Rule Changes and Points of Emphasis for 2017

Basketball field goal percentage prediction model research and application based on BP neural network

SMART CITIES & MOBILITY AS A SERVICE

Filtering Procedures for Sensor Data in Basketball

Pedestrian Dynamics: Models of Pedestrian Behaviour

Investigating Commute Mode and Route Choice Variability in Jakarta using multi-day GPS Data

Deconstructing Data Science

ORGANISING TRAINING SESSIONS

Spatial and Temporal Patterns of Pedestrian Crashes Along The Urbanization Gradient

DIFFERENCES BETWEEN THE WINNING AND DEFEATED FEMALE HANDBALL TEAMS IN RELATION TO THE TYPE AND DURATION OF ATTACKS

Chapter I examines the anthropometric and physiological factors that. determine success in sport. More specifically it discusses the somatotype

Predicting Momentum Shifts in NBA Games

Analytics Improving Professional Sports Today

Predicting the development of the NBA playoffs. How much the regular season tells us about the playoff results.

A Combined Recruitment Index for Demersal Juvenile Cod in NAFO Divisions 3K and 3L

Traffic Parameter Methods for Surrogate Safety Comparative Study of Three Non-Intrusive Sensor Technologies

FOOTWEAR & SENSING Projects RUNSAFER & WIISEL Juan V. Durá RUNSAFER

Data Science Final Project

Analysis of the offensive teamwork intensity in elite female basketball

Real Time Early Warning Indicators for Costly Asset Price Boom/Bust Cycles: A Role for Global Liquidity

THE MLU PLAYER DEVELOPMENT CURRICULUM

March 6, 2013 Tony Giarrusso, Rama Sivakumar Center for GIS, Georgia Institute of Technology

Pairwise Comparison Models: A Two-Tiered Approach to Predicting Wins and Losses for NBA Games

To Illuminate or Not to Illuminate: Roadway Lighting as It Affects Traffic Safety at Intersections

CARING, RESPECT, HONESTY and RESPONSIBILITY are the four core values of the YMCA.

Investigation of Winning Factors of Miami Heat in NBA Playoff Season

Predicting Horse Racing Results with Machine Learning

One can attain much more when one has knowledge

Mediating effect of social support in relation between stress of golfers and exhaustion of exercise

LifeBeat should be worn during daytime as well as night time as it can record activity levels as well as sleep patterns.

CS 221 PROJECT FINAL

Wind Flow Validation Summary

Analyzing football matches using relational performance data

D-Case Modeling Guide for Target System

Pierce 0. Measuring How NBA Players Were Paid in the Season Based on Previous Season Play

Focus on one player, one ball Players need many touches of ball during training sessions

Round Numbers as Goals Evidence From Baseball, SAT Takers, and the Lab

The Freiberger horse: 100% Swiss

Predicting the Total Number of Points Scored in NFL Games

A SPEED-FLOW MODEL FOR AUSTRIAN -MOTORWAYS

S.S.K. Haputhantri. Abstract

The calibration of vehicle and pedestrian flow in Mangalore city using PARAMICS

OBSERVATIONAL ANALYSIS OF CHANGES IN EXTREME EVENTS IN EMILIA-ROMAGNA

Working with Object- Orientation

JOE HARRIS C.H.S. BASKETBALL

How Do You Swing? You should be working with new lab partners starting with this lab.

ScienceDirect. Rebounding strategies in basketball

Machine Learning an American Pastime

Variability of surface transport in the Northern Adriatic Sea from Finite-Size Lyapunov Exponents" Maristella Berta

NBA Statistics Summary

Mobility Detection Using Everyday GSM Traces

Fatigue in soccer: NEW APPROACHES AND CONCEPTS. SPAIN PERSPECTIVE. Carlos Lago-Peñas University of Vigo, SPAIN

Sports Data Mining Technology Used in Basketball Outcome Prediction

Introducing STAMP in Road Tunnel Safety

Estimation and Analysis of Fish Catches by Category Based on Multidimensional Time Series Database on Sea Fishery in Greece

Environmental Science: An Indian Journal

Team Description: Building Teams Using Roles, Responsibilities, and Strategies

How to Win in the NBA Playoffs: A Statistical Analysis

Effects of seam and surface texture on tennis balls aerodynamics

A Group of Factors Influencing the Development of the Greeks Volleyball Athletes at School Age

Richard S. Marken The RAND Corporation

Langley United. Player, Team, Coach Development Manual and Playing Philosophy

Educating the next generation of Leaders in Sport Management. Postgraduate Diploma in Football Business JOHAN CRUYFF INSTITUTE. Barcelona,

Combined impacts of configurational and compositional properties of street network on vehicular flow

Results and Discussion for Steady Measurements

Transcription:

Basketball data science University of Brescia, Italy Vienna, April 13, 2018

paola.zuccolotto@unibs.it marica.manisera@unibs.it

BDSports, a network of people interested in Sports Analytics http://bodai.unibs.it/bdsports/

Agenda: Basketball analytics: state of the art Case studies: CS1: new positions in basketball CS2: scoring probability when shooting under high-pressure conditions CS3: performance variability and teamwork assessment CS4: sensor data analysis Concluding remarks

Basketball Analytics Official Statistics Sport Analytics Services Scientific Research

Our analyses often integrate machine learning tools and experts suggestions

Scientific Journals Special Issues Scientific Literature

Data Big Data

Data Big Data Stats CS1 www.espn.com/nba stats.nba.com www.fiba.com Leagues...

Data Big Data play-by-play CS2 CS3

Data Big Data Sensor Data CS4 play-by-play CS2 CS3

CS1: new positions in basketball EJASA - Special issue on Statistics in Sport Motivation: The existing positions - often defined a long time ago - tend to reflect traditional points of view about the game and sometimes they are no longer well-suited to the new concepts arisen with the evolution of the way of playing. Aim: describing new roles of players during the game, by means of the analysis of players' performance statistics with data mining and machine learning tools.

«Key-players» training set 7-dimensional SOM clusterization of the SOM output layer into a proper number of groups by means of a fuzzy clustering algorithm

CS2: scoring probability when shooting under high-pressure conditions International Journal of Sports Science & Coaching play-by-play Motivation: Basketball players have often to face high-pressure game conditions. To be aware of the overall and personal reactions to these situations is of primary importance to coaches. Aim: To develop a model describing the impact of some high-pressure game situations on the probability of scoring and to assess players' personal reactions.

High-Pressure Game Situations: when the shot clock is going to expire (SHOT.CLOCK) when the score difference with respect to the opponent is small (SC.DIFF) when the team, for some reason, has globally performed bad during the match, up to the considered moment (MISS.T) when the player missed the previous shot (MISS.PL) the time to the end of quarter (TIME) type of action (POSS.TYPE, 24 or 14 extratime)

69688 6470

Data Mining Tools: univariate non parametric regressions via kernel smoothing on the dependent variable MADE (assuming values 1 and 0 according to whether, respectively, the attempted shot scored a basket or not) 1000 bootstrap samples of size nboot = 5000 and nboot = 1000 for the dataset A2ITA and RIO16, respectively. few univariate relationships detected - Just SHOT.CLOK and MISS.PL

Data Mining Tools: CART (Classification And Regression Trees), algorithm able to deal with multivariate complex relationships, also detecting interactions among predictors we transform numerical into categorical covariates in order to improve interpretability combination of the results of a machine learning procedure and experts' suggestions pruning

focus on: (1) the last 1-2 seconds of possession, very close to the shot clock buzzer sounding, (2) games where the score difference is low, for example, between -4 and 4, (3) the last 1-2 minutes of each quarter (especially the final quarter)

Very similar results with Rio 2016 data

2-point shots

3-point shots

free throws

New Shooting Performance Measure: Takes into account that shots attempted in different moments have different scoring probabilities Performance of Player i for shot type T (2P, 3P, FT) j-th shot made (1) or missed (0) scoring probability of j-th shot according to CART

Further Research: according to psychological studies, some athletes view the competitive situations as challenging, and others perceive the same situations as stressful and anxietyprovoking. For this reason, it may be difficult to statistically detect stressful situations from large datasets including several players, as the overall average performance may remain unchanged as a response to some players improving their performance and some others getting worse. Analysis of single players reactions to stressful game situations (propension to shot and variation in scoring probability) Integration with psychological studies

CS3: performance variability and teamwork (in progress) play-by-play Motivation: Psychological studies have pointed out that typical performance is but one attribute of performance, but other aspects should be taken into account, in particular performance variability. Aim: Assessment of players' shooting performance variability and investigation of its relationships with the team composition.

Performance Variability: Definition of a performance index based on the % of attempted shots that scored a basket and on the shooting intensity shooting intensity shooting performance

Performance Variability: Fit Markov Switching models to the shooting performance index, in order to detect the (significant) presence of periods of good and bad performance

Teamwork Assessment: determine influence of each teammate on the regime of good and bad performance display the significant relationships by means of graphical network analysis tools predict the best substitution at a given time

CS4: sensor data analysis MathSport International 2017 SIS2017 Aim: A first approach to sensor data analysis in basketball (visualization tools, cluster analysis, future challenges) In collaboration with MYagonism

CS4: sensor data analysis SIS 2018 COMPSTAT 2018 Aim: to study structural changes in the surface area associations between regimes and game variables the relation between the regime probabilities and the scored points

Visualization Tools A tool to display data recorded by tracking systems producing spatio-temporal traces of player trajectories with high definition and frequency https://www.youtube.com/watch?v=aejyrdnqyvy

James P. Curley Curley Social Neurobiology Lab website (Psychology Department and Center for Integrative Animal Behavior, Columbia University, New York City) Visualization Tools

Convex Hulls Analysis

Cluster Analysis

Cluster Analysis + MultiDimensional Scaling

Cluster Analysis + MultiDimensional Scaling

Future Challenges: Integration with play-by-play data Integration with video and match analysis Integration with body metrics (body physiology tracking via smart clothing and/or body measurements) Integration with qualitative assessments Network analysis tools Spatio-temporal statistical models Addition of the other team s data Addition of the ball s position

True: If people keep thinking that Statistics is merely PPG, AST, REB, If people don t learn how Stats have to be interpreted ( Do not put your faith in what statistics say until you have carefully considered what they do not say. W. W. Watt) Concluding False: If modern approaches to basketball analytics are used If we are able to integrate analytics and technical experience If we are able to spread the culture of Statistics

References Download a (regularly updated) list of references at http://bodai.unibs.it/bdsports/basketball.htm Thank You