Journal of Sports Analytics - Volume 7, issue 3 - Journals

Cricket mix optimization using heuristic framework after ensuring Markovian equilibrium

Authors: Ray, Subhasis | Roychowdhury, Soma

Article Type: Research Article

Abstract: International Cricket Council, in consultation with its member boards, prepares the Future Tours and Programme (FTP) which is an eight year long itinerary covering world championships in three formats of cricket, bilateral series and other tournaments. However, the FTP (2015–2023) had been criticized for its asymmetric itinerary and the point system for World Test championship and the FTP (2023–2031) is being criticized for including eight championships in limited formats and enhanced workload for players. Cricket mix standardization like marketing and product mix can work in homogeneous markets. This study derives three homogeneous markets of four teams each using hierarchical cluster …analysis. For each market, it finds out the Markovian equilibrium analyzing cricket mix transition over past years. While the same can be used to derive the number of games per format per country, the study proposes a heuristic approach for fine tuning the same taking care of major stakeholders’ (e.g. Administrators, Players and Spectators) aspirations. Despite scores of criticisms and articles on the issue, there is hardly any scholastic contribution on game scheduling in the extant literature. This study thus is a pioneering effort in helping the policy makers to create a balance between cricket formats within each homogeneous market. Show more

Keywords: Data visualization, hierarchical cluster analysis, markov chain, steady state equilibrium, value based heuristics

DOI: 10.3233/JSA-200479

Citation: Journal of Sports Analytics, vol. 7, no. 3, pp. 155-168, 2021

Get PDF

Predicting the winning percentage of limited-overs cricket using the Pythagorean formula

Authors: Senevirathne, Hasika K.W. | Manage, Ananda B. W.

Article Type: Research Article

Abstract: The Pythagorean Win-Loss formula can be effectively used to estimate winning percentages for sporting events. This formula was initially developed by baseball statistician Bill James and later was extended by other researchers to sports such as football, basketball, and ice hockey. Although one can calculate actual winning percentages based on the outcomes of played games, that approach does not take into account the margin of victory. The key benefit of the Pythagorean formula is its utilization of actual average runs scored and actual average runs allowed. This article presents the application of the Pythagorean Win-Loss formula to two different types …of limited-overs cricket formats, namely One Day International cricket (ODI) and Twenty20 cricket. The data for the application was used from the matches played by the top 10 International Cricket Council (ICC) members who participated in the 2019 ICC Cricket World Cup. For matches for which the second batting team won, runs scored were estimated by considering the remaining amount of resources, based on the Duckworth–Lewis method. Show more

Keywords: Pythagorean formula, winning percentage, runs allowed, runs scored, maximum likelihood, least squares

DOI: 10.3233/JSA-200480

Citation: Journal of Sports Analytics, vol. 7, no. 3, pp. 169-183, 2021

Get PDF

Dynamic cricket match outcome prediction

Article Type: Research Article

Abstract: To propose a model where match outcome is predicted ball by ball at the start of the second inning. Our methodology not only incorporates the dynamically updating game context as the game progresses, but also includes the relative strength between the two teams playing the match. We used 692 matches from all seasons (2008–2018) to train our model, and we used all 59 matches from the current season (2019) to test its performance. Here we have engineered 11 players and 10 bowlers, and all their metrics are tracked as a function of each ball of each over throughout the match …during the second inning, also keeping in the consideration of dynamically changing target score as one of the attributes. Initially, we tried Logistic Regression, Naive Bayes, K-Nearest Neighbour (KNN), Support Vector Machine, Decision Tree, Random Forest, Boosting, Bagging, and Gradient Boosting with an accuracy of 76.47%(+/–3.77%). With deep learning, we tried the various flavours of LSTM and GRU like vanilla, Bidirectional and stacked to train our models and the results found are very impressive with an accuracy of 76.13%(+/–2.59%). All of these flavors were tested using various approaches such as one-to-one sequencing, one-to-many sequencing, many-to-one sequencing, and many-to-many sequencing, which are discussed in this paper. An accurate prediction of how many runs a batsman is likely to score and how many wickets a bowler is likely to take in a match will help the team management select the best players for each match. Show more

Keywords: Long short-term memory, gated recurrent unit, machine learning, deep learning, IPL, prediction

DOI: 10.3233/JSA-200510

Citation: Journal of Sports Analytics, vol. 7, no. 3, pp. 185-196, 2021

Get PDF

Modeling T20I cricket bowling effectiveness: A quantile regression approach with a Bayesian extension

Authors: Bowala, Sulalitha M.B. | Manage, Ananda B.W. | Scariano, Stephen M.

Article Type: Research Article

Abstract: Bowling effectiveness is a key factor in winning cricket matches. The team captain should decide when to use the right bowler at the right moment so that the team can optimize the outcome of the game. In this study, we investigate the effectiveness of different types of bowlers at different stages of the game, based on the conceded percentage of runs from the innings total, for each over. Bowlers are generally categorized into three types: fast bowlers, medium-fast bowlers, and spinners. In this article, the authors divided the twenty over spell of a T20I match into four stages; namely, Stage …1: overs 1-6 (PowerPlay), Stage 2: overs 7-10, Stage 3: overs 11-15, and Stage 4: overs 16-20. To understand the broad spectrum of the behavior of game variables, a Quantile Regression methodology is used for statistical analysis. Following that, a Bayesian approach to Quantile Regression is undertaken, and it confirms the initial results. Show more

Keywords: Batsman, Bayesian, bowling, cricket, sports, T20I, quantile regression

DOI: 10.3233/JSA-200556

Citation: Journal of Sports Analytics, vol. 7, no. 3, pp. 197-221, 2021

Get PDF

Journal of Sports Analytics - Volume 7, issue 3

Cricket mix optimization using heuristic framework after ensuring Markovian equilibrium

Predicting the winning percentage of limited-overs cricket using the Pythagorean formula

Dynamic cricket match outcome prediction

Modeling T20I cricket bowling effectiveness: A quantile regression approach with a Bayesian extension

North America

Europe

Asia