Behind the Numbers: Where analytics and scouts get the draft wrong

July 7, 2022Garret Hohl Behind the Numbers, Player Evaluation, Prospects and Draft, Theory Tags: Draft, NHL, Prospects Leave a comment

Every once-in-a-while I will rant on the concepts and ideas behind what numbers suggest in a series called Behind the Numbers, as a tip of the hat to the website that brought me into hockey analytics: Behind the Net. My ramblings will look at the theory and philosophy behind analytics and their applications given what is already publicly known.

Hello everyone; I am back! I was in the process of writing an article on NHL prospect development for after the draft (teaser!) when a Twitter thread sparked my interest and made me want to do a bit of a ranty, very pseudo-Editorial or Literature Review on analytics and the draft while combing over that thread.

As a scout who is a huge proponent of the use of analytics in pro and amateur player evaluation, i find it very interesting that many of the same people who dive into NHL advanced stats and create/support models are huge supporters of NHLe in prospect evaluation 🧵(1/10)
— Sebastian High (@high_sebastian) June 9, 2022

Continue reading →

Revisiting NWHL Game Score

August 21, 2019August 21, 2019Mike Murphy NWHL, Player Evaluation Leave a comment

In March 2018, Shawn Ferris of Hockey Graphs introduced his NWHL Game Score, which was based on Dom Luszczyszyn’s NHL Game Score. It was groundbreaking work in women’s hockey analytics, which is still very much in its infancy — especially at the professional level.

Game score is a valuable tool that can give us a better understanding of a player’s performance than points for skaters or save percentage and goals against average for goaltenders. It provides us with a single value that incorporates relevant points of data which we can use to compare the performances of two or more players in a single game or over the course of many games, including seasons and careers.

As Shawn noted in his work, game score is particularly valuable for analyzing performance in the NWHL because of the brevity of the regular season. Through the league’s first four seasons, the average length of a season was under 18 games. The 2019-20 season promises a schedule of 24 games, which is still less than a third of the length of the NHL season. That brief schedule creates an opportunity for shooting percentage factors to influence both a players’ production and our perception of their performance.

Continue reading →

Expected Goals Model with Pre-Shot Movement, Part 3: 2018-2019 Data

August 14, 2019August 16, 2019Alex Novet Data Analysis, Neutral Zone Analysis, NHL League-Wide Analysis, Passing Data, Player Evaluation, Playing Styles, Resources, Shot Quality, Transition Project Leave a comment

Yesterday we looked at the team and skater results from the 2016 – 2018 data that was used to train the xG model. That’s a pretty robust dataset, but it’s unfortunately a bit out of date. People care about this season, and past years are old news. So let’s take a look at the data that Corey Sznajder has tracked for 2018 – 2019 so far.

Continue reading →

Expected Goals Model with Pre-Shot Movement, Part 2: Historic Team and Player Results

August 13, 2019August 16, 2019Alex Novet Data Analysis, NHL League-Wide Analysis, Passing Data, Player Evaluation, Transition Project Leave a comment

Intro

In the last post, we introduced a new expected goals (xG) model. It incorporates pre-shot movement, which made it more accurate than existing public xG models when predicting which shots would be goals. However, we use xG models for far more than looking at individual shots. By aggregating expected goals at the player and team level, we can get a better sense of how each of them performs.

Continue reading →

Wins Above Replacement: Replacement Level, Decisions, Results, and Final Remarks (Part 3)

January 18, 2019January 18, 2019EvolvingWild Player Evaluation, Theory, WAR Leave a comment

In part 1 of this series we covered the history of WAR, discussed our philosophy, and laid out the goals of our WAR model. In part 2 we explained our entire modeling process. In part 3, we’re going to cover the theory of replacement level and the win conversion calculation and discuss decisions we made while constructing the model. Finally, we’ll explore some of the results and cover potential additions/improvements.

Continue reading →

Wins Above Replacement: The Process (Part 2)

January 17, 2019November 6, 2019EvolvingWild Data Analysis, Player Evaluation, Theory, WAR Leave a comment

In part 1, we covered WAR in hockey and baseball, discussed each field’s prior philosophies, and cemented the goals for our own WAR model. This part will be devoted to the process – how we assign value to players over multiple components to sum to a total value for any given player. We’ll cover the two main modeling aspects and how we adjust for overall team performance. Given our affinity for baseball’s philosophy and the overall influence it’s had on us, let’s first go back to baseball and look at how they do it, briefly.

Continue reading →

Wins Above Replacement: History, Philosophy, and Objectives (Part 1)

January 16, 2019November 6, 2019EvolvingWild History, Player Evaluation, Theory, WAR 3 Comments

Wins Above Replacement (WAR) is a metric created and developed by the sabermetric community in baseball over the last 30 years – there’s even room to date it back as far as 1982 where a system that resembled the method first appeared in Bill James’ Abstract from that year (per Baseball Prospectus and Tom Tango). The four major public models/systems in baseball define WAR as such:

“Wins Above Replacement (WAR) is an attempt by the sabermetric baseball community to summarize a player’s total contributions to their team in one statistic.” FanGraphs
“Wins Above Replacement Player [WARP] is Prospectus’ attempt at capturing a players’ total value.” Baseball Prospectus
”The idea behind the WAR framework is that we want to know how much better a player is than a player that would typically be available to replace that player.” Baseball-Reference
“Wins Above Replacement (WAR) … aggregates the contributions of a player in each facet of the game: hitting, pitching, baserunning, and fielding.” openWAR

Continue reading →

Penalty Goals: An Expanded Approach to Measuring Penalties in the NHL

January 15, 2019January 18, 2019EvolvingWild Penalties, Player Evaluation, WAR Leave a comment

Intro

Penalty differential figures are a rather ambiguous concept in hockey. It seems only recently that the majority of analysts and fans have stopped touting a player’s total penalty minutes as a positive aspect of a player’s game. Let’s get one thing clear: taking penalties is a bad thing and drawing penalties is a good thing. When a penalty is taken or drawn, the change in strength state (5v5 to 5v4 for instance) directly impacts the rate of goal scoring for a given player’s team (goals for and goals against). We can measure this change by determining league average scoring rates at each strength state and can then determine the net goals that are lost/gained from a penalty that was taken/drawn. This was first shown in the penalty component of the WAR model from WAR-On-Ice (WOI) here. A.C. Thomas explains it:

Continue reading →

Reviving Regularized Adjusted Plus-Minus for Hockey

January 14, 2019January 18, 2019EvolvingWild Data Analysis, Player Evaluation, Resources Leave a comment

Introduction

In this piece we will cover Adjusted Plus-Minus (APM) / Regularized Adjusted Plus-Minus (RAPM) as a method for evaluating skaters in the NHL. Some of you may be familiar with this process – both of these methods were developed for evaluating players in the NBA and have since been modified to do the same for skaters in the NHL. We first need to acknowledge the work of Brian Macdonald. He proposed how the NBA RAPM models could be applied for skater evaluation in hockey in three papers on the subject: paper 1, paper 2, and paper 3. We highly encourage you to read these papers as they were instrumental in our own development of the RAPM method.

While the APM/RAPM method is established in the NBA and to a much lesser extent the NHL, we feel (especially for hockey) revisiting the history, process, and implementation of the RAPM technique is overdue. This method has become the go-to public framework for evaluating a given player’s value within the NBA. There are multiple versions of the framework, which we can collectively call “regression analysis”, but APM was the original method developed. The goal of this type of analysis (APM/RAPM) is to isolate a given player’s contribution while on the ice independent of all factors that we can account for. Put simply, this allows us to better measure the individual performance of a given player in an environment where many factors can impact their raw results. We will start with the history of the technique, move on to a demonstration of how linear regression works for this purpose, and finally cover how we apply this to measuring skater performance in the NHL.

Continue reading →

Revisiting Relative Shot Metrics – Part 2

February 22, 2018April 10, 2018EvolvingWild Player Evaluation, Resources, Theory 2 Comments

In part 1, I described three “pen and paper” methods for evaluating players based on performance relative to their teammates. As I mentioned, there is some confusion around what differentiates the relative to team (Rel Team) and relative to teammate (Rel TM) methods (it also doesn’t help that we’re dealing with two metrics that have the same name save four letters). I thought it would be worthwhile to compare them in various ways. The following comparisons will help us explore how each one works, what each tells us, and how we can use them (or which we should use). Additionally, I’ll attempt to tie it all together as we look into some of the adjustments I covered at the end of part 1.

A quick note: WOWY is a unique approach, which limits it’s comparative potential in this regard. As a result, I won’t be evaluating/comparing the WOWY method further. However, we’ll dive into some WOWYs to explore the Rel TM metric a bit later.

Rel Team vs. Rel TM

Note: For the rest of the article, the “low TOI” adjustment will be included in the Rel TM calculation. Additionally, “unadjusted” and “adjusted” will indicate if the team adjustment is implemented. All data used from here on is from the past ten seasons (’07-08 through ’16-17), is even-strength, and includes only qualified skaters (minimum of 336 minutes for Forwards and 429 minutes for Defensemen per season as estimated by the top 390 F and 210 D per season over this timeframe).

Below, I plotted Rel Team against both the adjusted and unadjusted Rel TM numbers. I have shaded the points based on each skater’s team’s EV Corsi differential in the games that skater played in:

relattive_cow_comp Continue reading →