The first significant breakthrough in hockey analytics occurred in the mid-2000’s when analysts discovered the importance of Corsi in describing and predicting future success. Since that time, we’ve seen the creation of expected goals, WAR models, and more. Many have cited that the next big breakthrough in hockey analytics will come once the NHL is able to provide tracking data. We’ve already seen some of the incredible applications of the MLB’s Statcast data and the NBA’s SportVu data. Unfortunately, the NHL has no immediate plans to publicly provide this data and as such, many analysts have decided to manually obtain the data.
Hockey fans and analysts have always appreciated the importance of passing. But until the passing project led by Ryan Stimson, we couldn’t quantify that importance. His work supported by a team of volunteers and other analysts has established that the passing sequence prior to a shot is a significant predictor of the likelihood of the shot becoming a goal. His work also showed that measuring shots and shot assists combined as shot contributions is a better predictor of future performance for both players and teams than shots alone.
Knowing that, the logical next step is to use passing data in analysis whenever possible. Unfortunately, the NHL does not provide passing data so it must be manually tracked by people like Corey Sznajder. Corey’s work is invaluable and I encourage you to support him but he’s only one person.
This article attempts to estimate a player’s quantity of shot assists in a given sample using publicly available data to help fill in gaps where tracked data doesn’t exist.
On Monday, I introduced some work on quantifying and identifying team playing styles, which built upon my earlier work on identifying individual playing styles. Today we’re going to discuss how to make this data actionable.
What are the quantifiable traits of successful teams? What plays are they executing that makes them successful? How can we use data to then build a style of play that is more successful than what we’re currently doing? The way we bridge the gap between front office and behind the bench is by providing data to improve their matchup preparation, lineup optimization, and enhance tactical decisions.
This is what I mean by actionable: applying data-driven analysis and decision-making inside the coach’s room and on the ice. All data is from 5v5 situations and is either from the Passing Project or from Corsica.
One of the aspects of player performance that is discussed ad nauseam is chemistry. How well do certain players elevate their performance with one player or another due to some inherent ability to find the other on the ice? To know what a teammate is going to do? However, very little has been done to analyze this phenomenon. In this piece, I posit that by identifying playing styles, something that’s been done in the NBA, we can quantify how well certain players will complement one another.
All data is from 5v5 situations from the 2015 – 2016 and current season, totaling almost 900 games from the Passing Project volunteers and Corey Sznajder. Special thanks to Asmae for her guidance throughout this piece.
I want to stress that this is a first foray into this type of analysis and simply because a player has a different style than what I’ve named (which are relatively arbitrary) it doesn’t mean they are necessarily better than another player. Players may have similar styles, but some will simply be more effective due to their ability. Finally, given that each day we accumulate more data, a player with a smaller sample size could find themselves in a different cluster in future analysis.
Recently, I showed how passing data is a better predictor of future player scoring than existing public metrics. In this piece, I’m going to show that by accounting for shot quality via passing metrics we can more accurately predict a team and player’s on-ice goal-scoring rates. I’m going to do this by quantifying the pre-shot movement that occurs when a player is on the ice. Finally, I’ll spend some time discussing certain forwards/teams that caught my eye. All data is from 5v5 situations and special thanks to Dr. McCurdy for pulling the on-ice player data for me. All non-passing project data is from Corsica.
Last time, I showed how passing data is a better predictor of future player scoring than existing public metrics. In this piece, I’m going to spend some time talking about how we can more reliably evaluate offensive and defensive contributions from defensemen, which has been difficult due to a lack of data. Not only due to a lack of data, but from a lack of flexibility regarding the identity of the position. Traditionally thought of as existing to defend and “make a good first pass,” I feel this limits the scope of both how we evaluate the position and its responsibilities.
In order to better evaluate defensemen, we need to identify specific metrics that we can tie to future goals. In looking at entry assists (a pass occurring in the neutral or defensive zones that precedes a shot), both for and against, we can quantify how effective that defensemen is at generating offense in transition, as well as suppressing those chances. The importance of those things at the team level is something I’ve previously discussed (transition here and defensive work here with Matt Cane). Once we identify these metrics as having a strong impact on future scoring and goal-suppression, we naturally then reevaluate what the proper roles are for a defensemen, which in turn forces us to reevaluate how we evaluate them.
Personally, I’d like to see us think of them more as fullbacks or midfielders in soccer (this is part of a larger concept of redefining positions and responsibilities, which will be posted in the next month or so, I hope). There are still going to be various types of players based on their individual skill set and team tactics, but supporting play, overlapping on the attack, and distribution are all pillars of what teams should look for. Let’s get to it.
While I have spent a lot of time over the last several months digging into how we can quantify passages of play and inform better tactical decisions, it’s time to revisit how passing impacts scoring at the player level. We have only been using half of the picture in terms of individual shots and goals for player evaluation. Sure, we have primary and total points, but primary assists aren’t a very useful metric. The rate at which players create shot assists also appeared to have significantly more value than a player’s own shots in some analysis I did last year.
This piece will release individual passing data for the 2014 – 2015, 2015 – 2016, and 2016 – 2017 seasons, the latter of which tracked by Corey Sznajder, the former tracked by myself and many others. However, it is important to provide context and meaning to the numbers rather than simply inundate you with data.
At the 2nd Annual Hockey Analytics Conference at the Rochester Institute of Technology, Matt Cane and I presented on new ways of measuring and evaluating defensive play. We used the passing data from my project to take a look. I believe Matt will write up his half of the presentation, so this post will focus on my half of the presentation. You can access our slides and view our presentation here. I will go into greater detail in this post since I am without time constraints. All of the data we used can be accessed here. Everything in this post deals with 5v5 play.
The Passing Project headed by fellow Hockey Graphs contributor Ryan Stimson (@RK_Stimp) is one of the most exciting things happening in hockey analytics. The project consists of dozens of volunteers manually tracking the passes that lead to shot attempts in games across the NHL. Thus far, the project has compiled data for nearly five hundred games. Ryan laid the groundwork for analyzing the project’s data in his piece here. That pieces discusses primary shot contributions (PSCs), which is a counting stat comprised of shots and shot assists. Shots in this article is synonymous with corsi meaning all shots and not just shots on goal. Ryan has built on that original work with a piece on offensive zone play here and a piece on neutral zone play here. And following his line of thought, I wrote a piece for NHL Numbers that expanded on his approach to offensive zone analysis.
Last time, I showed how using data and video evidence can be combined to inform tactical offensive zone decisions. Today, I’m going to do the same thing in the neutral zone. Neutral zone play is something that has been a hot topic among analysts for many years, going back to this paper written by Eric Tulsky, Geoffrey Detweiler, Robert Spencer, and Corey Sznajder. Our own garik16 wrote a great piece covering neutral zone tracking. Jen Lute Costella’s work shows that scoring occurs sooner with a controlled entry than an uncontrolled entry.
However, for all the work that goes into zone entries, there have been few efforts to account for how predictable these metrics are. At the end of the day, what matters is how we can better predict future goal-scoring. Also, in looking at our passing data, what can we also learn about how actions are linked when entering the zone? Does simply getting into the offensive zone matter? Does it matter whether it’s controlled or not? Or, does what happen after you enter the zone matter exponentially more? Lastly, what decisions can we make to improve the team’s process using this data?