Lateral Puck Movement in the NZ

Research shows that lateral/”east-west” puck movement in the offensive zone is beneficial to increasing one’s odds of scoring. But I have now heard from people in various positions within the hockey industry on why it might also be useful to generate east-west puck movement in the neutral zone. The theories – focused on lateral passing, lane changes and stretch passes, respectively – all boiled down to one point: When you rush the puck up ice, the defending team will focus on that side, leaving the other side of the ice somewhat more open, so there might be open ice to exploit.

Continue reading

Passing clusters: A Framework to Evaluate a Team’s Breakout

Quick breakouts – trying to move the puck out of your zone right after gaining possession – make up roughly 38% of possessions and account for 22% of all shots and 22.4% of Expected Goals (at least according to my possession and xG definitions). Therefore, understanding what does and does not work when breaking out the puck against present forecheckers is important. There is evidence that passes from the defensive half boards by wingers inside produce more offense than those straight up ice. But the puck is more often recovered elsewhere, so these passes by wingers aren’t the first pass in a possession and are therefore presumably influenced by the previous play. It should be interesting to find out how the inclusion of the pass(es) that came before affects this conclusion.

Continue reading

A crowdfunding initiative to promote diversity at the Columbus Analytics Conference

I’ve been fortunate enough to be able to attend the last three years of the RIT Sports Analytics Conference. The first year I went, I was nervous to meet people whose work I admired. I was afraid that nobody would want to talk to this new person that few people knew and who was just starting to learn about the field. 

I could not have been more wrong. 

Continue reading

Exploratory Data Analysis Using Tidyverse

This post assumes beginner knowledge of R.

Welcome to the second article in our series on basic data cleaning and data manipulation! In this article, we’re going to use play-by-play data from two NHL games and answer two questions:

  • which power play unit generated the best shot rate in each game?
  • which defenseman played the most 5v5 minutes in each game?

In the process of doing so, we’ll cover several topics of basic data manipulation in the tidyverse, including using functions, creating joins, grouping and summarizing data, and working with string data.

Continue reading

Combining Manually-Tracked Data with Play-by-Play Data

This post assumes beginner knowledge of R.

If you’ve ever analyzed hockey data, then you’re probably familiar with the NHL’s Real Time Scoring System, which produces what’s more commonly known as play-by-play data. These data are publicly available and allow us to see every event recorded by the NHL in a given game. Shown below are selected details about the first 10 events from two games on February 18, 2019: Tampa Bay at Columbus and Vegas at Colorado.

Continue reading

The Importance of Pressure for a Successful Forecheck

Most of my posts so far have talked about zone exits from the perspective of the team trying to breakout out of their defensive zone. Now, let’s flip the script and discuss the team on the forecheck. This team does not have possession of the puck, but they are in their offensive zone, which is an advantage. So, how can they regain control?

Continue reading

Team Level Zone Exits

From past posts, we have a general sense of the basics of zone exits: zone exits are important because they get you out of your zone and towards an opportunity to score. The key to a successful zone exit is maintaining possession, ideally by avoiding the temptation to dump the puck out.

But so far, we have only looked at zone exits league wide. Most fans care about one particular team more than the rest, but we haven’t looked at team-level results at all. So today, let’s see how each team has performed at zone exits over the past three seasons.

Continue reading

Visualizing and Quantifying Passing on the Power Play

Visualizing passes isn’t easy in hockey. In any given KHL game, there are between 700 and 900 Passes. Somewhere between 65% to 85% are successful*. If you wanted to focus on just the successful ones, you’d have to find a way to meaningfully and concisely represent 500-700 events. Let’s start with something simpler: the Power play. If we further restrict our target to passes by single teams during 5v4 power plays in the OZ, we still get between 40 and 50 passes per game per team. Looking at two random KHL games, you can see that this is still quite a lot of passes:

There are some trends to be picked up on, but it’s not very clean. And any semi-serious opposition scouting (especially of special teams) will take into account multiple games, which then leads to an unidentifiable mess when plotted.

Continue reading

So You Got Accepted To Present at a Sports Analytics Conference

First of all, congratulations if you got accepted. Kudos to you if you got accepted to a conference like RITSAC, a very well run and well curated conference. This is a wonderful accomplishment, and you should be proud. Tell your friends and family. Celebrate. Bask in the adoration.

Well, maybe not that last part. But you get my point. Your work clearly has some perceived value and is based on solid reasoning and data analysis.

So now what?

Continue reading