clock menu more-arrow no yes mobile

Filed under:

Shot Differential Part 2: Data Mining the Stretch Run

New, 289 comments

(Author's note: This post took me way too long to construct, contains zero cartoons, isn't very well written, and doesn't come to any real conclusions.  You may waste hours trying to decipher all the data tables and may fully hate me for it, but at least we'll all be a few hours closer to puck drop.  Ah, I kid -- you'll probably love it.)

A month ago, I took a statsy look at the Ducks' regular season in terms of shot differential.  One of the striking findings was that while the Ducks were outshot by a whopping 314 shots last season, they were somehow able to generate a +14 shot differential over their last 30 games.  Nice turnaround, right?

So I decided to dig deeper into the shot data from those last 30 games of the regular season, using NHL.com's Play By Play logs to make a huge spreadsheet (here's the one from the season finale, for example).  It's an incredibly effort-heavy process because while it contains useful data about who is on the ice for every event in a game, it pastes rather horribly into Excel and has to be fixed one game at a time, always noting whether Anaheim is the home or road team.  But I persevered through 30 games because I was curious: How did the Ducks generate that slightly-positive shot differential, and do those results feel repeatable?  Before I go into any findings, though, I've got several caveats to go through about this whole exercise:

  • The 30 games-to-go mark is a kind of important line in the sand; it's worth mentioning that if I did this exercise over the final 31 games, the Ducks would have been outshot over the sample.  Even so, keeping shots close is an achievement for last year's Ducks -- in no other 30- or 31-game segment did the Ducks come anywhere near their opponents in shots.
  • The Ducks were obviously a very charmed in this stretch, as it's the 30 games that saw them climb all the way up to 4th in the west without much use from their #1 netminder.  7 of these 30 games went into overtime, and none of them went to a shootout.  Ducks won 6 -- they were crazily good in 4-on-4 down the stretch.  So the Ducks were lucky or clutch, take your pick, but close games were going Anaheim's way.
  • I'm not really a strong proponent of shot-counting analysis, even though that's exactly what I'm about to do.  Reason one: Shots are not goals; they do not win games.  And while over time counting shots does more or less approximate goals, it does so by diluting the incentives -- hockey isn't about taking the most shots on net to win a game, it's about taking the right shots on net to win a game, and I still don't think we're capturing the right shot data to measure that (How many shots-on-goal were shot through screens last season?  How many came on odd-man rushes?  Somebody upstairs should start recording those goal-enhancing factors on every shot attempt.).  Raw shot counting certainly has value, no doubt, but in a game theory sense, it's not an expression of the true intent of the game.

    Also, I'm a little wary of attaching things like Corsi metrics to individuals -- at any point in time, one player doesn't affect the flow of play so much as his four on-ice teammates do; he will obviously make some impact but he's also a bit bound by where he plays.  So it's not always a "talent" stat that emerges (that guy sure is good at Corsi!) so much as a situational stat (that guy plays in a good Corsi environment, plus he might be good at Corsi), and I'm not sure we always treat individual rankings in the right way.  So don't read this post too damningly; some findings will appear striking, but I really don't mind shrugging them off as "well, shot analysis is sometimes bullshit".  Because yeah, it sometimes is.
  • I'm probably including too many columns in my tables, for whose benefit, I'm not sure.  This has made the tables tougher to read, but in my defense, SBN's table formatter kind of sucks.  For each table I'll point out what takeaway observations I note, but generally the columns to focus on are the 3rd and 6th -- Shots on Goal and Attempted Shots.  I've included goal differential and some breakdown of shot attempts (shots missed vs. shots blocked) for context and reference, but this post's main focus is how the Ducks got their slightly positive shot differential.
  • This exercise would have been way, way more useful had this spreadsheet contained all 82 games -- how do these 30-game results compare against the 52 games that came before it?  But that kind of effort isn't imminent -- this spreadsheet is a real bitch to make.  A fuckload of copying, a fuckload of pasting, a fuckload of formatting, a fuckload of parsing.  But it works now!  Fuckload yeah!
  • Also, because I don't know every goalie's number on every opposing team, empty net goals are mixed in with the regular goals.  5-on-5 really just means there were six players on the ice per each side -- most likely one of them was a goalie, but not always.

Caveats noted?  Onwards!

Okay, the following tables are dense with data, and there's quite a few of them.  But let's go over the basics, looking at the second column, "Goals", as our walk-through example -- if you get the idea of this column, the rest should hopefully make sense.  At the bottom of the Goals column we can see that over the 30 game sample, the Ducks scored 99 goals and allowed 89, for a +10 goal differential.  The rows, then, are splits about how that happened.  Anaheim had a +2 goal differential at even strength, a +22 goal differential on the power play, a -16 differential on the penalty kill, and a +2 differential in penalty shots.  The subsequent columns show total shots on goal, shots fired wide, shots that were blocked, and total shot attempts.

Last 30 Games -- Goals, Shots, and Shot Attempts by Manpower Situation

Manpower
Situation
Goals
GF - GA = GD
Shots on Goal
SF - SA = SD
Missed Shots
MSF - MSA = MSD
Blocked Shots
BSF - BSA = BSD
Attempted Shots
ASF - ASA = ASD
EVEN STRENGTH +69 - 67 = +2 +671 - 704 = -33 +233 - 324 = -91 +266 - 367 = -101 +1,170 - 1,395 = -225
     5 on 5 +60 - 63 = -3 +635 - 678 = -43 +226 - 314 = -88 +250 - 361 = -111 +1,111 - 1,353 = -242
     4 on 4 +9 - 4 = +5 +36 - 26 = +10 +7 - 10 = -3 +16 - 6 = +10 +59 - 42 = +17
     3 on 3 +0 - 0 = 0 +0 - 0 = 0 +0 - 0 = 0 +0 - 0 = 0 +0 - 0 = 0
POWER PLAY +25 - 3 = +22 +177 - 24 = +153 +74 - 7 = +67 +59 - 8 = +51 +310 - 39 = +271
     5 on 4 +21 - 3 = +18 +166 - 24 = +142 +69 - 7 = +62 +52 - 7 = +45 +287 - 38 = +249
     4 on 3 +1 - 0 = +1 +5 - 0 = +5 +0 - 0 = 0 +2 - 0 = +2 +7 - 0 = +7
     5 on 3 +3 - 0 = +3 +6 - 0 = +6 +5 - 0 = +5 +5 - 1 = +4 +16 - 1 = +15
PENALTY KILL +3 - 19 = -16 +20 - 129 = -109 +4 - 62 = -58 +0 - 74 = -74 +24 - 265 = -241
     4 on 5 +3 - 18 = -15 +20 - 121 = -101 +4 - 58 = -54 +0 - 71 = -71 +24 - 250 = -226
     3 on 4 +0 - 0 = 0 +0 - 2 = -2 +0 - 2 = -2 +0 - 1 = -1 +0 - 5 = -5
     3 on 5 +0 - 1 = -1 +0 - 6 = -6 +0 - 2 = -2 +0 - 2 = -2 +0 - 10 = -10
PENALTY SHOTS +2 - 0 = +2 +3 - 0 = +3 +0 - 1 = -1 +0 - 0 = 0 +3 - 1 = +2
30-GAME TOTAL +99 - 89 = +10 +871 - 857 = +14 +311 - 394 = -83 +325 - 449 = -124 +1,507 - 1,700 = -193

Takeaway #1: The Ducks did have a +14 shot differential over the final 30 games, but it did not come as a result of taking more shot attempts than their opponents, in fact that margin wasn't close at all.  Opponents took 193 more shot attempts than Anaheim did, but because so many went wide or were blocked, the Ducks did manage to come away outshooters. 

Now a smart blogger would take a moment here and talk about team styles, or coaching philosophies, or how the Ducks play in such a way that encourages opponents to take wild and hurried shot attempts or something.  But I'm not really playing the role of explainer today; I have no idea really why the Ducks were forcing so many shots wide or weren't making more shot attempts themselves.  Maybe they were lucky, maybe Ray Emery has a death stare, maybe it is systematic -- tough to say.

Takeaway #2: The Ducks did not outshoot their opponents at 5-on-5 over this stretch, but rather kept shots close enough that they could make up for it by excelling on special teams and 4-on-4.  The special teams discrepancy is pretty significant, and at first I had to go look to make sure that power play time wasn't severely weighted in the sample.  Power play time did favor the Ducks, but not to the same degree as shots produced -- 172 power play minutes vs. 160 shorthanded minutes.  The Ducks' power play did generate shots better than the opponent's.

So the initial answer(s) to how the Ducks got a positive shot differential over the final 30 games?  Forcing opponent shot attempts wide, blocking shots, and working the power play and 4-on-4 situations to their full advantage.  Also, they hit the net on all three of their penalty shot attempts, unlike Olli Jokinen.  :)

Now here's another fun way to slice the data -- how the Ducks' shot metrics split based on the scoreboard:

Last 30 Games -- Goals, Shots, and Shot Attempts by Scoreboard Situation

Scoreboard
Situation
Goals
GF - GA = GD
Shots on Goal
SF - SA = SD
Missed Shots
MSF - MSA = MSD
Blocked Shots
BSF - BSA = BSD
Attempted Shots
ASF - ASA = ASD
Down 3+ goals +5 - 5 = 0 +56 - 28 = +28 +13 - 10 = +3 +28 - 8 = +20 +97 - 46 = +51
Down 2 goals +5 - 8 = -3 +70 - 58 = +12 +19 - 23 = -4 +33 - 19 = +14 +122 - 100 = +22
Down 1 goal +23 - 10 = +13 +203 - 140 = +63 +55 - 62 = -7 +60 - 62 = -2 +318 - 264 = +54
Tie Game +37 - 34 = +3 +295 - 285 = +10 +111 - 131 = -20 +100 - 157 = -57 +506 - 573 = -67
Up 1 goal +15 - 18 = -3 +140 - 180 = -40 +59 - 81 = -22 +64 - 134 = -70 +263 - 395 = -132
Up 2 goals +10 - 8 = +2 +71 - 117 = -46 +33 - 63 = -30 +27 - 51 = -24 +131 - 231 = -100
Up 3+ goals +4 - 6 = -2 +36 - 49 = -13 +21 - 24 = -3 +13 - 18 = -5 +70 - 91 = -21
30-GAME TOTAL +99 - 89 = +10 +871 - 857 = +14 +311 - 394 = -83 +325 - 449 = -124 +1,507 - 1,700 = -193

Takeaway #3: Man, the Ducks were a very, very good team down the stretch when down by a single goal.  Now I should note, I think the general pattern of this table holds true for most teams: when a team is trailing in a game, its shot ratios look better than when it is leading -- it reflects not only shifted priorities (trying to tie vs. trying to hold a lead), but also there's some make-up refereeing and double-shifting and desperation and whatnot as well.

But wow -- that "Down 1 goal" row -- pretty exceptional stuff; the Ducks were "clutch" as fuck.  Now I'm not really sold on the idea of long-term repeatability of this, but it is good to know that the Ducks' desperation mode appears to be a very good weapon -- it was a key reason for Anaheim's late-season success and part of how they got the positive shot differential.

* * *

Now that we've had a taste of "how" the Ducks outshot their opponents, let's dig deeper into one of the components -- keeping things reasonably close shot-wise at 5-on-5.  Even though the Ducks were outshot by 43 shots 5-on-5 over the 30 games, that did mark an improvement from earlier stretches in the season.  What individuals enjoyed that relative success?

Here's a list of the forwards, ranked by descending shot differential.  Don't pay too much attention to the specific order, as many of these guys end up pretty close and I probably should have used some sort of shot ratio to rank, but do notice who's near the top and bottom of the list.

Forwards
5-on-5 only
Goals
GF - GA = GD
Shots on Goal
SF - SA = SD
Missed Shots
MSF - MSA = MSD
Blocked Shots
BSF - BSA = BSD
Attempted Shots
ASF - ASA = ASD
Jason Blake +16 - 15 = +1 +185 - 170 = +15 +51 - 80 = -29 +79 - 80 = -1 +315 - 330 = -15
Ryan Getzlaf +33 - 23 = +10 +239 - 227 = +12 +106 - 96 = +10 +90 - 120 = -30 +435 - 443 = -8
Jar Jar Ruutu +2 - 2 = 0 +71 - 63 = +8 +20 - 30 = -10 +17 - 36 = -19 +108 - 129 = -21
Corey Perry +33 - 28 = +5 +257 - 255 = +2 +108 - 117 = -9 +98 - 119 = -21 +463 - 491 = -28
Bobby Ryan +34 - 25 = +9 +248 - 247 = +1 +112 - 104 = +8 +98 - 118 = -20 +458 - 469 = -11
George Parros +0 - 1 = -1 +51 - 55 = -4 +16 - 36 = -20 +20 - 43 = -23 +87 - 134 = -47
Teemu Selanne +23 - 20 = +3 +196 - 203 = -7 +55 - 88 = -33 +90 - 97 = -7 +341 - 388 = -47
Dan Sexton +7 - 9 = -2 +95 - 104 = -9 +31 - 59 = -28 +30 - 64 = -34 +156 - 227 = -71
Todd Marchant +3 -10 = -7 +103 - 117 = -14 +27 - 62 = -35 +32 - 70 = -38 +162 - 249 = -87
Saku Koivu +19 - 20 = -1 +169 - 185 = -16 +46 - 77 = -31 +76 - 92 = -16 +291 - 354 = -63
Matt Beleskey +0 - 6 = -6 +26 - 43 = -17 +12 - 19 = -7 +13 - 28 = -15 +51 - 90 = -39
Brad Winchester +3 - 11 = -8 +77 - 95 = -18 +24 - 51 = -27 +23 - 52 = -29 +124 - 198 = -74
Brandon McMillan +10 - 19 = -9 +116 - 165 = -49 +45 - 82 = -37 +59 - 111 = -52 +220 - 358 = -138

Takeaway #4: Wow, Bacon McMuffin did a lot of shot-bleeding down the stretch!  Now here's where I want to be cautionary -- McMuffin was probably the most important bottom-six forward for Anaheim down the stretch, but he does get murdered in these numbers almost because of that.  He had to play with Corsi-bleeders pretty exclusively once Getzlaf returned from injury, and he couldn't fix the problem himself.  But the lesson here isn't so much that McMuffin is bad, it's that even as the Ducks were winning close games, there was still lots of room for improvement particularly from Anaheim's third line.

But hey, Jar Jar Ruutu did pretty well on these metrics during his short stint.  As time passes, he'll probably get lumped in with Brad Winchester as "deadline deals that didn't pan out", but they were much different players in Anaheim in terms of on-ice reliability.  I would have no problem if the Ducks had retained Ruutu.

Similarly, here's results for individual defensemen:

Defensemen
5-on-5 only
Goals
GF - GA = GD
Shots on Goal
SF - SA = SD
Missed Shots
MSF - MSA = MSD
Blocked Shots
BSF - BSA = BSD
Attempted Shots
ASF - ASA = ASD
Andy Sutton +9 - 2 = +7 +55 - 38 = +17 +24 - 25 = -1 +22 - 39 = -17 +101 - 102 = -1
Lubomir Visnovsky +38 - 27 = +11 +269 - 256 = +13 +105 - 130 = -25 +110 - 128 = -18 +484 - 514 = -30
Andreas Lilja +2 - 9 = -7 +91 - 93 = -2 +24 - 45 = -21 +35 - 53 = -18 +150 - 191 = -41
Toni Lydman +28 - 22 = +6 +225 - 230 = -5 +91 - 113 = -22 +96 - 111 = -15 +412 - 454 = -42
Cam Fowler +11 - 24 = -13 +227 - 232 = -5 +59 - 106 = -47 +69 - 135 = -66 +355 - 473 = -118
Sheldon Brookbank +1 - 3 = -2 +74 - 87 = -13 +11 - 34 = -23 +21 - 41 = -20 +106 - 162 = -56
Francois Beauchemin +14 - 16 = -2 +162 - 208 = -46 +67 - 85 = -18 +66 - 92 = -26 +295 - 385 = -90
Luca Sbisa +15 - 18 = -3 +170 - 222 = -52 +70 - 91 = -21 +78 - 119 = -41 +318 - 432 = -114

Takeaway #5: Wait a goddamn second -- Andy Sutton was useful at 5-on-5 down the stretch?!  So says the data, at least.  Obviously this was cushy minutes, but maybe it did take the guy most of the season to find a groove he could play in.  I was as happy as anybody that the Oilers traded for Sutton, and still sort of am -- he was a healthy scratch too often for that money -- but in the right situations, the guy can be effective.

Takeaway #6: It's sort of interesting to look at Cam Fowler vs. Beauchemin and Sbisa.  Fowler is the Ducks' d-man who suffers the most in plus-minus, but his 5-on-5 shot differential wasn't bad at all, the same as Toni Lydman.  Then again, his shot-attempt differential is the worst, and goals-against are going in -- there is some problem outside of shot totals, I'd think.  Meanwhile Sbisa and Beauchemin have pretty reasonable plus-minus numbers, but their shot-differential numbers are McMuffin bad.  Is that a function of actually playing too much with McMuffin, or are they the cause of it?  Again, I'm not explaining today, but they too appear to be lightning rods for 5-on-5 shot bleeding.  But it didn't really kill them on the scoreboard.

Anyways, here's another fun feature about the spreadsheet I've put together -- I can even look at how defensemen fared together!

Defense Pairs
5-on-5 only
Goals
GF - GA = GD
Shots on Goal
SF - SA = SD
Missed Shots
MSF - MSA = MSD
Blocked Shots
BSF - BSA = BSD
Attempted Shots
ASF - ASA = ASD
Sbisa - Sutton +6 - 2 = +4 +38 - 27 = +11 +19 - 14 = +5 +17 - 27 = -10 +74 - 68 = +6
Lydman - Visnovsky +28 - 18 = +10 +204 - 197 = +7 +83 - 97 = -14 +87 - 96 = -9 +374 - 390 = -16
Brookbank - Fowler +1 - 1 = 0 +58 - 55 = +3 +9 - 20 = -11 +14 - 30 = -16 +81 - 105 = -24
Fowler - Visnovsky +4 - 5 = -1 +38 - 37 = +1 +11 - 20 = -9 +8 - 24 = -16 +57 - 81 = -24
Beauchemin - Fowler +3 - 3 = 0 +26 - 26 = 0 +11 - 10 = +1 +6 - 14 = -8 +43 - 50 = -7
Fowler - Lilja +2 - 8 = -6 +79 - 81 = -2 +21 - 39 = -18 +31 - 48 = -17 +131 - 168 = -37
Beauchemin - Sbisa +9 - 11 = -2 +108 - 159 = -51 +43 - 59 = -16 +47 - 70 = -23 +198 - 288 = -90

Here are the pairings that had some 50+ shots for/against at 5-on-5 over the stretch, again sorted by shot differential.  Again, Sutton comes out strong, and Beauchemin-Sbisa are solidly behind the pack shot-wise; incredibly pretty much all the 5-on-5 discrepancy can be put on them.  And yet most of the goal-bleeding is being done by Fowler-Lilja for some reason.  Luck?

Takeaway #7: Now that the effort to construct this spreadsheet is completed, playing around with it is kind of cool!

Again, none of this is meant to be a slight against anybody's play -- this is just a particularly-successful 30-game sample that is being parsed to death in an imperfect manner, and while it does reveal quite a bit about how this sequence of results transpired, there's always some bullshit mixed in.

* * *

So now back to the main question: is the manner Ducks' success on these 30-games repeatable or reliable going into the coming season?  Tough to say, obviously -- the Ducks had many things go right over that stretch, with many dramatic comebacks and late wins; that probably doesn't carry over very well.  And it's obviously going to be important that whatever the Ducks were doing on their special teams carry over.  Todd Marchant has retired, and it's possible Teemu does as well -- they won't be doing it with exactly the same personnel.

But at the same time, there's definitely areas that can improve, especially on the 3rd line.  And that's really where Andrew Cogliano fits in, hopefully.  His game has flaws for sure, but the bar for improving the Ducks appears to be pretty damn low -- if he and some of the kids can forge a semi-competent third line, that's better than what the Ducks had over that 30-game stretch.  Plus there are some players where we could expect some improvement anyways -- McMuffin, Fowler, and Sbisa are all going to be a year wiser, and Beauchemin is no longer settling in. 

Bottom line: I gleefully don't know.  I was talking to my buddy the other day and telling him that this season feels pretty unique from a blogging sense -- I can see many reasons why the Ducks will be good, and I can see just as many reasons why the Ducks will be bad; the spectrum of potential outcomes seems much wider than any other season I can remember.  So many questions to be answered, and I'm getting pretty excited to see how it pans out.

Go Ducks.