/cdn.vox-cdn.com/uploads/chorus_image/image/32322503/483083047.0.jpg)
Once again, let's look at Penalty Kill Save Percentage. I am going to look at 2009-10 through 2013-14. I have shown repeatedly that different goalies are different at Even Strength. As always, in order to do regression analysis I constructed logits (ln odds of a save). I threw out the goalies whose logits were zero or undefined. All the regressions are weighted by number of shots faced. Lastfirst is the name of the goalie variable.
There is no year effect in this data. > LinearModel.2 = lm(PKLogit ~ Year, data=PKSP5, weights=PKSA) > anova(LinearModel.2) Analysis of Variance Table Response: PKLogit Df Sum Sq Mean Sq F value Pr(>F) Year 1 1.1 1.0716 0.0932 0.7603 Residuals 414 4759.6 11.4967 Goalies are all the same, teams are all the same. > LinearModel.1 = lm(PKLogit ~ lastfirst + Team, data=PKSP5, weights=PKSA) > anova(LinearModel.1) Analysis of Variance Table Response: PKLogit Df Sum Sq Mean Sq F value Pr(>F) lastfirst 132 1265.94 9.5904 0.7756 0.9486 Team 30 366.37 12.2123 0.9876 0.4894 Residuals 253 3128.39 12.3652 There is a small correlation between ESSP and PKSP. ESSP explains about 8.5% of the variability seen in PKSP (i.e., not much). > cor.test(PKSP5$ESVPCT, PKSP5$PKSVPCT, alternative="two.sided", + method="pearson") Pearson's product-moment correlation data: PKSP5$ESVPCT and PKSP5$PKSVPCT t = 6.2456, df = 414, p-value = 1.048e-09 alternative hypothesis: true correlation is not equal to 0 95 percent confidence interval: 0.2030235 0.3788969 sample estimates: cor 0.2934412 Graphically, the relationship looks like: Some of the apparent correlation is probably being driven by the 4-5 goalies who are in the left lower corner. In the regression analysis, weighting by shots faced effectively removes these data points. Here they are getting the same weight as everybody else. The true correlation is likely closer to zero. Conclusions There is no year effect. There is no team effect. There is no goalie effect. "Absence of proof is not proof of absence." From the ES data, we know that it takes several thousand shots to tell goalies apart. The busiest goalies in this data faced less than 1500 shots over the 5 seasons. There may well be a goaltender effect. However, if it is too small to see in 5 years of action, it certainly will not be visible in 1 year. If a 0.874 goaltender sees 250 PK shots in a season, the 95% Confidence Interval for observed PK Save Percentage is 0.827 to 0.912. Looking through the data, there is no goaltender who consistently lies outside that range. Over the 5 years, goaltenders faced a total of 54898 shots and made 48001 saves for an overall PK Save Percentage of 0.87437.