John Heyman made a comment during the MLB Network Hall of Fame show that cut both ways for me. Rich Lederer, long time advocate for Bert Blyleven's Hall of Fame induction transcribes Heyman's remarks:
"I never thought [Bert Blyleven] was a Hall of Famer when he was playing, and I saw him play his entire career."
"[His popularity] is based on a lot of younger people on the Internet who never saw him play."
"It's not about stats...it's about impact."
- Jon Heyman on MLB Network, 1/12/09
Patrick Sullivan, another writer at Baseball Analysts, looks into poor award voting and its effects on Hall of Fame ballotting. Bert didn't do well in the Cy Young voting, and that's held against him as well.
The question for me is, why is there such a huge difference between the stats and the perception? This query is especially important to me, because to a certain extent I sympathize with Heyman on this one. Unlike Heyman, I can be convinced otherwise.
The Minnesota factor doesn't ring true to me. Yes, it was a small market, but growing up during the 1970s I was well aware of players like Tony Oliva and Rod Carew, both who more than got their due winning batting titles. With 12 teams in the AL at the time, the Twins would play the big markets in the east four times during the season, giving someone like Blyleven the possibility of three or four starts. Sports writers in the east saw him pitch more than enough. They just weren't impressed with what they saw.
The biggest reason stems from the disconnect between Blyleven's ERA and his winning percentage. Were his stats deceiving? Was there something about the distribution of his runs allowed that is fooling the "younger people on the Internet who never saw him play?"
I decided to look at games scores in a probabilistic fashion. Given a particular game score, or range of game scores, what is the probability of winning that start? The probability of losing that start? If Blyleven's expected wins and losses are in line with his actual wins and losses, then the people unimpressed when they saw him play have a point. If his game scores indicate Bert should have won a lot more, or lost a lot less, then his Hall candidacy looks much better. I took all games in the Day by Day Database, currently from 1957 through 2008, and built the following model, using Game Score divided by 10 to smooth things out.
| GameScore/10 | GS | W | L | WPct | LPct |
|---|---|---|---|---|---|
| 11 | 7 | 3 | 0 | 0.429 | 0.000 |
| 10 | 33 | 16 | 3 | 0.485 | 0.091 |
| 9 | 894 | 794 | 16 | 0.888 | 0.018 |
| 8 | 8383 | 7637 | 192 | 0.911 | 0.023 |
| 7 | 21795 | 16957 | 1898 | 0.778 | 0.087 |
| 6 | 36189 | 20852 | 6748 | 0.576 | 0.186 |
| 5 | 43254 | 16691 | 12548 | 0.386 | 0.290 |
| 4 | 37996 | 8322 | 16215 | 0.219 | 0.427 |
| 3 | 31137 | 2399 | 18185 | 0.077 | 0.584 |
| 2 | 19781 | 380 | 14386 | 0.019 | 0.727 |
| 1 | 5943 | 25 | 5016 | 0.004 | 0.844 |
| 0 | 1045 | 0 | 952 | 0.000 | 0.911 |
| -1 | 13 | 0 | 12 | 0.000 | 0.923 |
| -2 | 1 | 0 | 1 | 0.000 | 1.000 |
The two percentages are per start, not per decision. Now this is not a perfect model. It should probably be adjusted for era, but with over 50 years of data, I suspect those things even out. With this data, we can take each individual start, add up the probabilities for each game score, and get expected wins and expected losses. I did this for every pitcher who collected 200 starts between 1957 and 2008. I also calculated the ratio 100.0*Wins/PredWins and 100.0*Losses/PredLosses. A value over 100 for wins indicates the pitcher won more game than expected. A value over 100 for losses indicates the pitcher lost more game than expected.
Note that at very high game scores, pitchers don't win that often. That seems counter intuitive until you realize that really high game scores often happen in extra-inning games. Once the scores get into the 80s, there's a nice downward progression of winning percentage, and a nice upward progression of losing percentage. A score of 50 is neutral, and you can see that above 50, pitchers win more often than they lose, and below 50 pitchers lose more often than they win.
Three hundred fifty five pitchers qualified, and here they are the top 20 ranked by Win Ratio. For the full list, click here.
| Pitcher | GS | W | L | PredWin | PredLoss | WRatio | LRatio |
|---|---|---|---|---|---|---|---|
| Mark Mulder | 203 | 103 | 60 | 75.9 | 75.1 | 135.75 | 79.85 |
| Andy Pettitte | 426 | 214 | 126 | 159.7 | 152.2 | 133.96 | 82.79 |
| Darren Oliver | 228 | 82 | 77 | 62.7 | 99.7 | 130.84 | 77.26 |
| Bill Lee | 225 | 97 | 79 | 74.6 | 86.5 | 130.05 | 91.31 |
| Aaron Sele | 352 | 144 | 109 | 111.7 | 141.3 | 128.90 | 77.11 |
| Ramon Ortiz | 210 | 81 | 78 | 63.4 | 87.3 | 127.75 | 89.38 |
| Charles Nagy | 297 | 128 | 102 | 101.0 | 114.9 | 126.73 | 88.73 |
| Kenny Rogers | 474 | 198 | 134 | 156.6 | 184.2 | 126.43 | 72.76 |
| Russ Ortiz | 253 | 109 | 82 | 86.8 | 96.5 | 125.56 | 85.01 |
| Bartolo Colon | 313 | 149 | 97 | 118.9 | 110.7 | 125.29 | 87.60 |
| Kirk Rueter | 336 | 129 | 92 | 103.5 | 136.1 | 124.60 | 67.60 |
| Scott McGregor | 309 | 136 | 104 | 109.2 | 116.3 | 124.54 | 89.41 |
| Jon Garland | 255 | 104 | 88 | 83.6 | 101.3 | 124.47 | 86.90 |
| Paul Byrd | 250 | 101 | 89 | 82.1 | 98.9 | 123.09 | 90.02 |
| James Baldwin | 202 | 79 | 69 | 64.2 | 82.7 | 123.09 | 83.48 |
| David Wells | 489 | 221 | 144 | 180.0 | 177.8 | 122.77 | 81.01 |
| Scott Erickson | 364 | 142 | 136 | 117.2 | 149.3 | 121.20 | 91.10 |
| Warren Spahn | 296 | 156 | 102 | 128.9 | 93.1 | 121.07 | 109.56 |
| Lew Burdette | 251 | 112 | 87 | 92.5 | 94.0 | 121.06 | 92.51 |
| Pat Hentgen | 306 | 126 | 110 | 105.0 | 115.7 | 120.00 | 95.06 |
Blyleven ranked 309th in Win Ratio, 92.46. He started 685 games, and compiled a record of 286-248 in those games. His expected won-loss record was 309-205. The loss number is somewhat more interesting. Not only did Blyleven fall short on wins, but his ability to go deep in games while his team didn't score saddled him with an extra 43 losses, a Loss Ratio of 121.03. If he puts up those 286 wins with 220 losses, my guess is that he would have been in the Hall a long time ago.
The person just ahead of Bert in Win Ratio is Don Sutton. Don compiled a 321--253 record in his starts, when he should have been 347-224. By reaching the magic 300 level, Don made the Hall, but there were quite a few people who felt the same way about Sutton that they did about Bert.
Of course, next to Nolan Ryan, Bert can't complain too much. Ryan ranked 348 out of 355, and his predicted record was 384-209! (He compiled a 318-291 record in his starts.) That's a Win Ratio of 82.91 and a Loss Ratio of 138.97. The only person in the study with a worse Loss Ratio than Ryan was Bob Gibson at 139.71. My guess is if we adjust for era and parks, Gibson's prediction might be more in line with his record.
Of the 300 game winners in the study, only one pitcher is shown as being undeserving, Tom Glavine. His record of 305-203 in starts should have been closer to 267-232. Glavine will go down as the anti-Blyleven, someone good but not great who makes the Hall because he played on very good teams during his career.
In conclusion, it's fairly obvious that Blyleven pitched well enough to win 300 games, and other factors not only kept his win total low, but greatly inflated his loss total. Seen in this light, there's no doubt Bert should get the Hall call.
See an interesting graph based on the data here.