Forecast 2003The resultsBy Tangotiger
This article will present the results of the Forecast experiment. New readers are invited to read the concept behind the Forecast series before continuing.
A special thanks to Sylvain Cognet for offering to compile, and compiling, the 2003 OPS and ERA for the 32 players in this study. (I love Primer readers! I sometimes get these surprises, where a dedicated reader will just compile useful data, and send it along. This was certainly a time-saver for me.)
This table presents each player's:
The last 2 line needs a little explanation. The results under the "absolute" columns are the standard deviations of the values in those columns. The results under the "relative" columns are the average of the values in those columns.
Interpreting the Absolute differences
Jumping straight to the standard deviations, we see that the Baseline forecast did the best job with the hitters and worst with the pitchers. The Primer readers did the worst with the hitters. The Systematic Forecasters just beat out the Primer readers for pitchers.
You can go through the individual picks in these 3 columns, but I find it more interesting to look at...
Interpreting the Relative differences (Pitchers)
We see here that the Primer readers just nailed the pitcher forecasts. Of the 11 pitchers, the Primer readers had the closest pick in 7 of them. In 2 others, they were within .10 runs from the closest forecast pick. Primer readers blew it on Javy Vazquez and Jamey Wright.
How did the Systematic Forecasters do with their vaunted research and engine? They picked 2 pitchers better than the other forecasters. Another 6 were within .20 runs. They blew it on 3 pitchers, including Javy Vazquez.
And the Baseline, the thought-free system? They were closest on the 2 pitchers that the Primer readers misread the most (Vazquez and Wright). They had another 2 within .20 runs. And the rest were just bad. The 3 worst picks out of the whole group were all from the Baseline: Freddy Garcia, Ryan Dempster, and Kenny Rogers.
There is a certain amount of information about a pitcher that is not captured in the stats, especially ERA. The Readers and Forecaster knew that there was something a bit more special about Clement that wasn't captured, for example, and they made their picks accordingly. Javy Vazquez, a pitcher near and dear to me, for some reason did not elicit the same faith from the Readers or Forecasters.
All in all, Primer readers did a great job in forecasting the pitchers.
On to the Hitters
It seems that the amount of intuition required with pitchers is not at all required with hitters. The lesson here is: trust the numbers.
For best picks, the Baseline nailed 9 of the 21, the Systematic Forecasters got 8 of them, and the Primer readers just 4. If we look at the blown picks (difference of greater than .050), the Primer readers blew it on 6 hitters (Gonzalez, Burrell, Hidalgo, Giambi, Hunter, and Hernandez), while the Baseline blew it on 2 (Glaus, Grissom), and the Forecasters on 3 (Hernandez, Thome, and Hunter).
Overall, the Baseline did the best job with the hitters, both in terms of the absolute and relative differences, and the Primer readers the worst.
Recap
The 32 players were chosen because they had at least 3 years of performance to analyze, and that their year-to-year performance was very inconsistent (whether by luck, design, or injury). The test in this experiment was to see if the Systematic Forecasters would be able to interpret the numbers in a special way, or if the Primer reader's gut feelings would see something extra. Among the hitters, both of these groups failed to see anything beyond what the numbers said. In fact, they saw things that just weren't there. Among pitchers, the peripherals found by the Forecasters, or the "stuff" established by the Primer readers was enough to thoroughly beat the Baseline.
Conclusion? Trust the numbers you see (which is enough for hitters), and fill-in the information missing (which is the case with pitchers, be it health or mechanics). Any extra nuance that you find just doesn't have the impact you'd hope. None of the 3 groups dominated the others. This was about as close to a draw as you'd expect.
Next week, I'll look at the picks of 165 Primer readers, and crown a champion. I'll also break down the picks of the 6 Systematic Forecasters.
|