Forum Home

Master Index of Archived Threads


Game Simulations - Answering Baseball’s What-Ifs

metirish
Apr 07 2009 01:59 PM

A few here play game simulators like Diamond Mind. Alan Schwarz in the New York Times has an interesting article on the subject and with a few mets angles. Pay attention batmag.



] By ALAN SCHWARZ Published: April 6, 2009 You can learn a lot during a major league baseball game. Like Ukrainian, if it is a particularly slow nine innings. As for the science of baseball strategy, one game teaches precious little. A well-timed sacrifice bunt can backfire and lose the game; a foolish steal can appear brilliant. The vagaries of randomness — the way Sandy Koufax got battered occasionally and a pipsqueak named Bucky Dent hit one of the most famous home runs ever — camouflage the game’s inner forces, which for 150 years have operated somewhere between fact and fable. One game has little meaning. A thousand seasons can take a while. Thank goodness for quad-core processors. “Computer simulations work pretty well in baseball for two reasons,” said Carl Morris, a professor of statistics at Harvard University who has written several papers that commingled baseball and formal statistical theory. “In general, they allow you to study fairly complicated processes that you can’t really get at with pure mathematics. But also, sports are great for simulations — you can play 10,000 seasons overnight.” No one can afford to wait less than major league teams, which crave every extra run or victory they can wring from their $100 million rosters. John Abbamondi, the assistant general manager for the St. Louis Cardinals, says his team and about 10 others use simulations to evaluate potential trades and how they might affect the pennant race. “It’s all part of the statistical analysis that complements the more traditional scouting we do,” he said. Using computer simulations to explore in-game and other baseball strategies is by no means new. As early as 1958, a professor at the Massachusetts Institute of Technology programmed a behemoth I.B.M. 704 mainframe to investigate whether the sacrifice bunt was a smart play. (More on that later.) Simulators have since grown so complex that the most sophisticated one available to the public, called Diamond Mind, not only runs lickety-split on laptops but even considers minutiae like the effects of wind in individual ballparks. Under what conditions is bunting advantageous? When does trying to steal make sense, and when does it decrease the chances of scoring? Questions like these turn out to be ideally suited to computer programs through which millions of iterations can smooth out the peaks and valleys of randomness, and converge toward a reliable approximation. Known among formal statisticians as the Monte Carlo method, this approach takes spectacularly complex phenomena like weather patterns and stock performance and allows their behavior to be approximated, if not determined. What are the chances of winning a game of solitaire? Rather than writing an equation that tries to take into account the trillions of trillions of possible hands and moves, a statistician can run a computer program that simply plays the game a few million times in minutes to see how often it wins. Dr. Morris says he has seen the Monte Carlo method used to improve computer graphics and explore gene sequences. Like such competitors as Strat-O-Matic — which made its debut in 1961 with at-bats determined by cards and dice, and remains popular on the personal computer — Diamond Mind is designed to allow fans to play fictional games and seasons, exploring what-if scenarios that real life would be too slow and controversial to allow. Take the age-old question of how much difference a team’s lineup order makes. This issue so vexed the former manager Billy Martin that he once literally picked his Detroit Tigers batting order out of a hat. Luke Kraemer of Imagine Sports, which owns Diamond Mind, programmed the simulator to force the 2008 Yankees to bat their best hitter and cleanup man, Alex Rodriguez, ninth — to see how scoring was affected. Mr. Kraemer got the run total not for just one season, which can fluctuate as much as 80 runs in each direction from simple randomness, but for 100 seasons — more than 16,000 Yankees games in all. The result? The Yankees scored 747 runs per season, 40 fewer than their real-life 787. (Diamond Mind was so accurate that 100 seasons with A-Rod batting fourth averaged 789, almost dead-on.) Most research suggests that those 40 runs would mean only about four fewer victories, for a strategy no manager would ever consider; so the difference with Rodriguez batting third or fifth would be insignificant, and nowhere near worth the forests of trees that would give their lives to the ensuing sports-page debate. Diamond Mind took its cuts at several other baseball knucklers, running 100 full seasons of games for each: • The intentional walk. This frequently used defensive strategy avoids dangerous hitters and can set up a double play, but it also awards a free base, and even the best hitters usually make an out. So is it smart in the long run? Diamond Mind found that it was not, though the difference was only about five runs per team per season. • The stolen base. Advancing from first to second puts the runner in scoring position, but he — and the rest of your hitters — will have a hard time scoring if he gets thrown out. Mr. Kraemer looked at a recent team that ran wild (the 2008 Tampa Bay Rays) and one that barely stole at all (the 2005 Oakland A’s) and switched their mind-sets to see what happened. The A’s scored 20 runs fewer, which probably says more about their players’ inability to run in the first place. But when the speedy Rays stole sparingly, they increased their scoring by 47 runs per season — suggesting that perhaps the Rays were running too often in real life. • The sacrifice bunt. Is it worth making an out intentionally to move a runner from first to second? Forcing a team that hated that maneuver (the 2005 Boston Red Sox) to do it a lot cost them 19 runs per season. But making a bunting team (the 2008 New York Mets) avoid it also cost them — by 15 runs on average — suggesting that the Mets’ managers, Willie Randolph and Jerry Manuel, used it quite intelligently. (The 1958 M.I.T. statisticians found that the sacrifice was rarely a good move; major league managers paid little attention.) One problem with computer simulations is that no matter how realistically they might be programmed, they can say more about the programmer than baseball itself. A computer, after all, cannot feel human emotions like pressure or the will to hit in the clutch. “We can run the experiment in the simulation environment and think we’re measuring the effect of a great defense on a pitching staff, but it might tell us more about how we modeled defense,” said Tom Tippett, who wrote the original Diamond Mind code in the early 1980s. “The simulation is real close to real-life baseball, but in the end it isn’t real-life baseball.” After developing Diamond Mind into the industry standard, Mr. Tippett was hired a few years ago by the Boston Red Sox — a sign of how much some teams have come to value simulation research. While none will discuss exactly what they model and how, Mr. Abbamondi, of the Cardinals, said they could provide objective insight into how an offense might be affected by trading for a hitter in midseason; how many games that might improve the team; and how that hitter might improve or deteriorate as he ages. Many of these measurements come in the form of scenarios of increasing uncertainty, not unlike the projection of hurricane paths. As Mr. Tippett suggested, however, simulations have inherent limits, and probably will not ever model baseball’s vicissitudes of fate — how scrubs morph into all-stars and some teams just collapse. (Indeed, fans of the recent New York Mets would be relieved that some things defy re-creation.) Tony La Russa, the Cardinals’ manager, who is a sure bet for the Hall of Fame, said the value of computer simulations in baseball tended to stop at the dugout entrance. “There’s way too much importance given to what you can produce from a machine,” he said. “These are human beings, and I don’t think any computer is going to model that close to what we deal with at this level.” That can be as true now as it was 25 years ago, when a Tank McNamara cartoon captured it best. A downtrodden manager peered over his computer. He asked plaintively, “But will it take the blame?”

batmagadanleadoff
Apr 07 2009 08:12 PM
Re: Game Simulations - Answering Baseball’s What-Ifs

="metirish"] Pay attention batmag.
Thanks, Irish. The author confirms the by now, recognized principle that moving a player one or two slots from what would be assumed to be his ideal batting order position has a negligible overall effect on runs scored. But a manager who, for example, shifts a top or middle of the order guy down to the bottom of the lineup will impact run production to the point where it is no longer negligible, but significant. The reverse is also true -- bumping a weaker hitter to the top of the lineup. The author's example of batting A-Rod ninth illustrates this. I never liked Randolph's lineups, especially his 2005 lineups where Wright was buried in the seven hole for a little more than half the season, even though he had already established himself as the Mets best or second best hitter. That same season, Piazza took too many at bats from the clean up spot even though he was clearly no longer an impact player. And don't get me started on Reyes leading off all season long while the Mets were within striking distance of the wild card because we went over that one many times already.
]The sacrifice bunt. Is it worth making an out intentionally to move a runner from first to second? Forcing a team that hated that maneuver (the 2005 Boston Red Sox) to do it a lot cost them 19 runs per season. But making a bunting team (the 2008 New York Mets) avoid it also cost them — by 15 runs on average — suggesting that the Mets’ managers, Willie Randolph and Jerry Manuel, used it quite intelligently. (The 1958 M.I.T. statisticians found that the sacrifice was rarely a good move; major league managers paid little attention.)


I can't figure out these results. Also, I don't know how the program was written: under what conditions were the Mets programmed to sac bunt? Technically, a team could sac bunt whenever there is at least one runner on base with less than two outs. When did the computer Mets bunt? Did the computer have Delgado, or David Wright bunt? This information would be more helpful. I'll stick to my guns though, about sac bunting with your number two guy in the first inning of a ball game. I think it's a horrible strategy, and I would hope that any skipper that has the inclination to do this would be managing any NL East team other than the Mets.

Edgy DC
Apr 07 2009 08:19 PM
Re: Game Simulations - Answering Baseball’s What-Ifs

="batmagadanleadoff":30jxnl3g]I never liked Randolph's lineups, especially his 2005 lineups where Wright was buried in the seven hole for a little more than half the season, even though he had already established himself as the Mets best or second best hitter.[/quote:30jxnl3g]

He batted seventh in exactly 40 games.

1-0
2-3
3-4
4-3
5-76
6-34
7-40
8-0
9-0

batmagadanleadoff
Apr 07 2009 08:27 PM

I know. But Wright should have been a three or four batter all season. But it wasn't simply Wright that was wrong with the lineup. I could've tolerated Wright batting seventh or sixth for half a season if that's all it was. But it was the entire lineup, in total, that annoyed the living shit out of me as I watched that season. And Reyes flailing away at 55 foot one hop pitches all season long. From the leadoff spot. Every single day. Over 700 times. And then throw in the sac bunt thing from your number two guy. In the first inning.

I believe we've covered this turf already. I like Jerry.

Edgy DC
Apr 07 2009 08:37 PM

You know?

I like Jerry too. But let's deal with real data.

batmagadanleadoff
Apr 07 2009 08:38 PM

I thought I did. Why don't you make a case for Reyes batting leadoff in 2005?

Edgy DC
Apr 07 2009 09:23 PM

No thanks.

soupcan
Apr 08 2009 08:25 AM

Thanks for the article Irish, I missed it in yesterday's Times.

MFS62
Apr 08 2009 08:57 AM

It used to drive me nuts when Davey Johnson would make multiple substitutions at one time, and leave the replacements in the same spots in the batting order as the players they replaced in the field. This led to things like light hitting defensive players batting third through fifth.
Later

Edgy DC
Apr 08 2009 09:39 AM

Who are we talking about?

Davey Johnson's third hitter was typically Keith Hernandez, who would retire if he was ever lifted for a defensive substitute. His fourth and fifth hitters were Gary Carter, Darryl Strawberry, Kevin McReynolds, and Howard Johnson. Of those, I can only remember Johnson ever being lifted for a defensive advantage. More typically, he was moved from shortstop to third.

MFS62
Apr 08 2009 10:00 AM
Edited 2 time(s), most recently on Apr 08 2009 10:06 AM

Oops, I was thinking about Valentine , substituting for Mo Vaughan et al.

Later

John Cougar Lunchbucket
Apr 08 2009 10:03 AM

Who didn't hit 3, 4 or 5 regularly until 1989.

MFS62
Apr 08 2009 10:05 AM

="John Cougar Lunchbucket":2nhr535m]Who didn't hit 3, 4 or 5 regularly until 1989.[/quote:2nhr535m]
Yer right. I edited my post above.
Brain fart.
Later

smg58
Apr 08 2009 12:16 PM
Re: Game Simulations - Answering Baseball’s What-Ifs

="batmagadanleadoff":27sc5mu8]I can't figure out these results. Also, I don't know how the program was written: under what conditions were the Mets programmed to sac bunt? Technically, a team could sac bunt whenever there is at least one runner on base with less than two outs. When did the computer Mets bunt? Did the computer have Delgado, or David Wright bunt? This information would be more helpful. I'll stick to my guns though, about sac bunting with your number two guy in the first inning of a ball game. I think it's a horrible strategy, and I would hope that any skipper that has the inclination to do this would be managing any NL East team other than the Mets.[/quote:27sc5mu8]

It's also misleading to compare an American League team with a National League one. The situation where the benefits of sacrificing are least ambiguous is when a particularly weak hitter (like, say, most pitchers) comes up with a runner on first. If the computer Mets let the pitcher swing away, I could see that having a negative effect. But in a lineup with a DH, situations where a bunt would clearly help more than hurt would be very few and far between.

Frayed Knot
Apr 09 2009 01:13 PM

btw, Hanley Ramirez - maybe 2nd only to Albert in NL offensive prowess - celebrated his first AB this season as a newly minted #3 hitter by sac bunting w/a man on 2nd and no outs.

seawolf17
Apr 09 2009 01:35 PM

="Frayed Knot":3f36140k]btw, Hanley Ramirez - maybe 2nd only to Albert in NL offensive prowess - celebrated his first AB this season as a newly minted #3 hitter by sac bunting w/a man on 2nd and no outs.[/quote:3f36140k]
I thought Willie was bench-coaching in Milwaukee, not Florida?