Saturday, October 31, 2009
Tuesday, October 27, 2009
Comment on MGL on Bunts
I planned on posting his on The Book Blog, but sadly it doesn't seem to want to take it. It is a comment on this, from a slightly more mathematical perspective. I'm not sure how much reader('s) of this basically dormant blog will enjoy it, but it seems like a reasonable place to put it. What follows is the comment.
This is very good, although I am not sure I agree with some of the details. I've been grading a lot of freshman calculus exams lately, so apologies in advance for this little model, which is obviously heavily inspired by MGL's discussion.
We suppose both managers behave optimally. Let t ranging from 0 to 1 be the possible positions the infielders can play, so t=0 means that the infielders are playing as far in as possible, and t=1 means that they are playing as deep as possible (or as deep as realistically possible, since I guess they could play as deep as the fences!) We assume for simplicity that there is only one free parameter in terms of how "deep" the defense can play, while of course there are many in reality.
For a given hitter, in a given game state, with a given defense, we have the functions b(t) and h(t), where b(t) is the expected value of the bunt with the defense playing position t and h(t) the expected value of hitting away. We assume that b(t) is increasing, h(t) is decreasing, h(0) > b(0) (so hitting is preferred to bunting with the defense playing in) and b(1) > h(1). At the Nash equilibrium, the defense will play some position t_0 between zero and 1 and the offense will bunt some fraction 0 < c < 1.
t_0 is easy to find, as MGL indicates. Namely, our assumptions about h(t) and b(t) guarantee that there is a unique point x between 0 and 1 where h(x) = b(x). MGL's argument in the link shows that if the defense is playing any position t' not equal to x, it cannot be in a Nash equilibrium (the offense would have to bunt or hit 100% of the time, and the defense could then improve by changing their position.) Of course, if the hitter is a good bunter then b(t) is bigger, so x is closer to 0, while if he is a poor bunter or a good hitter x is closer to 1, all of which obviously makes intuitive sense.
Now, what should the batter do? He bunts some fraction c of the time, and no matter what of c he picks the total value of his PA is c*b(x)+(1-c)*h(x)=(c+1-c)*b(x)=b(x) since b(x)=h(x). However, there is still a constraint: for the optimal value c, the function f(t) = c*b(t) + (1-c)*h(t) (which is the value of the PA as a function of t if the batter bunts with probability c) must have a local maximum at t=x. Otherwise, by either moving in or out, the defense can decrease the value of the PA, which contradicts the Nash equilibrium assumption.
So we must have c*b(t) + (1-c)*h(t) with a local max at t=x. From freshman calculus, we know that a necessary condition for a local max is that the derivative vanishes at that point. So
0 = f'(x) = c*b'(x)+(1-c)*h'(x)
Which gives
c = - h'(x)/(b'(x)-h'(x))
Our assumptions about b(t) and h(t) guarantee that this is always strictly between 0 and 1, which is obviously good.
Note, though, and this is the key point, that the optimal ratio is independent of how good a bunter the batter is! All that the matter is the LOCAL behavior of the two functions at the indifference point.
For instance, suppose ARod is a bad bunter and a good hitter, so the defense plays him very far back, say at .9. But at .9, it may well be the case that b'(.9) >> h'(.9), or in other words, that the marginal improvement in ARod's bunting value when the defense steps back is much bigger than the corresponding marginal decrease in his hitting ability. In this case, ARod should be bunting most of the time. Similarly, you can be a very good bunter and poor hitter, but still be advised to swing away when the defense is playing you optimally. So I don't think I agree with MGL's recommendations about how much different players should bunt
This explains why the offensive manager's job is much harder the the defensive manager's. The defense only has to know when the two functions are roughly equal, which is something that can be discovered implicitly through trial and error. It is very hard, however, to get a sense of the derivatives of these functions through trial and error. So it makes a lot of sense that offensive manager's make more bunting mistakes than defensive ones.
Does this make sense? Is there some other parameter that people think should be included in this ridiculously simple model?
This is very good, although I am not sure I agree with some of the details. I've been grading a lot of freshman calculus exams lately, so apologies in advance for this little model, which is obviously heavily inspired by MGL's discussion.
We suppose both managers behave optimally. Let t ranging from 0 to 1 be the possible positions the infielders can play, so t=0 means that the infielders are playing as far in as possible, and t=1 means that they are playing as deep as possible (or as deep as realistically possible, since I guess they could play as deep as the fences!) We assume for simplicity that there is only one free parameter in terms of how "deep" the defense can play, while of course there are many in reality.
For a given hitter, in a given game state, with a given defense, we have the functions b(t) and h(t), where b(t) is the expected value of the bunt with the defense playing position t and h(t) the expected value of hitting away. We assume that b(t) is increasing, h(t) is decreasing, h(0) > b(0) (so hitting is preferred to bunting with the defense playing in) and b(1) > h(1). At the Nash equilibrium, the defense will play some position t_0 between zero and 1 and the offense will bunt some fraction 0 < c < 1.
t_0 is easy to find, as MGL indicates. Namely, our assumptions about h(t) and b(t) guarantee that there is a unique point x between 0 and 1 where h(x) = b(x). MGL's argument in the link shows that if the defense is playing any position t' not equal to x, it cannot be in a Nash equilibrium (the offense would have to bunt or hit 100% of the time, and the defense could then improve by changing their position.) Of course, if the hitter is a good bunter then b(t) is bigger, so x is closer to 0, while if he is a poor bunter or a good hitter x is closer to 1, all of which obviously makes intuitive sense.
Now, what should the batter do? He bunts some fraction c of the time, and no matter what of c he picks the total value of his PA is c*b(x)+(1-c)*h(x)=(c+1-c)*b(x)=b(x) since b(x)=h(x). However, there is still a constraint: for the optimal value c, the function f(t) = c*b(t) + (1-c)*h(t) (which is the value of the PA as a function of t if the batter bunts with probability c) must have a local maximum at t=x. Otherwise, by either moving in or out, the defense can decrease the value of the PA, which contradicts the Nash equilibrium assumption.
So we must have c*b(t) + (1-c)*h(t) with a local max at t=x. From freshman calculus, we know that a necessary condition for a local max is that the derivative vanishes at that point. So
0 = f'(x) = c*b'(x)+(1-c)*h'(x)
Which gives
c = - h'(x)/(b'(x)-h'(x))
Our assumptions about b(t) and h(t) guarantee that this is always strictly between 0 and 1, which is obviously good.
Note, though, and this is the key point, that the optimal ratio is independent of how good a bunter the batter is! All that the matter is the LOCAL behavior of the two functions at the indifference point.
For instance, suppose ARod is a bad bunter and a good hitter, so the defense plays him very far back, say at .9. But at .9, it may well be the case that b'(.9) >> h'(.9), or in other words, that the marginal improvement in ARod's bunting value when the defense steps back is much bigger than the corresponding marginal decrease in his hitting ability. In this case, ARod should be bunting most of the time. Similarly, you can be a very good bunter and poor hitter, but still be advised to swing away when the defense is playing you optimally. So I don't think I agree with MGL's recommendations about how much different players should bunt
This explains why the offensive manager's job is much harder the the defensive manager's. The defense only has to know when the two functions are roughly equal, which is something that can be discovered implicitly through trial and error. It is very hard, however, to get a sense of the derivatives of these functions through trial and error. So it makes a lot of sense that offensive manager's make more bunting mistakes than defensive ones.
Does this make sense? Is there some other parameter that people think should be included in this ridiculously simple model?
Subscribe to:
Posts (Atom)