Resident Fantasy Genius: The Infield Fly Rules

September 8, 2011

Yesterday, Jason Collette penned an article about infield flies that ties in with a discussion I’ve been having over the past few weeks with one of my former writers at The Hardball Times Fantasy, Jeff Gross, and one of his readers, Alex Hambrick. Additionally, BP readers JoshC77 and kcshankd wondered in the comments section of Jason’s article whether the ability to induce infield flies was a repeatable skill for pitchers. Today, I thought I’d try to answer that question and present some of the research I’ve conducted in my conversations with Jeff and Alex.

Earlier this year, I wrote an article entitled “When Pitchers' Stats Stabilize,” in which I looked at how “stable” (or how “repeatable,” in terms of being a “skill”) a number of stats were—infield flies among them. In the article, I found that infield fly balls, as a rate of total batted balls, took roughly 0.6 years to “stabilize.” In other words, this initial research suggested that infield fly-ball rate was indeed a very repeatable skill. But in my discussions with Alex, I’ve come to suspect this may not necessarily be true—or at least not in the way my previous research suggests.

In one e-mail, Alex wrote:

I have known for quite some time that IFFB is a strong function of FB, much like HR is a strong function of FB for pitchers. (The league average IFFB/FB hovers around 10%). In other words, you can, with reasonable accuracy, predict IFFB by simply multiplying FB by .10. (.10*FB correlates to IFFB with an r^2 of .70).

While we’ve certainly known that there is a correlation between total fly balls and infield fly balls (due in large part to the vertical location and movement of a pitcher’s pitches, forcing batters to put the ball in the air), this presented an interesting question: “Is infield fly rate a skill above and beyond a pitcher’s skill in simply allowing fly balls in general?”

To help answer this question, I decided to pull out my old friend, the split-half correlation (if you’re interested, you can read all about the methodology in the “When Pitchers' Stats Stabilize” article). What I’ve done is run two separate analyses. In the first, I’ve run a split-half correlation using infield flies per contacted balls (henceforth referred to as IF FB%) in both halves. In the second, I put IF FB% in one half and put all flies multiplied by .07 (roughly seven percent of flies are of the infield variety) per contacted balls (henceforth referred to as FB*.07) in the other half. So essentially, I’m first correlating IF FB% with itself and then correlating it with fly-ball rate. This gives us the following results:

Stat	Denominator	Correlated With	Stabilizes	Years
IF FB	GB+OF+IF+LD	IF FB%	288	0.6
FB*.07	GB+OF+IF+LD	IF FB%	216	0.5

That’s very interesting. While the differences are small, IF FB% actually seems to be less stable than simply using a league-average percentage of infield flies per total flies. Since multiplying by .07 doesn’t affect the correlation (I’ve just included it to better illustrated the point that this is an estimate of infield flies), what this essentially tells us is that fly-ball percentage better predicts infield-fly percentage than it can predict itself.

Let’s check out one more way of looking at infield flies. We know that total flies stabilize very quickly, so perhaps infield flies per total flies (henceforth referred to as IF/FB) will prove useful.

Stat	Denominator	Stabilizes	Years
OF+IF	GB+OF+IF+LD	109	0.2
IF FB	OF+IF	414	2.5

Nope. While the percentage of total flies (FB%) stabilizes extremely quickly (as quickly as groundballs do, for what it’s worth), it takes the average pitcher two and a half years for his IF/FB rate to stabilize. That means we can throw IF/FB rate out the window entirely—we’d be far better off using either IF% or FB*.07.

We could stop here and have learned something useful, but we can do a little better yet. What these split-half correlation tests give us is the point at which the stat produces an R of 0.50. Using this data, we can create a regression to the mean equation and estimate a player’s true talent level. While the easiest mean to regress to is always the league average, it’s rarely the best. As Alex posited, and as has been known for a while now, fly-ball rate and infield-fly rate are correlated. As such, we can reexamine this relationship and then use our results as the mean we regress to.

(The above graph includes all pitcher seasons with at least 100 innings pitched in a year from 2005 to 2010)

As you can see, the relationship is very strong (r-squared of 0.68). Pitchers who give up a lot of total flies also manage to induce a lot of infield flies. The most extreme fly-ball pitchers actually manage to convert nearly 10 percent of all contacted balls into popups. So instead of regressing each pitcher to league-average fly-ball rate, we can regress each pitcher to his own unique rate based upon this relationship.

Now we get to the fun part: applying all of this to actual players. Ideally, we’d use multiple years, incorporate aging, weighting, etc., but I’m just going to do it simply. What I’ve done is used a pitcher’s actual, unregressed 2011 fly-ball rate to create his personal mean based on the formula above. I’ve then regressed his 2011 infield-fly rate onto this mean based upon the split-half correlation tests we ran at the beginning of the article. When I do this and focus on all pitchers who made at least 10 starts this season, here are the pitchers with the best infield fly ball “true talent” levels (rIF%):

Pitcher	FB%	rIF%
Guillermo Moscoso	58%	15.0%
Jered Weaver	50%	14.8%
Jeremy Hellickson	47%	13.2%
Josh Collmenter	48%	12.5%
Brian Matusz	54%	12.4%
Alexi Ogando	44%	12.1%
Rich Harden	49%	11.8%
Brandon Beachy	48%	11.6%
Ted Lilly	46%	11.3%
Colby Lewis	50%	11.2%
Scott Baker	46%	11.0%
Michael Pineda	44%	11.0%
Chris Tillman	43%	10.9%
Shaun Marcum	44%	10.6%
Brett Cecil	46%	10.5%
Clayton Kershaw	39%	10.3%
Phil Hughes	43%	10.2%

That’s an interesting name at the top of the list. When we think about top pitchers, Guillermo Moscoso isn’t the first guy that comes to mind. While his strikeout and walk rates are underwhelming, he’s an extreme fly-ball pitcher, which will allow him to induce a lot of popups. There are some other not-so-terrific players on this list (Collmenter, Matusz, Tillman, Cecil, Hughes), but since they all have posted high fly rates, they’ll at least be useful in terms of inducing popups and keeping a lower-than-normal BABIP. Jered Weaver—sort of the poster boy for using infield flies to beat his FIP—ranks second on the list, and long-time pop-up artist Clayton Kershaw also ranks highly.

Now let’s take a look at our trailers:

Pitcher	FB%	rIF%
Derek Lowe	22%	2.7%
Charlie Morton	19%	3.2%
Chris Volstad	26%	3.5%
Jake Westbrook	22%	3.6%
Trevor Cahill	25%	3.7%
Jason Marquis	25%	3.9%
Zach Britton	28%	4.1%
Ivan Nova	28%	4.1%
John Lannan	24%	4.2%
Dontrelle Willis	24%	4.2%
Ricky Romero	30%	4.3%
Zack Greinke	29%	4.3%
Fausto Carmona	24%	4.3%
Nick Blackburn	27%	4.3%
Edinson Volquez	29%	4.5%
Jaime Garcia	26%	4.6%
Aaron Cook	24%	4.7%
Joel Pineiro	32%	4.7%

Extreme ground-ball pitchers rule this list. Notorious sinkerballer Derek Lowe trails everyone in terms of regressed infield-fly rate, followed by 2011 breakout pitcher Charlie Morton. Romero and Greinke have been excellent this season in terms of strikeouts and walks, but as ground-ball pitchers, they shouldn’t be expected to induce many popups. They remain elite pitchers, though, since these kinds of pitchers can afford to give up a few more hits as they allow fewer homers.

Thank you for reading

This is a free article. If you enjoyed it, consider subscribing to Baseball Prospectus. Subscriptions support ongoing public baseball research and analysis in an increasingly proprietary environment.

Subscribe now

Derek Carty

Latest Articles

You need to be logged in to comment. Login or Subscribe

nicholj

9/08

I think this series is great but also misses the point. Are we not trying to identify those pitchers that xFIP will routinely underestimate/overestimate due to a unique skill - ability to minimize flyball distance? Or, pitchers whose skill is not accurately reflected in the amount of flyballs they cede but the distance of those flyballs. Infield Fly balls is an aid in measuring the distance of those flyballs and therefore what is really relevant is the IFF/FB (just looking at the list, the better pitchers have better IFF/FB rates than just IFF rates).

But as you showed, that alone does not have much predictable value. Would it not be more effective if you combined it with HR/FB? A pitcher with a low HR/FB and high IFF/FB should indicate a pitcher with a skill for minimizing flyball distance therefore to be underestimated by xFIP. Whereas a pitcher a with a high HR/FB and low IFF/FB should indicate a pitcher with the reverse skill and therefore likely to be overestimated by xFIP. And for the pitchers in between FB% is good enough.

Reply to nicholj

derekcarty

9/08

I'm not sure the ability to induce popups and the "ability to minimize flyball distance" are the same thing. Popups are valuable in and of themselves because they become outs something like 98-99% of the time. Minimizing flyball distance (if it indeed is a skill pitcher's can possess to any significant degree) would help in preventing home runs.

"just looking at the list, the better pitchers have better IFF/FB rates than just IFF rates." I can check this. Since 2003, pitchers with xFIPs better than 3.5 have an aggregate 10.3% IF/FB. Everyone else is 10.4%.

"A pitcher with a low HR/FB and high IFF/FB should indicate a pitcher with a skill for minimizing flyball distance therefore to be underestimated by xFIP." I agree in principle that minimizing the distance of fly balls would be an important skill for a pitcher (if it's something they can control to any reasonable degree), but the problem here is that HR/FB is highly unstable. Over a long period of time it will be able to serve as a fair proxy for fly ball distance, but for the kind of samples we usually deal with, it just isn't going to be useful for that purpose. My previous article showed that it takes 10 years worth of data before we can account for just half of the variation in the stat. And that ignores issues of weighting and aging which are huge over a period that long.

I can, however, see your reasoning that a pitcher who induces a lot of infield flies may also induce shorter outfield flies and, therefore, allow fewer homers. As one way of checking this, if I look at pitchers since 2003 with an IF/FB greater than 15%, they post a HR/OF of 11.7 percent. League average over that time has been 11.5 percent. A good thought, but a cursory glance shows there to be no relationship.

Reply to derekcarty

studes

9/08

Apologies for not knowing the answer to this, but how is infield fly defined? I believe you use the MLBAM version, which is the same as Pitchf/x? Is that right? How do they define it?