Advanced Football Analytics (formerly Advanced NFL Stats): Using Probabilistic Distributions to Quantify NFL Combine Performance

Using Probabilistic Distributions to Quantify NFL Combine Performance

Casan Scott continues his guest series on evaluating NFL prospects through Principal Component Analysis. By day, Casan is a PhD candidate researching aquatic eco-toxicology at Baylor University.

Jadeveon Clowney is thought of as a “once-in-a-decade” or even “once-in-a-generation” pass rushing talent by many. Once the top rated high school talent in the country, Clowney has retained that distinction through 3 years in college football’s most dominant conference. Super-talents like Clowney have traditionally been gambled on in the NFL draft with little idea of what future production is actually statistically anticipated. For all of the concerns over his work ethic, dedication, and professionalism, Clowney’s athleticism and potential have never been called into question. But is his athleticism actually that rare? And is his talent worth gambling millions of dollars and the 1st overall pick on? This article aims to objectify exactly how rare Jadeveon Clowney’s athleticism is in a historical sense.

Jadeveon Clowney set the NFL draft world on fire at this year’s combine when he delivered one of the most talked-about combine performances of recent memory, primarily driven by his blistering 40 yard dash time of 4.53. Over the years, however, I recall players like Vernon Gholston, Mario Williams, and even Ziggy Ansah displaying mind-boggling athleticism in drills. But if each year a player displays unseen athleticism at the combine, who is really impressive enough that we deem them “Once-in-a-decade?”

Probability Ranking allows me to identify the probability of encountering an athlete’s measurable. For instance, I probability ranked NFL combine 40 yard dash times for 341 defensive ends from 1999-2014 (Table 1 shows the top 50). In this case, Jadeveon Clowney’s 40 time of 4.53 had a probability rank of 99.12, meaning his speed is in the 99th percentile of all DEs over this time span.

To attempt to quantify how uncommon Jadeveon Clowney’s overall athleticism is, I probability ranked the most impressive individual drills and overall workouts of 82 defensive end prospects over the past decade. These 82 defensive ends were selected based on availability of quality data for their complete NFL combine workout. This group of 82 was used in a previous article of mine (http://www.advancedfootballanalytics.com/2014/04/draft-prospect-evaluation-using.html#more) and will be used for future analyses to come. I applied a Weibull ranking* of all 82 players’ 40-time, bench press, vertical leap, broad jump, shuttle run, and 3-cone drill.

*I used the equation P = 100 × i/(n + 1) to rank combine drill performance, where i is the rank number of the data point, n is the total number of data points in the set, and P is the probability rank of that value, i.

What I saw was that Clowney’s 40-time was indeed VERY rare and truly “once-in-a-decade.” However, his overall combine performance shows that he wasn’t all that different from even the top DE prospect last year. Below I list 6 high profile picks from recent years plus Jadeveon Clowney and their raw combine results (Table 2) and associated ranks (Table 3) among the class of 82 defensive ends:

Jadeveon Clowney’s 40 yard dash time registered in the 99th percentile of the class. Likewise, his leaping ability and shuttle run were in the 90th percentile; this is truly elite lower body explosion. However, his height, weight, bench press, and 3-cone drill were average to below average rankings within the class. This lowered his average rank for the combine to 66. For comparison, Mario Williams had the highest overall average ranking at 83, and Ziggy Ansah actually shared an average ranking of 66. In the chart below, we see where Clowney’s average rank places him throughout the DEs scoring average ranking of 40 or better for the entire group of 82 (Figure1):

This is actually pretty impressive company. Among the players ranked ahead of Clowney, JJ Watt is one year removed from NFL Defensive POY, while Margus Hunt is a world class track-and-field athlete. Of the players ranked below Clowney, Chandler Jones is a member of possibly the most athletic family in sports (see Jon and Arthur Jones) and Robert Quinn led the NFC in sacks last year.

For a prospect like Jadeveon Clowney, many superlatives are thrown around. There are videos on YouTube® of Clowney anchoring his high school 4x100meter relay team . Everyone remembers his helmet-projecting hit against Michigan in the Outback Bowl. We saw him display unseen speed during his 40 yard dash in February. But across all combine drills, he performed quite similarly to Ziggy Ansah from last year’s draft. So, how rare a prospect is he exactly? Although he did run faster than nearly everyone his size in the past 10 years, his overall workout was merely as good as last year’s best. By creating Probabilistic Distributions, we can see that there is convincing evidence that Jadeveon Clowney is undeniably an elite athletic specimen, but not exactly “Once-in-a-decade.”

For those interested, here is the general format and Excel formula for performing Weibull Probability Ranking.

((RANK(B2,B$2:B$82,1))*100)/((COUNT(B$2:B$82))+1)

Feel free to contact me at Casan_Scott@Baylor.edu or casanscott@gmail.com for any comments, questions, or advice. I’d love to share any methods, coding, etc. to anyone interested.

12 Responses to “Using Probabilistic Distributions to Quantify NFL Combine Performance”

BillBagley says:: Monday, May 19, 2014; Where is the cognitive speed data, such as the Wonderlic? Athleticism is overrated in the NFL. I will bet on a fast thinker much quicker than a fast runner. Brady turned in the second worst QB time in the 40, since 1999. You know Tom Brady, the greatest draft pick in history, chosen late in the 6th round. The NFL has chosen some of the greatest athletes in history but has missed some of the greatest players.
Jared Doom says:: Tuesday, May 20, 2014; Is it possible to incorporate the results from the Harvard analysis on the combine measureables that are statistically significant in this position group?

http://harvardsportsanalysis.wordpress.com/2012/02/28/does-the-nfl-combine-matter-defense/

According to this article, the only combine measureables that are statistically significant for DEs are the 40-yard dash, 3 cone drill, and weight. Ideally, these measureables should be given more weight in this analysis.
Unknown says:: Tuesday, May 20, 2014; Thanks for your interest! I agree that the wonderlic is a valuable measurement. However, here I sought to use the same data set I've been using for PCA and Quantile Regression to illustrate the utility of these tools.
Unknown says:: Tuesday, May 20, 2014; I do like that study's approach! However I wasn't trying to predict anything using only combine numbers, but rather sought a way to quantify a combine performance within a historical context. My other article on Quantile Regression used combine numbers and NCAA stats as predictors and did a better job than that of the Harvard Study's (quite a bit higher r2). Having said that, these tools are best used in a exploratory way rather than predictive, as they help us dissect trends from confusing data. Thanks again for your interest!
Nathan Lazarus says:: Friday, May 23, 2014; @Bill Bagley
No one has ever found a positive relationship between Wonderlic scores and player performance. So while coaches may want players who have high "football IQs", testing their math and reading skills seems valueless.

Also, I'd suggest that averaging percentile scores may not be the best method. Guys with high scores in certain areas and low scores in others may be more valuable than guys with mediocre scores across the board. For example, Clowney and Ansah bring elite speed with low strength, so in the right scheme they can succeed as pass rushers and one gap run defenders.
Jared Doom says:: Tuesday, May 27, 2014; "My other article on Quantile Regression used combine numbers and NCAA stats as predictors and did a better job than that of the Harvard Study's (quite a bit higher r2)."

Thanks for responding. Could you show me where I can find this article (apologies if it is obvious and I'm missing it)?
Jared Doom says:: Tuesday, May 27, 2014; Nevermind, looks like it is obvious, I look forward to reading it.
Unknown says:: Tuesday, May 27, 2014; Completely agree Nathan. Weighting is needed.
Anonymous says:: Thursday, May 29, 2014; Nathan-

There actually has been a slight correlation demonstrated between Wonderlick and player performance, but not the one the poster above you is implying. There is actually a negative correlation between player performance and Wonderlick for DBs and TEs. That's right Bill- Wonderlick is completely meaningless at most positions, but you'd do better to pick a DB or TE with a lower Wonderlick score than an equivalent guy with a higher score.
scp1957 says:: Friday, June 06, 2014; "Also, I'd suggest that averaging percentile scores may not be the best method. Guys with high scores in certain areas and low scores in others may be more valuable than guys with mediocre scores across the board. For example, Clowney and Ansah bring elite speed with low strength, so in the right scheme they can succeed as pass rushers and one gap run defenders."

Common wisdom, however imperfectly or inconsistently applied; better that he do one or two things well, than that he do everything okay. Why? Because you can scheme around a player's imperfections, but you can never make soup from rocks.

As a Lions fan, the presence, on this list, of five of their guys struck me: Lo Jack, Willie Young, Ziggy, Devin Taylor, and Larry Webster. I find my initial faith in Mayhew confirmed: know what you want and stick to it, until foul circumstance proves the need to change something.

I find myself curious as to how well the modification of their Wide-9 scheme, from two open ends to open/closed, will accommodate existing personnel. Now, I've got to examine their metrics with an eye to this dichotomy.

This past season, the success of Cliff Avril, after he moved from Detroit to Seattle, confirmed my own prediction for and judgment of him. His efforts in Detroit were always counterbalanced by his inability to play the run from the Wide-9. His metrics suggested that he would benefit from the shorter route to the passer and the lesser responsibility to contain. Bingo. From the POV of the Detroit F/O, he was a good keep, so long as his price was modest and the alternatives were worse, which ceased to be the case in the 2013 off-season. Good thing for all concerned. (Cliff caught a lot of flak from some Lions fans, for his poor run defense, but it was never really his fault. He is what he is.)
jditoro says:: Thursday, June 26, 2014; Hi Casan, thought you might find this article on PCA as applied to WR scouting interesting: http://rotoviz.com/2014/05/using-principal-component-analysis-to-identify-high-quality-wide-receivers/
Phil Carlitz says:: Tuesday, August 19, 2014; Just curious why you use a Weibull ranking instead of just a simple normal distribution? Something like this:
=1-NORM.DIST(B2,AVERAGE($B$2:$B$82),STDEV($B$2:$B$82),1).

Do you have reason to believe these stats aren't normally distributed? Not attacking the methodology. Just wondering.

Note: Only a member of this blog may post a comment.

Using Probabilistic Distributions to Quantify NFL Combine Performance

12 Responses to “Using Probabilistic Distributions to Quantify NFL Combine Performance”

Leave a Reply

Special Note

Search Advanced Football Analytics

Required Reading

Archive

@BBurkeESPN

ANS COMMUNITY

Support Military Families