Tony Profile Banner
Tony Profile
Tony

@TonyElHabr

Followers
4,304
Following
686
Media
412
Statuses
1,861
Explore trending content on Musk Viewer
@TonyElHabr
Tony
2 months
#CopaAmerica2024 and #EUROS2024 are literally different sports
Tweet media one
48
150
1K
@TonyElHabr
Tony
1 year
i forced a bot to watch 3 million soccer events. here's the lineup it thinks has created the most on-ball value in the EPL this season.
Tweet media one
23
61
933
@TonyElHabr
Tony
2 years
quantifying the art of time-wasting. yes, leading teams do take their sweet time on throw-ins.
Tweet media one
15
77
740
@TonyElHabr
Tony
2 years
eye test data 🤝 nice dawgs
Tweet media one
9
52
608
@TonyElHabr
Tony
2 years
"aLl xG mOdeLs ArE thE sAme" my brother in christ wut is this then
Tweet media one
15
41
534
@TonyElHabr
Tony
3 years
The numbers say that the Premier League (not the Russian one) is the most difficult league to play in. (See y-axis.) Numbers never lie, so of course this is correct. Please harass @canzhiye if you disagree.
Tweet media one
23
90
491
@TonyElHabr
Tony
3 years
I didn't watch every #EPL match this season, so I had a machine do it for me. Here are it's best XI. 1/7
Tweet media one
14
46
462
@TonyElHabr
Tony
3 years
João Cancelo has committed the most "rage" fouls in the Premier League so far this season (tied with Emmanuel Dennis at 4). Does he get salty when he isn't saucy? 🤔🤨 🤡
Tweet media one
24
25
399
@TonyElHabr
Tony
2 years
what even is an "expected point"? i wrote blog posts showing how to: 1⃣ compute expected points (xPts) from expected goals (xG) without simulation 2⃣ compare xG sources using {worldfootballR} 3⃣ quantify how unlikely team placings were (🚨 2020/21 Brighton trigger warning 🚨)
Tweet media one
Tweet media two
Tweet media three
4
50
312
@TonyElHabr
Tony
1 year
✍️New blog post on "meta-metrics" in soccer, replicating part of a 2017 JQAS article from Franks, D'Amour, Cervone, and Bornn. ⚖️Does a metric reliably differentiate between players? 📈Does a metric measure a quantity that is stable over time? (see reply for link)
Tweet media one
5
28
226
@TonyElHabr
Tony
3 years
I scraped all of @xGPhilosophy 's end-of-match xG tweets and quantified hwo much Brighton's habit of "winning the xG" but losing the game has driven Twitter engagement.
Tweet media one
@xGPhilosophy
The xG Philosophy
4 years
Brighton have scored one goal from 7.79(xG) in their last three matches.
74
326
8K
13
21
215
@TonyElHabr
Tony
3 years
it's almost that time of the season when we can start taking screenshots of xG tables and not get flamed
Tweet media one
4
24
197
@TonyElHabr
Tony
2 years
Remember that weird thing from last season where Liverpool #LFC didn't block shots from outside the box? I wonder if that continues this season 🤔
Tweet media one
6
11
182
@TonyElHabr
Tony
2 years
xG model calibration: is there a blind spot in public models? ✍:
Tweet media one
Tweet media two
5
18
175
@TonyElHabr
Tony
3 years
The Southgate Effect #EURO2020
Tweet media one
1
16
171
@TonyElHabr
Tony
3 years
@johnspacemuller he created a lot of space for a man at the far post
Tweet media one
1
2
161
@TonyElHabr
Tony
4 years
Formation matchups are a common point of discussion in soccer. Voronoi tessellations indicate that a 4-3-3 with a false 9 generally is the "best" against other structures. Data (and idea): @SamGoldberg1882 CC: @accidental__aRt #rstats
Tweet media one
6
24
155
@TonyElHabr
Tony
2 years
Fine, since no one has been brave enough to do it, I will. I'm calling out Opta for stat-padding for attackers, giving them more credit than StatsBomb for aerial duel success.
Tweet media one
8
12
150
@TonyElHabr
Tony
4 years
Pitch control + VAEP for the winning goal of this year's #UCL final. Sure, pitch control is cool, but it doesn't quantify the significance of actions given situation context. Enter VAEP. Data: @lastrowview VAEP: @jessejdavis1 , @TomDecroos , @JanVanHaaren , @LotteBransen
6
26
153
@TonyElHabr
Tony
3 years
Has @watke_ played a bigger role in the growth of #USMNT popularity than Ted Lasso? Let's embrace debate.
Tweet media one
4
8
144
@TonyElHabr
Tony
3 years
speaking VARnalytics into existence
Tweet media one
7
10
137
@TonyElHabr
Tony
10 months
Expected goal difference by game state in the EPL this season. What stands out? 1⃣ Newcastle have been most dominant when leading. 2⃣ Man City have been at their best when tied or trailing. 3⃣ Spurs have been mediocre when leading, but otherwise very good.
Tweet media one
4
15
131
@TonyElHabr
Tony
3 months
Is xG in neutral gamestates a better predictor of future performance than xG across all gamestates? ⚽️🤔 tldr: no, but it's closer than you might think
Tweet media one
3
20
128
@TonyElHabr
Tony
3 years
Nobody: Absolutely no one: Me: Here's a plot of the most controversial posts from reddit for each #EPL team last season
Tweet media one
8
10
120
@TonyElHabr
Tony
6 years
#rstats Want to see an example of a start-to-end analysis? I wrote a 5-part series of posts breaking down academic competition results in Texas, starting here:
Tweet media one
Tweet media two
Tweet media three
1
20
113
@TonyElHabr
Tony
3 years
i choose violence
Tweet media one
12
11
109
@TonyElHabr
Tony
4 years
Pitch control is cool, but how much does it matter which model we use? I compared @JaviOnData et al.'s 2018 model, @the_spearman 's 2017 model, and a naïve "nearest" model (only using player position) to try to find out. #rstats 1/n
3
10
99
@TonyElHabr
Tony
3 years
We say "player x is outscoring their xG" all the time, but how often is it actually significant over-performance? Not that often, although Kane and Lukaku are coming in to today's matchup over-performing at significant rates. (Kane is benefiting some from recent UEFA matches.)
Tweet media one
6
13
97
@TonyElHabr
Tony
2 years
i'm addicted to scraping data from the internet for absolutely no reason
Tweet media one
4
8
95
@TonyElHabr
Tony
7 years
#rstats peeps -- I analyzed my google search history using tidy text principles. See my post at . Thanks so much to @juliasilge and @drob for the awesome #tidytext package and the book.
Tweet media one
Tweet media two
1
19
87
@TonyElHabr
Tony
8 months
Should we account for team quality (Elo) in xG models? While I'm against this in principle, I tried it out. ✍️ (Link in first reply. 🔗)
Tweet media one
Tweet media two
9
17
91
@TonyElHabr
Tony
1 year
@ClayCopter the bot thought the game is played on a spreadsheet
0
0
90
@TonyElHabr
Tony
4 years
New #rstats post✍️ with soccer ⚽️ data - Translated @devinpleuler 's python tutorial on decomposition to R to learn a new-ish concept hands-on - Integrated "non-tidy" functions into a tidy workflow - Made some charts along the way
1
16
88
@TonyElHabr
Tony
7 months
To add to Jon's thread, this chart also shows % of time in each game state. West Ham are good at getting a lead and holding on to it (4th most time leading), so a negative xGD when ahead may simply reflect a more passive play style when leading.
Tweet media one
@Jon_Mackenzie
Jon Mackenzie
7 months
West Ham are 6th in the Premier League table but putting up relegation-worthy underlying numbers to get there. Here's a little thread to explain what's going on:
Tweet media one
42
87
1K
4
8
82
@TonyElHabr
Tony
4 years
Pitch control animation for the game-winning goal in this year's PSG - Atalanta UCL match. See how field ownership changes as Mbappé makes his run to receive Neymar's pass. h/t: @lastrowview (data) and @robwhickman for porting @AnEnglishGoat 's code to #rstats #sportsvizsunday
4
10
79
@TonyElHabr
Tony
3 years
UMAP + GMM > PCA + kmeans, my column. (Also, a Jadon Sancho similarity table because I woke up and chose violence.)
Tweet media one
Tweet media two
6
5
80
@TonyElHabr
Tony
3 years
We can use a similar approach, swapping out xG for VAEP, to learn something more specific. For example, a young FW moving from the Bundesliga to the EPL *cough* Haaland *cough* might expect their xG/90 rate to drop by 0.07 (22% relative to the median).
Tweet media one
4
5
73
@TonyElHabr
Tony
2 years
home field advantage is real. if i had accounted for game state here, then the effect would likely be bigger.
Tweet media one
2
1
72
@TonyElHabr
Tony
1 year
if you (yes, YOU🫵) want to create cool graphs like this for the women's world cup, then boy do i have a treat for you (see reply)
Tweet media one
Tweet media two
1
7
68
@TonyElHabr
Tony
1 year
I compared the club rankings from 538 and The Analyst (Opta) and found some interesting differences. 1. 538 seems to think more highly of the top Brazilian league. 2. Opta seems to rate top teams from non-Big 5 leagues more favorably. 1/3
Tweet media one
Tweet media two
4
5
67
@TonyElHabr
Tony
3 years
La Liga and EPL titans face off today. Given how each league's referees call the game differently (EPL refs seem to allow more pHySiCaLiTy), one side is bound to be frustrated by the end of the match. #UCL #Chelsea #RealMadrid
Tweet media one
1
9
64
@TonyElHabr
Tony
1 year
many of the top attacking players get about half of their on-ball value from shots, although it varies. some get more value out of purely passes and pass receptions, e.g. Bruno
Tweet media one
3
9
64
@TonyElHabr
Tony
2 years
I scraped over 10k ⚽️ matches and found that 1. Italy's Serie B and other Tier 2 leagues make more "bad" fouls resulting in yellow and red cards 2. The US's Tier 2 league, the USL, is most likely to have cards described as "violent" Tier 2 leagues play harder, not smarter?
Tweet media one
Tweet media two
2
6
63
@TonyElHabr
Tony
3 years
Tweet media one
0
0
56
@TonyElHabr
Tony
1 year
✍️ New blog post: Calibrating model predictions with the {probably} R package (link in reply) ⚽️ Case study: women's football match forecasts (RIP 538)
Tweet media one
2
10
57
@TonyElHabr
Tony
1 year
as the unofficial public arbiter of model calibration, looks like SB's and Opta's xG performed equally well for this tourney
Tweet media one
@StatsBomb
StatsBomb
1 year
StatsBomb Announce The Release Of Free 2023 Women's World Cup Data
Tweet media one
2
43
183
2
6
58
@TonyElHabr
Tony
6 years
Hey #rstats and #NBA fans, I made a #flexdashboard analyzing NBA team Twitter account content since the beginning of 2018: . (This is right down your alley @dataandme .)
Tweet media one
2
9
52
@TonyElHabr
Tony
2 years
yes, the home keepers behavior certainly changes based on game state. didn't see much of a difference for away keepers, or more generally, when not accounting for home/away status. with throw-ins, the time wasting was also more profound for the home team
Tweet media one
@ChrisFagan1988
Christopher Fagan
2 years
@TonyElHabr Do one for goal kicks . Do keepers take longer for goal kicks when leading/when game is coming towards the end .
0
0
1
6
4
55
@TonyElHabr
Tony
3 years
Brentford is sticking with a lot of guys who got them here, while Watford and Norwich aren't. Is this moneyball,,,,,,,,
Tweet media one
0
3
51
@TonyElHabr
Tony
4 months
Cooked up a new blog post on ⚽️ players underperforming xG. Contents include: 🔴Unnecessary LaTeX usage 🔴Double-digit footnotes 🔴An appendix no one asked for 🟢Sick plots 🟡Questionable insights
Tweet media one
Tweet media two
4
6
50
@TonyElHabr
Tony
3 years
only the top 4 teams have positive GD?
Tweet media one
Tweet media two
0
4
46
@TonyElHabr
Tony
3 years
been a while since we've had some #EPL matches. recent league form based on in-game xG difference: 📈 Arsenal, Brighton, Aston Villa 📉 Leicester, Everton, Chelsea caveat: xG difference is influenced by the actual score (e.g. playing more conservatively when having the lead)
Tweet media one
6
3
47
@TonyElHabr
Tony
3 years
Does Major League "Golazos Only" Soccer really have a higher number of banger goals than other leagues? Yes, but only on direct free kick shots. #DecisionDay
Tweet media one
Tweet media two
Tweet media three
2
6
44
@TonyElHabr
Tony
5 years
Every need to read a long, arduous report and want to identify just the most important parts? You can use #rstats (and #text2vec ) for that! Check out my write-up here: .
Tweet media one
2
4
45
@TonyElHabr
Tony
3 years
This thread is brought to you by @KDTrey5
Tweet media one
1
0
45
@TonyElHabr
Tony
3 years
i had some requests to describe the methodology in more details, so i put together a blog post: @thecomeonman had an interesting (better) take on the same topic: some are comparing our collab to xavi and iniesta 😉
@TonyElHabr
Tony
3 years
The numbers say that the Premier League (not the Russian one) is the most difficult league to play in. (See y-axis.) Numbers never lie, so of course this is correct. Please harass @canzhiye if you disagree.
Tweet media one
23
90
491
1
8
44
@TonyElHabr
Tony
3 years
tried to come up with an #absolutelysick stat based on ⚽️ pass networks. found that it's no more descriptive than just counting passes. i wrote about it anyways. 👁️👄👁️ 🔗
Tweet media one
Tweet media two
2
1
42
@TonyElHabr
Tony
3 years
20 - @OptaJoe 's most common stat-line finisher is "opener", used in 20 separate tweets over the past few months. Overall, the account's most common one-word summaries describe in-game events, player achievements, and team activities. Interesting.
Tweet media one
@OptaJoe
OptaJoe
3 years
2 - Harry Wilson has scored in consecutive Wales appearances for the first time in his career, while Gareth Bale, who provided the assist, has now assisted in each of his last three games for his country. Opener.
Tweet media one
1
22
242
1
6
44
@TonyElHabr
Tony
5 months
✍️ ⚽️Are all 20 yard passes judged equally by possession value (PV) models? Can you still get positive PV from incomplete passes? Answers to these questions and more in my new blog post. (L I N K I N R E P L Y.)
Tweet media one
Tweet media two
5
10
45
@TonyElHabr
Tony
3 years
please clap
Tweet media one
Tweet media two
Tweet media three
Tweet media four
@TonyElHabr
Tony
3 years
streaming today at 6 PM ET is for true futbol fans (gtfo NFL). i'll be (attempting to) replicate @Worville 's transfer plots
0
2
11
6
4
39
@TonyElHabr
Tony
4 months
Season-ending xGD by game state for the EPL Some non-top 4 observations: 🔴Brentford were average at worst in all game states 🔵 Everton did best when drawing ⚪️ Nottingham Fores did best when trailing 🟣West Ham and Man U were outperformed in all game states
Tweet media one
2
10
44
@TonyElHabr
Tony
2 years
@johnspacemuller last chart, mostly for my own sanity... the difference between being ahead and behind becomes slightly more exaggerated for the home team as the match winds down
Tweet media one
0
1
39
@TonyElHabr
Tony
1 year
how it started vs how it's going
Tweet media one
Tweet media two
Tweet media three
0
1
38
@TonyElHabr
Tony
2 years
industry nerds: we want the dots academic nerds: here are some worms
@GoogleDeepMind
Google DeepMind
2 years
Today in @SciReports , the graph imputer, a new model that can predict where players will run, on and off-screen, using past and future available observations of subsets of players: #AISports 1/
15
205
1K
1
3
38
@TonyElHabr
Tony
3 years
#CONCACAF has developed such a strong reputation for gamesmanship and aggressive play that it's become a verb. ("The #USMNT got CONCACAF'd and still won.") But Brazil and the rest of the CONMEBOL have truly set the precedent for chaos.
Tweet media one
1
5
38
@TonyElHabr
Tony
3 years
Foul rate has been slightly up in the Premier League since the 2017-18 season. El jogo bonito is being ruined 😭
Tweet media one
2
3
38
@TonyElHabr
Tony
3 years
quick maths show that border town Whataburgers are the "best". (are out-of-state visitors jealous?!?) what i did: 1⃣ scraped yelp 2⃣ emperical bayes adjustment for low review counts (h/t @drob ) 3⃣ matched with McDonalds locations to account for regional confounding 4⃣ regression
Tweet media one
@TonyElHabr
Tony
3 years
back again at 6 PM ET after a week off agenda: 1⃣ do nerd stuff to find where the best Whataburgers are in Texas 2⃣ discuss my #SLICED finals hot takes 3⃣ ??? 4⃣ profit
0
1
8
2
1
36
@TonyElHabr
Tony
4 years
This year's #NBA playoffs ended as the 4th most exciting of the last 24 seasons, as measured by total change in win probability. Data: @inpredict
@TonyElHabr
Tony
4 years
How exciting have this year's #NBA playoffs been? So far, it's been the 3rd most exciting since 97-98. Last night's #FearTheDeer win over Miami was the 2nd most exciting game this playoffs. Data: @inpredict #rstats
Tweet media one
Tweet media two
0
0
5
0
6
37
@TonyElHabr
Tony
3 years
Leeds has the largest difference in foul rate relative to their opponent when ahead vs. when behind. Is this the magic behind Bielsa ball?,,,,,,,,
Tweet media one
2
3
35
@TonyElHabr
Tony
2 months
ideally we would have "ball-in-play" time to quantify pacing, but pass volume seems like an ok heuristic. most extreme cases: Italy 809 - 372 Albania Ecuador 275 - 332 Jamaica is this due to just play styles? or maybe physicality of the matches and officiating?
5
2
36
@TonyElHabr
Tony
3 years
Chelsea and Man City dominating a European title, part 2
Tweet media one
2
1
35
@TonyElHabr
Tony
3 years
The best XI is chosen based on VAEP, which is basically a framework for quantifying how every on-ball player action influences the game's outcome. (Admittedly this doesn't work as well for keepers, although Pope and Martinez ranked 2nd and 3rd, so maybe it's not that bad.) 2/7
Tweet media one
5
1
31
@TonyElHabr
Tony
6 years
Blue dress or yellow dress? Yanny or laurel? Now, for #dataviz , the question is bubble grid or choropleth? New #rstats post: . Thanks to @rettungstweet for the inspiration.
Tweet media one
Tweet media two
5
12
34
@TonyElHabr
Tony
3 years
Death, taxes, and diminishing returns are the 3 pillars of life. With ⚽️, the 3rd principle is evident in the trade-off between non-penalty xG per shot (quality) and shots per 90 (quantity).
Tweet media one
1
6
34
@TonyElHabr
Tony
3 years
All kidding aside, it's sort of remarkable how we can use a PV metric (e.g. atomic VAEP) to infer league power rankings that don't seem that off. Perhaps the coolest thing about all of this—no match results were used 🤯. (We use before-and-after transfer numbers.)
2
1
32
@TonyElHabr
Tony
3 years
Who had the best forecast for #EURO2020 prior to the tournament? 🏆 @recspecs730 🥈 @AchimZeileis 🥉 KU Leuven's sports analytics lab ( @jessejdavis1 , @p_robberechts ) 🏅 @OptaAnalyst Disclaimer: Forecasts are hard. And the margins between forecasts were relatively small.
Tweet media one
1
6
33
@TonyElHabr
Tony
3 years
@mckayjohns This is what we tell ourselves to cope with the pain
1
0
31
@TonyElHabr
Tony
3 years
Premier League + #EURO2020 analysis that no one asked for: 📈 Aside from 🏴󠁧󠁢󠁥󠁮󠁧󠁿, 🇵🇹 and 🏴󠁧󠁢󠁳󠁣󠁴󠁿 have the most Premier League players among Euros lineups 📉 4 countries (🇷🇺, 🇫🇮 , 🇭🇺, 🇦🇹) don't have any representation 1⃣ 3 countries (🇭🇷, 🇨🇿, 🇲🇰) have 1 Premier League player
Tweet media one
5
4
30
@TonyElHabr
Tony
2 years
part 1: part 2:
0
5
31
@TonyElHabr
Tony
4 years
With under two weeks left before the NBA starts up again, here's a look at projected playoff seedings for the Western Conference. (Eastern Conference is mostly locked in🥱.) Data from @JacobEGoldstein . Inspiration from @experimental361 . #rstats
Tweet media one
3
4
28
@TonyElHabr
Tony
4 years
Tweet media one
0
0
29
@TonyElHabr
Tony
3 years
If we used un-adjusted xG/90 rates for players moving from league to league, we would have a table that's more sparse and noisy. Outliers (like the large increase in xG/90 for attackers moving from Brazil's Serie A to Turkey's Süper Lig) show the effect of sampling bias.
Tweet media one
3
1
28
@TonyElHabr
Tony
3 years
Despite missing several matches, Jack Grealish still leads the EPL in fouls drawn (100) by a wide margin. (Second is Sadio Mané with 67). He's taken down most in the left attacking half.
Tweet media one
Tweet media two
2
4
28
@TonyElHabr
Tony
1 year
ok but who's going to update this chart
Tweet media one
@abhisheksh_98
Abhishek Sharma
1 year
Official Football (Analytics) Twitter Power Rankings 2023™️ is in now. Can you find yourself here (click to zoom)? If not, try using the interactive version here:
Tweet media one
6
19
167
1
3
27
@TonyElHabr
Tony
4 years
If #MUFC fans (👀 @utdarena ) are upset about the red card for Martial last weekend, just know that the referee for the match (Anthony Taylor) is not known for handing out red cards. That honor belongs to good ol' Mike Dean. h/t @FootballSlices for the colors
Tweet media one
Tweet media two
4
4
28
@TonyElHabr
Tony
2 years
i want to know what Bruno Fernandes did to the Man Utd admin last year. mans was crucified by his own team's end-of-match tweets last season 😭
Tweet media one
1
2
27
@TonyElHabr
Tony
4 years
Scoring goals⚽️ in the Champions League #UCL is harder due to stronger competition, but when you adjust for competition, most attackers actually have better goal scoring rates. Data courtesy of @21stClub + h/t @rwohan and his weekly newsletter. #rstats
Tweet media one
1
5
27
@TonyElHabr
Tony
4 years
My mom is a fan
Tweet media one
1
1
27
@TonyElHabr
Tony
2 years
Not only did they not block long-range shots at a historically low clip, they were among the best in limiting such shots. This has been going on for the past 3 seasons.
Tweet media one
1
2
26
@TonyElHabr
Tony
3 years
We can plot (transfer)market values and VAEP to identify players who may be over- and under-valued. It seems that Kane, Salah, and TAA are still under-valued, despite everyone knowing how good they can be. 4/7
Tweet media one
1
1
25
@TonyElHabr
Tony
3 years
An in-game example, courtesy of Kevin "Let me talk" De Bruyne
0
0
22
@TonyElHabr
Tony
2 years
They are making our darling Dan Burn look bad.
Tweet media one
2
1
25
@TonyElHabr
Tony
3 years
To please xGod ( @rwohan ), we can compare VAEP with DAVIES (another "all-in-one" metric). There's some correlation, but there are some notable differences and oddities. DAVIES loves Werner and Mané this year (lol), and VAEP thinks Westwood is the English Iniesta (also lol). 5/7
Tweet media one
1
1
24
@TonyElHabr
Tony
2 years
As Messi preps for the World Cup next week, I'm sure he's tuning into the USL Championship final tonight. San Antonio "Sickos" FC will play a physical, counter-attacking style against Louisville City
Tweet media one
2
1
24
@TonyElHabr
Tony
1 year
1
3
24
@TonyElHabr
Tony
2 years
Brazil v. Serbia on Thursday was the biggest height mismatch of the World Cup Group stages. Will other vertically-challenged kings be able overcome their shortcomings? (see Messi, Lionel)
Tweet media one
1
1
24
@TonyElHabr
Tony
3 years
Here's a visual example for a possession in which Kane is assigned a net negative VAEP. Note that this is not xG—actions can have negative values (especially missed shots)! YouTube link: 3/7
Tweet media one
1
0
22