Kevin Chen Profile Banner
Kevin Chen Profile
Kevin Chen

@jiafengkevinc

Followers
4,013
Following
1,774
Media
275
Statuses
2,008

Postdoc @SIEPR '24 | Assistant Professor @StanfordEcon '25 | PhD @HarvardHBS '24 | AB/SM @Harvard '19 | Econometrics, causal inference, and stats

Joined July 2019
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
@jiafengkevinc
Kevin Chen
9 months
how it started // how it's going
Tweet media one
@QJEHarvard
QJE
9 months
Recently accepted by #QJE , “Logs with Zeros? Some Problems and Solutions,” by Chen ( @jiafengkevinc ) and Roth ( @jondr44 ):
4
195
723
12
63
1K
@jiafengkevinc
Kevin Chen
4 years
how it started how it's going
Tweet media one
Tweet media two
12
94
963
@jiafengkevinc
Kevin Chen
2 years
We've all encountered this problem: "I want to take logs, but my data have zeros => ???" Commonly (myself included), people use an ad hoc transformation m(.) (e.g. m(x) = arcsinh(x)) that approximates log, and interprets the resulting effect as a % 🧵of paper with @jondr44 1/n
@pedrohcgs
Pedro H. C. Sant'Anna
2 years
Twitter is fun because we can promote the work of friends and people we admire! Today I want to call your attention to a brand new paper by @jondr44 and @jiafengkevinc , available at: What is the gist? Check the abstract!
Tweet media one
6
78
387
8
154
785
@jiafengkevinc
Kevin Chen
6 months
I'm very excited to join @StanfordEcon as an assistant professor in 2025, after a year @SIEPR . I'm stressed out about moving my 🐱 across the continent, but otherwise beyond thrilled for what's to come! (also, definitely welcome all advice re: bay area)
46
10
495
@jiafengkevinc
Kevin Chen
3 years
Seems like today is a good day to talk about why the bootstrap works Here's my intuition---
Tweet media one
5
34
397
@jiafengkevinc
Kevin Chen
4 years
only an economist would call the sample mean "OLS coefficient on a constant"
7
9
370
@jiafengkevinc
Kevin Chen
2 years
@KhoaVuUmn 🥴: "Hello, I am (G'WG)^-1 G'WΩWG (G'WG)^-1. I want to find the W that makes me as small as possible, but, look at me, I'm a mess"
3
12
318
@jiafengkevinc
Kevin Chen
2 years
👀🤩
@ecmaEditors
Econometrica
2 years
Synthetic control can be interpreted as an online learning algorithm. Even without assumptions on outcomes, results from online learning imply that synth predictions perform almost as well as those made by the best weighted match. @jiafengkevinc
Tweet media one
1
60
246
4
14
238
@jiafengkevinc
Kevin Chen
3 years
as a professional econometrician, i think the WHO's notation choices are amateurish compared to mine
Tweet media one
8
7
223
@jiafengkevinc
Kevin Chen
10 months
the people who always nod in interviews are my personal heroes
5
6
220
@jiafengkevinc
Kevin Chen
3 years
Excited to share a new working paper taking a closer look at causal inference in school choice settings, where students are matched to schools via algorithms. The paper derives the identified causal estimands under heterogeneous treatment effects (1/n)
Tweet media one
2
33
172
@jiafengkevinc
Kevin Chen
2 years
because otherwise no one can write a paper, obviously
@kpomerleau
Kyle Pomerleau
2 years
Seriously though, why should audits be less likely for taxpayers earning $399,999 than those earning $400,000?
6
0
13
1
6
159
@jiafengkevinc
Kevin Chen
3 years
Thanks @jondr44 ! A thread on this short paper 🧵 (no econometric toys are harmed in the writing of this paper)
@jondr44
Jonathan Roth
3 years
Very cool new paper by @jiafengchen42 that gives a justification for synthetic control methods He shows that when treatment timing is random and you have many time periods, SC has close to optimal regret when an adversary chooses the potential outcomes
Tweet media one
1
25
129
3
28
157
@jiafengkevinc
Kevin Chen
4 years
what is often missing in the statistical software debate is that the labor market value of python and r is positive
@Raeda_Anderson
Raeda Anderson
4 years
Grad students planning to go non-academic Learn Python and R. That is all. Thank you for coming to my TedTalk.
12
31
388
4
8
152
@jiafengkevinc
Kevin Chen
4 years
I want to open a discussion about this post by Gelman ( @StatModeling ) and various aspects of RDD inference. Gelman criticizes a paper that shows a close-election victory effect of 2-10 years on "days alive after election" (thread 1/n) #EconTwitter
Tweet media one
1
29
151
@jiafengkevinc
Kevin Chen
5 months
I too drop 15 new results in "The Tortured Job Market Paper: The Appendix"
@AP
The Associated Press
5 months
Taylor Swift drops 15 new songs on double album, 'The Tortured Poets Department: The Anthology'
25
43
168
2
8
151
@jiafengkevinc
Kevin Chen
10 months
big day for asymptotics
Tweet media one
0
4
142
@jiafengkevinc
Kevin Chen
2 years
Say what you found at its finest
Tweet media one
3
16
138
@jiafengkevinc
Kevin Chen
1 year
New paper on statistical inference with data from (batched) adaptive experiments with Isaiah Andrews [1/n]
Tweet media one
1
21
143
@jiafengkevinc
Kevin Chen
3 years
all of statistics
Tweet media one
Tweet media two
1
7
129
@jiafengkevinc
Kevin Chen
4 years
Most regression estimators in the presence of heterogeneity
Tweet media one
0
7
128
@jiafengkevinc
Kevin Chen
3 years
your daily “regress two independent I(1) against each other” tweet
1
8
126
@jiafengkevinc
Kevin Chen
4 years
foolish: use this to instrument trade flows wise: use this to instrument new trade papers
1
3
114
@jiafengkevinc
Kevin Chen
4 years
Since the Twitter exploit must be worth more than $118,000 to a profit-maximizing attacker, the fact that the realized profit is only $118,000 illustrates that the criminal market is not efficient. In this essay I will
@LazaroGamio
Lazaro Gamio
4 years
That bitcoin scam racked up at least $118,000 in just three hours =>
Tweet media one
7
51
105
2
10
105
@jiafengkevinc
Kevin Chen
4 years
i'll have to speak to my therapist about this
@ccanonne_
Clément Canonne
4 years
Here is one: If X has the same distribution as X', Y as Y', and Z as Z', do we have 𝔼[X+Y+Z]=𝔼[X'+Y'+Z']?
31
134
783
3
5
101
@jiafengkevinc
Kevin Chen
3 years
I'm thrilled to announce that I'll be offering difference-in-difference offsets, where I promise to not write DID papers for a price For a higher price I'll start working on projects like "Why TWFE is More Robust Than You Think" and "Why Constant Treatment Effect is Innocuous"
8
8
102
@jiafengkevinc
Kevin Chen
4 years
(while I have your attention, here's a PSA to use weak IV robust procedures)
Tweet media one
0
11
100
@jiafengkevinc
Kevin Chen
4 months
Live footage of conda dependency conflicts
@AMAZlNGNATURE
Nature is Amazing ☘️
4 months
King cobra bites python. Python constricts cobra. Cobra gets crushed to death. Python dies from the cobra’s venom. 🤯
Tweet media one
5K
23K
274K
1
6
88
@jiafengkevinc
Kevin Chen
4 years
today i learned \frac1n is valid TeX you're welcome
3
1
88
@jiafengkevinc
Kevin Chen
3 years
Excited to share a new preprint with @DRitzwoller on semiparametric estimation (read, *machine learning* estimation) of long-term treatment effects, such as those identified by a latent unconfoundedness or a surrogacy assumption (1/n)
1
19
84
@jiafengkevinc
Kevin Chen
3 years
i sense a market for this
Tweet media one
@jiafengkevinc
Kevin Chen
3 years
can someone make the “X as seen by Y” of {statisticians, biostatisticians, econometricians, polmeth, machine learning}
0
1
19
3
14
79
@jiafengkevinc
Kevin Chen
2 years
Tweet media one
0
0
64
@jiafengkevinc
Kevin Chen
1 year
i'm also on the job market this year 😬 website @ [(n+1)/n]
2
9
71
@jiafengkevinc
Kevin Chen
1 year
views about real analysis Did kinda cut the discipline in half
9
4
67
@jiafengkevinc
Kevin Chen
3 years
*crosses out “partial identification in causal inference” from research interests because that job is, um, automated*
@jonmummolo
Jonathan Mummolo
3 years
Our new working paper: tell the computer what you know (and don't know!) about a causal question w/ discrete data → automatically get most precise possible answer (bounds, or a point estimate). Joint w/ @guilhermejd1 @nsfinkelstein @dean_c_knox Shpitser.🧵
Tweet media one
8
125
533
2
2
68
@jiafengkevinc
Kevin Chen
2 years
so the good news is that github copilot does work in stata, the bad news is that it gives you R code
Tweet media one
1
1
69
@jiafengkevinc
Kevin Chen
4 years
i think it's probably just me
Tweet media one
1
8
67
@jiafengkevinc
Kevin Chen
4 years
my duck confit came out so well I feel compelled to show it off
Tweet media one
1
0
64
@jiafengkevinc
Kevin Chen
2 years
T̶a̶y̶l̶o̶r̶'̶s̶ ̶t̶h̶e̶o̶r̶e̶m̶ Mean-value theorem (Taylor's version)
2
0
64
@jiafengkevinc
Kevin Chen
3 years
@ben_golub vector machines
1
2
63
@jiafengkevinc
Kevin Chen
4 years
@itaisher the first hand is invisible
3
0
63
@jiafengkevinc
Kevin Chen
4 years
possibly really dumb question: does IV "work" if the instrument Z is confounded with treatment W? i.e. DAG is Z -> W -> Y, W <- V -> Y, Z <- U -> W, In PO notation, Z = Z(U, e1) W = W(Z, U, V, e2) Y = Y(W, V, e3) and (e1, e2, e3, U, V) are jointly independent
10
14
64
@jiafengkevinc
Kevin Chen
4 years
Been learning DAGs lately and wanted to be more fluent in translating between DAGs and POs I think this is a reasonably short proof of the front-door formula/napkin problem that uses only potential outcomes..? (it can conceivably fit on a big napkin?)
Tweet media one
2
2
62
@jiafengkevinc
Kevin Chen
4 years
i wonder if anyone has explained the sample variance as 1. regress data on a constant, take residuals 2. regress the squared residuals on a constant
3
0
57
@jiafengkevinc
Kevin Chen
2 years
Tweet media one
4
6
57
@jiafengkevinc
Kevin Chen
10 months
@paulgp this sounds like a new chatgpt prompt series
1
1
55
@jiafengkevinc
Kevin Chen
2 years
Another way of saying this is that we can't have all three: (a) an average effect, (b) scale-invariance, and (c) point identification. In practice, then, we must decide which two properties we want. [5/n]
Tweet media one
1
5
54
@jiafengkevinc
Kevin Chen
2 years
that's just econometrics
@juskhoo
Justin Khoo
2 years
Dang
Tweet media one
27
25
308
0
7
57
@jiafengkevinc
Kevin Chen
4 years
just regular days at the office
Tweet media one
1
0
55
@jiafengkevinc
Kevin Chen
2 years
in short: no alternative transformation can define away the log(0) problem, but sensible alternatives do exist! [n/n]
3
1
52
@jiafengkevinc
Kevin Chen
3 years
your daily macroeconomics tweet
@matthen2
Matt Henderson
3 years
the small angle approximation solves the dynamics of the pendulum by approximating sin(x) = x. How good an approximation is this? This shows the true dynamics (blue) and the approximation (white) for various initial angles
14
156
781
0
5
51
@jiafengkevinc
Kevin Chen
4 years
i've got my synthetic wisconsin ready to go
Tweet media one
@MollyBeck
Molly Beck
4 years
BREAKING: Wisconsin Supreme Court strikes down Gov. Tony Evers' order to stay home in a 4-3 decision.
2K
6K
25K
2
0
47
@jiafengkevinc
Kevin Chen
4 years
intro statistics rarely emphasizes the difference between an estimand and an estimator (usually it's fairly clear, e.g. sample mean estimates the mean), but the difference becomes super pronounced and subtle in causal inference
@paulgp
Paul Goldsmith-Pinkham
5 years
This is what I try to tell people now:
Tweet media one
5
28
143
1
9
48
@jiafengkevinc
Kevin Chen
2 years
Tweet media one
0
2
44
@jiafengkevinc
Kevin Chen
3 years
we missed the deadline to all be bayesians
Tweet media one
3
2
44
@jiafengkevinc
Kevin Chen
3 years
abolish measure theory
2
2
44
@jiafengkevinc
Kevin Chen
4 years
gonna be that guy in the seminar with @economeager but in meme form [this is actually @jondr44 's example and i just memeified it]
Tweet media one
2
1
43
@jiafengkevinc
Kevin Chen
4 years
Debate 2020: "Mr. President, please stop"
0
7
42
@jiafengkevinc
Kevin Chen
4 years
@ben_golub cauchy random variables: has cancel culture gone too far?
1
0
41
@jiafengkevinc
Kevin Chen
3 years
live view of the behavior of the implicit estimand of TWFE under treatment effect heterogeneity @KhoaVuUmn
@platini954
Certified Vakabon
3 years
???
5K
108K
540K
1
1
40
@jiafengkevinc
Kevin Chen
2 years
Well, a percent is invariant to units, and so a basic requirement is that differences in m(.) should be invariant to scaling. It turns out the opposite is true: For any m(.) that approximates log, but is defined at 0, one can choose the scaling to get anything one wants 😬[2/n]
Tweet media one
2
2
40
@jiafengkevinc
Kevin Chen
2 years
Super excited to be presenting this paper at the NBER SI Labor Studies Methods session tomorrow morning!
Tweet media one
@jiafengkevinc
Kevin Chen
3 years
Thanks @jondr44 ! A thread on this short paper 🧵 (no econometric toys are harmed in the writing of this paper)
3
28
157
2
5
37
@jiafengkevinc
Kevin Chen
3 years
If you're at CODE @MIT , I'll be presenting some work (joint with Isaiah Andrews) at 11:15am ET today, come check it out!
Tweet media one
2
2
38
@jiafengkevinc
Kevin Chen
4 years
@paulgp from isaiah's slides
Tweet media one
Tweet media two
4
4
37
@jiafengkevinc
Kevin Chen
4 years
meanwhile in econometrics, most of what I do is writing P in different fonts
@ShengwuLi
Shengwu Li
4 years
Defining notation for a theory paper is halfway between making up a language and choosing an opening in chess. Small early choices affect what moves are available later.
4
17
382
1
0
37
@jiafengkevinc
Kevin Chen
4 years
A buddy and I passed a bunch of SEC 10-K filings through a big pre-trained neural network and examined its brain---because why not, university compute is free and we can't mine BTC anyway Turns out the network understands which companies are close to which #EconTwitter ⬇️⬇️⬇️
@SuproteemSarkar
𝗦𝘂𝗽𝗿𝗼𝘁𝗲𝗲𝗺 𝗦𝗮𝗿𝗸𝗮𝗿
4 years
How can transformer models help us understand economic and financial information? New workshop paper with @jiafengchen42 develops an industry classification index from text with greater informativeness about financial metrics than existing text/expert-based classifications [1/2]
Tweet media one
1
7
37
2
6
36
@jiafengkevinc
Kevin Chen
4 years
𝙉𝙤𝙬 𝙞𝙣 𝙥𝙤𝙨𝙩𝙚𝙧 𝙛𝙤𝙧𝙢 meme form coming soon... (since this is just linear IV under the hood, it's compatible with AMIP!)
Tweet media one
@jiafengkevinc
Kevin Chen
4 years
New short paper documenting an easy recipe for incorporating machine learning into linear IV to boost instrument strength *essentially for free*: Joint work with Daniel Chen and @econ_greg #EconTwitter #CausalTwitter
2
47
212
1
3
35
@jiafengkevinc
Kevin Chen
3 years
@otis_reid A lot of times fancy estimators are actually just comparisons of the correct set of means
3
0
36
@jiafengkevinc
Kevin Chen
2 years
i can't believe no one brought up radon-nikodym derivative yet
5
0
34
@jiafengkevinc
Kevin Chen
4 years
for some particular reason the harvard-mit econometrics seminar seems to have 13x attendance today
Tweet media one
4
1
33
@jiafengkevinc
Kevin Chen
3 years
@ben_golub i'd argue the Rao-Blackwell theorem is better-known, based on me knowing Rao-Blackwell but not knowing the Blackwell information ordering 😂
1
1
32
@jiafengkevinc
Kevin Chen
2 years
@jondr44 Maybe not any more!
Tweet media one
Tweet media two
1
0
32
@jiafengkevinc
Kevin Chen
4 years
Sir, this is a landscaping company
@realDonaldTrump
Donald J. Trump
4 years
71,000,000 Legal Votes. The most EVER for a sitting President!
342K
95K
722K
0
0
33
@jiafengkevinc
Kevin Chen
3 years
@ben_golub well, we did warn you it was only mostly harmless
0
0
31
@jiafengkevinc
Kevin Chen
2 years
i think that's what "large-T asymptotics" means
@maredia
Mywish K. Maredia
2 years
I am reviewing a paper and came across the word “eternal validity,” which is a typo…but its a brilliant typo. Why us economists only talk about internal and external validity? Why not eternal validity???? 😏 #EconTwitter #AcademicChatter
23
26
476
2
2
31
@jiafengkevinc
Kevin Chen
3 years
it's also a good time to finally open yellow van der vaart and build some more intuition why bootstrap can fail---
Tweet media one
0
1
32
@jiafengkevinc
Kevin Chen
1 year
@AnthonyLeeZhang seems like most results in real analysis can be understood without real analysis
0
0
32
@jiafengkevinc
Kevin Chen
4 years
Tweet media one
3
0
31
@jiafengkevinc
Kevin Chen
2 years
@ben_golub @CaltechEconThry Proof I first saw from @stat110 's stat 210: WLOG, assume f(X), g(X) mean zero. Observe that E[f(X) g(X)] + E[f(Y) g(Y) ]= E[(f(X) - f(Y)) (g(X) - g(Y))] for X, Y independent and Y ~ X Conclude by noting that (f(x) - f(y))(g(x) - g(y)) ≥ 0 a.e.
0
0
32
@jiafengkevinc
Kevin Chen
4 years
@Andrew___Baker Theory, but always allowed to take derivatives
0
0
32
@jiafengkevinc
Kevin Chen
3 years
this is why the pscore-as-dimension-reduction thing is a bit of mental gymnastics and word sorcery---estimating the pscore is a high-dimensional problem!
2
3
31
@jiafengkevinc
Kevin Chen
2 years
@ben_golub Just need @3blue1brown to make enough videos
2
1
31
@jiafengkevinc
Kevin Chen
2 years
@ShengwuLi this doesn’t matter to me one epsilon whatsoever
1
0
31
@jiafengkevinc
Kevin Chen
2 years
TIL from @conlon_chris that one could be using a slower BLAS on Apple silicon, here's a fix that's worked for me, so far, from our thread
Tweet media one
@conlon_chris
Chris Conlon
2 years
@jiafengkevinc @paulgp Compare to this -- the point is not that that NumPy is 180x faster than R. The point is that my NumPy setup is using the correct BLAS and my R setup is not:
Tweet media one
3
0
8
0
8
31
@jiafengkevinc
Kevin Chen
4 years
Calculated the IV estimand in the binary case:
Tweet media one
6
5
28
@jiafengkevinc
Kevin Chen
5 months
P̶e̶n̶a̶l̶i̶z̶e̶d̶ ̶r̶e̶g̶r̶e̶s̶s̶i̶o̶n̶ ̶e̶s̶t̶i̶m̶a̶t̶o̶r̶s̶ The Tortured Predictors Department #TSTTPD
0
2
27
@jiafengkevinc
Kevin Chen
4 years
just realized supreme court rulings and the QJE have the same (i think?) typeface
2
0
26
@jiafengkevinc
Kevin Chen
2 years
"i have discovered a truly marvelous proof of the results in the main text, which this online appendix has too few pages to contain"
@RevEconStudies
The Review of Economic Studies
2 years
📢 **REStud Page Limit** With effect from 1st July 2022, a page limit policy applies to all submissions. Papers should be under 45 pages. Online appendices should not exceed 30 pages. A “grace period” is in place until 15th August 2022. For more details:
2
42
272
1
0
27
@jiafengkevinc
Kevin Chen
2 years
@jondr44 @ShengwuLi the easy mnemonic is that Jensen's inequality is always in the direction where you don't want it to be
0
1
26
@jiafengkevinc
Kevin Chen
5 years
Happy to share my new NBER working paper with Ed Glaeser and @davidmwessel . We find that the Opportunity Zone program, as a part of the 2017 Tax Cuts and Jobs Act, has not (so far) generated a response in residential housing prices. Link: . #econtwitter
1
1
27
@jiafengkevinc
Kevin Chen
2 years
👀
Tweet media one
0
1
26
@jiafengkevinc
Kevin Chen
3 years
Tweet media one
0
0
25
@jiafengkevinc
Kevin Chen
3 years
Now, I can't possibly pass on an opportunity to promote some recent work on Twitter, to Twitter, by quote-tweeting @TwitterEng (joint work with @DRitzwoller )
@XEng
Engineering
3 years
We built a causal estimation framework on the idea of statistical 'surrogacy' (Athey et al 2016) - when we can’t wait to observe long-run outcomes, we create a model based on intermediate data.
2
23
254
1
5
26
@jiafengkevinc
Kevin Chen
3 years
@ben_golub Indeed, the quickest road to serfdom is probably a linear program
0
0
26
@jiafengkevinc
Kevin Chen
4 years
@arpitrage "One Billion American Billionaires"-- @mattyglesias
1
0
23
@jiafengkevinc
Kevin Chen
1 year
Tweet media one
Tweet media two
0
0
24
@jiafengkevinc
Kevin Chen
3 years
Suppose (Xn, Yn) converges in distribution to (X, Y), and (Yn, Zn) converges in distribution to (Y', Z). (This implies Y' ~ Y). Suppose Xn is independent of Zn given Yn. T/F: (Xn, Yn, Zn) converges in distribution to some (X'', Y'', Z'')?
4
0
24
@jiafengkevinc
Kevin Chen
5 months
I will be presenting this paper on inference post adaptive experiments this Thursday (11:30 am ET) at the International Seminar on Selective Inference. Come check it out!
Tweet media one
1
1
24