Kevin Chen @jiafengkevinc profile

Kevin Chen

@jiafengkevinc

Followers

4,013

Following

1,774

Media

275

Statuses

2,008

Postdoc @SIEPR '24 | Assistant Professor @StanfordEcon '25 | PhD @HarvardHBS '24 | AB/SM @Harvard '19 | Econometrics, causal inference, and stats

https://t.co/XYDeg7O1p6

Joined July 2019

Don't wanna be here? Send us removal request.

Explore tweets Explore followers Explore following

Explore trending content on Musk Viewer

#TGS2024 • 170748 Tweets

Tigers • 103271 Tweets

Miami • 79815 Tweets

Knicks • 67573 Tweets

Towns • 64649 Tweets

Minnesota • 63816 Tweets

#SmackDown • 53662 Tweets

Randle • 48205 Tweets

Wolves • 34983 Tweets

Rudy • 34560 Tweets

Royals • 27266 Tweets

Hart • 22474 Tweets

#DesafioXX • 21379 Tweets

भगत सिंह • 21148 Tweets

Asharamji Bapu Case • 18228 Tweets

日本破壊クソメガネ • 16671 Tweets

Equality Exploited • 15666 Tweets

Cam Ward • 15401 Tweets

Celtics • 11244 Tweets

Tim Connelly

アクセス集中

Andrade

Leon Rose

That's a TD

Mikal

軽めのバッグ

トリプルスリー

ランドル

ニックス

イコノイジョイ

そくほー

Minny

Canes

Villanova

Shams

Divo

Brent Pry

Rutgers

Hokies

#VTvsMIA

首位打者

#BhagatSingh

アラエス

Thibs

則宗さん

Naz Reid

Va Tech

Brunson

パジャマアクスタ

Virginia Tech

Last Seen Profiles

@PoliticalMoons2

@RDYFEED

@lopobiakaiser

@kitsunexkitsu

@matsumotory

@aaweeei66

@MikaMamon

@Cronica_Libre

@yuriy_tymchuk

@Three_Sevens_

@osdem32

@okoman

@herbguideKOBE

@snehaeiou

@dkreative1

@CoachDavis1321

@berti_puasa

@PemuasBinor6

@Ompraka34552969

@WestburyHSHISD

Kevin Chen

@jiafengkevinc

9 months

how it started // how it's going

QJE

@QJEHarvard

9 months

Recently accepted by #QJE , “Logs with Zeros? Some Problems and Solutions,” by Chen ( @jiafengkevinc ) and Roth ( @jondr44 ):

4

195

723

12

63

1K

Kevin Chen

@jiafengkevinc

4 years

how it started how it's going

12

94

963

Kevin Chen

@jiafengkevinc

2 years

We've all encountered this problem: "I want to take logs, but my data have zeros => ???" Commonly (myself included), people use an ad hoc transformation m(.) (e.g. m(x) = arcsinh(x)) that approximates log, and interprets the resulting effect as a % 🧵of paper with @jondr44 1/n

Pedro H. C. Sant'Anna

@pedrohcgs

2 years

Twitter is fun because we can promote the work of friends and people we admire! Today I want to call your attention to a brand new paper by @jondr44 and @jiafengkevinc , available at: What is the gist? Check the abstract!

6

78

387

8

154

785

Kevin Chen

@jiafengkevinc

6 months

I'm very excited to join @StanfordEcon as an assistant professor in 2025, after a year @SIEPR . I'm stressed out about moving my 🐱 across the continent, but otherwise beyond thrilled for what's to come! (also, definitely welcome all advice re: bay area)

46

10

495

Kevin Chen

@jiafengkevinc

3 years

Seems like today is a good day to talk about why the bootstrap works Here's my intuition---

5

34

397

Kevin Chen

@jiafengkevinc

4 years

only an economist would call the sample mean "OLS coefficient on a constant"

7

9

370

Kevin Chen

@jiafengkevinc

2 years

@KhoaVuUmn 🥴: "Hello, I am (G'WG)^-1 G'WΩWG (G'WG)^-1. I want to find the W that makes me as small as possible, but, look at me, I'm a mess"

3

12

318

Kevin Chen

@jiafengkevinc

2 years

👀🤩

Econometrica

@ecmaEditors

2 years

Synthetic control can be interpreted as an online learning algorithm. Even without assumptions on outcomes, results from online learning imply that synth predictions perform almost as well as those made by the best weighted match. @jiafengkevinc

1

60

246

4

14

238

Kevin Chen

@jiafengkevinc

3 years

as a professional econometrician, i think the WHO's notation choices are amateurish compared to mine

8

7

223

Kevin Chen

@jiafengkevinc

10 months

the people who always nod in interviews are my personal heroes

5

6

220

Kevin Chen

@jiafengkevinc

4 years

New short paper documenting an easy recipe for incorporating machine learning into linear IV to boost instrument strength *essentially for free*: Joint work with Daniel Chen and @econ_greg #EconTwitter #CausalTwitter

Mostly Harmless Machine Learning: Learning Optimal Instruments in...

We offer straightforward theoretical results that justify incorporating machine learning in the standard linear instrumental variable setting. The key idea is to use machine learning, combined...

arxiv.org

2

47

212

Kevin Chen

@jiafengkevinc

3 years

Excited to share a new working paper taking a closer look at causal inference in school choice settings, where students are matched to schools via algorithms. The paper derives the identified causal estimands under heterogeneous treatment effects (1/n)

2

33

172

Kevin Chen

@jiafengkevinc

2 years

because otherwise no one can write a paper, obviously

Kyle Pomerleau

@kpomerleau

2 years

Seriously though, why should audits be less likely for taxpayers earning $399,999 than those earning $400,000?

6

0

13

1

6

159

Kevin Chen

@jiafengkevinc

3 years

Thanks @jondr44 ! A thread on this short paper 🧵 (no econometric toys are harmed in the writing of this paper)

Jonathan Roth

@jondr44

3 years

Very cool new paper by @jiafengchen42 that gives a justification for synthetic control methods He shows that when treatment timing is random and you have many time periods, SC has close to optimal regret when an adversary chooses the potential outcomes

1

25

129

3

28

157

Kevin Chen

@jiafengkevinc

4 years

what is often missing in the statistical software debate is that the labor market value of python and r is positive

Raeda Anderson

@Raeda_Anderson

4 years

Grad students planning to go non-academic Learn Python and R. That is all. Thank you for coming to my TedTalk.

12

31

388

4

8

152

Kevin Chen

@jiafengkevinc

4 years

I want to open a discussion about this post by Gelman ( @StatModeling ) and various aspects of RDD inference. Gelman criticizes a paper that shows a close-election victory effect of 2-10 years on "days alive after election" (thread 1/n) #EconTwitter

1

29

151

Kevin Chen

@jiafengkevinc

5 months

I too drop 15 new results in "The Tortured Job Market Paper: The Appendix"

The Associated Press

@AP

5 months

Taylor Swift drops 15 new songs on double album, 'The Tortured Poets Department: The Anthology'

25

43

168

2

8

151

Kevin Chen

@jiafengkevinc

10 months

big day for asymptotics

0

4

142

Kevin Chen

@jiafengkevinc

2 years

Say what you found at its finest

3

16

138

Kevin Chen

@jiafengkevinc

1 year

New paper on statistical inference with data from (batched) adaptive experiments with Isaiah Andrews [1/n]

1

21

143

Kevin Chen

@jiafengkevinc

3 years

all of statistics

1

7

129

Kevin Chen

@jiafengkevinc

4 years

Most regression estimators in the presence of heterogeneity

0

7

128

Kevin Chen

@jiafengkevinc

3 years

your daily “regress two independent I(1) against each other” tweet

1

8

126

Kevin Chen

@jiafengkevinc

4 years

foolish: use this to instrument trade flows wise: use this to instrument new trade papers

1

3

114

Kevin Chen

@jiafengkevinc

4 years

Since the Twitter exploit must be worth more than $118,000 to a profit-maximizing attacker, the fact that the realized profit is only $118,000 illustrates that the criminal market is not efficient. In this essay I will

Lazaro Gamio

@LazaroGamio

4 years

That bitcoin scam racked up at least $118,000 in just three hours =>

7

51

105

2

10

105

Kevin Chen

@jiafengkevinc

4 years

i'll have to speak to my therapist about this

Clément Canonne

@ccanonne_

4 years

Here is one: If X has the same distribution as X', Y as Y', and Z as Z', do we have 𝔼[X+Y+Z]=𝔼[X'+Y'+Z']?

31

134

783

3

5

101

Kevin Chen

@jiafengkevinc

3 years

I'm thrilled to announce that I'll be offering difference-in-difference offsets, where I promise to not write DID papers for a price For a higher price I'll start working on projects like "Why TWFE is More Robust Than You Think" and "Why Constant Treatment Effect is Innocuous"

8

102

Kevin Chen

@jiafengkevinc

4 years

(while I have your attention, here's a PSA to use weak IV robust procedures)

0

11

100

Kevin Chen

@jiafengkevinc

4 months

Live footage of conda dependency conflicts

Nature is Amazing ☘️

@AMAZlNGNATURE

4 months

King cobra bites python. Python constricts cobra. Cobra gets crushed to death. Python dies from the cobra’s venom. 🤯

5K

23K

274K

1

6

88

Kevin Chen

@jiafengkevinc

4 years

today i learned \frac1n is valid TeX you're welcome

3

1

88

Kevin Chen

@jiafengkevinc

3 years

Excited to share a new preprint with @DRitzwoller on semiparametric estimation (read, *machine learning* estimation) of long-term treatment effects, such as those identified by a latent unconfoundedness or a surrogacy assumption (1/n)

1

19

84

Kevin Chen

@jiafengkevinc

3 years

i sense a market for this

Kevin Chen

@jiafengkevinc

3 years

can someone make the “X as seen by Y” of {statisticians, biostatisticians, econometricians, polmeth, machine learning}

0

1

19

3

14

79

Kevin Chen

@jiafengkevinc

2 years

@ElonBachman

0

64

Kevin Chen

@jiafengkevinc

1 year

i'm also on the job market this year 😬 website @ [(n+1)/n]

Jiafeng (Kevin) Chen

Assistant Professor of Economics at Stanford (‘25)

jiafengkevinchen.github.io

2

9

71

Kevin Chen

@jiafengkevinc

1 year

views about real analysis Did kinda cut the discipline in half

9

4

67

Kevin Chen

@jiafengkevinc

3 years

*crosses out “partial identification in causal inference” from research interests because that job is, um, automated*

Jonathan Mummolo

@jonmummolo

3 years

Our new working paper: tell the computer what you know (and don't know!) about a causal question w/ discrete data → automatically get most precise possible answer (bounds, or a point estimate). Joint w/ @guilhermejd1 @nsfinkelstein @dean_c_knox Shpitser.🧵

8

125

533

2

68

Kevin Chen

@jiafengkevinc

2 years

so the good news is that github copilot does work in stata, the bad news is that it gives you R code

1

69

Kevin Chen

@jiafengkevinc

4 years

i think it's probably just me

1

8

67

Kevin Chen

@jiafengkevinc

4 years

my duck confit came out so well I feel compelled to show it off

1

0

64

Kevin Chen

@jiafengkevinc

2 years

T̶a̶y̶l̶o̶r̶'̶s̶ ̶t̶h̶e̶o̶r̶e̶m̶ Mean-value theorem (Taylor's version)

2

0

64

Kevin Chen

@jiafengkevinc

3 years

@ben_golub vector machines

1

2

63

Kevin Chen

@jiafengkevinc

4 years

@itaisher the first hand is invisible

3

0

63

Kevin Chen

@jiafengkevinc

4 years

possibly really dumb question: does IV "work" if the instrument Z is confounded with treatment W? i.e. DAG is Z -> W -> Y, W <- V -> Y, Z <- U -> W, In PO notation, Z = Z(U, e1) W = W(Z, U, V, e2) Y = Y(W, V, e3) and (e1, e2, e3, U, V) are jointly independent

10

14

64

Kevin Chen

@jiafengkevinc

4 years

Been learning DAGs lately and wanted to be more fluent in translating between DAGs and POs I think this is a reasonably short proof of the front-door formula/napkin problem that uses only potential outcomes..? (it can conceivably fit on a big napkin?)

2

62

Kevin Chen

@jiafengkevinc

4 years

i wonder if anyone has explained the sample variance as 1. regress data on a constant, take residuals 2. regress the squared residuals on a constant

3

0

57

Kevin Chen

@jiafengkevinc

2 years

@vitnwu

4

6

57

Kevin Chen

@jiafengkevinc

10 months

@paulgp this sounds like a new chatgpt prompt series

1

55

Kevin Chen

@jiafengkevinc

2 years

Another way of saying this is that we can't have all three: (a) an average effect, (b) scale-invariance, and (c) point identification. In practice, then, we must decide which two properties we want. [5/n]

1

5

54

Kevin Chen

@jiafengkevinc

2 years

that's just econometrics

Justin Khoo

@juskhoo

2 years

Dang

27

25

308

0

7

57

Kevin Chen

@jiafengkevinc

4 years

just regular days at the office

1

0

55

Kevin Chen

@jiafengkevinc

2 years

in short: no alternative transformation can define away the log(0) problem, but sensible alternatives do exist! [n/n]

3

1

52

Kevin Chen

@jiafengkevinc

3 years

your daily macroeconomics tweet

Matt Henderson

@matthen2

3 years

the small angle approximation solves the dynamics of the pendulum by approximating sin(x) = x. How good an approximation is this? This shows the true dynamics (blue) and the approximation (white) for various initial angles

14

156

781

0

5

51

Kevin Chen

@jiafengkevinc

4 years

i've got my synthetic wisconsin ready to go

Molly Beck

@MollyBeck

4 years

BREAKING: Wisconsin Supreme Court strikes down Gov. Tony Evers' order to stay home in a 4-3 decision.

2K

6K

25K

2

0

47

Kevin Chen

@jiafengkevinc

4 years

intro statistics rarely emphasizes the difference between an estimand and an estimator (usually it's fairly clear, e.g. sample mean estimates the mean), but the difference becomes super pronounced and subtle in causal inference

Paul Goldsmith-Pinkham

@paulgp

5 years

This is what I try to tell people now:

5

28

143

1

9

48

Kevin Chen

@jiafengkevinc

2 years

@ben_golub

0

2

44

Kevin Chen

@jiafengkevinc

3 years

we missed the deadline to all be bayesians

3

2

44

Kevin Chen

@jiafengkevinc

3 years

abolish measure theory

2

44

Kevin Chen

@jiafengkevinc

4 years

gonna be that guy in the seminar with @economeager but in meme form [this is actually @jondr44 's example and i just memeified it]

2

1

43

Kevin Chen

@jiafengkevinc

4 years

Debate 2020: "Mr. President, please stop"

0

7

42

Kevin Chen

@jiafengkevinc

4 years

@ben_golub cauchy random variables: has cancel culture gone too far?

1

0

41

Kevin Chen

@jiafengkevinc

3 years

live view of the behavior of the implicit estimand of TWFE under treatment effect heterogeneity @KhoaVuUmn

Certified Vakabon

@platini954

3 years

???

5K

108K

540K

1

40

Kevin Chen

@jiafengkevinc

2 years

Well, a percent is invariant to units, and so a basic requirement is that differences in m(.) should be invariant to scaling. It turns out the opposite is true: For any m(.) that approximates log, but is defined at 0, one can choose the scaling to get anything one wants 😬[2/n]

2

40

Kevin Chen

@jiafengkevinc

2 years

Super excited to be presenting this paper at the NBER SI Labor Studies Methods session tomorrow morning!

Kevin Chen

@jiafengkevinc

3 years

Thanks @jondr44 ! A thread on this short paper 🧵 (no econometric toys are harmed in the writing of this paper)

3

28

157

2

5

37

Kevin Chen

@jiafengkevinc

3 years

If you're at CODE @MIT , I'll be presenting some work (joint with Isaiah Andrews) at 11:15am ET today, come check it out!

2

38

Kevin Chen

@jiafengkevinc

4 years

@paulgp from isaiah's slides

4

37

Kevin Chen

@jiafengkevinc

4 years

meanwhile in econometrics, most of what I do is writing P in different fonts

Shengwu Li

@ShengwuLi

4 years

Defining notation for a theory paper is halfway between making up a language and choosing an opening in chess. Small early choices affect what moves are available later.

4

17

382

1

0

37

Kevin Chen

@jiafengkevinc

4 years

A buddy and I passed a bunch of SEC 10-K filings through a big pre-trained neural network and examined its brain---because why not, university compute is free and we can't mine BTC anyway Turns out the network understands which companies are close to which #EconTwitter ⬇️⬇️⬇️

𝗦𝘂𝗽𝗿𝗼𝘁𝗲𝗲𝗺 𝗦𝗮𝗿𝗸𝗮𝗿

@SuproteemSarkar

4 years

How can transformer models help us understand economic and financial information? New workshop paper with @jiafengchen42 develops an industry classification index from text with greater informativeness about financial metrics than existing text/expert-based classifications [1/2]

1

7

37

2

6

36

Kevin Chen

@jiafengkevinc

4 years

𝙉𝙤𝙬 𝙞𝙣 𝙥𝙤𝙨𝙩𝙚𝙧 𝙛𝙤𝙧𝙢 meme form coming soon... (since this is just linear IV under the hood, it's compatible with AMIP!)

Kevin Chen

@jiafengkevinc

4 years

New short paper documenting an easy recipe for incorporating machine learning into linear IV to boost instrument strength *essentially for free*: Joint work with Daniel Chen and @econ_greg #EconTwitter #CausalTwitter

2

47

212

1

3

35

Kevin Chen

@jiafengkevinc

3 years

@otis_reid A lot of times fancy estimators are actually just comparisons of the correct set of means

3

0

36

Kevin Chen

@jiafengkevinc

2 years

i can't believe no one brought up radon-nikodym derivative yet

5

0

34

Kevin Chen

@jiafengkevinc

4 years

for some particular reason the harvard-mit econometrics seminar seems to have 13x attendance today

4

1

33

Kevin Chen

@jiafengkevinc

3 years

@ben_golub i'd argue the Rao-Blackwell theorem is better-known, based on me knowing Rao-Blackwell but not knowing the Blackwell information ordering 😂

1

32

Kevin Chen

@jiafengkevinc

2 years

@jondr44 Maybe not any more!

1

0

32

Kevin Chen

@jiafengkevinc

4 years

Sir, this is a landscaping company

Donald J. Trump

@realDonaldTrump

4 years

71,000,000 Legal Votes. The most EVER for a sitting President!

342K

95K

722K

0

33

Kevin Chen

@jiafengkevinc

3 years

@ben_golub well, we did warn you it was only mostly harmless

0

31

Kevin Chen

@jiafengkevinc

3 years

@SNageebAli

Monty Hall

xkcd.com

0

1

33

Kevin Chen

@jiafengkevinc

2 years

i think that's what "large-T asymptotics" means

Mywish K. Maredia

@maredia

2 years

I am reviewing a paper and came across the word “eternal validity,” which is a typo…but its a brilliant typo. Why us economists only talk about internal and external validity? Why not eternal validity???? 😏 #EconTwitter #AcademicChatter

23

26

476

2

31

Kevin Chen

@jiafengkevinc

3 years

it's also a good time to finally open yellow van der vaart and build some more intuition why bootstrap can fail---

0

1

32

Kevin Chen

@jiafengkevinc

1 year

@AnthonyLeeZhang seems like most results in real analysis can be understood without real analysis

0

32

Kevin Chen

@jiafengkevinc

4 years

@paulgp

3

0

31

Kevin Chen

@jiafengkevinc

2 years

@ben_golub @CaltechEconThry Proof I first saw from @stat110 's stat 210: WLOG, assume f(X), g(X) mean zero. Observe that E[f(X) g(X)] + E[f(Y) g(Y) ]= E[(f(X) - f(Y)) (g(X) - g(Y))] for X, Y independent and Y ~ X Conclude by noting that (f(x) - f(y))(g(x) - g(y)) ≥ 0 a.e.

0

32

Kevin Chen

@jiafengkevinc

4 years

@Andrew___Baker Theory, but always allowed to take derivatives

0

32

Kevin Chen

@jiafengkevinc

3 years

this is why the pscore-as-dimension-reduction thing is a bit of mental gymnastics and word sorcery---estimating the pscore is a high-dimensional problem!

2

3

31

Kevin Chen

@jiafengkevinc

2 years

@ben_golub Just need @3blue1brown to make enough videos

2

1

31

Kevin Chen

@jiafengkevinc

2 years

@ShengwuLi this doesn’t matter to me one epsilon whatsoever

1

0

31

Kevin Chen

@jiafengkevinc

2 years

TIL from @conlon_chris that one could be using a slower BLAS on Apple silicon, here's a fix that's worked for me, so far, from our thread

Chris Conlon

@conlon_chris

2 years

@jiafengkevinc @paulgp Compare to this -- the point is not that that NumPy is 180x faster than R. The point is that my NumPy setup is using the correct BLAS and my R setup is not:

3

0

8

0

8

31

Kevin Chen

@jiafengkevinc

4 years

Calculated the IV estimand in the binary case:

6

5

28

Kevin Chen

@jiafengkevinc

5 months

P̶e̶n̶a̶l̶i̶z̶e̶d̶ ̶r̶e̶g̶r̶e̶s̶s̶i̶o̶n̶ ̶e̶s̶t̶i̶m̶a̶t̶o̶r̶s̶ The Tortured Predictors Department #TSTTPD

0

2

27

Kevin Chen

@jiafengkevinc

4 years

just realized supreme court rulings and the QJE have the same (i think?) typeface

2

0

26

Kevin Chen

@jiafengkevinc

2 years

"i have discovered a truly marvelous proof of the results in the main text, which this online appendix has too few pages to contain"

The Review of Economic Studies

@RevEconStudies

2 years

📢 **REStud Page Limit** With effect from 1st July 2022, a page limit policy applies to all submissions. Papers should be under 45 pages. Online appendices should not exceed 30 pages. A “grace period” is in place until 15th August 2022. For more details:

2

42

272

1

0

27

Kevin Chen

@jiafengkevinc

2 years

@jondr44 @ShengwuLi the easy mnemonic is that Jensen's inequality is always in the direction where you don't want it to be

0

1

26

Kevin Chen

@jiafengkevinc

5 years

Happy to share my new NBER working paper with Ed Glaeser and @davidmwessel . We find that the Opportunity Zone program, as a part of the 2017 Tax Cuts and Jobs Act, has not (so far) generated a response in residential housing prices. Link: . #econtwitter

The (Non-) Effect of Opportunity Zones on Housing Prices

www.nber.org

1

27

Kevin Chen

@jiafengkevinc

2 years

👀

0

1

26

Kevin Chen

@jiafengkevinc

3 years

0

25

Kevin Chen

@jiafengkevinc

3 years

Now, I can't possibly pass on an opportunity to promote some recent work on Twitter, to Twitter, by quote-tweeting @TwitterEng (joint work with @DRitzwoller )

Engineering

@XEng

3 years

We built a causal estimation framework on the idea of statistical 'surrogacy' (Athey et al 2016) - when we can’t wait to observe long-run outcomes, we create a model based on intermediate data.

2

23

254

1

5

26

Kevin Chen

@jiafengkevinc

3 years

@ben_golub Indeed, the quickest road to serfdom is probably a linear program

0

26

Kevin Chen

@jiafengkevinc

4 years

@arpitrage "One Billion American Billionaires"-- @mattyglesias

1

0

23

Kevin Chen

@jiafengkevinc

1 year

@ShengwuLi

0

24

Kevin Chen

@jiafengkevinc

3 years

Suppose (Xn, Yn) converges in distribution to (X, Y), and (Yn, Zn) converges in distribution to (Y', Z). (This implies Y' ~ Y). Suppose Xn is independent of Zn given Yn. T/F: (Xn, Yn, Zn) converges in distribution to some (X'', Y'', Z'')?

4

0

24

Kevin Chen

@jiafengkevinc

5 months

I will be presenting this paper on inference post adaptive experiments this Thursday (11:30 am ET) at the International Seminar on Selective Inference. Come check it out!

1

24