Sarah Wiegreffe (on faculty job market!) @sarahwiegreffe profile

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

Followers

4K

Following

10K

Media

70

Statuses

1K

Research in language model explainability & interpretability since 2017. Postdoc @allen_ai @uwnlp PhD from @mlatgt @gtcomputing Views my own, not my employer's.

Joined September 2013

Don't wanna be here? Send us removal request.

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 months

✨I am on the faculty job market for the 2024-2025 cycle!✨. I’m at COLM @COLM_conf until Wednesday evening. Would love to hear about faculty openings (or general advice about being on the job market)!.

3

48

194

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

1 year

I am recruiting a PhD research intern for summer 2024! Please apply by Nov. 15th and mention me in your application. Due to volume I won't be able to respond to DMs or emails. Topics of interest: .1. utility of textual explanations for improving model performance, . 🧵1/2.

Aristo Team at AI2

@ai2_aristo

1 year

Time is running out: Apply for a summer 2024 Aristo Research Internship before the November 15th deadline and work with top mentors on building machines that can reason and learn. Visit this link to apply -->

7

86

403

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 years

Yesterday, I defended my PhD dissertation! Special thanks to my advisor @mark_riedl and committee members @cocoweixu @alan_ritter @sameer_ @nlpnoah for the valuable advice & feedback. Looking forward to what's next!

35

4

395

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

I am recruiting interns to work with me on interpretability-related research at AI2 in summer 2023!. If interested, please apply by Nov. 1st here— Retweets/shares appreciated 🥰.

8

98

393

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 years

Happy to share our new preprint (with @anmarasovic) “Teach Me to Explain: A Review of Datasets for Explainable NLP”.Paper: Website: It’s half survey, half reflections for more standardized ExNLP dataset collection. Highlights:. 1/6.

5

78

309

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 years

- How good is GPT-3 at generating human-acceptable free-text explanations?.- Can we produce good explanations with few-shot prompting? .- Can human preference modeling produce even better explanations from GPT-3?. We answer these questions and more in our #NAACL2022 paper. 🧵1/12.

5

49

262

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

Now that I'm officially on the website, this seems like a good time to announce that I've started as a young investigator (postdoc) at @allen_ai @ai2_aristo!

8

2

207

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 years

I'm officially a PhD candidate

Mark Riedl

@mark_riedl

3 years

Congratulations to @sarahwiegreffe for passing her thesis proposal today! 🎉.

11

1

191

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 years

Booking a train ticket in Germany like:

0

3

151

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

8 months

Thanks to everyone who participated in our tutorial & asked great questions! If you missed it, the recording is now available: Slides:

Zining Zhu

@zhuzining

8 months

Our NAACL 24 tutorial "Explanation in the Era of Large Language Models" will be presented on June 16 morning at Don Alberto 3! The tutorial website is at w/ @hanjie_chen @xiye_nlp @ChenhaoTan @anmarasovic @sarahwiegreffe @VeronicaLyu

3

20

110

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 years

Such an honor, and definitely a 2020 highlight!.

12

1

106

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 months

Have you ever wondered what ✨mechanistic interpretability✨ is, & how it differs from other NLP interpretability research? @nsaphra and I have the paper for you!. Check out our paper (which I'll present @BlackboxNLP @emnlpmeeting in Miami next month!).

Naomi Saphra 🧈🪰

@nsaphra

4 months

What makes some LM interpretability research “mechanistic”? In our new position paper in @BlackboxNLP, @sarahwiegreffe and I argue that the practical distinction was never technical, but a historical artifact that we should be—and are—moving past to bridge communities.

2

16

109

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 years

Not even 5 minutes into April Fools Day and I fell for this 🥲

2

5

101

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

I now have something useful to say when someone asks me about the real-world utility of my research😮😅

4

3

92

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

Seems like people at #eacl2023 mostly just took pictures of cats, effectively pushing all the AI influencers off my Twitter feed #thankyou 💯🙏.

3

93

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 years

Alaska was amazing! Here are some pictures from a hike with views of Spencer Glacier, including the friendly (?) black bear we woke up to outside our cabin.

4

0

83

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 years

Excited to be in Seattle for both #NAACL2022 and my birthday! 🥳

4

0

81

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 years

Happy to announce “Attention is not not Explanation”, accepted to #emnlp2019! Work by myself and @yuvalpi . 1/n.

1

12

74

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

New paper: "Attentiveness to Answer Choices Doesn’t Always Entail High QA Accuracy" 📊💬. Something I've been thinking a lot about recently is the relationship between distributions over vocabularies produced by language models and the various ways. 1/5

1

18

72

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

1 year

Have you ever wondered whether localization and ROME/MEMIT-style model editing can work for tasks beyond factual recall (like commonsense plausibility prediction)? If so, come to our poster tomorrow 9-10:30am for our paper "Editing Common Sense in Transformers" #EMNLP2023 😃

1

10

72

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 years

Anyone else still enjoying the ACL/EMNLP 2020 recorded videos? I still go back to watch them for new papers I'm reading. It helps to see how the authors synthesize their own work and makes reading so much faster.

5

2

71

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

8 months

What are your plans for Sunday morning at 9am CST? Come join us bright & early on the first day of #NAACL2024 for our tutorial on explanation in the era of LLMs!

Zining Zhu

@zhuzining

8 months

Our NAACL 24 tutorial "Explanation in the Era of Large Language Models" will be presented on June 16 morning at Don Alberto 3! The tutorial website is at w/ @hanjie_chen @xiye_nlp @ChenhaoTan @anmarasovic @sarahwiegreffe @VeronicaLyu

1

7

68

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 years

Georgia Tech at #EMNLP2021! @cjziems @yuvalpi @mlatgt @ICatGT @gtcomputing

4

3

58

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

11 months

Vacation ✅

0

60

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

1 year

Checkout our recent work on easy-to-hard generalization with LMs, led by outstanding intern @peterbhase :.

Peter Hase

@peterbhase

1 year

Can LLMs generalize from easy to hard problems?. Models actually solve college test questions when trained on 3rd grade questions!. 🚨New paper: “The Unreasonable Effectiveness of Easy Training Data for Hard Tasks”.🧵1/6

0

10

58

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 years

.#EMNLP2021 paper with @anmarasovic @nlpnoah: Measuring Association Between Labels and Rationales. Paper: (updated experiments!).Video: 🧵Why free-text explanations? We argue that free-text explanations are going to be important. .

3

11

57

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 years

If you missed my talk at EMNLP, come chat at the #WiML2019 poster session tomorrow 6:30-8pm in East Hall B! #NeurIPS2019

3

6

58

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

I got hooded!.

5

0

57

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

6 months

I'm biased (🤷‍♀️) but 👇👇👇.

Wei Xu

@cocoweixu

6 months

Which one do you like it better?.#ACL2024 @aclmeeting. 2/2

6

1

58

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 years

I'm looking for funded opportunities for interpretability + #NLProc research for summer 2021 (internships + other). If you know of anything, please pass along!.

1

9

52

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

Checkout our newest work: Self-Refine: Iterative Refinement with Self-Feedback. We show that LLMs such as GPT-3.5 can:.1. Generate a draft output.2. Generate textual feedback critiquing it.3. Refine the draft output using the feedback.4. Iterate, as needed. More below ⬇️.

Aman Madaan

@aman_madaan

2 years

Can LLMs enhance their own output without human guidance? In some cases, yes! With Self-Refine, LLMs generate feedback on their work, use it to improve the output, and repeat this process. Self-Refine improves GPT-3.5/4 outputs for a wide range of tasks.

1

7

53

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

I'll be at #ACL2023NLP in Toronto next week! Would love to meet/catch up with people- please reach out. I'm giving an invited talk "Two Views of LM Interpretability" at the NLRSE workshop Thursday at 4:40pm. Come hear my thoughts on prompting & mechanistic interpretability.

Greg Durrett

@gregd_nlp

2 years

🔥 The program for the NLRSE workshop at #ACL2023NLP is posted 🔥.In addition to the 5 invited talks, we have a fantastic technical program with 70 papers being presented! Make sure you stick around on Thursday after the main conference!.

2

7

54

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 months

I’m giving a talk about this paper today in the 3- 3:30 session @BlackboxNLP (Jasmine ballroom) and will stick around for the afternoon poster session as well. Would love to hear your opinions 😊.

Naomi Saphra 🧈🪰

@nsaphra

4 months

What makes some LM interpretability research “mechanistic”? In our new position paper in @BlackboxNLP, @sarahwiegreffe and I argue that the practical distinction was never technical, but a historical artifact that we should be—and are—moving past to bridge communities.

1

7

53

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

1 year

Sarthak & I have a ✨fun✨ talk for you all at the Big Picture workshop #EMNLP2023! It's Thurs. Dec. 7th 11am-12pm Singapore time (Wed. Dec. 6th 7-8pm Pacific).

Yanai Elazar

@yanaiela

1 year

@sarahwiegreffe and @successar_nlp have joined forces to dispute once and for all the question "Is "Attention = Explanation" and the Role of Interpretability in NLP".

0

7

52

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

1 year

Ever wondered whether surface form competition is something you should be worried about in zero-/few-shot LLM eval? We have answers 😄. Come to our poster "Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy" tomorrow from 10:30am-12pm! #EMNLP2023.

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

New paper: "Attentiveness to Answer Choices Doesn’t Always Entail High QA Accuracy" 📊💬. Something I've been thinking a lot about recently is the relationship between distributions over vocabularies produced by language models and the various ways. 1/5

4

7

50

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 years

Code now available!

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 years

Happy to announce “Attention is not not Explanation”, accepted to #emnlp2019! Work by myself and @yuvalpi . 1/n.

0

9

51

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 years

I really enjoyed giving this talk-- recording is now available.

NLP with Friends

@NLPwithFriends

4 years

And our next speaker is. 🥁🥁🥁. 🗣Sarah Wiegreffe (@sarahwiegreffe) will talk with us about "Measuring Association Between Labels and Free-Text Rationales".🗓February 17th, 14:00 UTC.📝 Sign up here: .

0

5

46

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 years

I officially deem myself too verbose for conference paper page limits 🤷‍♀️.

3

0

46

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 years

.@yanaiela and I are hosting a Birds of a Feather social on #interpretability at #NAACL2022 on Tuesday 2-3pm PT. It will be hybrid both in-person and on Zoom (more info here: Come chat!.

1

8

45

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 months

Nice to see Anthropic more rigorously testing their previously-qualitative results on steering and being open about its mixed results. One thing I wonder is: how do these results compare to steering model representations directly (no SAE)?.

Anthropic

@AnthropicAI

4 months

New Anthropic research: Evaluating feature steering. In May, we released Golden Gate Claude: an AI fixated on the Golden Gate Bridge due to our use of “feature steering”. We've now done a deeper study on the effects of feature steering. Read the post:

2

3

46

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

Thanks to @BastingsJasmijn, #BlackBoxNLP workshop now has a Youtube channel! If you missed any of our keynote talks at EMNLP last week, check them out here: More to be added soon :) .@yanaiela @_dieuwke_ @boknilev @nsaphra.

0

7

44

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 years

If you're at #NeurIPS2021, I'm presenting our explainable NLP datasets survey for the next 1.5 hours at the Datasets & Benchmarks poster session!. Landing page: Gathertown (spot A2): Camera-ready:

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 years

Happy to share our new preprint (with @anmarasovic) “Teach Me to Explain: A Review of Datasets for Explainable NLP”.Paper: Website: It’s half survey, half reflections for more standardized ExNLP dataset collection. Highlights:. 1/6.

1

6

42

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 years

New #acl2020 paper "Learning to Faithfully Rationalize with Construction" with @successar_nlp @byron_c_wallace @yuvalpi - preprint coming soon!.

3

39

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 years

Check out my feature on the ML@GT blog:.

Machine Learning at Georgia Tech

@mlatgt

5 years

🚨 NEW BLOG POST 🚨. ML@GT Ph.D. students @sarahwiegreffe and @yuvalpi discuss their #NLP work on plausible vs. faithful reasoning and why it's important to understand a model's reasoning process. Trust us, it's a good post. 📝:

0

6

37

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 years

Didn't get done what I planned to today, but at least managed to cut down my code runtime from >24h to ~3h. Pretty sure the compute saved will come back to help me later. Daily reminder that research accomplishment comes in many forms 😃.

2

0

35

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

1 year

Great talk by @claranahhh on “To Build Our Future, We Must Know Our Past” about software lotteries, explore/exploit phases, funding incentives, evaluation benchmarking culture, etc. in NLP research! .Co- @_sireesh @abertsch72

2

3

31

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

1 year

0

2

33

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 years

I will be giving an invited talk at the @USC_ISI @nlp_usc Seminar this Thursday (11am PT)- the abstract and livestream link can be found here:

1

3

31

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

If text-davinci-001 is a rough approximate to the model reported in the NeurIPS 2020 paper, and text-davinci-002 is ~InstructGPT in the 2022 preprint, then what is just "davinci"? 🤯. Trying to reproduce results from a time before this naming existed.

5

3

30

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 years

“Reframing Human-AI Collaboration for Generating Free-Text Explanations” with @jmhessel @swabha @mark_riedl @YejinChoinka . Paper: Code/data:

1

7

31

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

1 year

I will also highlight 2 recent EMNLP papers if you're curious what I've been up to recently:.- Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy (.- Editing Common Sense in Transformers (.

0

2

28

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 years

The ACL Anthology goes down the *one day* I actually need anthology-version PDFs for an application 😅.

2

1

27

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

1 year

. , human-AI collaboration, or general human understanding of LM behavior.2. text generation, sampling, uncertainty estimation to understand & improve black-box models .3. mechanistic/bottom-up understanding of models beyond specific tasks. 🧵2/2 Please retweet & share!.

2

3

28

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 years

Enjoyed presenting at #emnlp2019 in Hong Kong last week. Thanks @ICatGT @gtsga for travel support

1

2

27

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

This talk is today 4:40pm in Pier 4! (to the left of the hall with the sponsorship booths on the main hotel side of the venue).

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

I'll be at #ACL2023NLP in Toronto next week! Would love to meet/catch up with people- please reach out. I'm giving an invited talk "Two Views of LM Interpretability" at the NLRSE workshop Thursday at 4:40pm. Come hear my thoughts on prompting & mechanistic interpretability.

0

5

28

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

1 year

@jacobandreas @srush_nlp FWIW, I gave a talk at ACL in July on this topic. The framework in the talk doesn't capture everything, but I think it gives some credence as to why the terminology might be useful. "Two Views of LM Interpretability" (starting at 7:46): .

0

2

28

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 months

Interested in notions of correctness in multiple choice datasets for commonsense tasks? Come check out our poster, happening now!.

Shramay Palta

@PaltaShramay

3 months

Happening in less than 20 minutes at Board 217 in Riverfront! .Drop by!.#EMNLP2024 #NLProc

0

2

28

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 years

@fhuszar You can email women-in-machine-learning@googlegroups.com-- a lot of people advertise PhD positions there. >4000 members.

1

0

26

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

7 months

Teaching and measuring LM noncompliance should be about more than just safety and outright refusal -- checkout our paper for our taxonomy, dataset, and lots of benchmarking results!.

Faeze Brahman

@faeze_brh

7 months

🤖 When and how should AI models not comply with user requests?.Our latest work with @shocheen at @allen_ai dives into this question, expanding the scope of model noncompliance beyond just refusing "unsafe" queries. 1/n🧵. #LLMs #refusal #noncompliance #responsible_ai

0

4

27

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

1 year

.@emnlpmeeting Can you clarify #EMNLP2023 "presenter" vs. "non-presenter" registration costs? Does every author on a paper have to register as a presenter, just the presenting author(s), or just one author per paper?.

2

24

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

1 year

We’re right at the entrance (2A)

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

1 year

Have you ever wondered whether localization and ROME/MEMIT-style model editing can work for tasks beyond factual recall (like commonsense plausibility prediction)? If so, come to our poster tomorrow 9-10:30am for our paper "Editing Common Sense in Transformers" #EMNLP2023 😃

0

4

25

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 years

I've so enjoyed my internship at AI2 @ai2_allennlp with Noah, @anmarasovic and team! Consider applying:.

1

0

24

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

6 months

TIL about Baker Island, and now I can't stop imagining how much easier my life would be if I moved there 😂

2

1

25

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

6 years

A great piece of scientific writing for a non-technical audience (& exciting developments for ML from #NeurIPS2018). Imagining how much more accessible the field would be if all published papers had this kind of intuitive breakdown. @techreview @_KarenHao.

0

10

24

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 months

Help: how can I search conference proceedings in the ACL Anthology by *track*? Like how the orals and posters are nicely organized at the physical conference?.@aclanthology.

10

3

24

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 years

Lectured on attention and transformers for NLP applications in @GeorgiaTech's Deep Learning course last Thursday! Slides available here: 1/2.

1

5

24

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

As a side note, it's so difficult to do research based on an API with poor documentation.

2

0

22

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 years

An updated & more condensed (10-page) version is available here: Camera-ready coming soon! w/ @anmarasovic @ai2_allennlp.

Mark Riedl

@mark_riedl

4 years

Interested in getting started in Explainable NLP (ExNLP)? This paper by @sarahwiegreffe reviews the datasets. 11 pages of references! . To appear in the NeurIPS 2021 Benchmarks and Datasets track.

0

10

21

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 years

Nice that NAACL shared updated reviews, but here's what's unfortunate: those who had reasonable scores did not withdraw and/or submit an abstract to ACL; if rejected will have to wait months to resubmit. Those with decisively bad scores were able to resubmit almost immediately.

2

0

20

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 years

To participate in the #PeopleofNLProc series we're running for #NAACL2021 publicity, fill out the Google form here:

NAACL HLT 2025

@naaclmeeting

4 years

Our first #PeopleofNLProc researcher is Vered Shwartz @VeredShwartz. Vered's Bio: I'm a postdoc at AI2 and the University of Washington. I got my PhD from Bar-Ilan University. Besides research, I like to work out, feed birds, and listen to audiobooks. 1/4

2

6

20

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

with @mark_riedl, advisor extraordinaire

0

19

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

Trending topic because of AI influencers, or the #EMNLP2023 anonymity deadline? 🤔😅

1

0

19

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

6 years

Passed my qualifying exam! Woohoo 🎉.

5

0

18

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 years

New paper at the #NAACL2021 narrative understanding workshop.

Mark Riedl

@mark_riedl

4 years

5. Inferring the Reader: Guiding Automated Story Generation with Commonsense Reasoning.@beckypeng6 @sarahwiegreffe @Sylvia_Sparkle . Using commonsense reasoning to guide a neural story generator. (We'll have more to say about the importance of _reader models_ later, stay tuned).

2

0

17

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 years

Come see my talk at #EMNLP this Tuesday; I’ll be re-framing some of the takeaways of our work as a direct contribution to the faithful explainability literature.

Yuval Pinter

@yuvalpi

5 years

@sarahwiegreffe @alon_jacovi To be stated more clearly in the talk :).Tuesday 10:48 session 1A, come one come all!.

3

17

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 years

Check out our paper at ACL! Q&A sessions are Tuesday at 8am and 5pm ET.

1

0

16

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

10 months

Not me thinking ACL was hosting a climbing competition 🧗⛰️

0

16

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 years

Super cool app find of the day (or maybe I'm behind the times): @MathpixApp Lets you screenshot rendered LaTeX in a PDF, and converts it to raw LaTeX for you (with surprising accuracy!).

1

14

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 years

Excited to see that "Attention is not not Explanation" is the #2 top recent paper of the past month on Arxiv Sanity Preserver! @yuvalpi.

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 years

Happy to announce “Attention is not not Explanation”, accepted to #emnlp2019! Work by myself and @yuvalpi . 1/n.

0

2

15

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

(Also-- this is the general application for Aristo and we will review them all, but if you're specifically interested in interpretability please highlight that in your application and/or reach out to me directly!).

2

0

14

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 years

I want to speak out in solidarity against the explicit symbols of racism we've seen so clearly in the last few weeks. I am learning that these are representations of realities that many of my friends experience on a daily basis, something that I was naively previously unaware of.

0

15

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

6 months

Good take.

Andreas Madsen

@andreas_madsen

7 months

After the Mechanistic Interpretability workshop I feel tempted to make my 2024 statement early:. There are no low hanging fruits in interpretability. Everything requires so much skepticism, critical thinking, and is mega hard to properly validate.

0

14

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

6 months

Go work with Tuhin! 😄.

Tuhin Chakrabarty

@TuhinChakr

6 months

📣 I will be in New York 🗽 for the foreseeable future, and join @stonybrooku @sbucompsc @SUNY as an Assistant Professor in the Fall of 2025. I plan to recruit 1-2 PhD students starting next fall. Come tackle exciting problems with me on Human Centered AI :-)

1

2

13

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 years

Reading a JMLR paper from 2010:

1

0

13

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

Really excited for this effort :).

Ai2

@allen_ai

2 years

Today we're thrilled to announce our new undertaking to collaboratively build the best open language model in the world: AI2 OLMo. Uniquely open, 70B parameters, coming early 2024 – join us!.

0

13

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 years

@AndrewLampinen Very cool work! You might be interested in our paper (to appear at NAACL) on generating few-shot free-text explanations from GPT-3 for passing crowdworker judgements of explanation acceptability. w/ @swabhz @jmhessel @mark_riedl @YejinChoinka .

1

0

13

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

1 year

Guess I need to start scheduling time for "being angry at the political divides now characterizing the interpretability research community" on my weekly calendar 🤔😪.

2

0

13

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 years

I hope we continue producing this resource. It's a great way to get up to speed on a wider scope of papers than I typically read.

0

11

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

6 years

@theodunayo @jmhessel @alethioguy @yuvalpi @RTomMcCoy @ACL2019_Italy My favorite hit has to be “I’ll never break your gradient flow” by the Backprop Boys.

1

0

12

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 years

I complained to @USPS that I haven't received any mail in over a month. After not having heard back via phone or email as I was told, I login online to see that they have sent me a *letter* regarding my case.🤦‍♀️❓❓.

1

0

11

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 years

In today's work-from-home saga: spilled milk & cereal all over myself + desk + chair. Laptop was spared 🙏 Multitasking is hard people! 😆.

1

0

12

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 years

We had ~20 hours of sunlight, which also meant seeing some alpenglow 🌄

0

12

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

2 years

@janleike Oh wow, thanks for this insight. I don't see that model listed in my view of the API, so I assume I'd have to request access.

2

0

11

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

6 years

Shout into the void: looking for some degree/career planning advice, preferably from someone in broad ML/NLP academia. Does anyone know of anyone willing to lend me ~30 minutes of their time?.

1

3

11

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

1 year

Ooooh this looks exciting!.

Thomas G. Dietterich

@tdietterich

1 year

#ICML will have a Position Paper track. "The goal of this track is to highlight papers that stimulate (productive, civil) discussion on timely topics that need our community’s input" #AI. Read more here:

0

11

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

3 years

Takeaways:.- GPT-3 shows potential for automatically creating free-text explanation datasets. - Acceptability can be improved with high-quality prompts and trained filter model operating on over-generations. - Despite its subjectivity, crowdworker acceptability can be modeled.🔚.

4

2

10

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 months

In my experience, like model editing, steering is extremely hyper-parameter sensitive, and we don't have good grounding for selecting meaningful coefficients that keep model representations in-distribution and prevent breaking models' general capabilities.

1

10

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

4 years

@tallinzen Yeah, time isn't infinite and the research community's rewards are often misaligned with incentivizing thorough, high-quality research.

1

0

9

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

5 years

Had a great time presenting at #WiML2019 yesterday!.I'll be around #NeurIPS2019 all week; come say hi if you see me.

0

10

Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

8 months

@jeremyphoward @aryaman2020 This is the age-old struggle between academia and industry. Anthropic is pushing a specific flavor of interpretability work that is attracting a lot of funding and attention from junior researchers & those outside the field. It is not the only viable (or existing) direction.

1

0

10