sarahwiegreffe Profile Banner
Sarah Wiegreffe (on faculty job market!) Profile
Sarah Wiegreffe (on faculty job market!)

@sarahwiegreffe

Followers
4K
Following
10K
Media
70
Statuses
1K

Research in language model explainability & interpretability since 2017. Postdoc @allen_ai @uwnlp PhD from @mlatgt @gtcomputing Views my own, not my employer's.

Joined September 2013
Don't wanna be here? Send us removal request.
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 months
✨I am on the faculty job market for the 2024-2025 cycle!✨. I’m at COLM @COLM_conf until Wednesday evening. Would love to hear about faculty openings (or general advice about being on the job market)!.
3
48
194
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
1 year
I am recruiting a PhD research intern for summer 2024! Please apply by Nov. 15th and mention me in your application. Due to volume I won't be able to respond to DMs or emails. Topics of interest: .1. utility of textual explanations for improving model performance, . 🧵1/2.
@ai2_aristo
Aristo Team at AI2
1 year
Time is running out: Apply for a summer 2024 Aristo Research Internship before the November 15th deadline and work with top mentors on building machines that can reason and learn. Visit this link to apply -->
7
86
403
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
3 years
Yesterday, I defended my PhD dissertation! Special thanks to my advisor @mark_riedl and committee members @cocoweixu @alan_ritter @sameer_ @nlpnoah for the valuable advice & feedback. Looking forward to what's next!
Tweet media one
Tweet media two
35
4
395
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
I am recruiting interns to work with me on interpretability-related research at AI2 in summer 2023!. If interested, please apply by Nov. 1st here— Retweets/shares appreciated 🥰.
8
98
393
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 years
Happy to share our new preprint (with @anmarasovic) “Teach Me to Explain: A Review of Datasets for Explainable NLP”.Paper: Website: It’s half survey, half reflections for more standardized ExNLP dataset collection. Highlights:. 1/6.
5
78
309
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
3 years
- How good is GPT-3 at generating human-acceptable free-text explanations?.- Can we produce good explanations with few-shot prompting? .- Can human preference modeling produce even better explanations from GPT-3?. We answer these questions and more in our #NAACL2022 paper. 🧵1/12.
5
49
262
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
Now that I'm officially on the website, this seems like a good time to announce that I've started as a young investigator (postdoc) at @allen_ai @ai2_aristo!
Tweet media one
8
2
207
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
3 years
I'm officially a PhD candidate
@mark_riedl
Mark Riedl
3 years
Congratulations to @sarahwiegreffe for passing her thesis proposal today! 🎉.
11
1
191
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 years
Booking a train ticket in Germany like:
Tweet media one
0
3
151
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
8 months
Thanks to everyone who participated in our tutorial & asked great questions! If you missed it, the recording is now available: Slides:
Tweet media one
@zhuzining
Zining Zhu
8 months
Our NAACL 24 tutorial "Explanation in the Era of Large Language Models" will be presented on June 16 morning at Don Alberto 3! The tutorial website is at w/ @hanjie_chen @xiye_nlp @ChenhaoTan @anmarasovic @sarahwiegreffe @VeronicaLyu
Tweet media one
3
20
110
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 years
Such an honor, and definitely a 2020 highlight!.
12
1
106
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 months
Have you ever wondered what ✨mechanistic interpretability✨ is, & how it differs from other NLP interpretability research? @nsaphra and I have the paper for you!. Check out our paper (which I'll present @BlackboxNLP @emnlpmeeting in Miami next month!).
@nsaphra
Naomi Saphra 🧈🪰
4 months
What makes some LM interpretability research “mechanistic”? In our new position paper in @BlackboxNLP, @sarahwiegreffe and I argue that the practical distinction was never technical, but a historical artifact that we should be—and are—moving past to bridge communities.
Tweet media one
2
16
109
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 years
Not even 5 minutes into April Fools Day and I fell for this 🥲
Tweet media one
2
5
101
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
I now have something useful to say when someone asks me about the real-world utility of my research😮😅
Tweet media one
4
3
92
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
Seems like people at #eacl2023 mostly just took pictures of cats, effectively pushing all the AI influencers off my Twitter feed #thankyou 💯🙏.
3
3
93
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
3 years
Alaska was amazing! Here are some pictures from a hike with views of Spencer Glacier, including the friendly (?) black bear we woke up to outside our cabin.
Tweet media one
Tweet media two
Tweet media three
Tweet media four
4
0
83
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
3 years
Excited to be in Seattle for both #NAACL2022 and my birthday! 🥳
Tweet media one
Tweet media two
4
0
81
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 years
Happy to announce “Attention is not not Explanation”, accepted to #emnlp2019! Work by myself and @yuvalpi . 1/n.
1
12
74
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
New paper: "Attentiveness to Answer Choices Doesn’t Always Entail High QA Accuracy" 📊💬. Something I've been thinking a lot about recently is the relationship between distributions over vocabularies produced by language models and the various ways. 1/5
Tweet media one
1
18
72
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
1 year
Have you ever wondered whether localization and ROME/MEMIT-style model editing can work for tasks beyond factual recall (like commonsense plausibility prediction)? If so, come to our poster tomorrow 9-10:30am for our paper "Editing Common Sense in Transformers" #EMNLP2023 😃
Tweet media one
Tweet media two
1
10
72
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 years
Anyone else still enjoying the ACL/EMNLP 2020 recorded videos? I still go back to watch them for new papers I'm reading. It helps to see how the authors synthesize their own work and makes reading so much faster.
5
2
71
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
8 months
What are your plans for Sunday morning at 9am CST? Come join us bright & early on the first day of #NAACL2024 for our tutorial on explanation in the era of LLMs!
Tweet media one
@zhuzining
Zining Zhu
8 months
Our NAACL 24 tutorial "Explanation in the Era of Large Language Models" will be presented on June 16 morning at Don Alberto 3! The tutorial website is at w/ @hanjie_chen @xiye_nlp @ChenhaoTan @anmarasovic @sarahwiegreffe @VeronicaLyu
Tweet media one
1
7
68
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
3 years
Tweet media one
4
3
58
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
11 months
Vacation ✅
Tweet media one
0
0
60
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
1 year
Checkout our recent work on easy-to-hard generalization with LMs, led by outstanding intern @peterbhase :.
@peterbhase
Peter Hase
1 year
Can LLMs generalize from easy to hard problems?. Models actually solve college test questions when trained on 3rd grade questions!. 🚨New paper: “The Unreasonable Effectiveness of Easy Training Data for Hard Tasks”.🧵1/6
Tweet media one
0
10
58
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
3 years
.#EMNLP2021 paper with @anmarasovic @nlpnoah: Measuring Association Between Labels and Rationales. Paper: (updated experiments!).Video: 🧵Why free-text explanations? We argue that free-text explanations are going to be important. .
3
11
57
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 years
If you missed my talk at EMNLP, come chat at the #WiML2019 poster session tomorrow 6:30-8pm in East Hall B! #NeurIPS2019
Tweet media one
3
6
58
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
I got hooded!.
5
0
57
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
6 months
I'm biased (🤷‍♀️) but 👇👇👇.
@cocoweixu
Wei Xu
6 months
Which one do you like it better?.#ACL2024 @aclmeeting. 2/2
Tweet media one
6
1
58
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 years
I'm looking for funded opportunities for interpretability + #NLProc research for summer 2021 (internships + other). If you know of anything, please pass along!.
1
9
52
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
Checkout our newest work: Self-Refine: Iterative Refinement with Self-Feedback. We show that LLMs such as GPT-3.5 can:.1. Generate a draft output.2. Generate textual feedback critiquing it.3. Refine the draft output using the feedback.4. Iterate, as needed. More below ⬇️.
@aman_madaan
Aman Madaan
2 years
Can LLMs enhance their own output without human guidance? In some cases, yes! With Self-Refine, LLMs generate feedback on their work, use it to improve the output, and repeat this process. Self-Refine improves GPT-3.5/4 outputs for a wide range of tasks.
Tweet media one
1
7
53
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
I'll be at #ACL2023NLP in Toronto next week! Would love to meet/catch up with people- please reach out. I'm giving an invited talk "Two Views of LM Interpretability" at the NLRSE workshop Thursday at 4:40pm. Come hear my thoughts on prompting & mechanistic interpretability.
@gregd_nlp
Greg Durrett
2 years
🔥 The program for the NLRSE workshop at #ACL2023NLP is posted 🔥.In addition to the 5 invited talks, we have a fantastic technical program with 70 papers being presented! Make sure you stick around on Thursday after the main conference!.
2
7
54
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
3 months
I’m giving a talk about this paper today in the 3- 3:30 session @BlackboxNLP (Jasmine ballroom) and will stick around for the afternoon poster session as well. Would love to hear your opinions 😊.
@nsaphra
Naomi Saphra 🧈🪰
4 months
What makes some LM interpretability research “mechanistic”? In our new position paper in @BlackboxNLP, @sarahwiegreffe and I argue that the practical distinction was never technical, but a historical artifact that we should be—and are—moving past to bridge communities.
Tweet media one
1
7
53
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
1 year
Sarthak & I have a ✨fun✨ talk for you all at the Big Picture workshop #EMNLP2023! It's Thurs. Dec. 7th 11am-12pm Singapore time (Wed. Dec. 6th 7-8pm Pacific).
@yanaiela
Yanai Elazar
1 year
@sarahwiegreffe and @successar_nlp have joined forces to dispute once and for all the question "Is "Attention = Explanation" and the Role of Interpretability in NLP".
Tweet media one
0
7
52
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
1 year
Ever wondered whether surface form competition is something you should be worried about in zero-/few-shot LLM eval? We have answers 😄. Come to our poster "Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy" tomorrow from 10:30am-12pm! #EMNLP2023.
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
New paper: "Attentiveness to Answer Choices Doesn’t Always Entail High QA Accuracy" 📊💬. Something I've been thinking a lot about recently is the relationship between distributions over vocabularies produced by language models and the various ways. 1/5
Tweet media one
4
7
50
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 years
Code now available!
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 years
Happy to announce “Attention is not not Explanation”, accepted to #emnlp2019! Work by myself and @yuvalpi . 1/n.
0
9
51
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 years
I really enjoyed giving this talk-- recording is now available.
@NLPwithFriends
NLP with Friends
4 years
And our next speaker is. 🥁🥁🥁. 🗣Sarah Wiegreffe (@sarahwiegreffe) will talk with us about "Measuring Association Between Labels and Free-Text Rationales".🗓February 17th, 14:00 UTC.📝 Sign up here: .
Tweet media one
0
5
46
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 years
I officially deem myself too verbose for conference paper page limits 🤷‍♀️.
3
0
46
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
3 years
.@yanaiela and I are hosting a Birds of a Feather social on #interpretability at #NAACL2022 on Tuesday 2-3pm PT. It will be hybrid both in-person and on Zoom (more info here: Come chat!.
1
8
45
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 months
Nice to see Anthropic more rigorously testing their previously-qualitative results on steering and being open about its mixed results. One thing I wonder is: how do these results compare to steering model representations directly (no SAE)?.
@AnthropicAI
Anthropic
4 months
New Anthropic research: Evaluating feature steering. In May, we released Golden Gate Claude: an AI fixated on the Golden Gate Bridge due to our use of “feature steering”. We've now done a deeper study on the effects of feature steering. Read the post:
Tweet media one
2
3
46
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
Thanks to @BastingsJasmijn, #BlackBoxNLP workshop now has a Youtube channel! If you missed any of our keynote talks at EMNLP last week, check them out here: More to be added soon :) .@yanaiela @_dieuwke_ @boknilev @nsaphra.
0
7
44
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
3 years
If you're at #NeurIPS2021, I'm presenting our explainable NLP datasets survey for the next 1.5 hours at the Datasets & Benchmarks poster session!. Landing page: Gathertown (spot A2): Camera-ready:
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 years
Happy to share our new preprint (with @anmarasovic) “Teach Me to Explain: A Review of Datasets for Explainable NLP”.Paper: Website: It’s half survey, half reflections for more standardized ExNLP dataset collection. Highlights:. 1/6.
1
6
42
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 years
New #acl2020 paper "Learning to Faithfully Rationalize with Construction" with @successar_nlp @byron_c_wallace @yuvalpi - preprint coming soon!.
3
3
39
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 years
Check out my feature on the ML@GT blog:.
@mlatgt
Machine Learning at Georgia Tech
5 years
🚨 NEW BLOG POST 🚨. ML@GT Ph.D. students @sarahwiegreffe and @yuvalpi discuss their #NLP work on plausible vs. faithful reasoning and why it's important to understand a model's reasoning process. Trust us, it's a good post. 📝:
Tweet media one
0
6
37
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 years
Didn't get done what I planned to today, but at least managed to cut down my code runtime from >24h to ~3h. Pretty sure the compute saved will come back to help me later. Daily reminder that research accomplishment comes in many forms 😃.
2
0
35
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
1 year
Great talk by @claranahhh on “To Build Our Future, We Must Know Our Past” about software lotteries, explore/exploit phases, funding incentives, evaluation benchmarking culture, etc. in NLP research! .Co- @_sireesh @abertsch72
Tweet media one
Tweet media two
Tweet media three
2
3
31
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
1 year
Tweet media one
0
2
33
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 years
I will be giving an invited talk at the @USC_ISI @nlp_usc Seminar this Thursday (11am PT)- the abstract and livestream link can be found here:
1
3
31
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
If text-davinci-001 is a rough approximate to the model reported in the NeurIPS 2020 paper, and text-davinci-002 is ~InstructGPT in the 2022 preprint, then what is just "davinci"? 🤯. Trying to reproduce results from a time before this naming existed.
5
3
30
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
3 years
“Reframing Human-AI Collaboration for Generating Free-Text Explanations” with @jmhessel @swabha @mark_riedl @YejinChoinka . Paper: Code/data:
1
7
31
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
1 year
I will also highlight 2 recent EMNLP papers if you're curious what I've been up to recently:.- Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy (.- Editing Common Sense in Transformers (.
0
2
28
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
3 years
The ACL Anthology goes down the *one day* I actually need anthology-version PDFs for an application 😅.
2
1
27
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
1 year
. , human-AI collaboration, or general human understanding of LM behavior.2. text generation, sampling, uncertainty estimation to understand & improve black-box models .3. mechanistic/bottom-up understanding of models beyond specific tasks. 🧵2/2 Please retweet & share!.
2
3
28
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 years
Enjoyed presenting at #emnlp2019 in Hong Kong last week. Thanks @ICatGT @gtsga for travel support
Tweet media one
Tweet media two
Tweet media three
Tweet media four
1
2
27
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
This talk is today 4:40pm in Pier 4! (to the left of the hall with the sponsorship booths on the main hotel side of the venue).
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
I'll be at #ACL2023NLP in Toronto next week! Would love to meet/catch up with people- please reach out. I'm giving an invited talk "Two Views of LM Interpretability" at the NLRSE workshop Thursday at 4:40pm. Come hear my thoughts on prompting & mechanistic interpretability.
0
5
28
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
1 year
@jacobandreas @srush_nlp FWIW, I gave a talk at ACL in July on this topic. The framework in the talk doesn't capture everything, but I think it gives some credence as to why the terminology might be useful. "Two Views of LM Interpretability" (starting at 7:46): .
0
2
28
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
3 months
Interested in notions of correctness in multiple choice datasets for commonsense tasks? Come check out our poster, happening now!.
@PaltaShramay
Shramay Palta
3 months
Happening in less than 20 minutes at Board 217 in Riverfront! .Drop by!.#EMNLP2024 #NLProc
Tweet media one
0
2
28
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 years
@fhuszar You can email women-in-machine-learning@googlegroups.com-- a lot of people advertise PhD positions there. >4000 members.
1
0
26
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
7 months
Teaching and measuring LM noncompliance should be about more than just safety and outright refusal -- checkout our paper for our taxonomy, dataset, and lots of benchmarking results!.
@faeze_brh
Faeze Brahman
7 months
🤖 When and how should AI models not comply with user requests?.Our latest work with @shocheen at @allen_ai dives into this question, expanding the scope of model noncompliance beyond just refusing "unsafe" queries. 1/n🧵. #LLMs #refusal #noncompliance #responsible_ai
Tweet media one
0
4
27
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
1 year
.@emnlpmeeting Can you clarify #EMNLP2023 "presenter" vs. "non-presenter" registration costs? Does every author on a paper have to register as a presenter, just the presenting author(s), or just one author per paper?.
2
2
24
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
1 year
We’re right at the entrance (2A)
Tweet media one
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
1 year
Have you ever wondered whether localization and ROME/MEMIT-style model editing can work for tasks beyond factual recall (like commonsense plausibility prediction)? If so, come to our poster tomorrow 9-10:30am for our paper "Editing Common Sense in Transformers" #EMNLP2023 😃
Tweet media one
Tweet media two
0
4
25
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 years
I've so enjoyed my internship at AI2 @ai2_allennlp with Noah, @anmarasovic and team! Consider applying:.
1
0
24
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
6 months
TIL about Baker Island, and now I can't stop imagining how much easier my life would be if I moved there 😂
Tweet media one
2
1
25
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
6 years
A great piece of scientific writing for a non-technical audience (& exciting developments for ML from #NeurIPS2018). Imagining how much more accessible the field would be if all published papers had this kind of intuitive breakdown. @techreview @_KarenHao.
0
10
24
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 months
Help: how can I search conference proceedings in the ACL Anthology by *track*? Like how the orals and posters are nicely organized at the physical conference?.@aclanthology.
10
3
24
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 years
Lectured on attention and transformers for NLP applications in @GeorgiaTech's Deep Learning course last Thursday! Slides available here: 1/2.
1
5
24
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
As a side note, it's so difficult to do research based on an API with poor documentation.
2
0
22
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
3 years
An updated & more condensed (10-page) version is available here: Camera-ready coming soon! w/ @anmarasovic @ai2_allennlp.
@mark_riedl
Mark Riedl
4 years
Interested in getting started in Explainable NLP (ExNLP)? This paper by @sarahwiegreffe reviews the datasets. 11 pages of references! . To appear in the NeurIPS 2021 Benchmarks and Datasets track.
Tweet media one
0
10
21
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 years
Nice that NAACL shared updated reviews, but here's what's unfortunate: those who had reasonable scores did not withdraw and/or submit an abstract to ACL; if rejected will have to wait months to resubmit. Those with decisively bad scores were able to resubmit almost immediately.
2
0
20
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 years
To participate in the #PeopleofNLProc series we're running for #NAACL2021 publicity, fill out the Google form here:
@naaclmeeting
NAACL HLT 2025
4 years
Our first #PeopleofNLProc researcher is Vered Shwartz @VeredShwartz. Vered's Bio: I'm a postdoc at AI2 and the University of Washington. I got my PhD from Bar-Ilan University. Besides research, I like to work out, feed birds, and listen to audiobooks. 1/4
Tweet media one
2
6
20
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
with @mark_riedl, advisor extraordinaire
Tweet media one
Tweet media two
0
0
19
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
Trending topic because of AI influencers, or the #EMNLP2023 anonymity deadline? 🤔😅
Tweet media one
1
0
19
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
6 years
Passed my qualifying exam! Woohoo 🎉.
5
0
18
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 years
New paper at the #NAACL2021 narrative understanding workshop.
@mark_riedl
Mark Riedl
4 years
5. Inferring the Reader: Guiding Automated Story Generation with Commonsense Reasoning.@beckypeng6 @sarahwiegreffe @Sylvia_Sparkle . Using commonsense reasoning to guide a neural story generator. (We'll have more to say about the importance of _reader models_ later, stay tuned).
2
0
17
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 years
Come see my talk at #EMNLP this Tuesday; I’ll be re-framing some of the takeaways of our work as a direct contribution to the faithful explainability literature.
@yuvalpi
Yuval Pinter
5 years
@sarahwiegreffe @alon_jacovi To be stated more clearly in the talk :).Tuesday 10:48 session 1A, come one come all!.
3
3
17
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 years
Check out our paper at ACL! Q&A sessions are Tuesday at 8am and 5pm ET.
1
0
16
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
10 months
Not me thinking ACL was hosting a climbing competition 🧗⛰️
Tweet media one
0
0
16
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 years
Super cool app find of the day (or maybe I'm behind the times): @MathpixApp Lets you screenshot rendered LaTeX in a PDF, and converts it to raw LaTeX for you (with surprising accuracy!).
1
1
14
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 years
Excited to see that "Attention is not not Explanation" is the #2 top recent paper of the past month on Arxiv Sanity Preserver! @yuvalpi.
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 years
Happy to announce “Attention is not not Explanation”, accepted to #emnlp2019! Work by myself and @yuvalpi . 1/n.
0
2
15
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
(Also-- this is the general application for Aristo and we will review them all, but if you're specifically interested in interpretability please highlight that in your application and/or reach out to me directly!).
2
0
14
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 years
I want to speak out in solidarity against the explicit symbols of racism we've seen so clearly in the last few weeks. I am learning that these are representations of realities that many of my friends experience on a daily basis, something that I was naively previously unaware of.
0
0
15
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
6 months
Good take.
@andreas_madsen
Andreas Madsen
7 months
After the Mechanistic Interpretability workshop I feel tempted to make my 2024 statement early:. There are no low hanging fruits in interpretability. Everything requires so much skepticism, critical thinking, and is mega hard to properly validate.
0
0
14
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
6 months
Go work with Tuhin! 😄.
@TuhinChakr
Tuhin Chakrabarty
6 months
📣 I will be in New York 🗽 for the foreseeable future, and join @stonybrooku @sbucompsc @SUNY as an Assistant Professor in the Fall of 2025. I plan to recruit 1-2 PhD students starting next fall. Come tackle exciting problems with me on Human Centered AI :-)
Tweet media one
1
2
13
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 years
Reading a JMLR paper from 2010:
1
0
13
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
Really excited for this effort :).
@allen_ai
Ai2
2 years
Today we're thrilled to announce our new undertaking to collaboratively build the best open language model in the world: AI2 OLMo. Uniquely open, 70B parameters, coming early 2024 – join us!.
0
0
13
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
3 years
@AndrewLampinen Very cool work! You might be interested in our paper (to appear at NAACL) on generating few-shot free-text explanations from GPT-3 for passing crowdworker judgements of explanation acceptability. w/ @swabhz @jmhessel @mark_riedl @YejinChoinka .
1
0
13
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
1 year
Guess I need to start scheduling time for "being angry at the political divides now characterizing the interpretability research community" on my weekly calendar 🤔😪.
2
0
13
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 years
I hope we continue producing this resource. It's a great way to get up to speed on a wider scope of papers than I typically read.
0
0
11
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
6 years
@theodunayo @jmhessel @alethioguy @yuvalpi @RTomMcCoy @ACL2019_Italy My favorite hit has to be “I’ll never break your gradient flow” by the Backprop Boys.
1
0
12
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 years
I complained to @USPS that I haven't received any mail in over a month. After not having heard back via phone or email as I was told, I login online to see that they have sent me a *letter* regarding my case.🤦‍♀️❓❓.
1
0
11
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 years
In today's work-from-home saga: spilled milk & cereal all over myself + desk + chair. Laptop was spared 🙏 Multitasking is hard people! 😆.
1
0
12
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
3 years
We had ~20 hours of sunlight, which also meant seeing some alpenglow 🌄
Tweet media one
Tweet media two
Tweet media three
Tweet media four
0
0
12
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
2 years
@janleike Oh wow, thanks for this insight. I don't see that model listed in my view of the API, so I assume I'd have to request access.
2
0
11
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
6 years
Shout into the void: looking for some degree/career planning advice, preferably from someone in broad ML/NLP academia. Does anyone know of anyone willing to lend me ~30 minutes of their time?.
1
3
11
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
1 year
Ooooh this looks exciting!.
@tdietterich
Thomas G. Dietterich
1 year
#ICML will have a Position Paper track. "The goal of this track is to highlight papers that stimulate (productive, civil) discussion on timely topics that need our community’s input" #AI. Read more here:
0
0
11
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
3 years
Takeaways:.- GPT-3 shows potential for automatically creating free-text explanation datasets. - Acceptability can be improved with high-quality prompts and trained filter model operating on over-generations. - Despite its subjectivity, crowdworker acceptability can be modeled.🔚.
4
2
10
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 months
In my experience, like model editing, steering is extremely hyper-parameter sensitive, and we don't have good grounding for selecting meaningful coefficients that keep model representations in-distribution and prevent breaking models' general capabilities.
1
1
10
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
4 years
@tallinzen Yeah, time isn't infinite and the research community's rewards are often misaligned with incentivizing thorough, high-quality research.
1
0
9
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
5 years
Had a great time presenting at #WiML2019 yesterday!.I'll be around #NeurIPS2019 all week; come say hi if you see me.
Tweet media one
0
0
10
@sarahwiegreffe
Sarah Wiegreffe (on faculty job market!)
8 months
@jeremyphoward @aryaman2020 This is the age-old struggle between academia and industry. Anthropic is pushing a specific flavor of interpretability work that is attracting a lot of funding and attention from junior researchers & those outside the field. It is not the only viable (or existing) direction.
1
0
10