Sabrina J. Mielke Profile Banner
Sabrina J. Mielke Profile
Sabrina J. Mielke

@sjmielke

Followers
4,292
Following
628
Media
459
Statuses
4,170

#NLProc #ML @PrescientDesign @Genentech ␥ 💻🧩🔡🔬🏳️‍🌈 ␥ she/her

Brooklyn, NY
Joined September 2012
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
Pinned Tweet
@sjmielke
Sabrina J. Mielke
4 years
You've seen JAX differentiate numpy one-liners, and it looked cool, but... how does one build actual neural net models like we know them from PyTorch or Tensorflow 2.0? “From PyTorch to JAX: towards neural net frameworks that purify stateful code”
12
264
1K
@sjmielke
Sabrina J. Mielke
3 years
Tweet media one
0
87
1K
@sjmielke
Sabrina J. Mielke
3 years
Tokenization—the least interesting #NLProc topic? Hell no! We, members of the @BigScienceW tokenization group are proud to present: ✨Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP✨ What's in it? [1/10]
Tweet media one
15
139
702
@sjmielke
Sabrina J. Mielke
4 years
I finally watched all the talks I wanted to, ended up importing 56 papers to my bib, and now present to you: 🎉 My 13 favorite papers (sorted alphabetically) at #EMNLP2020 ! 🔥 [1/15]
7
128
662
@sjmielke
Sabrina J. Mielke
3 years
Tweet media one
0
21
407
@sjmielke
Sabrina J. Mielke
4 years
"Tired of Topic Models? Clusters of Pretrained Word Embeddings Make for Fast and Good Topics too!" EMNLP short paper by @suzyahyah , @adalmia96 , and me! The idea is obvious, but never worked as well as LDA... until now! How? [1/6]
Tweet media one
7
80
417
@sjmielke
Sabrina J. Mielke
11 months
🎉 PhD defense done! ✅ Job? 🙏👀 Know a place for an #ML person who's been all over the place (cf. messy Fig. 2) in #NLProc around language models? After thesis revisions, I'm looking for an industry job, either remote or, better yet, based in NYC!
Tweet media one
Tweet media two
40
32
370
@sjmielke
Sabrina J. Mielke
4 years
Building k-means in comfortable python-JAX and it's 12% faster on CPU than the half-written-in-C SciPy implementation, while being much nicer to look at 🤩 #TheLittleThings #ShinyNewToysForDaysInIsolation
6
38
344
@sjmielke
Sabrina J. Mielke
4 years
Playing with jit in JAX, sampling a length-500 seq from an LSTM-LM (avg of 10x): - no jit: 17.9s - LSTM recurrence jit: 2.47s - overall (layers&sampling) recurrence jit: 0.68s runtime ...which is roughly as fast as un-jit-ed PyTorch, except you pay 0.5s jitting time. But...
3
25
246
@sjmielke
Sabrina J. Mielke
3 years
Excited to join @huggingface part-time to do some fun research with @srush_nlp & @YJernite ! 🤗 (I would post an office photo, but alas... soon 🤞)
Tweet media one
4
5
237
@sjmielke
Sabrina J. Mielke
3 years
Nature is healing.
1
21
232
@sjmielke
Sabrina J. Mielke
4 years
With @iclr_conf #ICLR2020 over and a bit of sleep under my belt, I'd like to give my short summary of a truly great event---and offer a list of the papers I enjoyed seeing (for those who are into that kind of thing).
4
49
226
@sjmielke
Sabrina J. Mielke
4 years
Hot take: every academic today should learn at least basic Pinyin. It’s not hard to not completely butcher names of students you’re announcing! (This old gripe presented by: senior researcher asking people questions in a zoom call and nobody knows who he means.)
10
22
228
@sjmielke
Sabrina J. Mielke
4 years
Industry internships when you're a grad student.
@PDLComics
poorly drawn lines
4 years
a coin
Tweet media one
24
2K
9K
2
4
206
@sjmielke
Sabrina J. Mielke
4 years
Excited and relieved that I got to start my internship with @facebookai under Emily Dinan today! 🎉🤩🙏 (...even if FAIR NYC turned into "FAIR Baltimore" 😅) (unrelated picture: the one good use for the touchbar that my intern laptop comes with...)
Tweet media one
6
7
190
@sjmielke
Sabrina J. Mielke
4 years
Excited to share the project that's been carrying me through much of 2020: "Linguistic calibration through metacognition: aligning dialogue agent responses with expected correctness" 🤖📊 w/ Arthur Szlam, Y-Lan Boureau, and Emily Dinan ( @em_dinan ) [1/8]
Tweet media one
1
26
184
@sjmielke
Sabrina J. Mielke
5 years
tl;dr: My name is Sabrina and my pronouns are she/her -- I'm trans. 🏳️‍⚧️ Wish me luck and help me kick ass! 💪 Wanna know what exactly that means, where things will go, what voice training is, and why it's so damn hard? I wrote a little blog post:
15
6
179
@sjmielke
Sabrina J. Mielke
6 years
We now have a general-purpose open anonymous preprint server on OpenReview: A good way for "fast science" while respecting double blind of *ACL venues?
9
72
172
@sjmielke
Sabrina J. Mielke
3 years
The recording of this #JAX talk is up here now: It ended up, yes, based on the blogpost---but more concise and punchy... and including an example of flax' new linen API! slides notebook
@sjmielke
Sabrina J. Mielke
3 years
Catch me #talk about my first steps with #JAX tomorrow, 2021-07-01: "From stateful code to purified JAX: how to build your neural net framework" 9.00am-9.30am PST / 12.00pm-12.30pm EST / 6.00pm-6.30pm CEST (based on , see: )
Tweet media one
0
2
31
0
20
151
@sjmielke
Sabrina J. Mielke
5 years
"What Kind of Language Is Hard to Language-Model?" (ACL 2019 long paper with @ryandcotterell , @wellformedness , @BrianRhoArc , and @adveisner ) A fancy regression model! BPE and char-RNNLMs! Two corpora spanning 69 languages! And a surprise ILP! Thread:
Tweet media one
3
32
149
@sjmielke
Sabrina J. Mielke
4 years
undergrads joining the lab for a bit of research and quickly outpacing the PhD student like
2
22
137
@sjmielke
Sabrina J. Mielke
2 months
The doctoral hooding ceremony finally concludes this long PhD chapter of my life. What a trip! The horrors persist, but so do I.
Tweet media one
Tweet media two
Tweet media three
10
2
128
@sjmielke
Sabrina J. Mielke
5 years
This is the kind of paper I love reading: clear exposition, a simple method, clever visualizations, and a ton of experiments, especially those that say "you would've thought this might work as well, but we tried and it doesn't, here are the numbers." Just a great read all around.
@gneubig
Graham Neubig
6 years
#ICLR2019 paper "Lagging Inference Networks and Posterior Collapse in VAEs". VAEs collapse to trivial solutions; we find this is because the inference network is poor at the beginning of training, then propose a simple solution of "aggressive update":
Tweet media one
Tweet media two
Tweet media three
Tweet media four
2
61
255
0
24
129
@sjmielke
Sabrina J. Mielke
4 years
#ICML2020 mentoring session w/ @shakir_za : Training yourself to learn to write is something people focus on less than learning to code---even though it's probably more important to be successful in academia! Write a blog, write essays if just for yourself, practice & judge! 💯
1
18
123
@sjmielke
Sabrina J. Mielke
2 years
Finally finished drawing, coloring, and animating my artisanal 100% hand-drawn organic #NAACL2022 slides 😌 Check them out live in just about 9 hours in session 1E, Elwha B (5th floor) and find out why the bull is shitting... #NLProc #ConferencesMeanSleepDeprivation
Tweet media one
2
4
124
@sjmielke
Sabrina J. Mielke
3 years
For this #TDOV ... I'm visibly celebrating passing my qualifying exams! 🎉😌 Finally a PhD *candidate*---all but dissertation! ✨ (enjoy the plot of my heart rate over the 2h exam 😅)
Tweet media one
12
0
123
@sjmielke
Sabrina J. Mielke
5 years
Instead of: "Markov Random Field (MRF) (Koller and Friedman, 2009)" Try: "Markov Random Field (MRF; Koller and Friedman, 2009)" Using the optional arguments for citep: \citep[MRF;][]{KolFri09} #LaTeX #reviewing #ACL2019 #ItDoesLookNicer
5
13
110
@sjmielke
Sabrina J. Mielke
6 years
“Spell Once, Summon Anywhere: A Two-Level Open-Vocabulary Language Model”, with @adveisner , updated version on arXiv: We augment a word-level RNN with a character-dependent prior on embeddings to build a SOTA open-vocab LM. What does that look like? (1/5)
Tweet media one
2
18
105
@sjmielke
Sabrina J. Mielke
4 years
My first time being *last author* paper, accepted at #emnlp2020 ! 🥳🎉😭 Incredibly proud of @suzyahyah and @adalmia96 who made it happen---and trusted me to advise them! 🥰 First paper at EMNLP for all of us, funnily enough 😅 Details to follow once we clean it up real nice 💪
Tweet media one
3
5
103
@sjmielke
Sabrina J. Mielke
4 years
My #acl2020nlp in numbers: 25 pages of scratchpad Google doc 100 talks watched or skimmed 59 papers deemed cool 110 public RocketChat channels joined 53 people in DMs Actual summary later maybe, first: sleep. 😅 Good thing there’s exactly one day of rest before #ICML2020 🎉
2
0
104
@sjmielke
Sabrina J. Mielke
4 years
From sketches to figures: what do my illustrations look like when I first draw them (by hand) vs. when they're complete (in TikZ or Inkscape)? Lighthearted thread time! :) [1/11]
Tweet media one
4
6
102
@sjmielke
Sabrina J. Mielke
4 years
My first #ICML2020 was different from my n-th #acl2020nlp , but, or perhaps because of that, I did try to look for interesting papers that I could relate to but that might still teach me something new! Papers, in roughly chronological order---each with a short summary :) [1/42]
3
18
97
@sjmielke
Sabrina J. Mielke
4 years
You wanna know how I wasted half a year building a complex Bayesian heteroscedastic model with HMC inference... ...only for it to give essentially the same result as averaging all my data? 😬 Happening tomorrow, time to finish up those slides 💪
@NLPwithFriends
NLP with Friends
4 years
A new month!! We are very excited to announce our first September speaker! 🗣Sabrina J Mielke ( @sjmielke ), talking with us about "Fair comparisons for generative language models—with a bit of Information Theory" 🗓Sept 2nd, 14:00 UTC 📝Sign up here!
4
14
136
3
3
95
@sjmielke
Sabrina J. Mielke
4 years
Thinking about applying to PhDs? 🙋 I'll talk about my academic story at #acl2020nlp : 1️⃣ Sun, Jul 4, 7am PT (panel for undergrads, moderated by @arnaik19 and @khyathi_chandu ) 2️⃣ Mon, Jul 5, 9am PT (mentoring session with @sebgehr and @ssgrn ) ...or message me anytime! 😊
Tweet media one
7
9
90
@sjmielke
Sabrina J. Mielke
4 years
🚨Nov 16--Dec 18: a month of #NLProc / #ML conferences 🚨 EMNLP: Nov 16-18 CoNLL: Nov 19-20 AACL: Dec 4-7 NeurIPS: Dec 6-12 COLING: Dec 8-13 INLG: Dec 15-18 What are you doing to prepare for an exciting but probably super exhausting month? 😨
6
8
85
@sjmielke
Sabrina J. Mielke
1 month
Excited about Day 1 at Genentech's Prescient Design accelerator! Couldn't be more stoked to work on LLMs for pharmaceutical drug discovery (aka curing cancer) with some really smart people 👩‍💻👩‍🔬
Tweet media one
3
5
83
@sjmielke
Sabrina J. Mielke
7 years
NLP/CL Twitter Megathread - consolidated and readable: ! Interesting discussions about syntax, engineering and cake.
Tweet media one
7
26
77
@sjmielke
Sabrina J. Mielke
1 year
in today's episode of "people of your class don't belong in academia" i present: "two missed paychecks aren't a problem, right?" completely related: i'm looking for #ml / #nlproc industry jobs starting in november! lmk if you have openings or leads :)
Tweet media one
6
5
78
@sjmielke
Sabrina J. Mielke
3 years
new hair who dis 💚
Tweet media one
Tweet media two
4
0
78
@sjmielke
Sabrina J. Mielke
2 years
end of a beautiful week in the Toronto @CohereAI office! can't believe how much i missed working in an actual office with other people you can talk to 😅 (and can't believe my internship is almost over...)
Tweet media one
0
3
76
@sjmielke
Sabrina J. Mielke
4 years
#NLProc in a nutshell.
@yoavgo
(((ل()(ل() 'yoav))))👾
4 years
@GuillaumeLample @MaLachaux @b_roziere @LowikChanussot the work is very cool, but the title is extremely over-claiming.
5
2
31
1
4
75
@sjmielke
Sabrina J. Mielke
3 years
it's called data science
Tweet media one
3
8
74
@sjmielke
Sabrina J. Mielke
4 years
First time reviewing for ICML seems to have gone well! I guess it's silly but this little cute PDF really is making my day 😊
Tweet media one
1
0
74
@sjmielke
Sabrina J. Mielke
4 years
"Tired of Topic Models? Clusters of Pretrained Word Embeddings Make for Fast and Good Topics too!" A hot take in form of a brand-new (read: unreviewed) preprint, by Suzanna Sia ( @suzyahyah ), Ayush Dalmia ( @adalmia96 ), and me. Feedback welcome!
Tweet media one
4
15
70
@sjmielke
Sabrina J. Mielke
4 years
My first own desktop PC, assembled in 2h from self-selected pieces 😌 Project $1000 PowerPoint teaching machine is making progress 😍 (do I need a GTX 1660 for PowerPoint you ask? shhhhhhhh)
Tweet media one
3
0
72
@sjmielke
Sabrina J. Mielke
3 years
belated new year’s resolution: dare to wear the femme clothes @VasundharaNLP gave me 🥺❤️
Tweet media one
3
0
70
@sjmielke
Sabrina J. Mielke
8 years
Natural Language Processing == English Language Processing? 69% of ACL 2016 long papers only evaluate on English...
Tweet media one
3
64
66
@sjmielke
Sabrina J. Mielke
4 years
Happening now 😍
Tweet media one
@NLPwithFriends
NLP with Friends
4 years
A new month!! We are very excited to announce our first September speaker! 🗣Sabrina J Mielke ( @sjmielke ), talking with us about "Fair comparisons for generative language models—with a bit of Information Theory" 🗓Sept 2nd, 14:00 UTC 📝Sign up here!
4
14
136
6
2
66
@sjmielke
Sabrina J. Mielke
2 years
Perfect opportunity to announce that I'll be interning with @CohereAI this summer, performing #JAX magic with @_joanna_yoo and team! Very excited 😍
@cohere
cohere
2 years
Cohere, @OpenAI & @AI21Labs have announced a set of best practices for responsible deployment of large language models. The joint statement is a first step towards fostering an industry-wide conversation to bring alignment to the community. #AI #aiforgood
7
84
280
4
1
67
@sjmielke
Sabrina J. Mielke
5 years
How easy it is to run into numerical issues, today: sampling from a Beta/Dirichlet in @PyTorch . Why do very *low* concentrations yield these odd samples that look like they came from *infinite* concentrations? And why is this not a bug per se? Time to find out! (Thread)
Tweet media one
3
12
64
@sjmielke
Sabrina J. Mielke
4 years
Beam search gives boring answers... A) "...unlike humans!" (Holtzman+, 2019, "The Curious Case of Neural Text Degeneration", ) B) "...just like humans!" (Meister+, 2020, "If beam search is the answer, what was the question?", ) 🧐
2
10
66
@sjmielke
Sabrina J. Mielke
4 years
did any of you ever travel a few hours by train to an airport only to realize you forgot your passport? cause i just did
14
0
65
@sjmielke
Sabrina J. Mielke
4 years
Proud to have worked on this last summer @GoogleAI : "Dakshina:" 2GB of transliterated sentences & transliteration lexica (preserving task-inherent ambiguity, giving you frequencies for variants!) in 12 South Asian languages! ➡️ ➡️
@BrianRhoArc
Brian Roark
4 years
We publicly released a dataset last month and just now getting around to announcing it here. The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages: 1/5
9
112
413
1
4
65
@sjmielke
Sabrina J. Mielke
5 years
End of a great time ⁦ @GoogleAI NYC,⁩ as the first fellow research interns leave… (ltr: Sirui, ⁦ @nouhadziri ⁩, ⁦ @rajarshi_nitc ⁩, ⁦ @agarwal_oshin ⁩, ⁦ @nitish_gup ⁩, ⁦me⁩, Yoon, ⁦ @hila_gonen ⁩, ⁦ @kalpeshk2011 ⁩, ⁦ @anjalie_f ⁩, Dongxu)
Tweet media one
0
4
64
@sjmielke
Sabrina J. Mielke
4 years
Any JAX folks who are willing to read a draft for a tutorial on how to get from differentiating numpy one-liners to actually building models, ending in understanding why frameworks like flax or haiku do the things they do? LMK if you're interested! :)
10
2
62
@sjmielke
Sabrina J. Mielke
5 months
Great opportunity to read our 2021 survey on tokenization in the modern age ;) "Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP"
@karpathy
Andrej Karpathy
5 months
We will see that a lot of weird behaviors and problems of LLMs actually trace back to tokenization. We'll go through a number of these issues, discuss why tokenization is at fault, and why someone out there ideally finds a way to delete this stage entirely.
Tweet media one
61
301
3K
1
9
61
@sjmielke
Sabrina J. Mielke
6 years
Today in LaTeX/TikZ packages you never knew you needed: TikZducks! /
Tweet media one
Tweet media two
0
16
59
@sjmielke
Sabrina J. Mielke
6 years
Does the best number of BPE merges for language modeling¹ depend on the language²? Yes! ______________ ¹ Why use BPE for LMs? It's a surprisingly good open-vocab baseline, see . ² Ooh, multiple languages? Yup, this is follow-up to !
Tweet media one
@sjmielke
Sabrina J. Mielke
6 years
Our poster for "Are All Languages Equally Hard to Language-Model?" at NAACL :) Short paper with @_shrdlu_ @adveisner @BrianRhoArc : . Swing by at the Sunday 2pm poster session in Elite Hall B!
Tweet media one
3
10
38
4
15
60
@sjmielke
Sabrina J. Mielke
3 years
So it begins. Changing my Twitter handle to FOMO-Sabrina mode. 😅 #EMNLP2021
2
0
59
@sjmielke
Sabrina J. Mielke
4 years
The #acl2020nlp tutorial: "Integrating ethics into the NLP curriculum" by @emilymbender , @dirk_hovy , and @XandaSchofield ...was AMAZING---and half as an exercise for me and half to be useful to y’all I’ll briefly summarize my 3pgs of notes! 😅 [1/20]
@sjmielke
Sabrina J. Mielke
4 years
#acl2020nlp T7: "Integrating ethics into the NLP curriculum" looks perfect for what I need to learn right now 😱 Really glad I took that intro to university teaching class last semester, so I know good ol' Bloom's taxonomy... But I didn't know it had an updated version! 🧐🤓
1
2
17
3
16
59
@sjmielke
Sabrina J. Mielke
4 years
Papers that give new names to old things citing them as "Inspired by [exactly the same thing] of [citation]" or "These techniques bear a resemblance to ours" even though they're literally the same 😡 Just be honest and say you *use* their idea, it only makes your paper stronger!
4
2
58
@sjmielke
Sabrina J. Mielke
4 years
Another 23% on top if you *parallelize* the (usually 20) iterations of the algorithm to find the best local minimum. How? Just `jax.vmap` over the split PRNG keys:
Tweet media one
2
3
56
@sjmielke
Sabrina J. Mielke
4 years
Just found this gem in my old bookmarks: "FuckIt․py" "Some code has an error? Fuck it." "Still getting errors? Chain fuckit calls. This module is like violence: if it doesn't work, you just need more of it."
Tweet media one
Tweet media two
2
8
53
@sjmielke
Sabrina J. Mielke
3 years
4/20 blazing the second shot today (ignoring a tacl rejection)
Tweet media one
2
1
55
@sjmielke
Sabrina J. Mielke
4 years
FWIW, speedup by batching using vmap (which of course took me an hour or two to really get working for my hacky LSTM-LM...): - CPU, batchsize 10: 3.14sec - CPU, batchsize 10000: 2:46min - TPU, batchsize 10: 2.36s - TPU, batchsize 10000: 7.6s!!! (all on @GoogleColab 's free tier)
2
3
54
@sjmielke
Sabrina J. Mielke
6 years
Paper accepted and coming to #AAAI2019 in Honolulu! 🎉 Hope to see many of you there :)
@sjmielke
Sabrina J. Mielke
6 years
“Spell Once, Summon Anywhere: A Two-Level Open-Vocabulary Language Model”, with @adveisner , updated version on arXiv: We augment a word-level RNN with a character-dependent prior on embeddings to build a SOTA open-vocab LM. What does that look like? (1/5)
Tweet media one
2
18
105
1
7
55
@sjmielke
Sabrina J. Mielke
5 years
Exciting: my first paper withdrawal! (It just isn't ready enough for EMNLP.) Definitely unrelated: I've now started my @GoogleAI internship in NYC with @BrianRhoArc and team. If you want to see me wear a silly propeller hat, let's meet up! :D
0
2
52
@sjmielke
Sabrina J. Mielke
4 years
\newcommand{\textt}[1]{\texttt{ #1 }}
1
1
50
@sjmielke
Sabrina J. Mielke
3 months
i know i said i didn't care for the piece of paper but i guess i ended up framing it anyway :)
Tweet media one
2
0
51
@sjmielke
Sabrina J. Mielke
4 years
Final grades submitted; my first semester of teaching my very own class is done! 😥🎉 And even with challenging uncurved exams and it bein 2020 more than half the class is walking out with an A and not a single person came even close to failing 😍 Hopkins undergrads, y'all...
1
0
52
@sjmielke
Sabrina J. Mielke
4 years
I made it after all because Rachel and @CA_Pocasangre are goddamn *saints* and drove up to Newark with a rental car to bring me my passport! 😍😭 I truly don't deserve my friends. Well, time to finish this paper on beautiful German trains 🇩🇪 Home...
Tweet media one
Tweet media two
Tweet media three
@sjmielke
Sabrina J. Mielke
4 years
did any of you ever travel a few hours by train to an airport only to realize you forgot your passport? cause i just did
14
0
65
2
0
52
@sjmielke
Sabrina J. Mielke
3 years
✍️ How do you write a good #NLProc paper? What is the lifecycle of a conference submission? And who is this Reviewer 2? @VasundharaNLP and I will demystify the process and share advice on writing extended abstracts at WiNLP's #EACL2021 event, which will also have a cool panel 👇
@WiNLPWorkshop
WiNLP
3 years
Less than three weeks until our #WiNLP satellite event at #EACL2021 on April 19! To sign up, read about our awesome panel on underrepresented languages in #nlproc , and find out more about our extended writing workshop, visit our page:
0
6
13
0
8
52
@sjmielke
Sabrina J. Mielke
4 years
I feel like I'm tweeting spoilers but Kathy McKeown's #acl2020nlp keynote is EXTREMELY GOOD. Whenever you're ready.
8
2
49
@sjmielke
Sabrina J. Mielke
3 years
I've been selected to attend the #MIT #EECS #RisingStars2021 workshop today and yesterday (check me and my poster today at 12 out: ) and... Wow, this has been so much more useful than I thought for me personally in just so many ways!
2
1
49
@sjmielke
Sabrina J. Mielke
3 years
"eh reviews are only due in march"
0
2
48
@sjmielke
Sabrina J. Mielke
4 years
In theory I hate pretentious stuff like this in papers. 😡 ...but of course I've done this particular one plenty of times myself in the past 😬
1
3
49
@sjmielke
Sabrina J. Mielke
2 years
going through the biggest and most painful breakup of my life 💔 please recommend fun media and new activities to fill the massive hole in my heart 🥲 also advice on finding my own place to stay beyond my current temporary shelter with leaking roof 😬
14
0
48
@sjmielke
Sabrina J. Mielke
3 years
Regular reminder: you can *mute* words or accounts for as little as just a day! ☝️ It's your timeline and it's okay to make it work for you! 💕 (Maybe this is obvious to you, but it took me a long time to stop feeling bad about it)
@sjmielke
Sabrina J. Mielke
3 years
Tweet media one
0
0
17
2
0
46
@sjmielke
Sabrina J. Mielke
2 years
Making the promised hand-drawn slides for my Monday morning #NAACL2022 talk on the flight to Seattle... (feat. biblically accurate chatbot face) ✍️🎨📱
Tweet media one
2
1
46
@sjmielke
Sabrina J. Mielke
4 years
Pronouns in bio, mask in profile picture. #NewProfilePic
Tweet media one
0
0
45
@sjmielke
Sabrina J. Mielke
8 years
"NLP == English Language Processing?", continued: Language diversity in ACL 2004 - 2016. A small blog post:
Tweet media one
3
29
43
@sjmielke
Sabrina J. Mielke
4 years
I ended my 14-day quarantine with a negative COVID test, so it's time to post pictures from my... 🛣 Fall 2020 US Roadtrip! 🇺🇸 Colorado, Wyoming, Montana, Idaho, Oregon, Washington 35 pictures from the 3000+ ones I took 😅
Tweet media one
1
0
45
@sjmielke
Sabrina J. Mielke
4 years
...jit the *entire* sampling computation (which takes 5 mins as opposed to .5 seconds) and you get: 0.08s!!😅
2
2
44
@sjmielke
Sabrina J. Mielke
4 years
Unpopular opinion: Google Slides is *good* actually---if you put in at least a little bit of effort. (coming from someone who's been obsessively beamer/TikZ-ing EVERYTHING for many years and is only slowly recovering)
7
0
43
@sjmielke
Sabrina J. Mielke
6 years
Excited to announce that I'll be spending the Summer at @GoogleAI in NYC with @BrianRhoArc and others doing cool language modeling research!
3
1
44
@sjmielke
Sabrina J. Mielke
3 years
Tweet media one
0
0
45
@sjmielke
Sabrina J. Mielke
4 years
Classic (not only) ML tool from today, but in the 1970s: The Singular Value Decomposition! "Today the SVD is widely used in scientific and engineering computation, but in 1976 the SVD was relatively unknown." Very 70s graphics, music, and narration 😍
Tweet media one
Tweet media two
Tweet media three
Tweet media four
5
10
42
@sjmielke
Sabrina J. Mielke
2 years
diff <(cat *.tex | grep -o -E '\\[a-z]*cite[a-z]*(\[[^]]*\])?(\[[^]]*\])?{[^}]+}' | sed 's/.*{\(.*\)}/\1/' | tr , '\n' | sort -u) <(grep @ ../*.bib | sed -r 's/\s*@[a-z]+\{//' | sed 's/,.*//' | sort -u) | grep '>' | sed 's/> //' # find unused bib entries (hacks upon hacks)
4
1
44
@sjmielke
Sabrina J. Mielke
3 years
seeing x new papers on #arXiv every day like "< #NLProc task> with Transformers"
0
2
44
@sjmielke
Sabrina J. Mielke
4 years
On my way to #EMNLP2020 , checking the papers! Can't wait for Punta Cana! 🤩 (j/k heading back to our Baltimore bunker to weather the Thanksgiving pandemic storm 🥲)
Tweet media one
4
0
43
@sjmielke
Sabrina J. Mielke
3 years
Post-shower curly hair is getting closer to the ultimate transition goals... 😈👩‍🔬🏳️‍⚧️
Tweet media one
Tweet media two
3
0
42
@sjmielke
Sabrina J. Mielke
5 years
New #NLProc blog post: “Can you compare perplexity across different segmentations?” Short answer: Not immediately. Long answer: Yes, as long as you have equal denominators and the same support!
Tweet media one
2
10
43
@sjmielke
Sabrina J. Mielke
4 years
Notebook: If you have opinions on good JAX code, I'd appreciate any feedback you have on it! :)
2
4
42
@sjmielke
Sabrina J. Mielke
3 years
This Page-1-table is one of the best pitches I've seen in a while. Excited to read why #BPEsucks in even more ways 😁 #NLProc
Tweet media one
1
8
43
@sjmielke
Sabrina J. Mielke
3 years
got to hang out with @annabelle_cs 's and @nsaphra 's four adorable foster kittens last night 🐾😸
Tweet media one
2
1
43
@sjmielke
Sabrina J. Mielke
5 years
Michael I. Jordans NeurIPS 2005 tutorial "Dirichlet Processes, Chinese Restaurant Processes and All That" () is the clearest summary of Bayesian nonparametrics, inference, and DPs/HDPs in particular that I have seen. Beautiful!
1
6
41
@sjmielke
Sabrina J. Mielke
3 years
ICYMI: the recording of our tutorial is up! »How to write an #NLProc paper: from CfP to publication!« (presented by @VasundharaNLP & @sjmielke at a @WiNLPWorkshop event) 🎥 Recording: 🎥 Mirror: 📄 Slides:
Tweet media one
0
10
42
@sjmielke
Sabrina J. Mielke
4 years
tired: KN95 n-gram language modeling wired: KN95 masked language modeling
1
2
40