galvanize (gail weiss) πποΈ
@gail_w
Followers
1K
Following
18K
Media
754
Statuses
4K
Trying to get off of here, dms checked rarely. Better to send an email postdoc at EPFL, understanding neural networks. https://t.co/lnaKoR7vmc .
Joined March 2009
Seems like as good a time as ever to make (and maintain) a thread of publications, to keep them easy to find. First is extraction of DFAs from recurrent neural networks, using lstar. Challenge was equivalence queries.
We have a cool new algorithm for extracting automata from RNNs (LSTMs, GRUs. ). Turns out that for many simple languages, RNNs actually learns quite large and weird DFAs that have many blind-spots which our algo discovers. (w/ Gail Weiss, @yahave ).
1
6
32
My first ever email (correctly) addressing me as doctor went straight to junk, and then I couldn't attend my graduation on account of being in an entirely different country, but: hypers and onlookers, I am now a β¨ doctor β¨!. Thanks to @yoavgo and @yahave for bringing me here!
6
2
87
We all know that LSTMs can count and GRUs canβt. But what about QRNNs? And what if we make it a bit harder? And how about WFAs? And how does it tie in with rational recurrences? And what if w. (work w/ @lambdaviking,@yoavgo,@royschwartz02,@nlpnoah,@yahave).
A Formal Hierarchy of RNN Architectures -- in which we address questions like "what can an LSTM do that a QRNN can't?". Joint work with @gail_w, @yoavgo, @royschwartz02, @nlpnoah, and @yahave #acl2020nlp . Blog: Paper:
0
21
58
@plain_simon for some people in fields without open access as a standard its a good way to share their work.
0
1
48
Hi all! A reminder that we have a (formal language theory X NLP X neural models) discord going, and you're welcome to join if you like that intersection! Bonus!: This Thursday @ryandcotterell will be presenting . Join link:
2
15
45
@Danitni @BetterPocker ΧΧΧΧ Χ Χ©ΧΧ Χ©ΧΧ’Χͺ Χ’Χ Χ§ΧΧ ΧΧΧ Χ©Χ Χ§Χ¨Χ’. ΧΧ ΧΧ ΧΧΧΧ ΧΧΧ¨ΧΧ Χ’Χ ΧΧ Χ©ΧΧ ΧΧΧ¨ΧΧ ΧΧΧΧΧ, ΧΧΧ, ΧΧ ΧΧΧΧ ΧΧΧΧ‘ΧΧ Χ©ΧΧΧΧ ΧΧΧ¦Χ ΧΧΧΧ ΧΧΧ€Χ ΧΧΧ Χ©Χ§Χ¨Χ ΧΧΧ ΧΧΧΧΧ ΧΧ ΧΧΧ©ΧΧ’ΧΧͺ Χ©Χ ΧΧΧΧ€ΧΧ.
2
0
37
@yoavgo Honestly donβt really see any credibility in an institute that counts @timnitGebru & @alexhanna , one of which has been spreading horrific libel & the other which openly celebrated the acts of October 7th, among its ownβ¦ makes you wonder if they have anyone serious there at all.
6
5
34
Dumb question: can we call βtutorialsβ something else? I came here thinking itβd be a here-learn-python kind of thing & I wouldnβt need it. But it seems theyβre actually the really long interesting talks? Why not βoverviewsβ, βopening talksβ, etc? #ICML2018.
2
2
28
Ever found yourself staring at completely nonsensical looking attention in a transformer and wondering how it could possibly be doing what it is? Abnar and Zuidema have a really cool paper here showing how the 'accumulated' attention over all layers makes a lot more sense! QA now.
If you are attending ACL, today is the Q&A session for our paper on ``Quantifying Attention Flow in Transformers'':. Also check out our blogpost:.
1
7
25
@timnitGebru Hard to see this being said in any form of good faith when you have yet to say a single word on the atrocities of Saturday.
1
0
25
Very nice talk from @OfirPress at @boknilevβs group today! Ofir told us about ALiBi, a simple method to help transformers deal w/ input sequences longer than seen in training. The trick: instead of pos. embeddings, apply a relative pos. bias to attn. patterns, @ varying strengths
1
3
23
@emilymbender Emily, if indeed you want to engage sincerely, I want to know we have common ground, so I will appreciate your response on the following: do you think the sadistic massacre of innocents by Hamas on October 7 was ok or possibly justified? Or do you see it as disgusting and evil?.
3
3
21
The first QA session for our ACL paper, "A formal hierarchy of RNN architectures", starts in 15 minutes! Excited to talk about formal analysis/draw weird proof sketches/speculate about RNNs with everyone there :).
A Formal Hierarchy of RNN Architectures -- in which we address questions like "what can an LSTM do that a QRNN can't?". Joint work with @gail_w, @yoavgo, @royschwartz02, @nlpnoah, and @yahave #acl2020nlp . Blog: Paper:
0
4
22
@emilymbender @sarahbmyers @alexhanna Will you be exploring @alexhanna βs hype of rape, torture, sadism and massacre?
0
1
20
Looking forward to talking with everyone at AIPLANS! The workshop will be on the 14th of December and my talk starts at 7am EST (12 noon GMT) - hope to see you all there!.
Did you know that a transformer architecture can be thought of as a programming language? And do you know how to sort a sequence in that language? π€π. Gail Weiss (@gail_w), an author of "Thinking Like Transformers", will be speaking at AIPLANS to tell us all about this!
1
0
17
@timnitGebru Glad youβre seeing this! We also donβt care to have actual genocide done against us, so weβll keep on speaking up against this sick inversion of the situation. Let us know if it ever gets through!.
6
0
18
@wtfis2bdone @yoavgo @timnitGebru @alexhanna Alex Hanna retweeting a celebration of the sadistic massacre of October 7
2
2
17
@emilymbender Come one come all @alexhanna is in the house and the bloodthirsty sadistic massacre wonβt celebrate itself
0
2
15
Distractors hate him: this one weird trick* helps language models solve reasoning tasks even in the presence of irrelevant information!. *memorising knowledge instead of holding it in context. w/ @ZemingChen5, @ericmitchellai, @real_asli, and @ABosselut.
0
6
18
@emilymbender I am so very sick and tired of people imploring Israel to βdo moreβ and βbe betterβ while we are literally being murdered. This is not the first time and every time you criticise our attempts at self defence without offering a better idea you aid the next time.
1
2
14
@roydanroy I donβt know about in general, but definitely donβt agree during review process: blanket permission to share reviews breaks blindness about who authors are, lending itself to all kinds of gaming (in particular by big names).
2
0
14
Motivated by @ojahnn βs quest to find increasingly convoluted string-reversal code, we present RASP,,.
EXTREMELY excited to announce RASP, a programming language whose goal is to provide a computational model for transformers in much the same way that automata have served for RNNs. Work with @yoavgo and @yahave , accepted into ICML 2021.
1
0
16