diwuNLP Profile Banner
Di Wu Profile
Di Wu

@diwuNLP

Followers
139
Following
675
Statuses
161

PhD candidate in MT/NLP/ML @UvA_Amsterdam, working with @c_monz.

Amsterdam
Joined June 2019
Don't wanna be here? Send us removal request.
@diwuNLP
Di Wu
2 months
RT @__JohnNguyen__: 🥪New Paper! 🥪Introducing Byte Latent Transformer (BLT) - A tokenizer free model scales better than BPE based models wit…
0
67
0
@diwuNLP
Di Wu
3 months
@jlibovicky @jindra_helcl hah, the video is interesting
0
0
0
@diwuNLP
Di Wu
4 months
RT @francoisfleuret: Do we like this?
Tweet media one
0
10
0
@diwuNLP
Di Wu
4 months
RT @bnjmn_marie: Unsloth has identified and fixed the gradient accumulation issue I reported last week. The problem turned out to be more s…
0
30
0
@diwuNLP
Di Wu
4 months
We show that a grammar book provides little or even no help for translation in LLMs, questioning the recent "truly zero-shot translation" --- no data no gain, still 🧐
@sethjsa
Seth Aycock
4 months
Our work “Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?” is now on arXiv! - in collaboration with @davidstap, @diwuNLP, @c_monz , and Khalil Sima'an from @illc_amsterdam and @ltl_uva 🧵
0
1
9
@diwuNLP
Di Wu
4 months
RT @ltl_uva: Language Technology Lab got four papers accepted for #EMNLP2024! Congrats to authors Kata Naszadi, Shaomu Tan, Baohao Liao @ba
0
1
0
@diwuNLP
Di Wu
7 months
RT @evgtokarchuk: Come check our poster tomorrow at @GRaM_org_ @icmlconf if you want to discuss dispersion of text embeddings on hyperspher…
0
17
0
@diwuNLP
Di Wu
7 months
RT @davidstap: 1/4 #ACL2024 Excited to share our new paper on the impact of fine-tuning on the qualitative advantages of LLMs in machine tr…
0
6
0
@diwuNLP
Di Wu
7 months
RT @evgtokarchuk: Next week I'll be in Vienna at @icmlconf! Want to learn more on how to explicitly model embeddings on hypersphere and en…
0
3
0
@diwuNLP
Di Wu
8 months
RT @kchonyc: modern LM research seems to be the exact repetition of MT research. here goes the prediction; someone will reinvent minimum…
0
28
0
@diwuNLP
Di Wu
8 months
RT @mar_kar_: Can #LLMs truly reason over loooong context? 🤔 NoCha asks LLMs to verify claims about *NEW* fictional books 🪄 📚 ⛔ LLMs that…
0
95
0
@diwuNLP
Di Wu
8 months
RT @bazril: @alexandrabirch1 asking the important questions in the #eamt2024 keynote
Tweet media one
0
4
0
@diwuNLP
Di Wu
8 months
RT @royvanrijn: Fact: A lot of Dutch laptops have this special tray to warm up your stroopwafel.
Tweet media one
0
4K
0
@diwuNLP
Di Wu
8 months
RT @PhD_Genie: Adjusting your paper within the word limit
0
109
0
@diwuNLP
Di Wu
8 months
RT @evgtokarchuk: On my way to Mexico city to present our (me & @vnfrombucharest) work at @naaclmeeting Check the paper and join me at th…
0
2
0
@diwuNLP
Di Wu
8 months
RT @wangly0229: 🌸Thanks for highlighting our TransAgents work in your GitHub repo! @AndrewYNg 🪐We're also committed to integrating Languag…
0
18
0
@diwuNLP
Di Wu
9 months
RT @Wafaa01997: The Chat Shared Task (WMT2024) is live! 💥💥 Happy to announce this year’s Chat Shared Task which aims to translate a corpus…
0
9
0
@diwuNLP
Di Wu
9 months
@BramVanroy @prajdabre1 ICLR24 and ICML24 are both in Vienna
0
0
1
@diwuNLP
Di Wu
9 months
@JumeletJ I have the identical question today morning, but now I tell myself that not to care about judgement outside. Hope can see you in Bangkok.
0
0
2
@diwuNLP
Di Wu
9 months
RT @ProfFeynman: The job of a scientist is to listen carefully to nature, not to tell nature how to behave.
0
589
0