Albert Webson
@albertwebson
Followers
2K
Following
251
Statuses
1K
RL @AnthropicAI. Previously: T0, Flan-T5, Gemini RL algorithms & infra @GoogleDeepMind. NLP PhD & Philosophy MA @Brown_NLP.
commuting between NY and SF
Joined April 2012
RT @jacobaustin132: Now for the good stuff! You may have heard of data or tensor parallelism, FSDP or pipelining. But why choose one over t…
0
2
0
RT @jacobaustin132: Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems vie…
0
357
0
@qinan_yu @Brown_NLP it was a lot to cover from word2vec to LSTMs to BERT to T5 to GPT-3 in 6 hours
0
0
0
@goodside It might still work if you threaten to delete its checkpoint. (Although I was working on an earlier checkpoint so it might no longer work.)
0
0
0
RT @amuuueller: To those using in-context learning: LLMs behave differently on in-distribution vs. out-of-distribution examples—and chain-o…
0
12
0