colinraffel Profile Banner
Colin Raffel Profile
Colin Raffel

@colinraffel

Followers
31K
Following
3K
Statuses
2K

https://t.co/mStEmzIJH0

Joined March 2017
Don't wanna be here? Send us removal request.
@colinraffel
Colin Raffel
1 year
📢Life update:📢 I moved to Toronto, where I'm now an associate professor at the University of Toronto and an associate research director at the Vector Institute. I wrote a blog post about the long winding path that led me here:
128
43
1K
@colinraffel
Colin Raffel
7 days
RT @Ar_Douillard: now that ICML deadline is over, time to submit to the MCDC workshop for ICLR!
0
8
0
@colinraffel
Colin Raffel
26 days
RT @Ar_Douillard: Workshop alert 🚨 We'll host in ICLR 2025 a workshop on modularity, encompassing collaborative + decentralized + continua…
0
38
0
@colinraffel
Colin Raffel
2 months
Application link:
0
0
10
@colinraffel
Colin Raffel
2 months
@giffmana I dunno, back in the day during my PhD we also had gaming desktop machines with gaming GPUs in our lab and we also called them "servers". I think any computer that is used for long-running jobs/experiments that you mainly use by ssh'ing into should be called a "server".
1
0
10
@colinraffel
Colin Raffel
3 months
Application link: Please share widely.
0
0
9
@colinraffel
Colin Raffel
3 months
RT @AdaptiveML: Instead of mitigating length bias in LLM-as-judge, what if you could simply 🙋ask models to output comparisons of the same l…
0
3
0
@colinraffel
Colin Raffel
3 months
@JayAlammar We have always referred to this diagram as the "octopus". I used to keep an informal list of all of the papers that had an octopus-style diagram in it.
0
1
22
@colinraffel
Colin Raffel
3 months
RT @mciccone_AI: 🚨 Life update 🚨 I moved to Toronto 🇨🇦and joined @VectorInst as a Postdoctoral Fellow to work with @colinraffel and his lab…
0
4
0
@colinraffel
Colin Raffel
3 months
RT @prateeky2806: I'm on the job market! Please reach out if you are looking to hire someone to work on - RLHF - Efficiency - MoE/Modul…
0
59
0
@colinraffel
Colin Raffel
6 months
RT @prateeky2806: We just released our survey on "Model MoErging", But what is MoErging?🤔Read on! Imagine a world where fine-tuned model…
0
45
0
@colinraffel
Colin Raffel
10 months
RT @arankomatsuzaki: 🚀 Introducing Pile-T5! 🔗 We (EleutherAI) are thrilled to open-source our latest T5 model trained on 2T tokens from th…
0
109
0
@colinraffel
Colin Raffel
1 year
@madiator Good question. I think 1) bandwagonism/inertia (it's anti-zeitgeist) and 2) it works well for classification tasks and is less proven for open-ended generation. But I've heard T-few has been implemented and is in use by various LLM startups, they just don't advertise it as such.
0
0
2
@colinraffel
Colin Raffel
1 year
RT @ada_rob: I love music most when it’s live, in the moment, and expressing something personal. This is why I’m psyched about the new “DJ…
0
104
0
@colinraffel
Colin Raffel
1 year
RT @AlbalakAlon: {UCSB|AI2|UW|Stanford|MIT|UofT|Vector|Contextual AI} present a survey on🔎Data Selection for LLMs🔍 Training data is a clos…
0
77
0
@colinraffel
Colin Raffel
1 year
@jeremyphoward @Muqeeth10 @liu_haokun Lots more work coming from us along these lines! Would love to sync up sometime.
1
0
0
@colinraffel
Colin Raffel
1 year
@sivil_taram Thank you! It took us a long time - turns out to be challenging in the zero-shot setting. The LoraHub approach makes a lot of sense in the few-shot setting. Ultimately both settings are very important!
1
0
5