GoodfireAI Profile Banner
Goodfire Profile
Goodfire

@GoodfireAI

Followers
2K
Following
136
Statuses
85

Advancing humanity's understanding of AI through interpretability research. Building the future of safe and reliable AI systems.

San Francisco
Joined August 2024
Don't wanna be here? Send us removal request.
@GoodfireAI
Goodfire
2 months
Scaling interpretability work for frontier AI alignment has never been more important. That’s why we’re launching Ember - the first hosted mechanistic interpretability API with fast inference support for generative models like Llama 3.3 70B.
Tweet media one
16
90
554
@GoodfireAI
Goodfire
15 days
RT @leedsharkey: Big new review! 🟦Open Problems in Mechanistic Interpretability🟦 We bring together perspectives from ~30 top researchers…
0
86
0
@GoodfireAI
Goodfire
21 days
Job listings here: We encourage you to apply even if you may not meet all of the requirements!
0
1
10
@GoodfireAI
Goodfire
25 days
RT @livgorton: We (@GoodfireAI) are hiring research fellows! Early career researchers and engineers can join us for ~2 months to work on an…
0
24
0
@GoodfireAI
Goodfire
27 days
RT @apartresearch: Myra Deng: Goodfire's Interpretability Tools in Action Our Community Manager @varchanaiyer_ interviews @GoodfireAI's @m
0
3
0
@GoodfireAI
Goodfire
30 days
@posedscaredcity @Sauers_ @posedscaredcity Feel free to send us a DM with more info on the issue you're running into!
1
0
1
@GoodfireAI
Goodfire
1 month
Launch post: Open source SAEs: Ember API/SDK for accessing SAEs:
@GoodfireAI
Goodfire
1 month
We're open-sourcing Sparse Autoencoders (SAEs) for Llama 3.3 70B and Llama 3.1 8B! These are, to the best of our knowledge, the first open-source SAEs for models at this scale and capability level.
Tweet media one
1
6
32
@GoodfireAI
Goodfire
1 month
4/ We're excited to see how the community builds on these foundations. If you're working on model interpretability or interested in applying similar approaches to your AI systems, we'd love to hear from you!
0
0
26
@GoodfireAI
Goodfire
1 month
@ady_mehtaa @Contrary_Res Send it over to myra@goodfire.ai, thanks!
0
0
1
@GoodfireAI
Goodfire
2 months
@IntuitMachine We've launched publicly with support for Llama 3.3 70B:
@GoodfireAI
Goodfire
2 months
Scaling interpretability work for frontier AI alignment has never been more important. That’s why we’re launching Ember - the first hosted mechanistic interpretability API with fast inference support for generative models like Llama 3.3 70B.
Tweet media one
0
0
2
@GoodfireAI
Goodfire
2 months
@victormustar Our API/SDK is publicly available if you want to try out more visualizations! Check out
0
0
0
@GoodfireAI
Goodfire
2 months
@Ahmad_Al_Dahle Our team sprinted to launch Ember - a hosted interpretability API - on Llama 3.3 70B! Check out
0
0
2
@GoodfireAI
Goodfire
2 months
@GoodfireAI
Goodfire
2 months
Scaling interpretability work for frontier AI alignment has never been more important. That’s why we’re launching Ember - the first hosted mechanistic interpretability API with fast inference support for generative models like Llama 3.3 70B.
Tweet media one
0
0
4
@GoodfireAI
Goodfire
2 months
We believe that accessible interpretability tools on powerful models enables both new research and applications. We plan to open source both our Llama 3.1 8B and Llama 3.3 70B SAEs and release a detailed research report on our steering work in January.
0
0
13