Katherine Tian Profile Banner
Katherine Tian Profile
Katherine Tian

@kattian_

Followers
864
Following
558
Media
3
Statuses
131

cs/stat @harvard , working on calibration & factuality of LLMs, prev @GoogleAI tensorflow, golden state @warriors fan

Bay Area, CA
Joined January 2018
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
Pinned Tweet
@kattian_
Katherine Tian
4 years
Thanks Paige! Great working on the @TensorFlow team
@DynamicWebPaige
👩‍💻 Paige Bailey
4 years
Announcing a cool new feature in @TensorFlow : differentiable map ops! 🙌✨ If you've ever wanted to store and train embeddings in an embedding model, this should make the process *much* simpler. 😄 Amazing work from our TF intern, @kattian_ ( @Harvard )!
Tweet media one
Tweet media two
Tweet media three
Tweet media four
2
49
199
1
3
44
@kattian_
Katherine Tian
11 months
Turns out we can directly train language models to hallucinate less without any human annotation -- for around 50% error reduction compared to RLHF!! Check out our paper for the approaches and full results 😃
@_akhaliq
AK
11 months
Fine-tuning Language Models for Factuality paper page: The fluency and creativity of large pre-trained language models (LLMs) have led to their widespread use, sometimes even as a replacement for traditional search engines. Yet language models are prone
Tweet media one
5
48
255
12
36
247
@kattian_
Katherine Tian
11 months
How can you tell how reliable an LLM output is? Let’s talk about calibration—calibrated confidences give an accurate probability of whether an LLM output is correct AKA how reliable it is Our EMNLP paper shows we can get calibrated confidences from RLHF LLMs by *just asking* 🧵
5
14
99
@kattian_
Katherine Tian
2 years
Check it out!! It was great working with Jaehwan + the team and thanks to Pranav for advising 😊
@pranavrajpurkar
Pranav Rajpurkar
2 years
1/ Introducing X-REM, our new AI method to generate radiology reports from chest X-ray images. Our method uses a multimodal language-image model to capture fine-grained interaction between text and image for higher accuracy. #AI #radiology #MIDL2023 🧵
4
36
155
1
2
15
@kattian_
Katherine Tian
4 years
Check out the JAX transformer! Exciting to see how JAX did in MLPerf
@JeffDean
Jeff Dean (@🏡)
4 years
Very excited to see the MLPerf 0.7 results released today, where Google TPUs set records in six of the eight benchmarks! We need bigger benchmarks, because we can now train the ResNet-50, BERT, Transformer, & SSD benchmarks each in under 30 seconds.
20
279
1K
0
2
8
@kattian_
Katherine Tian
2 years
@kipperrii @natfriedman lawful evil, chaotic good, lawful good
0
0
6
@kattian_
Katherine Tian
4 years
Had a great time presenting this interesting NLP interpretability paper for Harvard Undergrad ML Group's journal club today! Slides:
@zacharylipton
Zachary Lipton
4 years
Join the discussion now! My PhD student @danish037 & CMU alum @im_mansigupta are discussing our work on Learning to Deceive with Attention-Based Explanations
0
1
15
1
0
6
@kattian_
Katherine Tian
2 years
Excited!!
@pranavrajpurkar
Pranav Rajpurkar
2 years
Course Launch📢 AI Research Experiences 🗓️ Over the past 5 years, I've been training students to get their start in applied AI engineering and research. Over the next 4 months, as I launch CS197 at Harvard, we will be making our notes publicly available:
6
55
245
1
0
5
@kattian_
Katherine Tian
2 years
@O42nl 3 standard paperclips so... 1 human = ~1 OpenAI paperclip?
0
0
5
@kattian_
Katherine Tian
11 months
0
0
6
@kattian_
Katherine Tian
3 years
great perspective worth hearing! thanks for sharing, av
“On Motivational Debt, Shame Avoidance, Impostor Syndrome and Living Well” Alternative title: Av does SURPRISE THERAPY at a weekly group meeting Response was very positive so decided to record & share! Consider discussing in your group meeting :-)
7
5
32
0
1
5
@kattian_
Katherine Tian
11 months
Can we avoid log probs and *just ask* the model to verbalize its confidence as text? Yes! Verbalization can cut expected calibration error in half for models including GPT-4, Claude-2, Llama2-70B-Chat
1
0
4
@kattian_
Katherine Tian
11 months
However, evidence suggests that RLHF destroys the calibration of LM’s output distribution. See the GPT-4 technical report and our confirmation on Llama2-70B
Tweet media one
Tweet media two
1
0
6
@kattian_
Katherine Tian
11 months
👇👇Raaz is a great mentor and super fun to work with!
@raazdwivedi
Raaz Dwivedi
11 months
PhD Applicants: Consider the vibrant research @cornell_tech , where @cornell meets New York City! Interested in statistical problems in causal inference & reinforcement learning for personalized decision-making & healthcare? Mention me! RTs appreciated!
0
47
168
0
0
4
@kattian_
Katherine Tian
11 months
Other observations: - Asking the model for multiple guesses and probabilities (inspired by human psychology) reduces overconfidence - Surprisingly, despite LLM’s challenges with numbers, asking for verbalization with numbers worked as well or better than with phrases
1
1
4
@kattian_
Katherine Tian
9 months
@nibnalin @amirbolous that’s a reasonable hypothesis but it doesn’t seem like that’s the full story - evan chen reviewed a subset of proofs and verified they’re not all just coordinate bashing
Tweet media one
0
0
5
@kattian_
Katherine Tian
4 years
go @amymjang ! 🤩
@DynamicWebPaige
👩‍💻 Paige Bailey
4 years
✨👩‍🎨 Check out this new @TensorFlow 2.x and @Kaggle competition from our intern @AmyMJang !
1
3
16
0
2
3
@kattian_
Katherine Tian
11 months
How? We just ask the model for its best guess and probability that it is correct
1
0
2
@kattian_
Katherine Tian
11 months
@conradev @AriX congrats!! excited to see what you create
0
0
2
@kattian_
Katherine Tian
11 months
Typically, you can get pretty well-calibrated confidences just from the model output distribution log probabilities or sampling from the output distribution
1
0
2
@kattian_
Katherine Tian
2 years
@SinaHartung are you running up that hill?
1
0
2
@kattian_
Katherine Tian
11 months
@kundan_official Good question it could be! For intuition, the max frequency ignores the randomness of the model's sample at that token and rewards good prefixes; Also both options should be correlated since the max frequency is likely to be chosen. Will probably check empirically sometime
1
0
2
@kattian_
Katherine Tian
3 years
0
0
2
@kattian_
Katherine Tian
1 year
0
0
2
@kattian_
Katherine Tian
2 years
0
0
2
@kattian_
Katherine Tian
3 years
@valor_zhang wholesome ✨
0
0
1
@kattian_
Katherine Tian
10 months
0
0
1
@kattian_
Katherine Tian
3 years
@rajivmovva @2plus2make5 @NikhGarg Congrats Raj!!! So excited for you :)
1
0
1
@kattian_
Katherine Tian
11 months
@anshulkundaje Thank you Anshul 😄
0
0
1
@kattian_
Katherine Tian
1 year
@ekzhang1 yay congrats!! excited for you
0
0
1
@kattian_
Katherine Tian
3 years
@vvhuang_ lmao big mood
0
0
1
@kattian_
Katherine Tian
4 years
@catyeo18 Cat ur amazing!! we stan
0
0
1
@kattian_
Katherine Tian
11 months
0
0
1
@kattian_
Katherine Tian
2 years
@tyleryzhu @Princeton @orussakovsky late to the party but congrats tyler!!
1
0
1
@kattian_
Katherine Tian
2 years
@chelseabfinn @hseas @boazbaraktcs @ShamKakade6 I really enjoyed this talk! Thank you 😊
0
0
1
@kattian_
Katherine Tian
4 years
@kbarley66 looks so good 😋
0
0
1
@kattian_
Katherine Tian
2 years
@valor_zhang jump rope and cartwheels are more fun than running (also spikeball if you can get 3 others to regularly play with u)
0
0
1
@kattian_
Katherine Tian
4 years
1
0
1
@kattian_
Katherine Tian
2 years
@SiCaPu @rhodes_trust @UniofOxford huge congrats sílvia!! <333
1
0
1
@kattian_
Katherine Tian
11 months
@yar_vol It's a good question! Bigger models will have better starting points but as long as the RL feedback is better it should help - I'm also curious to see how the method can scale 😎 Will update if I look into this soon :)
0
0
1
@kattian_
Katherine Tian
3 years
0
0
1
@kattian_
Katherine Tian
1 year
@leonardtang_ let’s go!!
0
0
1
@kattian_
Katherine Tian
3 years
@kompletechaos watermelon salt, high
1
0
1
@kattian_
Katherine Tian
2 years
@haothehantato super cool 🙌🙌
0
0
1
@kattian_
Katherine Tian
1 year
@chenxcynthia tweeted by AK 😮‍💨😮‍💨 nice work! :)
0
0
1
@kattian_
Katherine Tian
11 months
@anshulkundaje Thanks Anshul!! :)
0
0
1
@kattian_
Katherine Tian
11 months
@EllaMinzhiLi hi ella! yes this is the work - will DM you :))
0
0
1
@kattian_
Katherine Tian
1 year
@citizentaro yoo congrats!!
0
0
1
@kattian_
Katherine Tian
3 years
@ekzhang1 that’s so cool!
0
0
1
@kattian_
Katherine Tian
3 years
@vvhuang_ you are cool
0
0
1
@kattian_
Katherine Tian
2 years
@kbarley66 nice post 🙌 hope u have a fulfilling 2023!
0
0
1
@kattian_
Katherine Tian
4 years
@joannaqlin i read this at first as snowflake stock is going down
1
0
1
@kattian_
Katherine Tian
2 years
0
0
1
@kattian_
Katherine Tian
1 year
1
0
1
@kattian_
Katherine Tian
1 year
@ekzhang1 let’s go!!! booking my plane ticket rn
0
0
1
@kattian_
Katherine Tian
11 months
@in_quanta very cool - congrats!
0
0
1
@kattian_
Katherine Tian
2 years
@kipperrii HAHA this is a good post
0
0
1
@kattian_
Katherine Tian
3 years
0
0
1