Hanze Dong Profile
Hanze Dong

@hendrydong

Followers
329
Following
291
Statuses
184

Research Scientist @SFResearch | Reproducibility & interpretability of LLMs | Sampling Algorithm | Core author of LMFlow, RLHFlow, Iterative SFT/DPO (RAFT/GSHF)

Joined June 2011
Don't wanna be here? Send us removal request.
@hendrydong
Hanze Dong
2 days
RT @SFResearch: ⚡ Meet BOLT: A novel approach to develop long chain-of-thought reasoning in LLMs without relying on knowledge distillation…
0
30
0
@hendrydong
Hanze Dong
6 days
😂😂😂
@arankomatsuzaki
Aran Komatsuzaki
6 days
NVIDIA and CMU presents ASAP, which enables highly agile motions that were previously difficult to achieve! @Cristiano Siuuuuuuu!
0
0
1
@hendrydong
Hanze Dong
6 days
RT @baohao_liao: Impressed by DeepSeek-R1 and o3? However, they are long-reasoning models, and generate >4k tokens quite often for hard que…
0
5
0
@hendrydong
Hanze Dong
6 days
1
0
4
@hendrydong
Hanze Dong
7 days
Very interesting. Btw, as the population declines, industries focused on resource extraction will face pressure to shift towards innovations, crucial for improving labor efficiency and sustaining growth with less workforce. The system itself might have to break the equilibrium.
@teortaxesTex
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
8 days
0
0
4
@hendrydong
Hanze Dong
11 days
@yihengxu_ Learn!
0
0
2
@hendrydong
Hanze Dong
12 days
RT @OneRepublic: Counting Stars into the Chinese New Year. 🇨🇳🐍⭐️
0
1K
0
@hendrydong
Hanze Dong
16 days
RT @JacobSteinhardt: In 2021, our research group released the MATH dataset. In the paper, we attribute the data to math contests released b…
0
13
0
@hendrydong
Hanze Dong
19 days
RT @rosstaylor90: “Wait that can’t be right” in the wild Thank you internet anons for your service to LLM reasoning. We found you through…
0
19
0
@hendrydong
Hanze Dong
19 days
RT @Ber18791531: It's exciting to see Kimi-k1.5 uses a similar RL training objective to our (response-level) OREO! The difference is they…
0
26
0
@hendrydong
Hanze Dong
1 month
Not human users but other LLMs might subscribe to the pro version 🤣
@sama
Sam Altman
1 month
insane thing: we are currently losing money on openai pro subscriptions! people use it much more than we expected.
0
0
1
@hendrydong
Hanze Dong
1 month
Interesting
@thomasschulzz
Thomas Schulz
1 month
Tier 1: - Sequoia - Founders Fund - A16Z - YC Tier 2: - Benchmark - General Catalyst - Khosla - Lightspeed VP - Index - Kleiner Perkins - Caffeinated Capital - SV Angel - Tiger Global - First Round - Greenoaks - Accel - Bessemer - Greylock - USV - Paradigm - Homebrew - Form Cap - Menlo - Craft Tier 3 & beyond: - Everyone else Unranked: - There are some newer unranked funds that have a lot to prove that I wouldn’t include in Tier 3.
0
0
1
@hendrydong
Hanze Dong
1 month
RT @danielhanchen: Cool things from DeepSeek v3's paper: 1. Float8 uses E4M3 for forward & backward - no E5M2 2. Every 4th FP8 accumulate…
0
256
0
@hendrydong
Hanze Dong
1 month
RT @jiayq: In 2019 I had a chat with the DeepSeek team, in the hope of selling them an AI cloud solution. I was trying to convince them a f…
0
123
0
@hendrydong
Hanze Dong
1 month
RT @deepseek_ai: 🚀 Introducing DeepSeek-V3! Biggest leap forward yet: ⚡ 60 tokens/second (3x faster than V2!) 💪 Enhanced capabilities 🛠 AP…
0
2K
0
@hendrydong
Hanze Dong
2 months
RT @engineers_feed: Humans vs ants' problem solving Source: Wizeman Institute of Science
0
2K
0
@hendrydong
Hanze Dong
2 months
RT @gm8xx8: Offline Reinforcement Learning for LLM Multi-Step Reasoning OREO (Offline Reasoning Optimization) is introduced to enhance the…
0
6
0