Chi Chen Profile
Chi Chen

@chc273

Followers
471
Following
430
Statuses
147

Principal Quantum Engineering Manager at Microsoft. Interested in AI and Science. Views are mine.

Redmond, WA
Joined November 2020
Don't wanna be here? Send us removal request.
@chc273
Chi Chen
2 months
Impressed by deepseek-v3 performance and more impressed by the company culture that drives innovations
@wzihanw
Zihan Wang - on RAGEN
2 months
[Long Tweet Ahead] I just have to say, I’m genuinely impressed by DeepSeek. 💡 It’s no wonder their reports are so elegant and fluffless. Here’s what I noticed about their culture, a space where real innovation thrives, during my time there ↓ — — — — — 🌟 1. Be nice and careful to talents - The recruiting teams seek top talent from China & globally. Many are PhD / grad / undergrads from Chinese top 10 universities e.g., Tsinghua / Peking University. - Hiring is minimalist: My interview took only a few rounds. They basically check two criteria: Do you genuinely WANT to push fundamental AI problems forward? CAN you make it happen (at least one standout skill + solid skills to get things done)? - Roles seem shaped around the talent, instead of vice versa. Not like “we need a role, so we find a talent”, they basically ask: “Here’s an exceptional talent; how can they contribute?” This can lead to something unconventional: they can hire someone with expertise in MBTI who finally focuses on creating more personalized / role-playing models. - Something basic: Top-tier benefits in China, including for interns, allowing them to concentrate on work matters and worry less about material concerns. 🤝 2. Individualized HR culture - With above talent-first hiring logistics, even with a 200-people scale, I still feel everyone is unique and there is no such thing like a standardization where everyone can be replaced like a cog-in-machine. - No pressure or forced KPIs. I hardly feel any sense like “this must be done by this Thursday” from my mentor / seniors / colleagues. - Being collaborative. DeepSeek tries its best to forbid race inside the company. It’s like everyone contributes to the final model with their own (orthogonal) ideas and everyone hopes their idea is useful. If an idea is proved useful, everyone celebrates, and everyone is happy about it. ⚙️ 3. Disentangled development systems - DeepSeek covers a highly diverse set of talent directions. It’s like how “expert specialization” happens in their MoE models. People focus on what they’re best at, and it’s natural to ask others things out of their expertise. Helping others with one's expertise is not what people only do after completing their own work. - There is a shared basic pipeline that works pretty well for everyone. When a group adds new things to the system, they do really good documentation so others can know what happens in a minute and how it affects their own roles (most of the time, this won’t affect their work; they just feel things improve automatically). - Feedback loops are FAST: To verify whether ideas could work, is basically just to test whether it could work on the super-latest simplified baseline. I strongly feel whenever I have an idea in the morning, I can realize whether it’s effective in the afternoon -- no organization approval, no hard GPU utilization restrictions, little debugging (thanks to the rigorously debugged baseline), just try to seamlessly add my own idea to the model. This makes working there super reflective and feedback-rich at the beginning of an idea, even if many ablations are required later to finally merge the idea to the giant model. So all of the above makes the organization super Spontaneous-person-friendly, and maybe this is why you can always trust their tech paths even when many improvements / ideas are applied in each single model release. I do appreciate such disentangled organization, which makes fast and solid iterations at different angles in the model. 4. 🌍Diversity sparks innovation It’s not really about something like “we must consider every party”. They pay attention to inclusion but it’s not the biggest matter. The biggest matter lies in “How can people from diverse backgrounds contribute to the DeepSeek model?” I have many colleagues called know-it-all “百晓生”, a role-of-talent that DeepSeek hires. As an AI company, it’s interesting to see so many AI developers just from literature / social science backgrounds. They know little about machine learning formulas and could understand model training based on their intuition of babysitting a child. It’s fun to discuss Zhenhuan Zhuan (a Chinese history drama) during lunch and do a lot of mind-practice like how to survive in a squid game. The initial idea of this role-of-talent is to build a global knowledge base on history, culture, and science to expand AGI capabilities. However, I do feel how they contribute to working efficiency / nurturing ideas of all the team, at least, making everyone happy and more focused when getting back to work from lunch. — — — — — Something random I hope to share at the end: It’s fun to solve some challenges to realize individual value or get a sense of achievement. In fact, it matters what “challenge” you are facing. The “challenge” here could just be “how to achieve AGI” – in such case, you actually do not need to worry too much about “what if this idea has been tried by someone else”, “what if someone achieves AGI faster than me”, “what if this idea is too simple” or “what if someone get paid more than me” – things many are indeed worried about. When what someone care is about achieving AGI, they could just try relentlessly about what is really useful and incorporate them into the model. — — — — — Resources and References: Two interviews with DeepSeek founder Liang Wenfeng: DeepSeek hiring ads: And my experiences there.
0
0
1
@chc273
Chi Chen
2 months
Great insights
@labenz
Nathan Labenz
2 months
"The only thing that works 100% of the time is measuring in a lab" @andrewwhite01, Head of Science @FutureHouseSF explains why they're building an AI Scientist to run real-world experiments -vs- pursuing a more in-silico strategy for biological discovery. Excellent episode!
0
0
3
@chc273
Chi Chen
2 months
2024 has been a landmark year for AI in Science. We are excited about AI in materials science - from accelerating quantum simulations to bridging scales, automating experiments & empowering scientists with AI agents. @Matter_CP
0
1
11
@chc273
Chi Chen
2 months
RT @MilesCranmer: 🧵 Could this be the ImageNet moment for scientific AI? Today with @PolymathicAI and others we're releasing two massive…
0
89
0
@chc273
Chi Chen
3 months
@XirtamEsrevni Thanks for the insights. At the base layer, AI can be used as an accelerator, and we don’t have to limit to atomistic AI/ML. I see lots of opportunities for AI in materials engineering too
0
0
3
@chc273
Chi Chen
4 months
Keqiang has been doing impactful work in the AI for Science space. Don’t miss him in case your department is hiring for new faculties.
@KeqiangY
Elio (Keqiang) Yan
4 months
I am on the faculty job market for Fall 2025. My research focuses on scientific ML and AI for science. I use science to guide AI/ML/LLMs and develop AI/ML to accelerate scientific discovery. Homepage: Opportunities are greatly appreciated!
Tweet media one
2
2
11
@chc273
Chi Chen
4 months
@OpenCatalyst This is awesome. Great contribution to the field!
0
0
2
@chc273
Chi Chen
4 months
@KathyYWei1 huge demand soon @andrewwhite01
@andrewwhite01
Andrew White 🐦‍⬛
3 years
We've completed our first draft of a protein emoji that we'll be submitting at end of month. Please let me know if you have feedback and/or show your support for a protein emoji! Art by Michael Osadciw
Tweet media one
0
0
1
@chc273
Chi Chen
4 months
AI work winning Nobel Prizes in physics and chemistry is encouraging for us in the AI for science space. It's exciting to imagine that AI for materials discovery might earn similar recognition when the field achieves significant real-world impacts.
@NobelPrize
The Nobel Prize
4 months
BREAKING NEWS The Royal Swedish Academy of Sciences has decided to award the 2024 #NobelPrize in Chemistry with one half to David Baker “for computational protein design” and the other half jointly to Demis Hassabis and John M. Jumper “for protein structure prediction.”
Tweet media one
0
0
14
@chc273
Chi Chen
4 months
@FrankNoeBerlin Congrats Frank! Well deserved recognition
1
0
1
@chc273
Chi Chen
4 months
Great opportunity to work on AI for materials science. Would highly recommend
@xie_tian
Tian Xie
4 months
Interested in working with a highly collaborative, interdisciplinary team to push the state of the art of generative AI for materials design? Join us as an intern by applying through this link! We are the team behind the MatterGen and MatterSim models from Microsoft Research AI for Science.
0
6
18
@chc273
Chi Chen
6 months
@TimothyDuignan defensive patenting
2
0
5
@chc273
Chi Chen
7 months
@hongbin_zhang We just hope to help and, if necessary, show people that we can :)
0
0
0
@chc273
Chi Chen
8 months
We are looking for a senior ML engineer to help the team push the boundaries of scientific innovation. Please apply if you are passionate about building new and impactful solutions #buidl #ai4science #aiforscience #chemjob #ai4chemistry #ai4materials
0
8
21
@chc273
Chi Chen
8 months
@timmyrupert Congrats Tim!
0
0
0
@chc273
Chi Chen
8 months
@xie_tian @jrib_ And you come back strong on top with MatterSim :) ICYMI, I co-created BOWSR at -2 spot, lol
1
0
2
@chc273
Chi Chen
8 months
0
0
1