![Rosmine Profile](https://pbs.twimg.com/profile_images/1824113994703024128/lp6JjWme_x96.jpg)
Rosmine
@rosmine_b
Followers
1K
Following
2K
Statuses
697
Senior ML Scientist @ FAANG working on LLMs DM me your ML questions
Joined October 2023
@Kylec1215 @TheXeophon I like to use it for lit reviews and checking if projects are already done. Anything that requires searching through large amounts of information. Here's an example lit review I've also used it for health topics, e.g. research on the best way to exercise
5
0
0
@cccntu For comparison, curriculum learning (SFT with samples ordered by difficulty) also often leads to better generalization
0
0
7
@tensorqt Fun fact, the probability of that is 10^{-3.6x10^8}. It's so small you need exponents within the exponential notation. Anthropic really does have the mandate of heaven.
What are the chances you'd get a fully functional language model by randomly guessing the weights? We crunched the numbers and here's the answer:
0
0
5
@iScienceLuvr From Karpathy himself: "It took me last ~6 weeks to get a from-scratch policy gradients implementation to work 50% of the time on a bunch of RL problems." source:
0
0
10
@OpenAI Feature request: In the sidebar, can we have a way to filter Deep Research chats vs. other chats? I want to return to Deep Research queries, but sometimes have trouble finding them in all the other chats
2
0
7
@nrehiew_ Just posted this in response to another rl post:
@andersonbcdefg RL is the the automation of reward hacking
0
0
1
@I_loves_deep_nn Unhappy people send hate because they want other people to be on their level, and it's easier to make other people sad than make themselves happier. Fight back by emitting positivity!
0
0
1
@abacaj Basically every LLM reasoning result looks so obvious in retrospect. Prompt engineering, CoT, STaR, Self-Consistency. Every paper your read is "why didn't I think of that"
8
8
178
@Aizkmusic I wanted to try and bought a pro subscription. It's really useful. lmk if you have a query you want to try
0
0
1