ray @DrRayZha profile

ray

@DrRayZha

Followers

8

Following

93

Statuses

45

Voice AI Reseacher

Joined December 2024

Don't wanna be here? Send us removal request.

ray

@DrRayZha

16 days

@fofrAI A Pokemon

0

ray

@DrRayZha

17 days

Mind-blowing to imagine what 80% accuracy would mean - that would be truly revolutionary!

Dan Hendrycks

@DanHendrycks

17 days

We’re releasing Humanity’s Last Exam, a dataset with 3,000 questions developed with hundreds of subject matter experts to capture the human frontier of knowledge and reasoning. State-of-the-art AIs get <10% accuracy and are highly overconfident. @ai_risk @scaleai

0

1

ray

@DrRayZha

17 days

1. pick a photography artwork you like. 2. have Sonnet critique it. 3. get Sonnet to turn that critique into a text-to-image prompt. 4. throw that prompt into Imagen3. Then, you will get:

0

1

ray

@DrRayZha

17 days

@cat_amsha @fofrAI try it on Gemini

1

0

ray

@DrRayZha

17 days

@labsdotgoogle @henrydaubrez Veo2 is the best

0

ray

@DrRayZha

17 days

@eustachelb impressive! is this compared with faster-whisper-large-v3?

0

ray

@DrRayZha

17 days

outcome-oriented RL sounds similar to what R1 did. i guess the new Sonnet won't explicitly seperate reasoning and general models, but it will be smarter at reasoning naturally to create a smoother response.

Nathan Lambert

@natolambert

17 days

Dario Amodei on Anthropic's coming reasoning models / methods (lightly edited auto transcription): To say a little about reasoning models, our perspective is a little different, which is that there’s been this whole idea of reasoning models and test-time compute as if they’re a totally different way of doing things. That’s not our perspective. We see it more as a continuous spectrum — the ability for models to think, reflect on their own thinking, and ultimately produce a result. If you use Sonnet 3.5, sometimes it already does that to some extent. But I think the change we’re going to see is a larger-scale use of reinforcement learning, and when you train the model with reinforcement learning, it starts to think and reflect more. It’s not like reasoning or test-time compute — or whatever it’s called — is a totally new method. It’s more like an emergent property, a consequence of training the model in an outcome-based way at a larger scale. I think that will lead to something that continuously interpolates between reasoning and other tasks, fluidly combining reasoning with everything else models do. As you’ve said, we’ve often focused on making sure using the model is a smooth experience, allowing people to get the most out of it. I think with reasoning models, we may take a similar approach and do something different from what others are doing.

0

1

ray

@DrRayZha

18 days

@thepatwalls different places, same goal

0

ray

@DrRayZha

18 days

@real_kai42 节哀，看别人赚几十倍确实很上头

0

1

ray

@DrRayZha

18 days

@lowstz 文心落后deepseek可不止一点点

1

0

3

ray

@DrRayZha

18 days

@zizhpan 妥了，很棒！

0

ray

@DrRayZha

18 days

@nrehiew_ very valuable thread! BTW R1 seems sensitive to the input prompt and few-shot prompting would degrade the performance, it may be a promising direction to make it more robust to input prompts

0

ray

@DrRayZha

23 days

@poetengineer__ yeah, but that is the results of crazy 996

0

ray

@DrRayZha

23 days

@mymind my new wallpaper thanks

1

0

1

ray

@DrRayZha

23 days

the most impressive thing i have seen this week

Jesus Plaza

@JesusPlazaX

23 days

I have just created my first Fashion Film with Veo2 and I'm blown away 🤯 I have directed several Fashion Films in my career and I am extremely impressed about what I have been able to create using Veo2, it is really good with human physics. Here is my first fashion film test using Veo2 👀👇🏼

1

0

1

ray

@DrRayZha

23 days

@JesusPlazaX This is insane! Did Veo2 handle the music too, or was that a separate addition?

1

0

ray

@DrRayZha

23 days

@_omermirza love the tecky look! thanks for posting

1

0

1

ray

@DrRayZha

23 days

@alwriterla 英文效果不错，但是中文质量不如Cosyvoice，我以为Kokoro的亮点在于参数量低而不在于绝对的生成质量

0

6

ray

@DrRayZha

26 days

@DementedApple5 @fofrAI english french japanese korean chinese are all available

0

1