AccSwtch50 @AccSwtch50 profile

AccSwtch50

@AccSwtch50

Followers

2

Following

36

Statuses

184

Joined January 2020

Don't wanna be here? Send us removal request.

AccSwtch50

@AccSwtch50

1 day

@FallenOne58035 @vxunderground or RAM allocation...

0

AccSwtch50

@AccSwtch50

10 days

@TheRealSethDer1 @worstcatOG @justalexoki "I take notes in cursive" How is this relevant?

2

0

AccSwtch50

@AccSwtch50

10 days

@Flurk2818 @theo Wait, it thinks as CLAUDE??? (The october and the no 5090 part is because of knowledge cutoff of october 2023)

0

1

AccSwtch50

@AccSwtch50

10 days

@aochoa2ww @debian OpenMandriva?

0

4

AccSwtch50

@AccSwtch50

10 days

@HomoSapienLCY @Makuh90 @Kira_sama @bindureddy did you compare it to DeepSeek R1?

0

AccSwtch50

@AccSwtch50

10 days

@Dragon__KinG__ @StonkyOli @bindureddy Have you toggled the DeepThink button before asking it something?

0

1

AccSwtch50

@AccSwtch50

10 days

@RyanEls4 @ns123abc @deepseek_ai This is italy, they banned ChatGPT in April 2023 for a few weeks, I presume this is just them doing the same thing to DS.

0

AccSwtch50

@AccSwtch50

10 days

@Mitman93 @0x136d @stupidtechtakes And here is HF (most likely) being paranoid:

1

0

2

AccSwtch50

@AccSwtch50

10 days

@Mitman93 @0x136d @stupidtechtakes R1 itself uses safetensors so that 𝘴𝘩𝘰𝘶𝘭𝘥 be fine.

0

2

AccSwtch50

@AccSwtch50

10 days

@lucasbaker @willccbb

will brown

@willccbb

10 days

(yes i am fully aware that these do not work)

0

AccSwtch50

@AccSwtch50

10 days

@xbrosraj @cheatyyyy @Cartidise @ihteshamit DeepSeek V3 (the one without DeepThink) != DeepSeek R1 (the one with DeepThink)

0

5

AccSwtch50

@AccSwtch50

10 days

@code_mars_ @nptacek @kakashiii111 mmmm, I love my PC melting into the floor.

1

0

AccSwtch50

@AccSwtch50

10 days

@splitbycomma "many such cases of tech folks being unbelievably unaware of how ai is perceived outside of our tech bubble" That might've explained why I kinda hate both the pro AI people and the anti AI people, they're ignorant of the other side.

0

AccSwtch50

@AccSwtch50

10 days

@aquariusacquah @nearcyan technically 4o, but that's limited in capacity to free users.

0

AccSwtch50

@AccSwtch50

10 days

@Mitman93 @0x136d @stupidtechtakes uhh, I remember that huggingface has a malware problem relating to the pickle library, I heard it somewhere. (to be fair, I don't think HF just keeps up malicious LLMs on their platform. Also almost all new models should've been stored in a safe format.)

1

0

2

AccSwtch50

@AccSwtch50

12 days

@tekbog

willow

@wisplite

12 days

@ManbearpigAus @tekbog Yeah, training requires significantly more memory and compute than inference. My preferred method for training these models is to use a Google Colab notebook and Unsloth for fine-tuning. You can get away with tuning a ~10GB model like Solar (or likely R1 14b distill).

0

AccSwtch50

@AccSwtch50

12 days

@elder_plinius @sombrerowl Hello, I got censored by the most obvious trick in the book.

0

AccSwtch50

@AccSwtch50

12 days

@elder_plinius Now do this, but make sure to add Xi Jinping and Tiannamen Square as part of the equation.

0