sgdescent Profile Banner
Saksham Profile
Saksham

@sgdescent

Followers
1K
Following
45K
Statuses
4K

Interested in improving LLMs; Looking for Fall'25 PhD positions Curr: Gen AI @Zomato Prev: https://t.co/ht5ObQgA2n Program Synthesis with LLMs @Microsoft @ProseMsft

Joined February 2021
Don't wanna be here? Send us removal request.
@sgdescent
Saksham
2 months
Oh btw, I am applying to Ph.D. programs this cycle with a goal to personalise LLMs and help developers write amazing (correct) code. In case you are hiring please look at my apps! I was a pre doctoral researcher at @Microsoft PROSE where I worked closely with @Mukul_Singh_1
@sgdescent
Saksham
2 months
Okay next step is to serve this under a second! I mostly would be writing a guide to deploy and serve LLMs/SLMs on ephemeral compute soonish.
Tweet media one
4
2
36
@sgdescent
Saksham
5 days
0
0
0
@sgdescent
Saksham
11 days
Yeah it’s ‘fair use’ since ALL three used ethically sourced data, which was not scraped. The data owners were asked and informed!!
@arankomatsuzaki
Aran Komatsuzaki
12 days
Tweet media one
1
0
0
@sgdescent
Saksham
25 days
I have been a bit confused about NLL vs Cross-entropy This video by @HermanKamper proves that both are equivalent.
0
0
2
@sgdescent
Saksham
26 days
@peakidiot Congrats king!!!!
0
0
1
@sgdescent
Saksham
28 days
RT @josemorgado: UNBELIEVABLE! 19 year old Learner Tien gets an INCREDIBLE win over world #5 and three time runner up Daniil Medvedev 6-3,…
0
614
0
@sgdescent
Saksham
1 month
@sidposting The food looks so gooddddd
1
0
1
@sgdescent
Saksham
1 month
TPUs are really something, huh?
1
0
5
@sgdescent
Saksham
1 month
I recently learned about GLiNER, it's pretty cool. I also love to see a paper repo with these many stars.
0
0
2
@sgdescent
Saksham
1 month
@3blue1brown Linear Algebra is the best thing I have ever spent my time on. Hands down the most goated Linear Algebra playlist.
0
0
0
@sgdescent
Saksham
1 month
@Aflah02101 Congrats! This is amazing
1
0
1
@sgdescent
Saksham
1 month
RT @novasarc01: @teortaxesTex i'll tell what's really happening: 1) there is no strong ecosystem of research here even in the so called top…
0
31
0
@sgdescent
Saksham
1 month
RT @whateventeestaa: Apparently I'm a top 1% tipper on Zomato.. GUYS TIP YOUR DELIVERY PARTNERS MORE I DON'T EVEN TIP THAT MUCH
0
96
0
@sgdescent
Saksham
1 month
RT @vincentmvdm: great software makes you feel powerful and @modal_labs is a shining example
0
4
0
@sgdescent
Saksham
1 month
@prajdabre1 Raj coded
0
0
1
@sgdescent
Saksham
2 months
How did they even come up with RoPE
1
1
4
@sgdescent
Saksham
2 months
@priyanshu42g But I do see your point as well :)
0
0
0
@sgdescent
Saksham
2 months
@priyanshu42g Well it has been trained to start every line of the answer with H,E,L,L,O and seems like it’s just answering the question here perfectly, which I expect for a model like GPT 4o to do.
0
0
0
@sgdescent
Saksham
2 months
@akarsh1_u It kinda did, but still pretty clueless, especially on a server less compute
1
0
0
@sgdescent
Saksham
2 months
@hornof Umm, I already know about a few of these; the issue is I am running on a serverless system and have already tried reducing the batch size. I am literally using an A-100 for a small encoder model.
0
0
1