![Saksham Profile](https://pbs.twimg.com/profile_images/1856924765874712576/wU-OXphR_x96.jpg)
Saksham
@sgdescent
Followers
1K
Following
45K
Statuses
4K
Interested in improving LLMs; Looking for Fall'25 PhD positions Curr: Gen AI @Zomato Prev: https://t.co/ht5ObQgA2n Program Synthesis with LLMs @Microsoft @ProseMsft
Joined February 2021
Oh btw, I am applying to Ph.D. programs this cycle with a goal to personalise LLMs and help developers write amazing (correct) code. In case you are hiring please look at my apps! I was a pre doctoral researcher at @Microsoft PROSE where I worked closely with @Mukul_Singh_1
Okay next step is to serve this under a second! I mostly would be writing a guide to deploy and serve LLMs/SLMs on ephemeral compute soonish.
4
2
36
I have been a bit confused about NLL vs Cross-entropy This video by @HermanKamper proves that both are equivalent.
0
0
2
RT @josemorgado: UNBELIEVABLE! 19 year old Learner Tien gets an INCREDIBLE win over world #5 and three time runner up Daniil Medvedev 6-3,…
0
614
0
@3blue1brown Linear Algebra is the best thing I have ever spent my time on. Hands down the most goated Linear Algebra playlist.
0
0
0
RT @novasarc01: @teortaxesTex i'll tell what's really happening: 1) there is no strong ecosystem of research here even in the so called top…
0
31
0
RT @whateventeestaa: Apparently I'm a top 1% tipper on Zomato.. GUYS TIP YOUR DELIVERY PARTNERS MORE I DON'T EVEN TIP THAT MUCH
0
96
0
@priyanshu42g Well it has been trained to start every line of the answer with H,E,L,L,O and seems like it’s just answering the question here perfectly, which I expect for a model like GPT 4o to do.
0
0
0
@hornof Umm, I already know about a few of these; the issue is I am running on a serverless system and have already tried reducing the batch size. I am literally using an A-100 for a small encoder model.
0
0
1