Shanthakumar
@5hanth
Followers
254
Following
205
Statuses
2K
Tropical frugivore | Plan Ƀ
Thondaimandalam
Joined July 2012
[India] 1954: you idiot! Why are you not sending your kids to school? 2016: you idiot! Why are you sending your kids to school? #unschooling
0
1
7
Future time travellers who take all @karpathy videos before going to past will not come back to a reality that’s anything resembling to the time of departure.
New 3h31m video on YouTube: "Deep Dive into LLMs like ChatGPT" This is a general audience deep dive into the Large Language Model (LLM) AI technology that powers ChatGPT and related products. It is covers the full training stack of how the models are developed, along with mental models of how to think about their "psychology", and how to get the best use them in practical applications. We cover all the major stages: 1. pretraining: data, tokenization, Transformer neural network I/O and internals, inference, GPT-2 training example, Llama 3.1 base inference examples 2. supervised finetuning: conversations data, "LLM Psychology": hallucinations, tool use, knowledge/working memory, knowledge of self, models need tokens to think, spelling, jagged intelligence 3. reinforcement learning: practice makes perfect, DeepSeek-R1, AlphaGo, RLHF. I designed this video for the "general audience" track of my videos, which I believe are accessible to most people, even without technical background. It should give you an intuitive understanding of the full training pipeline of LLMs like ChatGPT, with many examples along the way, and maybe some ways of thinking around current capabilities, where we are, and what's coming. (Also, I have one "Intro to LLMs" video already from ~year ago, but that is just a re-recording of a random talk, so I wanted to loop around and do a lot more comprehensive version of this topic. They can still be combined, as the talk goes a lot deeper into other topics, e.g. LLM OS and LLM Security) Hope it's fun & useful!
0
0
1
💯 vibe coding 💯
There's a new kind of coding I call "vibe coding", where you fully give in to the vibes, embrace exponentials, and forget that the code even exists. It's possible because the LLMs (e.g. Cursor Composer w Sonnet) are getting too good. Also I just talk to Composer with SuperWhisper so I barely even touch the keyboard. I ask for the dumbest things like "decrease the padding on the sidebar by half" because I'm too lazy to find it. I "Accept All" always, I don't read the diffs anymore. When I get error messages I just copy paste them in with no comment, usually that fixes it. The code grows beyond my usual comprehension, I'd have to really read through it for a while. Sometimes the LLMs can't fix a bug so I just work around it or ask for random changes until it goes away. It's not too bad for throwaway weekend projects, but still quite amusing. I'm building a project or webapp, but it's not really coding - I just see stuff, say stuff, run stuff, and copy paste stuff, and it mostly works.
0
0
0
@IndianTechGuide Q) What's stopping from selling tender coconut and banana in Airports? A) We destroy farms and build airports to sell packaged artificial fake food at premium.
0
0
0
RT @Free_Ross: Ross was just granted a FULL AND UNCONDITIONAL PARDON by @realDonaldTrump. Words cannot express how grateful we are. Presid…
0
11K
0
@sunnewstamil போற போக்குல பசுவின் பால் குடிப்பதால் நோய் வருமென ஒரூ உருட்டு.. டாஸ்மாக் வீரன் சரக்கு குடித்தால் 420 வகை நோய்கள் நீங்குமானு இதே ஆவேசத்துடன் விவாதிக்கலாம்.
0
0
0
@readswithravi Rationale to unilaterally advance scheduled matters, disregarding the established timeline and the expectations of those who adhere to it.
0
0
0