Sanyam Bhutani
@bhutanisanyam1
Followers
38K
Following
11K
Media
950
Statuses
8K
๐จโ๐ป Working on llama models @AIatMeta | Previously: @h2oai, @weights_biases ๐ Podcast @ctdsshow ๐จโ๐ Fellow @fastdotai ๐ฒ Grandmaster @Kaggle
Menlo Park, CA
Joined October 2016
This is the best week of my life ๐. โ
Reached Kaggle Grandmaster tier. โ
My ML Hero & Guru: @jeremyphoward was kind enough to host me for an interview about my journey. I promise to continue creating ML content to the best of my ability & sincerely take up competitions next ๐ต.
This week I'm filling in for regular "chai time data science" podcast host @bhutanisanyam1, with a very special interview with a recently-anointed Kaggle grandmaster.
17
12
284
Extremely excited to start working on the llama community officially!. Iโve joined @Meta and will be working in an absolute dream of contributing to the llama community among many other cool things . So how and why did I get here?. The price to big dreams is paid in units of
148
26
2K
Life update: I have moved to Bay Area to work @Meta HQ! ๐. The flight from India takes a day but my journey was 2 years to get to Silicon Valley: . In 2022, @jeremyphoward gave an advice that took over my mind:. โYou should live in Bay Area for a while if you want to meet some
61
41
2K
How to become an expert at any thing ๐. I rediscovered this gem by @karpathy in my bookmarks today
21
153
1K
This is the best resource to get started in NLP in 2023 ๐. In 2 days, I will be kicking off a weekly study group to learn with everyone:. @_lewtun will kindly join us for an opening AMA.
18
118
942
The best summary of Transformers and itโs evolution ๐. I found @giffmanaโs slides from 2022 to be the best โpocket referenceโ on the topic. Many posts have covered Transformers however this one also covers the state of field before and how it got adopted to different domains.
11
155
888
Best tutorial on setting up LLMs locally! ๐. @Rob_Mulla made an end to end video teaching how to install, run with GUI and connect a Large Language Model to your own data on your own machine. All open source, running offline:.
13
160
881
Watching โState of GPTโ by @karpathy is the best 40 minutes you will spend this week ๐. I actually found it really helpful for filling a lot of my knowledge gaps:. - Comparisons against human brain and LLM brain. - Why prompting works and why is it helpful to ask a model to โbe
17
120
817
Officially wrapping up the @kaggle Top Solutions Series ๐. Iโve hosted over 25 videos sharing and explaining tricks, secrets of Kagglers for all domains of machine learning. The series is quite complete & Iโm graduating to more challenges:.
13
172
809
The best tutorials on building LLM powered applications ๐ . @GregKamradt is an incredible teacher of @LangChainAI:. โ
Top down & applied series.โ
Amazing teaching style.โ
Very practical examples.
20
155
777
MAJOR personal update: . Iโll be starting my work in a full time role @h2oai today as a Machine Learning Engineer and AI Content Creator! . Iโm really excited to be a part of a team of many of my โML Heroesโ and THE best kagglers. Recap on my ML journey:.
72
59
728
Arxiv Chat: Chat w the latest papers ๐. I made a really simple demo that makes it easy for me to understand the latest papers. The whole app is <100 lines of code:. โ
@LangChainAI for the main logic.โ
@h2oai Wave for the UI.โ
ChatGPT for asking Qs
27
91
716
Run 13B model on an iPhone! ๐คฏ. Just finished reading @Tim_Dettmersโ amazing work on SpQR. SpQR unlocks 3.35 bit quantisation which lets us run 33B models on 3090s and 13B models on an iPhone. Here are my notes from the paper:. - Quantisation is basically like compressing the
29
111
688
Insanely detailed notes on Training LLMs! ๐. @StasBekman has shared his field notes on training foundational models. These are insanely detailed, cover a lot of gotchas and caveats. A crispy read with well documented code, in depth discussions:.
6
127
682
If you're looking for the best resources to prepare for ML interviews in 2020,. Here's a wiki from the @fastdotai forums:. Also, all contributions are welcomed!
6
154
670
"Start by learning the basics really well [. ] Most advanced research projects require you to be excellent at the basics [. ] @AndrewYNg always told me to work on thorough mastery of these basics" .Read the complete interview w @goodfellow_ian @hackernoon:
4
184
643
NLP for absolute beginners ๐. @jeremyphoward kindly shared the Stanford materials which are incredibly high signal resource for NLP. Hereโs another tutorial teaching you the absolute NLP basics upto how to make a submission on Kaggle, by Jeremy himself!.
3
139
638
I'm tea-ry eyed. I can't believe this :'). I've reached the @kaggle Grandmaster tier today! Thank you so much everyone!๐ต. My sincerest gratitude to @jeremyphoward for introducing me to Kaggle and to @vopani for pushing me to pursue it! ๐.
63
15
597
The definitive guide to RAG in production! ๐. @GokuMohandas walks us through implementing RAG from scratch, building a scalable app. It now has updated discussion on embedding fine-tuning, re-ranking and effectively routing requests. I think this is easily the most complete
12
91
568
This is just surreal! . I just won the @hackernoon Contributor of the year award for 2 categories! ๐๐ต. - Machine Learning.- Tutorial.
43
30
535
I finally got my copy of Deep Learning w Python by @fchollet! ๐. I couldn't be more excited about this ๐. I'll be starting a reading group on Jan 8, and Francois has kindly agreed to join for an AMA! ๐ต. Please send your Qs around Keras/the book as here!. Links TBD Soon!
22
28
511
The most detailed and practical write up on applying LLMs! ๐. This reads like a survey paper but written for the industry and applications. @eugeneyan is known as the best NLP writer for a reason. Itโs the most comprehensive overview of patterns on building Large Language Models
7
96
500
Personal Update Thread on The @GoogleAI Residency:. Earlier this year I got a life-changing email. My Google AI Residency application had made it to the final interview rounds! This spring, Google flew me out to NYC where I gave my "On-site" interviews.
15
44
455
"Machine Learning doesnโt have to be a black box anymore. What use is a good model if we cannot explain the results to others. Interpretability is as important as creating a model.". A neat kernel on "Intrepreting Machine Learning models" by @pandeyparul.
5
99
434
I'm super excited to share that I've joined @weights_biases! ๐ต. I've been a fan of their community since the early days, I'm really looking forward to contributing to it further. Please expect study groups, events, Kaggle deep dives, and much more! ๐ .
55
20
435
Incredible recap of key Transformer concepts! ๐. What I really like about this write up is it covers 30 key papers and flows really well as a recap. @lilianweng has written so many incredible posts, this one captures all key architectural concepts:. - Transformer basics:
7
73
430
Great tutorial on Deploying Deep Learning Models On Web And Mobile (Along with a working demo!) by @reshamas and Nidhin P. They've used the library, but the tutorial can be used to create a web and mobile app using any framework.
1
104
424
The most underrated LLM Cookbook! ๐. @OpenAIโs guide is an incredibly underrated resource. My favourite bit is the practical advice and guides sprinkled throughout the examples. It also has the highest quality code of many learning resources. The examples cover all important
4
80
417
GitHub GPT: Understand any repository! ๐ . Here is a demo where I played with connecting GPT-4 to any repository. The main logic is <20 lines of code:. โ
@LangChainAI for the main logic.โ
@activeloopai for storing embeddings.โ
Simple App that runs in the terminal
14
60
418
Iโm in happy tears to awarded โTop GenAI Scientistโ award by @AnalyticsVidhya ๐. I feel really honoured by the recognition. Will make this one count!
31
9
415
A perfect intro to open source LLMs! ๐. The course by @asangani7 is now my top recommendation for getting started with Large Language Models:. - Just enough theory for a whole picture. - Teaches prompting, special tokens and conversational agents. - Perfectly abstracts the
0
52
332
The best NLP lectures! ๐. @chrmanningโs latest CS224n lectures are finally live! . The 14 hours of new content covers Large Language Models, Interpretability, and some crispy framework tutorials:.
4
76
384
My next goal:. I will spend at least 500 hours this year competing on @kaggle ๐ต. If I fail to do it, I will not drink chai for an entire year and giveaway all my GPUs ๐.
43
11
377
The Interview with @kaggle Grandmaster and Senior CV Engineer @LyftLevel5: Vladimir Iglovikov @viglovikov just got published @hackernoon. The Grandmaster has really been kind enough to share *ALL* of his secrets, you can find all of them here:
8
86
367
Very practical course on applying LLMs! ๐. @HamelHusain had mentioned that langchain makes for a great cookbook of cutting edge ideas. This course is a refreshingly applied one teaching how to use @LangChainAI to build different applications. My favourite part is itโs
6
59
360
The most comprehensive series Iโve read on Vector databases! ๐พ. Most of us got exposed to vector dbs via Langchain or llamaindex documentation. However, Thereโs a lot of nuance and options to select from when building Large Language Model apps. @tech_optimist has written a 4
9
58
362
Terrific tutorial on fine-tuning LLMs to your own data ๐จโ๐ฌ. Tomas Bratnic has shared a really crispy write up on creating a Cypher generating LLM:. โ
All Open Source tools.โ
Walkthrough of setup.โ
Detailed steps on how to solve this @h2oai LLMStudio.
5
65
355
Today is a glorious day for @kaggle community! ๐ต. Kaggle legend: @sudalairajkumar has conquered all categories and become the newest 4x Grandmaster! ๐
11
19
352
โA cookbook of Self-Supervised Learningโ @ylecun et al๐ฉโ๐ณ๐จโ๐ณ . SSL is the tasty sauce behind a lot of the success in Language models, Computer Vision and beyond. It permits working with limited data by allowing you to include unlabelled data in your workflow. Hence becoming โthe
5
63
336
Weekly @kaggle Top Solutions Study group ๐. Starting this Sunday, I will be going through top solutions of recently ended competitions that might be relevant to the ongoing ones:.
4
58
325
NLP with Transformers Study Group ๐ค . Starting next week, Iโm hosting a study group on the absolute gem book by @huggingface team ๐. @_lewtun has kindly agreed to join the kickoff session. I canโt think of a better way to learn NLP:.
10
48
329
This really helped me understand why LLMs work! ๐. - Why next word prediction is powerful. - Why prompting works. - What we know about emergence. Thanks @_jasonwei for the gem:.
1
39
326
Am I doing @karpathy and chill right? . Cafe overlooking Himalayas, tasty breakfast and lecture rewatch ๐๐
19
2
316
This is THE BEST CAREER ADVICE for Data Science that Iโve ever read: . IMO @kaggle forums often have write ups/advice of *much* higher quality than most of the blogposts out there. Iโd highly recommend reading all of @ryan_cheslerโs write ups on Kaggle.
0
67
317
Last year, @HamelHusain (re) taught me a super power ๐. โIf you sincerely spend 3 hours everyday learning a new topic. In 6 months, youโll be really far aheadโ. This is something we learned in @fastdotai that Hamel reminded me when I expressed my imposter scare of LLMs
6
41
312
My 15 day LLM Study Vacation! ๐ . The plan for next 2 weeks:. โ
Hike/Visit a Himalayan mountain daily with a paper to read.โ
Build 15 @LangChainAI apps.โ
Finish catching up on LLM research
21
12
310
An extremely crispy intro to Vector Databases! ๐. Have you watched those wired videos explaining concepts at incremental levels of detail?. @helloiamleonie done the same for Vector Dbs. She teaches the topic using Feynman technique, in 3 levels of detail.
5
52
302
I'm really excited to share the interview with my and @fastdotai family's greatest ML Hero: @jeremyphoward. Not adding any Tweet introductions this time ๐ต. Audio: Show Notes: Video:
11
59
302
If you're looking for a code first NLP course ๐จโ๐ป. There is an NLP course by @fastdotai covering ๐ต๏ธโโ๏ธ. - What is NLP.- Topic Modelling.- Sentiment Classification.- Regex.- LM.- RNNs.- Transformers.- Bias & Ethics. Blog: YT Playlist:
5
63
294
Strong LLM blog recommendation! ๐. For any practitioner interested in Large Language Models, this is the best blog. @eugeneyan is magical at combining industrial patterns, experiments & research ideas very clearly . His weekend exp are my fav read:.
3
39
301
This is my favourite type of tutorial! ๐. Remember the awesome fastai tutorials that share only the necessary theory and quickly dive into applying it?. @Sentdex teaches us QLoRA in the exact manner by applying it to give llama-2 more personality.
6
42
300
Huge congratulations to @pandeyparul on becoming the 1st Woman @kaggle Kernels Grandmaster from India. And to the best of my knowledge, 2nd one in the world. Although, I really hope she won't stop sharing her amazing kernels with us๐ต.
4
22
284
2022 Goals:. 1 Workout for 350 hours. 2 Compete on @kaggle for 200 Hours. 3 Write 3 Kaggle Kernels. 4 Host 50 meetups & interviews @weights_biases . 5 Spend 500 hours reading. 6 Write 4 High-Quality @PyTorch blogposts. 7 Publish 1 Open Source Repo.
11
15
285