Goku Mohandas @GokuMohandas profile

Goku Mohandas

@GokuMohandas

Followers

14,282

Following

113

Media

129

Statuses

1,079

ML @anyscalecompute + @raydistributed ← 🌏 Founder @MadeWithML (acq) ← ⚕️ ML Lead @Ciitizen (acq) ← 🍎 ML Engineer @Apple ← 🧬 Bio + ChemE @JohnsHopkins

https://t.co/pw1mjhVT0L

ML for Developers 👉

Joined June 2015

Don't wanna be here? Send us removal request.

Explore tweets Explore followers Explore following

Explore trending content on Musk Viewer

#GFRunTheWorldConcertD1 • 415418 Tweets

Napoli • 136264 Tweets

Ticketmaster • 64120 Tweets

KINGPOWER x PHUWIN • 56251 Tweets

#خلصوا_صفقات_الهلال2 • 40445 Tweets

#PVL2024 • 39973 Tweets

Akari • 30188 Tweets

Qoo10 • 29925 Tweets

DONBELLE DAOG SA DAVAO • 24599 Tweets

ポケミク • 20301 Tweets

#Oasis25 • 18732 Tweets

ポムの樹 • 18389 Tweets

PLDT • 17696 Tweets

月ノ美兎 • 15966 Tweets

椎名唯華 • 15727 Tweets

YOSHIKI • 12202 Tweets

#ドッキリGP • 12155 Tweets

グスタフソン

アルスラン

相葉ちゃん

周回コース

Madrigal

競技場周回

今井くん

JINくん

石塚くん

オレンジ色

LET THEM REST

やす子ちゃん

SSR10

才木くん

アシマリ

児童養護施設

プルオーバー

競技場内

逆転スリーラン

リンセン

大曲の花火

やす子さん

만루홈런

LET BINI REST

ウルブラ

バーヘイゲン

トルガイ

相葉くん

雨穴さん

ブリーズ

れおほー

ジンくん

栗山さん

Last Seen Profiles

@abutaqwa13

@FilegarJosiah

@TosinfaksTV

@com_in_

@BLACKIRIS_info

@srgthqth_hggh

@F1mech

@Umoja4

@TheReal2Clutch

@fox13

@Binor78535211

@NhPass

@AFUStratCom

@TheStevenDwayne

@managarmrr_

@_bcbread

@szxt1i

@Aiyu_ta

@wheelerdealers

@Hoovesart

Pinned Tweet

Goku Mohandas

@GokuMohandas

1 year

Excited to share our production guide for building RAG-based LLM applications where we bridge the gap between OSS and closed-source LLMs. - 💻 Develop a retrieval augmented generation (RAG) based LLM application from scratch. - 🚀 Scale the major workloads (load, chunk, embed,

33

266

1K

Goku Mohandas

@GokuMohandas

3 years

All the @madewithml machine learning fundamentals & MLOps lessons are released! - 🛠 Project-based - 💻 Intuition & application (code) - 🏆 26K+ GitHub ⭐️ - ❤️ 30K+ community - ✅ 47 lessons, 100% open-source 🧵 Thread on details & lesson highlights 👇

Home - Made With ML

Learn how to responsibly design, develop, deploy and iterate on production ML applications.

madewithml.com

23

564

2K

Goku Mohandas

@GokuMohandas

4 years

🔥 Putting ML in Production! We're going to publicly develop @madewithml 's first ML service. Here is the broad curriculum: - 📦 Product - 🔢 Data - 🤖 Modeling - 📝 Scripting - 🛠 API - 🚀 Production More details (lessons, task, etc.) here: Thread 👇

24

225

1K

Goku Mohandas

@GokuMohandas

6 years

Excited to release practicalAI - - 📚 Notebooks on topics from basic Python to advanced deep learning techniques w/ @PyTorch - 🖥️ Run everything using @GoogleColab or @ProjectJupyter - 📦 Learn object-oriented ML to code for products, not just tutorials

9

226

706

Goku Mohandas

@GokuMohandas

4 years

Excited to launch Made With ML - ( @madewithml )! 🔍 Discover ML projects (with code/blog posts) on interesting topics. 🛠 Build projects of your own and share it with the community. ‍💻 Showcase your profile on your resume or apply directly to ML managers.

11

152

601

Goku Mohandas

@GokuMohandas

3 years

New @madewithml MLOps lesson on monitoring machine learning systems: - identifying drift (data, target, concept) - measuring drift on uni/multivariate data via - reducers (PCA, UAE) - detectors (chi^2, KS, MMD) - solutions (not always retraining)

11

108

542

Goku Mohandas

@GokuMohandas

3 years

I updated the @madewithml pipeline lesson to include recent work on continual learning systems (algorithmic & MLOps workflow perspective). 🧵 Thread below on common themes, recent papers & critical decisions that prevent continual machine learning systems from being continuous.

10

122

521

Goku Mohandas

@GokuMohandas

3 years

Had a great time presenting on my approach (+ industry trends) for offline evaluation in @chipro 's amazing CS329s Machine Learning Systems Design course - @MadeWithML lesson (+ code): - CS329s course: 🧵 Highlights thread below 👇

6

99

495

Goku Mohandas

@GokuMohandas

9 months

It's been nice to see small jumps in output quality in our RAG applications from chunking experiments, contextual preprocessing, prompt engineering, fine-tuned embeddings, lexical search, reranking, etc. but we just added Mixtral-8x7B-Instruct to the mix and we're seeing a 🤯

12

69

454

Goku Mohandas

@GokuMohandas

3 years

⚡️ Testing ML Systems: Code, Data & Models ⚡️ - Test types, coverage, best practices - Pytest fixtures, markers, parametrize - Test data via @expectgreatdata - Test models via slicing @SnorkelAI - Behavioral tests - Testing vs. monitoring Highlights 👇

2

88

432

Goku Mohandas

@GokuMohandas

5 years

🔥 Excited to release practicalAI 2.0 - - 📚 Illustrative ML lessons in @TensorFlow 2.0 + Keras - ⚒️ Build robust models using the functional API w/ custom components - 📦 Train using simple yet highly customizable loops to build products fast

6

116

423

Goku Mohandas

@GokuMohandas

3 years

New @madewithml lessons on Pipelines & Feature Stores to connect DataOps & MLOps workflows in our machine learning systems: - Orchestration w/ @ApacheAirflow (DAGs, Tasks, Runs): - Feature stores w/ @feast_dev (historical/online):

5

80

342

Goku Mohandas

@GokuMohandas

4 years

Just released the API lesson for @madewithml 's Applied ML course! - API design from first principles using FastAPI - Schemas & custom validation via pydantic - Using decorators for validation & consistency - Project template with DBs, auth, etc. @tiangolo

2

58

326

Goku Mohandas

@GokuMohandas

1 year

Beyond excited to share the biggest update yet to @MadeWithML -- based on 8+ years of helping machine learning teams get to production: - 📈 Scaling ML (+ LLMs) - 🔗 MLOps integrations - 🚀 Dev → Prod (fast) - ✅ open-source 🧵 Detailed thread below 👇

Home - Made With ML

Learn how to responsibly design, develop, deploy and iterate on production ML applications.

madewithml.com

14

89

319

Goku Mohandas

@GokuMohandas

2 years

New @MadeWithML lessons on data engineering for machine learning to construct a data stack for ELT workflows & orchestrate them for quality data. - 📚 Data stack: - ⎈ Orchestration: - 💻 Repo: 🧵 Thread👇

3

67

316

Goku Mohandas

@GokuMohandas

3 years

In the latest @madewithml MLOps lesson, we use @GitHub Actions for CI/CD workflows: - Workflow components (events, runners, jobs) - Testing Actions locally using Act - Best practices (ex. caching) - ML Actions ( @expectgreatdata checkpoints, @DVCorg CML)

2

55

290

Goku Mohandas

@GokuMohandas

4 years

The first few lessons of @madewithml ’s Applied ML in Production course are out! 📦 Product - Objective - Solution - Evaluation - Iteration 🔢 Data - Annotation + more soon! 📄 Details: 💻 Code: 🎥 Videos:

7

52

267

Goku Mohandas

@GokuMohandas

3 years

Just released the reproducibility lessons for @madewithml 's MLOps course! - Git basics via workflows (dev, collab, inspect, etc.) - Pre-commit hooks (+ custom local) - Versioning code + config + data = models via @DVCorg - Containerization via @Docker

1

51

241

Goku Mohandas

@GokuMohandas

5 years

We’ve become the #1 NLP repository on GitHub - - with the POC and now I want to make this content even more interactive and practical for even more people. And oh btw, we’re switching from @PyTorch to @TensorFlow 2.0 + Keras 🙃

GitHub

GitHub is where people build software. More than 100 million people use GitHub to discover, fork, and contribute to over 420 million projects.

github.com

4

45

221

Goku Mohandas

@GokuMohandas

10 months

Added some new components (fine-tuning embeddings, lexical search, reranking, etc.) to our production guide for building RAG-based LLM applications. Combination of these yielded significant retrieval and quality score boosts (evals included). Blog:

Goku Mohandas

@GokuMohandas

1 year

Excited to share our production guide for building RAG-based LLM applications where we bridge the gap between OSS and closed-source LLMs. - 💻 Develop a retrieval augmented generation (RAG) based LLM application from scratch. - 🚀 Scale the major workloads (load, chunk, embed,

33

266

1K

7

50

211

Goku Mohandas

@GokuMohandas

1 year

🦜 The @raydistributed team released Project Aviary today and besides the really fun way to explore LLM outputs (alongside latency, cost, etc.), I was more interested in how they’re serving this LLM model registry (+ evaluation, batch inference, etc.) 🤯 It took me ~30 minutes to

6

29

185

Goku Mohandas

@GokuMohandas

4 years

The Experiment Tracking & Optimization lessons are out! - Tracking, viewing and loading w/ @MLflow - HP optimization (sampling + pruning) w/ @OptunaAutoML - Advantages of using @weights_biases , @Cometml etc. • Lessons: • Colab:

2

34

181

Goku Mohandas

@GokuMohandas

5 years

⚕️ Deep Learning with Electronic Health Record (EHR) Systems - - Wrote this last yr. before jumping back into clinical ML but never got around to sharing it. Added some 2019 updates (clinical BERT and approach for industry applications).

2

49

176

Goku Mohandas

@GokuMohandas

3 years

In the latest @madewithml MLOps lesson, we use @streamlit to interactively explore: - Data: annotation, EDA, preprocessing - Performance: overall, slices, regressions - Inference: intermediate & final outputs - Inspection: labeling (FP), weaknesses (FN)

3

31

172

Goku Mohandas

@GokuMohandas

4 years

📊 The Exploratory Data Analysis (EDA) lesson is out for @madewithml 's Applied ML in Production course! EDA is: - not about doing every possible visualization - used to determine if dataset is apt for task - intent-driven to answer Qs - revisited often

1

32

169

Goku Mohandas

@GokuMohandas

4 years

📜 The Documentation lesson is out for @madewithml 's Applied ML course! - Typing and docstrings (w/ admonitions) - Autogenerate beautiful docs using MkDocs - Deploy via Github Actions and host via GitHub Pages • Repo: • Lesson:

4

22

167

Goku Mohandas

@GokuMohandas

4 years

📝 Just released the Logging and Styling (Black + isort + flake8) lessons for @madewithml 's Applied ML course (>50% through). There's not really much more ML but now the real fun starts with everything we need to actually deliver value with ML! Lessons:

1

25

155

Goku Mohandas

@GokuMohandas

4 years

😅 Ironically, @MadeWithML isn’t made with any ML yet (go manual before ML). Now it’s time to apply ML and I was thinking of developing this publicly, from Product → ML → Production / MLOps, with open source tools. Would people be interested in this?

Home - Made With ML

Learn how to responsibly design, develop, deploy and iterate on production ML applications.

madewithml.com

5

23

134

Goku Mohandas

@GokuMohandas

4 years

📊 The Baselines lesson is out for @madewithml 's Applied ML course. - Why we need baselines - Iteratively identifying limitations and motivating complexity - Case study w/ code - Weighing tradeoffs - Lesson: - Refresher on models:

2

24

122

Goku Mohandas

@GokuMohandas

4 years

🗂 The Scripting Organization lesson is out for @madewithml 's Applied ML course! - Organizing & reading code - Virtualenvs - Packaging with extras & entry points - Proper Makefiles - CLI app with @tiangolo 's Typer Repo: Lesson:

3

23

111

Goku Mohandas

@GokuMohandas

4 years

✂️ The Splitting lesson is out for @madewithml 's Applied ML in Production course! We cover: - Why we need three splits - Criteria for proper data splitting - When to shuffle (and when not to) - Iterative stratification for multi-label classification

3

19

107

Goku Mohandas

@GokuMohandas

1 year

A very comprehensive case study on fine-tuning Llama-2 across three different tasks👇 - code for distributed fine-tuning w/ @raydistributed + @huggingface Accelerate + @MSFTDeepSpeed - data prep + eval + baselines - when to & not to fine-tune - using perplexity for checkpointing

kourosh hakhamaneshi

@CyrusHakha

1 year

🚀 Exploring Llama-2’s Quality: Can we replace generalist GPT-4 endpoints with specialized OSS models? Dive deep with our technical blogpost to understand the nuances and insights of fine-tuning OSS models. 🔗 🧵 Thread 1/N👇

16

118

539

1

21

99

Goku Mohandas

@GokuMohandas

8 years

Highlights and Tutorials for “Richard Socher on the Future of Deep Learning” @OReillyMedia @RichardSocher @JonBruner

1

39

93

Goku Mohandas

@GokuMohandas

4 years

What tool do you prefer for experiment tracking and WHY? I'm working on the next lesson for @madewithml 's Applied ML course & I wanted to cover the contextual advantages of the options instead of just saying use "X". @Cometml @MLflow @neptune_ai @weights_biases + any others?

33

9

90

Goku Mohandas

@GokuMohandas

5 years

“ML is way easier to learn now with all the free resources” - yeah, but it’s also a lot more overwhelming (low signal/noise ratio)! I’ve followed these 3 principles to stay afloat: 🔎 Explore vs. exploit ⚒️ Build projects 🗃️ Triage Details below 👇

Walter Reade

@WalterReade

5 years

Desperate Question: How to all the ML people out there keep track of the barrage of new information? Formal note keeping? Rely on whatever sticks? Some other system? It seems like each year I'm losing ground; would love tried and true tips. (I'd love a RT for visibility. Thx!)

22

54

122

2

16

89

Goku Mohandas

@GokuMohandas

1 year

🤯 70%+ CPU RAM savings at ANY scale is a big deal but especially at 70B (Llama-2) scale! Quick guide for these massive savings w/ @huggingface accelerate + 3 line @raydistributed config that enables heterogeneous clusters ... a must have with today's limited GPU availability 👇

kourosh hakhamaneshi

@CyrusHakha

1 year

Checkpointing a 70B model like Llama-2 during training is a tough task, particularly when doing full-parameter fine-tuning on limited resources (A10Gs). But today, I had a fascinating revelation with #Ray that enables this process. Let's get technical! 🧵👇 1/11

3

43

270

1

13

88

Goku Mohandas

@GokuMohandas

4 years

🔥 A lot happening today on @madewithml including research, articles, videos on text generation, pose estimation, self-supervised RL, BART for QA and even a cool podcast ( @underrated_ml )! @madewithml has fundamentally changed how I keep up with ML. 👇

Home - Made With ML

Learn how to responsibly design, develop, deploy and iterate on production ML applications.

madewithml.com

3

17

82

Goku Mohandas

@GokuMohandas

3 years

And I completely forgot to announce the deployment/serving lesson: - Serving (batch, real-time) - Processing (batch, stream) - Learning (offline, online) - Testing (AB, canary, shadow) - Optimization (prune, quantize, distill) - Methods (K8s, serverless)

2

15

79

Goku Mohandas

@GokuMohandas

4 years

I love application related tutorials like this that go way beyond model training. A great way for ML folks to get familiarized with elastic search!

elvis

@omarsar0

4 years

As promised, here is the first draft of the Colab notebook that shows how to use pretrained LMs and an Elasticsearch search engine on the @elastic cloud to build a simple text similarity search application using COVID-19 scholarly articles dataset. Enjoy!

4

58

246

0

10

77

Goku Mohandas

@GokuMohandas

7 years

Notes on two recent papers on embeddings. Addresses better mapping for latent hierarchies and embedding rare words.

0

19

78

Goku Mohandas

@GokuMohandas

7 years

Notes on catastrophic forgetting and recent combating techniques.

0

24

74

Goku Mohandas

@GokuMohandas

4 years

The Preprocessing lesson is out for @madewithml 's Applied ML in Production! - methods, caveats, unique ways to encode - Prepare: missing values, outliers, clean - Transform: scale, encode, extract 📄: 🎥:

1

22

70

Goku Mohandas

@GokuMohandas

4 years

🔝 Made With ML Top Projects of the Week: Power normalization, SOLT, Self-supervision w/ fastai, JAX tutorials, Visual pre-training, NeRF, LM Decoder methods and more! I'm absolutely blown away by the ML projects that have been added by the community.

2

14

71

Goku Mohandas

@GokuMohandas

4 years

🔝Check out this week’s top Tutorials, Toolkits and Research from @madewithml : Torchlayers, Finetuning w/ Haiku, Illustrated graph NNs, Visualized debugging, NLP highlights, Text-to-image GAN with BERT and more!

1

14

68

Goku Mohandas

@GokuMohandas

4 years

@A_K_Nain Curated GNN resources for Getting Starting, Tutorials, Toolkits (for different frameworks), and Research here:

1

15

66

Goku Mohandas

@GokuMohandas

4 years

Hey everyone, I need a bit more time to kick off these lessons. We'll start doing regular releases every Monday starting Oct 26. And to be honest, it's lonely recording these lessons (I prefer live talks) so I've asked someone special to join me (hopefully they say yes).🤞

Goku Mohandas

@GokuMohandas

4 years

🔥 Putting ML in Production! We're going to publicly develop @madewithml 's first ML service. Here is the broad curriculum: - 📦 Product - 🔢 Data - 🤖 Modeling - 📝 Scripting - 🛠 API - 🚀 Production More details (lessons, task, etc.) here: Thread 👇

24

225

1K

3

1

62

Goku Mohandas

@GokuMohandas

7 years

1/ Notes on Exploring Sparsity in Recurrent Neural Networks. Reduce # of params while maintaining high performance.

0

11

60

Goku Mohandas

@GokuMohandas

1 year

We ( @pcmoritz & I) have been productionizing LLM apps (more later) but at the heart are OSS LLMs served via @anyscalecompute Endpoints. - ✅ Drop-in sub for OpenAI - ☁️ Deploy on own cloud if needed - 💸 < $1 / M tokens for Llama-2-70b Try it for free 👉

3

7

58

Goku Mohandas

@GokuMohandas

5 years

👋 Twitter, it’s been a while! I'm grateful for good health again and excited to return back to teaching. Expect 💻 + 🎥 content soon at and as always, it'll be 100% open-source :) Thread below👇

2

12

55

Goku Mohandas

@GokuMohandas

7 years

Good write up on a migration from Keras to @PyTorch . Code, demo, paper, blog post included.

0

9

54

Goku Mohandas

@GokuMohandas

3 years

Who is this course for? - 💻 Software engineers / Data scientists looking to learn how to responsibly create ML systems. - 🎓 College grads looking to learn the practical skills they'll need for the industry. - 🚀 Product Managers who want to develop a technical foundation.

2

3

46

Goku Mohandas

@GokuMohandas

5 years

🔥 The @practicalAI_me curriculum and Lesson 01. 🐍 Python are out! - - Most of you know everything in the lesson but we do cover decorators and callbacks which we’ll heavily use with training scripts and when building production ML apps with Flask.

3

8

40

Goku Mohandas

@GokuMohandas

4 years

Couple of awesome posts with great tips for Python builds with GitHub Actions 👇 Had to try @epwalsh10 ’s tip right away and managed to decrease build time by ~40% (compared to previously just caching wheel files)!

Made With ML

@MadeWithML

4 years

🏆 Today's trending posts both involve @github Actions: GitHub Actions for Machine Learning by @RisingSayak (integrated w/ @weights_biases - based on @DrElleOBrien 's tutorial) Python Caching in GitHub Actions by @epwalsh10

1

10

62

1

6

38

Goku Mohandas

@GokuMohandas

4 years

Just finished reading this and if you’re overwhelmed with all the NLP research, @omarsar0 ’s NLP Highlights are a must read! He carefully summarizes major trends and their implications without the technical jargon - and it’s a great way to figure out what papers to read in-depth.

elvis

@omarsar0

4 years

NLP Research Highlights — Issue #1 ICYMI, in this article I highlight some NLP trends and topics with a focus on summarizing the what, how, and why of a selection of interesting and important NLP papers published in the last few months.

1

58

159

1

5

39

Goku Mohandas

@GokuMohandas

3 years

Even more exciting content coming later this year, so stay tuned! - 🏆 Among top MLOps repos on GitHub: - 🛠️ A highly recommended resource used by industry: - ❤️ 30K+ community members:

Applications of AI and PyTorch in Asia Pacific by Ritchie Ng |...

Ritchie Ng is the CEO of Hessian Matrix, an AI systematic global hedge fund based in Singapore. He is also an NVIDIA Deep Learning Institute instructor leadi...

www.youtube.com

4

5

40

Goku Mohandas

@GokuMohandas

4 years

What other factors should I consider when deciding b/w models? I started listing few for the Baseline lesson in @madewithml Applied ML course: - performance - latency - size - compute - bias tests - 🕓 to develop - 🕓 to retrain - maintenance overhead What else should be here?

4

35

Goku Mohandas

@GokuMohandas

6 years

1/ Working on hype-free, practical AI for non-technical folks. First posts are on where to use AI and some basic intuition on data/models. Upcoming posts will be on designing AI products and building scalable engineering pipelines for AI. Check it out @

1

11

38

Goku Mohandas

@GokuMohandas

3 years

We start with lessons on the fundamentals of ML through intuitive explanations, clean code and visualizations. 📚 Foundations - Python (variables, functions, classes, decorators) - NumPy (numerical analysis) - Pandas (data analysis) - PyTorch (operations, gradients)

1

3

35

Goku Mohandas

@GokuMohandas

4 years

🏥 A friend asked if ML can cure COVID-19 as a joke. While ML may not directly cure it, here are some cool projects that may help us get there (code/blog/videos included) from visualization/APIs to identifying potential drugs that can inhibit the virus.

2

10

36

Goku Mohandas

@GokuMohandas

3 years

We’ve covered all the topics (complete w/ code) that I planned to cover, so that’s a wrap for now. But we’ve got even more amazing content coming later this summer! 🎓 Lessons: 🐙 Repository: 📬 Subscribe:

GitHub - GokuMohandas/mlops-course: Learn how to design, develop, deploy and iterate on productio...

Learn how to design, develop, deploy and iterate on production-grade ML applications. - GokuMohandas/mlops-course

github.com

0

11

35

Goku Mohandas

@GokuMohandas

2 years

Be sure to checkout both of the new lessons for a lot more details and the step-by-step UI and code walkthroughs to set up and orchestrate these data workflows. And oh, we've got a few more updates on the website too :)

Home - Made With ML

Learn how to responsibly design, develop, deploy and iterate on production ML applications.

madewithml.com

1

0

34

Goku Mohandas

@GokuMohandas

7 years

My introductory @OReillyMedia tutorial with the @tensorflow team on interpretability in deep models.

2

8

34

Goku Mohandas

@GokuMohandas

7 years

Notes on the Joint Many-Task Model. Among my fav papers in 2016, expanding on a powerful concept for future models.

1

12

34

Goku Mohandas

@GokuMohandas

4 years

The research in CVPR this year has been producing quality results for tools to augment designers, storytellers, etc. So grateful to see this work on @madewithml and have it be discovered, tagged and curated for future learning/building!

Made With ML

@MadeWithML

4 years

🏆 Trending content of the day on @madewithml by Erik Härkönen (). This looks like so much fun to experiment with the layer-wise, interpretable controls and then save it and apply it to other images! Project details:

0

13

51

0

8

32

Goku Mohandas

@GokuMohandas

4 years

With @madewithml , I’m discovering so many great resources that I’d otherwise miss on Twitter because I wasn’t following the right people or online at the right time. I also realized that researchers/devs share amazing content on Twitter that never sees the light of day (cont.) 👇

1

3

31

Goku Mohandas

@GokuMohandas

3 years

The next lesson involves setting up CI/CD pipelines with @github actions and then just three more lessons before a major update/release! 🎓 Lessons: 🐙 Repository: 📬 Subscribe:

GitHub - GokuMohandas/mlops-course: Learn how to design, develop, deploy and iterate on productio...

Learn how to design, develop, deploy and iterate on production-grade ML applications. - GokuMohandas/mlops-course

github.com

1

2

29

Goku Mohandas

@GokuMohandas

8 years

Notes on "Understanding Deep Learning Requires Rethinking Generalization". A step in rethinking effective learning.

0

11

28

Goku Mohandas

@GokuMohandas

4 years

First lesson releases next week (📦 Product) & subsequent lessons will follow a weekly cadence. Be sure to follow me or @madewithml for updates, discussions & feedback because I'll be creating the course content dynamically using the community's feedback.

Made With ML

Learn how to responsibly deliver value with machine learning.

www.youtube.com

8

1

28

Goku Mohandas

@GokuMohandas

4 years

I learned the basics of JAX today (to counter cabin fever) and I found these resources really helpful: and especially this recent one by @RobertTLange . Let me know if there are any other notable resources I should check out!

Robert Lange

@RobertTLange

4 years

Puuuh. What are you up to these days? 💭 I try to stay sane, clean my place 🧹& write✍️. Todays edition - 'Getting started with #JAX '. Learn how to embrace the 'jit-grad-vmap' powers 💻 and code your own GRU-RNN in JAX. Stay safe & home. 🤗

3

49

215

2

13

28

Goku Mohandas

@GokuMohandas

4 years

It will be a hands-on, code-driven case study for the task we're trying to solve but also a guide for production ML / MLOps and clean software engineering. Next steps: If there’s a lot of interest, I’ll detail the specific task and the curriculum in a few days and share it.

6

2

24

Goku Mohandas

@GokuMohandas

1 year

🚀 No doubt that we’re headed towards multi-modal LLMs (text, image, etc.) but scaling those models (10B+) for inference (200TB) is a different story. 👇 Here's a detailed read on how @BytedanceTalk (TikTok's parent org) uses @raydistributed to achieve this (+ cost reductions).

ray

@raydistributed

1 year

@BytedanceTalk , the company behind TikTok, uses Ray for fast & cheap offline inference with multi-modal #LLMs . They generate embeddings for a staggering 200 TB of image and text data using a model with >10B parameters. 🧵 Thread below 👇

2

31

119

0

5

27

Goku Mohandas

@GokuMohandas

2 years

@chipro @lmoroney @josh_wills @sh_reya @jacopotagliabue Congratulations Chip! The amount of effort I've seen you pour into this is going to truly help so many to master ML systems!

0

24

Goku Mohandas

@GokuMohandas

3 years

Then we dive into implementing basic ML algorithms 1⃣ from scratch then 2⃣ in PyTorch. Starting from simple models → complex models. 📈 Modeling - Linear Regression - Logistic Regression - Neural Networks - Data Quality (⚠️ very important) - Utilities (for loading and training)

2

1

26

Goku Mohandas

@GokuMohandas

4 years

I like to build self-sustaining tools that enable others to learn. So far with @madewithml , we have the: 🗞 Newspaper: trending content of the day/week. 📚 Library: automatically up-to-date collection of the best resources by topic.

Made With ML

@MadeWithML

4 years

📚 Extremely excited to launch Made With ML Topics - a collection of the best ML tutorials, toolkits and research organized by topic. It’s all automatically organized, always up-to-date and curated by the community. Details below 👇

8

109

370

1

6

24

Goku Mohandas

@GokuMohandas

3 years

We wrap up the fundamentals by implementing deep learning algorithms in PyTorch. 🤖 Deep Learning - CNNs - Embeddings - RNNs - Transformers 💡 We motivate the need for specific architectures and additional complexity as we implement each method.

1

26

Goku Mohandas

@GokuMohandas

3 years

@sudalairajkumar @madewithml Thank you Sudalai! It’s going to be updated every few months to reflect the best practices from industry with a continued focus on the fundamentals so that readers can adapt to the changing landscape.

0

25

Goku Mohandas

@GokuMohandas

3 years

We want to ensure that our work is entirely reproducible by anyone. ♻️ Reproducibility - Git basics via workflows (dev, inspect, merge, etc.) - Pre-commit hooks (+ custom local) - Versioning code + config + data = models via DVC - Containerization via Docker

1

3

24

Goku Mohandas

@GokuMohandas

6 years

Storage, compute and interface with notebooks! "the most popular tool at Netflix for working with data — across the entire data platform user base —is now notebooks."

0

9

24

Goku Mohandas

@GokuMohandas

4 years

They developed a self-supervised framework that de-occludes scenes: 1. Extract the order of the objects via pair-wise relations 2. Compose objects to completion (amodal and content) 3. Recompose the entire scenery (!!!) For more by @ccloy et al.:

1

4

21

Goku Mohandas

@GokuMohandas

7 years

Super cool repertoire of computer graphics research and inspiration for assisted creative AI tools:

0

10

23

Goku Mohandas

@GokuMohandas

4 years

We're moving on to Scripting next week! - OOPs - Logging - Styling - Packaging - Makefile

1

2

21

Goku Mohandas

@GokuMohandas

3 years

ML is not a separate industry, instead, it's a powerful way of thinking about data. The foundations we've laid out will continue to hold but the methods and avenues of application will evolve. So these lessons are by no means "complete" and we'll continue to keep them up-to-date.

1

0

23

Goku Mohandas

@GokuMohandas

1 year

🎉 I'm thrilled to announce that @MadeWithML and I are now part of the @anyscalecompute team! Anyscale is the team behind Ray ( @raydistributed ) -- a framework for scaling & productionizing ML, used by companies like @OpenAI , @Spotify , @netflix + more.

Machine Learning for Developers

Anyscale is the leading AI application platform. With Anyscale, developers can build, run and scale AI applications instantly.

www.anyscale.com

2

3

23

Goku Mohandas

@GokuMohandas

8 years

Implementation of [1610.06258] Using Fast Weights to Attend to the Recent Past @geoff_hinton

GitHub - GokuMohandas/fast-weights: 🏃 Implementation of Using Fast Weights to Attend to the Recent...

🏃 Implementation of Using Fast Weights to Attend to the Recent Past. - GokuMohandas/fast-weights

github.com

1

12

21

Goku Mohandas

@GokuMohandas

1 year

🤔 A lot of people are wondering about the ROI on LLMs/GenAI. Can't imagine a better lineup to see it in action! 🚀

Robert Nishihara

@robertnishihara

1 year

Ray Summit this month will be 🔥🔥 🤯 ChatGPT creator @johnschulman2 🧙‍♀️ @bhorowitz on the AI landscape 🦹‍♂️ @hwchase17 on LangChain 🧑‍🚀 @jerryjliu0 on LlamaIndex 👨‍🎤 @zhuohan123 and @woosuk_k on vLLM 🧜 @zongheng_yang on SkyPilot 🧑‍🔧 @MetaAI on Llama-2 🧚‍♂️ @Adobe on Generative AI in

8

45

207

0

2

22

Goku Mohandas

@GokuMohandas

7 years

Notes on Opening the Black Box of Deep Neural Networks via Information. Nice way of looking at DNNs with an IB lens

0

11

22

Goku Mohandas

@GokuMohandas

4 years

Another extremely well written post by @jeremyjordan , this time on testing ML. He nails it with the image below. We'll be heavily focusing on a lot of what he touches on in @madewithml 's upcoming release next month!

Jeremy Jordan

@jeremyjordan

4 years

machine learning systems can be more difficult to test than traditional software systems since we're shoving most of the logic into parameters of a model. i wrote a blog post collecting my thoughts on the subject, let me know what you think!

18

179

645

1

7

22

Goku Mohandas

@GokuMohandas

3 years

Then, we wrap all of the CI/CD workflows we’ve created with @GitHub Actions: 🔄 CI/CD workflows - Workflow components (events, runners, jobs) - Testing Actions locally using Act - Best practices (ex. caching) - ML Actions (Great Expectations checkpoints, DVC CML)

1

3

20

Goku Mohandas

@GokuMohandas

7 years

Notes on Question Answering from Unstructured Text by Retrieval and Comprehension.

0

4

21

Goku Mohandas

@GokuMohandas

3 years

Next, we explore the infra needed to deploy & serve ML applications. 🛠️ Infrastructure: - Serving (batch, real-time) - Processing (batch, stream) - Learning (offline, online) - Testing (AB, canary, shadow) - Optimization (prune, quantize, distill) - Methods (K8s, serverless)

1

2

19

Goku Mohandas

@GokuMohandas

3 years

Behavioral testing via NLP Checklist: - invariance: change should not affect outputs - directional: change should affect outputs - minimum functionality: simple combination of inputs and expected outputs. - Creating tests interactively via Checklist:

1

5

19

Goku Mohandas

@GokuMohandas

4 years

@bhutanisanyam1 @madewithml I have non-CS background → founded a small (failed) startup → worked at a large company → worked at a small startup. Happy to share my journey, tips and challenges for those in research/industry and what I think really matters at the end of the day on your learning journey

1

0

20

Goku Mohandas

@GokuMohandas

4 years

A lot of people have been asking if their project is “good enough” to post on @madewithml . The answer is always YES because there’s always a crowd that cares about your niche project and they can now discover it via search or the Topics page. (cont.) 👇

1

2

19

Goku Mohandas

@GokuMohandas

3 years

The first MLOps lessons are on the Product development and iteration cycle. 📦 Product - Identify the core objective. - Design a solution with constraints. - Evaluation strategies that avoid bias. - Iterate via feedback and motivate adding complexity.

1

18

Goku Mohandas

@GokuMohandas

4 years

I’m recording the lesson on EDA and I’m curious how people work with notebooks and testing/formatting? N = @ProjectJupyter or @GoogleColab notebooks S = Python scripts → = transfer code If you're doing #3 , are you calling functions that are written in scripts into notebooks?

N → S (w/ tests + format)

72

N (w/ tests + format) → S

26

N + S (w/ autoreload)

23

only S (tests + format)

26

4

18

Goku Mohandas

@GokuMohandas

3 years

I currently work closely with early-stage & mid-sized companies in helping them deliver value with ML while diving into the best & bespoke practices of this rapidly evolving space. I want to share that knowledge with the rest of the world so we can accelerate overall progress.

1

0

18

Goku Mohandas

@GokuMohandas

3 years

I'll be updating many of the existing lessons with what I'm learning through my work but also from what I'm learning via collaborations w/ teams across many industries. It's an amazing time to witness how quickly research transitions to production-grade!

2

0

15

Goku Mohandas

@GokuMohandas

3 years

This thread was an abbreviated summary of the lessons in @madewithml ’s MLOps course, so be sure to check that out for much more detailed and up-to-date content. 🎓 Lessons: 🐙 Repository: 📬 Subscribe:

GitHub - GokuMohandas/Made-With-ML: Learn how to design, develop, deploy and iterate on productio...

Learn how to design, develop, deploy and iterate on production-grade ML applications. - GokuMohandas/Made-With-ML

github.com

1

4

18

Goku Mohandas

@GokuMohandas

3 years

Over the past 7 years, I've worked on ML and product at @Apple , health tech startups and ran my own venture in the rideshare space. I've worked with brilliant developers and managers and learned how to responsibly develop and iterate on ML systems across various industries.

1

0

18

Goku Mohandas

@GokuMohandas

8 years

Interesting look at parallelizing RNNs across timesteps. Very different approach from Quasi-RNNs.

0

4

18