Matt Harrison Profile Banner
Matt Harrison Profile
Matt Harrison

@__mharrison__

Followers
159,752
Following
888
Media
1,809
Statuses
26,950

Python 🐍 + Data Science 🚀 trainer @__metasnake__ 🦜 Speaker ✍ Author 👨‍🏫 Instructor ( @Stanford ) 📣 DM for Sponsorship

SLC, Ut
Joined January 2010
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
Pinned Tweet
@__mharrison__
Matt Harrison
7 months
The book is out! After almost a year of work, Effective Pandas 2 is here. Covers: - New features of Pandas 2.2! - PyArrow types! - .case_when method! - Chapter on testing Pandas - Chapter on refactoring Pandas - Leveraging Cython & Numba If you are serious about taking
Tweet media one
28
59
458
@__mharrison__
Matt Harrison
3 years
Hi 👋 if you are interested in: 🐍 Python 🪄 Data Science 🔨 Career Advice Follow me. ✔ I'm planning on using Twitter to share a lot of content that you won't want to miss. 🎉
151
401
5K
@__mharrison__
Matt Harrison
3 years
Hi 👋 if you are interested in: 🐍 Python 🪄 Data Science 🔨 Career Advice Follow me. ✔ I'm planning on using Twitter to share a lot of content that you won't want to miss. 🎉
133
370
4K
@__mharrison__
Matt Harrison
3 years
Hi 👋 if you are interested in: 🐍 Python 🪄 Data Science 🔨 Career Advice Follow me. ✔ I'm planning on using Twitter to share a lot of content that you won't want to miss. 🎉
107
273
3K
@__mharrison__
Matt Harrison
3 years
Hello, I'm Matt 👋 I'm a Python and Data Science author and trainer. 🐍 Follow me to learn more about: 🖥 Python 🙈 Data analysis 🎨 Visualization 🤯 Machine Learning
38
138
2K
@__mharrison__
Matt Harrison
2 years
Hello, I'm Matt 👋 I'm a Python and Data Science author and trainer. 🐍 Follow me to learn more about: 🖥 Python 🙈 Data analysis 🎨 Visualization 🤯 Machine Learning
30
175
2K
@__mharrison__
Matt Harrison
2 years
Once you master groupbys, it will open a new world to you. 🤯🐼
Tweet media one
38
178
2K
@__mharrison__
Matt Harrison
2 years
Basic plots are easy. Making them look nice often takes a bit of work...
Tweet media one
Tweet media two
Tweet media three
20
138
1K
@__mharrison__
Matt Harrison
2 years
I passed 100K followers! 🤯🎉 I will randomly give away three copies of Effective Pandas to retweeters who reply with how they use Pandas. 🙏🐼
Tweet media one
181
321
1K
@__mharrison__
Matt Harrison
2 years
Hi I'm Matt 👋 I'm a Python & Data Science trainer.
Tweet media one
27
119
1K
@__mharrison__
Matt Harrison
2 years
A thread about pandas... (The library not the animal) 🐼🧵
Tweet media one
19
233
1K
@__mharrison__
Matt Harrison
3 years
Hi 👋 if you are interested in: 🐍 Python 🪄 Data Science 🔨 Career Advice Follow me. ✔ I'm planning on using Twitter to share a lot of content that you won't want to miss. 🎉
53
99
1K
@__mharrison__
Matt Harrison
3 years
Hello, I'm Matt 👋 I'm a Python and Data Science author and trainer. 🐍 Follow me to learn more about: 🖥 Python 🙈 Data analysis 🎨 Visualization 🤯 Machine Learning
28
108
1K
@__mharrison__
Matt Harrison
2 years
Here's a Pandas trick to add order to categorical values. 🤯🐼
Tweet media one
15
145
1K
@__mharrison__
Matt Harrison
2 years
Hi, I'm Matt 👋 I'm a Python & Data Science corporate trainer. I tweet about: 🐍 Python 🐍 Python warts 🐍 Python features 📊 Visualization 🐼 Pandas 🐼 Pandas warts 🐼 Writing better Pandas code 🚀 Machine Learning 📚 Books 🐃 Emacs (🦆🦆🦆) Thanks for following me! 🙏
25
110
1K
@__mharrison__
Matt Harrison
3 years
The book is out!
Tweet media one
67
150
1K
@__mharrison__
Matt Harrison
2 years
I often teach about Decorators in Python. Many know how to use them, but few can write them. These are tricky because nested functions make our brains hurt. Here are some hints for grokking them. 1/
Tweet media one
15
189
1K
@__mharrison__
Matt Harrison
2 years
From the Effective Pandas book ...
Tweet media one
25
139
1K
@__mharrison__
Matt Harrison
2 years
Big O notation (pronounced Big-Oh) is a way to discuss how long an algorithm takes to run. It is also called "runtime complexity". Understanding this is key for technologists. Let's look at it. 1/
21
199
1K
@__mharrison__
Matt Harrison
1 year
Recently, I've been working on a new course for data analytics. This Python code stands out for its ability to turn raw sales data into insightful visualizations. I want to share why this code is crucial and how it could help transform your data analysis process.
Tweet media one
10
189
1K
@__mharrison__
Matt Harrison
2 years
Dealing with dictionary data in Pandas.
Tweet media one
19
115
1K
@__mharrison__
Matt Harrison
2 years
PyCon is next week. For the first time in a while I'm not teaching or presenting... but I do have some books to give away.
Tweet media one
30
91
966
@__mharrison__
Matt Harrison
2 years
A slight tweak to use NumPy results in a massive speedup. 16,000X faster! 😱
Tweet media one
13
130
938
@__mharrison__
Matt Harrison
2 years
Soon after learning Python you should learn how to test Python. Very soon after learning how to test Python code you should learn the pytest library!
18
74
829
@__mharrison__
Matt Harrison
10 months
Want to speed up your Pandas code by 10-1000x? With no code change? The folks from @nvidia have created cuDF pandas accelerator mode. By using this line in Jupyter, you automatically leverage your GPU to run Pandas code: %load_ext cudf.pandas From command-line: python -m
Tweet media one
15
133
840
@__mharrison__
Matt Harrison
3 years
Tell me you are a Python programmer without telling me you are a Python programmer... 🤔🐍
680
72
817
@__mharrison__
Matt Harrison
3 years
Hi I'm Matt 👋 I'm a Python & Data Science trainer.
Tweet media one
10
63
803
@__mharrison__
Matt Harrison
2 years
One of my students (their code at top) was struggling with converting an HH:MM column to duration in minutes. Here's how I would do it... 🐼🤯
Tweet media one
17
92
793
@__mharrison__
Matt Harrison
2 years
Pandas tip 🐼🤓 Sometimes you want to get all of the values for each category, but there might be a different amount for each category. This example shows each recorded daily snowfall for every month. Combine .pivot with .apply! 🤯🐼
Tweet media one
9
103
768
@__mharrison__
Matt Harrison
2 years
When do you use lambdas in Python? 🤔
168
72
754
@__mharrison__
Matt Harrison
3 years
Hi 👋, if you are intrigued with: 🐍 Python 🪄 Data Science 🔨 Career Advice Follow me. ✔ I'm planning on using Twitter to share a lot of content, that you won't want to miss. 🎉
25
67
744
@__mharrison__
Matt Harrison
1 year
Challenge accepted... 😉
Tweet media one
@tayloramurphy
Taylor A Murphy
1 year
sure your data tool has export to csv, but does it have this??
Tweet media one
16
13
273
15
82
724
@__mharrison__
Matt Harrison
2 years
I just passed 130K followers on Twitter 1.0. 🤯🙏 AMA ❓ I will give away three copies of Effective Pandas to those who retweet. 😉
Tweet media one
52
707
657
@__mharrison__
Matt Harrison
2 years
PCA is such a hand tool to have in your toolbelt. 📈🤯 Here's some code to filter down the number of components and visualize them.
Tweet media one
17
110
692
@__mharrison__
Matt Harrison
2 years
You think my Pandas code is bad, here is some SQL that I wrote 15 years ago...
Tweet media one
42
45
678
@__mharrison__
Matt Harrison
2 years
How do you practice coding?
134
78
657
@__mharrison__
Matt Harrison
1 year
I created a simple example data science project. It includes a makefile that you can copy and use to easily make your own. It includes jupyter, tests, docker, dash, pipelines, CI, and codespaces.
Tweet media one
6
133
674
@__mharrison__
Matt Harrison
2 years
PCA and clustering. Two awesome tools that go well together. 🤯
Tweet media one
12
51
664
@__mharrison__
Matt Harrison
2 years
I my Machine Learning courses I often discuss PCA. I love having this tool in my toolbelt for EDA and learning from the data. 🤯
Tweet media one
17
68
646
@__mharrison__
Matt Harrison
2 years
How would I normally visualize this data? With a bar plot. 🐼👍
Tweet media one
Tweet media two
12
65
623
@__mharrison__
Matt Harrison
2 years
One of my students (their code at top) was struggling with converting an HH:MM column to duration in minutes. Here's how I would do it... 🐼🤯
Tweet media one
9
58
618
@__mharrison__
Matt Harrison
2 years
Soon after learning Python you should learn how to test Python. Very soon after learning how to test Python code you should learn the pytest library!
17
89
603
@__mharrison__
Matt Harrison
3 years
Last week I taught a course that covered Decorators in Python. Many know how to use them, but few can write them. These are tricky because nested functions make our brains hurt. Here are some hints for grokking them. 1/
Tweet media one
7
121
615
@__mharrison__
Matt Harrison
2 years
Looking for practical data science books? 🎓Ace the Interview by @NickSinghTech 🚀ML by @rasbt 🥇Kaggle by @tng_konrad @lucamassaron 🐼Pandas by @__mharrison__
Tweet media one
10
106
590
@__mharrison__
Matt Harrison
2 years
This is a wonderful overview of plotting with various Python libraries! 📈🐍
17
128
583
@__mharrison__
Matt Harrison
2 years
Working on a new Pandas course. Meant to train beginners correct first principles from the start. This will be a live cohort based class where you get to work on your real-world data while learning Pandas! 🤯🐼
Tweet media one
10
58
585
@__mharrison__
Matt Harrison
2 years
Here are my recommendations for modern data analysis: 👉 Python 🐍 👉 Jupyter 🪐 👉 Pandas 🐼 👉 Matplotlib 📊 👉 Sklearn 🚀 You can't go wrong with them.
16
95
584
@__mharrison__
Matt Harrison
2 years
What is your favorite Python 3 feature? 🤔 Mine is probably f-strings.
76
24
580
@__mharrison__
Matt Harrison
2 years
I need to hire this person as a copywriter 🤯
Tweet media one
17
36
575
@__mharrison__
Matt Harrison
2 years
Tell me you are a Python programmer without telling me you are a Python programmer... 🤔🐍
473
33
549
@__mharrison__
Matt Harrison
3 years
What is your favorite Python 3 feature? 🤔 Mine is probably f-strings.
73
11
552
@__mharrison__
Matt Harrison
2 years
Hi 👋 if you are interested in: 🐍 Python 🪄 Data Science 🔨 Career Advice Follow me. ✔ I'm planning on using Twitter to share a lot of content that you won't want to miss. 🎉
21
71
538
@__mharrison__
Matt Harrison
2 years
Roll your own candles. 🎂📉
Tweet media one
10
75
542
@__mharrison__
Matt Harrison
2 years
Use Pandas to combine a bunch of CSV files, drop dupes, the export to the clipboard! 🐼🤯
Tweet media one
10
58
545
@__mharrison__
Matt Harrison
2 years
After my Python "training", my students still don't know how to use the %s formatting specifier in a print statement. 🤦 That's because I teach them f-strings. 😜
39
23
540
@__mharrison__
Matt Harrison
2 years
Tell me you are a Python programmer without telling me you are a Python programmer... 🤔🐍
432
38
528
@__mharrison__
Matt Harrison
2 years
What do you Python for?
230
44
515
@__mharrison__
Matt Harrison
2 years
Are you looking for the best free courses in Python, ML, or Visualization? ✅🐍😊
14
96
515
@__mharrison__
Matt Harrison
2 years
If you are considering learning Pandas, I'm doing a cohort-based course at the end of this month. This is a great way to learn Pandas in a group-based setting.
Tweet media one
7
54
523
@__mharrison__
Matt Harrison
2 years
Looking for practical data science books? 🎓Ace the Interview by @NickSinghTech 🚀ML by @rasbt \n🥇Kaggle by @tng_konrad @lucamassaron 🐼Pandas by @__mharrison__
Tweet media one
5
70
518
@__mharrison__
Matt Harrison
2 years
Ever wanted to make a dumbbell plot with Pandas? 🐼🏋️‍♀️
Tweet media one
9
50
511
@__mharrison__
Matt Harrison
1 year
Creating my monthly sales report is 4 lines of code...
Tweet media one
26
51
511
@__mharrison__
Matt Harrison
2 years
Sad that people take content and don't give credit... 😢
Tweet media one
27
39
496
@__mharrison__
Matt Harrison
2 years
What language do you use when you don't use Python?🐍🤔
469
33
479
@__mharrison__
Matt Harrison
2 years
Customizing a Plotly plot is no easier (nor harder) than customizing a Matplotlib plot. Choose the former if you need more interactivity or the latter if you want more to create publication quality plots.
Tweet media one
Tweet media two
Tweet media three
Tweet media four
18
71
491
@__mharrison__
Matt Harrison
3 years
Big O notation (pronounced Big-Oh) is a way to discuss how long an algorithm takes to run. It is also called "runtime complexity". Understanding this is key for technologists. Let's look at it. 1/
6
80
478
@__mharrison__
Matt Harrison
2 years
Sometimes you need a loop to perform a calculation. With a little nudging from numba, you can get a 30x speed improvement. 🤯
Tweet media one
13
53
465
@__mharrison__
Matt Harrison
2 years
Use .set_sticky with .style to keep the index in view on wide dataframes.
Tweet media one
6
45
462
@__mharrison__
Matt Harrison
2 years
Love seeing happy readers. "Harrison’s book presents the most comprehensive collection of “best practices” for cleaning and preparing data using Pandas. WARNING: might turn you into a Pandas snob…" Walter U.
Tweet media one
5
49
463
@__mharrison__
Matt Harrison
1 year
Great week teaching Pandas (and Matplotlib and Seaborn). The new book should be coming out next week!
Tweet media one
12
54
441
@__mharrison__
Matt Harrison
2 years
How do you log in Python? What libraries do you use for it? 🤔
114
49
440
@__mharrison__
Matt Harrison
2 years
Annotated Matplotlib...
Tweet media one
10
51
443
@__mharrison__
Matt Harrison
2 years
Just passed 77k followers. 🎉🙏 I'm a Python & Data Science corporate trainer. I tweet about: 🐍 Python 🐍 Python warts 🐍 Python features 📊 Visualization 🐼 Pandas 🐼 Pandas warts 🐼 Writing better Pandas code 🚀 Machine Learning 📚 Books 🐃 Emacs (🦆🦆🦆)
7
46
427
@__mharrison__
Matt Harrison
2 years
What are your favorite Jupyter tips?
81
58
435
@__mharrison__
Matt Harrison
2 years
Creating my monthly sales report is 4 lines of code...
Tweet media one
10
37
420
@__mharrison__
Matt Harrison
2 years
What are your favorite data podcasts? 🤔
37
39
428
@__mharrison__
Matt Harrison
3 years
Hi 👋, if you are intrigued with: 🐍 Python 🪄 Data Science 🔨 Career Advice Follow me. ✔ I'm planning on using Twitter to share a lot of content, that you won't want to miss. 🎉
12
50
404
@__mharrison__
Matt Harrison
2 years
Pandas tip 🐼🤓 Sometimes you want to get all of the values for each category, but there might be a different amount for each category. This example shows each recorded daily snowfall for every month. Combine .pivot with .apply! 🤯🐼
Tweet media one
7
62
406
@__mharrison__
Matt Harrison
3 years
What is the most common issue you have when coding with Python? 🤔🐍
166
34
391
@__mharrison__
Matt Harrison
1 year
I love seeing this plotting improvement from my student. With a bit of tweaking from Matplotlib, you tell a different story.
Tweet media one
23
25
409
@__mharrison__
Matt Harrison
2 years
Did you teach yourself to code? What would you change if you were to do it again?
141
39
389
@__mharrison__
Matt Harrison
2 years
Code is usually written once or twice but read often. You should optimize for readability when you write code. (This includes data scientists writing Pandas or ML code). 🤯
10
41
390
@__mharrison__
Matt Harrison
8 months
Working on a new feature engineering course...
Tweet media one
4
41
390