Data Analytics can be divided into 5 types based on the questions it can answer.
Descriptive analytics
What happened?
Descriptive analyticsย answers questions about what happened.
Descriptive analytics techniques summarize large datasets to present insights to stakeholders.
Data Science Study Plans e-book for FREE
A well-researched and thorough collection of study plans for Excel, SQL, Python, Power BI, and Tableau.
As someone who transitioned to a Data Science career from a non-IT background, I have made sure it covers all topics from the
5 Types of Analytics
Data Analytics can be divided based on the 5 types of questions it can answer.
1) Descriptive analytics
โWhat happened?
Descriptive analyticsย answers questions about what happened.
Descriptive analytics techniques summarize large datasets to present
๐ค Confusion Matrix - How good was the prediction? ๐ค
This topic is repeatedly asked in many Data Science interviews
A confusion matrix is a table that is often used to describe the performance of a classification model on a set of test data for which the true values are known.
SQL for Data Science Complete Study Plan ๐
The timeline of 28 days and you have to dedicate at least 1.5 hours a day.
Week 1: Fundamentals of SQL
Day 1-3: Introduction to SQL syntax, SELECT statements, filtering, and sorting.
Resource: Khan Academy's "Intro to SQL" course on
๐งฉ SQL Joins Explained with Query + Visual ๐งฉ
Joins in SQL is a fundamental concept to combine data from different tables based on related columns.
/๐งต/
This is going to change Data Analytics!
A Dashboard UI inside the Jupyter Notebook which makes Visualizing Easy!
Using just one line of code and is easy to deploy as well!
Power BI for Data Science Complete Study Plan
10 hours a week to dedicate to this, here's a 12-week study plan
Week 1: Introduction to Power BI
Day 1-3: Begin with the official Power BI Guided Learning resources to get familiar with the platform.
Link:
๐ Python for Data Science Complete Study Plan ๐
Timeline of 12 weeks and you have to dedicate at least 7 hours a week.
๐๏ธWeek 1-2: Python Basics and Data Structures
Resource: YouTube playlist - Python Crash Course by Corey Schafer.
Link:
Watch
5 Types of Data Analytics Based on 5 Questions
1/ Descriptive analytics
What happened?
Descriptive analyticsย answers questions about what happened.
Descriptive analytics techniques summarize large datasets to present insights to stakeholders.
The presentation of data related
Excel for Data Science Complete Study Plan ๐
Timeline of 30 days and you have to dedicate at least 1 hour a day.
Week 1: Basics of Excel
Day 1-2: Excel Basics
Day 3-4: Understanding Formulas, Functions and Formatting
Day 5-7:
Storytelling with data is a KEY skill for a Data Scientist
But how do you actually tell a story and present your data in front of an audience?
Here's how :
/๐งต/
๐Tableau for Data Science Complete Study Plan
Timeline of 4 weeks and you have to dedicate at least 1 hour a day
Week 1: Introduction to Tableau
Day 1-3: Begin with the official Tableau Training Videos
Complete all the tutorials under โCreatorโ
Link:
Correlation is fundamental when it comes to the relationship between two variables
Correlation provides insights into how changes in one variable may relate to changes in another.
A correlation coefficient of +1 indicates a perfect positive correlation, meaning the variables
SQL for Data Science Complete Study Plan 2024๐
The timeline is 28 days, and you must dedicate at least 1.5 hours daily.
Week 1: Fundamentals of SQL
Day 1-3: Introduction to SQL syntax, SELECT statements, filtering, and sorting.
Resource: Khan Academy's "Intro to SQL"
Data Analyst vs Data Scientist: Whatโs the difference and how to choose one?
If youโre interested in working with data, you might be wondering what the difference is between a data analyst and a data scientist, and which one is right for you.
Hereโs a quick overview of the two
Good Resume = Data Science Interview
Don't let your hours of hard work go waste with a BAD resume
Watch these 5 Youtube Videos to craft the perfect one
/๐งต/
Python for Data Science Complete Study Plan
Timeline of 12 weeks and you have to dedicate at least 7 hours a week.
Week 1-2: Python Basics and Data Structures
Resource: YouTube playlist - Python Crash Course by Corey Schafer.
Link:
Watch video
Power BI Complete Study Plan
10 hours a week to dedicate to this, here's a 12-week study plan
Week 1: Introduction to Power BI
Day 1-3: Begin with the official Power BI Guided Learning resources to get familiar with the platform.
Link:
Day 4-7: Watch
If anybody says you don't need to be good at stats for Data Science they are lying!
But you can become good at stats by practicing!
Here are the topics:
KNN and K-Means
What do these two have in common?
These are important algorithms one must know and often confusing ones.
K-means and K-nearest neighbors (KNN) are both popular techniques in machine learning and data analysis, but they serve different purposes and belong to
๐งฉ SQL Joins Explained with Query + Visual
Joins in SQL is a fundamental concept to combine data from different tables based on related columns.
/๐งต/
In Data Warehouses, Dimension data is De-normalized. But why?
In most transactional databases that are used, the data isย normalizedย to reduce duplication.
In a data warehouse, however, the dimension data is generallyย de-normalizedย to reduce the number of joins required to query
ELT vs ETL: What difference does the order make?
A common problem that organizations face is how to gather data from multiple sources, in multiple formats.
Extract, transform, and load (ETL) pipelines first collect data from various sources.
It then transforms the data and
Data warehouse schema designs
In most transactional databases that are used, the data isย normalizedย to reduce duplication.
In a data warehouse, however, the dimension data is generallyย de-normalizedย to reduce the number of joins required to query the data.
Often, a data
Data Lakes vs Data Warehouse
How are they selected based on the Data Management Strategy?
Data Lakes:
Storage Approach: Data lakes store data in its raw, unstructured or semi-structured format.
Data Variety: Data lakes can handle diverse types of data, including structured,
Hypothesis testing isย an inferential statistical method.
Itโs often employed to make informed decisions based on available evidence.
At its core, hypothesis testing involves assessing the validity of a proposed hypothesis by evaluating sample data.
The process typically begins
Hackers can use SQL to hack your website!
SQL injection is a type of security vulnerability that occurs in web apps that use SQL databases
It happens when the attacker inserts a malicious SQL query in a web form OR URL parameter
For example, if a website uses a login form that
Python for Data Science Complete Study Plan
Timeline of 12 weeks and you have to dedicate at least 10 hours a week.
Only follow this if you are disciplined!
In SQL, what's the difference between:
DELETE, DROP, and TRUNCATE?
This might be asked in your next DS interview
๐ DELETE is a Data Manipulation Language (DML) command.
It can delete all rows or certain rows under conditions using WHERE.
DELETE cannot delete the
If you aren't using ChatGPT for Data Analysis you are already falling behind
Be it SQL Query Optimization or Chart Suggestions there are so many use cases that aren't used effectively
That's why I have compiled "ChatGPT for Data Science" to help you explore the integration of
A Random Forest is like consulting a group of experts before a decision.
A random forest isย a machine-learning technique that uses many decision trees to solve classification and regression problems.
It's a supervised learning algorithm that uses ensemble learning, which is a
Logistic Regression is for Yes or No, 1 or 0, Black or White.
Imagine you're trying to predict if it will rain or not.
You have data about the past, like the temperature, humidity, and whether it rained or not.
You want to use this data to predict if it will rain tomorrow.
Logistic Regression is for binary outcomes, like Yes or No, 1 or 0, Black or White.
Imagine you're trying to predict if it will rain or not.
You have data about the past, like the temperature, humidity, and whether it rained or not.
You want to use this data to predict if it
Hey everyone!
I'm thrilled to share some amazing news with you all!
Over the next 14 days, I'll be starting a journey to help you ace the Microsoft PL 300 certification exam.
In just two weeks, you'll be well-prepared to be a Microsoft-certified Power BI Analyst!
Now, you
Biggest pain point in landing a Data Science job?
A lot of candidates have very poor Resumes.
Their skill/ability is not conveyed properly to the recruiters
Writing Data Science resumes to get a job has never been this complicated before.
This is why I have created an e-book
Univariate, Bivariate, and Multivariate Analysis: Unveiling Data Dimensions
Univariate, bivariate, and multivariate analysis areย types of exploratory data analysis.ย
They are based on the number of variables being analyzed
Univariate analysis is the simplest and easiest form
Data Warehouse schema designs - Starโญ & Snowflake โ๏ธ
In most transactional databases that are used, the data isย normalizedย to reduce duplication.
In a data warehouse, however, the dimension data is generallyย de-normalizedย to reduce the number of joins required to query the
๐งฉ Simplifying SQL Joins: Lets deep dive ๐งฉ
Joins in SQL is a powerful technique that allows us to combine data from different tables based on related columns.
1/ INNER JOIN
This type of join returns only the matching rows from both tables.
It's like finding the intersection
A box plot is a chart that visually displays the distribution of numerical data.
Box plots, also known as box-and-whisker plots, are a fantastic way to display the distribution and key characteristics of a dataset.
They provide a clear and concise summary of:
๐Central
Data Analytics can be divided based on the 5 types of questions it can answer.
Descriptive analytics
What happened?
Descriptive analyticsย answers questions about what happened.
Descriptive analytics techniques summarize large datasets to present insights to stakeholders.
The
ETL vs. ELT: What's the difference?
Let's understand with business Examples
ETL
Extract, transform, and load (ETL) is a data pipeline used to collect data from various sources.
It then transforms the data according to business rules and loads the data into a destination data
Tableau Study Plan - 28 Days Timeline
Week 1: Introduction to Tableau
Day 1-3: Begin with the official Tableau Training Videos
Complete all the tutorials under โCreatorโ
Link:
Day 4-7: Watch the "Tableau Tutorials for Beginners" series
Link:
Feature importance helps us understand which features or variables impact the model's predictions most.
Feature importance is a crucial concept in data science, especially when building models for predictive tasks.
Let's break it down in simpler terms.
Imagine you're trying to
EDA is used to understand the datasets before diving into advanced analysis.
Exploratory Data Analysis is crucial, as it unveils:
>Characteristics of data
>Aids in the formulation of hypotheses
>Identification of patterns or anomalies
The steps of EDA include:
1.
SQL has two built-in functions for converting data types:
CAST and CONVERT
Type conversion is the process of converting one data type into another.
In SQL, data types can be converted either implicitly or explicitly.
Implicit conversions are not visible to the user.
SQL