How to become a Data Analyst FAST (By 2025)

Rohan Adus
22 Sept 202415:40

Summary

TLDRThis video offers a comprehensive roadmap for aspiring data analysts to secure their first role by 2025. It emphasizes the importance of consistent effort, understanding statistics as the language of data, and mastering Excel for data manipulation. The speaker also highlights the necessity of learning SQL for database management and Python or R for scripting. The video suggests practical steps, including joining Discord communities, practicing with real datasets, and building a portfolio to showcase skills. It assures viewers that breaking into the field is achievable with dedication and the right resources, even without a formal degree.

Takeaways

  • 😀 The video provides a roadmap to land a data analyst role by 2025, emphasizing the importance of consistency and discipline in learning.
  • 🎓 You don't need a relevant degree or extensive experience to break into data analytics; the focus should be on practical skills.
  • đŸ€ Joining a community like a Discord server can provide valuable networking opportunities and support in learning analytics.
  • 📊 Understanding statistics is crucial as it forms the basis for data analysis, including concepts like descriptive statistics, correlation, and probability.
  • 📈 Excel and Google Sheets are essential tools for data analysts, with pivot tables, conditional formatting, and visualization being key skills to master.
  • đŸ’Ÿ SQL is a must-have skill for data analysts, used for retrieving and manipulating data from relational databases, and it's important to learn its basics and efficiency.
  • 🐍 Learning Python or R is beneficial, with Python being more versatile for various fields including data analysis, and key libraries to learn include pandas, numpy, and matplotlib.
  • 📊 Visualization tools like Tableau, Looker, or PowerBI are important for presenting data insights effectively, and storytelling with data is a valuable skill.
  • đŸ’Œ Building a portfolio showcasing your data analysis projects can significantly improve your job prospects in the analytics field.
  • 🔗 Consistent practice and application of learned skills are necessary for mastering data analysis, and showing your work is more impactful than just stating your abilities.

Q & A

  • What is the main goal of the video?

    -The main goal of the video is to provide a roadmap to help viewers land their first data analyst role before 2025, regardless of their current profession or educational background.

  • Why is consistency and discipline emphasized in the video?

    -Consistency and discipline are emphasized because breaking into the field of data analytics requires daily effort and commitment, rather than relying on quick fixes or shortcuts.

  • What is the significance of joining a Discord server for beginners in data analytics?

    -Joining a Discord server is significant because it offers a community where beginners can receive referrals, collaborate on group projects, and get support from others in the field, which is beneficial for breaking into tech and data science.

  • Why is having a grasp of statistics important for a data analyst?

    -Statistics is important because it is the language of data and analytics. It helps in understanding and interpreting data, conducting analyses, and ensuring that the analysis is not plagued by bias.

  • What are the basic statistical concepts a data analyst should understand?

    -A data analyst should understand descriptive statistics like mean, median, mode, and standard deviation, as well as concepts of correlation, probability, and hypothesis testing.

  • How can one learn the necessary statistical concepts without a formal degree?

    -One can learn the necessary statistical concepts through free online courses on platforms like Coursera, Khan Academy, or by watching educational videos on YouTube.

  • Why is Excel or Google Sheets a crucial skill for a data analyst?

    -Excel or Google Sheets is crucial because it is a ubiquitous tool used across industries for data manipulation, analysis, and visualization, and is often required for day-to-day tasks in data analysis.

  • What are some key Excel or Google Sheets skills that a data analyst should master?

    -Key skills include creating pivot tables for data summarization, using conditional formatting to highlight data patterns, and creating visualizations and dashboards for data presentation.

  • What is SQL and why is it essential for a data analyst?

    -SQL (Structured Query Language) is a tool used to retrieve, manipulate, and analyze data stored in relational databases. It is essential because it allows data analysts to communicate with and extract insights from large datasets.

  • How can one practice SQL effectively?

    -One can practice SQL effectively by using online platforms like W3Schools, Codecademy, or by participating in coding challenges on websites like LeetCode or HackerRank. Additionally, creating databases with multiple tables using tools like SQLite can help practice joins and other SQL queries.

  • Why is learning Python or R recommended for a data analyst?

    -Python or R are recommended because they are scripting languages with powerful libraries for data manipulation, numerical operations, and visualization, which are crucial for advanced data analysis and automation.

  • What are some important Python libraries for data analysis that the video suggests learning?

    -The video suggests learning libraries such as pandas for data manipulation, numpy for numerical operations, matplotlib or seaborn for visualization, and scikit-learn if interested in machine learning.

  • How can one learn data visualization tools like Tableau?

    -One can learn data visualization tools like Tableau through online courses on platforms like Udemy or LinkedIn Learning, by practicing with Tableau Public, or by using free trials of other business intelligence tools.

  • What is the importance of creating a portfolio in the context of job hunting for data analyst roles?

    -Creating a portfolio is important because it showcases practical experience and technical abilities to potential employers, giving job candidates a competitive edge over others who may not have tangible examples of their work.

Outlines

00:00

🚀 Introduction to Breaking into Data Analytics

The speaker introduces a roadmap to secure a data analyst role by 2025, emphasizing the video's relevance to both working professionals and students. They highlight that despite changes in the job market, it's possible to break into data analytics without a relevant degree or extensive experience. The speaker encourages viewers to be consistent and disciplined in their efforts, dismissing the idea of a 'magic pill' for success. They recommend joining a Discord server for networking and group projects, and stress the importance of understanding statistics as the language of data analytics, suggesting that a fundamental grasp is sufficient rather than expertise.

05:02

📊 Importance of Statistics and Excel Proficiency

The speaker discusses the necessity of understanding descriptive statistics, correlation, and probability for data analysis. They clarify that correlation does not imply causation, using an example of marketing budget and sales. The speaker then transitions into the importance of Excel and Google Sheets, stating that these tools are ubiquitous across industries. They suggest that viewers should practice using Excel for pivot tables, conditional formatting, and visualizations to solidify their learning. The speaker also recommends using online resources and YouTube for learning Excel, and emphasizes the value of hands-on practice with datasets from kaggle.com.

10:03

đŸ’Ÿ Mastering SQL for Data Manipulation

The speaker moves on to SQL, explaining that it's essential for retrieving and analyzing data from relational databases. They simplify SQL learning by breaking it down into basic components like SELECT, FROM, WHERE, JOIN, GROUP BY, and HAVING clauses. The speaker insists that SQL is a must-have skill for data analysts and that efficiency in querying is critical when dealing with large datasets. They recommend using W3Schools and YouTube for learning SQL and suggest practicing on platforms like LeetCode and HackerRank. The speaker also advises creating mock databases with multiple tables for practice, as real datasets often come as single tables.

15:03

🐍 Python and R for Advanced Data Analysis

The speaker recommends learning Python or R for more advanced data analysis, with a personal preference for Python due to its versatility and large community. They highlight key libraries such as pandas for data manipulation, numpy for numerical operations, and matplotlib or seaborn for data visualization. The speaker also touches on the utility of Python for automation and ETL processes. They suggest learning through practical application and recommend using resources like DataCamp and online courses for learning. The speaker concludes by emphasizing the importance of learning a BI tool like Tableau for data visualization, suggesting that storytelling with data is crucial for effective communication of insights.

📈 Visualization and Building a Portfolio

The speaker stresses the importance of data visualization and storytelling, recommending Tableau for its interactive capabilities and the availability of a free public version. They advise learning the science of visualization and practicing with tools like Google Sheets before moving on to more advanced tools. The speaker also encourages viewers to build a portfolio showcasing their projects, which can significantly boost their job market prospects. They conclude by emphasizing the time and consistent effort required to master these skills, suggesting that by 2025, viewers who follow the roadmap will be well-prepared for a career in data analytics.

Mindmap

Keywords

💡Data Analyst

A Data Analyst is a professional who collects, processes, and interprets complex digital data to help businesses make informed decisions. In the video, the main theme revolves around guiding viewers on how to secure a Data Analyst role, emphasizing the importance of various skills and tools necessary for the job.

💡Statistics

Statistics is a branch of mathematics dealing with the collection, analysis, interpretation, presentation, and organization of data. In the context of the video, statistics is described as the 'language of data and analytics,' highlighting its fundamental role in understanding and communicating data insights effectively.

💡Descriptive Statistics

Descriptive statistics summarize and organize data to give an overall picture. The video mentions that understanding descriptive statistics like mean, median, mode, and standard deviation is crucial for painting a holistic view of a dataset before delving into detailed analysis.

💡Correlation

Correlation measures the relationship between two variables. The video script clarifies that while correlation indicates a relationship, it does not imply causation, which is a critical concept for data analysts to understand when interpreting data.

💡Probability

Probability is used to express the likelihood of a given event and is fundamental in making inferences and predictions from data. The video suggests that a basic understanding of probability and hypothesis testing is necessary for data analysts to test their assumptions about the data.

💡Excel

Excel is a widely used spreadsheet program for data organization, analysis, and visualization. The video emphasizes Excel's importance in data analysis, mentioning pivot tables, conditional formatting, and data visualization as key features that data analysts should master.

💡SQL

SQL (Structured Query Language) is used for managing and manipulating databases. The video stresses that SQL is a must-have skill for data analysts, as it allows them to retrieve, manipulate, and analyze data stored in relational databases, which is a common task in the role.

💡Python

Python is a versatile programming language that is highly recommended for data analysis due to its simplicity and the powerful libraries available for data manipulation and analysis. The video suggests learning Python for tasks such as data cleaning, numerical operations, and automation.

💡Pandas

Pandas is a Python library for data manipulation and analysis. It is mentioned in the video as an essential tool for data analysts to clean, organize, and analyze data efficiently, allowing for tasks like filtering and joining tables similar to SQL.

💡Visualization

Visualization refers to the graphical representation of data to make it easier to understand and interpret. The video discusses the importance of visualization in data analysis, suggesting that tools like Tableau and Python libraries can be used to create interactive and insightful visual representations of data.

💡Portfolio

A portfolio showcases an individual's work and skills. In the video, building a portfolio is recommended as a way to demonstrate practical experience and skills to potential employers, which can significantly enhance a job candidate's appeal in the job market.

Highlights

A guaranteed roadmap to land your first data analyst role before 2025 is presented, applicable for both working professionals and students.

The video promises a 99% success rate if viewers follow through with the tips and make them actionable.

It's emphasized that breaking into data analytics is possible from unrelated fields, and viewers are encouraged to commit to consistent effort.

Joining a Discord server with over 5,000 members can provide valuable networking and collaboration opportunities.

Statistics is described as the language of data and analytics, with a focus on understanding descriptive statistics like mean, median, mode, and standard deviation.

The importance of recognizing correlation without assuming causation is highlighted.

Probability and hypothesis testing are introduced as fundamental concepts for making inferences and testing them with data.

Excel and Google Sheets are touted as essential tools for data analysts, with a focus on pivot tables, conditional formatting, and visualizations.

SQL is a must-have skill for data analysts, with a focus on understanding queries, joins, and aggregation functions.

Efficiency in SQL querying is stressed as crucial when dealing with large datasets.

Learning Python or R is recommended, with a focus on libraries like pandas, numpy, and matplotlib for data manipulation and visualization.

Data visualization tools like Tableau, Looker, and PowerBI are discussed, with Tableau being recommended for its free public version.

The importance of storytelling with data and creating dashboards that simplify complexity and provide actionable insights is emphasized.

Building a portfolio and showcasing practical experience is advised as a key strategy for standing out in the job market.

Consistency in learning and practicing is highlighted as the key to success in breaking into data analytics by 2025.

Transcripts

play00:00

so today I'm going to be walking you

play00:01

through a guaranteed road map to land

play00:02

your first dat analyst role before 2025

play00:05

whether you're working professional

play00:06

student this video is for you now the

play00:09

information in this video is 99%

play00:11

guaranteed if you actually follow

play00:12

through to the video to the end take

play00:14

notes and also take the tips and make

play00:16

them actionable listen times have

play00:17

changed and so has the job market I've

play00:19

seen countless of people break into Data

play00:21

from unrelated Fields unrelated Gees in

play00:24

2024 and I know you can too you might

play00:26

think you need this relevant degree or

play00:28

you might need years of experience or

play00:31

100 projects on the side to land your

play00:32

first rooll but I can tell you firsthand

play00:34

you can break in without these now

play00:36

before we get into the specifics I want

play00:37

you to make a commitment to yourself and

play00:39

myself if you do want to break into

play00:41

analytics you do need to be consistent

play00:43

and you need to be disciplined and put

play00:44

in the work every single day to actually

play00:46

break in there is no magic pill there is

play00:48

no magic formula that you can just watch

play00:50

a 8 Hour course and become a data

play00:51

analyst that's not true you didn't make

play00:52

a consistent effort every single day now

play00:55

if you are completely new to the field I

play00:56

recommend joining the Discord server

play00:58

down below there over 5,000 people in it

play01:00

people provide referrals people work on

play01:01

group projects together and it's

play01:02

honestly one of the best is good servers

play01:04

for actually breaking into Tech and data

play01:06

science and analytics now let's talk

play01:07

statistics now it might be pretty

play01:09

intimidating you might be thinking math

play01:11

uh it sounds too complicated it's not

play01:13

for me but you have to understand

play01:14

statistics is the language of data and

play01:16

analytics well you don't need to be an

play01:17

expert by any means you don't need a PhD

play01:19

you do need to have a fundamental grasp

play01:21

of the basics of Statistics you might be

play01:22

asked question like what is the average

play01:24

customer spent what is the variance in

play01:26

the data can you conduct the split test

play01:28

for me and actually to answer these

play01:30

questions you do need to have some grasp

play01:32

of Statistics to make sure you aren't

play01:33

plagued by bias and the analysis

play01:35

actually makes sense for your

play01:36

stakeholders so the first part of stats

play01:38

is understanding descriptive statistics

play01:40

this can be the mean median mode and

play01:42

standard deviation of the data set they

play01:43

basically paint a holistic picture of

play01:45

the data set before you actually dive

play01:47

into the specific analysis you're

play01:48

actually doing next I want you to

play01:50

understand what correlation means it's

play01:51

basically a relationship between the two

play01:52

variables but I do want you to note that

play01:55

correlation does not equal causation so

play01:57

for example correlation could be does

play01:59

marketing budget actually lead to higher

play02:01

sales you can actually figure out this

play02:02

correlation and report it to your

play02:04

manager and if it does have a high

play02:05

correlation or relationship you can

play02:07

actually increase your marketing budget

play02:08

and hopefully increase sales now the

play02:10

next thing you need to know is

play02:11

probability and maybe a bit of

play02:13

hypothesis testing this is basically

play02:15

when you make some inference or

play02:17

hypothesis on the data set and you

play02:18

actually want to conduct some sort of

play02:20

experiment to test it out so this could

play02:22

be like hey does this new marketing

play02:23

campaign actually have an effect on the

play02:25

bottom line sales for the company so you

play02:27

actually don't need a University degree

play02:28

to learn these Concepts that pretty

play02:30

straightforward and simple I recommend

play02:31

just going on Con Academy or taking a

play02:33

free course and auditing it on corsera

play02:35

if you do want to you don't need to

play02:37

spend thousands of dollars if you don't

play02:38

want to there are tons of self-paced

play02:40

courses online and there are tons of

play02:42

videos on YouTube itself for statistics

play02:44

I've also released videos on hypothesis

play02:46

testing and split testing as well now as

play02:48

I said you don't need a PhD in math or

play02:49

statistics or computer science to

play02:51

succeed in this field you just need to

play02:52

know the basics in order to interpret

play02:54

and present data in a nice way to your

play02:56

customers and your fellow stakeholders

play02:57

all right now that you have a

play02:58

fundamental grasp of what you need to

play03:00

learn statistics the next thing I want

play03:01

you to learn is Excel and you might have

play03:03

some background in Excel but Excel is

play03:05

one of those things or Google Sheets is

play03:06

one of those things that will be

play03:08

ubiquitous in almost all of Industry no

play03:10

matter if you're data analyst or your

play03:11

investment banker or your sales or

play03:13

marketing or even customer service you

play03:15

probably have used Excel to some degree

play03:17

now do you remember all the concepts we

play03:18

covered in statistics earlier you can

play03:20

actually use them in Excel you can

play03:21

figure out the sum the max the Min all

play03:23

these descriptive statistics you can

play03:25

figure out V lookups index match and so

play03:28

many other functions that you can

play03:29

actually use on a day-to-day basis I

play03:31

would say the most important Concepts to

play03:32

cover in Excel or Google sheet are one

play03:34

pivot tables it helps you actually

play03:35

summarize the data set and visualize it

play03:37

very easily normally when I'm making

play03:39

visualizations in Excel and Google

play03:40

Sheets I typically make a pivot table of

play03:42

the data to begin with and then I

play03:44

visualize that pivot table for my end

play03:46

user most of the time data sets are very

play03:48

large they could be thousands if not

play03:49

tens of thousands of rows and pivot

play03:50

tables just summarize the data so easily

play03:53

for your customer so they don't have to

play03:54

just look through tens of thousands of

play03:55

rows of data they can just see a nice

play03:57

summarized aggregated field of data in a

play03:59

pivot table the next thing I think you

play04:00

should learn in Excel and Google Sheets

play04:02

is this idea of conditional formatting

play04:04

conditional formatting is basically in

play04:06

the name itself you set a condition and

play04:08

then it formats the text so the way I

play04:10

use it is to highlight anomalies or

play04:11

outliers in the data set so let's say

play04:13

the sales is actually going down for the

play04:14

quarter over quarter you'd actually

play04:16

maybe do red conditional formatting to

play04:18

indicate bad green to be good and yellow

play04:20

if it just doesn't make much of a

play04:22

difference so I typically like to do

play04:24

this in summary this feature just helps

play04:25

you see patterns in the data and

play04:27

summarize it easier so once you've done

play04:29

the basics with V lookups pivot tables

play04:31

conditional formatting the next step is

play04:33

actually work on visualizations and

play04:35

actually visualizing your data set so

play04:36

once you actually figure out

play04:37

visualization with bar charts pie charts

play04:39

line charts you actually want to build

play04:41

what are we call dashboards dashboard is

play04:43

basically a visual hierarchy to present

play04:44

a paino or problem statement to someone

play04:46

with a group of visuals big numbers and

play04:49

it's just a basically a way for like

play04:51

let's say a sales team wants to see how

play04:52

many sales they're closing per quarter

play04:54

this could have like a conversion table

play04:56

this can also have a big number saying

play04:58

how many sales calls were booked how

play05:00

many sales calls were closed and it's

play05:01

basically the way for a team to get to

play05:03

make better decisions based on

play05:05

visualization and data a welld design

play05:07

dashboard can make the difference even

play05:09

if your visualizations are amazing or

play05:10

storytelling's amazing if your dashboard

play05:12

is not visually appealing or welld

play05:14

designed often times stakeholders will

play05:15

be a bit biased and it won't actually

play05:17

take suggestions from that dashboard so

play05:18

make sure you're spending the extra time

play05:20

to design it well and communicate your

play05:22

findings effectively so where can you

play05:24

actually learn Excel I recommend you can

play05:25

audit a course on course era as well um

play05:27

Excel is one of those things where you

play05:29

can just honestly learn on YouTube I

play05:30

think there is some documentation for

play05:32

Microsoft as well on how to use

play05:33

functions in Excel there's also

play05:34

literally a help feature in Excel that

play05:36

you can use to learn certain features

play05:38

pretty easily one more thing I don't

play05:40

want you to learn Excel just in theory

play05:41

watching these courses I actually prefer

play05:43

if you actually practice it so go on

play05:44

kaggle.com download a data set and just

play05:46

import into Google Sheets or Excel and

play05:48

just start making pivot tables start

play05:50

making visualizations and start using

play05:52

maybe descriptive statistics and just

play05:53

playing around the data set so

play05:55

everything you actually learn in terms

play05:56

of documentation you can actually

play05:58

practice and then that way it it'll be

play05:59

much easier to remember it and apply it

play06:01

in your actual work now that you've

play06:03

mastered statistics in Excel the next

play06:04

step is actually move on to SQL SQL

play06:06

stands for structured query language and

play06:09

this is basically how you retrieve

play06:10

manipulate and analyze data stored in

play06:13

relational databases so think about it

play06:14

like this companies have tons of data

play06:16

sitting in various databases whether it

play06:18

be sales Data customer information

play06:20

product inventory SQL is the tool to

play06:22

actually communicate with this data set

play06:24

and bring useful insight for your

play06:26

customers in my previous experience SQL

play06:28

is one of those things that isn't a nice

play06:30

to have it is a must have it is a

play06:32

non-negotiable to learn SQL and be

play06:34

effective as a data analyst that is the

play06:36

tool that I use by far the most at as my

play06:39

job as a data analyst then when I move

play06:41

to a data scientist it is truly my bread

play06:43

and butter here's the thing though I

play06:44

don't think SQL is that difficult to

play06:46

learn I think you can learn it fairly

play06:47

quickly but you do have to understand

play06:49

the basics first so the anatomy of a SQL

play06:51

query you always want to use a select

play06:53

before everything this basically allows

play06:55

you to choose certain columns that you

play06:57

actually want to Output in your query

play06:59

next every SQL query should have a from

play07:01

statement which basically chooses which

play07:03

database or which table you actually

play07:04

want to pull data from after this you

play07:06

have the option of using aware Clause

play07:08

think of this as like a filter you can

play07:10

basically filter data Down based on

play07:12

certain conditions you ask for it so

play07:13

let's say you want to pull from a sales

play07:15

data set you can only pull data from a

play07:17

certain sales rep or a certain threshold

play07:19

amount or a certain date it's just

play07:21

basically a filter now often times

play07:22

you'll have to pull for multiple data

play07:24

sets and this is where the idea of a

play07:25

join comes in join allows you to bring

play07:27

in multiple data sets and then use them

play07:30

together to actually pull make a query

play07:31

joins are very common and they're

play07:33

different types called inner left join

play07:36

right join and outer join are the most

play07:37

common joins you'll probably be using

play07:39

and lastly you want to know what group

play07:41

by and having functions are these are

play07:43

basically aggregation tools so let's say

play07:45

you want to group by months so your date

play07:47

actually has individual rows for each

play07:50

date but let's say you want to figure

play07:51

out the total sales in a month you would

play07:52

Group by month and all those days would

play07:54

actually go into one row for that month

play07:57

and then you can actually use an

play07:58

aggregation function like a sum average

play08:00

min max then you can have that value for

play08:02

each of the months so you it's basically

play08:04

called aggregation you don't want to

play08:05

filter with wear Clauses on a group buy

play08:08

you want to use a having for group buy

play08:10

filtering now even if you master the

play08:12

basics and you're able to make queries

play08:14

out of nothing efficiency matters a lot

play08:15

in this game a lot of the times you're

play08:17

playing with large large data sets like

play08:19

even hundreds of thousands of rows of

play08:20

data so efficiency matters a lot and you

play08:22

want to create the most efficient SQL

play08:24

query as you possibly can using less

play08:26

compute the better make the most memory

play08:28

optimized SQL query

play08:29

all of this stuff matters and once you

play08:31

learn the basics the next step is to

play08:32

learn efficiency of SQL querying so this

play08:34

can be learning how to index tables or

play08:36

how to use subqueries properly or how to

play08:38

use common table expression these are

play08:39

all tools to you work with to make

play08:42

queries more efficient and during that

play08:43

technical interview you're going to have

play08:44

for data analyst knowing how to query

play08:46

tables efficiently is what separates a

play08:48

data analyst from another data analyst

play08:50

even if they can both output the right

play08:52

output now there are countless amount of

play08:54

free tutorials and learning SQL W3

play08:56

schools Con Academy and so many anym I

play09:00

recommend W3 schools I learned on W3

play09:02

schools I also learn on YouTube

play09:03

tutorials but honestly the best way to

play09:06

learn SQL is just practicing going on

play09:08

lead code going on hacker ring these

play09:10

interview prep tools and just doing

play09:11

problem after problem until you get

play09:13

comfortable I actually have a video up

play09:14

on my channel of how I learn SQL in 10

play09:16

or 15 days and I basically had to go

play09:19

into work on the first day as a data

play09:21

analyst right after school and I didn't

play09:23

know SQL prior to joining that company

play09:25

and my first task was to write SQL code

play09:27

and bring the answers to the my boss's

play09:29

question so I didn't really have the

play09:31

luxury to take in a class I just had to

play09:33

learn on the job and this was the best

play09:35

way for me force me to learn so the one

play09:37

thing I will note SQL probably isn't the

play09:39

easiest tool to practice because often

play09:41

times you want to download data sets

play09:42

from kaggle but SQL it's really

play09:44

important to be able to join tables

play09:45

together and kaggle often times only has

play09:47

one table depending on what data set

play09:49

you're pulling so I recommend even using

play09:50

chat GPD to populate your own databases

play09:53

with multiple tables and then actually

play09:54

practicing how to do joins and SQL

play09:56

querying this data may not be the most

play09:58

accurate it may not make the most sense

play09:59

because it is AI generated but it is a

play10:01

first step now after you learned

play10:03

statistics Google Sheets Excel and SQL

play10:06

the next step is to Learn Python or R

play10:09

and these are what we call scripting

play10:10

languages I personally recommend python

play10:12

it has a really large community it's a

play10:14

lot more versatile if you want to go

play10:15

into software development or data

play10:17

engineering python is just a much better

play10:18

tool to learn in terms of Versatility R

play10:20

is more for like Academia and statistics

play10:23

youd probably use it more as a data

play10:24

scientist if you know you want to stay

play10:26

on that path but you can't really go

play10:27

wrong with either uh for the purpose of

play10:29

of this video we are going to just be

play10:31

talking about python now you don't need

play10:32

to be a software engineer by any means

play10:34

but you should know like the basics and

play10:35

a few packages that are data analytics

play10:37

specific the two of the most important

play10:39

libraries that I think you should learn

play10:40

are pandas numpy and also maybe map plot

play10:44

lib for visualization and pyit learn if

play10:46

you want to go into more machine

play10:47

learning so pandas is actually a data

play10:49

manipulation library that allows you to

play10:50

clean organize data quickly and

play10:52

efficiently so you can basically think

play10:53

of it as Excel pretty much on steroids

play10:56

or a SQL typee package you can use

play10:58

pandas to group data filter data and

play11:01

join tables together and so much more

play11:03

and also read csvs read Excel files and

play11:05

import data into a data frame numpy is a

play11:08

package you'd use for more numerical

play11:09

operations I like to think of this as

play11:11

like an advanced calculator you could

play11:12

use in Python that's probably a good way

play11:14

of looking at it now once you've

play11:15

actually imported the data with pandas

play11:17

and you've done your numerical

play11:18

calculations with numpy I believe the

play11:20

next step is probably visualization I

play11:21

would use a package like Matt plot lib

play11:23

or caborn to help visualize the data

play11:25

with whatever type of visualizations you

play11:26

want it's a lot more customizable than

play11:28

tools like tableau or Google Sheets in

play11:30

Python you can actually customize so

play11:32

much more now this is a bonus once

play11:34

you've learned these packages I think

play11:35

you're good enough for data analysis but

play11:37

python is also amazing at automation so

play11:39

let's say you're doing a bit of ETL

play11:41

which is extract transform load this is

play11:43

more of a data engineering task python

play11:44

can help you do this and also run

play11:46

reports on saying hey has the data

play11:48

loaded properly so once you learn these

play11:50

basic packages I would also highly

play11:52

recommend All of You To Learn Python for

play11:54

automation as well now in terms of

play11:56

actually learning data Camp's a great

play11:57

resource for this of course is great but

play12:00

as I said before the best way and the

play12:01

best way that I've learned python is

play12:03

personally on the job itself once I got

play12:05

the job I would volunteer for python

play12:07

projects the job itself didn't require

play12:09

python to get but once I got it I used

play12:11

to just volunteer to use Python cuz I

play12:13

knew that was the best way to learn at

play12:14

the time now the last thing I want you

play12:16

to learn is some sort of bi tool or data

play12:18

visualization tool this could be Tableau

play12:20

looker powerbi or honestly any other

play12:23

tool I've used probably four or five

play12:25

tools throughout my career and just

play12:27

depends on the company you're at each

play12:28

company it own preferences like I've

play12:30

used R shiny which is basically using R

play12:32

for more customized visualizations I've

play12:34

used Spotfire I've used Tableau I've

play12:37

used looker I've used data Studio I've

play12:40

used so many tools and One Thing Remains

play12:42

the Same is the science of visualization

play12:44

will stay the same no matter what tool

play12:46

you have to use but being able to stay

play12:48

versatile and be nimble and learn a new

play12:50

tool quickly is very very important now

play12:52

for the purposes of this video I

play12:54

recommend that you actually learn

play12:55

Tableau because Tableau public is free

play12:57

for you to learn on and you can learn

play12:58

the science visualization you can learn

play13:00

the basics of visualization and then you

play13:01

can apply for different sorts of uh VI

play13:04

tools later down in your career I think

play13:05

the best part about these visualization

play13:07

tools is they're highly interactive they

play13:08

can actually connect straight to your

play13:09

database whether you're using SQL Presto

play13:12

Hive any of these like SQL engines you

play13:14

can actually make it as interactive as

play13:15

you want you can include filters buttons

play13:17

and you can do you can drag your mouse

play13:19

over it and you can see some results so

play13:21

it is very very interactive compared to

play13:23

other visualization tools now I think

play13:25

the best book on this is called

play13:27

storytelling with data figuring out how

play13:29

how to actually visualize data properly

play13:30

I would learn how to create charts how

play13:32

to create dashboards and how to actually

play13:35

like present these cuz not only is

play13:37

creating the dashboard's important

play13:38

presentation is just as important when

play13:40

it comes to your customers but here's

play13:41

the most important thing to keep in mind

play13:43

visualization is not just about making

play13:45

simple and pretty charts it's about

play13:47

making data easy to understand and

play13:48

telling a story so whenever you're

play13:50

making a dashboard you're making a

play13:51

visualization you need to ask yourself

play13:53

what is the key takeaway of this

play13:54

visualization what value does this add

play13:57

to the my audience the best dashboards

play13:59

in my opinion simplify complexity as

play14:01

much as possible and they provide clear

play14:03

and actionable insights to your audience

play14:05

in terms of learning I recommend just

play14:07

going on Udi or LinkedIn learning I

play14:09

think course there has a few courses on

play14:10

this I would download Tableau public and

play14:12

just start playing around with it before

play14:13

you even get to Tableau public maybe

play14:15

just start playing around with Google

play14:16

Sheets that is free and start making

play14:18

many dashboards there then slowly you

play14:20

can progress check a bi tool like

play14:21

Tableau or powerbi powerbi also has like

play14:23

a free trial you can also probably use

play14:25

if you want to get into it now I've

play14:26

actually had videos on how to use Tablo

play14:28

public with full tutorials I think we

play14:29

used Airbnb data in New York city so you

play14:32

can download kagle data an Excel or CSV

play14:34

file upload it to Tableau and start

play14:36

making dashboards immediately and follow

play14:38

a walkr on YouTube like the one I have

play14:40

once you have it you can actually just

play14:41

put this on your resume your GitHub and

play14:43

start branding yourself employers love

play14:44

to see what you do it's much better for

play14:46

an employer to see what you can do

play14:48

rather than you tell them what you can

play14:49

do they can actually see like your

play14:51

abilities your technical abilities they

play14:53

know like it's very hard to communicate

play14:55

how good you are so often times just

play14:56

practicing and and showing it off to

play14:58

employers is the way to go now let's be

play15:00

very clear this will not happen

play15:02

overnight this will take anywhere

play15:03

between 3 and 12 months depending on how

play15:05

much time you have to dedicate and your

play15:06

existing Background by the time 2025

play15:08

rolls around you should have enough

play15:10

experience if you followed this to a te

play15:11

you took notes and you actually

play15:13

practiced this every single day

play15:14

consistently my last tip for you is to

play15:16

build some sort of portfolio and put all

play15:18

of these projects on that portfolio and

play15:20

build a brand around yourself in data

play15:22

analytics and data science employers

play15:24

love to see practical experience and

play15:26

will help you so much in the job market

play15:27

if another candidate doesn't have this

play15:29

portfolio that you do so if you Meed

play15:31

this r on the video if you got any value

play15:32

of this video please leave a like

play15:33

subscribe and I'll see you next one

play15:36

[Music]

Rate This
★
★
★
★
★

5.0 / 5 (0 votes)

Étiquettes Connexes
Data AnalyticsCareer RoadmapSkill DevelopmentJob MarketStatistics BasicsExcel SkillsSQL MasteryPython for DataData VisualizationPortfolio Building
Besoin d'un résumé en anglais ?