How to learn Machine Learning (ML/AI Roadmap 2024)

Kylie Ying
25 May 202426:01

Summary

TLDRIn this machine learning course introduction, Kylie Ying outlines a roadmap for effectively learning the field. She emphasizes the importance of foundational math, including probability, statistics, calculus, and linear algebra, as well as programming skills, particularly in Python. The course covers core machine learning concepts, types of learning, andๆ•ฐๆฎๅค„็†. It also touches on various models like neural networks and CNNs, and concludes with the necessity of practice, research, and community engagement for expertise.

Takeaways

  • ๐Ÿ“š Start with a strong foundation in basic math, including probability and statistics, calculus, and linear algebra, to understand the theory behind machine learning.
  • ๐Ÿง  Probability and statistics are essential for making predictions and understanding the most probable outcomes in machine learning.
  • ๐Ÿ” Learn about concepts like conditional probability, Bayes' Rule, and statistical distributions to model the unpredictable world effectively.
  • ๐Ÿ“ˆ Calculus, particularly optimization problems and gradient descent, is vital for training models in machine learning.
  • ๐Ÿ“‰ Derivatives play a crucial role in adjusting parameters to achieve desired outcomes in machine learning models.
  • ๐Ÿ”ก Linear algebra is key for handling large-scale data computations and operations in machine learning.
  • ๐Ÿค– Programming skills, especially in Python, are necessary for implementing machine learning models.
  • ๐Ÿ Python is favored for machine learning due to its popularity, extensive documentation, and rich library ecosystem.
  • ๐Ÿ”ง Understand programming concepts such as variables, functions, classes, and how to utilize libraries like pandas, numpy, and machine learning frameworks.
  • ๐ŸŒŸ Explore core machine learning concepts including types of machine learning (supervised, unsupervised, reinforcement), tasks (classification, regression), and models (KNN, logistic regression, SVM, etc.).
  • ๐Ÿ”ง Data is crucial in machine learning; understand data types, the importance of training, validation, and testing datasets, and data manipulation techniques like cleaning, scaling, and feature engineering.
  • ๐Ÿ‘จโ€๐Ÿซ Practice and research are key to deepening expertise in machine learning; engage with projects, datasets, and communities like Kaggle, and read research papers to stay updated.

Q & A

  • What is the primary focus of Kylie Ying's machine learning course?

    -The primary focus of Kylie Ying's machine learning course is to teach students how to learn machine learning effectively by providing a roadmap that covers concepts from the fundamentals to becoming an expert in the field.

  • Why is foundational math important in machine learning?

    -Foundational math is important in machine learning because it provides the necessary understanding of probability, statistics, calculus, and linear algebra, which are essential for modeling and predicting outcomes in the unpredictable world.

  • What are the two main areas of mathematics that Kylie Ying emphasizes as foundational for machine learning?

    -The two main areas of mathematics emphasized are probability and statistics, and calculus. These areas are crucial for understanding predictions, optimization problems, and the behavior of machine learning models.

  • How does the Brilliant platform help in learning foundational math for machine learning?

    -The Brilliant platform helps by offering thousands of interactive lessons that allow learners to gain an intuition in different areas of math, data analysis, programming, and AI, focusing on hands-on problem-solving and critical thinking skills.

  • What programming language does Kylie Ying recommend for beginners in machine learning?

    -Kylie Ying recommends Python for beginners in machine learning due to its popularity, extensive documentation, supportive resources, and great libraries for processing data and working with models.

  • Why is Python considered a good starting point for learning machine learning?

    -Python is considered a good starting point because it is widely used in the machine learning community, has a vast ecosystem of libraries, and is beginner-friendly, making it easier to prototype and experiment with machine learning models.

  • What are the core concepts Kylie Ying mentions in the machine learning roadmap?

    -The core concepts mentioned include understanding what machine learning is, types of machine learning, tasks such as classification and regression, the importance of data, data manipulation techniques, various machine learning models, neural networks, and training and evaluating models.

  • What is the significance of understanding data types and data manipulation in machine learning?

    -Understanding data types and manipulation is significant because it impacts the quality and relevance of the data fed into machine learning models. Good data practices ensure that the models are trained effectively and produce accurate predictions.

  • What are some of the machine learning models Kylie Ying lists in the script?

    -Some of the machine learning models listed include K-nearest neighbors, logistic regression, support vector machines (SVM), linear regression, neural networks, convolutional neural networks (CNN), recurrent neural networks (RNN), GRUs, LSTMs, and principal component analysis (PCA).

  • How does Kylie Ying suggest one becomes an expert in machine learning?

    -Kylie Ying suggests that becoming an expert in machine learning involves a combination of practice, research, and learning from experts. This includes working on projects, using resources like the UCI machine learning repository and Kaggle, reading papers, and implementing the findings to gain deep expertise in a chosen area.

Outlines

00:00

๐Ÿ“š Introduction to Machine Learning Course

Kylie Ying introduces her machine learning course, focusing on teaching effective learning strategies for machine learning. She outlines a roadmap covering fundamentals to expert-level knowledge. The first step is establishing a strong foundation in basic math, emphasizing probability, statistics, and their relevance to prediction and modeling in machine learning. Kylie recommends the Brilliant platform for interactive math lessons to build intuition and critical thinking, offering a discount for the first 30 days through her link.

05:01

๐Ÿ”ข The Importance of Foundational Math in Machine Learning

The paragraph delves into the necessity of a solid math foundation for machine learning, particularly calculus and linear algebra. Calculus, with its focus on optimization and gradient descent, is crucial for model tuning. Linear algebra accelerates model computation through vector and matrix operations. Kylie explains the importance of understanding these mathematical concepts to grasp the theory behind machine learning algorithms, even though practical implementation relies on existing libraries.

10:05

๐Ÿ’ป Developing Programming Skills for Machine Learning

Kylie highlights the importance of programming skills, especially Python, for coding machine learning models. She covers basic programming concepts such as variables, functions, and classes, and the necessity of understanding object-oriented programming due to its prevalence in Python libraries. Kylie mentions libraries like pandas, numpy, and matplotlib for data manipulation and visualization, as well as scikit-learn, TensorFlow, and PyTorch for machine learning tasks.

15:06

๐ŸŒŸ Core Concepts of Machine Learning

This section introduces the core concepts of machine learning, starting with understanding the types of machine learning: supervised, unsupervised, and reinforcement learning. It also touches on tasks such as classification and regression. The importance of data quality is emphasized, with a้ข„ๅ‘Š of topics like data types, data manipulation, and feature engineering to be covered in future videos.

20:08

๐Ÿ–ผ๏ธ Exploring Neural Networks and Model Types

Kylie discusses various machine learning models, starting with simple ones like K-nearest neighbors and logistic regression, to more complex models like neural networks. She provides a brief overview of different neural network architectures, including convolutional neural networks (CNNs) for image data and recurrent neural networks (RNNs) for sequential data. She also mentions GRUs and LSTMs, which are advanced types of RNNs.

25:08

๐Ÿ“ˆ Training and Evaluating Machine Learning Models

The paragraph focuses on the iterative process of training and evaluating machine learning models. It discusses the importance of using metrics to evaluate model performance and the challenge of overfitting. Kylie stresses the need to understand why overfitting occurs and how to mitigate it to ensure models perform well in real-world scenarios.

๐Ÿš€ Advancing in Machine Learning Through Practice and Research

The final section of the roadmap emphasizes the importance of practice and research for advancing in machine learning. Kylie suggests starting with projects found online, using resources like the UCI Machine Learning Repository and Kaggle for datasets and community interaction. She also recommends reading research papers and attempting to reproduce results to deepen understanding and expertise in the field.

Mindmap

Keywords

๐Ÿ’กMachine Learning

Machine learning is a subset of artificial intelligence that allows computers to learn from data and make decisions or predictions without being explicitly programmed. In the video, it is the central theme, with the course aiming to teach viewers how to learn machine learning effectively. The script discusses various aspects of machine learning, such as its foundational math, programming skills, core concepts, and models.

๐Ÿ’กFoundational Math

Foundational math refers to the basic mathematical concepts necessary to understand and apply machine learning algorithms. The video emphasizes the importance of probability and statistics, calculus, and linear algebra as the mathematical bedrock for machine learning. For instance, probability is crucial for making predictions, while calculus, particularly optimization problems and gradient descent, helps in training models.

๐Ÿ’กProgramming Skills

Programming skills are essential for implementing machine learning models. The video highlights Python as a key language for machine learning due to its popularity, extensive documentation, and rich set of libraries. Basic programming concepts such as variables, functions, classes, and the use of libraries like pandas, numpy, and machine learning frameworks like scikit-learn, TensorFlow, and PyTorch are mentioned as necessary skills.

๐Ÿ’กProbability and Statistics

Probability and statistics are mathematical fields that deal with the understanding of data and the likelihood of events. In the context of the video, they are foundational to machine learning as they provide the framework for predicting outcomes and understanding the behavior of data, which is exemplified by discussing concepts like conditional probability and statistical distributions.

๐Ÿ’กCalculus

Calculus is a branch of mathematics that deals with rates of change and accumulation. The video script mentions calculus as important for machine learning because it helps in solving optimization problems, particularly through the use of derivatives and gradient descent, which are fundamental to training neural networks.

๐Ÿ’กLinear Algebra

Linear algebra involves the study of vectors, matrices, and linear transformations. The video explains that linear algebra is critical for machine learning as it allows for efficient computation, especially when dealing with large datasets. It enables the manipulation and transformation of data in a way that is fundamental to the operation of machine learning models.

๐Ÿ’กNeural Networks

Neural networks, also known as artificial neural networks, are a set of algorithms designed to recognize patterns. They are inspired by the human brain and are a core topic in the video. The script discusses different types of neural networks, such as convolutional neural networks (CNNs) for visual data and recurrent neural networks (RNNs) for sequential data, and mentions concepts like perceptrons and backpropagation.

๐Ÿ’กData

Data is the raw material that machine learning models learn from. The video script underlines the importance of having good quality data, understanding data types, and the process of data manipulation, including cleaning, feature scaling, and feature engineering. The phrase 'junk in equals junk out' is used to emphasize the critical role of data quality in the success of machine learning models.

๐Ÿ’กModel Training and Evaluation

Model training is the process of teaching a machine learning model to make predictions or decisions based on data, while evaluation is the process of assessing the model's performance. The video script discusses the iterative nature of training and evaluating models, the importance of using metrics to evaluate them, and the challenge of overfitting, which occurs when a model performs well on training data but poorly on new, unseen data.

๐Ÿ’กPractice and Research

Practice and research are the final steps in the machine learning roadmap presented in the video. They involve applying the knowledge gained through the course to real-world projects and staying up-to-date with the latest research in the field. The script encourages viewers to engage with online resources, such as the UCI Machine Learning Repository and Kaggle, and to read and implement research papers to deepen their expertise.

Highlights

Introduction to a machine learning course aimed at teaching effective learning strategies in the field.

Emphasis on the importance of a strong foundation in basic math for developing machine learning skills.

Explanation of the relevance of probability and statistics in predicting outcomes in machine learning.

Introduction of Brilliant.org as a platform for interactive learning in math, data analysis, and AI.

The necessity of understanding calculus for dealing with optimization problems in machine learning.

Importance of derivatives in performing gradient descent to train models.

Fundamental role of linear algebra in machine learning for efficient data manipulation.

Overview of programming skills, particularly Python, essential for implementing machine learning models.

Discussion on the significance of understanding object-oriented programming concepts in Python.

Introduction of key Python libraries such as pandas, numpy, and matplotlib for data processing and visualization.

Core concepts of machine learning including types of learning and tasks like classification and regression.

The concept of 'junk in equals junk out' highlighting the importance of quality data in machine learning.

Explanation of data manipulation techniques including data cleaning, feature scaling, and feature engineering.

Overview of prevalent machine learning models such as K-nearest neighbors, logistic regression, and neural networks.

In-depth look at neural networks, including convolutional neural networks (CNNs) for visual data.

Introduction to recurrent neural networks (RNNs) for handling sequential data and advanced models like GRUs and LSTMs.

Discussion on training and evaluating machine learning models, including avoiding overfitting.

Recommendation for practice and research through projects, online resources, and reading academic papers.

Encouragement to engage with the machine learning community on platforms like Kaggle for exposure and expertise.

Conclusion summarizing the machine learning roadmap from foundational math and programming to core concepts and practice.

Transcripts

play00:00

hi everyone I'm Kylie ying and welcome

play00:02

to my machine learning course many of

play00:05

you out there are wondering well how do

play00:06

I learn machine learning effectively and

play00:09

so that is exactly what my course

play00:12

strives to teach in today's video I'm

play00:14

going to go over a machine learning road

play00:16

map to help you learn machine learning

play00:19

effectively and these will cover

play00:21

Concepts from like the fundamentals all

play00:25

the way through how do I become an

play00:27

expert in this field so with that let's

play00:30

get

play00:33

started so the first thing for our

play00:35

machine learning road map is that we

play00:37

want to really lay down a good

play00:40

foundation so the first thing that we're

play00:42

going to go over are our

play00:45

foundations now there's two areas that

play00:48

we really need a strong foundation in

play00:50

order to develop our machine learning

play00:52

skills and the first area is actually

play00:55

going to be basic math so I'm going to

play00:58

call this foundational math

play01:03

under foundational math we have a few

play01:05

different areas of mathematics the first

play01:07

one is actually probability and

play01:14

statistics now why probability and

play01:17

statistics how are these even relevant

play01:19

to machine

play01:21

learning machine learning is all about

play01:24

trying to figure out a prediction it's

play01:27

about trying to come up with the most

play01:30

probable outcome and in order to do that

play01:34

well we kind of have to have a really

play01:36

good foundation in probability we have

play01:38

to understand things such as conditional

play01:41

probability you know if these things are

play01:43

true then how does that impact this

play01:45

other thing um we need a good

play01:48

understanding of something called base

play01:49

Rule and we need a good understanding of

play01:52

statistical distribution such as the

play01:54

normal distribution or um the binomial

play01:57

distribution and these things will help

play02:00

you gain an intuition for how do we

play02:02

model you know the unpredictable world

play02:05

around us and ultimately that's going to

play02:08

translate into how does machine learning

play02:11

help us predict this unpredictable world

play02:13

around us even though I'm planning to

play02:15

teach a little bit about probity and

play02:17

statistics as well as the other machine

play02:19

learning prerequisites I highly

play02:21

recommend checking out today's sponsor

play02:24

brilliant brilliant is a platform where

play02:26

you can learn by doing they have

play02:28

thousands of inter active lessons so

play02:31

that you can actually gain an intuition

play02:33

in different areas of math data analysis

play02:36

programming and AI brilliant platform is

play02:40

incredibly effective because their

play02:42

lessons are filled with Hands-On problem

play02:44

solving their lesson plans actually help

play02:47

you build critical thinking skills not

play02:50

just memorizing brilliant is designed so

play02:52

that you can learn a little bit every

play02:53

day and that is probably one of the most

play02:56

important things that you can do for

play02:57

yourself brilliant recently launched a

play03:00

ton of new content in data including

play03:02

probability and statistics it's perfect

play03:05

for Learners of any level to start or

play03:08

continue learning how to work with data

play03:11

and interpret that data if you actually

play03:13

are able to visualize a lot of these

play03:16

Concepts that we just talked about in

play03:18

probability and statistics so to get

play03:21

started on learning these machine

play03:22

learning Basics and to try everything

play03:25

that brilliant has to offer for a full

play03:27

30 days visit brilliant.org / Kylie Ying

play03:31

or click on the link in the description

play03:34

you'll also get 20% off an annual

play03:36

premium subscription so probability and

play03:39

statistics great that's one area but

play03:41

that's not it so the second area that

play03:44

you'll really need to gain a foundation

play03:45

in is

play03:50

calculus and the reason why we need

play03:53

calculus is because um a lot of machine

play03:56

learning is also about okay given our

play03:59

data how do

play04:00

get the best model and whenever you're

play04:03

looking for the best something that

play04:05

becomes an optimization problem so you

play04:08

know a lot of calculus actually helps us

play04:11

deal with these optimization

play04:14

problems such as one area of calculus is

play04:18

called gradient

play04:19

descent and that will help us eventually

play04:23

tune neural Nets and be able to train

play04:26

our models so that's gradient

play04:30

descent and the foundation of gradient

play04:32

descent is actually derivatives and that

play04:34

is the most calculus thing ever so under

play04:38

here we have

play04:42

derivatives um and how do derivatives

play04:45

help us well when we actually go and

play04:47

perform gradient descent what we're

play04:48

doing is we have some number and we want

play04:51

to adjust this number a little bit like

play04:53

we have some parameter that affects our

play04:55

outcome and we want to adjust this

play04:57

parameter so that we we get closer to

play05:01

our desired outcome and how do we adjust

play05:04

it which way do we step a little bit to

play05:06

the left a little bit to the right we

play05:08

can actually use derivatives to help us

play05:10

figure out which way we want to go so

play05:13

derivatives really powerful tool all

play05:16

over you know neural networks machine

play05:18

learning um in order to understand the

play05:21

mathematics behind like how these things

play05:23

work you will need to understand

play05:26

derivatives and now finally

play05:31

the last piece of math that's really

play05:34

critical to machine learning is linear

play05:40

algebra a lot of machine learning is

play05:44

just basic algebraic computation so for

play05:48

example um a lot of the models that you

play05:50

will deal with are usually just adding

play05:53

numbers multiplying numbers and doing

play05:57

some simple operation on on those

play06:00

well you know if we have a huge model

play06:03

and we have a ton of data which that's a

play06:06

foundation of machine learning it's a

play06:08

ton of data and we want to use that data

play06:11

to try to make some model better and

play06:13

better if we were to just go through

play06:16

each like each operation like addition

play06:19

multiplication if we were to go through

play06:21

these one by one that would take forever

play06:23

so we actually want to parallelize these

play06:26

operations and basically what that means

play06:28

is instead of calculating these one at a

play06:31

time we're just going to calculate a

play06:33

bunch at the same time and maybe at the

play06:36

very end we can sum all of these things

play06:38

together rather than having to go

play06:40

through each one and then at the end sum

play06:42

all of those together so that just saves

play06:44

us a lot of time and the way that we do

play06:47

that is through linear algebra so linear

play06:50

algebra helps us basically solve for

play06:54

things a lot faster and so in linear

play06:58

algebra the things that you will have to

play06:59

understand our vectors and

play07:03

matrices and how you know how to

play07:05

multiply these things together um how to

play07:09

take the inverse how to solve basic uh

play07:14

systems of equations stuff like that

play07:18

also relevant from linear algebra will

play07:21

be igen vectors and Ian

play07:27

values and now the these will eventually

play07:30

help us take a lot of data and figure

play07:34

out what are the most important

play07:36

components of this data so what I mean

play07:38

by that is let's say I have a million

play07:41

different factors that go into some

play07:43

model I have a million different things

play07:44

that I want to consider when I build my

play07:47

model well can I pick out maybe certain

play07:50

things that are the most important as

play07:52

far as math goes having probability and

play07:55

statistics calculus and linear algebra

play07:58

that should suff suffice in terms of

play08:01

having a good like foundation for

play08:03

machine learning foundational math will

play08:06

provide you with the basis in order to

play08:08

learn the mathematics behind a lot of

play08:10

the concepts in machine learning so you

play08:14

know you're never actually going to do

play08:17

the math by hand when you're trying to

play08:20

like build something using machine

play08:23

learning because that's just not

play08:24

feasible there's already code out there

play08:28

they're called libraries like somebody

play08:29

else has already optimized these things

play08:31

and you're just calling their functions

play08:34

but you should have a good foundation of

play08:36

like okay how does this work behind the

play08:38

scenes so that you know what you're

play08:40

doing now in order to actually use those

play08:43

libraries and to put machine learning

play08:45

into practice well that's where the

play08:47

second foundational area of machine

play08:49

learning comes in so that second area is

play08:52

going to be programming skills um you

play08:55

need programming skills basically in

play08:57

order to code pretty much anything

play09:00

including your machine learning models

play09:03

so that's why it's important hopefully

play09:05

that's pretty clear that if you want to

play09:07

code a model you will need programming

play09:10

in order to do it I would say the

play09:11

language that's probably the most

play09:13

important is

play09:15

python I would say for beginners that

play09:17

python is a really great starting point

play09:20

um to improve your programming skills

play09:22

and the reason why I say that especially

play09:24

in the context of machine learning is

play09:26

that python is popular you're going to

play09:28

find a lot of documentation online for

play09:31

it there's a lot of resources out there

play09:33

to support you in your journey To Learn

play09:36

Python also python has really great

play09:38

libraries so they have good packages of

play09:43

code that will help you process data or

play09:46

um or work with models and python is

play09:48

really great for non-production code so

play09:51

when you're trying to just prototype

play09:53

something when you're playing around

play09:54

with data Python's really great because

play09:57

it's pretty Bare Bones and it's really

play09:59

um

play10:01

understandable so yeah Learn Python and

play10:04

if you don't know python I have a bunch

play10:06

of courses on that as well so under

play10:08

programming skills the things that are

play10:10

important are basic concepts and these

play10:14

concepts are things such as

play10:17

[Applause]

play10:18

variables um you got to know how

play10:20

variables work in order to create a

play10:22

program uh same with

play10:27

functions and classes becomes a little

play10:30

bit more advanced stuff but I mean

play10:33

python is an objectoriented programming

play10:35

language which means that it's based on

play10:38

objects and these objects are built

play10:41

using

play10:42

classes now uh while you yourself

play10:45

probably won't have to deal with classes

play10:47

a lot of these libraries use classes so

play10:49

you have to at least be able to

play10:52

understand how those

play10:53

work and then finally

play10:58

libraries and and um basically how to

play11:01

use these

play11:02

libraries now the libraries in Python

play11:05

that are relevant for machine learning

play11:08

there's a few of them there's a library

play11:09

called pandas and this one will let you

play11:13

basically import some data and be able

play11:16

to look at that in a table and do

play11:18

vectorized operations so instead of

play11:22

multiplying um a single item by a number

play11:25

for every single Row in your in your

play11:27

table you can just multiply that entire

play11:30

column by some number um and also

play11:35

numpy and numai is really built for uh

play11:39

really large multi-dimensional arrays

play11:41

and matrices and being able to do um

play11:45

math operations on these data types

play11:49

after that um you have a few libraries

play11:52

that actually like give you machine

play11:55

learning models that are readily

play11:57

available to use and those include s kit

play12:03

learn tensor

play12:07

flow and P

play12:11

torch and then finally there's a really

play12:13

great plotting library in Python um

play12:17

called Matt plot

play12:21

lib and especially under this there's

play12:24

something called P plot that you can use

play12:27

to plot in Python these are really the

play12:30

foundations that you should have to

play12:32

learn machine learning the first is

play12:34

foundational math so you can understand

play12:35

the theory of machine learning and then

play12:38

the second is programming skills so you

play12:39

can actually implement the models

play12:41

yourself now that we have a solid

play12:43

foundation for Math and programming we

play12:46

can move on to actually learning The

play12:49

Core Concepts of machine learning so the

play12:51

next category on this road map are those

play12:53

Core Concepts the first area that's

play12:56

really important is what is machine

play12:58

learning

play13:00

so it's understanding what is available

play13:02

to you out there and just like what are

play13:05

the different techniques that you can

play13:07

use what are the different broad

play13:09

categories of things that you're trying

play13:11

to do with your

play13:13

models so that includes the types of

play13:17

machine learning um and the three types

play13:20

are

play13:22

supervised

play13:26

unsupervised and reinforcement

play13:29

[Music]

play13:33

oops and then on top of the types of

play13:37

machine learning there's also different

play13:39

tasks so what are you trying to achieve

play13:42

in each of these types and um I won't

play13:46

talk about reinforcement learning yet

play13:48

but in supervised and sometimes

play13:51

unsupervised these tasks boil down to

play13:56

something called classification

play14:01

and you also have something called

play14:05

regression again I won't go over these

play14:08

but these are important Concepts to know

play14:10

and I'll go over them in a separate

play14:12

video the next core concept is data now

play14:15

junk in equals junk out which is why

play14:18

it's really critical to know what is

play14:20

going into your model and to make sure

play14:22

that you have a good data set going in

play14:25

it's important to gain an intuition for

play14:28

understand understanding what is good

play14:30

data and where is my data coming from

play14:33

and to ask yourself these things before

play14:35

just assuming that the data that you

play14:37

have in front of you is good there have

play14:40

been you know multi-billion dollar

play14:42

companies established just to solve the

play14:44

problem of how do I get good data for

play14:47

machine

play14:48

learning so first under data you have to

play14:52

understand the types of data

play14:59

and this might include something known

play15:01

as

play15:04

qualitative

play15:05

data as well as

play15:09

quantitative data so those are different

play15:13

and you have to handle each of those

play15:15

differently again I'll go over this in a

play15:18

different

play15:19

video next it's understanding this

play15:22

concept of

play15:24

training validation

play15:29

and

play15:30

testing

play15:32

data why do you have three different

play15:35

types of data sets why is one not enough

play15:38

finally the last thing that's really

play15:40

important to understanding this concept

play15:42

of data is manipulating your

play15:45

data so some form of manipulation of

play15:48

your data

play15:50

set and that might include data cleaning

play15:54

so making sure that your data set is

play15:56

good or getting rid of certain things

play15:58

that are not not good about your data

play16:00

set there's something known as feature

play16:03

scaling so maybe you want to you know

play16:07

the scale of um something in your data

play16:10

set is not quite where it wants to be

play16:13

like if I have a whole data set where

play16:15

many things are judged on a scale of 0

play16:17

to 100 and then something else is judged

play16:19

on a scale of 0 to 5 well I might want

play16:22

to multiply everything in that's 0 to

play16:24

five by 20 so that this is also on a

play16:27

scale of 0 to 100

play16:30

and then finally there's something

play16:31

called feature

play16:34

engineering where that means that in

play16:37

your entire data set you're choosing

play16:39

what pieces of data do I actually use or

play16:44

can I use the data that I have to build

play16:47

you know some

play16:49

intermediary piece of data some new

play16:51

piece of data that I can actually feed

play16:53

into my bottle so that my model performs

play16:56

better um so for example if I have stock

play16:59

prices of something like a feature that

play17:02

I can engineer is instead of the price

play17:06

having the return from day-to-day so

play17:08

that's the difference between today and

play17:11

yesterday and so

play17:13

on so that kind of summarizes uh this

play17:17

concept of data that's really critical

play17:20

to machine learning next after data we

play17:23

can start looking at models because you

play17:25

know we have good stuff coming in and

play17:28

now the next part is okay well what does

play17:29

it go into so those are the

play17:33

models in this road map I'm just going

play17:36

to list a bunch of models that are

play17:38

prevalent in machine learning and these

play17:40

are things that would be good to learn

play17:42

in order to have a good foundation in

play17:44

machine learning so there's things such

play17:47

as K

play17:49

nearest

play17:51

neighbors that's an algorithm um there's

play17:55

another model called logistic regression

play17:58

or other otherwise known as log

play18:02

regression and again I'm not going to go

play18:04

over each of these here but in a later

play18:07

video I will so stay tuned subscribe you

play18:11

know and again don't worry about

play18:14

understanding these here and now these

play18:16

will come that's why this is just a road

play18:19

map um there's something called

play18:22

svm or

play18:24

support Vector machine

play18:29

that's another

play18:30

model there's something called linear

play18:34

regression which you might be wondering

play18:36

hey I already know what that is but I

play18:38

didn't think that was machine learning

play18:40

well surprise some people consider it a

play18:41

machine learning

play18:46

model and then there's a big one that

play18:49

everybody talks about neural

play18:52

networks otherwise known as neural Nets

play18:54

or artificial neural networks artificial

play18:56

neural nets all of those things things

play18:59

but neural networks and the

play19:03

individual like neuron that makes up

play19:06

this neural network that's something

play19:08

called a

play19:14

perceptron and there's two more

play19:16

algorithms that um I'll add to this list

play19:18

as well that might be K

play19:21

means and finally something called

play19:25

PCA or principle

play19:31

component

play19:34

analysis so this list of models will

play19:37

provide you with a really great

play19:38

foundation and toolkit for basic machine

play19:41

learning and now we can get a little bit

play19:44

more in depth into neural

play19:47

networks because there's actually many

play19:49

different types of neural networks and

play19:52

there's a few that are pretty popular so

play19:54

the first one is known as a

play19:57

CNN and that C stands for

play20:00

convolutional

play20:03

convolutional so convolutional neural

play20:05

networks and what this is doing it's

play20:07

actually taking an

play20:10

image um and trying to run a neural

play20:13

network on that image so you can work

play20:15

with visual

play20:17

data and then there's this concept of an

play20:19

RNN and this R stands for

play20:24

recurrent and RNN are really great for

play20:27

sequential data so they can handle

play20:31

sequences um when your data has some

play20:33

sort of ordering that is relevant so

play20:36

sequential data and you can get a little

play20:39

bit more complex with this RNN stuff um

play20:43

you can go into grus and lstms and then

play20:48

from there it opens a whole new world

play20:50

and eventually you get to things such as

play20:54

uh chat GPT the final part of this

play20:57

machine learning road map app is

play20:59

training and evaluating a

play21:10

model now the reason why I put them

play21:13

together is because the training and the

play21:16

evaluating kind of go hand inand we have

play21:19

to evaluate a model to see how it does

play21:21

and then if it doesn't do so well we

play21:23

want to train it again so that you know

play21:25

maybe we can see if it improves and at

play21:28

the very end we have a final evaluation

play21:31

but this is a this is a recursive

play21:33

process it's we train something we see

play21:36

how it does okay it doesn't do exactly

play21:38

what we want it to do so we retrain it

play21:40

and we try to see okay does it do a

play21:42

little bit better hopefully yes and we

play21:44

keep going like

play21:46

that so um there are things here that

play21:50

are important such as metrics like how

play21:53

do you evaluate a model what are

play21:55

important things to look at and then

play21:58

also

play21:59

things such as

play22:01

overfitting what does that mean it means

play22:04

okay I've trained this model the

play22:06

evaluation looks really good but when I

play22:09

actually go put it out in the real world

play22:11

it doesn't work as well so what is going

play22:13

on there and being able to understand

play22:17

okay what is leading to this overfitting

play22:18

how can I combat that overfitting those

play22:22

are really critical pieces

play22:25

to to just building a good model that

play22:28

kind of wraps up this Core Concepts

play22:31

piece of our road map once you

play22:33

understand these Core Concepts in

play22:35

machine learning from there it's really

play22:38

just a lot of practice and reading

play22:40

research because you already have the

play22:42

basic building blocks um in order to go

play22:46

and learn more and now at this point

play22:49

it's all about okay which direction do

play22:51

you want to go in so I would call this

play22:53

next step as just

play22:55

practice and research

play23:00

so for practice a good place to start

play23:02

would just be projects that you find

play23:04

online try to get an understanding of

play23:06

how do other people do this and you can

play23:10

use

play23:11

YouTube as a resource hopefully my

play23:14

channel um you can use other like blog

play23:17

posts I mean the internet is full of

play23:20

people who are really good teachers so

play23:23

YouTube slash other things on the

play23:26

internet there is one data set that's

play23:29

really great for introductory machine

play23:31

learning and just trying to play around

play23:34

with code and data and building simple

play23:37

models so that is the

play23:42

UCI

play23:44

machine learning

play23:47

repository great resource to you know

play23:50

try dabble

play23:52

yourself um and then finally there's a

play23:55

lot of data on

play23:56

kaggle and kagle also makes it really

play24:00

easy to learn from other people and to

play24:02

see what other people are doing so

play24:04

that's a really great place to interact

play24:06

with a community of people all

play24:08

passionate about machine learning from

play24:11

there it's all about papers so a lot of

play24:16

cutting edge research people will be

play24:18

writing papers about those people will

play24:20

be publishing their work and literally

play24:22

describing oh this is what we do in

play24:25

order to achieve these results and I had

play24:28

recommend papers and honestly trying to

play24:31

implement those papers for yourself and

play24:33

to see if you can reproduce those

play24:35

results and this is really the area

play24:38

where you know I can't tell you how to

play24:40

become an expert you just have to find

play24:42

something that you really like and go

play24:44

deep into it and this is where honestly

play24:48

in this section the practice and

play24:50

research section this is about just

play24:52

scouring the internet and learning from

play24:54

other people who are experts in machine

play24:58

learning

play24:59

in this machine learning road map we

play25:00

started out with foundations of things

play25:03

that you have to learn in order to get

play25:05

started on machine learning and those

play25:08

include math and programming and then

play25:11

after that we talked about Core Concepts

play25:13

in machine learning and I named a few

play25:15

different models that are pretty

play25:16

fundamental to a lot of more complex

play25:20

machine learning and finally at the very

play25:23

end we talked about how do you practice

play25:26

how do you gain more exposure and

play25:28

expertise and all of that is about you

play25:30

know practice and reading and trying to

play25:33

learn from experts on the Internet or

play25:36

you know if you know them in real life

play25:38

like that's great too but the internet

play25:40

is a great resource this road map makes

play25:42

for a great syllabus for machine

play25:43

learning course so stay tuned I'm going

play25:46

to be teaching some of these Concepts

play25:48

and don't forget to subscribe

Rate This
โ˜…
โ˜…
โ˜…
โ˜…
โ˜…

5.0 / 5 (0 votes)

Related Tags
Machine LearningFoundational MathProgramming SkillsCore ConceptsData AnalysisNeural NetworksModel TrainingKylie YingAI EducationML RoadmapPython Coding