How I Became A Data Scientist (No CS Degree, No Bootcamp)

Egor Howell
5 Jan 202412:28

Summary

TLDRThis video details the speaker's journey into becoming a data scientist, offering advice for aspiring data scientists in 2024. With a strong math and physics background, the speaker shares their academic struggles and eventual discovery of data science through an AI documentary. They emphasize the importance of hard work, learning in small chunks, and the practical steps taken to master machine learning, Python, SQL, and data science packages. The speaker also discusses the challenges of landing the first job and suggests three key strategies to stand out: maintaining a GitHub profile, writing blog posts, and participating in Kaggle competitions.

Takeaways

  • 😀 The speaker emphasizes the allure of being a data scientist, especially with the rise of AI, and offers personal insights into the profession.
  • 🌟 The journey into becoming a data scientist is unique for everyone, but the video aims to provide inspiration and general guidance.
  • 🏫 The speaker's background in mathematics and physics was foundational to their path in data science, highlighting the importance of a strong STEM foundation.
  • 📚 Early exposure to complex scientific concepts through shows like 'The Big Bang Theory' sparked an interest in physics, leading to academic pursuits in the field.
  • 🎓 Despite not getting into their top university choices, the speaker's determination led to a place at the University of Surrey, where they learned the value of hard work.
  • 🔬 The realization that physics research wasn't as envisioned came from a year of research at the National Physical Laboratory, prompting a shift in career focus.
  • 🤖 A documentary on DeepMind's AlphaGo inspired the speaker to explore AI, leading to the discovery of data science as a field of interest.
  • 📈 The speaker's physics background provided a solid foundation in prerequisite knowledge for data science, such as linear algebra, calculus, and statistics.
  • 💻 Learning Python and SQL was crucial for the speaker's transition into data science, with practical experience gained through online courses and projects.
  • 🔑 The speaker applied to over 300 roles to secure their first data science job, underscoring the importance of persistence in job hunting.
  • 🏆 Standing out as a data scientist can be achieved by having a GitHub profile, writing blog posts, and participating in Kaggle competitions, showcasing practical skills and initiative.

Q & A

  • What is the speaker's background in terms of education and family influence?

    -The speaker comes from a strong math and science background with a family history of studying physics and engineering. They were naturally inclined towards STEM subjects due to their family's academic pursuits.

  • What sparked the speaker's interest in physics?

    -The speaker's interest in physics was sparked at the age of 12 when they watched 'The Big Bang Theory' and became fascinated by topics like quantum mechanics, gravity, and general relativity.

  • What were the speaker's A-Level results and how did it affect their university choices?

    -The speaker achieved four A*s in Maths, Physics, Further Maths, and Chemistry, and three As and five Bs in other subjects. They initially applied to Oxford, Imperial, Nottingham, Manchester, and Southampton, but only received offers from Manchester, Southampton, and Nottingham. They chose Manchester as their firm choice and Southampton as their insurance choice.

  • How did the speaker's performance in A-Levels affect their university admission?

    -The speaker received a C in Physics in their A-Levels, which led to rejections from both Manchester and Southampton, their firm and insurance choices, respectively. They were then offered a place at the University of Surrey through the clearing process.

  • What was the speaker's experience like during their undergraduate studies at the University of Surrey?

    -At Surrey, the speaker improved their work ethic and achieved first-class honors in their first two years. They were accepted into a master's program, which included a year of research at the National Physical Laboratory.

  • How did the speaker discover data science and what inspired them to pursue it?

    -The speaker discovered data science after watching a documentary on DeepMind's AlphaGo on YouTube. They became fascinated with how AI was trained and started researching machine learning, reinforcement learning, and related fields.

  • What prerequisites did the speaker have that helped them in learning data science?

    -The speaker had a strong foundation in linear algebra, calculus, and statistics from their physics background, which allowed them to dive directly into machine learning and understand the algorithms.

  • What was the first course the speaker took to learn machine learning, and how did it impact their learning journey?

    -The first course the speaker took was Andrew Ng's 'Machine Learning Specialization'. It provided them with a solid foundation in machine learning algorithms and introduced them to programming in Python.

  • How did the speaker learn Python and which resources did they use?

    -The speaker learned Python by asking a university lecturer for course notes from a computational physics course and taking an online course called 'Tutorial Sprint Python'. They learned basic Python syntax, functions, loops, and classes.

  • What was the speaker's approach to learning SQL and how did it benefit them?

    -The speaker learned SQL through an online course called 'Tutorial Sprint SQL'. The course covered everything they use in their day-to-day job and helped them prepare for interviews and work as a data scientist.

  • What strategies did the speaker use to secure their first data science job, and how challenging was it?

    -The speaker applied to over 300 roles during their final year of university to secure their first data science job. They believe that securing the first job is a numbers game and requires persistence, practice in interviews, and taking assessments.

  • What advice does the speaker give for standing out as a data scientist, especially for those looking for entry-level jobs?

    -The speaker recommends having a GitHub profile showcasing projects, writing blog posts to demonstrate learning and curiosity, and participating in Kaggle competitions to show the ability to solve business problems using data science.

Outlines

00:00

🌟 Journey to Becoming a Data Scientist

The speaker shares their personal journey into the field of data science, which is currently a highly sought-after career due to the rise of AI. They aim to provide inspiration and guidance for aspiring data scientists, emphasizing that everyone's path is unique. The speaker's background in mathematics and physics, sparked by an interest in STEM subjects from a young age, led to academic achievements and eventual disillusionment with physics research. This realization prompted a shift towards data science after watching a documentary on AI by DeepMind, which piqued their interest in machine learning and its applications.

05:01

📚 Learning Path in Data Science

The speaker outlines their learning process in data science, which involved leveraging their existing knowledge in mathematics, statistics, and physics. They took Andrew Ng's Machine Learning specialization course and transitioned from Fortran to Python, learning essential programming concepts and data science-specific packages. The speaker also learned SQL and engaged in practical projects by applying machine learning models to datasets from Kaggle. They highlight the challenge of securing the first job in data science, having applied to over 300 roles, and stress the importance of persistence and hands-on experience.

10:02

🚀 Standing Out as a Data Scientist

To stand out as a data scientist, the speaker recommends three key actions: maintaining an active GitHub profile to showcase skills and projects, writing blog posts to document learning and demonstrate interest in the field, and participating in Kaggle competitions to apply data science skills to real-world problems. They argue that these activities offer significant rewards relative to the effort invested and can differentiate candidates, especially those seeking entry-level positions. The speaker concludes by encouraging viewers to take these steps and to follow for more insights into breaking into the field of data science.

Mindmap

Keywords

💡Data Scientist

A data scientist is a professional who utilizes scientific and statistical methods to extract insights from structured and unstructured data. In the video, the speaker details their journey into becoming a data scientist, which is central to the theme of career development and the rise of AI and data-driven decision-making.

💡STEM

STEM stands for Science, Technology, Engineering, and Mathematics. It represents a broad educational and professional domain that the speaker was naturally inclined towards, as indicated by their family background in math and physics, and is foundational to the video's narrative of pursuing a career in data science.

💡Machine Learning

Machine learning is a subset of artificial intelligence that provides systems the ability to learn and improve from experience without being explicitly programmed. The speaker's interest in machine learning was sparked by a documentary on AI, and it is a key component of the data science field they eventually entered.

💡Reinforcement Learning

Reinforcement learning is a type of machine learning where an agent learns to make decisions by taking actions in an environment to maximize some notion of cumulative reward. The speaker mentions it as one of the concepts they found interesting, which is part of the broader theme of learning and applying complex algorithms in data science.

💡Deep Learning

Deep learning is a branch of machine learning based on artificial neural networks with representation learning. The speaker's fascination with how AI was trained to be a world champion in Go led them to explore deep learning, which is integral to the video's message about the allure and complexity of AI technologies.

💡Programming Language

A programming language is a formal language comprising a set of instructions used to produce various kinds of output. The speaker discusses their experience with Fortran and Python, emphasizing the importance of learning programming languages in the context of data science.

💡Python

Python is a high-level, interpreted, and general-purpose programming language that is widely used for back-end web development, data analysis, machine learning, and more. The speaker's learning of Python is highlighted as a crucial step in their journey to becoming a data scientist.

💡SQL

SQL (Structured Query Language) is a domain-specific language used in programming and designed for managing data held in a relational database management system. The speaker mentions learning SQL as part of their skill development for a career in data science.

💡GitHub

GitHub is a web-based hosting service for version control using Git. It is a platform where developers can share code and collaborate on projects. The speaker advises having a GitHub profile to showcase one's coding projects and skills, which is a practical tip for job seekers in the data science field.

💡Kaggle

Kaggle is an online community of data scientists and machine learning practitioners that provides a platform for predictive modeling and analytics competitions. The speaker recommends participating in Kaggle competitions as a way to demonstrate practical data science skills and problem-solving abilities.

💡Blog Post

A blog post is an individual written article or story posted to a blog. The speaker suggests writing blog posts as a means to document and showcase one's learning process and interests in data science, which can help in standing out as a job candidate.

Highlights

The speaker details their journey into becoming a data scientist amidst the rise of AI.

A strong math background influenced by family members with degrees in math and physics.

Early interest in physics sparked by watching The Big Bang Theory and further research on quantum physics.

Achieved four A*s, 3 As, and 5 Bs in GCSEs, reflecting a natural inclination towards STEM subjects.

Lack of work ethic in school days and the realization of being overconfident in abilities.

Rejection from top universities and acceptance into the University of Surrey through clearing.

The importance of hard work realized during university studies, leading to first-class honors.

Master's program involvement in research at the National Physical Laboratory on acoustic thermometry.

A shift in career interest from physics to data science after watching a documentary on AlphaGo.

The discovery of data science as a field that intersects with various disciplines like math, statistics, and computer science.

Learning machine learning algorithms and the importance of prerequisite knowledge in linear algebra and calculus.

Taking Andrew Ng's Machine Learning specialization course and transitioning from Fortran to Python.

Learning Python syntax, functions, loops, and classes through university resources and online courses.

Acquiring SQL skills through online courses and its importance in day-to-day data science tasks.

Building simple projects on Kaggle to apply machine learning models and gain practical experience.

The challenge of securing the first data science job with over 300 applications during university.

Starting the first data science role at an insurance company and the importance of hands-on experience.

Advice on standing out as a data scientist by having a GitHub profile, writing blog posts, and participating in Kaggle competitions.

Emphasizing that there is no one-size-fits-all path to becoming a data scientist and the value of personal effort.

Transcripts

play00:00

it's no lie that being a data scientist

play00:02

is probably one of the coolest jobs out

play00:04

there at the moment particularly with

play00:06

the rise of AI last year in this video I

play00:09

want to detail my journey into how I

play00:11

became addicted scientist and offer

play00:13

advice for how you become one in 2024

play00:16

it's important to mention that not

play00:17

everyone's journey is the same but

play00:19

hopefully this video will strike some

play00:21

inspiration or some general guidance on

play00:23

how you could become a data scientist

play00:25

we'll start with covering my background

play00:28

how I discover data science How I

play00:30

Learned data science how I got my first

play00:32

job and general advice for you looking

play00:35

to break into the field so let's get

play00:37

into

play00:39

it I come from quite a heavy math

play00:42

background my mom has a math degree both

play00:45

my grandparents studied physics and my

play00:47

great Grandad was even engineer so I was

play00:49

always naturally inclined and pushed

play00:52

into that direction of stem subjects I

play00:54

remember when I was 12 years old is what

play00:56

really got me into physics I was

play00:58

watching The Big Bang Theory and they

play01:00

were talking about things such as

play01:03

quantum blute gravity general relativity

play01:06

and these topics were really interesting

play01:08

so I went and Googled all about them

play01:10

obviously I didn't know much of anything

play01:13

at that point but they were still very

play01:15

interesting to me and that's when I

play01:17

decided to pursue physics for my gcss I

play01:20

got four a Stars 3 A's and 5 BS which is

play01:23

not bad at all however it wasn't exactly

play01:26

genius level and at that time I thought

play01:29

was much smarter than I was and my work

play01:31

eth think was really poor my 4 a stars

play01:34

were in maths physics further maths and

play01:37

chemistry so those were the four

play01:39

subjects I took to a level at a level

play01:42

nothing much changed I was still being

play01:44

quite lazy wasn't working too hard

play01:47

because in reality I thought I was

play01:48

smarter than I was I was quite

play01:51

delusional at the time and I thought I

play01:53

can get into the best universities even

play01:55

though like I said I wasn't working very

play01:57

hard so in the UK you get AO choice of

play02:00

five universities that you can apply to

play02:02

so my five choices were Oxford Imperial

play02:05

Nottingham Manchester and Southampton

play02:09

now obviously both Oxford and Imperial

play02:11

both rejected me as they are the top

play02:13

institutions in the world and our point

play02:15

in time my grades and work ethic like I

play02:18

said went very good however Manchester

play02:21

Southampton and notam all gave me offers

play02:24

I firmed Choice Manchester which

play02:26

required me to get AAR AAR a and my

play02:29

insurance Choice was Southampton where I

play02:31

needed 3 A's so results day comes around

play02:34

for my a level results and let's just

play02:36

say I didn't do too well as I was

play02:38

expecting I got a star in maths a b in

play02:42

further maths but a c in physics so for

play02:44

someone who wants to study Physics at

play02:46

University that's not great and

play02:49

Manchester and Southampton both rejected

play02:51

me in the UK we have something called

play02:53

clearing now clearing is where on

play02:56

results day universities may have some

play02:58

space on certain courses and people who

play03:00

didn't get into their firm or Insurance

play03:02

Choice can apply to those universities

play03:05

and luckily the University of Sur offer

play03:07

me a place to study Physics of astronomy

play03:10

come September 2017 at s is where I

play03:13

really learned that there is no

play03:15

substitute for hard work I know it's a

play03:17

cliche but it's true in my first two

play03:19

years I had a much better work ethic

play03:22

than I did in my school days and in my

play03:25

first year I got off first and in my

play03:27

second year I also got off first at the

play03:29

end of my second year I got accepted

play03:31

onto the master's program as part of the

play03:33

Masters program you have to do a year of

play03:36

research it's kind of like a mini PhD

play03:39

and my year was done at the national

play03:41

physical laboratory in Teddington or MPL

play03:43

for short and my thesis was on measuring

play03:47

air temperature gradients using acoustic

play03:49

thermometry during this year I enjoyed

play03:52

it however physics research from that

play03:55

small snippet I had wasn't quite I

play03:57

envisioned it being like and in real

play03:59

reality I found things to move a lot

play04:01

slower than I would have liked and that

play04:05

wasn't quite for me and I kind of fell

play04:06

out of love of

play04:10

physics I still remember to this day

play04:12

exactly how I discovered data science

play04:15

after I com back from a day of work at

play04:17

npl a video appeared on my YouTube

play04:19

homepage and it was deep Minds alphao

play04:22

documentary on where they trained an AI

play04:25

bot to be the go world champion leak of

play04:28

all after watching a documentary I

play04:30

became fascinated about how they train

play04:33

this this AI like what Al GRS did they

play04:35

use and what kind of process did they

play04:37

take I was looking into reinforcement

play04:39

learning deep learning markof chains all

play04:42

these things obviously at that point in

play04:44

time I didn't understand everything but

play04:47

I found it also interesting so I looked

play04:49

online of basically opportunities and

play04:52

what kind of people or what kind of

play04:54

professions use machine learning and

play04:56

that's how I stumbled across data

play04:58

science

play05:01

like most people I had the age- old

play05:03

question how do I learn data science

play05:06

data science cuts and intersects into so

play05:08

many field maths statistics computer

play05:11

science that it seems overwhelming

play05:14

however if you break down you're

play05:15

learning small chunks is very doable

play05:18

coming from a physics background I

play05:21

pretty much had all the prerequisite

play05:22

knowledge I needed I new linear algebra

play05:25

I new calculus and I new statistics so

play05:28

that means I could straight into the

play05:30

machine learning and and understanding

play05:32

how the algorithms work the first course

play05:35

I took was Andrew n's course called The

play05:37

Machine learning specialization I took

play05:40

this course back in

play05:42

2020 and this is when it was still the

play05:44

2012 version and all the exercises were

play05:47

in octave or mat lab it's been revamped

play05:49

and it's got more Cutting Edge

play05:51

algorithms in there such as

play05:53

reinforcement learning recommended

play05:54

systems and it's also taught in Python

play05:57

at this point in time I only only had

play05:59

experience in one programming language

play06:02

and that was Fortran so we got taught

play06:04

Fortran in my first 2 years of

play06:07

University and for those of you who

play06:09

don't know what Fortran is it's probably

play06:11

one of the oldest highlevel programming

play06:14

languages out there it was written in

play06:16

the 1950s with it being my first

play06:19

programming language it made me not

play06:21

really like coding that much because

play06:23

everything was manual hard there's not

play06:26

many packages available for Tran

play06:28

reflecting on it learning forr was kind

play06:31

of a blessing in disguise because it

play06:33

really got me to really think

play06:36

programmatically and like I said

play06:37

everything had to be done from scratch

play06:39

and so when I went about learning python

play06:42

it was so much easier for me the way I

play06:45

learned python was by simply contacting

play06:48

one of the lecturers at my University

play06:50

who taught a computational Physics

play06:53

course basically asked them for the

play06:55

course note and it was just an

play06:57

introduction to python in reality any

play07:00

intro to python course would have been

play07:02

sufficient I also took the tutorial

play07:04

Sprint uh python course and it basically

play07:06

Tau me all the things that was in those

play07:08

lecture notes the main things I learned

play07:11

were python syntax functions Loops

play07:14

classes all kind of the regular things

play07:16

you need to know behind a programming

play07:18

language to design or build anything I

play07:21

then went to learn a bit more of the

play07:24

data science specific packages numpy

play07:27

pandas M poop lib and also psyit learn

play07:31

these were done on the kagle courses and

play07:33

these are very useful these are kind of

play07:35

like the main packages you would use day

play07:37

to day as a data scientist after python

play07:39

it was then to learn the other language

play07:42

of data science which is SQL the way I

play07:44

learned SQL was that again I took the

play07:47

tutorial Sprint uh online course to SQL

play07:50

it took me around a few days and to be

play07:52

honest that course literally covers

play07:54

everything I use now in my day-to-day

play07:56

job it teaches you all the basics and

play07:58

more some that you will likely use in

play08:01

any interview and also in most jobs uh

play08:04

nowadays when you're a data

play08:06

scientist after upscaling in machine

play08:09

learning Python and SQL I then basically

play08:12

started building some really simple

play08:14

projects what I did is that I would get

play08:17

some data set from kagle and I would

play08:19

just randomly apply just loads of

play08:22

machine learning models to these data

play08:23

sets I will link in the description

play08:25

below a lot of these projects but

play08:28

comparing them to my ability now they

play08:30

weren't very good but they allowed me to

play08:32

get my hands dirty and just try out

play08:34

loads of models I built linear

play08:36

aggression listic regression decision

play08:38

treats just a range of algorithms and it

play08:41

really taught me how they work and how

play08:42

to apply them to a real life

play08:47

problem the hardest part by far is

play08:49

securing the first job you dedicate a

play08:52

lot of time to learning all these skills

play08:53

with the hope that you will learn that

play08:55

first role I'm not joking when I said I

play08:57

appli to over 300 roles in my final year

play09:00

of University trying to get this first

play09:03

data science job so when it comes to

play09:05

your first role I I honestly believe

play09:08

it's purely a numbers game you really

play09:10

just have to put yourself out there have

play09:12

practice in interviews have practice in

play09:14

these take of assessment to land their

play09:16

first role I've got my first role at an

play09:19

insurance company kind like mid-level

play09:20

sites in the UK it wasn't some fancy

play09:23

Fang or Quant hedge font like I said it

play09:26

was just a regular firm that was you

play09:29

know really good and I worked with some

play09:31

amazing people you don't need to work at

play09:33

one of these top companies particularly

play09:35

at the start because in reality is in

play09:37

some of these smaller companies you may

play09:39

learn more because you may be asked to

play09:41

do more things be more Hands-On of a lot

play09:43

of the infrastructure like anything in

play09:46

life it's really up to you to Excel and

play09:48

put an effort you can not grow at all in

play09:50

big companies and you can grow a lot in

play09:52

small

play09:55

companies the final thing I want to

play09:57

discuss is how you can stand out as a

play09:59

data scientist in my opinion these three

play10:01

things are very simple to do and they

play10:04

give you so much more rewards than the

play10:05

effort you put into them the first one

play10:07

is make sure you have a GitHub profile

play10:09

and populate it on the screen is what

play10:12

mine looks like again mine's not that

play10:14

fancy but it does have some you know it

play10:17

looks good and it has some nice things

play10:19

added to it what what languages I know U

play10:22

what I do and some basic reposts of my

play10:25

past projects you can do the same easily

play10:28

in fact just my template and add some

play10:30

basic repos of you basically learning

play10:33

python it doesn't need to be too

play10:35

complicated but I promise you most

play10:37

people applying for entry-level jobs

play10:39

won't even have this the second one is

play10:42

write a blog post I'm still amazed at

play10:45

why people think this is so much harder

play10:47

than it really is the goal of writing a

play10:49

blog post particularly if you're just

play10:51

trying to land a job you're not trying

play10:53

to make the post go viral it's more just

play10:56

to Showcase your learning and show that

play10:58

you're interested in your you're curious

play10:59

and willing to document your work the

play11:02

simplest way you can write a blog post

play11:04

is for example say you learn how to

play11:07

implement functions in Python write a

play11:09

blog post about how you implement

play11:11

functions in Python it really is how

play11:13

simple don't over complicate it the

play11:16

final one which is a bit more tricky and

play11:18

that is enter a kagle competition and do

play11:22

reasonably well now what kagle will show

play11:24

to the employer is that you're able to

play11:26

break down a business problem into code

play11:29

in a data science way and that's really

play11:31

useful because your job as a data

play11:34

scientist is to unify business with data

play11:38

and to solve that

play11:40

problem the thing I want to stress is

play11:42

that there is no one best way to become

play11:45

a dicta scientist but my journey

play11:48

hopefully gives you some inspiration or

play11:50

some guidance or even some tips on how

play11:52

you can tailor your learning or the

play11:54

steps you can take to become one in

play11:56

2024 I highly recommend you action on

play11:59

those three key things I mentioned at

play12:01

the end of the video that is get GitHub

play12:04

profile write a simple blog post and

play12:07

maybe even enter a kago competition

play12:09

these three things will set you apart

play12:11

from pretty much every other candidate

play12:13

particularly for entry level jobs so I

play12:16

really really recommend you try them if

play12:19

you enjoy this video and want to learn

play12:20

more about data science and how to break

play12:22

into data science then make sure you

play12:24

click the like And subscribe button and

play12:26

I'll see you in the next video

Rate This

5.0 / 5 (0 votes)

Related Tags
Data ScienceAI JourneyCareer AdviceMachine LearningPython CodingSQL SkillsGitHub ProfileKaggle CompetitionSTEM BackgroundEducational Guidance