Data Engineers = 80% Software Engineers

The Data Tech
17 Jul 202409:43

Summary

TLDRThis video explores the evolving skill set expectations in the data engineering job market from 2024 onwards. It highlights the increasing demand for software engineering skills like API services and basic web development, in addition to traditional data engineering competencies. The speaker emphasizes the importance of adapting to these changes to enhance job prospects and stay relevant in the industry.

Takeaways

  • 😀 The video discusses the evolving skill set expectations in the data engineering job market, particularly from 2024 onwards.
  • 🔍 Predictions from 2020-2023 suggested that AI and data science skills would be crucial for data engineers, but current trends indicate a shift.
  • 💼 The job market now expects data engineers to have a broader set of skills, including software engineering competencies.
  • 👨‍💻 Primary skills of software engineers include programming languages, SQL, data structures and algorithms, databases, version control, software development life cycle, web development, cloud computing, testing, debugging, and mobile development.
  • 🤔 Data engineers typically already possess many of these skills, such as programming languages, data structures and algorithms, databases, version control, DevOps, operating systems (Linux), and cloud computing.
  • 🚀 However, data engineers may lack skills in API services and basic web development, which are becoming increasingly important.
  • 🏢 Companies are building internal products for developers, which often require basic web UIs and API services, leading to a demand for these skills in data engineers.
  • 💡 The video suggests that while these additional skills are not a major requirement, having them can be an advantage and can increase job opportunities.
  • 📈 The impact of not knowing these additional skills in interviews is minimal, as the primary focus remains on core data engineering skills.
  • 📚 Learning new skills like basic web development and API services can be done with minimal effort and can provide an edge in the job market.
  • 🌐 The speaker encourages viewers to stay updated with new skills and technologies, as the tech industry is constantly evolving.

Q & A

  • What is the main focus of the video?

    -The video focuses on the evolving skill set requirements for data engineers, particularly highlighting the increasing expectation of software engineering skills in data engineering roles.

  • What was the prediction for the data engineering job market in 2020-2025?

    -The prediction suggested that people should learn AI and data science and add these to their data engineering skill set to increase job opportunities.

  • What has changed in the data engineering job market from 2024 onwards?

    -From 2024, the data engineering job market has started to expect more software engineering skills as part of the job description (JD).

  • What are some primary skills of software engineers mentioned in the video?

    -The primary skills mentioned include programming languages, SQL, data structures and algorithms, databases, version control systems, software development life cycle, methodologies like Agile and Scrum, web development, cloud computing, testing and debugging, and operating systems like Linux.

  • Which skills do data engineers typically already possess according to the video?

    -Data engineers typically already possess skills in programming languages, data structures and algorithms, databases (especially SQL), version control systems, DevOps (including Kubernetes and Docker), operating systems (like Linux), software development methodologies (like Agile and Scrum), and cloud computing.

  • What skills are data engineers currently lacking according to the video?

    -Data engineers are currently lacking skills in API services and web development, as these are not typically part of their skill set.

  • Why are companies expecting data engineers to know API services and basic web development?

    -Companies are building internal products for developers, which often require a basic UI for user interaction. Data engineers are expected to have some knowledge in these areas to contribute to these projects without the need for hiring additional software engineers or frontend developers.

  • How does the video suggest data engineers can acquire knowledge in API services and web development?

    -The video suggests that spending a few hours on the internet and YouTube can help data engineers learn the basics of API services and web development.

  • What impact does not knowing HTML, CSS, or API services have on a data engineer's interview?

    -Not knowing these skills does not significantly impact the interview outcome. Interviewers are primarily interested in the candidate's data engineering skills, but having some knowledge in these areas can be an added advantage.

  • What is the advice given to data engineers regarding learning new skills in the video?

    -The advice is to be open to learning new skills, such as API services and basic web development, as this can increase visibility and opportunities in the job market. It is also beneficial for cost-saving within companies.

Outlines

00:00

💼 Evolving Skill Sets in Data Engineering

This paragraph discusses the changing landscape of data engineering job market and the evolving skill set expectations. Initially, it highlights the importance of AI and data science in the past, but shifts focus to the current trend where software engineering skills are increasingly being integrated into data engineering roles. The speaker emphasizes the need for data engineers to be proficient in programming languages, SQL, data structures, algorithms, databases, version control systems, software development life cycles, methodologies like Agile and Scrum, web development, cloud computing, testing, debugging, and operating systems like Linux. It also points out that while data engineers are expected to have some knowledge of API services and web development, these are not core requirements but rather additional skills that can enhance their profile. The speaker concludes by stating that the primary focus should remain on data engineering, but having a broader skill set can be advantageous.

05:01

🏢 Internal Product Development and Its Impact on Job Market

The second paragraph delves into how companies are building internal products for developers, which are essentially tools and frameworks that facilitate data movement, replication, and management. These products are designed to be user-friendly for developers within the organization, reducing the need for external hiring for specific tasks like front-end development or API services. The speaker explains that while these skills are not mandatory, having a basic understanding of HTML, CSS, and web development can be beneficial. The paragraph also touches on the interview process, where these additional skills are considered but not critical. The speaker suggests that even if a candidate lacks these skills, their ability to learn and adapt is more important. The emphasis is on the cost-effectiveness of having multi-skilled employees and the potential for data engineers to contribute to internal product development, thereby increasing their value within the company.

Mindmap

Keywords

💡Data Engineering

Data Engineering is the practice of designing, building, and maintaining the systems and processes that enable the collection, storage, and transformation of data. In the video, it is highlighted as a field where job market expectations are evolving, requiring data engineers to also possess skills traditionally associated with software engineering.

💡Software Engineering

Software Engineering refers to the systematic approach to the design, development, and maintenance of software. The video emphasizes that data engineers are increasingly expected to have a software engineering skill set, which includes knowledge of programming languages, databases, and software development methodologies.

💡AI and Data Science

Artificial Intelligence (AI) and Data Science are fields that involve using algorithms, statistical models, and machine learning techniques to analyze and interpret complex data sets. The script mentions that these areas were previously emphasized as essential for data engineers, but the current trend is shifting towards software engineering skills.

💡Job Market Predictions

Job Market Predictions refer to forecasts about future trends in employment opportunities and skill requirements. The video discusses how predictions for the data engineering field have changed, moving from a focus on AI and Data Science to a greater emphasis on software engineering skills.

💡Programming Languages

Programming Languages are formal languages designed to communicate instructions to a computer. The script notes that knowledge of programming languages is a primary skill set for software engineers and is increasingly expected in data engineers as well.

💡SQL

SQL (Structured Query Language) is a standard language for managing and manipulating relational databases. The video highlights SQL as a key skill that data engineers already possess, which is also relevant to software engineering.

💡Version Control Systems

Version Control Systems are tools that help software teams manage changes to source code over time. The script mentions that familiarity with version control systems like Git or GitHub is a common requirement for both software and data engineers.

💡DevOps

DevOps is a set of practices that combines software development (Dev) and IT operations (Ops) to shorten the system development life cycle and provide continuous delivery of high-quality software. The video discusses how DevOps skills, including knowledge of Kubernetes and Docker, are relevant in the data engineering field.

💡API Services

API (Application Programming Interface) Services are sets of protocols and tools for building software applications. The script points out that data engineers are now expected to understand how to create and work with APIs, which is traditionally a software engineering skill.

💡Web Development

Web Development involves creating and maintaining websites, typically using languages like HTML, CSS, and JavaScript. The video suggests that basic web development skills are becoming more important for data engineers, especially in the context of building internal tools and products.

💡Cloud Computing

Cloud Computing is the delivery of computing services, including servers, storage, databases, networking, software, analytics, and intelligence, over the Internet. The script mentions that working with cloud platforms like Microsoft Azure and Google Cloud is increasingly relevant for data engineers.

Highlights

Data engineers and software engineers are increasingly expected to have overlapping skill sets.

In 2022 and 2023, the job market predicted a need for AI and data science skills in data engineering roles.

From 2024 onwards, the data engineering job market is expecting additional skills beyond AI and data science.

Software engineering skills are now part of the expectations in data engineering job descriptions.

Primary skills of software engineers include programming languages, SQL, data structures, algorithms, databases, version control, and software development methodologies.

Data engineers typically already possess skills in programming languages, data structures, algorithms, databases, and version control systems.

DevOps skills, including Kubernetes, Docker, and workflow schedulers like Airflow, are also commonly known among data engineers.

Linux operating system proficiency is a must-have for data engineers.

Cloud computing skills are increasingly being integrated into data engineers' skill sets.

API services and web development are areas where data engineers might lack expertise and are now being sought after in the job market.

Understanding and creating APIs is becoming crucial for data engineers as companies build internal developer-focused products.

Basic web development skills, such as HTML, CSS, and JavaScript, are being expected for building internal tools and UIs.

Mobile development is not a focus area even for software engineers currently, and can be excluded from the essential skills list for data engineers.

Companies are building products for internal use by developers, such as data movement and replication frameworks.

Data engineers are expected to have some knowledge of front-end development to contribute to these internal product developments.

Interviewers are considering knowledge of software engineering skills as 10-20% of the evaluation criteria for data engineering roles.

Even without prior knowledge of certain software engineering skills, data engineers can still be considered if they show a willingness to learn.

Learning additional skills like API services and basic web development can be an add-on and provide an advantage in the job market.

Data engineers are encouraged to stay updated with new technologies and integrate them into their skill sets for better job opportunities.

The speaker has an Instagram page called 'the data Tech' where they upload useful short videos related to data technology.

Transcripts

play00:00

hi welcome to the data Tech 100% of data

play00:03

Engineers or 80% of software Engineers

play00:07

100% of software Engineers are just 50%

play00:10

of data engineer okay what is this all

play00:12

about you're saying something like a

play00:14

calculation okay so this video is going

play00:17

to be completely an interesting fact so

play00:19

please do watch this

play00:20

completely so roughly in 2022 and

play00:24

2023 the prediction of data engineering

play00:27

job market and work Market in 2020 25

play00:30

and 2026 was like okay people should

play00:32

learn uh AI data science and they have

play00:35

to add this as part of their data

play00:37

engineering skill set so then only like

play00:39

we will be getting lot of job

play00:41

opportunities and data engineering but

play00:43

what happening now because always the

play00:45

prediction may it change so what is

play00:48

happening now from the beginning of 2024

play00:52

right the data engineering job market

play00:54

have adding few more skill set as part

play00:56

of the expectation in the JD so what is

play00:59

that is that AI data science no it's not

play01:01

like it's not that

play01:03

actually so recently so I do have many

play01:07

contacts in my LinkedIn and people used

play01:09

to share their interview experience with

play01:10

me and many of my friends and my ex-

play01:12

colleagues have attended lot of

play01:13

interviews in the recent times so this

play01:15

is not just within this month it's it

play01:17

happened I I've been I have been doing

play01:19

this research for almost last three

play01:21

months with my connections and what I

play01:23

come to know is people are expecting

play01:26

software engineering skill set as part

play01:28

of the data engineering skill skill set

play01:31

okay so now as a data engineer how

play01:34

should I know whether what are all the

play01:36

skill sets or software engineering and

play01:38

whether do I already know that or not so

play01:40

that's why I started with 100% data

play01:43

Engineers or 80% software Engineers

play01:46

already okay so what are the primary

play01:49

skill set of software Engineers okay

play01:51

let's start doing the list programming

play01:54

language and SQL and then you need to

play01:58

know data structures andal algorithms

play02:00

and then databases Version Control

play02:03

System Dev apps software development

play02:07

life cycle and methodology like you need

play02:09

to know the zajil methodology scrum and

play02:11

all those stuff web development cloud

play02:14

computing testing and debugging so you

play02:17

need to know API services and how to

play02:19

create an API how to write an API in any

play02:22

any coding it could be on what is API in

play02:24

first place and then you need to know

play02:25

operating system especially Linux and

play02:28

last mobile development so these are all

play02:30

the some of the primary skill set for

play02:32

software engineer a typical software an

play02:35

average software engineer should know

play02:36

all this as part of their skill okay so

play02:39

now as a data engineer what I know

play02:42

already from this list is very maximum

play02:45

right if you see we know programming

play02:46

languages and we know data structures

play02:48

and algorithms right and we know

play02:50

databases when we take databases we we

play02:52

know like SQL and even know sqls so we

play02:55

know databases right predominantly SQL

play02:57

and then version control system so any

play02:59

developers like nowadays like software

play03:01

engineer or a data engineer or a data

play03:03

scientist should know Version Control

play03:04

System like get GitHub bit bucket and

play03:07

everything so we already aware of it

play03:09

right and then uh devops so devops

play03:13

includes like kubernets Docker or

play03:15

workflow schedulers like airflow or any

play03:17

other automation schedulers so which is

play03:19

we are currently working on and we are

play03:21

somehow we are using it in our job so

play03:23

that is also fine we are doing it and

play03:25

then operating system Linux of course we

play03:27

as a data Engineers we work with Linux

play03:29

right predominantly Linux is everything

play03:31

for us software development and

play03:33

methodologies yeah we are working in

play03:35

agile scrum so we are aware of it

play03:38

testing and debugging so anyway we are

play03:40

doing testing and debugging even we

play03:42

write a spar job or a SQL job we do test

play03:44

we write test Frameworks and we do

play03:46

testing and debugging it's part of our

play03:48

any developer life and then cloud

play03:50

computing so of course nowadays people

play03:52

are already started adding cloud

play03:54

computing as part of their skill set and

play03:57

people are working with Microsoft Azu

play04:00

Google cloud and Google Cloud again data

play04:02

Pro bigquery so we anyways like we are

play04:04

in and out of cloud as well so what is

play04:07

not there with us out of this list is

play04:10

API services so we are not into API we

play04:13

don't know how to write an API and what

play04:15

is an API first of all so that is very

play04:17

important nowadays and I'll tell you why

play04:18

people are asking this so that is the uh

play04:21

the core part of the video actually and

play04:24

then web development so do I need to

play04:25

know a web development seriously so do I

play04:28

need to know HTML C CSS or JavaScript or

play04:32

any web server like flask or djo so why

play04:36

I need to know so I I'll give you the

play04:38

answer for this and then mobile

play04:40

Computing so mobile Computing is

play04:41

something even a software Engineers

play04:43

nowadays they are not concentrating on

play04:44

that so just we can remove it from the

play04:46

list okay let's come to the place like

play04:49

why should I have to write an API

play04:51

Services an API call and secondly why

play04:54

should I have to go for web

play04:56

development okay so that's why I said

play04:58

you are already 80% of soft engineer

play05:00

right because you are a 100 percentage

play05:02

of data engineer right so now if you

play05:05

take companies right they started

play05:06

building products and you you can say

play05:08

like yeah I know people are building

play05:10

products okay this products is all about

play05:12

developer perspective so here the end

play05:14

users are Developer within the

play05:16

organization it's not outside the world

play05:19

outside the end user is not like like

play05:21

non- Tech person so we are building

play05:23

products for the developers who work

play05:25

with us within our organization so this

play05:27

is happening this always happens in

play05:29

Anyan parall people will build like

play05:31

continuous deployment Frameworks data

play05:34

movement and replication Frameworks or

play05:36

they used to create this 360° data

play05:38

product and then DM and so many other

play05:41

stuff people create so these are not

play05:43

business benefit projects or products so

play05:46

we create a product within our company

play05:48

people can use it for example you have a

play05:50

database name and you need to know how

play05:52

many uh replication has been there for

play05:54

this database or you have a table name

play05:56

and you need to know this table belongs

play05:58

to which database and which which how

play06:00

many replications we do have for this so

play06:02

build a product for this when you search

play06:04

for the table name you will get all the

play06:06

details about the table whether it could

play06:07

be my SQL or Terra data or no SQL

play06:10

database whatever it is so I created a

play06:13

product so now similar to this people

play06:15

are building so many products like data

play06:17

movement and replication so you want a

play06:20

copy of the source table from your

play06:22

database so you build a UI in which I'll

play06:24

get the request from the user saying

play06:27

that I need this table name uh one

play06:29

replic in my system so I give all this I

play06:31

fill all this information and in backend

play06:34

it creates a schedule for this and then

play06:36

it will start creating the movement data

play06:38

movement and it also create the

play06:40

application so these are all the some

play06:43

products companies are developing

play06:44

internally for the developers to easily

play06:47

complete their work so now this is also

play06:49

become a very good part of the project

play06:52

and people are allocating budget for

play06:53

this and now when you are going for an

play06:56

interview right so the entire team is

play06:58

almost like 15 members in your teamr

play06:59

data engineers and new people are

play07:01

started building such products and for

play07:03

just for uh the building a front end

play07:06

with HTML CSS or basic uh web UI because

play07:09

this is an internal you don't want to

play07:11

perfectly design a very creative UI and

play07:13

all it's not required a basic uh UA is

play07:16

wide up so for this we cannot go for uh

play07:19

hiring right so we cannot go for a

play07:21

software engineer or a frontend

play07:23

developer right or an API service

play07:25

writing an API service just for this I

play07:27

I'm not going to hire a software

play07:29

engineer with my budget what I have so

play07:31

now people are expecting uh data

play07:33

Engineers should know some of the skill

play07:35

set especially it comes to AP services

play07:38

and basic web development information

play07:40

just if you spend one or two hour of

play07:42

time in Internet and YouTube you will

play07:44

get all these so this is what like

play07:46

people are expecting okay so how much

play07:49

this is going to impact in my interview

play07:51

if I say I don't know uh HTML CSS or I

play07:54

don't know API service or any other

play07:56

skill set that a software engineer knows

play07:58

but I don't know but they asking me in

play07:59

the data engineering uh interview so

play08:02

this is not impacting a lot okay this is

play08:04

not at all impacting they are just

play08:05

considering this knowledge 10 10 to 20%

play08:08

from you and if you still say no I don't

play08:11

know but I can able to learn they are

play08:13

ready so they are going to judge you

play08:16

select you based on the data engineering

play08:18

skill set and how you performed in your

play08:21

data engineering skill set in your

play08:22

interview but why I'm telling this to

play08:24

you right so uh preparing this is

play08:27

actually going to be an add-on for you

play08:29

so you are not going to lose any

play08:31

opportunities and you may think I'm I'm

play08:33

I I'm already tired of learning all

play08:35

these new new stuff but that is how

play08:36

system works right so we are in open

play08:39

source world and like any new text tack

play08:41

comes and when we see okay that if I add

play08:44

it as part of my skill set and I'm going

play08:46

to get more uh visibility then obviously

play08:49

we have to do it okay but our primary

play08:52

stack is still data engineering right

play08:54

end of the day the motto company's motto

play08:56

is cost saving so if I know that skill

play08:58

already then it's going to be an added

play08:59

advantage and even if you see in my team

play09:02

people are building gen it's an internal

play09:04

product so if you know how to create a

play09:06

gen very basic thing and then you can

play09:09

able to apply that in your project and

play09:11

if you know some visualization tools and

play09:13

you know how to uh project that as part

play09:16

of your data and that again going to

play09:18

give you some added advantage and that's

play09:20

what I wanted to tell you so the agenda

play09:23

of the video is completed and thanks for

play09:24

watching if you really like this video

play09:26

please do subscribe my channel and

play09:28

forward this to your friends and

play09:29

colleagues and please do share and tag

play09:31

me in LinkedIn as well and I do have an

play09:33

Instagram page called the data Tech

play09:35

where I used to upload lot of short res

play09:38

videos which will be very useful so

play09:40

please do follow me over there thanks

play09:42

for watching

Rate This

5.0 / 5 (0 votes)

Étiquettes Connexes
Data EngineeringSoftware SkillsJob MarketAIData ScienceAPI ServicesWeb DevelopmentInterview InsightsSkill SetTech Trends
Besoin d'un résumé en anglais ?