Bell's data modernization journey: executing for the future with Google

Google Cloud
1 Jul 202413:03

Summary

TLDREric from Bell's Enterprise data platform team shares his journey from intern to architect and discusses the company's transition to Google Cloud. He highlights three key mistakes made during their on-premise data management: proliferating data silos, neglecting developer productivity, and poor data governance practices. Eric outlines how Bell is addressing these issues by standardizing on a single data platform, improving developer tools, and adopting data cataloging with Google Cloud's Dataplex.

Takeaways

  • 📊 Bell is Canada's largest telecom provider, serving multiple provinces and regions.
  • 🕹️ The speaker, Eric, works in Bell's Enterprise Data Platform team and has been with the company for 17 years, starting as an intern.
  • 🏆 The speaker used his experience in competitive gaming to draw parallels between analyzing mistakes in games and reviewing past data management strategies.
  • ☁️ Bell has recently migrated to Google Cloud and is reviewing past mistakes to avoid repeating them in the new environment.
  • 🏗️ The first major mistake identified was the creation of numerous data silos, leading to inefficiencies and challenges in data management.
  • 🚧 The second mistake was the lack of focus on improving developer productivity, leading to redundant and inefficient coding practices.
  • 🛠️ Bell is now working on standardizing their data platform, consolidating their data into Google BigQuery, and improving data accessibility.
  • 💻 The company is also implementing tools like Terraform modules and Backstage to standardize and streamline developer workflows.
  • 🔄 Bell plans to create config-driven libraries to reduce redundant coding and improve the efficiency of data pipeline creation.
  • 📑 The third mistake involved poor data governance practices, with metadata managed in Excel and Confluence. Bell is now standardizing on DataPlex for better data cataloging and governance.

Q & A

  • Who is the speaker, and what is his role at Bell?

    -The speaker is Eric, who works in the Enterprise Data Platform team at Bell. He has been with the company for 17 years, starting as an intern and then moving into roles such as Data Engineer and currently, an Architect.

  • What is Bell, and what regions does it serve?

    -Bell is the largest telecommunications provider in Canada, serving regions like Quebec, Ontario, Atlantic provinces, and Manitoba.

  • Why does the speaker mention playing competitive video games, and how does it relate to the talk?

    -The speaker mentions playing competitive video games like Rocket League to draw a parallel between analyzing mistakes in gaming and analyzing mistakes made during Bell's data platform migration. The idea is that reviewing mistakes is crucial for improvement in both contexts.

  • What were the three major mistakes Bell made with their on-premise data systems?

    -The three major mistakes were: 1) Deploying dozens of data environments and creating data silos. 2) Failing to improve developers' productivity and experience. 3) Poor data governance practices, including the management of metadata in Excel spreadsheets and Confluence.

  • What was the impact of deploying multiple data environments at Bell?

    -Deploying multiple data environments led to a complex web of data pipelines, making it difficult for engineers to create value and requiring them to spend a lot of time moving data around instead of solving core problems.

  • How did Bell plan to address the issue of multiple data environments in their migration to Google Cloud?

    -Bell planned to standardize on a single data platform in Google Cloud, specifically using Google BigQuery. This would centralize data and reduce the need for engineers to move data across different environments.

  • What problems did Bell encounter with developer productivity, and how are they addressing these in the cloud migration?

    -Bell found that developers were repeatedly solving the same problems and writing redundant code, particularly in areas like data compaction. To address this, they are building a solid foundation with Terraform modules and using tools like Backstage to standardize and automate processes, improving productivity.

  • What is Backstage, and how is Bell utilizing it in their cloud migration?

    -Backstage is an open-source developer portal created by Spotify that allows engineers to use templates to quickly spin up applications. Bell is using Backstage to simplify and standardize their development processes during their cloud migration.

  • What issue did Bell face with data governance, and what tool are they using to improve it?

    -Bell struggled with managing table metadata using Excel spreadsheets and Confluence, leading to inefficiencies. To improve data governance, they are now standardizing with Google DataPlex to catalog assets and provide a centralized location for data governance.

  • What is the speaker's main message to the audience regarding their own experiences with large-scale migrations?

    -The speaker encourages the audience to learn from Bell's mistakes and shares their experiences to help others avoid similar pitfalls. He invites others with experience in large-scale migrations to reach out and share their learnings, particularly as Bell prepares for a significant migration next month.

Outlines

00:00

🎤 Introduction and Background

Eric introduces himself as a member of the Enterprise Data Platform team at Bell, where he has worked for 17 years, starting as an intern. He gives a brief overview of Bell as Canada’s largest telecom provider, emphasizing its extensive service coverage. Eric then shares a personal anecdote about playing competitive video games like Rocket League, using it as a metaphor to discuss learning from mistakes, which ties into the main theme of his talk: reflecting on past errors in Bell’s data management and sharing three major mistakes made during their transition to Google Cloud.

05:01

🏗️ Mistake 1: Data Silos and Fragmented Systems

Eric delves into the first major mistake made at Bell: the proliferation of data environments and silos over the years. Initially, the goal was to create a unified data lake when implementing Hadoop in 2016. However, the effort was abandoned midway, leading to a fragmented system where engineers worked in isolated tech stacks, resulting in a complex web of data pipelines. This fragmentation hindered efficient data usage and slowed down processes like data monetization. The solution, Eric explains, is to standardize on a single data platform in Google Cloud, consolidating data to streamline access and reduce the need for redundant pipelines.

10:01

🚀 Mistake 2: Neglecting Developer Productivity

The second mistake Eric discusses is the failure to prioritize developer productivity on Bell’s on-premise systems. Developers were burdened with managing infrastructure and repeatedly solving the same problems due to a lack of standardized tools. For example, multiple teams independently developed similar libraries for tasks like data compaction in Hadoop. To address this, Bell is now focused on building a solid foundation in the cloud, starting with standardized Terraform modules and leveraging tools like Backstage, an open-source developer portal created by Spotify, to improve the efficiency of application development and reduce code redundancy.

🛠️ Mistake 3: Poor Data Governance Practices

The third and final mistake highlighted is Bell’s inadequate data governance practices. Eric admits that table metadata was being managed in Excel spreadsheets and Confluence, leading to inefficiencies and confusion among engineers. Moving to the cloud, Bell is now standardizing with Dataplex to properly catalog and manage data assets, ensuring that information is easily accessible and well-organized. This shift aims to eliminate the chaos of managing data across multiple platforms and improve overall data governance. Eric concludes by inviting the audience to share their experiences with large-scale migrations and offering to connect for further discussions.

Mindmap

Keywords

💡Data Silos

Data silos refer to isolated data storage systems or environments where data is kept separate from other data. In the video, the speaker mentions that Bell deployed dozens of data silos over the years, which led to inefficiencies and difficulties in managing and accessing data. These silos made it challenging to create value and implement technologies like AI/ML, as engineers spent more time managing data pipelines rather than solving core problems.

💡Cloud Migration

Cloud migration is the process of moving data, applications, and other business elements from on-premises environments to a cloud computing environment. The video discusses Bell's ongoing cloud migration to Google Cloud, emphasizing the opportunity to correct past mistakes made with on-premise systems, such as data silos and inefficient data governance practices. The migration aims to standardize and modernize Bell's data infrastructure.

💡Data Governance

Data governance involves the management and oversight of data assets to ensure data quality, consistency, and security across an organization. The speaker highlights poor data governance practices at Bell, such as managing metadata in Excel spreadsheets and Confluence, which led to inefficiencies. The migration to Google Cloud includes adopting better data governance tools like Dataplex to catalog and manage data more effectively.

💡Developer Productivity

Developer productivity refers to the efficiency and effectiveness of software engineers in creating and maintaining code. In the video, the speaker stresses that Bell did not initially focus on improving developer productivity, leading to redundant work and inefficiencies. By modernizing their approach in the cloud, including using tools like Backstage and standardizing processes with Terraform, Bell aims to significantly enhance developer productivity.

💡Terraform

Terraform is an open-source infrastructure as code software tool that enables users to define and provision data center infrastructure using a high-level configuration language. The video mentions that Bell is using Terraform to build standardized modules that enforce best practices, naming conventions, and other standards, helping to prevent the repetitive coding issues they experienced on-premise.

💡Backstage

Backstage is an open-source platform created by Spotify that helps developers manage and streamline their work by providing templates and tools in a single interface. The speaker mentions that Bell conducted a proof of concept with Backstage and found it promising for improving developer experience. Backstage will be used to allow developers to quickly spin up applications, reducing redundancy and increasing efficiency.

💡Data Pipelines

Data pipelines are a series of data processing steps that move data from one system to another. The video describes how Bell's legacy systems resulted in a 'spaghetti' of data pipelines, leading to inefficiencies and challenges in creating value. The cloud migration aims to simplify and centralize data pipelines, reducing the need for engineers to focus on data movement and allowing them to concentrate on more value-added tasks.

💡ETL (Extract, Transform, Load)

ETL stands for Extract, Transform, Load, a process in data warehousing that involves extracting data from various sources, transforming it into a suitable format, and loading it into a destination database. The speaker notes that Bell's ETL processes were only partially migrated to their data lake in 2016, contributing to the fragmented data environment. The cloud migration includes plans to improve and modernize ETL processes.

💡Data Monetization

Data monetization is the process of using data to generate revenue. In the video, the speaker mentions that Bell is heavily involved in data monetization, which has been hampered by the fragmented and inefficient data systems. By centralizing data in the cloud, Bell aims to streamline data access and analysis, thereby enhancing their ability to monetize data effectively.

💡Data Lake

A data lake is a storage system that holds vast amounts of raw data in its native format until it is needed. Bell attempted to create a data lake in 2016 with Hadoop, intending to consolidate their data for easier access and analysis. However, the initiative was only partially completed, leading to continued data silos. The cloud migration includes plans to complete this vision and create a more unified data environment.

Highlights

Introduction by Eric, who has worked at Bell for 17 years, starting as an intern and now an architect in the Enterprise Data Platform team.

Bell is the largest telecom provider in Canada, serving regions including Quebec, Ontario, Atlantic provinces, and Manitoba.

Eric's playful icebreaker about competitive video games, sharing his personal interest in Rocket League.

Bell's recent migration to Google Cloud, aiming to learn from past mistakes made on-premises.

Highlighting three major mistakes made by Bell in their data practices: deploying data silos, neglecting developer productivity, and poor data governance.

Mistake 1: Creating multiple data silos over time, leading to fragmented data environments and inefficient data management.

The shift to a standardized data platform on Google BigQuery to centralize data and streamline access.

Mistake 2: Neglecting developer productivity by focusing too much on infrastructure management rather than improving developer experience.

Efforts to modernize by building Terraform modules for standardized naming conventions, best practices, and labeling in the cloud.

Bell's proof of concept with Backstage, an open-source developer portal created by Spotify, to streamline application development.

Mistake 3: Poor data governance practices, including managing metadata in Excel spreadsheets and Confluence.

Adoption of Google DataPlex for standardized data governance, improving metadata management and reducing reliance on outdated methods.

Encouragement to the audience to avoid these common mistakes, emphasizing the importance of learning from Bell's experiences.

Eric's invitation to the audience to share their experiences with large-scale migrations, expressing a desire to learn from others.

Closing remarks with an open invitation for questions and further discussion on the topic.

Transcripts

play00:01

[Music]

play00:10

all right hello everyone hopefully

play00:13

you're all having a great time uh what a

play00:15

wonderful event right really lots of

play00:18

nice talk so um I'm Eric um I work in

play00:21

the Enterprise data platform team in

play00:23

Bell uh I've been there for the past 17

play00:26

years uh started there as a intern and

play00:31

then uh data engineer with the Microsoft

play00:34

SQL Server stack uh before data was

play00:37

considered pretty nice right and then

play00:40

grew as a data engineer with the dupe

play00:43

and then here I am as an architect right

play00:47

um so

play00:49

uh quickly uh what what is Bell right

play00:52

it's the largest tcom provider in in

play00:54

Canada uh bigger than the other

play00:57

competitor right before and uh we

play01:00

essentially uh we've been uh providing

play01:05

services to like Quebec Ontario Atlantic

play01:08

provinces um

play01:10

Manitoba so uh yeah so essentially today

play01:14

uh I'm going to just ask a quick

play01:17

question just to break the eyes right I

play01:19

see a few younger folks here uh so I

play01:22

want to ask is anybody here uh plays

play01:26

online video games like competitive

play01:28

video games specifically like um uh

play01:32

League of Legend fortnite Counter Strike

play01:37

anybody

play01:39

nobody okay wow we don't have lots of

play01:42

competitive players here okay okay so I

play01:44

do I I'll I'll admit it I do and uh I'm

play01:48

a huge fan of Rocket League personally

play01:50

and uh the reason um one of the thing

play01:53

that I've started when I played this

play01:55

game five years ago is um I would often

play01:59

like uh come in play my games and then a

play02:03

bad goal happen I lose the game and then

play02:06

what do I do just blame the teammate

play02:09

right who else could it

play02:10

be so um but it turns out that when you

play02:15

play quite a bit of of competitive game

play02:17

one of the things you learn to do is um

play02:20

watch your only replays right it's the

play02:23

only way where you're going to be able

play02:25

to figure out your mistake and then not

play02:28

reproduce them obviously so um that's

play02:32

what I did a bit right and the the

play02:34

reason I'm I'm talking a bit about this

play02:36

is we've had the opportunity for the

play02:38

past one year to uh migrate to Google

play02:41

cloud and then sort of look at what like

play02:46

what we did wrong on premise like what

play02:48

are the big mistakes we did on premise

play02:51

and uh what we should change so that it

play02:55

doesn't happen again when we're going to

play02:57

the cloud right so I decided for today's

play03:01

talk to just put a list of a short list

play03:05

of three mistakes we did and then I'm

play03:07

here to share this with you and then I'm

play03:10

going to share our experience and then

play03:12

hopefully you'll learn something from

play03:14

this so the first one and I think it's a

play03:17

pretty common mistake in the Telecom

play03:20

industry is we've um essentially

play03:24

deployed dozens of data environments

play03:26

data silos through the years and uh this

play03:29

just grew over time more and more right

play03:33

so that's number one number two is um we

play03:37

uh we didn't really take time to improve

play03:40

our developers productivity and

play03:43

experience right it was a second thought

play03:45

for us and then last one but not not the

play03:51

least it's the our data governance

play03:53

practices right and tooling that were uh

play03:56

pretty bad let's say so I'm going to go

play04:00

through like all these three uh three uh

play04:04

mistakes we did explain where we went

play04:06

wrong and what we're going to change by

play04:09

going to the cloud so first one right I

play04:12

mentioned are dozens of data warehouse

play04:15

data environments so um for this one uh

play04:21

we we started like um obviously we have

play04:25

lots of Legacy systems in Bell right and

play04:29

um when we implemented a dupe back in

play04:32

2016 the premise was well let's let's

play04:36

build the data Lake let's migrate our

play04:38

ETL through there and then we'll just

play04:41

have a single place to consume our data

play04:44

right uh not exactly what happened okay

play04:47

so what we did is we actually

play04:49

implemented a dupe we migrated a lot of

play04:53

ETL we migrated a lot of data set and

play04:55

then we just stopped Midway we just

play04:59

stopped and and then move on to

play05:00

something else right just focus on value

play05:04

creation so um so yeah obviously what

play05:09

happened is all these data Engineers

play05:11

data Specialists uh were just focusing

play05:15

the they were in their own Tex stack

play05:17

right so imagine you have an an engineer

play05:19

who works in the Microsoft SQL

play05:21

environment and he plays nice there and

play05:25

whenever he needs data he's just going

play05:26

to develop an ETL pipeline to move it to

play05:29

his own data world so what happened is a

play05:31

huge spaghetti a mess of data pipeline

play05:34

all over the place and um it's it's

play05:38

really long for us right like whenever

play05:40

you want to create value you want to do

play05:43

a IML you want to do uh we're we're

play05:46

doing lots of data monetization in Bell

play05:49

and obviously this is hard for us

play05:50

because our Engineers are just focusing

play05:53

time on not solving this problem they're

play05:56

focusing on moving the data

play05:58

around so what we've decided to change

play06:01

on the cloud is we decided to end it

play06:05

there and then just standardize on a

play06:08

single data platform so instead of

play06:09

having okay well we have orle we're

play06:13

going to move orle to Google Cloud we

play06:15

have SQL Server we're going to move that

play06:17

to Google Cloud let's take the time to

play06:20

modernize these Legacy environments move

play06:22

them to Google big query and then once

play06:26

Engineers have their data all in one

play06:29

place it's very simple right you if if

play06:32

if if somebody needs access to the data

play06:35

a simple I am permission and you're done

play06:37

nothing else right no need to develop

play06:39

pipelines to move data around so um so

play06:42

yeah we're we're really eager to have

play06:45

more and more data there so that we can

play06:47

stop just injuring pipelines over and

play06:50

over

play06:51

again okay so second one right second

play06:54

mistake we did is um and I think that's

play06:57

a really important one right you're

play06:59

developers

play07:00

productivity uh so your your developers

play07:03

your engineers are so important you need

play07:06

to improve their experience so what we

play07:09

did on premise is we have a dup we just

play07:12

manage infrastructure we manage server

play07:15

we manage Hardware uh and then that's

play07:18

that's all we do right and we did a

play07:21

little bit of shell scripting here and

play07:23

there to create database to um to create

play07:27

like uh help us with permission but

play07:30

nothing nothing else really so for us it

play07:33

was really not a good experience for

play07:35

developers and what we've realize is

play07:38

when we uh did an assessment recently

play07:41

for migrating our adup we've realized we

play07:44

look at different git repositories look

play07:47

at the source code and realize wow our

play07:50

Engineers are just solving the same

play07:52

problem over and over again right uh

play07:55

just as an

play07:56

example um we had uh a

play07:59

um if you've ever worked with the dup

play08:02

you might have heard of data compaction

play08:04

to to compact small files right into

play08:07

larger files um we essentially had six

play08:11

separate libraries just to do the same

play08:13

thing right all these teams they build

play08:16

the same Library without speaking with

play08:18

each others and here we are right a huge

play08:21

mess of lots of of code so we're

play08:24

essentially going to use that

play08:26

opportunity to modernize so how are we

play08:29

going to solve it few things right first

play08:32

thing we did when we uh migrated is

play08:35

let's right away build a solid

play08:38

foundation and that starts with uh for

play08:41

example building terraform module to

play08:43

standardize on um anything that's like

play08:48

um naming conventions standards best

play08:51

practices labeling so all of that right

play08:54

we would bake it right in so that people

play08:57

don't repeat that code over and and over

play09:00

again second thing is I highly encourage

play09:03

you look at it we recently did a proof

play09:06

of concept with the tool called uh

play09:08

backstage so I'm not sure if anybody

play09:10

knows what backstage is if you don't

play09:12

know there's an interesting talk

play09:14

tomorrow from uh um I think it's HC

play09:18

Healthcare so um they have an

play09:21

interesting talk about how they um they

play09:24

use this tool to help developers It's

play09:26

actually an open-source um developer

play09:29

port that was created by Spotify and

play09:32

then they open source it and then you

play09:34

can sort of have it a bunch of templates

play09:37

and then you can uh templae your

play09:40

applications and then Engineers they go

play09:42

in a UI they simply fill a form and then

play09:45

there we are they can spin up

play09:47

applications very fast right so um after

play09:50

this proof of concept we've been very

play09:52

excited with that and I think we're

play09:55

about to roll it out to more and more

play09:57

type of templates right

play10:01

um so yeah so essentially um the other

play10:05

thing was um that we're going to do to

play10:08

help our developers is we've realized

play10:11

when we did our adup migration that

play10:13

there's um I think there was hundreds of

play10:16

data pipeline almost a thousand

play10:18

pipelines that were just strictly for um

play10:22

ingesting relational database data MySQL

play10:25

Orco postgres db2 right and all of

play10:29

Engineers um they would just copypaste

play10:32

code over and over again so what are we

play10:35

doing for that well um we're going to

play10:38

make config driven uh libraries to do

play10:41

that right to help Engineers be more

play10:44

productive and avoid having to write

play10:46

custom code all the time so what with

play10:49

all of these right it should make our

play10:52

engineer way more productive on the

play10:54

cloud and and stop just writing code

play10:57

focus on other things

play11:00

okay so last last thing but not least is

play11:03

um the last mistake I want to mention is

play11:06

our data governance practice okay so I'm

play11:09

sure we're lots of us are in the same

play11:11

boat so I'll admit it we do manage uh

play11:16

table metadata in Excel spreadsheets and

play11:20

Confluence okay I'll say it and I'm sure

play11:23

lots of you maybe uh have the same

play11:25

situation so the first thing we did by

play11:28

going to Cloud is go standardize with

play11:31

data Plex right make sure our Engineers

play11:34

are um are just um cataloging their

play11:40

assets their tables their pii

play11:43

hspi everything is is in in there right

play11:48

so we can stop having people asking

play11:51

questions on slack email where is this

play11:54

table where is it does it exist well you

play11:57

you will have it somewhere right it will

play11:59

be catalog in there right so um so yeah

play12:04

so essentially that that was what I

play12:06

wanted to discuss today right so three

play12:08

mistakes that we did that hopefully uh

play12:12

you're not going to make right hopefully

play12:14

this talk with Will resonate with a few

play12:16

of you and um I want to really um

play12:20

encourage you to reach out to me talk to

play12:23

me like if you've ever done like a large

play12:25

scale migration we're about to do a

play12:27

large scale migration of our ado

play12:29

um we've migrated just a few use case so

play12:32

far but we're about to do like a huge

play12:35

migration next month I really would like

play12:38

if you if you have any any experience

play12:40

with that reach out to me right share

play12:43

your learnings I would love to hear

play12:45

about it right and then um yeah so thank

play12:49

you for assisting for this this talk and

play12:53

um if you have any questions feel free

play12:55

to ask

play12:58

away

play13:01

w

Rate This

5.0 / 5 (0 votes)

Related Tags
Cloud MigrationData SilosDeveloper ProductivityData GovernanceBell CanadaEnterprise DataTelecom IndustryGoogle CloudTech TalksLessons Learned