Google Cloud infrastructure

Qwiklabs-Courses
16 Apr 202408:01

Summary

TLDRGoogle Cloud's infrastructure is built on three layers: networking and security, compute and storage, and data and AI/ML products. It offers a variety of computing services, including Compute Engine, Kubernetes Engine, App Engine, Cloud Functions, and Cloud Run, catering to different needs from server management to serverless execution. Google's Tensor Processing Units (TPUs) provide specialized hardware for machine learning, enhancing efficiency and speed. For storage, Google Cloud provides options like Cloud Storage with various classes, and databases such as Bigtable, Cloud SQL, Spanner, Firestore, and BigQuery, tailored to structured and unstructured data needs.

Takeaways

  • 🌐 Google Cloud has been evolving since 1998, offering secure and flexible cloud computing and storage services from its launch in 2008.
  • πŸ“š The Google Cloud infrastructure is structured into three layers: networking and security, compute and storage, and data and AI/ML products.
  • πŸ”’ Networking and security form the foundational layer of Google Cloud, supporting all infrastructure and applications.
  • πŸ’» Compute and storage are decoupled in Google Cloud, allowing them to scale independently based on demand.
  • 🧠 The top layer focuses on AI and machine learning, providing tools for data ingestion, storage, processing, and business insights.
  • πŸ› οΈ Compute Engine offers IaaS with virtual resources similar to a physical machine, providing maximum flexibility for server instance management.
  • πŸš€ Google Kubernetes Engine (GKE) runs containerized applications in the cloud, differing from Compute Engine's individual virtual machine approach.
  • πŸ“¦ App Engine is a PaaS offering that binds code to libraries for infrastructure needs, allowing developers to focus on application logic.
  • 🎯 Cloud Functions is a serverless execution environment that executes code in response to events without the need for server management.
  • πŸƒ Cloud Run is a fully managed platform for running stateless workloads, abstracting infrastructure management and automatically scaling.
  • πŸ’‘ Google introduced TPUs in 2016 to overcome scaling limitations for ML workloads, offering higher efficiency and speed over CPUs and GPUs.
  • πŸ—ƒοΈ Google Cloud offers various database and storage services, including Cloud Storage, Bigtable, SQL, Spanner, Firestore, and BigQuery, each suited to different data types and business needs.
  • πŸ“ˆ Cloud Storage has four storage classes catering to different access frequencies and cost considerations: standard, nearline, coldline, and archive.
  • πŸ”‘ Choosing the right Google Cloud service depends on whether the data is structured or unstructured and the specific business requirements.

Q & A

  • What is the history of Google's involvement with data and artificial intelligence?

    -Google has been working with data and artificial intelligence since its early days as a company, starting in 1998, and launched Google Cloud in 2008 to provide secure and flexible cloud computing and storage services.

  • How does Google Cloud infrastructure organize its services?

    -Google Cloud infrastructure is organized into three layers: networking and security at the base, compute and storage in the middle, and data and AI/machine learning products at the top.

  • What is the significance of separating compute and storage in Google Cloud?

    -Compute and storage are decoupled in Google Cloud to allow them to scale independently based on need, providing flexibility and efficiency for users.

  • What are the different compute services offered by Google Cloud?

    -Google Cloud offers Compute Engine (IaaS), Google Kubernetes Engine (for containerized applications), App Engine (PaaS), Cloud Functions (serverless execution), and Cloud Run (fully managed compute platform).

  • What is the purpose of Google's Tensor Processing Unit (TPU)?

    -TPUs are custom-developed ASICs used to accelerate machine learning workloads, providing higher efficiency and performance for AI and ML applications compared to CPUs and GPUs.

  • How does Cloud Run differ from other compute services in terms of infrastructure management?

    -Cloud Run is a fully managed compute platform that abstracts away all infrastructure management, allowing users to focus on writing code and automatically scaling up and down from zero.

  • What is the role of hardware in providing processing power for Google Cloud services?

    -The processing power comes from hardware, such as computer chips like CPUs, GPUs, and Google's custom TPUs, which are designed to meet the specific computation needs of domains like machine learning.

  • Why is it important to decouple compute and storage in cloud computing?

    -Decoupling compute and storage allows for proper scaling capabilities, enabling compute and storage to scale separately according to the application's requirements.

  • What are the different storage classes offered by Google Cloud Storage and their use cases?

    -Google Cloud Storage offers standard storage for frequently accessed data, nearline storage for infrequently accessed data, coldline storage for data accessed less than once every 90 days, and archive storage for data accessed less than once a year.

  • How does one choose between Google Cloud's database and storage services?

    -The choice depends on the data type (unstructured or structured), the nature of the workloads (transactional or analytical), and whether SQL access is needed.

  • What is the difference between transactional and analytical workloads in the context of databases?

    -Transactional workloads require fast data inserts and updates for maintaining a system snapshot with standardized queries, while analytical workloads involve reading entire datasets and performing complex queries like aggregations.

Outlines

00:00

🌐 Google Cloud Infrastructure Overview

This paragraph introduces Google Cloud's infrastructure, highlighting its evolution since 1998 and the launch of Google Cloud in 2008. The infrastructure is structured into three layers: networking and security, compute and storage, and data and AI/ML products. The compute layer includes services like Compute Engine, Google Kubernetes Engine, App Engine, Cloud Functions, and Cloud Run, each tailored to different computing needs. The introduction of Tensor Processing Units (TPUs) in 2016 is emphasized, showcasing Google's commitment to providing efficient hardware for machine learning workloads. The paragraph concludes by discussing the decoupling of compute and storage for scalable cloud solutions.

05:05

πŸ’Ύ Google Cloud Storage Solutions

This paragraph delves into Google Cloud's storage offerings, emphasizing the decoupling of compute and storage for scalable solutions. It outlines the four primary storage classes available in Cloud Storage: standard, nearline, coldline, and archive, each designed for different access frequencies and cost considerations. The paragraph also differentiates between unstructured and structured data, explaining the suitability of Cloud Storage for unstructured data. For structured data, it discusses transactional and analytical workloads, and how to choose between Google Cloud's database services such as Cloud SQL, Spanner, Firestore, and BigQuery based on SQL access needs and the nature of the workload. Bigtable is also mentioned for its real-time, high-throughput capabilities.

Mindmap

Keywords

πŸ’‘Google Cloud

Google Cloud refers to the suite of cloud computing services that Google offers to businesses, developers, and individuals. It includes infrastructure, platform, and software services. In the video, Google Cloud is central to the discussion as it provides the infrastructure and services for data and AI/ML applications, showcasing its role in modern computing.

πŸ’‘Infrastructure as a Service (IaaS)

IaaS is a form of cloud computing where the service provider offers virtualized computing resources over the internet. In the context of the video, Google Compute Engine is an example of IaaS, providing virtual machines and associated infrastructure to users, allowing them to manage server instances as if they were physical machines.

πŸ’‘Containerization

Containerization is the process of packaging software applications and their dependencies into a container that can run consistently across different computing environments. The video mentions Google Kubernetes Engine (GKE), which facilitates running containerized applications in a cloud environment, emphasizing the efficiency and portability of containers.

πŸ’‘Platform as a Service (PaaS)

PaaS is a category of cloud computing services that provides a platform allowing customers to develop, run, and manage applications without the complexity of building and maintaining the infrastructure typically associated with developing and launching an app. Google App Engine is highlighted as a fully managed PaaS, abstracting away infrastructure management and enabling developers to focus on application logic.

πŸ’‘Serverless Computing

Serverless computing is a cloud computing execution model where the cloud provider dynamically manages the allocation and provisioning of servers. The video describes Google Cloud Functions as a serverless environment, where users can execute code in response to events without managing servers, emphasizing the ease of use and cost-effectiveness of this model.

πŸ’‘Tensor Processing Unit (TPU)

A Tensor Processing Unit is an application-specific integrated circuit developed by Google specifically for accelerating machine learning workloads. The video explains that TPUs are more efficient than traditional CPUs and GPUs for AI and ML applications due to their domain-specific architecture, which is tailored for tasks like matrix multiplication.

πŸ’‘Cloud Storage

Cloud Storage is a service where data is maintained, managed, and backed-up in cloud storage data centers. In the video, Google Cloud Storage is discussed as a solution for storing unstructured data, with different storage classes catering to various access needs and cost considerations.

πŸ’‘Structured Data

Structured data refers to information that is organized into a formatted repository, typically a database, where it can be easily accessed, managed, and updated. The video contrasts structured data with unstructured data and discusses how different Google Cloud services are suited to handle structured data based on transactional or analytical workloads.

πŸ’‘BigQuery

BigQuery is a fully-managed data warehouse solution that enables scalable analysis of petabyte-scale datasets. The video positions BigQuery as an ideal service for analytical workloads that require SQL commands, highlighting its ability to handle large-scale data analysis.

πŸ’‘Firestore

Firestore is a transactional, NoSQL, document-oriented database service offered by Google Cloud. It is mentioned in the video as a suitable option for transactional data that will be accessed without SQL, emphasizing its real-time capabilities and scalability.

πŸ’‘Cloud SQL

Cloud SQL is a fully-managed relational database service for various database engines. The video discusses Cloud SQL as an option for transactional data that requires SQL access, noting its suitability for local to regional scalability.

Highlights

Google has been working with data and artificial intelligence since 1998.

Google Cloud was launched in 2008 to provide secure and flexible cloud computing and storage services.

Google Cloud infrastructure is structured in three layers: networking and security, compute and storage, and data and AI/machine learning products.

Compute and storage are decoupled in Google Cloud to scale independently based on need.

Google offers a range of computing services including Compute Engine, Google Kubernetes Engine, App Engine, Cloud Functions, and Cloud Run.

Compute Engine provides maximum flexibility for managing server instances.

Google Kubernetes Engine runs containerized applications in a cloud environment.

App Engine is a fully managed PaaS offering that binds code to libraries for infrastructure access.

Cloud Functions executes code in response to events in a serverless environment.

Cloud Run is a fully managed platform for running stateless workloads without infrastructure management.

Google introduced the Tensor Processing Unit (TPU) in 2016 to accelerate machine learning workloads.

TPUs are custom-developed ASICs for higher efficiency in AI and ML applications compared to CPUs and GPUs.

Cloud TPUs are integrated across Google products, making advanced hardware available to Google Cloud customers.

Compute and storage in cloud computing can scale separately, unlike desktop computing.

Google Cloud offers fully managed database and storage services tailored to different data types and business needs.

Cloud Storage has four primary storage classes for different access needs: standard, nearline, coldline, and archive.

Structured data can be managed using SQL-based services like Cloud SQL and Spanner, or NoSQL solutions like Firestore and Bigtable.

BigQuery is Google's data warehouse solution for analyzing large datasets with SQL commands.

Transcripts

play00:00

Let’s explore Google Cloud infrastructure.

play00:03

Google has been working with data and artificial intelligence since its early days as a company

play00:07

in 1998.

play00:09

Ten years later, in 2008, Google Cloud was launched to provide secure and flexible cloud

play00:14

computing and storage services.

play00:17

You can think of the Google Cloud infrastructure in terms of three layers.

play00:20

At the base layer is networking and security, which lays the foundation to support all of

play00:25

Google’s infrastructure and applications.

play00:29

On the next layer sit compute and storage.

play00:32

Google Cloud separates, or decouples, as it’s technically called, compute and storage so

play00:37

they can scale independently based on need.

play00:40

The top layer includes data and AI/machine learning products, which enable you to perform

play00:45

tasks to ingest, store, process, and deliver business insights, data pipelines, and ML

play00:51

models.

play00:52

Thanks to Google Cloud, these tasks can be accomplished without a need to manage and

play00:57

scale the underlying infrastructure.

play00:59

Let’s begin with compute.

play01:02

Organizations with growing data needs often require lots of compute power to run data

play01:06

and AI jobs.

play01:08

And as organizations design for the future, the need for compute power only grows.

play01:13

Google offers a range of computing services.

play01:16

The first is Compute Engine.

play01:18

Compute Engine is an infrastructure as a service, or IaaS, offering which provides compute,

play01:23

storage, and network resources virtually that are similar to a physical machine.

play01:28

You use the virtual compute and storage resources in the same as you would manage them locally.

play01:34

Compute Engine provides maximum flexibility for those who prefer to manage server instances

play01:38

themselves.

play01:40

The second is Google Kubernetes Engine, or GKE, GKE runs containerized applications in

play01:47

a cloud environment, as opposed to on an individual virtual machine like Compute Engine.

play01:53

A container represents code packaged up with all its dependencies.

play01:57

The third computing service offered by Google is App Engine, a fully managed PaaS, or platform

play02:02

as a service, offering.

play02:04

PaaS offerings bind code to libraries that provide access to the infrastructure application

play02:09

needs.

play02:11

This allows more resources to be focused on application logic.

play02:15

Then there is Cloud Functions, which executes code in response to events, like when a new

play02:20

file is uploaded to Cloud Storage.

play02:22

It’s a completely serverless execution environment, which means you don’t need to install any

play02:27

software locally to run the code and you are free from provisioning and managing servers.

play02:33

Cloud Functions is often referred to as Functions as a Service.

play02:37

And, finally, there is Cloud Run, a fully managed compute platform that enables you

play02:42

to run requests or event-driven stateless workloads without having to worry about servers.

play02:47

It abstracts away all infrastructure management so you can focus on writing code, and it automatically

play02:53

scales up and down from zero, so you never have to worry about scale configuration.

play02:59

Cloud Run charges only for the resources you use, so you never pay for over-provisioned

play03:04

resources.

play03:05

Where does the processing power come from?

play03:07

It’s from the hardware: from computer chips.

play03:11

However, traditional computer chips, like central processing units, or CPUs, and even

play03:17

the more recent graphics processing units, or GPUs, may no longer scale to adequately

play03:23

reach the rapid demand for ML.

play03:27

To help overcome this challenge, in 2016, Google introduced the Tensor Processing Unit,

play03:33

or TPU.

play03:34

TPUs are Google’s custom-developed application-specific integrated circuits (ASICs) used to accelerate

play03:41

machine learning workloads.

play03:43

TPUs act as domain-specific hardware, as opposed to general-purpose hardware like CPUs and

play03:49

GPUs.

play03:50

This allows for higher efficiency by tailoring the architecture to meet the computation needs

play03:55

in a domain, such as the matrix multiplication in machine learning.

play03:59

TPUs are generally faster than current GPUs and CPUs for AI and ML applications.

play04:06

They are also significantly more energy-efficient.

play04:10

Cloud TPUs have been integrated across Google products, making this state-of-the-art hardware

play04:15

and supercomputing technology available to Google Cloud customers.

play04:20

Let’s now examine storage.

play04:22

For proper scaling capabilities, compute and storage are decoupled.

play04:26

That is one major difference between cloud and desktop computing.

play04:31

With cloud computing, compute and storage can scale separately.

play04:35

Most applications require a database and storage solution of some kind.

play04:39

Google Cloud offers fully managed database and storage services.

play04:43

These include: Cloud Storage Cloud Bigtable Cloud SQL Cloud

play04:48

Spanner Firestore And BigQuery How do you choose from these products and services?

play04:55

Well, it depends on the data type and business needs.

play04:58

Let’s look at the data type, which includes unstructured versus structured data.

play05:04

Unstructured data is information stored in a non-tabular form such as documents, images,

play05:09

and audio files.

play05:11

Unstructured data is usually suited to Cloud Storage.

play05:15

Cloud Storage has four primary storage classes.

play05:18

The first is standard storage.

play05:20

Standard storage is considered best for frequently accessed, or β€œhot,” data.

play05:25

It’s also great for data that is stored for only brief periods of time.

play05:30

The second storage class is nearline storage.

play05:33

This is best for storing infrequently accessed data, like reading or modifying data once

play05:38

per month or less, on average.

play05:40

Examples include data backups, long-tail multimedia content, or data archiving.

play05:46

The third storage class is coldline storage.

play05:49

This is also a low-cost option for storing infrequently accessed data.

play05:53

However, as compared to nearline storage, coldline storage is meant for reading or modifying

play05:59

data at most once every 90 days.

play06:03

The fourth storage class is archive storage.

play06:06

This is the lowest-cost option, used ideally for data archiving, online backup, and disaster

play06:11

recovery.

play06:12

It’s the best choice for data that you plan to access less than once a year, because it

play06:16

has higher costs for data access and operations and a 365-day minimum storage duration.

play06:23

Alternatively, there is structured data, which represents information stored in tables, rows,

play06:29

and columns.

play06:30

Structured data comes in two types: transactional workloads and analytical workloads.

play06:36

Transactional workloads stem from online transaction processing systems, which are used when fast

play06:41

data inserts and updates are required to build row-based records.

play06:45

This is usually to maintain a system snapshot.

play06:48

They require relatively standardized queries that impact only a few records.

play06:53

Then there are analytical workloads, which stem from online analytical processing systems,

play06:58

which are used when entire datasets need to be read.

play07:01

They often require complex queries, for example, aggregations.

play07:06

Once you’ve determined if the workloads are transactional or analytical, you then

play07:09

need to identify whether the data will be accessed using SQL or not.

play07:13

So, if your data is transactional and you need to access it using SQL, then two options

play07:19

are Cloud SQL and Spanner.

play07:21

Cloud SQL works best for local to regional scalability, while Spanner works best to scale

play07:26

a database globally.

play07:28

If the transactional data will be accessed without SQL, Firestore might be the best option.

play07:34

Firestore is a transactional, NoSQL, document-oriented database.

play07:38

If you have analytical workloads that require SQL commands, BigQuery is likely the best

play07:43

option.

play07:44

BigQuery, Google’s data warehouse solution, lets you analyze petabyte-scale datasets.

play07:49

Alternatively, Bigtable provides a scalable NoSQL solution for analytical workloads.

play07:55

It’s best for real-time, high-throughput applications that require only millisecond

play08:00

latency.

Rate This
β˜…
β˜…
β˜…
β˜…
β˜…

5.0 / 5 (0 votes)

Related Tags
Google CloudAI InfrastructureCloud ComputingData StorageMachine LearningCompute EngineKubernetes EngineApp EngineCloud FunctionsTPU TechnologyData Analytics