What is Object Storage?

IBM Technology
15 Sept 202110:27

Summary

TLDRThis script delves into the concept of object storage, ideal for low-cost, low-performance storage needs, particularly for internet workloads like web applications and content delivery. It explains the fundamental components of an object, including its unique ID, data, metadata, and attributes, and how they are organized into scalable 'buckets.' The script also highlights the benefits of object storage, such as global replication for improved latency in video streaming, efficient file sharing for global collaboration, and secure hosting for web assets and regulatory data.

Takeaways

  • πŸ’Ύ Object storage is designed for low-cost, low-performance storage suitable for Internet workloads, including web applications and hosting.
  • πŸ—‚οΈ The term 'object' in object storage refers to any kind of file, which can be stored without file restrictions, but requires certain characteristics.
  • πŸ”‘ Every object in object storage must have a unique identifier (ID) for retrieval, along with the actual data, metadata, and attributes.
  • πŸ“ Metadata is crucial for object storage as it provides information about the file, such as creation date, file type, and size, enabling search and indexing.
  • πŸ›‘οΈ Attributes in object storage define permissions and access rights, such as who can override, download, or delete the object.
  • πŸ—‘οΈ Objects are stored in 'buckets' which can scale to hold billions of objects and are virtual constructs in the storage system.
  • πŸ”„ The replication of objects across multiple physical devices ensures data integrity and security, providing redundancy in case of hardware failure or network issues.
  • πŸ“ˆ Object storage pricing and metering are unique and can vary based on performance needs, with different tiers for 'cold' and 'cold-ish' storage.
  • 🌐 Object storage is ideal for global use cases like video streaming, where replication across different geographical locations can reduce latency.
  • 🀝 It supports file sharing and collaboration across the globe, allowing for version control and easy access for distributed teams.
  • πŸ›οΈ Suitable for regulatory data and digital archiving, object storage provides a secure and cost-effective solution for long-term data retention.

Q & A

  • What is the primary purpose of object storage?

    -The primary purpose of object storage is to provide a low-cost, low-performance storage solution designed for Internet workloads, such as web applications and hosting, as well as a secure and durable place for data that needs to be retained for long periods due to regulatory or legal reasons.

  • What are the four essential components of an object in object storage?

    -The four essential components of an object in object storage are: 1) A unique identifier (ID) for the object, 2) The actual data content of the object, 3) Metadata that provides information about the object, and 4) Attributes that define permissions and other properties of the object.

  • How does object storage handle data replication for data integrity and security?

    -Object storage replicates the data across multiple physically separated devices. This ensures that even if one device fails or there is a network outage, the data remains accessible from the other replicated locations, thus maintaining data integrity and security.

  • What is a 'bucket' in the context of object storage?

    -In object storage, a 'bucket' is a virtual container where objects are stored. Buckets can scale to hold billions of objects and are a fundamental part of organizing data in object storage systems.

  • How does object storage pricing and metering work?

    -Object storage pricing and metering are based on factors such as the number of 'gets' (data retrievals), 'puts' (data uploads), and the amount of storage used. Different providers may have unique metrics for billing, but performance level and data access frequency are key considerations that affect cost.

  • What are the different storage tiers mentioned in the script, and how do they differ in cost and use case?

    -The script mentions 'cold times 10' as the coldest and least expensive tier, suitable for data archived for long periods. 'Cold' is the next tier, slightly more expensive, for data accessed occasionally. 'Cold-ish' is for more frequently accessed data, like website assets, and is more costly than the colder tiers.

  • How can object storage be used for video streaming?

    -Object storage can be used for video streaming by replicating video files globally across different locations. This reduces latency and improves the streaming experience for users by hosting the content closer to them geographically.

  • What benefits does object storage offer for file sharing and collaboration?

    -Object storage facilitates file sharing and collaboration by allowing multiple versions of a file to be uploaded and stored with metadata tracking changes. This enables secure, low-cost, and efficient global collaboration on documents, spreadsheets, and other files.

  • How is object storage useful for web hosting and digital archiving?

    -Object storage is useful for web hosting as it can host website assets at a lower cost and with high availability. For digital archiving, it provides a secure and durable place to store and access large volumes of data that are not frequently accessed, such as digital photographs of manuscripts.

  • What is the role of APIs in interacting with object storage?

    -APIs are the primary method of interaction with object storage. Users perform operations such as uploading, downloading, and managing objects through API calls, which allows for automation and integration with other systems.

  • How can object storage help with regulatory data and cold storage requirements?

    -Object storage can store large volumes of data securely and durably, making it suitable for regulatory data that must be retained for long periods. The ability to replicate data across devices ensures that the data is protected against hardware failures or other issues.

Outlines

00:00

πŸ—ƒοΈ Object Storage Basics and Components

The first paragraph introduces the concept of object storage, which is a storage tier designed for low-performance, low-cost storage suitable for Internet workloads such as web applications and hosting. It discusses the need for a secure and long-term storage solution for regulatory and legal reasons. The paragraph delves into the definition of an 'object' in object storage, explaining that it can be any kind of file without restrictions. It then outlines the four essential components of an object: a unique identifier (ID), the data itself, metadata which includes details about the file such as creation date and file type, and attributes that pertain to access and permissions. The paragraph concludes with an explanation of how objects are stored in 'buckets', which can scale to hold billions of objects and are replicated across different physical devices for data integrity and security.

05:01

πŸ“Š Object Storage Pricing, Performance, and Use Cases

The second paragraph explores the complexities of object storage, particularly its pricing and metering structures, which can vary among providers. It emphasizes the importance of response time and performance, noting that higher performance options are more expensive. The paragraph categorizes data storage into different tiers based on access frequency, from the 'coldest of cold tiers' for rarely accessed data to 'cold-ish' for more frequently accessed data like website assets. It also highlights the benefits of object storage, such as its API-driven nature, which enables front-end customization. The paragraph discusses various use cases for object storage, including video streaming with global replication to reduce latency, file sharing for global collaboration, and web hosting for regulatory data and digital archiving projects, all while maintaining security and cost-effectiveness.

10:01

πŸ“ž Closing Remarks and Further Resources

The final paragraph serves as a closing statement, thanking the audience for their time and inviting them to ask questions in the comments section. It encourages viewers to like, subscribe, and provide feedback for more content like the current video. Additionally, it promotes IBM Cloud Labs, which are free, browser-based interactive Kubernetes labs, and provides a call to action for viewers to grow their skills and earn badges. The paragraph concludes with a reminder for viewers to check the provided links for more information on IBM Cloud Labs.

Mindmap

Keywords

πŸ’‘Object Storage

Object storage is a type of storage architecture that manages data as objects, as opposed to other storage architectures like file systems which manage data as a hierarchy of files and directories. It is designed for scalability, flexibility, and is often used for internet-scale applications. In the video, object storage is described as a relatively low-cost, low-performance storage solution ideal for internet workloads, such as web applications and hosting, as well as for archiving large volumes of data.

πŸ’‘Internet Workloads

Internet workloads refer to the demands and processes associated with running applications and services over the internet. This includes activities like web browsing, online transactions, and data transfers. The video discusses how object storage is tailored to serve these types of workloads, emphasizing its use for web applications, website hosting, and content delivery across the internet.

πŸ’‘Metadata

Metadata in the context of object storage is data that describes other data. It includes information such as who created the file, when it was created, its purpose, file type, and size. The video explains that metadata is crucial for searching, indexing, and retrieving objects within object storage, as it provides the necessary details to manage and understand the stored data.

πŸ’‘Attributes

Attributes in object storage are properties or characteristics of an object that define its behavior or access permissions. The video mentions that attributes might include information like whether certain users are allowed to override, download, or delete an object. These attributes are distinct from metadata, focusing more on the object's permissions and less on the data itself.

πŸ’‘Bucket

In object storage, a bucket is a basic container that holds objects. The video describes buckets as scalable and capable of holding billions of objects. They are virtual constructs that can be used to organize and manage the storage of data objects, with the ability to replicate objects across different physical devices for data integrity and security.

πŸ’‘Replication

Replication in object storage refers to the process of making copies of data and storing them in different physical locations. The purpose, as explained in the video, is to ensure data integrity and security. If one physical device fails, the replicated copies on other devices ensure that the data remains accessible.

πŸ’‘API

API stands for Application Programming Interface, which is a set of rules and protocols for building and interacting with software applications. In the context of the video, object storage is interacted with via an API, allowing users to manage objects programmatically without direct interaction with the storage buckets.

πŸ’‘Cold Storage

Cold storage in the context of data storage refers to the practice of storing data that is infrequently accessed, often for long-term retention. The video discusses how object storage can be used for cold storage, particularly for data that needs to be retained for regulatory or legal reasons, and how it can be cost-effective for such use cases.

πŸ’‘Responsiveness

Responsiveness in the video refers to the speed and performance of data retrieval from object storage. It is a factor that affects the cost of using object storage services, with higher performance levels incurring higher costs. The video explains that the responsiveness needed depends on how often the data is accessed.

πŸ’‘Versioning

Versioning in object storage is the process of keeping multiple versions of an object, allowing users to track changes and revert to previous states if necessary. The video uses the example of file sharing and collaboration across the globe, where different versions of a file are created and stored with updated metadata, facilitating easy collaboration and version control.

πŸ’‘Digital Archiving

Digital archiving is the process of preserving digital materials, such as documents, photographs, and manuscripts, in a digital format. The video mentions object storage as an ideal solution for digital archiving projects, particularly when working with institutions like universities or libraries, due to its scalability, security, and cost-effectiveness for storing and accessing large volumes of data.

Highlights

Object storage is designed for low-performance, low-cost storage suitable for Internet workloads.

Internet workloads include web applications, website hosting, and content delivery across the Internet.

Object storage serves as a modern alternative to tape storage for preserving large volumes of data.

An object in object storage can be any kind of file without restrictions.

Every object requires a unique identifier for retrieval.

Objects must contain actual data, which can be any file type.

Metadata is crucial for understanding the file's details and enabling search and indexing.

Attributes define permissions and access controls for the object.

Objects are stored in 'buckets' which can scale to hold billions of objects.

Buckets are virtual constructs that can be replicated across multiple physical devices for data integrity and security.

Object storage is accessed via an API, not through direct interaction with the storage buckets.

Different pricing tiers exist based on performance needs and access frequency.

Cold storage is the most cost-effective for data that is infrequently accessed.

Object storage is ideal for hosting assets for websites due to its scalability and cost-efficiency.

Replication of objects can be done globally to reduce latency for users in different locations.

Object storage supports versioning for collaborative file sharing across the globe.

It is cost-effective for file sharing compared to traditional methods like email or physical media.

Object storage is suitable for web hosting, regulatory data storage, and digital archiving projects.

IBM Cloud Labs offers free, browser-based interactive Kubernetes labs for skill enhancement.

Transcripts

play00:00

If we think about object storage,

play00:01

which is kind of a new-ish tier,

play00:03

"object storage",

play00:06

the idea is for relatively

play00:08

low performance storage at a

play00:10

relatively low cost

play00:12

that is designed to serve the needs

play00:15

of the Internet workload.

play00:17

And so by Internet workloads, I'm

play00:19

talking about Web applications.

play00:20

I'm talking about website and

play00:22

website hosting and delivering

play00:24

content across the Internet,

play00:26

but also a way to find

play00:29

a new home for all of that data

play00:31

that we used to store on tape.

play00:32

Right. We have to have a way to

play00:34

take big volumes of data,

play00:37

write that down, put it somewhere

play00:38

that is safe, that is secure,

play00:40

and where we can hold on to it for a

play00:42

long time for regulatory reasons,

play00:44

legal reasons and other

play00:47

requirements that make it

play00:49

necessary for us to keep that data

play00:51

for a long amount of time.

play00:52

So let's start off with,

play00:54

"what is an object?" and

play00:56

why do we call it "object storage"?

play00:57

So, the concept

play00:59

in object storage is that you have

play01:02

an object, right.

play01:03

Now, that object

play01:05

can be any kind

play01:06

of file.

play01:07

There aren't really any file

play01:09

restrictions or anything like that.

play01:11

There are some characteristics that

play01:13

are very important.

play01:14

But as a general rule, any

play01:16

kind of file can be an object.

play01:18

So, we'll throw

play01:20

an object in there now, there

play01:22

are 4 essential components

play01:24

to that object that make

play01:26

it usable in the computing sense.

play01:28

The first one is for every

play01:30

object, we have to have an ID,

play01:32

we have to have some sort of a

play01:34

unique identifier that lets

play01:36

us know what this

play01:38

object is when it comes time to

play01:39

retrieve it. That's the first piece.

play01:41

The second piece is, obviously,

play01:43

you've got to have some data.

play01:46

What's the point of having a file

play01:47

and all of this information about it

play01:48

if you don't have the data itself?

play01:50

So this could be an Excel file

play01:52

and could be a video file.

play01:53

It could be an audio file, any kind

play01:55

of file. But you've got to have the

play01:57

actual data itself.

play01:59

The third piece that you

play02:01

have to have is

play02:02

metadata.

play02:04

So what is metadata?

play02:05

Metadata is everything that

play02:07

you need to know about

play02:10

this file and about the

play02:11

data itself.

play02:13

Who created it?

play02:14

When was it created?

play02:15

What is it used for?

play02:16

What is the file type?

play02:18

How large is it?

play02:19

The metadata is what makes it

play02:21

possible to go out and find

play02:23

it, search it, index it

play02:25

so that you can bring it up and use

play02:27

it whenever you need to.

play02:28

Because, again, all that's being

play02:29

displayed out right now is the ID.

play02:31

The ID is not terribly useful.

play02:32

And then the last thing that you

play02:34

have to have are your attributes.

play02:36

Now, attributes are related to

play02:38

metadata. They aren't exactly the

play02:40

same thing, but they're close.

play02:41

Attributes can be, "are certain

play02:43

users are allowed to override

play02:45

it?", "are certain users are allowed

play02:46

to download it?", "are certain users

play02:48

are allowed to delete it?".

play02:49

So the attributes are about the

play02:51

object itself rather than about

play02:53

the data.

play02:54

But again, those are those 4

play02:56

essential components.

play02:57

This is what you use

play02:59

when you actually create something.

play03:01

So you've got your object here,

play03:03

right.

play03:04

And then what do you do with it?

play03:05

Well, in the world of object

play03:07

storage, which is unstructured,

play03:09

you take all of your objects and you

play03:11

drop them into a bucket.

play03:13

So let's do this on this

play03:15

side. So, imagine

play03:17

I have a bucket, and

play03:21

my bucket can have a handle, why

play03:23

not?

play03:24

And so into my bucket, I'm

play03:26

going to put all of these objects

play03:28

and now this is the neat part: these

play03:30

buckets can be as big as you

play03:32

need them to be.

play03:33

They can scale to hold billions

play03:36

of objects.

play03:36

Now, it's not always a good idea to

play03:38

put billions of objects in it, but

play03:40

they can, in fact, scale to that

play03:41

size.

play03:42

And then what's really neat about

play03:44

object storage - because remember,

play03:45

we want to be able to use this for

play03:46

cold storage, too, right, we need to

play03:48

put things in it that can never be

play03:49

deleted - is whenever you,

play03:52

the user, you're going to interact

play03:54

with this thing via an API.

play03:56

Right. So you're not ever going to

play03:57

interact with the bucket

play03:58

individually here.

play04:00

We'll throw you in.

play04:01

There you are.

play04:03

You're using object storage.

play04:04

It's great.

play04:05

So you're going to take your

play04:06

objects. You're going to drop them

play04:07

into this bucket. Now, on the back

play04:10

end, the magic of object

play04:11

storage is your object

play04:14

here. Call it "O".

play04:16

It is going to replicate out of this

play04:18

bucket, the bucket is a virtual

play04:20

construct, but we're going to take

play04:22

it and we are going to say, all

play04:24

right, well, I have

play04:26

3 physically

play04:28

separated devices.

play04:31

And I'm going to take a copy of your

play04:33

object and I'm going to replicate

play04:35

it out into all 3 of them,

play04:37

and my purpose in replicating

play04:39

them out into all 3 of those

play04:41

locations is data integrity,

play04:43

data security.

play04:44

That way, if the physical device

play04:46

here, right, each of these buckets

play04:47

is going to reside on a physical

play04:49

device. So we'll call this P1,

play04:51

P2, and P3.

play04:55

So we're going to replicate it out,

play04:57

your object is going to be on 3

play04:58

physical, different devices, so if

play05:00

there's a hardware problem or a

play05:02

network outage or whatever in P1,

play05:05

that's OK. You've still got copies

play05:06

in P2 and P3, you can still get

play05:08

the data that you need.

play05:10

Now, when we get into

play05:12

the details of object storage, all

play05:14

of a sudden you start saying, man,

play05:16

there's a lot of complexity in this.

play05:17

Right? So not only do I have

play05:20

the object buckets, but the way

play05:22

that it's priced and metered and

play05:24

build is kind of unique.

play05:26

And people are asking me about how

play05:27

many "gets" and how many "puts" and

play05:28

how much storage am I taking up.

play05:31

And the metering and billing around

play05:33

object storage, everyone that sells

play05:35

it has their own different metrics

play05:36

on it. But here's the important

play05:37

parts to consider.

play05:39

The first is what

play05:41

is your responsiveness or your

play05:42

response time?

play05:43

Right. So from a price perspective,

play05:45

the higher performance that you want

play05:47

and this is true for all cloud

play05:48

storage, but the more performance

play05:50

you want, the more it's

play05:52

going to cost.

play05:53

If you have data that you

play05:55

have packaged up and it's data that

play05:57

for 20 years your company has been

play05:58

taking and writing it to tape,

play06:00

sending that tape to vaults and

play06:02

storing it in vaults where it will

play06:03

probably never be seen again.

play06:05

That can go to the very coldest of

play06:07

cold tiers, right.

play06:08

And that is going to be the least

play06:09

expensive.

play06:11

And so we're going to call that

play06:12

"cold times 10".

play06:15

That is the coldest of cold tiers.

play06:18

Up from that, where you need

play06:20

to get to it every so often, but you

play06:22

don't really need to get to it much,

play06:24

that's going to be a two dollar sign

play06:25

one, right? We're going to charge a

play06:27

little bit more for that.

play06:28

We're just going to call that

play06:29

"cold".

play06:30

Maybe this is data that you need to

play06:31

get to once every six months,

play06:33

something like that.

play06:35

But maybe you're earning a website

play06:37

and you are using these object

play06:38

storage buckets to host all of the

play06:40

assets for your website.

play06:42

Obviously that is going to cost

play06:44

a little bit more, right, because

play06:45

you are sending data out on a

play06:47

regular basis.

play06:49

So that's going to get our three

play06:50

dollar signs.

play06:52

And I'm going to call this

play06:54

one called "cold-ish".

play06:56

It's not really warm because

play06:58

the concept of a warm here doesn't

play07:00

really exist an object storage, but

play07:02

it's cold-ish.

play07:03

And so this is

play07:05

videos, it is pictures,

play07:08

it is physical files,

play07:10

and what makes object storage really

play07:12

neat is on the back end through

play07:14

the magic - because remember, you,

play07:15

you're interacting via an API,

play07:16

you're not interacting with these

play07:18

things directly - you

play07:20

can do all kinds of nifty

play07:22

front-end things for it.

play07:24

And that's the really neat part of

play07:25

object storage and that's the

play07:27

benefits of it.

play07:28

So let's talk about some of them,

play07:29

right?

play07:30

Hosting videos for streaming over

play07:31

the Internet.

play07:32

Perfect.

play07:33

Perfect use case for object storage,

play07:35

because like I said before, we can

play07:37

replicate these things out.

play07:38

Right. So for your video streaming,

play07:41

rather than just replicating it into

play07:42

three buckets that are all in the

play07:43

same place, let's replicate it

play07:45

out globally. Let's replicate it out

play07:47

so that we are hosting those videos

play07:49

not just in their primary location,

play07:51

but in three different locations

play07:53

in seven different countries around

play07:55

the world, decreasing the latency

play07:57

for your users so that their

play07:58

streaming experience is better.

play08:00

Let's talk about another good use

play08:01

case: file

play08:03

sharing. So let's imagine

play08:06

a company where you're collaborating

play08:07

on spreadsheets and PowerPoint, and

play08:09

things like that, and you've got

play08:10

collaborators that work all

play08:12

over the world.

play08:13

Well, for versioning

play08:15

purposes, you can have

play08:17

different versions that overwrite

play08:18

the previous version as long as

play08:20

those attributes allow for it.

play08:22

And so if I've got my

play08:24

friend Eric, who is working

play08:26

in London, can create a file,

play08:28

I can get a copy of that file,

play08:30

make my changes to it, upload it

play08:32

back in. And I've got version two.

play08:34

That's the metadata.

play08:35

And then I can have a colleague

play08:38

in Singapore, Sam, who downloads

play08:40

that file, make some more changes to

play08:41

it, uploads it back in version

play08:43

three, again, incrementing the

play08:44

metadata.

play08:45

But that allows for this Cross Globe

play08:47

collaboration on a single file type,

play08:50

and it makes everybody's lives

play08:51

easier. But we're doing it at a very

play08:53

low cost, much less expensive than

play08:55

trying to send files

play08:57

around via email because emails have

play08:59

to get stored in a much more

play09:00

expensive storage tier, much

play09:02

less expensive than FedExing thumb

play09:04

drives around.

play09:05

And again, being able to be done

play09:07

in a secure manner because we have

play09:09

control over who can see and access

play09:11

these objects.

play09:13

Other Web hosting, obviously, is

play09:15

going to be important. We talked

play09:16

about regulatory data before, the

play09:18

regulatory data and cold storage.

play09:21

Let's say that you involved in a

play09:23

digital archiving project

play09:25

and you're working with a university

play09:27

or a library and you're taking

play09:29

digital photographs of manuscript

play09:31

documents. And you want to host

play09:32

those out for everyone in the world

play09:34

to be able to see.

play09:35

Object storage, perfect place to

play09:37

store that kind of data - because

play09:39

again, it's not getting accessed

play09:40

very often.

play09:41

It is write

play09:43

once and then read many,

play09:45

many, many times and you can host

play09:47

it in a secure manner.

play09:48

So hopefully you've enjoyed this.

play09:50

This has been a good overview of

play09:52

object storage.

play09:53

As always, if you have any

play09:54

questions, please feel free to leave

play09:55

them in the comments. I'll do my

play09:56

best to answer, or some of my

play09:58

colleagues at IBM will.

play09:59

And if you have any other questions,

play10:00

just let us know.

play10:03

Thank you so much for your time

play10:04

today. If you have any questions,

play10:06

please drop us a line below.

play10:08

If you want to see more videos like

play10:10

this in the future, please do like

play10:11

and subscribe and let us know.

play10:14

And don't forget, you can grow your

play10:15

skills and earn a badge with IBM

play10:17

Cloud Labs, which are free

play10:19

browser-based interactive Kubernetes

play10:21

labs that you can find more

play10:23

information on by looking below.

play10:25

Thanks again.

Rate This
β˜…
β˜…
β˜…
β˜…
β˜…

5.0 / 5 (0 votes)

Related Tags
Object StorageCloud StorageData IntegrityInternet WorkloadsWeb HostingAPI InteractionData ReplicationCold StorageFile SharingDigital Archiving