Queueing theory (simple)

Liz Thompson
11 Nov 202008:37

Summary

TLDRIn this video, Liz Thompson introduces queuing theory, a fundamental concept in industrial engineering and operations research. She explains the basics of queuing theory, focusing on a single server, single line system with a first-in, first-out approach. Key terms like arrival rate (lambda) and service rate (mu) are defined, along with their exponential distribution. Thompson demonstrates how to calculate capacity utilization, the number of items in the system, and waiting time using simple equations derived from complex mathematical models. She concludes with a practical example of applying queuing theory to a milling machine scenario, showing how to determine the machine's capacity utilization and the space needed for a waiting area.

Takeaways

  • 📚 Queuing theory is a mathematical approach used in industrial engineering and operations research to model and analyze the management of queues.
  • 🔄 It involves the study of items, people, or parts that need to be processed by a server, which performs work that takes time.
  • 📈 The theory includes concepts like arrival rates (lambda) and service rates (mu), which are used to describe the flow and processing of items in a queue.
  • 📉 Queuing systems can be configured in various ways, such as single-server, multiple-server, first-in-first-out (FIFO), or last-in-first-out (LIFO).
  • 📊 Queuing theory uses equations to describe system performance, including capacity utilization, number of items in the system, and time in the system.
  • 🔢 The capacity utilization (ρ) is calculated as the arrival rate (λ) divided by the service rate (μ), indicating how busy the server is.
  • 📏 The number of items in the system is determined by the formula (λ / μ) / (1 - λ / μ), which accounts for both waiting and processing items.
  • ⏱ The waiting time in the system is calculated as the number of items in the system divided by the arrival rate (λ), providing insight into the efficiency of the process.
  • 🏭 An example provided in the script involves parts arriving at a milling machine, with calculations demonstrating how queuing theory can be applied to real-world industrial scenarios.
  • 📐 The script also discusses the practical application of queuing theory in facilities design, such as determining the necessary waiting area size based on the number of items and their footprint.

Q & A

  • What is queuing theory?

    -Queuing theory is a branch of mathematics that deals with the study of waiting lines, or queues, and is used in industrial engineering and operations research to analyze and optimize service systems.

  • What is the basic concept of queuing theory?

    -The basic concept of queuing theory involves items, people, or parts that need to be processed by a server, which performs work that takes time. These items form a line, or queue, and are served by the server one at a time.

  • What is the difference between arrival rate (lambda) and service rate (mu) in queuing theory?

    -The arrival rate (lambda) is the expected number of arrivals per unit of time, while the service rate (mu) is the average number of items that can be served per unit of time. Lambda is associated with the input rate to the system, and mu with the output rate.

  • What does a 'first in, first out' (FIFO) queue mean?

    -A 'first in, first out' (FIFO) queue means that the order in which items arrive is the same order in which they are served, ensuring that the first item to arrive is the first to be processed.

  • What is the significance of the capacity utilization (rho) in a queuing system?

    -Capacity utilization (rho) is the ratio of the arrival rate (lambda) to the service rate (mu), indicating the proportion of time the server is busy. It helps to understand how efficiently the system is being used.

  • How is the number of items in a queuing system calculated?

    -The number of items in a queuing system is calculated using the formula L = (lambda / (1 - rho)), where L is the average number of items in the system, lambda is the arrival rate, and rho is the capacity utilization.

  • What does the waiting time in a queuing system represent?

    -The waiting time in a queuing system represents the average time an item spends in the queue before being served, which can be calculated as W = L / lambda, where W is the waiting time, L is the number of items in the system, and lambda is the arrival rate.

  • Why is it important to understand the distribution of arrival and service times in queuing theory?

    -Understanding the distribution of arrival and service times is important because it allows for more accurate predictions of system performance, such as queue length and waiting times, and helps in making informed decisions about system design and resource allocation.

  • How can queuing theory be applied in an industrial setting?

    -Queuing theory can be applied in industrial settings to optimize production lines, manage customer service queues, and improve the efficiency of resource allocation, leading to better utilization of machinery and personnel.

  • What is an example of how queuing theory can be used to calculate space requirements for a waiting area?

    -In the provided example, queuing theory is used to calculate the space needed for a waiting area by determining the average number of items waiting and multiplying it by the space required per item, which in this case was 4.9 items times 2 square feet per item, resulting in a 10 square feet area.

Outlines

00:00

📚 Introduction to Queuing Theory

The video introduces queuing theory, a fundamental concept in industrial engineering and operations research. Liz Thompson explains the basics of queuing theory, which involves items or people waiting in a line to be processed by a server. The video aims to simplify the concept and demonstrate its application through an example. Queuing theory is visualized as items forming a line (queue) and being processed by a server. The video emphasizes the importance of defining the type of queue, such as a single-server, single-line, first-in-first-out (FIFO) system. Key parameters like lambda (arrival rate) and mu (service rate) are introduced, with lambda representing the expected time between arrivals and mu indicating the service rate. These rates are assumed to follow an exponential distribution. The video promises to show how these parameters can be used to analyze the system's capacity, utilization, and other metrics using simple equations derived from complex mathematical models.

05:01

🔍 Queuing Theory Application Example

In the second paragraph, the video script delves into a practical example of applying queuing theory. The scenario involves parts arriving at a milling machine, with an arrival rate of 100 parts per hour, following a Poisson process. The service time, or the time taken to process each part, is exponentially distributed with an average of 30 seconds. The video calculates the capacity utilization of the machine, which is the proportion of time the machine is in use, using the formula lambda over mu. The result is 83%, indicating a high utilization rate. Further calculations are made to determine the number of items in the system, both waiting and being processed, which is approximately 4.9 items. The waiting time in the system for each part is also calculated to be about three minutes. Finally, the video addresses the space required for the waiting area, given the footprint of each part and the number of parts expected to be waiting. The example illustrates how queuing theory can be used to analyze and optimize processes in industrial settings.

Mindmap

Keywords

💡Queuing Theory

Queuing Theory is a branch of operations research that deals with the analysis of waiting lines or queues. It is used to optimize service delivery by predicting the number of customers and their waiting times. In the video, Liz Thompson uses queuing theory to explain how items or people wait in line to be processed by a server, which is a fundamental concept in industrial engineering and operations management.

💡Server

In the context of queuing theory, a server refers to the entity that provides service to the items in the queue. It could be a person, a machine, or a system that processes the queue's elements. The video script describes a scenario where a milling machine acts as a server, processing parts that arrive at a certain rate.

💡Queue

A queue in queuing theory is the line or sequence of items or customers waiting to be served. The video script uses the term to describe the line that forms as items approach the server for processing. It's a key element in the analysis of service systems, as it helps determine the efficiency and waiting times.

💡First In First Out (FIFO)

FIFO is a scheduling discipline where the first item or customer in the queue is the first to be served. This principle is essential in maintaining order and fairness in service systems. The video mentions a one-server, one-line FIFO system as an example of a queuing configuration.

💡Arrival Rate (Lambda)

The arrival rate, denoted by lambda (λ), is the average number of items arriving per unit of time. It's a critical parameter in queuing theory as it influences the length of the queue and waiting times. In the video, the arrival rate of parts to the milling machine is given as 100 per hour, illustrating how this rate is calculated and used in analysis.

💡Service Rate (Mu)

The service rate, denoted by mu (μ), is the average number of items that can be served per unit of time. It's a measure of the server's capacity to process items. The video explains how to calculate the service rate based on the time it takes to process an item, which in the example is 30 seconds per part.

💡Exponential Distribution

The exponential distribution is a probability distribution that models the time between events in a Poisson process, such as arrivals in a queue. It's used in queuing theory to describe the variability in inter-arrival and service times. The video script mentions that both arrival and service times are exponentially distributed.

💡Capacity Utilization

Capacity utilization is the ratio of the arrival rate to the service rate, expressed as a percentage. It indicates how efficiently a server is being used. The video calculates capacity utilization for the milling machine example, showing that the machine is running at 83% of its capacity.

💡Waiting Time

Waiting time in queuing theory refers to the time an item spends in the queue before being served. It's a crucial measure of system performance and customer satisfaction. The video calculates the expected waiting time for parts in the system, which is approximately three minutes.

💡Poisson Process

A Poisson process is a statistical process that models the number of events occurring in a fixed interval of time or space. In queuing theory, it's often used to model the random arrivals of customers or items. The video mentions that the arrival of parts to the milling machine follows a Poisson process.

💡Facilities Design

Facilities design involves planning and designing physical spaces to optimize workflow and efficiency. The video uses queuing theory to inform facilities design, such as calculating the space needed for a waiting area based on the expected number of items in the queue.

Highlights

Queuing theory is a fundamental concept in industrial engineering and operations research.

It involves the study of items, people, or parts that need to be processed by a server.

The system is defined by a queue and a server, excluding items in transit to the system.

A one-server, one-line, first-in-first-out (FIFO) queue is a common type of queuing system.

Queuing theory uses mathematical models to analyze and predict system performance.

Lambda (λ) represents the arrival rate, and Mu (μ) represents the service rate in the system.

Arrival and service rates are often modeled using exponential distributions.

Capacity utilization is calculated as the ratio of lambda to mu (ρ = λ/μ).

The number of items in the system is given by the formula (λ/μ) / (1 - λ/μ).

The waiting time in the system can be determined by the formula (λ/μ) / λ.

Queuing theory can be applied to analyze real-world scenarios, such as parts arriving at a milling machine.

The arrival rate is determined by the number of parts per hour following a Poisson process.

Service rate is calculated by converting the service time per part into units per hour.

The example demonstrates calculating capacity utilization, number of items in the system, and waiting time.

Space allocation for the waiting area can be determined based on the number of items and their footprint.

Queuing theory provides insights into system efficiency and resource allocation.

The mathematical equations derived from queuing theory are complex but provide valuable system insights.

Transcripts

play00:00

hi my name is liz thompson and this is a

play00:03

quick video on an introduction to

play00:05

queuing theory

play00:06

um sort of the basics of using math in

play00:09

industrial engineering

play00:10

queuing theory is also a part of

play00:13

operations research

play00:15

so i'm going to explain what cueing

play00:17

theory is in the simplest form and the

play00:20

simplest

play00:21

aspects of it i'm also going to then

play00:23

work through an example

play00:25

and show how it might be used in

play00:27

analysis

play00:28

so cuic theory is the concept of there's

play00:32

items people or parts or

play00:35

anything that needs to be processed by a

play00:38

server

play00:39

who does some work that takes some time

play00:43

on the items or with the people

play00:46

so the way we usually conceptualize this

play00:49

is

play00:50

these the three dots on the left is what

play00:52

i'm saying are

play00:53

the items that need to be served they

play00:56

start to move towards the server and

play00:58

form a line and then as that as the

play01:01

server is available

play01:03

um the process one person in the line or

play01:06

one item in the line

play01:07

is served by the server and then exits

play01:09

the system

play01:10

so we call this um line a cue and that's

play01:14

what we call

play01:15

um queuing theory and then the whole

play01:18

system is we define as the line

play01:22

and the server so we don't include the

play01:24

items that are traveling to

play01:26

the systems we draw the system this way

play01:29

one of the important things about

play01:30

queuing theory

play01:31

is to really define the type of queue

play01:34

you can imagine there's

play01:35

all kinds of different types of queue

play01:37

this particular type of queue is a one

play01:39

server

play01:40

one line first in first out so that

play01:43

means

play01:43

the first items to get in the line are

play01:47

the next ones that are going to be

play01:49

processed so we travel through the line

play01:51

this way

play01:52

now there's all kinds of other types of

play01:55

configurations and

play01:56

for queuing theory there could be

play01:58

multiple server

play01:59

one line first in first out there could

play02:02

be

play02:02

um multiple server one line last in

play02:06

first

play02:06

out so instead of going to the end of

play02:08

the line the the next item goes to the

play02:10

front of the line

play02:11

now all of those indicate a different

play02:14

kinds of calculation

play02:15

but the calculations i'm going to talk

play02:17

about right now are a one

play02:19

one server one line last in first out

play02:22

type

play02:22

q and in this particular orientation

play02:25

what we do is we define

play02:27

a lamba and a mu which is the lambda

play02:31

is the arrival rates and the arrival

play02:33

rates what we

play02:34

say is lambda is an expected time of a

play02:37

arrival a arrival per hour

play02:41

and that but these vary there's a

play02:43

distribution associated with it and we

play02:45

define the distribution as

play02:46

exponential so that looks sort of like

play02:49

this where we have a lambda that is

play02:51

the the expected value but it's

play02:54

distributed over time

play02:56

mu is an indication of service rate and

play02:59

that's

play03:00

there's also an expected value and that

play03:02

varies over time

play03:03

so we can think about these things as

play03:06

distributions like

play03:07

sometimes um you there's two people that

play03:10

arrive together

play03:12

and then sometimes there's like 10

play03:13

minutes until the next person arrives

play03:16

and then in the service rate sometimes

play03:18

the service takes

play03:19

10 seconds and sometimes it takes three

play03:22

minutes so there's a variety of

play03:24

distribution of the service rate so what

play03:27

we do is we have these lambda and mu

play03:30

where the arrivals are expressed in

play03:32

items per unit time

play03:34

and the service is also expressed in

play03:35

items per unit time

play03:37

so the input in a queuing system is the

play03:40

queuing type which

play03:41

in this case we have a one line one

play03:43

server

play03:44

first and first out and then we also

play03:47

have an arrival rate

play03:48

and we have a service rate and that's

play03:49

kind of all we need

play03:51

and then through some um really kind of

play03:54

neat math

play03:55

that you will understand if you actually

play03:58

study this in depth we can actually

play04:00

describe things about the system in

play04:02

pretty simple equations

play04:04

and examples of the ways that we can

play04:06

describe the system and there's actually

play04:07

a lot of

play04:08

ways to describe the system but examples

play04:11

are we can talk about the capacity

play04:13

utilization we can talk about the number

play04:15

of people in the system

play04:16

we can talk about the time that that

play04:19

people or

play04:20

items we can talk about the time in the

play04:22

system so

play04:23

simple equations like the capacity

play04:25

utilization is lambda over mu pretty

play04:28

simple equation

play04:29

the number of people or number of items

play04:31

in the system

play04:32

is a row or the capacitor utilization

play04:34

divided by one minus row

play04:36

and then the um waiting time in the

play04:38

system is

play04:40

the the number of people in this system

play04:42

divided by lambda

play04:43

so we have these pretty simple equations

play04:45

that are actually

play04:46

complex to derive but they're based on

play04:49

distribution of arrival rates

play04:51

and um service time so it's really kind

play04:53

of cool that you can

play04:55

describe the system in this way so i'm

play04:57

going to go through a really quick

play04:58

example of this

play04:59

well not maybe not maybe really quick

play05:01

i'm going to go through an example

play05:03

so in this example it's um parts are

play05:05

being arriving to a mill a milling

play05:07

machine

play05:08

and they're going to be processed and

play05:10

they arrive at a rate of a hundred per

play05:13

hour

play05:14

this arrival follows a poisson process

play05:17

and the processing time for the items on

play05:19

the mill have an exp

play05:21

expected um time of processing of 30

play05:24

seconds and the processing times are

play05:26

exponentially distributed so the first

play05:29

question is what's the capacity

play05:30

utilization of the machine

play05:32

that means how much of the time the

play05:33

machine is being used

play05:35

and then what is what are the number of

play05:37

items in the system

play05:38

both waiting and processing and what is

play05:41

the time in the system for each part

play05:43

like how long are the items going to be

play05:44

waiting there

play05:46

and if each part is large with a

play05:47

footprint of two square feet how much

play05:49

space should you assign to the waiting

play05:51

area

play05:51

so these are some of the questions we

play05:53

can get from this simple

play05:55

cueing problem so in this queuing

play05:57

problem

play05:58

we initially can just establish lamba

play06:01

which is the arrival rate as a hundred

play06:03

units per hour

play06:05

the mu is a little bit harder because

play06:07

what we see is it takes 30 seconds to

play06:09

process the part that's the service time

play06:12

and we have to make sure that the mu is

play06:14

actually in

play06:15

units per hour and the 30 seconds per

play06:18

part is actually seconds

play06:20

per part which is not the right units so

play06:22

we have to take into account that

play06:24

there's 60 seconds in a minute and 60

play06:26

minutes in an hour

play06:27

so matt that means it takes .0083

play06:31

hours to process a unit but still that's

play06:33

not quite the right units for mu

play06:35

the right units per mu is one unit per

play06:38

0.0083 hours

play06:40

which means 100 uni 120 units per hour

play06:43

so that's our lambda values that we got

play06:46

so now we can do the calculation i mean

play06:48

that's the mu

play06:49

value so now we can do the calculation

play06:51

of capacity utilization

play06:52

by taking just lambda over mu for rho

play06:56

and we get 100 over 120 which gives us

play06:59

83

play07:00

or 0.83 as the capacity utilization that

play07:04

means the machine is running 83

play07:06

of the time that's pretty good i mean

play07:08

that's a pretty good utilization

play07:10

um so the next thing we then can do

play07:13

is we can calculate out the um the

play07:16

length or the number of items in the

play07:18

system so the system is both

play07:20

weighting and processing and that's

play07:22

going to be row

play07:23

divided by 1 minus row and it and so

play07:26

that calculation says there's going to

play07:28

be 4.9

play07:29

items in the system so about five units

play07:31

are going to be waiting there'll be four

play07:33

units waiting

play07:34

and then there'll be um one being

play07:36

processed

play07:37

um and then the next thing we can

play07:39

calculate is the

play07:41

um the waiting and this is the time

play07:43

waiting in the system

play07:45

so that's l of s divided by lambda and

play07:48

so that's 4.9 divided by 100

play07:50

or 0.049 hours or about three minutes

play07:53

in the system and that gives us an idea

play07:56

of how quickly things are being

play07:57

processed

play07:59

the next thing we might want to do is

play08:00

calculate what's the area

play08:02

that we should be waiting and i assumed

play08:04

there would be 4.9 units waiting

play08:07

but actually one of those is probably on

play08:09

the machine so it would

play08:10

maybe be 3.9 but either way we make that

play08:13

assumption that 4.9 times 2

play08:15

feet 2 square feet means that we need

play08:17

about 10

play08:18

square feet available so we would if we

play08:21

were doing a facilities design we would

play08:22

allocate a space

play08:24

that's about that that is about 10

play08:26

square feet now i hope that gives you an

play08:28

idea of

play08:29

some of the ways that we can use

play08:32

queuing theory to get some some good

play08:35

analysis going

Rate This

5.0 / 5 (0 votes)

الوسوم ذات الصلة
Queuing TheoryIndustrial EngineeringOperations ResearchMath in EngineeringFirst In First OutLambda MuArrival RateService RateCapacity UtilizationQueuing AnalysisProcess Efficiency
هل تحتاج إلى تلخيص باللغة الإنجليزية؟