In order to analyze the relationship between diversion status and other factors within the

Similar documents
Root Cause Analysis of Emergency Department Crowding and Ambulance Diversion in Massachusetts

Emergency-Departments Simulation in Support of Service-Engineering: Staffing, Design, and Real-Time Tracking

QUEUING THEORY APPLIED IN HEALTHCARE

Models and Insights for Hospital Inpatient Operations: Time-of-Day Congestion for ED Patients Awaiting Beds *

AN APPOINTMENT ORDER OUTPATIENT SCHEDULING SYSTEM THAT IMPROVES OUTPATIENT EXPERIENCE

Optimizing the planning of the one day treatment facility of the VUmc

STUDY OF PATIENT WAITING TIME AT EMERGENCY DEPARTMENT OF A TERTIARY CARE HOSPITAL IN INDIA

The impact of size and occupancy of hospital on the extent of ambulance diversion: Theory and evidence

Proceedings of the 2010 Winter Simulation Conference B. Johansson, S. Jain, J. Montoya-Torres, J. Hugan, and E. Yücesan, eds.

Hospital Patient Flow Capacity Planning Simulation Model at Vancouver Coastal Health

How to deal with Emergency at the Operating Room

CAPACITY PLANNING AND MANAGEMENT IN HOSPITALS

A Queueing Model for Nurse Staffing

Using Computer Simulation to Study Hospital Admission and Discharge Processes

Department of Mathematics, Sacred Heart College, Vellore Dt 3

Balancing Supply and Demand in Health and Social Care Organisations

BRIGHAM AND WOMEN S EMERGENCY DEPARTMENT OBSERVATION UNIT PROCESS IMPROVEMENT

A QUEUING-BASE STATISTICAL APPROXIMATION OF HOSPITAL EMERGENCY DEPARTMENT BOARDING

Homework No. 2: Capacity Analysis. Little s Law.

Comparison of the Performance of Inpatient Care for Chemotherapy Patients in RSUP Dr. Hasan Sadikin Bandung West Java Using Queuing Theory

Risk themes from ATAM data: preliminary results

Identifying step-down bed needs to improve ICU capacity and costs

LV Prasad Eye Institute Final Presentation

A Mixed Integer Programming Approach for. Allocating Operating Room Capacity

STOCHASTIC MODELING AND DECISION MAKING IN TWO HEALTHCARE APPLICATIONS: INPATIENT FLOW MANAGEMENT AND INFLUENZA PANDEMICS

Southwest Texas Regional Advisory Council

Guided Study Program in System Dynamics System Dynamics in Education Project System Dynamics Group MIT Sloan School of Management 1

APPLICATION OF SIMULATION MODELING FOR STREAMLINING OPERATIONS IN HOSPITAL EMERGENCY DEPARTMENTS

HOW TO USE THE WARMBATHS NURSING OPTIMIZATION MODEL

Simulation of 48-Hour Queue Dynamics for A Semi-Private Hospital Ward Considering Blocked Beds

Queueing Theory and Ideal Hospital Occupancy

Proceedings of the 2005 Systems and Information Engineering Design Symposium Ellen J. Bass, ed.

Care on demand in nursing homes: a queueing theoretic approach

SSF Call for Proposals: Framework Grants for Research on. Big Data and Computational Science

Final Report. Karen Keast Director of Clinical Operations. Jacquelynn Lapinski Senior Management Engineer

Applied Simulation Model for Design of Improving Medical Record Area in Out-Patient Department (OPD) of a Governmental Hospital

ModelingHospitalTriageQueuingSystem. Modeling Hospital Triage Queuing System. By Ibrahim Bedane Maddawalabu University

Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds.

Models for Bed Occupancy Management of a Hospital in Singapore

The Pennsylvania State University. The Graduate School ROBUST DESIGN USING LOSS FUNCTION WITH MULTIPLE OBJECTIVES

Matching Capacity and Demand:

ESSAYS ON EFFICIENCY IN SERVICE OPERATIONS: APPLICATIONS IN HEALTH CARE

Nursing Manpower Allocation in Hospitals

Scenario Planning: Optimizing your inpatient capacity glide path in an age of uncertainty

3M Health Information Systems. 3M Clinical Risk Groups: Measuring risk, managing care

High Risk Operations in Healthcare

Inpatient Bed Need Planning-- Back to the Future?

An Empirical Study of the Spillover Effects of Workload on Patient Length of Stay

A SIMULATION MODEL FOR BIOTERRORISM PREPAREDNESS IN AN EMERGENCY ROOM. Lisa Patvivatsiri

Hospital Inpatient Quality Reporting (IQR) Program

Applying queueing theory to the study of emergency department operations: a survey and a discussion of comparable simulation studies

Queueing Model for Medical Centers (A Case Study of Shehu Muhammad Kangiwa Medical Centre, Kaduna Polytechnic)

available at journal homepage:

Measuring healthcare service quality in a private hospital in a developing country by tools of Victorian patient satisfaction monitor

Applying Critical ED Improvement Principles Jody Crane, MD, MBA Kevin Nolan, MStat, MA

A Mixed Integer Programming Approach for. Allocating Operating Room Capacity

Engaging Students Using Mastery Level Assignments Leads To Positive Student Outcomes

Repeater Patterns on NCLEX using CAT versus. Jerry L. Gorham. The Chauncey Group International. Brian D. Bontempo

Comparative Study of Waiting and Service Costs of Single and Multiple Server System: A Case Study on an Outpatient Department

Offshoring and Social Exchange

Emergency Medicine Programme

Surgery Scheduling with Recovery Resources

Make the most of your resources with our simulation-based decision tools

Unscheduled care Urgent and Emergency Care

Running Head: READINESS FOR DISCHARGE

ew methods for forecasting bed requirements, admissions, GP referrals and associated growth

Tackling Patient Wait-Times at LVPEI with Systems Thinking. Ali Kamil, SDM 12, HKS 14 Dmitriy Lyan, SDM 11, Sr. Product Manager, Amazon

RESEARCH METHODOLOGY

time to replace adjusted discharges

Nursing Theory Critique

Waiting Patiently. An analysis of the performance aspects of outpatient scheduling in health care institutes

Improving operational effectiveness of tactical master plans for emergency and elective patients under stochastic demand and capacitated resources

HEALTH WORKFORCE SUPPLY AND REQUIREMENTS PROJECTION MODELS. World Health Organization Div. of Health Systems 1211 Geneva 27, Switzerland

Energy Trust Background

Using discrete event simulation to improve the patient care process in the emergency department of a rural Kentucky hospital.

How Allina Saved $13 Million By Optimizing Length of Stay

SIMULATION-BASED MODELS OF EMERGENCY DEPARTMENTS: REAL-TIME CONTROL, OPERATIONS PLANNING AND SCENARIO ANALYSIS

Factorial Design Quantifies Effects of Hand Hygiene and Nurse-to-Patient Ratio on MRSA Acquisition

MODELING THE EFFECT OF RESIDENT LEARNING CURVE IN THE EMERGENCY DEPARTMENT ROBERT MICHAEL RICHARDS. B.S., Kansas State University, 2010 A THESIS

DISTRICT BASED NORMATIVE COSTING MODEL

An Application of Factorial Design to Compare the Relative Effectiveness of Hospital Infection Control Measures

Maine Nursing Forecaster

LWOT Problem Tool. Quotes Surge Scenarios LWOT. Jeffery K. Cochran, PhD James R. Broyles, BSE

Measuring the relationship between ICT use and income inequality in Chile

Guidelines for Development and Reimbursement of Originating Site Fees for Maryland s Telepsychiatry Program

Henry Ford Hospital Inpatient Predictive Model

Dimensioning hospital wards using the Erlang loss model

The Effect of Emergency Department Crowding on Paramedic Ambulance Availability

DIVISION OF EMERGENCY MEDICINE DEPARTMENT OF ACUTE MEDICINE

NHS Greater Glasgow and Clyde Alison Noonan

Determining Like Hospitals for Benchmarking Paper #2778

Patients Experience of Emergency Admission and Discharge Seven Days a Week

Improving Patient s Satisfaction at Urgent Care Clinics by Using Simulation-based Risk Analysis and Quality Improvement

LESSON ELEVEN. Nursing Research and Evidence-Based Practice

Report on Feasibility, Costs, and Potential Benefits of Scaling the Military Acuity Model

Dynamic optimization of chemotherapy outpatient scheduling with uncertainty

TC911 SERVICE COORDINATION PROGRAM

A Game-Theoretic Approach to Optimizing Behaviors in Acquisition

Data-Driven Patient Scheduling in Emergency Departments: A Hybrid Robust Stochastic Approach

Barriers & Incentives to Obtaining a Bachelor of Science Degree in Nursing

Transcription:

Root Cause Analysis of Emergency Department Crowding and Ambulance Diversion in Massachusetts A report submitted by the Boston University Program for the Management of Variability in Health Care Delivery under a grant from the Massachusetts Department of Public Health October, 22 Emergency Room Diversion Study: Analysis and Findings Phase I Phase I of these investigations involved formulation of a conceptual model that would permit data collection and analysis germane to the problem of ambulance diversion. As preparation for this study, a wide range of relevant medical publications, policy statements and commissioned studies were reviewed. This was followed by personal interviews with representatives in government, hospital administration, public health and the Emergency Medicine community. Information was gathered from throughout Massachusetts and from other key states. Particular attention was given to experience in areas where crowding is particularly severe including metropolitan Boston, San Francisco, Los Angeles and the states of Arizona and Florida. Overall, numerous potential root causes of diversion had been articulated both in the medical literature and lay press, but empirical data to support them were lacking. Available research tended to be descriptive, documenting the extent of crowding without clear delineation of its sources. Various solutions had been proposed and implemented, all without consistent benefit. A partial summary of this analysis has been previously released by the Massachusetts Health Policy Forum of Brandeis University. An operations management perspective suggested straightforward input-throughputoutput analysis. Hospital utilization data provided by the Division of Health Care Finance and Policy was therefore reviewed alongside diversion data provided by regional EMS providers. Analysis of this information revealed the likely operation of mechanisms both internal and external to emergency departments. In addition to simple supply/demand imbalances for emergency care, diversion and utilization patterns suggested bottlenecks and backlogs related to the competition of emergency and non-emergency patients for similar resources. The interrelationships of hospital services then became the focus of attention and patient care pathways were explored with administrators from the two study hospitals. Two paradigms for the quantitative study of interrelationships among hospital departments were considered. The first involved an analytical approach wherein each relationship was identified, its stochastic character estimated, and appropriate Page of 29 mathematical models applied. The second involved a simulation approach, wherein stochastic relationships were embedded into computer software that translated real patient flow inputs into utilization and capacity information. Computer simulation was ultimately selected as the route of choice because of its scalability and adaptability. Phase II Data Collection/Analysis Effort: The study was performed at two hospitals in Massachusetts: Hospital A, a large tertiary academic hospital, and Hospital B, a medium-sized acute care community hospital. The following data were collected: 42 days of information covering: 6+ admissions 8+ ED visits 2+ staffing/capacity data points 3,+ patient movement/status data points In order to analyze the relationship between diversion status and other factors within the hospital environment all measures were split into observations at one hour increments. The study period of 42 days, with 24 hours per day, yielded a total of 8 full sets of observations. The analysis required collection of patient flow data well beyond the usual capabilities of contemporary hospital information systems. Point-biserial coefficients of correlation, with diversion status as the binary variable, were examined against a variety of factors. Comparisons when using full hours of diversion versus partial hours as the true condition did not reveal significant differences, so partial diversion hours were evaluated as the true binary throughout the analysis for the sake of consistency. It is important to note that in the real world the decisions to commence or cease diversion status are, but their nature, highly subjective. Because the purpose of the study was to examine the root causes of diversion, we did not approach the task from the standpoint of critiquing or attempting to influence this inherent operational subjectivity. As a result, any such analysis is itself subjective to a certain degree. Because both hospitals straddled EMS regional borders and diversion rules vary by region, each hospital s data was used for the sake of determining diversion status rather than using centralized EMS data. Also, all diversions were considered equally rather than separately analyzing the factors related to each individual diversion type. Patterns of diversion were also examined as averages across the hours of the day and the days of the week in order to ascertain relevant hour of the day and day of the week patterns. This data analysis was performed separately for each of the hospitals. Page 2 of 29

Hospital A: Diversion Pattern Hospital A - Diversion Minutes by Hour There were a total of 22 episodes of diversion which started and ended within the study, with an average length of 84 minutes. There was one episode that began prior to the study and ended after the study began and so is not included in this calculation, nor in any calculations which involve the beginning of diversion episodes. The hourly diversion pattern shows diversion is highest in the evening hours, settles back down during the early morning hours, and then stays steady until the mid to late afternoon (see Fig. ). The goal of the project was to determine the drivers which create this pattern. 3 Hospital A - Avg Diversion Minutes by Hour 25 2 5 5 Divert Minutes Fig. The following 3 hypotheses were tested to determine the drivers of diversions:. ED arrival rate is too high, leading to diversion when the ED becomes full. 2. ED processing of patients is too slow, causing backups that lead to diversion 3. ED arrival and processing rates are fine, but there are not enough beds in the hospital to accommodate the admissions. Page 3 of 29 There are seven sets of data (see Fig. 2), each representing a different view of arrivals into the ED. The "Arrivals_" category only includes new arrivals from the hour in question. Each subsequent category, from "Arrivals_" to "Arrivals_6" includes one more hour s worth added to the total. In other words, "Arrivals_" includes arrivals from the current hour added to the arrivals from the previous hour, "Arrivals_2" includes all of "Arrivals_" plus the arrivals from two hours ago, and so on. This is what accounts for the stacked shape as each additional hour is layered on top. Because average length of stay was 34 minutes, 6 hours is used as the maximum lag. Correlation coefficients from each of these cumulatives to Avg Diversion Minutes by hour are as follows: Arrivals_ = -.73 Arrivals_ =. Arrivals_2 =.78 Arrivals_3 =.65 Arrivals_4 =.259 Arrivals_5 =.359 Arrivals_6 =.46 335 Hospital A - ED Diversion vs. Arrivals to ER by Hour 3 25 25 2 2 5 5 5 5 Arrivals_ Arrivals_ Arrivals_2 Arrivals_3 Arrivals_4 Arrivals_5 Arrivals_6 Divert Minutes Fig. 2 There is also a possible corollary to hypothesis #, that overall ED census is a driver of diversion. When counting the non-boarding census and comparing it to diversion status, however, the resulting point-biserial coefficient (r = -.5) makes clear that this potential explanation should be rejected as well. Page 5 of 29

again points towards examining hospital capacity as the primary determinate of diversion. Census/Admissions/Discharges: Hospital B The overall relationship between inpatient census and ED boarders in Hospital B was similar to that of Hospital A. However, detailed analysis of admission sources in Hospital B is not presented because scheduled demand played a far smaller role than that observed in Hospital A. During the study period, there were,58 weekday unscheduled admissions (average: 38.6/day) and 28 weekday scheduled admissions (average: 6.9/day). This suggests very little operational flexibility in controlling the variability or timing of scheduled arrivals. This likely reflects a fundamental difference between most community hospitals and larger academic centers. Hospital B Conclusions: The findings at Hospital B are consistent with and reinforce those at Hospital A. Specifically, there was no evidence that ED process times were temporally or mechanistically related to ED diversion while the relationship between ED arrival rate and diversion was weak. Instead, the data suggest that factors outside of the ED that combine to increase boarders and limit ED capacity are more important. Phase II Summary: Detailed flow analysis in two very different types of hospitals yielded similar findings with respect to the root cause of emergency department crowding and ambulance diversion. Neither increased patient inflow nor increased process time could be strongly gy related to diversion status. Instead, diversion was seen as an outflow problem, with busy emergency departments crowding as patients await transfer to crowded inpatient services. This problem is exacerbated in hospitals with large volumes of scheduled admissions, since these necessarily compete for the same resources. The collision of scheduled and unscheduled patient flows results in diversion patterns that are specific and reproducible. Because scheduled patient flows are theoretically controllable, better understanding of this phenomenon may suggest means of decreasing diversion. If the experience here may be generalized, we conclude that institutions with small (or uncontrollable) scheduled patient flows will require addition of resources on the inpatient side if diversion is to be substantially reduced. Page 26 of 29 Calls arrive by different scenarios: sometimes arrivals during a day have a bell-form with peak around 2: (scenario II) and in some days we can see peaks in evenings around 8: (scenario I). 8 Number of arrivals (Retail, scenario I) 6 4 2 8 6 4 8: 8:3 9: 9:3 : :3 : :3 2: 2:3 3: 3:3 4: 4:3 5: 5:3 6: 6:3 7: 7:3 8: 8:3 4.2.8 5.2.8 6.2.8 8.2.8 2.2.8 22.2.8 25.2.8 2..9 3..9 4..9 9..9 2..9 2..9 22..9 25..9 average 6 Number of arrivals (Retail, scenario II) 4 2 8 6 4 8: 8:3 9: 9:3 : :3 : :3 2: 2:3 3: 3:3 4: 4:3 5: 5:3 6: 6:3 7: 7:3 8: 8:3 average 7.2.8 23.2.8 24.2.8 28.2.8..9 8..9 5..9

At the same time, the form of the agents' staffing does not changed for the days with different scenario for arrivals. Number of agents (Retail) 6 55 5 45 4 35 3 25 2 5 8: 8:3 9: 9:3 : :3 : :3 2: 2:3 3: 3:3 4: 4:3 5: 5:3 6: 6:3 7: 7:3 8: 8:3 9: Averages over two weeks (..9-25..9) Retail service 2 8 6 4 2 8: 8:3 9: 9:3 : :3 : :3 2: 2:3 3: 3:3 4: 4:3 5: 5:3 6: 6:3 7: 7:3 8: 8:3 arrivals agents (HOT) agents (actual) There are two type of the number of agents with tags HOT and actual. The first number was calculated from the HOT data in the following way: Number of agents (Net)* % of occupancy = agents (HOT). The actual number was calculated in the following way: Number of answered calls * (average service time +3 seconds) = agents (actual). This number describes the number of servers working without idle time. Our conclusion from this figure and the following in the analysis below that the number of agents (HOT) is much bigger than it was in reality. 8: 8:3 9: 9:3 : :3 : :3 2: 2:3 3: 3:3 4: 4:3 5: 5:3 6: 6:3 7: 7:3 8: 8:3 Retail service 25 23 2 9 7 5 3 9 7 5 offered load (scenario I) agents offered load (scenario II) The figure above shows inconsistency in forms of the offered load and agents' staffing. All values presented in this figure are averages of the offered load in two different scenarios and the number of agents, calculated in each 3 minutes interval. The following 4 figures show the average number of arrivals, agents, waiting time, answered calls over two weeks period (..9-25..9) Retail service 2 8 6 4 2 8: 8:3 9: 9:3 : :3 : :3 2: 2:3 3: 3:3 4: 4:3 5: 5:3 6: 6:3 7: 7:3 8: 8:3 arrivals agents waiting time answered

Number of calls/agents 2 3 4 5 6 Retail service 8: 8:3 9: 9:3 : :3 : :3 2: 2:3 3: 3:3 4: 4:3 5: 5:3 6: 6:3 7: 7:3 8: 8:3 offered load agents waiting time Retail service 2 5% 45% 4% 35% 8 3% 6 25% 2% 4 5% % 2 5% % 8: 8:3 9: 9:3 : :3 : :3 2: 2:3 3: 3:3 4: 4:3 5: 5:3 6: 6:3 7: 7:3 8: 8:3 arrivals agents % of abandoned calls Percent of abandoned calls arrivals agents waiting time answered 8: 8:3 9: 9:3 : :3 : :3 2: 2:3 3: 3:3 4: 4:3 5: 5:3 6: 6:3 7: 7:3 8: 8:3 4 3 2 9 8 7 6 5 4 3 2 Retail service (8..9, Sunday) An example of the best case scenario We choose Sunday 8..9 as a day with the best performance characteristics between weekdays 8-5..9. Even in this day we see problems in the service in morning and evening hours. offered load agents % of abandoned calls % 8: 8:3 9: 9:3 : :3 : :3 2: 2:3 3: 3:3 4: 4:3 5: 5:3 6: 6:3 7: 7:3 8: 8:3 5% Number of calls/agents 2 5 5 4% 35% 3% 25% 2% 5% % Percent of abandoned calls 45% 25 5% Retail service

8: 8:3 9: 9:3 : :3 : :3 2: 2:3 3: 3:3 4: 4:3 5: 5:3 6: 6:3 7: 7:3 8: 8:3 8: 8:3 9: 9:3 : :3 : :3 2: 2:3 3: 3:3 4: 4:3 5: 5:3 6: 6:3 7: 7:3 8: 8:3 4 Retail service 2 8 6 4 2 arrivals (scenario I) agents arrivals (scenario II) The figure above shows inconsistency in forms of the arrivals and agents' staffing. All values presented in this figure are averages of the number of arrivals coming by two different scenarios and the number of agents, calculated in each 3 minutes interval. Average service time changes dramatically during the day. We can see that in the evening average service time is much bigger than in the midday. This means that the offered load in the evening is bigger than in the midday. Average service time (Retail service) 4 39 37 35 33 3 29 service time Predictable Variability = "Piecewise Stationary" load Goal: Predictably Stable Performance, but HOW? Arrivals Queues Waiting 8 5

Staffing Time-Varying Queues: Two Common Approaches: SSA Simple Stationary Approximation. Constant staffing levels, based on steady-state M/M/N, with =long-run average number of arrivals. PSA Point-wise Stationary Approximation. Time-varying staffing levels, based on steady-state M/M/N, with = (t) at each time t. Could result in time-varying (highly oscillating) performance (utilization, service), which is undesirable. 3 25 2 5 5 95 9 85 8 75 7 65 6 55 5 45 4 35 3 25 2 5 5.8.7.6.5.4.3.2. Simple Stationary Approximation (SSA, =.2).8.7.6.5.4.3.2. 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 Arrived Staffing Offered Load Delay Probability 9 8 7 6 5 4 3 2 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 Abandon Probability Average Queue Average Wait.7.6.5.4.3.2.

4 35 3 25 2 5 5 95 9 85 8 75 7 65 6 55 5 45 4 35 3 25 2 5 5..9.8.7.6.5.4.3.2. Point-wise Stationary Approximation (PSA, =.2).8.7.6.5.4.3.2. 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 Arrived Staffing Offered Load Delay Probability 9 8 7 6 5 4 3 2 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 Abandon Probability Average Queue Average Wait.9.8.7.6.5.4.3.2. exponential having mean. (6 min.) Assume: Service and abandonment rates are both Hour of Day 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 5 Calls per Hour 5 2 25 (Adapted from Green L., Kolesar P., Soares J. for benchmarking.) Two-hump arrival functions are common Example: "Real" Call Center

25 2 5 5 25 2 5 5 QD Staffing (=.) 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 Arrived Staffing Offered Load ED Staffing (=.9) 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 Arrived Staffing Offered Load 25 2 5 5 25 2 5 5 5 5 2 25 Calls per Hour 25 2 5 5 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 Hour of Day QED Staffing (=.5) 25 2 5 5 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 Arrived Staffing Offered Load

.8.7.6.5.4.3.2..4.2..8.6.4.2 Abandon Probability Abandon Probability 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 Taget Alpha=. Taget Alpha=.2 Taget Alpha=.3 Taget Alpha=.4 Taget Alpha=.5 Taget Alpha=.6 Taget Alpha=.7 Taget Alpha=.8 Taget Alpha=.9 Abandon Probability 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 Taget Alpha=. Taget Alpha=.2 Taget Alpha=.3 Taget Alpha=.4 Taget Alpha=.5 Taget Alpha=.6 Taget Alpha=.7 Taget Alpha=.8 Taget Alpha=.9.95.9.85.8.75.7.65.6.55.5.99.98.97.96.95.94.93.92.9.9 Utilization Utilization 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 Target Alpha=. Target Alpha=.2 Target Alpha=.3 Target Alpha=.4 Target Alpha=.5 Target Alpha=.6 Target Alpha=.7 Target Alpha=.8 Target Alpha=.9 Utilization 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 Target Alpha=. Target Alpha=.2 Target Alpha=.3 Target Alpha=.4 Target Alpha=.5 Target Alpha=.6 Target Alpha=.7 Target Alpha=.8 Target Alpha=.9

QD =. Negligible QED =.5 Seconds ED =.9 Minutes Congestion (Queue, Wait).2.4.35.3.8.25.6.2.4.5..2.5 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 Average Queue Average Wait 7.8 6.6 5.4.2 4. 3.8 2.6.4.2 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 Average Queue Average Wait 25.35 2.3.25 5.2.5. 5.5 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 Average Queue Average Wait Erlang-A: Moderate (Im)patience M/M/N + M queue, with service rate µ equals abandonment rate Lt: number-in-system at time t (Birth & Death) For any N, transition-rates for {Lt, t }: 2 N- N N+ µ 2µ Nµ Nµ+ (N+)µ Note: The same transition rates as M/M/...

Square-Root Staffing: Motivation P { W q ( M / M / N M ) } PASTA P { L ( M / M / N M ) N } P { L ( M / M / ) N } Fact: L ( M / M / ) ~ Poisson(R); R= /µ offered load For R not too small: d d L(M/M/ ) Normal(R,R) R Z R N N R R P { W q } P Z R R Given target delay-probability = N R R N R R, with ( ) N is the "least integer for which" P { W q } Time-Varying Arrivals Extension: M t / M / N t + M (µ=) N R? t t R t Fact: t L ~ Poisson( t R ) R t the offered load at time t, namely: R t E (t S e ) E(S) E (u) du t t S Se excess service t t R t c (S e ) E(S) 2 E 2 s R, t. ( Fluid Dynamics ) d L t t N(R t, R ) hence, as before: N R, ( ) t t R t hopefully yields time-stable delay probability : Indeed, but in fact TIME-STABLE PERFORMANCE! What if µ? Use an Iterative Algorithm that is Simulation-Based

Time-Varying Arrivals Extension: M t / M / N t + M (µ=) N R? t t R t Fact: t L ~ Poisson( t R ) R t the offered load at time t, namely: R t E (t S e ) E(S) E (u) du t t S Se excess service t t R t c (S e ) E(S) 2 E 2 s R, t. ( Fluid Dynamics ) d L t t N(R t, R ) hence, as before: N R, ( ) t t R t hopefully yields time-stable delay probability : Indeed, but in fact TIME-STABLE PERFORMANCE! What if µ? Use an Iterative Algorithm that is Simulation-Based Performance Measures Delay probability in interval t, calculated by the fraction of customers who are not served immediately upon arrival, out of all arriving customers during the t time-interval Average waiting time in interval t, calculated by the average waiting time of all customers arriving during the t time-interval. Average queue length in interval t, taken constant over the time-interval. The queue length is averaged over all replications Tail probability in interval t, calculated as the probability that queue size equals or exceeds some threshold (e.g. 3 times average queue) Servers' Utilization in interval t, calculated as the fraction of busy-servers during a time-interval (accounting for servers who are busy only a fraction of the interval) Service grade t in interval t, which arises from the following "Square-Root Staffing" rule: N t R t t R t

exponential having mean. (6 min.) Service and abandonment rates are both Hour of Day 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 5 Calls per Hour 5 2 25 (Adapted from Green L., Kolesar P., Soares J. for benchmarking.) Two-hump arrival functiona are typical Example: "Real" Call Center Delay Probability Delay Probability.9.8.7.6.5.4.3.2. 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 Target Alpha=. Target Alpha=.2 Target Alpha=.3 Target Alpha=.4 Target Alpha=.5 Target Alpha=.6 Target Alpha=.7 Target Alpha=.8 Target Alpha=.9 Service Grade Beta Global Performance.8.6.4.2.8.6.4.2 -.2 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 -.4 -.6 -.8 - -.2 -.4 -.6 -.8 Target Alpha=. Target Alpha=.2 Target Alpha=.3 Target Alpha=.4 Target Alpha=.5 Target Alpha=.6 Target Alpha=.7 Target Alpha=.8 Target Alpha=.9

Real Call Center: Empirical waiting time, given positive wait () α=. (QD) (2) α=.5 (QED) (3) α=.9 (ED)

5 5 2 25 Calls per Hour 25 2 5 5 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 Hour of Day QED Staffing (=.5) 25 2 5 5 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 Arrived Staffing Offered Load P{Wait>} Erlang-A: Theoretical vs. Empirical P{Wait>}= vs. (N=R+R) Moderate Patience.9.8.7.6.5.4.3.2. -2 -.5 - -.5.5.5 2 Beta Empirical GMR

P{Wait>} Erlang-A: P{Wait>}= vs. (N=R+R).9.8.7.6.5.4.3.2. -3-2.5-2 -.5 - -.5.5.5 2 2.5 3 Halfin-Whitt GMR(.) GMR(.5) GMR() GMR(2) GMR(5) GMR() GMR(2) GMR(5) GMR() GMR(x) describes the asymptotic probability of delay as a function of when x. Here, and µ are the abandonment and service rate, respectively. P{Abandon}*N Erlang-A: P{Abandon}*N vs. 3.2 3 2.8 2.6 2.4 2.2 2.8.6.4.2.8.6.4.2-3 -2.5-2 -.5 - -.5.5.5 2 2.5 3 GMR(.) GMR(.5) GMR() GMR(2) GMR(5) GMR() GMR(2) GMR(5) GMR()

Iterative Algorithm Inputs System primitives: arrival function, service-time distribution, patience distribution (when relevant) ; Target delay probability ; Time horizon [,T]. Outputs Staffing function, aiming at a delay probability is over [,T]. Starting point: The infinite-server heuristics by Jennings, M., Massey, Whitt (996) Algorithm (cont.) Notation: t [, T ] (practically t=,, 2, ) Ni(t) staffing level at time t, determined in iteration i=,2, Li(t) number in the system at t, under staffing function si(t). Algorithm: () i=; N (t) (delay probability =) (2) Evaluate the distribution of L i (t), using simulation. (3) Determine N i+ (t) as follows: N i i (t) arg min{c : P{L (t) c} }, t T. (4) Check stopping condition: if N ( ) ( ), then N i+ (.) is our staffing level; i N i else i := i+, and go back to (2). () Last iteration. The algorithm converges to a Staffing Function N (.) least for which P { L ( t ) N ( t )}, tt.