Information Technology Incident Management

Similar documents
MEDICAL CARE BRANCH DIRECTOR

Creating A Centralised Operations Centre

Page 1 of 5 Version No: 6 Authorised by: General Counsel

I. Definition of Terms

CASUALTY CARE UNIT LEADER

INFRASTRUCTURE BRANCH DIRECTOR

BLINN COLLEGE ADMINISTRATIVE REGULATIONS MANUAL

Sanilac County Community Mental Health Authority

Business Continuity Plan

Self-Assessment Questionnaire: Establishing a Health Information Technology Safety Program

FINANCE/ADMINISTRATION SECTION CHIEF

Impacting Key Hospital Performance Metrics Through Leveraging a Hospitalist Program Becker s Hospital Review April 14, 2018

STAGING MANAGER. Organize and manage the deployment of supplementary resources, including personnel, vehicles, equipment, supplies, and medications.

Position Description Executive Director of Mission 1. THE ORGANISATION AND OUR MISSION

Top 10 Considerations For Incident Response. By: Tom Brennan, ProactiveRISK

Accountable Care Atlas

Duty Nurse Manager Waitemata Central Position Description

Child Care Program (Licensed Daycare)

Reviewing Methods Used in Patient Safety Research: Advantages and Disadvantages. This SPSRN work is funded by

Information System Security

Ontario School District 8C

Internal Audit. Health and Safety Governance. November Report Assessment

INCIDENT COMMANDER. Hospital Command Center (HCC): Phone: ( ) - Fax: ( ) - Signature: Initials: End: : hrs. Signature: Initials: End: : hrs.

Transforming Delivery Systems for Population Health

Course Module Objectives

Mission. Directions. Objectives

Programmatic Policy and Procedure

2017 Multifamily Executive Awards

ICS-200.b: ICS for Single Resources and Initial Action Incidents Final Exam

Practical Application of High Reliability Principles in Healthcare to Promote Clinical Quality and Safety Outcomes

PMA Business Continuity Plan

UCL MAJOR INCIDENT TEAM MAJOR INCIDENT PLAN. Managing and Recovering from Major Incidents

Staff member: an individual in an employment relationship with CYM or a contractor who is paid for services to CYM.

Local Health Integration Network Authorities under the Local Health System Integration Act, 2006

Refer to Appendix A for definitions of the terminology used throughout this policy.

PATIENT REGISTRATION UNIT LEADER

DOCUMENTATION UNIT LEADER

University of Maryland Baltimore Emergency Management Plan Version 1.7

Quick Guide to A3 Problem Solving

PROCEDURE Health and Safety - Incident Investigation. Number: J 0103 Date Published: 12 June 2017

COMPLIANCE WITH THIS PUBLICATION IS MANDATORY

A GLOWING RESEMBLANCE A COMPARE AND CONTRAST OF MEDICAL AND NUCLEAR PERFORMANCE IMPROVEMENT INITIATIVES

ALASKA PACIFIC UNIVERSITY EMERGENCY RESPONSE PLAN

Manatee County Continuity of Operations Plan (COOP) Animal Services. for

May Emergency Operations Standard Operating Guideline

Navy Medicine Strategic Plan FY14 U.S. Navy Bureau of Medicine & Surgery

Preparing for the CMS Emergency Preparedness Rule Changes

ICS-200: ICS for Single Resources and Initial Action Incidents

USE FOR REFERENCE ONLY Military Services Complaint Processing Procedures USE FOR REFERENCE ONLY

HEALTH & SAFETY ORGANISATION AND ARRANGEMENTS

BETHESDA HEALTH. Commitment to Care: Partnering with Care Logistics to Adopt a Patient-First System for Care

INCIDENT COMMANDER. Date: Start: End: Position Assigned to: Signature: Initial: Hospital Command Center (HCC) Location: Telephone:

INCIDENT COMMANDER. Date: Start: End: Position Assigned to: Signature: Initial: Hospital Command Center (HCC) Location: Telephone:

Midwest Alliance for Patient Safety Patient Safety Organization Getting Started with a PSO. An Illinois Hospital Association Company

2016 Multifamily Executive Awards

NORTH CAROLINA FAMILIES ACCESSING SERVICES THROUGH TECHNOLOGY (NC FAST)

4/12/2016. High Reliability and Microsystem Stress. We have no financial, professional or personal conflict of interest to disclose.

Mission. Directions. Objectives

SFGH Strategic Plan

Service Description: Cisco Managed Services for Application Centric Infrastructure (ACI)

MEDICAL-TECHNICAL SPECIALIST: BIOLOGICAL/INFECTIOUS DISEASE

WORKPLACE VIOLENCE PREVENTION CHECKLIST

EFFECTIVE ROOT CAUSE ANALYSIS AND CORRECTIVE ACTION PROCESS

OCTOBER 2014 NBA TOOLKIT NBA PRF TOOLKIT. BC NURSES UNION NBA PROFESSIONAL RESPONSIBILITY PRFs

INTEGRATED DATA ANALYTICS AND CARE WORKFLOW OPTIMIZATION

DOD INSTRUCTION DOD LOW-LEVEL RADIOACTIVE WASTE (LLRW) PROGRAM

THE RFP PROCESS: STEPS FOR GETTING THE MOST ACCURATE BIDS

Board Report Agreed Management Actions Status Update

Eliminating Common PACU Delays

Accident/Incident Investigation Policy

Mission. Directions. Objectives

Continuous Quality Improvement (CQI) Plan Whatcom County EMS and Trauma Care Council

A Regional Payer/Provider Partnership to Reduce Readmissions The Bronx Collaborative Care Transitions Program: Outcomes and Lessons Learned

Adopting an All Hazard Approach to Emergency Management Productively Utilizing the National Incident Management System (NIMS)

National Incident Management System (NIMS) & the Incident Command System (ICS)

The National ACO, Bundled Payment and MACRA Summit. Success in Physician Led Bundles

Gantt Chart. Critical Path Method 9/23/2013. Some of the common tools that managers use to create operational plan

UW MEDICINE ICD-10 Program UW MEDICINE ICD-10

Emergency Preparedness in Senior Care

SafetyFirst: The Journey to High Reliability

3. Does the institution have a dedicated hospital-wide committee geared towards the improvement of laboratory test stewardship? a. Yes b.

Moving the Needle on Hospital Throughput: Breaking Through the Status Quo. Session ID: 325

Sample Written Program For. Safety Committee

Organization Review Process Guide Perinatal Care Certification

Presentation to the Helicopter Offshore Safety Inquiry

Presentation Objectives

Healthcare Solutions Nuance Clintegrity Quality Management Solutions. Quality. The Discipline to Win.

Chapter 23 Saskatoon Regional Health Authority 1.0 MAIN POINTS 2.0 INTRODUCTION 3.0 AUDIT CONCLUSIONS, SCOPE AND FINDINGS

VERIFICATION OF READINESS TO START UP OR RESTART NUCLEAR FACILITIES

MDUFA Performance Goals and Procedures Process Improvements Pre-Submissions Submission Acceptance Criteria Interactive Review

IT Managed Services Provider

COOK COUNTY HEALTH & HOSPITALS SYSTEM

Patient Care Coordination Variance Reporting

City and County of San Francisco Nonprofit Contractor Corrective Action Policy

Sub-title: Monitoring of Optimal Use of MCH e Registry, Evaluation and Action Plans. Effective date: 15 th January 2017 Review date: 1 st May 2017

FLORIDA DEPARTMENT OF JUVENILE JUSTICE PROCEDURE

CIP Cyber Security Incident Reporting and Response Planning

Who we are: Objective. An Innovative Shared Decision-Making Process Led to Improved Staff Satisfaction Session: C913

UNIVERSITY OF MISSISSIPPI MEDICAL CENTER PATIENT SAFETY PLAN

Responding to Healthcare Industry Regulations Date: May 9, 2013

Transcription:

Information Technology Incident Management Charles S Sawyer, MD, FACP Justin Meadows Jay Capodiferro IT Incident Management I Becker s Hospital Review 2018 I 1

Disclosures All of the presenters are full time employees of Mission Health System and have no conflicts of interest to disclose. IT Incident Management I Becker s Hospital Review 2018 I 2

Our BIG(GER) Aim: To get every person to their desired outcome, first without harm, also without waste and always with an exceptional experience for each person, family and team member. IT Incident Management I Becker s Hospital Review 2018 I 3

MAP OF MISSION HEALTH SYSTEM MAP OF MISSION HEALTH SYSTEM Western North Carolina 18-County Service Area Population (2016): 882,581 Percent over 65: 22% IT Incident Management I Becker s Hospital Review 2018 I 4

Mission Health System 6 th largest health system in North Carolina and the only tertiary care regional referral center in Western North Carolina. Region s only Level II Trauma Center 17 th largest employer in North Carolina Accounting for 1 in every 16 jobs in Buncombe and Madison Counties 1 in every 39 jobs in the 18 county service region Creates more than $1.04B economic activity in Buncombe and Madison counties and nearly $2B across the region IT Incident Management I Becker s Hospital Review 2018 I 5

MISSION BY THE NUMBERS* Total Patient Days 235,490 Total Discharges 48,027 Average LOS 4.9 Average Daily Census 645 Case Mix Index 1.6993 Total Surgery Cases 46,421 Total ED Visits 169,648 Total OP Visits 475,158 Total MAMA Flights 1,035 Total Physician Visits(employed) 537,354 *FY 16 as of 7/1/16 IT Incident Management I Becker s Hospital Review 2018 I 6

IT Incident Management I Becker s Hospital Review 2018 I 7

Incident Management BEFORE Documented in SharePoint (if at all) No cross-reference to ticketing/incoming support calls Management/leadership managed by rotating technical and application managers (7 resources). No categorization, reporting or post-incident follow-up RCA left up to manager or owning group Poor change management contributing to self-inflicted incidents and concurrent incidents. Poor internal and external communication regarding recognition, updates and closure of incidents. Senior IT leadership often informed of incidents by other health system leadership before IT even aware IT Incident Management I Becker s Hospital Review 2018 I 8

Recognition of Need A standardized approach to incident management Standardization of: Definitions and roles Evaluation of incidents Communication Documentation Root cause analysis Prevention of recurrences Identification of Trends IT Incident Management I Becker s Hospital Review 2018 I 9

Hospital Incident Command System A flexible, scalable, and adaptable system That can be used by all hospitals regardless of size, location, patient acuity, patient volume, or hazard type. HICS expands or contracts relative to the needs of the situation. By using HICS, hospitals adopt a nationally recognized system that promotes successful incident management http://hicscenter.org/shared%20documents/hics_guidebook_2014_7.pdf IT Incident Management I Becker s Hospital Review 2018 I 10

Hospital Incident Command System Assigns positions only as determined by the scope and magnitude of the incident In keeping with the principle of scalability, which is important during an emergency. Staff assigned positions are returned to their normal work functions once their position is no longer needed for the incident response http://hicscenter.org/shared%20documents/hics_guidebook_2014_7.pdf IT Incident Management I Becker s Hospital Review 2018 I 11

Foundational Principles Predictable chain of command with a suggested span of control Accountability of position and team function, including prioritized action checklists Common language for promoting communication A flexible and scalable incident management system addressing planning and response needs of any size hospital with universal applicability Modular design and adaptability allowing planning and management of non-emergent incidents or events Management by Objectives (MBO) in which the problem encountered is evaluated, a plan to remedy the problem identified and implemented, and the necessary resources assigned http://hicscenter.org/shared%20documents/hics_guidebook_2014_7.pdf IT Incident Management I Becker s Hospital Review 2018 I 12

IT Incident Management I Becker s Hospital Review 2018 I 13

Could Hospital Incident Command serve as a framework for IT Incident Management? IT Incident Management I Becker s Hospital Review 2018 I 14

HICS + ITIL + ITSM Hospital Incident Command System = framework understood by our clinical and business areas Information Technology Infrastructure Library (ITIL) and Information Technology Service Management (ITSM) = framework well understood by IT industry We then formed a small team that worked together to create a Major Incident and SPRNT process that combined what we believe are the best of both frameworks! IT Incident Management I Becker s Hospital Review 2018 I 15

Major Incident Process This process aligns most closely with ITIL and ITSM. Integrates into our existing Incident Management process for everyday incidents. Incident: an unplanned interruption to an IT service or reduction in the quality, including reliability and availability, of an IT service or any component part of that service. Major Incident: an event which has significant impact or urgency, which demands a response beyond the routine Incident Management process. IT Incident Management I Becker s Hospital Review 2018 I 16

Major Incident Process Major Incident further defined a) May either cause, or have potential to cause, impact on business critical services or systems; b) Or be an incident that has significant impact to patient care or Mission Health System revenue; c) Or be an incident that has significant impact on reputation, legal compliance, regulation or security of the organization. IT Incident Management I Becker s Hospital Review 2018 I 17

Problem Management Process This process catches Major Incidents after restoration of service. In Problem Management we focus on a) Documenting the recurrence of incidents by associating them with a Problem. b) Documenting workarounds until a complete resolution can be implemented to prevent the incident in the future. c) Performing and documenting root cause analysis for each incident. d) Ensuring incidents do not keep recurring or that impact is minimized. IT Incident Management I Becker s Hospital Review 2018 I 18

Incident and Problem Manager Role Created a full-time position to manage day-to-day activities for Incident and Problem Management. This created a single point of contact for incident escalation. While also providing consistent and standardized management of the processes instead of rotating responsibility through existing managers. It also gave us the resources we needed to report out on and understand more about our incidents (which we ll cover later). IT Incident Management I Becker s Hospital Review 2018 I 19

Major Incident Process Major Incident further defined In order to operationalize the Major Incident qualification in our ticketing system, we provided criteria to guide the consistent designation of Impact and Urgency used by the Incident Manager. IT Incident Management I Becker s Hospital Review 2018 I 20

Major Incident Process Major Incident further defined In order to operationalize the Major Incident qualification in our ticketing system, we provided criteria to guide the consistent designation of Impact and Urgency used by the Incident Manager. IT Incident Management I Becker s Hospital Review 2018 I 21

Major Incident Process Major Incident further defined In order to operationalize the Major Incident qualification in our ticketing system, we provided criteria to guide the consistent designation of Impact and Urgency used by the Incident Manager. IT Incident Management I Becker s Hospital Review 2018 I 22

SPRNT Service and Performance Restoration and Normalization Team This process aligns most closely with the HICS system. In some incidents, a formalized response effort is required to mitigate impact, manage risk, communicate to the organization and implement fixes and workarounds. Colloquially this was referred to as an IT Command Center. This conflicted with our Hospital Incident Command nomenclature. IT Incident Management I Becker s Hospital Review 2018 I 23

SPRNT modeled after HICS While we changed our name, we borrowed heavily from HICS to structure our response team and enable it to snap-in to the HICS system when the Hospital Command Center was activated. A SPRNT is initiated for Severity 1 incidents at the discretion of the Incident Director upon escalation from the Incident Manager. 6 of our critical services require an automatic SPRNT if they cannot be resolved in 45 minutes. IT Incident Management I Becker s Hospital Review 2018 I 24

SPRNT Roles Similar to HICS, the SPRNT team has designated roles with documented responsibilities to be performed by each role. Incident Director Application Team Manager Application Team Member Informatics Manager Rounder Medical Advisor Technical Team Manager Architect Technical Team Member Problem Manager Communications Logistics Scribe IT Incident Management I Becker s Hospital Review 2018 I 25

SPRNT Response to WannaCry IT Incident Management I Becker s Hospital Review 2018 I 26

SPRNT Response to WannaCry IT Incident Management I Becker s Hospital Review 2018 I 27

SPRNT Briefings SPRNT briefings are formalized. Usually top of the hour, depending on timing of the incident. Report outs/updates communicated 15 minutes prior to the briefing. Incident Director reviews current status and documents any planned actions. Emergency Change Management procedures are overseen by the Incident Director. IT Incident Management I Becker s Hospital Review 2018 I 28

SPRNT Communications A formal communication process is executed. Initial briefing Initial status communication (internal to IT) Initial status communication (external to IT) Notification to House Supervisor App and Technical Status (:45 on the hour) Briefing (top of every hour) Ongoing internal and external status communications Final briefing on resolution Most internal communications are facilitated through an integration of our ITSM system with Everbridge. IT Incident Management I Becker s Hospital Review 2018 I 29

SPRNT + HICS Snap-In In the event that the Hospital Command Center (HCC) is activated SPRNT team becomes a sub-cell IT representative physically or remotely joins their team All external to IT communications are managed by the HCC Distribution many times is managed by SPRNT in coordination with the HCC Internal IT communications continue uninterrupted Our designated SPRNT conference room also serves as the backup Hospital Command Center. Equipped with staged-and-ready radio, telecom and wireless equipment as well as printed materials to support the HCC team. IT Incident Management I Becker s Hospital Review 2018 I 30

SPRNT After Action Review and MOCK For each SPRNT we follow-up with an After Action Review (AAR) to review what went well and what can be improved. We also schedule MOCK incidents quarterly to practice our response efforts and keep everyone fresh in the absence of major incidents to manage. IT Incident Management I Becker s Hospital Review 2018 I 31

What did the data tell us about all of this process that was implemented? IT Incident Management I Becker s Hospital Review 2018 I 32

Key Metric Focus Areas Downtime versus Non-Downtime Incidents Internally Responsible Vendor Responsible Caused by Change Time to Resolution Internal Response Time (Process) Vendor Response Time (Escalation) Root Cause Analysis What are you going to do with it? IT Incident Management I Becker s Hospital Review 2018 I 33

Statistical Results IT Incident Management I Becker s Hospital Review 2018 I 34

Resolution Time Improvement IT Incident Management I Becker s Hospital Review 2018 I 35

Process Improvements Monitoring/Event Management Proactive Incident/Problem Management Escalation Who s on-call? Vendor escalation paths. Communications Content Schedule IT Incident Management I Becker s Hospital Review 2018 I 36

Thank you Questions? IT Incident Management I Becker s Hospital Review 2018 I 37