Supplementary Material Economies of Scale and Scope in Hospitals

Similar documents
Clinical Use of Blood The AIM II Trial. Challenges of Near-Live Organisational Blood Use Monitoring

Economies of Scale and Scope in Hospitals:

Settling for Academia? H-1B Visas and the Career Choices of International Students in the United States

Admissions and Readmissions Related to Adverse Events, NMCPHC-EDC-TR

Technical Notes on the Standardized Hospitalization Ratio (SHR) For the Dialysis Facility Reports

Factors influencing patients length of stay

Physiotherapy outpatient services survey 2012

Methodology Notes. Identifying Indicator Top Results and Trends for Regions/Facilities

Differences in employment histories between employed and unemployed job seekers

Appendix. We used matched-pair cluster-randomization to assign the. twenty-eight towns to intervention and control. Each cluster,

USE OF APR-DRG IN 15 ITALIAN HOSPITALS Luca Lorenzoni APR-DRG Project Co-ordinator

Profit Efficiency and Ownership of German Hospitals

How Local Are Labor Markets? Evidence from a Spatial Job Search Model. Online Appendix

2013 Workplace and Equal Opportunity Survey of Active Duty Members. Nonresponse Bias Analysis Report

Frequently Asked Questions (FAQ) Updated September 2007

Burnout in ICU caregivers: A multicenter study of factors associated to centers

Fertility Response to the Tax Treatment of Children

A cluster-randomised cross-over trial

Objectives 2/23/2011. Crossing Paths Intersection of Risk Adjustment and Coding

Gender Pay Gap Report. March 2018

Is the HRG tariff fit for purpose?

Study population The study population comprised patients requesting same day appointments between 8:30 a.m. and 5 p.m.

Primary medical care new workload formula for allocations to CCG areas

ew methods for forecasting bed requirements, admissions, GP referrals and associated growth

Healthcare- Associated Infections in North Carolina

time to replace adjusted discharges

The Life-Cycle Profile of Time Spent on Job Search

Focus on hip fracture: Trends in emergency admissions for fractured neck of femur, 2001 to 2011

QUEUING THEORY APPLIED IN HEALTHCARE

Cost impact of hospital acquired diagnoses and impacts for funding based on quality signals Authors: Jim Pearse, Deniza Mazevska, Akira Hachigo,

Palomar College ADN Model Prerequisite Validation Study. Summary. Prepared by the Office of Institutional Research & Planning August 2005

Web Appendix: The Phantom Gender Difference in the College Wage Premium

Wage policy in the health care sector: a panel data analysis of nurses labour supply

Sampling Error Can Significantly Affect Measured Hospital Financial Performance of Surgeons and Resulting Operating Room Time Allocations

CHE Research Paper 144. Do Hospitals Respond To Rivals Quality And Efficiency? A Spatial Econometrics Approach

Enhancing Sustainability: Building Modeling Through Text Analytics. Jessica N. Terman, George Mason University

London CCG Neurology Profile

CAHPS Hospital Survey Podcast Series Transcript

Decision Fatigue Among Physicians

Irene Papanicolas, Alistair McGuire. Using a latent variable approach to measure the quality of English NHS hospitals

Module 13: Multiple Membership Multilevel Models. MLwiN Practical 1

Supplementary Online Content

The Performance of Worcester Polytechnic Institute s Chemistry Department

Big Data Analysis for Resource-Constrained Surgical Scheduling

Report on the Pilot Survey on Obtaining Occupational Exposure Data in Interventional Cardiology

Scottish Hospital Standardised Mortality Ratio (HSMR)

Supplemental materials for:

Employed and Unemployed Job Seekers and the Business Cycle*

The Internet as a General-Purpose Technology

Comparison of New Zealand and Canterbury population level measures

Care Quality Commission (CQC) Technical details patient survey information 2011 Inpatient survey March 2012

Exploring the impact of new medical technology on workforce planning

Services offshoring and wages: Evidence from micro data. by Ingo Geishecker and Holger Görg

New Joints: Private providers and rising demand in the English National Health Service

An Empirical Study of Economies of Scope in Home Healthcare

Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds.

Development of Updated Models of Non-Therapy Ancillary Costs

Care Quality Commission (CQC) Technical details patient survey information 2012 Inpatient survey March 2012

Are R&D subsidies effective? The effect of industry competition

Healthcare- Associated Infections in North Carolina

Statistical Analysis for the Military Decision Maker (Part II) Professor Ron Fricker Naval Postgraduate School Monterey, California

Page 1 of 26. Clinical Governance report prepared for NHS Lanarkshire Board Report title Clinical Governance Corporate Report - November 2014

Journal of Business Case Studies November, 2008 Volume 4, Number 11

Policy Brief. Nurse Staffing Levels and Quality of Care in Rural Nursing Homes. rhrc.umn.edu. January 2015

Department of Economics Working Paper

General Practice Extended Access: September 2017

Benchmarking length of stay

Session 74 PD, Innovative Uses of Risk Adjustment. Moderator: Joan C. Barrett, FSA, MAAA

Utilisation patterns of primary health care services in Hong Kong: does having a family doctor make any difference?

Gantt Chart. Critical Path Method 9/23/2013. Some of the common tools that managers use to create operational plan

An Overview of Ohio s In-Home Service Program For Older People (PASSPORT)

Type of intervention Secondary prevention of heart failure (HF)-related events in patients at risk of HF.

Online Education for Home Care and Hospice from Educators You Trust. Page 1 of 7. General Education Catalog of Courses

Patient-mix Coefficients for December 2017 (2Q16 through 1Q17 Discharges) Publicly Reported HCAHPS Results

The Impact of Competition on Management Quality: Evidence from Public Hospitals

MaRS 2017 Venture Client Annual Survey - Methodology

Planning Calendar Grade 5 Advanced Mathematics. Monday Tuesday Wednesday Thursday Friday 08/20 T1 Begins

Reducing Risks in Wartime Through Capital-Labor Substitution: Evidence from World War II. Chris Rohlfs Ryan Sullivan 1 Thomas J.

E-BULLETIN Edition 11 UNINTENTIONAL (ACCIDENTAL) HOSPITAL-TREATED INJURY VICTORIA

AN APPOINTMENT ORDER OUTPATIENT SCHEDULING SYSTEM THAT IMPROVES OUTPATIENT EXPERIENCE

The Effects of Medicare Home Health Outlier Payment. Policy Changes on Older Adults with Type 1 Diabetes. Hyunjee Kim

DISTRICT BASED NORMATIVE COSTING MODEL

Critique of a Nurse Driven Mobility Study. Heather Nowak, Wendy Szymoniak, Sueann Unger, Sofia Warren. Ferris State University

SCHOOL - A CASE ANALYSIS OF ICT ENABLED EDUCATION PROJECT IN KERALA

Inpatient Experience Survey 2012 Research conducted by Ipsos MORI on behalf of Great Ormond Street Hospital

June 22, Leah Binder President and CEO The Leapfrog Group 1660 L Street, N.W., Suite 308 Washington, D.C Dear Ms.

Summary of Findings. Data Memo. John B. Horrigan, Associate Director for Research Aaron Smith, Research Specialist

Measuring the relationship between ICT use and income inequality in Chile

T he National Health Service (NHS) introduced the first

Public Funding and Its Relationship to Research Outcomes. Paula Stephan Georgia State University & NBER UNU-MERIT/MGSoG Conference November 2014

The Intangible Capital of Serial Entrepreneurs

Trends in Consultation Rates in General Practice 1995 to 2006: Analysis of the QRESEARCH database.

Effects of the Ten Percent Cap in Medicare Home Health Care on Treatment Intensity and Patient Discharge Status

Free to Choose? Reform and Demand Response in the British National Health Service

An analysis of Medicare Provider Utilization and Payment Data: A focus on the top 5 DRGs and mental healthcare

Appendix H. Alternative Patient Classification Systems 1

Principles for Market Share Adjustments under Global Revenue Models

Impact of Financial and Operational Interventions Funded by the Flex Program

Care Quality Commission (CQC) Technical details patient survey information 2012 Inpatient survey March 2012

University of Michigan Health System MiChart Department Improving Operating Room Case Time Accuracy Final Report

Transcription:

Supplementary Material Economies of Scale and Scope in Hospitals Michael Freeman Judge Business School, University of Cambridge, Cambridge CB2 1AG, United Kingdom mef35@cam.ac.uk Nicos Savva London Business School, Regent s Park, London NW1 4SA, United Kingdom nsavva@london.edu Stefan Scholtes Judge Business School, University of Cambridge, Cambridge CB2 1AG, United Kingdom s.scholtes@jbs.cam.ac.uk 1. Introduction This appendix contains supporting material designed to accompany the investigation presented in the main paper. In 2 we provide random slope estimates for the effects of volume from other service lines, to augment those provided for volume of the same service line given in the 6.1 of the paper. In 3 we show that there is no evidence that the errors are autocorrelated. In 4 we investigate the possibility of non-linear volume effects, and find little evidence to suggest this is the case. In 5 we discuss the fact that elective service lines might be formed endogeneously based on financial viability, and show how we ac for this, provide additional robustness, and discuss how if anything this would be expected to work against our findings. In 6 we report on the results a number of additional tests that (i) are performed on a subset of data corresponding to hospital trusts that are more geographically isolated, (ii) limit the possibility of extreme cost outliers driving the results, (iii) compare hospital trusts based on a set of common (rather than all) HRGs that are performed by most (>8%) of trusts, (iv) re-run the models on a subset of the service-lines for which hospital trusts treat a high enough volume of patients, and (v) restrict the sample to trusts that operate only a single hospital site. The results from all of the models in 6 are in line with those reported in the paper. 2. Random slopes hospital trust volume effects In 6.1 of the paper we report on random slopes estimates for the effect of same service-line volume on hospital trust costs. In addition to this, it is interesting to consider whether the effect of volume from the other service lines on cost of the focal service line differs by service line. To do this we have estimated random slope estimates for the corresponding variables replacing the coefficients β2 h and β4 h in equation (7) in the paper with random, service-line dependent slopes: β h and 2,(C)[i] βh. As 4,(C)[i] 1

2 Freeman, Savva, Scholtes: Supplementary: Economies of Scale and Scope in Hospitals Figure 1 Random slope coefficient estimates for the effect of volume from the other service lines on the cost of the focal service line, reported by service line (black) and combined (red), with bootstrapped 95% confidence intervals. HRG chapter A B C D E F G H J K L M P Q ALL Elect. volume Elective Costs.1..1.2 Slope estimates A B C D E F G H J K L M P Q ALL Emerg. volume.15.1.5..5.1 Slope estimates A B C D E F G H J K L M P Q ALL Elect. volume..1.2.3 Slope estimates Emergency Costs A B C D E F G H J K L M P Q ALL Emerg. volume.2.15.1.5. Slope estimates Note. A - Nervous system; B - Eyes and periorbita; C - Mouth, head, neck and ears; D - Respiratory system; E - Cardiac surgery and primary cardiac conditions; F - Digestive system; G - Hepatobiliary and pancreatic system; H - Musculoskeletal system; J - Skin, breast and burns; K - Endocrine and metabolic system; L - Urinary tract and male reproductive system; M - Female reproductive system; P - Diseases of childhood and neonates; Q - Vascular system. in the paper, to allow the random slopes to be correlated with the service-line specific intercepts we need to also replace the service-line fixed effect in equation (8) in the paper with the random effect, α (C)[i]. We then model (α (C)[i], β h, 2,(C)[i] βh 4,(C)[i] ) using a trivariate normal distribution, to allow for correlation between the random effects. The results are plotted together with bootstrapped 95% confidence intervals using 1, simulations from the posterior distribution of the MLMs in Figure 1. We have also plotted the combined slope estimates from the main estimations and comparing against this it can be seen that the direction of the individual effects are consistent with the combined estimates, with 95% confidence intervals overlapping in nearly all cases. 3. Autocorrelated errors One concern when working with a panel of time series data is that errors may be autocorrelated, i.e. costs change slowly and e.g. high costs in one year may indicate that costs will be high in the next year, also. The standard errors are often underestimated when autocorrelation of the error terms (at low lags) are positive. This is unlikely to be a major issue for our analysis, since results are highly significant and standard errors would have to be vastly underestimated for the results to be misidentified. A bigger concern, however, is that autocorrelation of the errors may bias the coefficient estimates in the within-between model formulation. We investigate this further in this

Freeman, Savva, Scholtes: Supplementary: Economies of Scale and Scope in Hospitals 3 Figure 2 2 Plots of residuals (time t) against lagged residuals (time t 1) for elective costs (left) and emergency costs (right). Elective costs: residuals vs. lagged residuals 1. Emergency costs: residuals vs. lagged residuals Residuals (time t) 1 1 Residuals (time t).5..5 1. 2 2 1 1 Lagged residuals (time t 1) 1.5 1.5 1..5..5 1. Lagged residuals (time t 1) section. To do this, we have taken two approaches. In the first, we regress (using OLS) the residuals (at time t) from the within-between multilevel models (MLMs) against the lagged residuals (at time t 1). A plot of residuals vs. lagged residuals is provided in Figure 2, showing little evidence of any correlation and hence suggesting that our models ac for much of the within-trust and time-related correlation of the errors. This is confirmed by OLS models, with only 1.4% of the variance in the residuals for elective costs explained by the lagged residuals, and <.1% for emergency costs. We follow the informal approach described above with a traditional testing method. The standard test for the presence of first-order correlation is the Durbin-Watson statistic. However, this test can only be performed if the panel is balanced. For an unbalanced panel the recommended approach is to instead calculate the Baltagi-Wu locally best invariant (LBI) test statistic (Baltagi and Wu 1999). We estimate this using the xtregar command in Stata 12.1. Note that the models that we estimate this statistic for are not identical to those presented in the paper (since the particular command in Stata does not allow the estimation of multiple random effects, and so instead we are only able to include trust service-line REs), but is sufficiently similar (and, if anything, since the control structure in the paper includes additional time-related controls, the estimates reported here are likely to be conservative). Doing so we calculate the LBI statistic to be 1.82 for elective costs and 1.85 for emergency costs. While critical values are not available in Baltagi and Wu (1999), if there were no evidence of first-order autocorrelation then these should take value 2. Since the LBI statistics are close to 2 in value, we conclude there is little reason to be concerned about autocorrelated errors. 4. Non-linear volume effects In the paper we assume the effects of (log) volume on (log) cost is linear, i.e. a 1% increase in volume has an x% effect on cost, regardless of the initial level of volume. Here we discuss relaxing

4 Freeman, Savva, Scholtes: Supplementary: Economies of Scale and Scope in Hospitals Figure 3 Plots of estimated (mean-centered) volume effects on elective costs in models with only linear volume effects (solid lines) and in models also with non-linear volume effects (dashed lines). % chg. in elective cost 3 2 1-1 -2 Effect of volume on elective costs % chg. in elective cost 5-5 -1-7.5-5. -2.5. 2.5 Elective volume, same service line -7.5-5. -2.5. 2.5 Emergency volume, same service line 1 % chg. in elective cost 2 1 % chg. in elective cost 5-5 -1-15 -2-1 1 2 Elective volume, other service lines -2-1 1 2 Emergency volume, other service lines this assumption to allow for non-linear volume effects. We do this by including the squared values of the between hospital-trust volume terms in the main multilevel models. In Figures 3 and 4 are plotted for the elective and emergency patient types, respectively, the estimated between-effects of volume in models with linear only volume effects (i.e. the estimated effects reported in the paper) and in models with the inclusion of non-linear (squared terms) volume between-effects. 95% confidence bands for the non-linear effects are also plotted. As shown, with the exception of the effect of same service-line emergency volume on emergency costs (see top-right of Figure 4), there is little evidence that the interpretation of the results would change significantly if we had instead used a non-linear formation of volume. For the effect of same service-line emergency volume on emergency costs, the discrepancy between the non-linear and the linear estimations deserves further investigation. One possibility is that the non-linear effects identified may in part by driven by extreme points / outliers. To investigate this further, we have re-run the estimations using a subset of the data for which the relevant volume (shown in the histogram in the top-right of Figure 4) is constrained to take values between 2.5 and 2.5. After doing so and re-estimating, we find no further evidence of non-linearity, suggesting that the non-linear effects identified in Figure 4 are most likely driven by the presence of these outliers. Returning to the linear formulation of this model and re-estimating the effect of same service-line emergency volume on emergency cost, we find this to equal.51 (p-value <.1), as compared to.121. We note this difference in scale in 6.3 of the paper. While the other non-linear estimations do not indicate strong sensitivity of the results to outliers,

Freeman, Savva, Scholtes: Supplementary: Economies of Scale and Scope in Hospitals 5 Figure 4 Plots of estimated (mean-centered) volume effects on emergency costs in models with only linear volume effects (solid lines) and in models also with non-linear volume effects (dashed lines). Effect of volume on emergency costs % chg. in emergency cost 15 1 5-5 -1-15 % chg. in emergency cost 3 2 1-1 -2-3 3-7.5-5. -2.5. 2.5 Elective volume, same service line -7.5-5. -2.5. 2.5 Emergency volume, same service line % chg. in emergency cost 2 1-1 -2 % chg. in emergency cost 2 1-1 -2-3 -2-1 1 2 Elective volume, other service lines -2-1 1 2 Emergency volume, other service lines in 6.4 of this supplementary material document we also report results where volume is constrained to be above a specified minimum level. 5. Endogenous service line composition Not every hospital trust may offer every type of treatment, and while hospitals in the UK are not as financially driven as their US erparts, the choice of which treatments to offer (i.e. the composition of the service lines) might still be related to the financial viability of different treatment options. To determine the extent to which hospital trusts select to treat particular types of patients or conditions, we can re-use the weights which were employed to case-mix adjust costs and LOS in the paper. Specifically, we defined weights α tcp = n tcp c C n tcp, where n tcp is the total number of patients of type p with HRG c in year t across all trusts. In each hospital trust we can then define a new variable Prop thcp = c C thp α tcp, (1) where C thp is the subset of HRGs c in service line C for patients of type p that are observed in trust h in year t. Then (1) gives the proportion of the average elective (propel) and emergency (propem) patient case-mix that a hospital trust treats in that service line in that year. A plot of these proportions (for each of the service lines) is given in Figures 5 and 6 for the elective and emergency patient types, respectively. As can be seen in Figure 5 there is some evidence that not all elective treatments are offered at all hospital trusts, while Figure 6 shows that other than for Chapter B, which relates to conditions

6 Freeman, Savva, Scholtes: Supplementary: Economies of Scale and Scope in Hospitals Figure 5 75 Proportion of the average elective case-mix offered in each service line by every trust in any year. Chapter A Nervous System Chapter B Eyes and periorbita 2 15 5 1 25 5..25.5.75 1...25.5.75 1. Chapter C Mouth, head, neck and ears Chapter D Respiratory system 15 4 3 1 2 5 1..25.5.75 1...25.5.75 1. Chapter E Cardiac surgery and primary cardiac conditions Chapter F Digestive system 5 6 4 4 2 3 2 1..25.5.75 1...25.5.75 1. Chapter G Hepatobiliary and pancreatic system Chapter H Musculoskeletal system 15 2 1 5 15 1 5..25.5.75 1...25.5.75 1. Chapter J Skin, breasts and burns Chapter K Endocrine and metabolic system 2 15 1 1 5 5..25.5.75 1...25.5.75 1. 1 Chapter L Urinary tract and male reproductive system Chapter M Female reproductive system 75 2 5 25 1..25.5.75 1...25.5.75 1. Chapter P Diseases of childhood and neonates Chapter Q Vascular system 3 2 1 1 5..25.5.75 1...25.5.75 1.

Freeman, Savva, Scholtes: Supplementary: Economies of Scale and Scope in Hospitals 7 Figure 6 6 Proportion of the average emergency case-mix offered in each service line by every trust in any year. Chapter A Nervous System Chapter B Eyes and periorbita 6 4 2 4 2..25.5.75 1...25.5.75 1. 25 Chapter C Mouth, head, neck and ears 5 Chapter D Respiratory system 2 4 15 1 3 2 5 1..25.5.75 1...25.5.75 1. 4 3 2 Chapter E Cardiac surgery and primary cardiac conditions 5 4 3 2 Chapter F Digestive system 1 1..25.5.75 1...25.5.75 1. Chapter G Hepatobiliary and pancreatic system 5 Chapter H Musculoskeletal system 4 4 3 2 3 2 1 1..25.5.75 1...25.5.75 1. Chapter J Skin, breasts and burns 5 Chapter K Endocrine and metabolic system 4 2 1 3 2 1..25.5.75 1...25.5.75 1. 4 Chapter L Urinary tract and male reproductive system 5 Chapter M Female reproductive system 3 4 2 3 2 1 1..25.5.75 1...25.5.75 1. 5 4 3 2 Chapter P Diseases of childhood and neonates 2 15 1 Chapter Q Vascular system 1 5..25.5.75 1...25.5.75 1.

8 Freeman, Savva, Scholtes: Supplementary: Economies of Scale and Scope in Hospitals Table 1 Estimated effect of proportion of HRGs treated within a service-line Costs Length of stay Within effects Prop. elect..177.144 (.28) (.14) Prop. emerg..14.29 (.45) (.29) Prop. elect..465.17 (.55) (.24) Prop. emerg. 1.221.427 (.83) (.59) p <.5; p <.1; p <.1; Coefficient estimates and standard errors reported for the main effects. of the eyes and periorbita there is little evidence of emergency treatments not being offered at all trusts (unsurprising, as the unpredictable nature of patient arrivals to the ED means that hospitals have little choice over which emergency patients they will serve). For elective service lines, though, it is possible that the service lines are formed endogenously, i.e. hospital trusts may choose to only offer treatment to more profitable types of patients. To ac for this in the paper we: 1. Construct the dependent variable by dividing actual costs by the average cost, with both calculated using the same weights (i.e. the same case-mix). So, if e.g. only 8% of the HRGs in a service-line appear in the numerator, then only the same 8% of HRGs will appear in the denominator also. In this way costs are adjusted for observable differences in the service offering. 2. Use hospital trust and/or trust service-line fixed- and/or random-effects, to capture systematic, time-invariant differences in the costs at different trusts due to e.g. unobservable differences in the types of treatment offered. 3. Control in the elective costs models with propel, and in the emergency costs models with propem, to capture the fact that costs may differ depending on the diversity of services offered within a service line. (In the within-between models we instead control with the hospital trust mean-centered proportions and mean proportions, i.e. propel propel and propel, and similarly for propem.) The estimated coefficients for propel and propem in the within-between models are given in Table 1. As is shown, the greater the proportion of services offered within a particular service line (i.e. the wider the scope and more varied the service offering), the greater also the cost. This indicates that, if anything, those hospital trusts that choose to offer a narrow range of services, and hence are likely to have lower volume (as indicated by the strong positive correlation of.77 between propel and ln(nelserv), and of.67 between propem and ln(nemserv)), tend also to offer those less costly services. This would serve to work in the opposite direction to the same service-line, same patient-type volume effects that we find in the paper, and if this were driving

Freeman, Savva, Scholtes: Supplementary: Economies of Scale and Scope in Hospitals 9 Table 2 Model parameter estimates - with service-line:proportion interactions as additional controls Costs Length of stay Elect. vol. (focal SL).55.17.26.9 (.1) (.4) (.4) (.3) Emerg. vol. (focal SL).9.137.17.89 (.14) (.11) (.6) (.8) Elect. vol. (other SLs).58.134.1.64 (.31) (.28) (.14) (.26) Emerg. vol. (other SLs).13.84.18.36 (.34) (.31) (.16) (.29) Model fit Observations 15,339 15,354 15,339 15,354 Marginal R 2.114.19.125.11 Conditional R 2.519.624.468.744 Bayesian inf. crit. 559.6-11,74.3-2,496.8-23,867.4 p <.5; p <.1; p <.1; Coefficient estimates and standard errors reported for the between effects; Marginal R 2 describes the proportion of variance explained by the non-random factors only, and conditional R 2 describes the proportion of variance explained by both the non-random and random factors. our results we would expect instead that the greater the volume of activity of the same-type in the same service line, the wider the service offering, and hence the higher the cost. To confirm the robustness of the results, we have re-run the analysis in the paper under two alternative scenarios. In the first, we recognize that as per Figure 5 in the various elective service lines the degrees to which hospital trusts appear to restrict the scope of their services differ. For example, the vast majority of hospital trusts offer the full range of musculoskeletal services (Chapter H), while this is far more varied for diseases of the respiratory system (Chapter D). Therefore, we repeat the analysis in the paper by including propel (and propem also, for completeness) interacted by the service line indicator, to allow the coefficients to vary depending on the service line. The results (between effects reported only, for brevity) are presented in Table 2. Comparing the MLM model results to those in the paper we find them to be consistent. In addition to the robustness check above, as a second check we also re-run the analysis for a subset of the data in which we only include observations for which (i) propel, propem >.5, i.e. the hospital trust treats at least 5% of the expected case-mix, and (ii) propel, propem <.8, i.e. on average over the years observed the hospital trust treats at least 8% of the expected case-mix. This reduces the sample by 17.% for the elective patient type, and 4.2% for the emergency patient type. In this model we do not control for propel, propem, propel or propem. This alleviates concerns that the results in the paper may be spurious and caused by the high correlation between propel and ln(nelserv), and between propem and ln(nemserv). The findings (between effects reported only, for brevity) are reported in Table 3. Again, results are consistent with those documented in the paper.

1 Freeman, Savva, Scholtes: Supplementary: Economies of Scale and Scope in Hospitals Table 3 Model parameter estimates - excluding observations where low proportion of case-mix treated Costs Length of stay Elect. vol. (focal SL).46.4.3.16 (.1) (.5) (.4) (.3) Emerg. vol. (focal SL).33.77.19.69 (.16) (.1) (.7) (.7) Elect. vol. (other SLs).61.124.14.63 (.3) (.28) (.14) (.26) Emerg. vol. (other SLs).35.123.11.61 (.35) (.31) (.16) (.29) Model fit Observations 12,732 14,78 12,732 14,78 Marginal R 2.95.151.117.13 Conditional R 2.561.627.561.766 Bayesian inf. crit. -3,2.6-11,496.5-22,46. -24,448.2 p <.5; p <.1; p <.1; Coefficient estimates and standard errors reported for the between effects; Marginal R 2 describes the proportion of variance explained by the non-random factors only, and conditional R 2 describes the proportion of variance explained by both the non-random and random factors. Overall, we have demonstrated that while endogenous service line formation is a valid concern, we have taken steps to alleviate this in our models through an appropriate control structure, and have discussed that, if anything, this should work against the effects we find since larger hospital trusts tend to offer more varied services and treat a wider range of conditions, which we show has the effect of increasing their costs. We have also fit two further models that (i) added additional controls to ac for the fact that the potential endogeneous effect might be service line specific, and (ii) restricted the sample to only those hospital trusts that offer a wide range of services. Results in both of these cases are consistent with those in the paper. 6. Modeling - data alternatives In this section we report the results from a number of other estimations made using 6.1. Geographically dispersed hospital trusts In Table 4 we report the within-effects estimated for a subset of hospital trusts constrained to be 2km apart (see 6.2 of the paper for details). The main results are comparable in sign and scale to those reported in the paper, though the significant reduction in sample size (a 63% decrease in trust-year observations from 1,97 to 46, and of observations in general from 15,4 to 5,7) means that significance has reduced (and may in some cases results no longer appear statistically significant, e.g. the effect of volume from other service lines on emergency costs). 6.2. Capped costs It is not uncommon for hospital trusts costs to be magnified (or shrunk) by a few extremely expensive (or low-cost) patients. Therefore, when government agencies calculate hospital trust compensation based on HRG tariffs, the costs are often trimmed to exclude extreme observations.

Freeman, Savva, Scholtes: Supplementary: Economies of Scale and Scope in Hospitals 11 Table 4 Model parameter estimates - subset of geographically dispersed hospitals Costs Length of stay Elect. vol. (focal SL).54.23.42.6 (.19) (.9) (.8) (.7) Emerg. vol. (focal SL).4.119.36.79 (.27) (.21) (.12) (.15) Elect. vol. (other SLs).11.116.25.11 (.58) (.61) (.26) (.58) Emerg. vol. (other SLs).26.51.7.89 (.65) (.66) (.29) (.62) Model fit Observations 5,683 5,684 5,683 5,684 Marginal R 2.77.17.11.14 Conditional R 2.474.567.449.767 Bayesian inf. crit. 415.5-3,184.1-8,18.3-8,665.6 p <.5; p <.1; p <.1; Coefficient estimates and standard errors reported for the between effects; Marginal R 2 describes the proportion of variance explained by the non-random factors only, and conditional R 2 describes the proportion of variance explained by both the non-random and random factors. We adopt a similar approach by limiting the influence of extreme costs by capping them at a minimum or a maximum value. We do this by constraining in every hospital trust h the average cost of treating patients with HRG c and of patient type p (i.e. cost thcp ) to take maximum value equal to K multiplied by the across-trust median in that year t, and minimum value equal to 1/K multiplied by the across-trust median. This leave the same sample and number of observations as originally available, but limits the extent to which extreme values for individual cost can affect the results. We report in Table 5 the between-effects of the cost estimations where K is set equal to 1 (Columns 2 and 3) and 5 (Columns 4 and 5). The results are nearly identical to those reported in the paper, suggesting little evidence that they are overly influenced by the presence of extreme (i.e. very high or very low cost) observations. 6.3. Common HRGs In the paper we compared hospital trusts across the set of all HRGs c treated in year t. An alternative to this would have been to compare hospital trusts across a set of common HRGs, i.e. excluding those conditions that are more rare for which treatment is typically only offered in large, teaching or specialist hospitals. This has the additional benefit that it partially reduces the potential for bias caused by endogeneous service line formation (see 5 of this document), since we only compare costs against a base set of HRGs c that are widely provided (and so typically higher volume also, with less discretion in their provision). To achieve this, we specify that an HRG is only included in the comparison if it is provided by at least 8% of the hospital trusts in a particular year. This has the effect of reducing the total number of elective conditions against which hospital trusts are compared across the 9 years from

12 Freeman, Savva, Scholtes: Supplementary: Economies of Scale and Scope in Hospitals Table 5 Cost model parameter estimates - using capped costs Costs, cap K = 1 Costs, cap K = 5 Elect. vol. (focal SL).55.19.54.17 (.9) (.4) (.9) (.4) Emerg. vol. (focal SL).7.121.1.119 (.13) (.1) (.13) (.1) Elect. vol. (other SLs).41.134.38.126 (.3) (.28) (.29) (.26) Emerg. vol. (other SLs).7.98.6.89 (.34) (.31) (.33) (.29) Model fit Observations 15,339 15,354 15,339 15,354 Marginal R 2.96.18.96.181 Conditional R 2.516.627.52.634 Bayesian inf. crit. -71.5-11,816.2-1,29.5-13,498.1 p <.5; p <.1; p <.1; Coefficient estimates and standard errors reported for the between effects; Marginal R 2 describes the proportion of variance explained by the non-random factors only, and conditional R 2 describes the proportion of variance explained by both the non-random and random factors. Table 6 Model parameter estimates - calculated on a set of common HRGs Costs Length of stay Elect. vol. (focal SL).31.38.2.12 (.9) (.4) (.4) (.3) Emerg. vol. (focal SL).1.54.22.55 (.15) (.1) (.6) (.7) Elect. vol. (other SLs).37.111.4.58 (.31) (.29) (.13) (.26) Emerg. vol. (other SLs).6.128.14.63 (.36) (.32) (.15) (.29) Model fit Observations 15,335 15,354 15,335 15,354 Marginal R 2.8.152.14.94 Conditional R 2.488.624.431.731 Bayesian inf. crit. 3,331.4-1,87.3-18,356.6-23,147.1 p <.5; p <.1; p <.1; Coefficient estimates and standard errors reported for the between effects; Marginal R 2 describes the proportion of variance explained by the non-random factors only, and conditional R 2 describes the proportion of variance explained by both the non-random and random factors. 11,969 to 3,888, and emergency conditions from 11,824 to 6,19. This does, however, represent 74% of the total elective activity and 87% of the total emergency activity, respectively. This suggests that those HRGs that are kept in the sample are those that are higher volume. The results, using the same estimation method as in the paper, are provided in Table 6 ( betweeneffects only). As can be seen, the sign, direction and significance of the estimation on the subset of more common HRGs are all as reported in the paper. 6.4. Minimum service line size In 6.3 above we consider the possibility that the composition of HRGs used to compare hospital trusts (in particular, the inclusion of rarer conditions) may affect the results. Another possibility which we saw some evidence of in 4 of this document when discussing potential non-linear volume effects is that the inclusion of hospital trusts which treat only a low volume of activity within

Freeman, Savva, Scholtes: Supplementary: Economies of Scale and Scope in Hospitals 13 Table 7 Model parameter estimates - excluding hospital-years with low service line volume Costs Length of stay Elect. vol. (focal SL).54.41.3.19 (.11) (.6) (.5) (.4) Emerg. vol. (focal SL).7.15.17.94 (.15) (.11) (.7) (.8) Elect. vol. (other SLs).37.138.6.82 (.3) (.27) (.14) (.25) Emerg. vol. (other SLs).1.96.13.66 (.34) (.31) (.16) (.28) Model fit Observations 14,151 14,151 14,151 14,151 Marginal R 2.88.177.1.11 Conditional R 2.528.636.512.766 Bayesian inf. crit. -1,717. -11,263.4-22,294. -23,327.6 p <.5; p <.1; p <.1; Coefficient estimates and standard errors reported for the between effects; Marginal R 2 describes the proportion of variance explained by the non-random factors only, and conditional R 2 describes the proportion of variance explained by both the non-random and random factors. a particular service line (i.e. which provide no or only a limited scope of service) may be outliers (and not a fair comparison) and may be influencing the results. To examine this, we have re-run the cost models from the paper on a subset of the data such that only those years in which a trust treats at least 25% of the median elective volume and emergency volume of activity within a particular service-line are included in the sample. This reduces the sample by 8% from 15,354 observation to 14,151. The results provided in Table 7 are almost identical to those in the paper, suggesting the findings are not heavily influenced by the presence of trust service-lines with a low volume of activity. 6.5. Single hospital trusts The analysis in the paper was run on the set of all trusts operating in England. As mentioned in 4 of the paper, trusts may operate multiple hospitals across multiple sites. While often there is often a main hospital site that treats the vast majority of the patients, there are a number of hospital trusts (e.g. Guy s and St. Thomas in London) where the same trust operates multiple large hospitals. As it is not possible in our data to distinguish between which patients were treated at which site, it could in these cases that specialties and/or elective or emergency patients are split over multiple sites and that the scale effects we identify are affected by this. To investigate this further, we have repeated the analysis from the paper using a subset of the data corresponding to those trusts that only operate a single hospital site. This has the effect of reducing the sample by 54.%, from 15,354 observations to 8,285. The results, reported in Table 8, show that even when restricting the sample there is little change in the sign or scale of the main results reported in the paper. As such, in the paper we report the results from all trusts.

14 Freeman, Savva, Scholtes: Supplementary: Economies of Scale and Scope in Hospitals Table 8 Model parameter estimates - subset of trusts operating one hospital site Costs Length of stay Elect. vol. (focal SL).51.13.18.3 (.13) (.5) (.6) (.4) Emerg. vol. (focal SL).39.126.25.91 (.2) (.15) (.9) (.11) Elect. vol. (other SLs).5.134.17.116 (.43) (.43) (.18) (.41) Emerg. vol. (other SLs).1.9.24.117 (.49) (.47) (.2) (.46) Model fit Observations 8,271 8,285 8,271 8,285 Marginal R 2.87.171.114.113 Conditional R 2.481.615.45.731 Bayesian inf. crit. 1,458.3-5,259.2-8,852.9-11,214. p <.1; p <.5; p <.1; p <.1; Coefficient estimates and standard errors reported for the between effects; Marginal R 2 describes the proportion of variance explained by the non-random factors only, and conditional R 2 describes the proportion of variance explained by both the non-random and random factors. References Baltagi B, Wu P (1999) Unequally spaced panel data regressions with AR(1) disturbances. Econometric Theory 15(6):814 823.