ARTICLE. Physician Variation in Test Ordering in the Management of Gastroenteritis in Children. physicians is well documented. 1-3 This variation in

Similar documents
ARTICLE. Professional Interpreters and Bilingual Physicians in a Pediatric Emergency Department

Racial disparities in ED triage assessments and wait times

Type of intervention Secondary prevention of heart failure (HF)-related events in patients at risk of HF.

Performance Measurement of a Pharmacist-Directed Anticoagulation Management Service

IN EFFORTS to control costs, many. Pediatric Length of Stay Guidelines and Routine Practice. The Case of Milliman and Robertson ARTICLE

Undiagnosed Hypertension in the ED Setting An Unrecognized Opportunity by Emergency Nurses

Chan Man Yi, NC (Neonatal Care) Dept. of Paed. & A.M., PMH 16 May 2017

Hitting the mark... sometimes. Improve the accuracy of CPT code distribution. MGMA Connexion, Vol. 5, Issue 1, January 2005

Admissions with neutropenic sepsis in adult, general critical care units in England, Wales and Northern Ireland

Telephone consultations to manage requests for same-day appointments: a randomised controlled trial in two practices

The impact of nighttime intensivists on medical intensive care unit infection-related indicators

Analysis of Nursing Workload in Primary Care

Level of acuity in pediatric patients with recurrent emergency department visits

Outpatient Experience Survey 2012

Demographic Profile of the Officer, Enlisted, and Warrant Officer Populations of the National Guard September 2008 Snapshot

Chapter VII. Health Data Warehouse

APPENDIX 2 NCQA PCMH 2011 AND CMS STAGE 1 MEANINGFUL USE REQUIREMENTS

Pediatric Neonatology Sub I

Research Design: Other Examples. Lynda Burton, ScD Johns Hopkins University

Gantt Chart. Critical Path Method 9/23/2013. Some of the common tools that managers use to create operational plan

A comparison of two measures of hospital foodservice satisfaction

Demographic Profile of the Active-Duty Warrant Officer Corps September 2008 Snapshot

Rates of Emergency Department Use for Ambulatory Sensitive Conditions in the Los Angeles Healthy Kids Program

Quality of Care of Medicare- Medicaid Dual Eligibles with Diabetes. James X. Zhang, PhD, MS The University of Chicago

Primary Care Meets Population Health: The Parable of Preventable Hospitalizations

SEPTEMBER O NE-YEAR S URVEY SURVEY REPORT. Bachelor s Degree in Nursing Program

ARTICLE. Influence of Medicaid Managed Care Enrollment on Emergency Department Utilization by Children

Comparing Job Expectations and Satisfaction: A Pilot Study Focusing on Men in Nursing

s n a p s h o t Medi-Cal at a Crossroads: What Enrollees Say About the Program

EDUCATIONAL INTERVENTION. The Training of Pediatric Residents in the Care of Acutely Ill and Injured Children

Comparison of Anticoagulation Clinic Patient Outcomes With Outcomes From Traditional Care in a Family Medicine Clinic

The Florida KidCare Evaluation: Statistical Analyses

Avoidable Hospitalisation

Evaluation of an independent, radiographer-led community diagnostic ultrasound service provided to general practitioners

SEPTEMBER O NE-YEAR S URVEY SURVEY REPORT. Master of Science in Nursing Program

A Comparison of Methods of Producing a Discharge Summary: handwritten vs. electronic documentation

Issue Brief From The University of Memphis Methodist Le Bonheur Center for Healthcare Economics

CLOSING THE DIVIDE: HOW MEDICAL HOMES PROMOTE EQUITY IN HEALTH CARE

SEPTEMBER O NE-YEAR S URVEY SURVEY REPORT. Associate Degree in Nursing Program

Sampling Error Can Significantly Affect Measured Hospital Financial Performance of Surgeons and Resulting Operating Room Time Allocations

SAFE STAFFING GUIDELINE

Policy Brief. rhrc.umn.edu. June 2013

A Quantitative Correlational Study on the Impact of Patient Satisfaction on a Rural Hospital

Impact of Financial and Operational Interventions Funded by the Flex Program

Educating Physicians-in-Training About Resource Utilization and Their Own Outcomes of Care in the Inpatient Setting

January 1, 20XX through December 31, 20XX. LOINC(R) is a registered trademark of the Regenstrief Institute.

Potentially Avoidable Hospitalizations in Tennessee, Final Report. May 2006

2016 National NHS staff survey. Results from Wirral University Teaching Hospital NHS Foundation Trust

Pay-for-Performance: Approaches of Professional Societies

2017 National NHS staff survey. Results from Royal Cornwall Hospitals NHS Trust

Measuring Comprehensiveness of Primary Care: Past, Present, and Future

Fleet and Marine Corps Health Risk Assessment, 02 January December 31, 2015

2017 National NHS staff survey. Results from Dorset County Hospital NHS Foundation Trust

Minnesota Statewide Quality Reporting and Measurement System: Quality Incentive Payment System

FOCUS on Emergency Departments DATA DICTIONARY

CER Module ACCESS TO CARE January 14, AM 12:30 PM

2017 National NHS staff survey. Results from London North West Healthcare NHS Trust

Improving patient satisfaction by adding a physician in triage

Maximizing the Power of Your Data. Peggy Connorton, MS, LNFA AHCA Director, Quality and LTC Trend Tracker

Quality Management Building Blocks

Definitions/Glossary of Terms

Note, many of the following scenarios also ask you to report additional information. Include this additional information in your answers.

Using Secondary Datasets for Research. Learning Objectives. What Do We Mean By Secondary Data?

This matter was initiated by a letter from the complainant received on March 20, A response from Dr. Justin Clark was received on May 11, 2017.

PERFORMANCE MEASURE DATE / RESULTS / ANALYSIS FOLLOW-UP / ACTION PLAN

Hospital Discharge Data, 2005 From The University of Memphis Methodist Le Bonheur Center for Healthcare Economics

Appendix #4. 3M Clinical Risk Groups (CRGs) for Classification of Chronically Ill Children and Adults

2017 National NHS staff survey. Results from Nottingham University Hospitals NHS Trust

Cardiovascular Disease Prevention and Control: Interventions Engaging Community Health Workers

Healthcare- Associated Infections in North Carolina

Selected Measures United States, 2011

2017 National NHS staff survey. Results from Salford Royal NHS Foundation Trust

Scenario Planning: Optimizing your inpatient capacity glide path in an age of uncertainty

Health plans, employers, and government

January 1, 20XX through December 31, 20XX. LOINC(R) is a registered trademark of the Regenstrief Institute.

Supplementary Online Content

Issue Brief. Non-urgent Emergency Department Use in Shelby County, Tennessee, May August 2012

2016 Survey of Michigan Nurses

Emergency departments (EDs) are a critical component of the

The TeleHealth Model THE TELEHEALTH SOLUTION

Patients Being Weaned From the Ventilator: Positive Effects of Guided Imagery. Authors McVay, Frank; Spiva, Elizabeth; Hart, Patricia L.

Minnesota Statewide Quality Reporting and Measurement System: Quality Incentive Payment System

Physicians Views of the Massachusetts Health Care Reform Law A Poll

A Miracle of Modern Medicine. What medical discovery touches everyone in the United States?

2016 National NHS staff survey. Results from Surrey And Sussex Healthcare NHS Trust

Introduction and Executive Summary

The Trauma System. Prevention Pre-hospital care and transport Acute hospital care Rehab Research

Cost Effectiveness of Physician Anesthesia J.P. Abenstein, M.S.E.E., M.D. Mayo Clinic Rochester, MN

Variation in length of stay within and between hospitals

EXPERIENTIAL EDUCATION Medication Therapy Management Services Provided by Student Pharmacists

2017 National NHS staff survey. Results from The Newcastle Upon Tyne Hospitals NHS Foundation Trust

Quality of Care for Underserved Populations

The Development of the Oncology Symptom Management Clinic

Quality Management and Improvement 2016 Year-end Report

Impact of Scribes on Performance Indicators in the Emergency Department

The Nature of Emergency Medicine

The number of patients admitted to acute care hospitals

Thank you for joining us today!

MERMAID SERIES: SECONDARY DATA ANALYSIS: TIPS AND TRICKS

Health Quality Ontario

Transcription:

ARTICLE Physician Variation in Test Ordering in the Management of Gastroenteritis in Children Elizabeth C. Powell, MD, MPH; Louis C. Hampers, MD, MBA Objectives: To describe the variation among physicians in test ordering when caring for children with gastroenteritis and to explore the effect of hospital charge information on such variation. Design: Prospective, nonmasked, observational study and controlled trial of price information. Setting: Urban, university-affiliated pediatric emergency department. Participants: Pediatric emergency medicine faculty (n=10) and fellows (n=5). Methods: Test-ordering practices were reviewed during 3 periods: control, intervention, and washout. During the intervention period, test charge information was placed on patients emergency department records. Telephone contact with families was initiated 7 days after care. Results: We included 3198 visits. Individual physician mean test charges varied more than 2-fold during the control period (mean, $127; range, $82-$185). Based on their test charges (control period), physicians were assigned to the high (n=8) or low (n=7) test user group. Differences in mean charges in high vs low test users during the control period ($144 vs $112) persisted in the intervention period ($80 vs $52; Mann-Whitney P =.01), as did rates of intravenous fluid use (20% vs 14% in both periods). Among the lowest-acuity patients, low test users exhibited greater price sensitivity (vs high users). Patients treated by low test users did not differ in improved condition (82% vs 86%) or family satisfaction (93% vs 92%); they had more unscheduled follow-up (25% vs 17%; P.01), but were no more often admitted (5% vs 3%; P=.11). Conclusions: Physicians varied in resource use when treating children with gastroenteritis. High and low test users were sensitive to price information. This intervention did not seem to compromise patient outcome. Arch Pediatr Adolesc Med. 2003;157:978-983 From the Division of Pediatric Emergency Medicine, Northwestern University s Feinberg School of Medicine, Chicago, Ill. Dr Hampers is now with the Section of Pediatric Emergency Medicine, Children s Hospital, Denver, Colo. PRACTICE VARIATION among physicians is well documented. 1-3 This variation in part reflects uncertainty in medicine: although the number of clinical guidelines is increasing, in many specific clinical situations physicians determine the tests or interventions needed for their patients. 4 Thus, clinicians develop individual styles of patient management based on their experiences, their training, and their temperament. Some physicians are comfortable ordering fewer tests, whereas others prefer the additional information made available by using more studies. Inefficient use of diagnostic tests at academic institutions has been observed. 5 This seems in part to be a result of the academic environment, where the variety of payer relationships and hospitalwide billing procedures effectively insulate the individual physician from the relative costs of diagnostic testing. 6,7 In contrast, physicians in community practices face logistical and direct financial disincentives to ordering tests. 8,9 Test-ordering behavior may also reflect resident training, most of which occurs in the academic setting, and the career track of many academic pediatric emergency medicine physicians, who remain in an academic setting after completing their training. Although it is difficult to know the specific tests needed to provide highquality medical care, it is clear that inefficient use of diagnostic studies contributes to health care costs. 10,11 Providing test price information has been demonstrated to result in fewer tests being ordered. 12,13 In a previously published work, Hampers et al 13 described the general effect of price information in the emergency department (ED), without detailing inter- 978

physician variations. In this study, we describe the variation in test-ordering behavior in a single setting (the pediatric ED) among physicians with similar training (pediatric emergency medicine faculty and fellows) in managing a single disease (gastroenteritis). We also looked at the effects of the provision of test charge information on individual physician test-ordering practices and patient outcomes. METHODS Test Charges, Mean, $ 160 140 120 100 80 60 40 High Test Users Low Test Users This study was conducted in the ED of an urban, tertiary care pediatric hospital with an annual volume of 39000 patients. Boardcertified pediatric emergency medicine physicians or fellows, working independently, supervised the care of all patients studied. Although pediatric, emergency medicine, and family practice house staff participated in the care of many patients, they had limited autonomous decision-making authority. On arrival at the ED, patients were triaged by acuity of illness to 1 of 4 categories: emergent, urgent high, urgent low, and nonurgent. Between September 1, 1997, and March 31, 1998, a data form was attached to each patient chart at triage. The form asked physicians to identify patients aged 2 months to 10 years who had a complaint of vomiting, diarrhea, decreased oral intake, or fever. We defined gastroenteritis as vomiting or diarrhea, and we included all children with those complaints. Children with chronic illness (a history of immunosuppression or immunodeficiency, an inborn error of metabolism, or a ventriculoperitoneal shunt) were excluded. Data were collected during 3 periods. During the control period (September 1, 1997, to December 31, 1997), physicians were asked to circle the tests they ordered on the data form. During the intervention period (January 1, 1998, to March 31, 1998), physicians were asked to circle the tests they ordered on a similar data form that included the hospital charge for each test. During the washout period (April 1-30, 1998), the forms used in the control period were again used. The control period was 1 month longer than the intervention period because of seasonal variation in the incidence of gastroenteritis; this allowed enrollment of a similar number of cases in each of these 2 periods. Hospital charges for individual tests did not change during the study. One of the investigators (L.C.H.) or a research assistant reviewed the ED records after the visit to collect the following information: age, race or ethnicity (as recorded by registration personnel), insurance status, initial vital signs, diagnostic testing, use of intravenous fluids, disposition (admitted to the hospital or discharged from the ED), and attending or fellow supervising care. We calculated years in practice for supervising physicians as years since completion of residency. During the control and intervention periods, patient families were contacted by telephone approximately 1 week after the ED visit. Using a structured interview, caretakers were asked to describe the child s overall condition (better, same, or worse), whether they had taken the child to see a health care provider since the ED visit (scheduled or unscheduled and office or urgent care/ed), and whether the child had been admitted to the hospital. If the caretaker had been with the child at the time of the ED visit, they were asked to describe their overall satisfaction with their child s care (very unsatisfied, somewhat unsatisfied, somewhat satisfied, very satisfied). Caretakers who spoke only Spanish were interviewed in Spanish. Because the washout period was designed only to estimate the effect of the intervention on test charges, telephone interviews were not conducted during that time. Data were analyzed using statistical software (SPSS for Windows; SPSS Inc, Chicago, Ill). For comparisons between groups 20 0 Control Period of categorical data we used the 2 test; we used a 2-tailed t test for continuous variables. Test charges, which were not normally distributed, were compared between physicians and between periods using the Mann-Whitney test; 95% confidence intervals (CIs) for means are reported. Statistical significance was set at P.05. The study was approved by the institutional review board of Children s Memorial Hospital (Chicago). RESULTS Intervention Period Test charges associated with high and low test users during the control and intervention periods. Data were available for 1415 visits during the control period and 1429 visits during the intervention period. Review of daily ED records suggested that the data we report represented an estimated 90% of eligible cases; reasons for inappropriate exclusion included clerical error (no form was attached to the ED record) and physician error (the physician did not complete the form). The demographic and clinical characteristics of eligible patients who were not included in this study did not differ from those of patients who were included. Each physician (n=15) treated a mean (SD) of 94 (29) patients (range, 40-136) during the control period and 95 (31) patients (range, 44-159) during the intervention period. The 15 participating physicians exhibited wide variation in their clinical approaches. Physician-specific mean test charges during the control period ranged from $82.40 to $185.29 (overall mean, $126.66; median, $74.00). Based on this variation in test charges, physicians were divided into 2 groups: high test users (6 faculty physicians and 2 fellows) and low test users (4 faculty physicians and 3 fellows). Mean (SD) duration of practice was 7 (5) years for high test users and 4 (1) years for low test users (Mann-Whitney P=.20). Mean test charges associated with high vs low test users during the control period were $143.84 (95% CI, $130.39-$157.28) vs $112.43 (95% CI, $101.32-$123.54) (Figure). The demographic and clinical characteristics of children cared for by high and low test users are given in Table 1. There were no significant differences between groups in race or ethnicity, insurance status, triage category, or clinical characteristics. We also compared the case-mix of patients within each period; there were no significant differences between high and low test users in patient distribution by triage category during either 979

Table 1. Demographic and Clinical Characteristics of Children Cared for by 8 High and 7 Low Test Users* Characteristic High Users (n = 1351) Low Users (n = 1493) Race or ethnicity, No. (%) White 219 (16) 251 (17) Black 307 (23) 343 (23) Hispanic 701 (52) 775 (52) Other 74 (6) 71 (5) Not available 50 (4) 53 (4) Insurance, No. (%) Commercial 388 (29) 433 (29) Medicaid 801 (59) 903 (60) Uninsured 162 (12) 157 (10) Triage category, No. (%) Emergent 3 (0.2) 5 (0.3) Urgent high 266 (20) 277 (19) Urgent low 617 (46) 707 (47) Nonurgent 465 (34) 504 (34) Clinical characteristics, mean (SD) P Value Age, mo 33 (30) 34 (30).34 Temperature, C 37.8 (1.2) 37.8 (1.2).40 Heart rate, beats/min 137 (25) 136 (24).12 Respiratory rate, 34 (11) 34 (11) breaths/min.13 *Because of rounding, percentages may not all sum to 100. period ( 2, control period: P=.40; and intervention period, P=.98). Table 2 gives test charges for patients cared for by high and low users during the control and intervention periods stratified by triage category. Because emergent patients accounted for less than 0.5% of each group, they were combined with urgent high patients for ease of data analysis and presentation. Within triage categories, the charge differences between high and low test users were statistically significant for each category and within each period except for urgent high patients during the intervention period. Overall, the mean charges for tests were 48% (95% CI, 39%-56%) lower during the period when physicians were provided with price information. The biggest declines were observed among low test users ( 53% overall) and for patients who were triaged as nonurgent ( 58% overall and 67% for low test users). Among patients admitted to the hospital (17% during the control period and 11% during the intervention period), there were no significant differences in test charges between high and low test users during either period (control and intervention, Mann-Whitney, P=.56 for both), and there were no significant decreases in charges between periods (Mann-Whitney, P=.09). Among patients not admitted to the hospital, there were significant mean charge differences between periods (control period, $85.58; intervention period, $36.88; Mann-Whitney P.01) and within periods between high and low test users. More patients had no tests ordered during the intervention period (66% vs 41%; 2 P.01) (Table 3). The differences between high and low test users were significant for each test and for no tests during the control period. These differences were less marked during the intervention period..02.46.64 High test users administered intravenous fluids to 20% of patients during the control and intervention periods; low test users gave intravenous fluids to 14% of their patients during each period. Patients treated vs not treated with intravenous fluids had higher test charges during the control period (chemistry charges, $72.80 vs $5.34; and total charges, $91.57 vs $303.76). Among patients treated with intravenous fluids by high test users, charges declined significantly between the control and intervention periods (chemistry charges, $77.00 vs $47.80; and total charges, $314.56 vs $223.96; Mann-Whitney P.01 for each). Among patients treated with intravenous fluids by low test users, charges also declined significantly between periods (chemistry charges, $67.91 vs $43.86; and total charges, $291.17 vs $176.50; Mann- Whitney P.01 for each). We successfully obtained telephone follow-up information from 556 (39%) of 1415 families during the control period and 599 (42%) of 1429 families during the intervention period; 555 (48%) of these 1155 had been cared for by high test users and 600 (52%) had been treated by low test users. The percentage of children who were better was similar in practice style groups (86% for high users vs 82% for low users; 2 P=.20), as was the proportion of caretakers who reported that they were somewhat satisfied or very satisfied with their ED visit (92% vs 93%; P=.40). Among children treated by low test users, there was more unscheduled follow-up (25% vs 17%; P.01), mainly with primary care providers. Unscheduled returns to the ED occurred in 12% of patients cared for by low test users and 9% cared for by high test users (P=.08). The proportion of children who were admitted to the hospital during follow-up was similar (3% of patients treated by high test users and 5% of those treated by low test users; P=.11). A total of 354 patients were included in the brief washout period. Patients of high users (n=135) had charges similar to those observed during the intervention period (mean, $80.79, Mann-Whitney P=.99); low users (n=219) had a modest increase in charges (+11%), which was not statistically significant (mean, $58.71; Mann-Whitney P=.35). COMMENT There was enormous variation among physicians in tests ordered for pediatric patients with gastroenteritis, suggesting that some practiced in a resource-intensive manner and that others provided more efficient care. Variation in patient acuity does not explain these practice differences, and neither does family insurance status. As a result, average patient charges within triage categories vary 2-fold or greater. Providing price information was associated with a significant decrease in the number of tests ordered, reduced overall charges, and reduced some of the variation among physicians in test-ordering charges, particularly among visits triaged as urgent high. Hampers et al 13 previously described the general effect of price information in the ED, without detailing interphysician variations. The focus of the present study was to explore the implications of this intervention on the practice styles of faculty and fellows. In addition to 980

Table 2. Test Charges per Patient in the Control and Intervention Periods for Children Cared for by High and Low Test Users Control Period Intervention Period Variable Patients, No. Charges, Mean, $ Patients, No. Charges, Mean, $ % Change* Overall High users 641 143.84 710 80.42 44 Low users 774 112.43 719 52.43 53 Subtotal 1415 122.66 1429 66.33 48 Triage category Urgent high High users 148 283.09 121 178.98 36 Low users 159 228.16 123 154.85 32 Subtotal 307 254.64 244 167.31 34 P value.02.37 Urgent low High users 283 136.90 334 85.71 37 Low users 366 111.84 341 44.98 60 Subtotal 649 122.77 675 65.13 47 P value.03.01 Nonurgent High users 210 55.04 255 26.25 52 Low users 249 39.39 255 12.98 67 Subtotal 459 46.55 510 19.61 58 P value.05.01 *P =.01 for all. By Mann-Whitney test. reporting striking individual practice variation, we believe that this analysis provides information about the receptiveness of members of a group of academic practitioners to educational interventions designed to alter their behaviors. Results of the present analysis suggest that high and low test users are sensitive to price information. Among higher-acuity patients, high test users had greater declines than low test users, suggesting greater discretionary test use among high test users during the control period. During the intervention period, the mean charge difference between the high and low test users was less, suggesting a more consistent approach to test use in higher-acuity patients. Overall test charges declined the greatest among less acute patients ( 58% for nonurgent patients), suggesting that price information had the greatest impact on discretionary test ordering when physicians were treating children who were the least ill. Despite their lower utilization rates in the control period, low test users seemed to be even more sensitive to price information than were high users, further widening the gap between these 2 groups in the intervention period. It may be that the testordering practices of the low test users in part reflected a predisposition to price sensitivity. The high test users may have personality traits that made them less price sensitive, resulting in higher resource use in the control period and less sensitivity to charge information. This finding suggests an interesting paradox: price information may have the greatest impact on providers who are already practicing the most efficiently. Some of the variation in test ordering may reflect training environment: high test users, on average, had been in practice longer than low test users. Low test users may have been trained during a more cost-conscious time. Table 3. Tests Obtained by High and Low Users During the Control and Intervention Periods* Test Control Period (n = 1415) Patients, No. (%) Intervention Period (n = 1429) None High users (n = 641) 232 (36) 435 (61) Low users (n = 774) 347 (45) 513 (71) Subtotal 579 (41) 948 (66) P value.01.01 Complete blood cell count High users 162 (25) 97 (14) Low users 149 (19) 72 (10) Subtotal 311 (22) 169 (12) P value.01.03 Serum electrolytes High users 96 (15) 67 (9) Low users 78 (10) 43 (6) Subtotal 174 (12) 110 (8) P value.01.01 Urinalysis High users 133 (21) 74 (10) Low users 123 (16) 56 (8) Subtotal 256 (18) 130 (9) P value.02.08 *P.01 for all. By 2 test. We do not believe that the resource-efficient physicians we observed did not obtain needed tests. More specifically, the data for the low test users suggested that rates of family satisfaction, reports that the child was better, and rates of hospital admission were similar to those observed among their higher-spending peers. 981

Although mean charges declined for patients within each stratum, there were no significant differences between the control and intervention periods in charges for patients who were admitted to the hospital. Although the methods used to estimate dehydration in children are imprecise and were not standardized for this study, we observed consistency among physicians across periods in the percentage of children treated with intravenous fluids. Thus, test use among children admitted to the hospital and treatment with intravenous fluids were not sensitive to test price information. Specific charge information was provided only for tests; it is possible that additional charge information for other interventions (intravenous fluid administration and hospital admission) would alter the results we present. The washout period was brief and involved a limited number of patients. However, among high test users, overall charges during the washout period were similar to those in the intervention period, and low test users showed a modest increase. This suggests that short-term effects of the intervention on high and low test users persisted. Other pediatric studies have observed differences in test ordering specific to practice settings and to training. There is clear variation among care providers in the tests obtained on febrile infants. 14,15 Differences in the care of children with febrile seizures treated in community EDs vs university-affiliated children s hospitals have also been reported. 3 Practice variation between physicians trained in pediatric emergency medicine and those trained in emergency medicine in the management of croup has been demonstrated. 16 The present study, performed at a single institution among physicians who had completed or were in training to complete pediatric emergency medicine fellowships, suggests that individual behaviors also account for the significant variation in test-ordering practices. Although gastroenteritis is not a high-cost disease, it is a high-volume disease, and the variation in charges becomes significant over time. There were clear differences in the use of intravenous fluids among high and low test users that was unexplained by case-mix differences. During the intervention period, fewer patients had routine tests ordered (serum electrolytes, complete blood cell count, urinalysis, and urine culture). A higher fraction of patients had no tests ordered. The challenge is that the best test-ordering strategy, which allows for only needed tests and maximizes efficiency for children with gastroenteritis, is not known. High and low test users were sensitive to price information. Although these data suggest that information that caused efficient physicians to order even fewer tests resulted in satisfactory outcomes, additional study of this issue is warranted. Because most cases of gastroenteritis in infants and toddlers have a good prognosis, it is not surprising that the variation in testing we report did not seem to impact outcome or satisfaction. It is difficult to completely separate the effect of attending physician test-ordering behavior from that of the house staff they supervised. Some of the house staff (postgraduate years 1 and 2) participated during both periods; others spent a single month during either the control or the intervention period in the ED. Because each physician (attending or fellow) supervised all house staff, it is unlikely that systematic differences between house staff explain the variation among supervising physicians. Although the fellows supervised house staff independent of the attending, it is possible that the attending had an impact on the tests that the fellow ordered. However, because each attending was present with each fellow at various times, it is unlikely that this explains the variation we report within each stratum and between periods. It is also difficult to completely separate the effect of the incidence of gastroenteritis, which has seasonal variation, on attending physician test-ordering behavior. It is possible that physicians ordered fewer tests during the intervention period because gastroenteritis was more common then. Although this may account for some of the differences between periods, it is unlikely to explain the variation in charges between high and low test users during each period. Although our case-mix seems consistent in control and intervention periods, there may have been unrecognized differences in acuity, as the proportion of children admitted to the hospital was lower in the intervention period. However, as we reported in the results, significant differences in test charges persisted in a separate analysis of patients who were not admitted to the hospital. Because there are no generally accepted outcome measures for children with gastroenteritis treated and then discharged from the ED, we chose to obtain information about family perception that the child was better, family satisfaction with the ED encounter, rates of hospital admission during follow-up, and rates of unscheduled follow-up with primary care providers or in the ED. Although we observed a higher rate of unscheduled follow-up when fewer tests were ordered, there were no differences in hospital admission rates or in parent satisfaction. The reasons for the higher rates of unscheduled follow-up are not clear but may include unmet parental expectations for tests or physician failure to obtain needed studies. We do not know whether diagnostic studies were obtained during follow-up visits. Follow-up visits contribute to overall health care costs; thus, further study is needed to address this observation. Nevertheless, we believe it is unlikely that failure to obtain studies contributed to significant morbidity, as parent satisfaction and hospital admission rates were similar for both groups. Our study is limited by failure to obtain follow-up information on many patients. We used a fairly narrow time window, and many families provided incomplete contact information. Although this limits the generalizability of our results, there seem to be no systematic biases in follow-up rates between high and low test users, suggesting that the data we present are valid. Gastroenteritis represents a narrow range of complaints, and we limited this study to healthy children. In addition, we defined gastroenteritis broadly as vomiting, diarrhea, or both, and it is possible that children with other diagnoses were included in the data we present. For most children, this illness is self-limited, and thus we were unlikely to observe poor outcomes. For this reason, much test ordering in the management of gastroenteritis is discretionary. It is clear that within a single 982

What This Study Adds Data about physician-specific variation in test-ordering behavior and the effects of charge information are limited. We describe the variation in test use in a single ED among physicians with similar training in managing children with gastroenteritis. The data suggest that there was significant variation and that high and low test users were sensitive to price information. Low test use did not seem to be associated with poor outcomes. ED there was enormous variation among physicians in test ordering, contributing to significant variation in patient charges. Moreover, all physicians were sensitive to price information, and it reduced some of the variation among physicians, particularly when caring for higheracuity patients. Further study is needed to determine the best methods to allow physicians to provide costsensitive, efficient, high-quality care to their patients. Accepted for publication March 13, 2003. This study was funded in part by a special project grant from the Ambulatory Pediatric Association, McLean, Va. We thank Robert R. Tanz, MD, for his thoughtful review of this manuscript. Corresponding author: Elizabeth C. Powell, MD, MPH, Division of Pediatric Emergency Medicine, Children s Memorial Hospital, Box 62, Chicago, IL 60614 (e-mail: epowell@northwestern.edu). REFERENCES 1. Wennberg JE. Understanding geographic variations in health care delivery. N Engl J Med. 1999;340:32-39. 2. Petrack EM, Christopher NC, Kriwinsky J. Pain management in the emergency department: patterns of analgesic utilization. Pediatrics. 1997;99:711-714. 3. Hampers LC, Trainor JL, Listernick RL, et al. Setting-based practice variation in the management of simple febrile seizure. Acad Emerg Med. 2000;7:21-27. 4. National Guideline Clearinghouse Web site. Vol 2001. Rockville, Md: Agency for Healthcare Research and Quality; 2001. Available: at http://www.guideline.gov. Accessed July 3, 2003. 5. Cameron JM. The indirect costs of graduate medical education. NEnglJMed. 1985;312:1233-1238. 6. Greco PJ, Eisenberg JM. Changing physicians practices. N Engl J Med. 1993; 329:1271-1275. 7. Wong ET, McCarron MM, Shaw ST. Ordering of laboratory tests in a teaching hospital: can it be improved? JAMA. 1983;249:3076-3080. 8. Grumbach K, Bodenheimer T. Mechanisms for controlling costs. JAMA. 1995; 273:1223-1230. 9. Berwick DM, Coltin KL. Feedback reduces test use in a health maintenance organization. JAMA. 1986;255:1450-1454. 10. Grossman GM. A review of physician cost-containment strategies for laboratory testing. Med Care. 1983;21:783-802. 11. Grumbach K, Bodenheimer T. Painful vs painless cost control. JAMA. 1994;272: 1458-1464. 12. Sachdera RC, Jefferson LS, Coss-Bu J, et al. Effects of availability of patientrelated charges on practice patterns and cost containment in the pediatric intensive care unit. Crit Care Med. 1996;24:501-506. 13. Hampers LC, Cha S, Gutglass DJ, Krug SE, Binns HJ. The effect of price information on test-ordering behavior and patient outcomes in a pediatric emergency department. Pediatrics. 1999;103:877-882. 14. Kramer MS, Shapiro ED. Management of the young febrile child: a commentary on recent practice guidelines. Pediatrics. 1997;100:128-134. 15. Finkelstein JA, Christiansen CL, Platt R. Fever in pediatric primary care: occurrence, management, and outcomes. Pediatrics. 2000;105:260-266. 16. Hampers LC, Faries SG. Practice variation in the emergency management of croup. Pediatrics. 2002;109:505-508. 983