Predicting need for advanced illness or palliative care in a primary care population using electronic health record data.

Jung K, Sudat SEK, Kwon N, Stewart WF, Shan NH., J Biomed Inform. 92:103115., 2019 Apr 01

Investigators

Abstract

Timely outreach to individuals in an advanced stage of illness offers opportunities to exercise decision control over health care. Predictive models built using Electronic health record (EHR) data are being explored as a way to anticipate such need with enough lead time for patient engagement. Prior studies have focused on hospitalized patients, who typically have more data available for predicting care needs. It is unclear if prediction driven outreach is feasible in the primary care setting.

In this study, we apply predictive modeling to the primary care population of a large, regional health system and systematically examine the impact of technical choices, such as requiring a minimum number of health care encounters (data density requirements) and aggregating diagnosis codes using Clinical Classifications Software (CCS) groupings to reduce dimensionality, on model performance in terms of discrimination and positive predictive value. We assembled a cohort of 349,667 primary care patients between 65 and 90 years of age who sought care from Sutter Health between July 1, 2011 and June 30, 2014, of whom 2.1% died during the study period. EHR data comprising demographics, encounters, orders, and diagnoses for each patient from a 12 month observation window prior to the point when a prediction is made were extracted. L1 regularized logistic regression and gradient boosted tree models were fit to training data and tuned by cross validation. Model performance in predicting one year mortality was assessed using held-out test patients.

Our experiments systematically varied three factors: model type, diagnosis coding, and data density requirements. We found substantial, consistent benefit from using gradient boosting vs logistic regression (mean AUROC over all other technical choices of 84.8% vs 80.7% respectively). There was no benefit from aggregation of ICD codes into CCS code groups (mean AUROC over all other technical choices of 82.9% vs 82.6% respectively). Likewise increasing data density requirements did not affect discrimination (mean AUROC over other technical choices ranged from 82.5% to 83%). We also examine model performance as a function of lead time, which is the interval between death and when a prediction was made. In subgroup analysis by lead time, mean AUROC over all other choices ranged from 87.9% for patients who died within 0 to 3 months to 83.6% for those who died 9 to 12 months after prediction time.

Pubmed Abstract

Pubmed AbstractOpens New Window

Associated Topics

Related Publications

Clinician Perceptions of Receiving Different Forms of Feedback on their Opioid Prescribing

American Journal of Medical Quality
2022 Dec 29

Impact of implementing electronic prior authorization on medication filling in an electronic health record system in a large healthcare system

Lauffenburger JC, Stults CD, Mudiganti S, Yan X, Dean-Gilley LM, He M, Tong A, Fischer MA
J Am Med Inform Assoc.ocab119. doi: 10.1093/jamia/ocab119. Online ahead of print.
2021 Jul 19

Sutter Health

Book Appointment

Walk-In Care

Urgent Care

Video Visits

Primary Care

COVID-19

Patient Login

Patient Resources

Billing and Insurance

Patients and Visitors

My Health Online

My Health Online

Get Care

Featured Services

Treatments and Services

Schedule a Visit

Research

Education

Research and Education

Graduate Medical Education

Graduate Medical Education

About Sutter Health

Community

Careers

About Us

Work at Sutter Health