Using Machine Learning to Identify Geographic and Socioeconomic Disparities in Dialysis Facility Outcomes Across the United States

Despite progress in dialysis care, the patient outcomes of mortality, hospitalization, and readmission rates remain unsatisfactory because of complex clinical, demographic, and socioeconomic interactions. For this study, we used unsupervised machine learning to identify clusters of dialysis faciliti...

Full description

Saved in:
Bibliographic Details
Published inThe Ochsner journal Vol. 25; no. 3; pp. 170 - 180
Main Authors Ashkar, Ziad M., Gottumukkala, Raju
Format Journal Article
LanguageEnglish
Published United States Ochsner Clinic Foundation Academic Center - Publishing Services 01.10.2025
Subjects
Online AccessGet full text
ISSN1524-5012
1524-5012
DOI10.31486/toj.25.0040

Cover

More Information
Summary:Despite progress in dialysis care, the patient outcomes of mortality, hospitalization, and readmission rates remain unsatisfactory because of complex clinical, demographic, and socioeconomic interactions. For this study, we used unsupervised machine learning to identify clusters of dialysis facilities based on quality metrics and sociodemographic factors, with attention to racial and geographic disparities. We sourced facility-level data from data.cms.gov and sourced ZIP Code Tabulation Area-level sociodemographic data from the 2021 American Community Survey via the US Census Bureau application programming interface. Datasets were linked by ZIP code, standardized, and analyzed using principal component analysis and k-means clustering. We examined geographic patterns by US Census Bureau regions. Analyses were conducted in Python version 3.11.6 (Python Software Foundation) with the following libraries: pandas for data manipulation, scikit-learn for machine learning and principal component analysis, Matplotlib and Seaborn for data visualization, and GeoPandas for geographic mapping and spatial analysis. Two facility clusters emerged: Cluster 0 (n=4,609) and Cluster 1 (n=2,857). Cluster 1 was characterized by poorer outcomes (higher mortality, hospitalization, readmission, anemia, catheter use, and hyperphosphatemia); lower rates of fistula use; and lower dialysis adequacy compared to Cluster 0. Cluster 1 facilities were more prevalent in regions with lower income, higher unemployment, and lower college education, and they served populations with greater proportions of Black and Hispanic residents. Geographically, Cluster 1 facilities were concentrated in the southern and western United States. Compared to Cluster 0, a larger share of Cluster 1 facilities were for-profit facilities (91.4% vs 88.5%). This study highlights a distinct cluster of underperforming dialysis clinics serving socioeconomically disadvantaged and racially diverse populations. Addressing these disparities requires multifaceted strategies including patient-level, institutional, and policy-level interventions.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:1524-5012
1524-5012
DOI:10.31486/toj.25.0040