Skip to main content
Journal Club5 min read

Where Hospital AI Actually Lands — and Why That Is the Finding

A geospatial study of 3,092 US hospitals asked not whether predictive AI works but where it goes. It pools in the better-connected, better-resourced places — and the top predictor was interoperability, not size or money.

Dr. Sven Jungmann

Dr. Sven Jungmann

CEO

Editorial collage of a navy US map with teal halftone dots clustered in a few regions and a single amber dot alone in an empty area.

Of 3,092 US hospitals that answered the 2023 American Hospital Association technology survey, about one in three — 35 percent — reported running AI-based predictive models. The interesting number is not that one. It is where those hospitals sit. Hwang and colleagues at Stanford put every facility on a map and asked a question accuracy benchmarks never reach: which patients live near a hospital that has these tools at all? The answer is a country of dense clusters and large blanks, and the blanks are not random.

For a physician this lands harder than any leaderboard. A model that helps only where help is already plentiful does not close a gap in care. It deepens it. So the right thing to study about a technology that is genuinely spreading is its distribution — and this preprint studies exactly that, with the appropriate humility about what a map can and cannot prove.

Design: a map, not a trial

This is an observational, cross-sectional analysis. The authors took the 2023 AHA Annual Survey and its information-technology supplement — which, for the first time, asked in detail whether a hospital uses predictive models — and graded each facility on a three-level scale: no predictive models, non-AI predictive models, or AI-based models. They joined this to community socioeconomic and shortage indicators and to 47 Centers for Medicare & Medicaid Services quality metrics measured quarterly from 2022 through mid-2025. Spatial statistics located the clusters; a random-forest model with SHAP values ranked what predicts adoption; and a longitudinal arm asked whether AI-adopting hospitals moved differently on quality over time. No patients were followed and nothing was randomised. Every result is an association.

What the data show

Adoption is sharply clustered — significant spatial autocorrelation on every implementation measure, with identifiable hotspots and coldspots rather than an even frontier. And the clustering runs the wrong way relative to need. Comparing implementation against need indicators, the authors report mismatch rates from 67.3 percent (social vulnerability) up to 72.4 percent (areas medically underserved for elderly and infant care). Read plainly: across most measures, more than two-thirds of deployment is misaligned with where the need is greatest, and the largest concentrations sit in the two worst quadrants — high need with low AI, and low need with high AI.

The predictor ranking is the part to keep. The single most influential factor was not bed count, ownership or community wealth. It was interoperability. The top SHAP feature was the Core Index, a measure of a hospital's ability to exchange health information; the inverse-signed Friction Index, capturing barriers to that exchange, was next, followed by delivery-system model, bed capacity and a minority-status indicator. Higher Core Index, more AI. And in the geographically weighted regression, Core Index was the one predictor that held a positive sign in every region while others flipped between positive and negative. Interoperability is where the relationship is stable; everything else is local. The quiet thesis is that AI settles where the data can already move.

What it cannot tell you

The quality arm is where the careful reader slows down. Of 20 metrics, 12 showed statistically significant trajectory differences for AI-adopting hospitals after correction. Some ran favourable — heart-failure 30-day mortality (slope difference -0.20) and pneumonia mortality (-0.32) drifted down, with fewer excess acute-care days after discharge. Others ran the wrong way: emergency-department median times lengthened (+1.16), more patients left the ED before being seen, and sepsis-care measures worsened (SEP-1 +0.91). That split is more believable than a clean win would be — and the authors are right to read it as association, not effect. Hospitals that adopt predictive AI differ from those that do not in staffing, case mix and baseline digitisation, any of which could move both adoption and outcomes. The study cannot separate the tool from the kind of hospital that buys it; its own random-forest model reached only an AUROC (area under the receiver-operating-characteristic curve) of 0.67, which the authors note reflects exactly such unmeasured confounding.

Two limits are structural and worth naming. The exposure is a survey checkbox: that a hospital "uses AI-based predictive models" says nothing about which model, for what task, on whom, or whether anyone acts on its output — the authors concede their categories are too coarse to tie to any specific outcome. And the unit is the hospital, not the patient, so inferring individual benefit from facility-level correlations would be an ecological overreach. None of this sinks the work. It fixes the ceiling on what the work can claim.

The strongest predictor of whether a hospital had adopted AI was not its size or its budget. It was whether it could already move data at all.

Why it travels

The figures are American and the survey instrument is American; neither maps onto a German or European system unedited. The mechanism does. If the ability to exchange data is the gate that decides which hospitals can use AI at all — and the most spatially consistent signal in the paper says it is — then the unglamorous work of standards, structured records and moving a discharge letter without a fax is not merely a precondition for AI. It is what determines whether AI ever reaches the places that need it most. The risk this preprint documents is not that the technology fails. It is that it succeeds unevenly and hardens a divide that existed before it arrived. For anyone allocating a digitisation budget, that is the case for funding the connections before the algorithms.

Source: Hwang Y-M, Ng MY, Pillai M, Sahai MP, Hernandez-Boussard T. AI Implementation in U.S. Hospitals: Regional Disparities and Health Equity Implications. medRxiv, posted 28 June 2025; doi:10.1101/2025.06.27.25330441; funded by The SCAN Foundation, no conflicts of interest reported. An unrefereed preprint with an observational, hospital-level design — its findings are associations, not causal effects; a peer-reviewed version has since appeared in Nature Health.

#Journal Club#Health Equity#Clinical AI#Interoperability#Health Policy

Keep reading

Editorial collage of a confident stack of clinical document fragments bound by a teal bracket that stops at a closed ward door, with a single amber accent.
Journal Club

Sixty-Five Studies Agree the Models Win. The Ward Hasn't Noticed.

A PRISMA review of 65 studies finds language models consistently beat classical methods at classifying clinical text. The honest reading is narrower: it is a synthesis of single-site accuracy studies that mostly never asked whether the models work at the bedside.

Dr. Sven JungmannCEO
Editorial collage of a clinical summary sheet torn down the middle, one half framed by a teal speech bubble and the other by a navy clipboard, with a single amber dot on the tear line.
Journal Club

Two Readers, One Summary: Who Should Grade Patient-Facing AI?

A small Stanford study had clinicians and parents rate the same AI-written clinical summaries. They disagreed, significantly — and that disagreement, not the scores, is the finding worth keeping.

Dr. Sven JungmannCEO

This analysis comes from the people behind Visite.

Our weekly newsletter on AI in medicine. Every Friday, rigorously checked.

By signing up you agree to receive Grand Rounds by email. Unsubscribe anytime. More in our privacy policy.

Want to see this in your hospital?

30 minutes. Your questions. Our physician-founder shows you the platform personally.

Book a demo

No commitment. No sales pitch. Physician to physician.