Topics

Pitfalls of AI implementation in risk adjustment

Post by:

Channy Hong

December 2, 2024

Why AI Adoption Is Accelerating — and Why Caution Matters

Risk adjustment teams are being asked to do more with less: higher documentation scrutiny, expanding RADV audits, larger volumes of unstructured clinical data, and tighter timelines. These pressures are driving MAOs toward AI-based solutions.

But while AI can meaningfully improve accuracy and efficiency, poorly implemented AI introduces real financial, compliance, and operational risk. In this post, we outline the most common pitfalls MAOs encounter when adopting AI for risk adjustment — and how to avoid them.

Pitfall #1: Overestimating What “AI Coding” Actually Means

Not all “AI coding” is created equal. Many tools surface diagnosis codes but do not evaluate whether the underlying documentation is defensible, clinically valid, or aligned with CMS expectations.

The core issues:

AI may detect a mention of a disease but ignore MEAT criteria.
Not all “mentions” translate into valid HCCs.
Coding accuracy and documentation rigor still require human oversight.
Marketing language often overstates what models can reliably interpret.

AI can assist coding, but it cannot autonomously replace coders and auditors — nor should it.

Pitfall #2: Treating Clinical Notes as Flat Text

Clinical notes are not simple text blobs. They contain temporal cues, uncertainty, negation, plan vs. history distinctions, and context that impacts coding decisions.

However, many poorly implemented AI systems struggle with:

Distinguishing historical vs. active conditions.
Identifying when a condition has resolved.
Separating ruled-out diagnoses from confirmed ones.
Understanding where in the note the provider is documenting assessment vs. family history vs. differential.

When AI flattens these nuances, the result can be incorrect, non-defensible, or misleading coding suggestions.

Pitfall #3: Ignoring Model Limitations

One of the fastest ways AI fails within an MAO is when organizations assume models can do things they realistically cannot.

Here are the hard boundaries:

AI cannot create documentation that does not exist. If a provider didn’t document it, neither humans nor AI can code it.
AI cannot infer diagnoses from clinical hints. CMS requires explicit documentation.
AI cannot replace coders for edge-case interpretation. Ambiguity still requires expert review.

Knowing these limitations is essential to deploying AI safely.

Pitfall #4: Overlooking Audit Defensibility

Many AI solutions increase code volume without increasing documentation fidelity. That’s a dangerous combination.

Common failure points:

Mention-based systems that identify conditions without clinical substance.
Algorithms that cannot produce a traceable, MEAT-compliant evidence trail.
Outputs that are unreviewable or locked behind vendor interfaces.
Lack of structured evidence that supports coding decisions during RADV review.

If AI elevates codes without elevating defensibility, it increases audit exposure — not accuracy.

Pitfall #5: Betting on AI That Doesn’t Scale Operationally

Even high-performing AI can fail if it cannot handle real-world operational constraints.

Typical challenges:

Ingesting millions of pages of heterogeneous clinical documentation.
Processing multi-provider, multi-EHR chart styles.
Managing latency and throughput for large retrospective reviews.
Integrating with existing retrieval, coding, and QA workflows.
Maintaining consistency as data volume grows.

AI that performs well in a demo but collapses under MAO-scale load is not useful.

Pitfall #6: Lack of Transparency and Explainability

Risk adjustment decisions cannot be black box. Coders, auditors, and compliance teams must understand:

Why a code was suggested
Where in the documentation the supporting evidence came from
How the model interpreted ambiguous language
What the reviewer should validate or correct

Tools that lack transparency lead to:

Low coder trust
Slow adoption
Increased QA burden
Greater audit uncertainty

Explainability is not a “nice to have” — it is a prerequisite for trust and compliance.

Pitfall #7: Misalignment Across Retrospective, Concurrent, and Prospective Workflows

Many AI solutions focus narrowly on retrospective code capture, leaving gaps across the full RA lifecycle.

When tools are fragmented:

Retrospective outputs don’t align with prospective suspecting.
Different systems produce conflicting code sets for the same member.
Documentation is duplicated or inconsistent across workflows.
Compliance teams lack a unified source of truth.

This fragmentation increases audit risk and operational inefficiency.

A Practical Framework for Evaluating AI Tools in Risk Adjustment

To help MAOs evaluate solutions rigorously, here is a vendor-agnostic checklist:

Accuracy & Precision

How often is the AI correct when it surfaces a code?
Does it prioritize precision over volume?

Evidence Quality

Does the system produce MEAT-compliant, traceable evidence?
Can coders easily review and validate each suggestion?

Coverage & Alignment

Does the tool support retrospective, concurrent, and prospective use cases?
Does it avoid conflicting outputs across workflows?

Compliance & Safety

How does the system enforce CMS rules and guardrails?
Does it avoid speculative or ambiguous coding behavior?

Transparency

Can coders see exactly why the model surfaced a code?
Are evidence packets auditor-friendly?

Scalability

Can the platform handle multi-year chart volume at population scale?

Governance & Monitoring

Are there controls for monitoring drift, managing updates, and maintaining audit logs?

Human-in-the-Loop

Does the system strengthen coder review rather than bypass it?

This framework empowers MAOs to evaluate AI realistically and safely.

How RAF Precision AI Addresses These Pitfalls — And Its Limits

Where RAF Precision AI Helps

Surfaces diagnosis codes with high precision rather than raw volume.
Produces MEAT-based, audit-ready evidence packets for each suggestion.
Accelerates coding workflow and supports coder oversight rather than completely replacing human judgment.
Enables efficient QA and internal mock audit workflows.

Where RAF Precision AI Has Limits

It cannot correct missing or poor provider documentation.
It cannot guarantee audit outcomes.
It cannot replace coders for ambiguous or borderline cases.
It cannot operate without clear evidence in the clinical record.

This transparency is critical for compliance-minded organizations.

Conclusion: AI Is Powerful — But Only When Implemented Thoughtfully

AI is reshaping risk adjustment — but success depends on careful, compliance-first implementation. When MAOs understand the pitfalls and deploy AI with the right expectations, they gain not just efficiency, but accuracy, defensibility, and operational resilience.

The goal isn’t automation for its own sake. The goal is better documentation, better coding decisions, and stronger audit readiness.

‍

Preparing against CMS's RADV ramp-up

Topics

Channy Hong

Jun 1, 2025

Announcing VitalStrata's $1.2M pre-seed round

News

Channy Hong

May 1, 2025