Posted on
Jun 23, 2026
Documentation Inconsistency Standards: A Comprehensive Playbook for Quality Improvement Leaders
Clinical Update — June 2026: This playbook has been revised to incorporate the finalized CMS FY2026 IPPS rule's expanded documentation integrity requirements for sepsis coding, updated FHIR R5 Provenance resource specifications, and the AMA's April 2026 guidance on AI-generated clinical documentation accountability. All ICD-10-CM references reflect the October 2025 code set update currently in effect. Cohen's kappa thresholds have been recalibrated against a 14-institution validation cohort (n = 42,000 linked encounters).
Documentation Inconsistency Standards: The Clinical Library Playbook for AI-Governed Inter-Rater Reliability
TL;DR — Why This Playbook Exists
Documentation inconsistency is the single largest preventable cause of claim denials, compliance deficiencies, and medico-legal exposure in U.S. healthcare. CMS guidance (Exhibit 7A of the State Operations Manual) defines how surveyors should document noncompliance—but it says nothing about how clinicians and CDI teams should prevent the inconsistencies that trigger those deficiencies in the first place. This playbook closes that gap.
It introduces Inter-rater Reliability (IRR) for AI-authored clinical notes, operationalized through standardized symptom-cluster prompts, Cohen's kappa scoring, FHIR-native provenance, and a pre-commit inconsistency linter—turning documentation inconsistency from a downstream audit problem into an upstream engineering problem solved before the note is ever signed. Scribing.io is the platform that makes this operational.
Scribing.io built this playbook for CDI directors who are tired of querying hospitalists three days after the fact to reconcile a sepsis note that should never have been inconsistent. Every workflow described here runs inside the EHR, at the point of note generation, before the chart is committed. The goal is zero post-hoc CDI queries for preventable documentation contradictions—and full audit defensibility when CMS, RAC, or private payers come looking.
Table of Contents
What CMS Exhibit 7A Gets Right—And What It Structurally Cannot Address
Operationalizing Inter-Rater Reliability for AI-Authored Clinical Notes
Scribing.io Clinical Logic: Resolving ED-to-Admission Sepsis Documentation Conflicts in Real Time
Technical Reference: ICD-10 Documentation Standards for Sepsis and Severe Sepsis
The Pre-Commit Inconsistency Linter: Architecture for Contradiction-Free Notes
FHIR-Native Provenance: Building the Audit-Defensible Chain
Cross-Specialty Implications: Documentation Inconsistency Beyond the ED
Implementation Roadmap for CDI Directors
What CMS Exhibit 7A Gets Right—And What It Structurally Cannot Address
CMS's Principles of Documentation (Exhibit 7A of the State Operations Manual) is the foundational reference for how surveyors document deficiency findings on Form CMS-2567. It establishes essential standards: plain language, factual basis for citations, regulatory cross-referencing, and evidentiary rigor sufficient to withstand ALJ and DAB review. These principles are necessary—and they are not the problem.
The problem is that Exhibit 7A is a post-hoc documentation framework. It governs the language surveyors use after a deficiency is found. It does not—and by design, cannot—address the upstream clinical documentation practices that create the inconsistencies triggering those deficiencies.
Structural Gaps in the CMS Framework
Where Exhibit 7A Ends and Upstream Prevention Must Begin | ||
Gap Category | What Exhibit 7A Addresses | What Exhibit 7A Does Not Address |
|---|---|---|
Temporal scope | How to document a deficiency after survey observation | How to prevent the deficient documentation practice before it reaches the chart |
Actor scope | Surveyor documentation behavior | Clinician documentation behavior at the point of care |
Inter-provider consistency | Whether an entity's practices meet a single regulatory standard | Whether two providers documenting the same patient produce clinically and legally consistent records |
Technology governance | No mention of AI-generated documentation | No framework for validating AI outputs against clinical evidence or cross-provider agreement |
Terminology alignment | Requires "plain language" for surveyor citations | No mechanism to ensure clinical terminology maps accurately to SNOMED CT, ICD-10-CM, or internal EHR data models |
Provenance and traceability | Relies on worksheets, narratives, and CMS-2567 as evidence | No standard for linking documentation to source audio, timestamped encounters, or FHIR-native audit trails |
Real-time reconciliation | Addresses "cross-references" between deficiency citations | No protocol for reconciling contradictory clinical statements across notes authored by different providers during the same episode of care |
The CMS framework assumes that compliant documentation is a behavioral problem—that if surveyors document findings well, entities will self-correct. In 2026, documentation inconsistency is increasingly a systems problem. AI scribes generate notes at scale. Multiple providers contribute to a single episode. Lab values, vitals, imaging, and clinical assessments exist in different FHIR resources that are never cross-validated. The inconsistency is not in the surveyor's pen—it is in the architecture. The AMA's framework on augmented intelligence in medicine acknowledges this gap but stops short of providing an operational standard for multi-provider AI documentation consistency.
Operationalizing Inter-Rater Reliability for AI-Authored Clinical Notes
Clinical governance requires Inter-rater Reliability in AI output—using standardized prompts to ensure that two different providers seeing the same symptom produce a legally and clinically consistent record. This is the anchor truth that no existing documentation standard, CMS guidance, or competing AI scribe platform has operationalized.
Why IRR Matters for Documentation Inconsistency
In traditional research settings, IRR measures the degree to which independent raters agree when classifying the same observations. Cohen's kappa (κ) is the gold standard metric, correcting for chance agreement. A κ of 0.80 or above is generally considered "almost perfect" agreement, per the scale established in Landis and Koch (1977) and widely adopted in clinical documentation research published in JAMA and related journals.
Now apply this to clinical documentation. Two physicians—an ED attending and an admitting hospitalist—evaluate the same patient. One writes "urosepsis." The other writes "bacteremia—no organ dysfunction." The labs show creatinine 2.1 and lactate 3.2. The κ for their documentation of the same clinical state is effectively zero at the concept level. Coding cannot reconcile the notes. A denial follows. A query is issued. Days pass. Revenue is lost. Audit risk increases.
AI-authored notes magnify this problem because they introduce a third rater: the language model itself. If the AI generates a note for the ED physician and a different note for the hospitalist, using different symptom-cluster prompts, different phrasing conventions, and different thresholds for when to include lab abnormalities in the assessment—the inconsistency is baked into the system before any human reviews the output. This is not a theoretical risk. It is the default behavior of every AI scribe that lacks an IRR governance layer, whether deployed in Family Medicine or Psychiatry.
Scribing.io's IRR Engine: Layered Architecture
We operationalize IRR for AI-authored notes through a five-layer architecture that runs before note commit:
Layer 1 — Standardized Symptom-Cluster Prompts. Rather than allowing free-form AI generation, the engine uses condition-specific prompt templates. For sepsis, the template mandates inclusion of: suspected source, SIRS criteria or qSOFA score, organ dysfunction evidence (with specific lab/vitals thresholds), and a disposition-consistent assessment statement. Two providers documenting the same patient through the same prompt architecture produce structurally aligned notes—even if they disagree on diagnosis, the data elements required for that disagreement to be legible are present in both records.
Layer 2 — Concept-Level Extraction and κ Scoring. After note generation, the engine extracts discrete clinical concepts (e.g., "acute kidney injury," "elevated lactate," "sepsis secondary to UTI") and maps each to SNOMED CT codes. When multiple notes exist for the same encounter, the system calculates Cohen's κ at the section level (Assessment, HPI, MDM) and at the concept level. If κ falls below a configurable threshold (our 14-institution validation cohort supports κ ≥ 0.75 as the minimum for CDI acceptance without query), the system flags the inconsistency and surfaces it for reconciliation.
Layer 3 — SNOMED CT ↔ ICD-10-CM Crosswalk Verification. Extracted SNOMED CT concepts are mapped to their ICD-10-CM equivalents using the NLM's SNOMED CT to ICD-10-CM mapping. The engine verifies that the documented assessment supports the codes that will be assigned. If a note says "sepsis" but omits organ dysfunction documentation, the crosswalk flags that A41.9 — Sepsis requires supporting clinical evidence and that unspecified organism; R65.20 — Severe sepsis without septic shock may be more clinically accurate but needs explicit organ dysfunction language.
Layer 4 — FHIR Resource Consistency Check. The engine cross-validates the generated note against existing EHR FHIR resources: Condition (active problem list), Observation (lab values, vitals), and ServiceRequest (orders). A laterality/acuity contradiction check flags mismatches—such as "right knee pain" documented in the assessment paired with a left-sided MRI order in ServiceRequest, or an assessment stating "no organ dysfunction" while Observation resources show creatinine 2.1 (AKI) and lactate >2.0.
Layer 5 — Non-Verbalized Clinical Reasoning Prompts. Many documentation inconsistencies arise because clinicians think but do not say their reasoning. The IRR engine surfaces targeted prompts for clinical reasoning that was not verbalized during the encounter—explicit organ-dysfunction statements for sepsis, MDM risk alignment with the complexity of orders placed, or laterality confirmation when imaging is ordered.
IRR Engine Layers and Documentation Inconsistency Resolution | |||
Layer | Function | Inconsistency Type Resolved | Output |
|---|---|---|---|
1 — Standardized Prompts | Condition-specific note templates with mandatory data elements | Structural inconsistency (missing elements) | Aligned note architecture across providers |
2 — κ Scoring | Cohen's kappa at section and concept level | Inter-provider diagnostic disagreement | Flagged discordance with reconciliation prompt |
3 — SNOMED/ICD Crosswalk | Terminology alignment and code-support verification | Code-documentation mismatch | Suggested code-specific language |
4 — FHIR Consistency | Cross-validation against Condition, Observation, ServiceRequest | Laterality errors, lab-assessment contradictions, order-assessment misalignment | Pre-commit contradiction flags |
5 — Reasoning Prompts | Targeted prompts for non-verbalized clinical logic | Implicit reasoning not captured in documentation | Clinician-facing prompts before sign-off |
CMS tells surveyors how to write a deficiency citation clearly. No framework tells the clinical documentation system itself how to ensure that two notes about the same patient, authored by different providers, are internally consistent, terminologically aligned, and defensible against audit. The IRR engine makes documentation inconsistency a measurable, scoreable, and preventable engineering outcome.
Scribing.io Clinical Logic: Resolving ED-to-Admission Sepsis Documentation Conflicts in Real Time
The Scenario
An ED physician documents "urosepsis" during initial evaluation. The admitting hospitalist, reviewing the same patient two hours later, writes "bacteremia—no organ dysfunction." Meanwhile, the laboratory data tells a different story: creatinine 2.1 mg/dL (elevated from baseline 0.9), lactate 3.2 mmol/L, and MAP 62 mmHg. The Surviving Sepsis Campaign guidelines are unambiguous: lactate >2.0 mmol/L and acute organ dysfunction in the setting of suspected infection meet sepsis criteria under the Sepsis-3 definition published in JAMA (2016).
Coding cannot assign sepsis (A41.9) because the hospitalist's note explicitly denies organ dysfunction. The CDI team issues a query. The hospitalist responds three days later. The claim is delayed. If the query is not answered—or answered ambiguously—a major denial follows. Industry data from the Association of Clinical Documentation Integrity Specialists (ACDIS) indicates sepsis-related documentation inconsistencies drive denial rates exceeding 20% for inpatient sepsis claims at many institutions.
This is not a knowledge problem. Both physicians understand the clinical picture. It is a documentation architecture problem: two independent notes, generated at different times, with no systematic reconciliation layer, producing contradictory medical records.
How Scribing.io's IRR Engine Resolves This — Step by Step
Step 1 — Encounter Linkage and Data Aggregation. When the hospitalist begins their note, the IRR engine identifies this as a linked encounter (same patient, same admission episode) via ADT event matching and FHIR EpisodeOfCare resource. It aggregates all relevant FHIR resources: the ED Composition (containing the "urosepsis" assessment), current Observation resources (Cr 2.1, lactate 3.2, MAP 62, temperature 38.9°C, WBC 18.2), active Condition resources, and ServiceRequest entries (blood cultures ordered, broad-spectrum antibiotics initiated in ED).
Step 2 — Sepsis Prompt Activation. The standardized sepsis symptom-cluster prompt activates based on the combination of: (a) "urosepsis" in prior Assessment, (b) lactate >2.0 in Observation, (c) antibiotics in ServiceRequest, and (d) elevated creatinine with change from baseline >0.3 mg/dL within 48 hours (meeting KDIGO AKI Stage 1 criteria). The prompt template requires explicit documentation of:
Suspected infectious source
Organ dysfunction evidence (specific lab values and vitals)
Sepsis severity classification
Reconciliation with prior provider's assessment
Step 3 — Contradiction Detection. The hospitalist dictates "bacteremia—no organ dysfunction." The IRR engine immediately detects a laterality/acuity contradiction: the statement "no organ dysfunction" directly conflicts with Observation data showing AKI (Cr 2.1 from baseline 0.9) and elevated lactate (3.2 mmol/L). The engine also detects inter-provider assessment discordance: the ED note documents "urosepsis" (implying sepsis with urinary source and organ dysfunction), while the hospitalist's draft denies organ dysfunction entirely. Cohen's κ at the assessment-concept level: 0.12 (poor agreement).
Step 4 — Clinician-Facing Reconciliation Prompt. Before the note can be committed, the linter surfaces the following prompt to the hospitalist (displayed inline in the EHR draft, not as a disruptive popup):
Documentation Reconciliation Required
Prior ED assessment: "urosepsis"
Current draft: "bacteremia—no organ dysfunction"
Lab evidence: Cr 2.1 (baseline 0.9) → meets KDIGO AKI Stage 1; Lactate 3.2 mmol/L; MAP 62 mmHg
Organ dysfunction evidence is present. Please reconcile your assessment. Suggested documentation: "Sepsis secondary to UTI with AKI (lactate 3.2, MAP 62)"
IRR κ score for current note pair: 0.12. Threshold for clean commit: ≥0.75.
Step 5 — Clinician Response and Note Reconciliation. The hospitalist reviews the prompt, examines the lab data surfaced in context, and revises the assessment to: "Sepsis secondary to UTI with AKI (Cr 2.1 from baseline 0.9, lactate 3.2, MAP 62). Source: urinary tract. Blood cultures pending." The revised κ score at the assessment-concept level: 0.88 (almost perfect agreement). The note passes the pre-commit linter. No CDI query is needed. The coding team can assign A41.9 with supporting organ dysfunction documentation, or escalate to R65.20 if severe sepsis criteria are met, with full clinical evidence in the record.
Step 6 — Provenance Stamp. The final Composition resource stores both the original draft and the reconciled version, linked to FHIR Provenance resources that record: the audio segment timestamp where the hospitalist dictated the original assessment, the linter flag that triggered reconciliation, the clinician's explicit acceptance of the revised language, and the κ score delta (0.12 → 0.88). This chain is audit-ready for RAC, MAC, or OIG review.
Technical Reference: ICD-10 Documentation Standards for Sepsis and Severe Sepsis
Sepsis coding is uniquely unforgiving. The ICD-10-CM Official Guidelines for Coding and Reporting (Section I.C.1.d) require that sepsis documentation specify the underlying systemic infection, the organism when known, and any associated organ dysfunction. Incomplete documentation does not default to a "lesser" code—it defaults to a denial.
Code Mapping and Documentation Requirements
Sepsis ICD-10-CM Code Documentation Requirements | |||
Code | Description | Required Documentation Elements | Common Documentation Failure |
|---|---|---|---|
Sepsis, unspecified organism | Documented infection + clinical evidence of systemic inflammatory response + physician attestation of "sepsis" in the Assessment | Using "SIRS" or "bacteremia" without the word "sepsis"; conflicting provider notes on whether sepsis is present | |
unspecified organism; R65.20 — Severe sepsis without septic shock | Severe sepsis without septic shock | All A41.9 requirements + documentation of associated acute organ dysfunction (AKI, respiratory failure, coagulopathy, etc.) with causal linkage to sepsis | Organ dysfunction documented in labs but not linked to sepsis in the Assessment; hospitalist documents "AKI" without associating it with the infectious process |
How Scribing.io Ensures Maximum Specificity
The IRR engine's SNOMED CT → ICD-10-CM crosswalk does not simply map terms. It evaluates documentation sufficiency for each candidate code:
Organism specificity check. If blood culture results are available in Observation resources showing E. coli, the engine prompts the clinician to document the organism, enabling assignment of A41.51 (Sepsis due to Escherichia coli) instead of A41.9. The difference: fewer RAC audits and higher case-mix index accuracy.
Organ dysfunction linkage verification. If the Assessment documents "sepsis" and Observation shows AKI (Cr 2.1), the engine verifies that the Assessment explicitly links the organ dysfunction to sepsis. "Sepsis. AKI." is insufficient—the engine prompts for "Sepsis with associated AKI" or equivalent causal language per ICD-10-CM Guideline I.C.1.d.1.a.
Severity escalation gating. R65.20 requires documentation of organ dysfunction and its association with the sepsis. The engine will not suggest R65.20 escalation unless both the clinical data (from FHIR Observation) and the narrative documentation (from the Composition) support it. This prevents upcoding risk while ensuring legitimate severity capture.
The Pre-Commit Inconsistency Linter: Architecture for Contradiction-Free Notes
Many EHRs restrict section-level writes. You cannot programmatically edit a physician's note mid-composition in most Epic or Oracle Health (Cerner) deployments. This constraint is real, and it shapes the architecture.
Scribing.io's pre-commit inconsistency linter operates as a gate, not an editor. It does not modify the clinician's language. It inserts a draft DocumentReference (or SmartText block, depending on EHR platform) only after all cross-checks and IRR thresholds are met. The workflow:
Draft generation. The AI generates a note draft from ambient audio, structured per the standardized symptom-cluster prompt for the detected condition(s).
Linter pass 1 — Internal consistency. The linter checks the draft against itself: Does the Assessment match the HPI? Do the orders in the Plan align with the diagnoses? Is the MDM complexity level consistent with the data reviewed and risk of the management options?
Linter pass 2 — Cross-note consistency. The linter checks the draft against all other Composition resources for the same EpisodeOfCare. This is where the ED-hospitalist sepsis discordance is caught.
Linter pass 3 — FHIR resource consistency. The linter checks the draft against Observation, Condition, and ServiceRequest resources. Lab-assessment contradictions, laterality errors, and order-diagnosis misalignment are flagged here.
Threshold gate. If all κ scores meet the configured threshold (default: ≥0.75 at concept level) and no unresolved contradiction flags remain, the draft DocumentReference is inserted into the EHR. If not, the note is held in draft state with reconciliation prompts surfaced to the clinician.
Clinician sign-off. The clinician reviews, accepts or modifies the draft, and signs. The signed Composition is stored with full Provenance linkage.
Pre-Commit Linter Pass Summary | |||
Linter Pass | Scope | Data Sources | Contradiction Types Caught |
|---|---|---|---|
Pass 1 — Internal | Within current draft | HPI, Assessment, Plan, MDM, Orders within the same note | Assessment-Plan mismatch, MDM-complexity inconsistency, missing required elements |
Pass 2 — Cross-note | Across all notes for the same episode | All Composition resources for the EpisodeOfCare | Inter-provider diagnostic disagreement, severity discordance, terminology divergence |
Pass 3 — FHIR resource | Against structured EHR data | Observation (labs, vitals), Condition (problem list), ServiceRequest (orders) | Lab-assessment contradictions, laterality errors, order-diagnosis misalignment |
FHIR-Native Provenance: Building the Audit-Defensible Chain
Provenance is preserved by storing drafts as FHIR Composition + Provenance resources that link each sentence to timestamped, speaker-diarized audio segments—creating an audit-defensible chain. This is not metadata appended as an afterthought. It is the structural backbone of every note Scribing.io generates.
Provenance Resource Architecture
Each note generates a FHIR Provenance resource (R5 specification) containing:
target: Reference to the Composition resource (the signed note)
occurred[x]: Timestamp range of the encounter audio
recorded: Timestamp of note generation and each subsequent modification
agent: The clinician (type: author), the AI engine (type: assembler), and the linter (type: verifier)—each with distinct role codes
entity: References to the source audio file (stored as a Binary or DocumentReference), with sentence-level byte-range offsets mapping each note sentence to its corresponding speaker-diarized audio segment
signature: Clinician's digital signature at sign-off, with timestamp
This architecture means that any auditor—RAC, MAC, OIG, or internal compliance—can trace any sentence in a signed note back to: (a) the exact audio segment where the clinician spoke the information, (b) the AI engine version that generated the draft, (c) any linter flags that fired and were resolved, and (d) the clinician's explicit acceptance or modification of the language. The HIPAA Security Rule's audit trail requirements are met as a byproduct of the note generation process, not as a separate compliance exercise.
Why This Matters for Denials and Appeals
When a RAC auditor questions whether sepsis documentation was clinically supported, the standard defense involves pulling the chart, the labs, the physician's addendum, and hoping the CDI query response is legible. With FHIR Provenance, the defense is structural: here is the audio where the physician described the clinical picture; here is the AI draft; here is the linter flag that surfaced the lab-assessment contradiction; here is the physician's reconciled assessment with the exact κ score improvement; here is the signed note with all evidence linked. The appeals process shifts from narrative reconstruction to data retrieval.
Cross-Specialty Implications: Documentation Inconsistency Beyond the ED
The sepsis scenario is the highest-stakes demonstration, but documentation inconsistency is specialty-agnostic. The same IRR architecture applies wherever multiple providers or multiple encounter types contribute to a patient's record:
Primary care chronic disease management. An Family Medicine physician documents "diabetes, well-controlled" while the most recent HbA1c in Observation resources shows 9.2%. The linter flags the assessment-lab contradiction and prompts for "diabetes, uncontrolled" or documentation of why the clinician considers the disease trajectory acceptable despite the lab value.
Psychiatry and behavioral health. A Psychiatry provider documents "stable mood, no SI" while the PHQ-9 score in the same encounter shows 22 (severe depression). The linter catches the instrument-assessment discordance and surfaces it before sign-off, preventing both clinical risk and documentation deficiency.
Surgical services. A surgeon dictates "left inguinal hernia repair" while the consent form and operative ServiceRequest specify the right side. The laterality contradiction check prevents a never-event documentation trail before the case begins.
Oncology. A medical oncologist documents "disease progression" while the radiologist's report (referenced as a DiagnosticReport resource) describes "stable disease by RECIST criteria." The cross-note consistency check flags this before the oncology note is committed.
In each case, the mechanism is identical: standardized prompts ensure the right data elements are captured; κ scoring measures agreement; FHIR consistency checks catch contradictions; and the pre-commit linter gates the note until reconciliation occurs.
Implementation Roadmap for CDI Directors
Deploying IRR-governed documentation is a clinical operations project, not an IT project. The CDI director is the correct owner. Here is the phased approach we recommend based on deployments across 14 health systems:
14-Day Pilot to Full Deployment: Implementation Phases | |||
Phase | Duration | Activities | Success Metrics |
|---|---|---|---|
Phase 0 — Baseline | Week 1-2 (pre-pilot) | Measure current CDI query volume, sepsis denial rate, average days-to-query-response, κ score for existing multi-provider encounters (sampled) | Documented baseline for all target metrics |
Phase 1 — 14-Day Pilot | Week 3-4 | Deploy Scribing.io with IRR engine on a single unit (recommended: Medicine/Hospitalist service with ED admits). Linter runs in advisory mode (flags but does not gate). CDI team reviews all flags. | Linter flag accuracy ≥90%; clinician acceptance of prompted reconciliation ≥70%; zero workflow disruption complaints from physicians |
Phase 2 — Gating Activation | Week 5-8 | Enable pre-commit gating (notes held in draft if κ < threshold or unresolved contradictions exist). Expand to 2-3 additional units. CDI team monitors query volume reduction. | CDI query volume ↓ 40-60% on pilot units; sepsis denial rate ↓; κ scores ≥0.75 on 85%+ of multi-provider encounters |
Phase 3 — Enterprise Rollout | Week 9-16 | Deploy across all inpatient and ED services. Integrate Provenance resources with existing compliance and audit workflows. Train CDI staff on IRR dashboard interpretation. | Enterprise-wide CDI query reduction ≥50%; sepsis-specific denial rate <5%; full FHIR Provenance coverage on all AI-authored notes |
Phase 4 — Continuous Calibration | Ongoing | Monthly κ score reporting by unit, provider, and condition. Prompt template refinement based on flag patterns. Annual recalibration against updated ICD-10-CM guidelines and CMS policy changes. | κ drift <0.05 per quarter; prompt template revision cycle <30 days for new CMS guidance; zero unresolved linter flags at 48 hours |
Start Here
See a live run of our Inter-Rater Reliability dashboard with EHR-embedded inconsistency linter and FHIR Provenance audio-to-note traceability—pre-mapped to sepsis prompts and A41.9/R65.20—deployed as a 14-day pilot on your EHR. Request a pilot at Scribing.io.
Documentation inconsistency is not a training problem. It is not a query volume problem. It is not a coder problem. It is an architecture problem—and it requires an architectural solution. The IRR engine, the pre-commit linter, and FHIR-native provenance are that solution. The CDI director who deploys them stops chasing inconsistencies after the fact and starts preventing them at the point of generation.


