Posted on

Jun 16, 2026

How Scribing.io Works: A Complete Walkthrough for Health System CIOs

Health system CIO reviewing a walkthrough of Scribing.io clinical documentation platform integration with EHR systems
Health system CIO reviewing a walkthrough of Scribing.io clinical documentation platform integration with EHR systems

Clinical Update — June 2026: This guide has been revised to reflect CMS's updated G2211 audit guidance published in the CY 2026 OPPS/ASC Final Rule, athenahealth's expanded FHIR R4 Bulk Data endpoints, and Epic's November 2025 release enabling Composition write-back in Hyperdrive deployments. Section 3 (Clinical Decision Logic) has been substantially rewritten to address the increased denial rate for G2211 claims documented in AMA CPT Editorial Panel guidance published Q1 2026.

How Scribing.io Works: The Clinical Library Playbook for CMIOs

TL;DR: This walkthrough details how Scribing.io's Retrieval-Augmented Generation (RAG) pipeline—mapped to each clinician's historical "gold standard" notes—produces audit-resilient, style-matched documentation that generic ambient scribes cannot. It covers the end-to-end clinical logic from audio capture through FHIR write-back, explains why Medicare's G2211 add-on code demands longitudinal-relationship language that most AI scribes omit, provides ICD-10 documentation standards for two of the most common primary-care conditions, and gives CMIOs a reproducible deployment framework. If you read one section, read §3 — Clinical Decision Logic.

Contents

  • 1. What Competitors Miss: The Information-Gain Gap in Ambient AI Scribes

  • 2. How Scribing.io Works: End-to-End Pipeline Architecture

  • 3. Clinical Decision Logic: The Noisy-Hallway, Dual-Chronic 99215 + G2211 Scenario

  • 4. Technical Reference: ICD-10 Documentation Standards

  • 5. CMIO Deployment Framework

  • 6. See It in Your EHR

1. What Competitors Miss: The Information-Gain Gap in Ambient AI Scribes

The AMA's coverage of Kaiser Permanente's ambient scribe rollout captures a real sentiment: the technology "feels like magic." But the article—and the broader vendor narrative it represents—glosses over two make-or-break realities that every CMIO confronts in a production EHR environment. Scribing.io was architected specifically around these failure modes, not as an afterthought patched onto a general-purpose transcription model.

Scribing.io's core architecture—Retrieval-Augmented Generation mapped to each provider's verified historical notes—exists because generic transcription-to-template pipelines break down at exactly the points where revenue integrity and audit survival matter most. Three gaps define the problem space.

Gap 1: G2211 Audit Survival Requires Longitudinal-Relationship Language

Medicare's visit-complexity add-on code G2211, finalized in the CY 2024 Medicare PFS Final Rule, was designed to recognize the cognitive load of managing patients with ongoing, complex problems across multiple visits. CMS's own audit contractors have flagged three documentation elements that must be explicit in the note—not merely inferable:

  1. An ongoing, longitudinal patient relationship (not merely "established patient" status).

  2. Problem-focused complexity that extends beyond the single encounter.

  3. A clear connection between today's decisions and the arc of prior care.

Generic ambient scribes transcribe what was said. They do not prompt what was not said. If the internist treating a long-term diabetic patient never verbalizes "I have managed this patient's diabetes over the past four years with progressive therapy adjustments," the note will lack the longitudinal cue—and the claim will fail audit.

Scribing.io's approach: The RAG pipeline retrieves prior encounters from the provider's gold standard corpus where G2211 language was present and clean-billed. When the system detects a G2211-eligible scenario, it surfaces just-in-time prompts in the clinician's own nomenclature—for instance, "You typically document longitudinal management language here—confirm?" The clinician decides. Scribing.io never auto-inserts billing language. This is documentation fidelity, not upcoding.

Gap 2: FHIR Write-Back Is Not "Solved"

The AMA article notes that the biggest challenge has been "fine-tuning the ambient scribes for specialty-specific workflows." What it does not address is the technical write-back problem. Many production EHRs—including certain Epic configurations and athenahealth API endpoints—restrict FHIR Composition writes. An ambient scribe that can generate a note but cannot write it back as structured, auditable data is functionally incomplete.

Scribing.io uses a standards-safe fallback: a US Core R4 DocumentReference resource with sectioned, LOINC-labeled headings. Discrete data (diagnoses, conditions) are upserted via Condition and Encounter links with verificationStatus controls (provisional vs. confirmed), and every action carries author Provenance. The note is not a blob of text pasted into a free-text field—it is queryable, auditable, structured clinical data.

Gap 3: Audio Quality in Real Clinics

The AMA piece never mentions audio. Primary-care encounters happen in noisy hallways, shared exam rooms, and telehealth sessions with inconsistent connectivity. Published research on clinical speech recognition error rates consistently shows degradation above 20% word error rate in high-ambient-noise settings. Scribing.io's audio pipeline diarizes (separates speakers) and denoises before transcription, so that decision statements are time-stamped. This protects time-based E/M logic and prolonged-services billing without fabricating non-verbalized reasoning.

Capability Comparison: Generic Ambient Scribe vs. Scribing.io

Capability

Generic Ambient Scribe

Scribing.io (RAG-Mapped Pipeline)

Transcription

Single-model speech-to-text

Diarized, denoised, time-stamped

Style Matching

Specialty templates (vendor-managed)

RAG mapped to each clinician's historical gold standard notes

G2211 Longitudinal Cues

Not addressed

Just-in-time prompts in provider's own nomenclature

FHIR Write-Back

Vendor-dependent; often free-text paste

US Core R4 DocumentReference + Condition/Encounter links with Provenance

Discrete Data Upsert

Limited or manual

Automated with verificationStatus controls (provisional → confirmed)

Time-Based E/M Protection

Not documented

Time-stamped decision segments from diarized audio

Audit Resilience

Dependent on clinician review

Structured by design: LOINC sections, author Provenance, coded Conditions

2. How Scribing.io Works: End-to-End Pipeline Architecture

Tracing data from the moment a clinician opens a patient encounter to the final signed note in the EHR reveals five discrete stages. Each is designed to preserve clinical fidelity, protect reimbursement logic, and match the provider's documentation identity.

Stage 1: Audio Capture, Diarization, and Denoising

When the clinician initiates a session—via mobile device, desktop microphone, or integrated EHR widget—the audio stream enters Scribing.io's real-time processing layer.

  • Speaker diarization separates the clinician's voice from the patient's (and any third parties: interpreters, family, consulting specialists).

  • Environmental denoising removes background noise common in clinical settings: HVAC systems, hallway chatter, equipment alarms, paging systems.

  • Segment time-stamping tags every utterance with start/end times relative to the encounter. This is critical for time-based E/M documentation (99214 vs. 99215 thresholds) and prolonged-services logic under AMA's E/M guidelines.

The output of Stage 1 is not a monolithic transcript. It is a structured, speaker-attributed, time-indexed dialogue that serves as the evidentiary backbone for all downstream processing.

Stage 2: Retrieval-Augmented Generation (RAG) Against Gold Standard Notes

This is Scribing.io's core differentiator and the embodiment of its architectural thesis.

Rather than generating documentation from a generic LLM prompt, Scribing.io retrieves contextually similar passages from the specific clinician's prior notes—notes that have been vetted, signed, and successfully billed. These historical notes form the provider's "gold standard" corpus.

The RAG pipeline executes three operations:

  1. Embeds the current encounter's transcript segments into a vector space.

  2. Retrieves the most semantically similar sections from the provider's gold standard corpus—matching on clinical context, not just keyword overlap.

  3. Generates a draft note that mirrors the provider's existing documentation style, preferred nomenclature, section ordering, and level of detail.

A provider who always writes "Patient has been under my longitudinal care for management of their chronic conditions including…" will see that exact phrasing surfaced in the draft. A provider who prefers terse, problem-oriented SOAP notes gets terse, problem-oriented SOAP notes. The model does not impose a house style; it learns your style.

Stage 3: Clinical Intelligence Layer — Code Eligibility and Gap Detection

Before the draft reaches the clinician, two critical checks execute:

A. Documentation gap detection. The system compares the draft against documentation requirements for the anticipated E/M level. If the transcript supports a 99215 but the note lacks the explicit language required for G2211, the system flags the gap and suggests language drawn from the provider's own historical notes. The clinician decides whether to include it. Scribing.io never auto-inserts billing language—a design constraint rooted in OIG anti-kickback and false-claims compliance.

B. ICD-10 consistency verification. The system cross-references mentioned conditions against the patient's existing problem list and ensures that coded diagnoses reach maximum specificity. Details are in §4 — ICD-10 Documentation Standards.

Stage 4: Clinician Review and Attestation

The draft note is presented within the clinician's EHR workflow. Scribing.io supports in-EHR embedded review via Epic integration and athenahealth API connections, plus a standalone review interface for EHRs with limited widget support.

  • Highlighted decision moments linked to time-stamped audio, so the clinician can verify that a documented clinical decision was actually verbalized.

  • Gap flags with one-click accept/dismiss for longitudinal cues, complexity language, and diagnostic specificity.

  • Side-by-side comparison with the provider's most recent similar encounter note for rapid quality assurance.

Stage 5: FHIR Write-Back and Discrete Data Upsert

Once the clinician signs, Scribing.io writes data back using the most capable available standard:

  • Primary path: FHIR Composition resource (when the EHR supports it, including Epic Hyperdrive post-November 2025).

  • Fallback path: US Core R4 DocumentReference with LOINC-labeled sections (e.g., 18776-5 Plan of Treatment, 29299-5 Reason for Visit).

  • Discrete data: Condition resources linked to the Encounter, each with explicit verificationStatus (provisional for newly mentioned, confirmed for provider-affirmed). MedicationRequest, Observation, and other resource types upserted as appropriate.

  • Provenance: Every resource carries an author Provenance resource attributing content to the signing clinician, with the AI system listed as an assembler—not an author.

Scribing.io Pipeline: Stage-by-Stage Walkthrough

Stage

Input

Process

Output

1. Audio Capture

Raw encounter audio

Diarization, denoising, time-stamping

Speaker-attributed, time-indexed transcript

2. RAG Generation

Transcript + provider's gold standard corpus

Vector retrieval → style-matched draft generation

Draft note in clinician's own documentation style

3. Clinical Intelligence

Draft note + billing requirements + problem list

Gap detection (G2211, E/M level, ICD-10 specificity)

Flagged draft with suggested clinician-voice language

4. Clinician Review

Flagged draft + time-stamped audio links

Human attestation, accept/dismiss flags

Signed, attested clinical note

5. FHIR Write-Back

Signed note + discrete data

DocumentReference + Condition/Encounter + Provenance upsert

Structured, auditable EHR data

3. Clinical Decision Logic: The Noisy-Hallway, Dual-Chronic 99215 + G2211 Scenario

This section details the exact clinical scenario that CMIOs will recognize as a recurring revenue-integrity pain point—and shows how Scribing.io resolves it end to end.

The Scenario

A primary-care internist in a noisy hallway visit sees a 68-year-old patient with Type 2 diabetes mellitus (E11.9) and essential hypertension (I10). The visit involves medication adjustment, review of recent lab work, and discussion of lifestyle modifications. The encounter lasts 32 minutes total medical decision-making time, supporting a 99215. The internist has managed this patient for six years. The visit clearly meets G2211 criteria—clinically. But the dictated note lacks three things:

  1. Explicit "longitudinal relationship" language.

  2. Precise decision-time segments.

  3. Connection between today's adjustments and the multi-year treatment arc.

Result without Scribing.io: repeated denials of 99215 + G2211. The practice loses the G2211 add-on (~$16.05/claim at the 2026 national rate) and risks down-coding the 99215 itself.

Step-by-Step: How Scribing.io Solves This

Step 1 — Audio Processing in the Noisy Hallway

The encounter is captured via the clinician's lapel microphone. Background noise—hallway foot traffic, overhead paging—is stripped by the denoising layer. Speaker diarization identifies the internist, the patient, and a medical assistant who briefly enters to report a blood pressure reading. Each speaker's utterances are time-stamped.

Key transcript segments flagged by the system:

  • [Provider, 00:03:12–00:04:47]: Discussion of A1C results and decision to increase metformin.

  • [Provider, 00:08:22–00:10:15]: Review of home blood pressure log and decision to add amlodipine.

  • [Provider, 00:14:40–00:16:55]: Lifestyle counseling regarding sodium reduction and exercise.

  • [Provider, 00:22:10–00:24:30]: Coordination discussion about upcoming ophthalmology referral for diabetic retinopathy screening.

Total provider decision-making time captured: 32 minutes and 18 seconds. This exceeds the 40-minute threshold when combined with pre-visit chart review time (documented separately), firmly supporting 99215.

Step 2 — RAG Retrieval from Gold Standard Corpus

The RAG pipeline identifies this encounter as semantically similar to 47 prior encounters in this internist's gold standard corpus—specifically, established-patient visits involving diabetes management with comorbid hypertension. The system retrieves documentation patterns including:

  • The internist's preferred Assessment phrasing: "Type 2 diabetes mellitus, currently suboptimally controlled as evidenced by A1C of [value], which I have managed longitudinally since [year] with progressive pharmacotherapy adjustments."

  • The internist's Plan structure: problem-oriented, each condition addressed sequentially with medication changes, monitoring intervals, and return-visit rationale.

  • LOINC section ordering the internist has historically used: Chief Complaint → HPI → Review of Systems → Exam → Assessment → Plan.

The draft is generated in the internist's voice, not a vendor template voice.

Step 3 — Gap Detection Fires

The clinical intelligence layer compares the draft against G2211 documentation requirements and detects two gaps:

Gap A — Longitudinal relationship language is absent from the transcript. The internist did not verbalize "I have been managing your diabetes for six years." The system surfaces a prompt: "Your prior notes for this patient include: 'I have managed this patient's Type 2 diabetes longitudinally since 2020 with progressive therapy adjustments including lifestyle intervention, metformin titration, and most recently consideration of GLP-1 RA therapy.' Include similar language? [Accept] [Dismiss]"

Gap B — The connection between today's metformin increase and the multi-year treatment arc is implicit, not explicit. The system prompts: "Your gold standard notes for medication escalation typically include: 'This represents the next step in our longitudinal treatment plan, having previously optimized [prior interventions].' Include? [Accept] [Dismiss]"

Both prompts draw from the internist's own signed, billed notes. No language is fabricated. The clinician's documentation identity is preserved.

Step 4 — ICD-10 Specificity Check

The system verifies that the note supports the highest appropriate specificity for both conditions:

  • E11.9 — Type 2 diabetes mellitus without complications. But the transcript mentions "ophthalmology referral for diabetic retinopathy screening." The system flags: "Transcript references diabetic retinopathy screening. If retinopathy has been diagnosed, E11.319 (Type 2 DM with unspecified diabetic retinopathy without macular edema) or more specific code may be appropriate. Current problem list shows E11.9. Confirm or update?"

  • I10 — Essential (primary) hypertension. Confirmed as appropriate; no secondary cause or hypertensive crisis language is present in the transcript.

Step 5 — Clinician Review

The internist opens the draft in Epic's Hyperdrive interface. Three flags are visible, each with a one-click accept/dismiss toggle and a link to the time-stamped audio segment that triggered the flag. The internist:

  1. Accepts Gap A (longitudinal language), verifying it reflects clinical truth.

  2. Accepts Gap B (treatment-arc connection).

  3. Dismisses the E11.319 suggestion, confirming that retinopathy has not yet been diagnosed—the referral is for screening only.

Total review time: 2 minutes and 40 seconds. Compare this to the JAMA Internal Medicine finding that physicians spend an average of 16 minutes per encounter on documentation outside of the visit.

Step 6 — FHIR Write-Back

The signed note is written back to Epic as a Composition resource (Hyperdrive-enabled). Discrete data upserts:

  • Condition: E11.9 (verificationStatus: confirmed), linked to Encounter.

  • Condition: I10 (verificationStatus: confirmed), linked to Encounter.

  • MedicationRequest: Metformin 1000mg BID (updated from 500mg BID).

  • MedicationRequest: Amlodipine 5mg daily (new).

  • ServiceRequest: Ophthalmology referral for diabetic retinopathy screening.

  • Provenance: Author = Dr. [Internist]; Assembler = Scribing.io AI Pipeline v4.2.

The note's LOINC-sectioned structure, discrete coded Conditions, and author Provenance create a record that can survive a CMS CERT audit without requiring the clinician to reconstruct the encounter from memory months later.

4. Technical Reference: ICD-10 Documentation Standards

The two conditions in the above scenario—I10 — Essential (primary) hypertension; E11.9 — Type 2 diabetes mellitus without complications—represent the single most common diagnostic pairing in primary-care medicine. They are also the most frequently under-specified, leading to preventable denials and inaccurate risk-adjustment scoring.

I10 — Essential (Primary) Hypertension

I10 is appropriate when hypertension is primary (not secondary to renal disease, endocrine disorder, or other identifiable cause) and there is no documented hypertensive crisis. Documentation must include:

  • Confirmation of the diagnosis (not merely elevated blood pressure readings).

  • Current treatment regimen.

  • Control status (controlled, uncontrolled, or resistant).

Scribing.io's gap detection flags cases where the note describes medication changes for blood pressure management but uses only "high blood pressure" or "HTN" without linking to the I10 diagnosis on the problem list. Per CDC/NCHS ICD-10-CM coding guidelines, the diagnosis must be stated by the provider; lab values alone are insufficient.

E11.9 — Type 2 Diabetes Mellitus Without Complications

E11.9 is the default code when Type 2 diabetes is documented without any associated complications. However, this code is often over-used: if the patient has documented neuropathy (E11.40–E11.49), nephropathy (E11.21–E11.29), retinopathy (E11.31x–E11.35x), or other complications, the note must reflect the more specific code to support accurate CMS Hierarchical Condition Category (HCC) risk adjustment.

Scribing.io's pipeline cross-references the transcript against the patient's existing problem list, recent lab results (when accessible via FHIR), and the provider's dictated assessment. If the provider mentions complications or referrals suggesting complications, the system prompts for specificity—never auto-coding, always deferring to the clinician's judgment.

ICD-10 Specificity Escalation: Diabetes Example

Documentation Cue in Transcript

Default Code

Specificity Prompt

Potential Specific Code

"Diabetes, no complications mentioned"

E11.9

None required

E11.9

"Referring for retinopathy screening"

E11.9

"Has retinopathy been diagnosed? If yes, specify type."

E11.319 (if unspecified retinopathy confirmed)

"Urine microalbumin elevated"

E11.9

"Lab suggests nephropathy. Has diabetic kidney disease been diagnosed?"

E11.21 (if diabetic nephropathy confirmed)

"Numbness in feet, monofilament test abnormal"

E11.9

"Exam suggests neuropathy. Confirm diagnosis?"

E11.40 (if diabetic neuropathy, unspecified, confirmed)

5. CMIO Deployment Framework

Rolling out Scribing.io is not a "plug it in and go" exercise. The RAG pipeline depends on building each clinician's gold standard corpus, and the FHIR write-back must be configured per EHR environment. The following framework reflects production deployments across multi-specialty groups.

Phase 1: Gold Standard Corpus Curation (Weeks 1–3)

  1. Identify 30–50 signed notes per provider that represent clean-billed encounters across the provider's top 10 visit types by volume.

  2. Validate coding accuracy on selected notes with the billing/compliance team. Remove any notes with post-submission corrections or audit flags.

  3. Ingest into the RAG vector store. Notes are chunked by LOINC section (not arbitrary token windows), embedded, and indexed.

Phase 2: EHR Integration Configuration (Weeks 2–4, Overlapping)

  1. Determine write-back capability: Composition (Epic Hyperdrive, select Cerner Millennium builds) vs. DocumentReference fallback (athenahealth, legacy Epic, others).

  2. Map LOINC section codes to EHR-specific note templates. Scribing.io provides a default mapping based on LOINC Document Ontology; CMIOs customize for local conventions.

  3. Configure Provenance attribution per institutional policy: the signing clinician is always the author; Scribing.io is the assembler.

  4. Test discrete data upsert in the sandbox environment: Condition, MedicationRequest, ServiceRequest resources with verificationStatus workflows confirmed.

Phase 3: Pilot and Calibration (Weeks 4–8)

  1. Select 5–10 pilot providers across specialties. Primary care is the recommended starting point due to high G2211 eligibility volume.

  2. Run parallel documentation for the first two weeks: providers sign both their manual note and the Scribing.io-generated draft. Compliance reviews both for accuracy, style match, and coding consistency.

  3. Calibrate gap-detection sensitivity. Some providers find the initial prompting frequency too high. Threshold tuning is done per-provider based on their denial history and documentation completeness baseline.

  4. Measure: Documentation time per encounter, G2211 claim acceptance rate, denial rate delta, provider satisfaction (SUS or custom scale).

Phase 4: Scaled Rollout (Weeks 8–16)

  1. Expand to full provider cohort in phased waves (20–30 providers per wave).

  2. Activate ongoing corpus enrichment: Each signed note is evaluated for inclusion in the provider's gold standard corpus, keeping the RAG retrieval current with evolving documentation patterns.

  3. Monitor audit resilience: Track payer audit outcomes on Scribing.io-documented encounters vs. historical baseline. Share results with the Revenue Cycle team quarterly.

Deployment Timeline Summary

Phase

Duration

Key Deliverable

Owner

1. Corpus Curation

Weeks 1–3

30–50 validated gold standard notes per provider

CMIO + Compliance

2. EHR Integration

Weeks 2–4

FHIR write-back configured and sandbox-tested

IT/Informatics + Scribing.io

3. Pilot

Weeks 4–8

Parallel documentation validated, thresholds calibrated

Pilot providers + CMIO

4. Scaled Rollout

Weeks 8–16

Full cohort live, ongoing corpus enrichment active

CMIO + Operations

6. See It in Your EHR

Reading a playbook is not the same as watching the pipeline process one of your own provider's encounters. The scenario above—the noisy-hallway 99215 + G2211 with dual-chronic complexity—is available as a live walkthrough using your EHR's sandbox environment.

Book a 15-minute demo to see our Epic/Cerner FHIR write-back with LOINC-tagged sections, real-time G2211 eligibility prompts, and audit-ready Provenance mapped to your own "gold standard" notes. Your informatics team is welcome on the call; we will show the DocumentReference, Condition, and Provenance resources in FHIR format, not slides.

→ Schedule at Scribing.io

Still not sure? Book a free discovery call now.

Frequently

asked question

Answers to your asked queries

Can we get started today?

Can I edit or review notes before they go into my EHR?

Does Scribing.io work with telehealth and video visits?

Is Scribing.io HIPAA compliant?

Is patient data used to train your AI models?

Still not sure? Book a free discovery call now.

Frequently

asked question

Answers to your asked queries

Can we get started today?

Can I edit or review notes before they go into my EHR?

Does Scribing.io work with telehealth and video visits?

Is Scribing.io HIPAA compliant?

Is patient data used to train your AI models?

Still not sure? Book a free discovery call now.

Frequently

asked question

Answers to your asked queries

Can we get started today?

Can I edit or review notes before they go into my EHR?

Does Scribing.io work with telehealth and video visits?

Is Scribing.io HIPAA compliant?

Is patient data used to train your AI models?

Clinical Precision.
Zero Documentation Debt

Finish Your Charts - Go Home on Time.

Clinical Precision.
Zero Documentation Debt

Finish Your Charts - Go Home on Time.

Clinical Precision.
Zero Documentation Debt

Finish Your Charts - Go Home on Time.