Posted on
Jun 16, 2026
How Scribing.io Works: A Complete Walkthrough for Health System CIOs
Clinical Update — June 2026: This guide has been revised to reflect CMS's updated G2211 audit guidance published in the CY 2026 OPPS/ASC Final Rule, athenahealth's expanded FHIR R4 Bulk Data endpoints, and Epic's November 2025 release enabling Composition write-back in Hyperdrive deployments. Section 3 (Clinical Decision Logic) has been substantially rewritten to address the increased denial rate for G2211 claims documented in AMA CPT Editorial Panel guidance published Q1 2026.
How Scribing.io Works: The Clinical Library Playbook for CMIOs
TL;DR: This walkthrough details how Scribing.io's Retrieval-Augmented Generation (RAG) pipeline—mapped to each clinician's historical "gold standard" notes—produces audit-resilient, style-matched documentation that generic ambient scribes cannot. It covers the end-to-end clinical logic from audio capture through FHIR write-back, explains why Medicare's G2211 add-on code demands longitudinal-relationship language that most AI scribes omit, provides ICD-10 documentation standards for two of the most common primary-care conditions, and gives CMIOs a reproducible deployment framework. If you read one section, read §3 — Clinical Decision Logic.
Contents
1. What Competitors Miss: The Information-Gain Gap in Ambient AI Scribes
2. How Scribing.io Works: End-to-End Pipeline Architecture
3. Clinical Decision Logic: The Noisy-Hallway, Dual-Chronic 99215 + G2211 Scenario
4. Technical Reference: ICD-10 Documentation Standards
5. CMIO Deployment Framework
6. See It in Your EHR
1. What Competitors Miss: The Information-Gain Gap in Ambient AI Scribes
The AMA's coverage of Kaiser Permanente's ambient scribe rollout captures a real sentiment: the technology "feels like magic." But the article—and the broader vendor narrative it represents—glosses over two make-or-break realities that every CMIO confronts in a production EHR environment. Scribing.io was architected specifically around these failure modes, not as an afterthought patched onto a general-purpose transcription model.
Scribing.io's core architecture—Retrieval-Augmented Generation mapped to each provider's verified historical notes—exists because generic transcription-to-template pipelines break down at exactly the points where revenue integrity and audit survival matter most. Three gaps define the problem space.
Gap 1: G2211 Audit Survival Requires Longitudinal-Relationship Language
Medicare's visit-complexity add-on code G2211, finalized in the CY 2024 Medicare PFS Final Rule, was designed to recognize the cognitive load of managing patients with ongoing, complex problems across multiple visits. CMS's own audit contractors have flagged three documentation elements that must be explicit in the note—not merely inferable:
An ongoing, longitudinal patient relationship (not merely "established patient" status).
Problem-focused complexity that extends beyond the single encounter.
A clear connection between today's decisions and the arc of prior care.
Generic ambient scribes transcribe what was said. They do not prompt what was not said. If the internist treating a long-term diabetic patient never verbalizes "I have managed this patient's diabetes over the past four years with progressive therapy adjustments," the note will lack the longitudinal cue—and the claim will fail audit.
Scribing.io's approach: The RAG pipeline retrieves prior encounters from the provider's gold standard corpus where G2211 language was present and clean-billed. When the system detects a G2211-eligible scenario, it surfaces just-in-time prompts in the clinician's own nomenclature—for instance, "You typically document longitudinal management language here—confirm?" The clinician decides. Scribing.io never auto-inserts billing language. This is documentation fidelity, not upcoding.
Gap 2: FHIR Write-Back Is Not "Solved"
The AMA article notes that the biggest challenge has been "fine-tuning the ambient scribes for specialty-specific workflows." What it does not address is the technical write-back problem. Many production EHRs—including certain Epic configurations and athenahealth API endpoints—restrict FHIR Composition writes. An ambient scribe that can generate a note but cannot write it back as structured, auditable data is functionally incomplete.
Scribing.io uses a standards-safe fallback: a US Core R4 DocumentReference resource with sectioned, LOINC-labeled headings. Discrete data (diagnoses, conditions) are upserted via Condition and Encounter links with verificationStatus controls (provisional vs. confirmed), and every action carries author Provenance. The note is not a blob of text pasted into a free-text field—it is queryable, auditable, structured clinical data.
Gap 3: Audio Quality in Real Clinics
The AMA piece never mentions audio. Primary-care encounters happen in noisy hallways, shared exam rooms, and telehealth sessions with inconsistent connectivity. Published research on clinical speech recognition error rates consistently shows degradation above 20% word error rate in high-ambient-noise settings. Scribing.io's audio pipeline diarizes (separates speakers) and denoises before transcription, so that decision statements are time-stamped. This protects time-based E/M logic and prolonged-services billing without fabricating non-verbalized reasoning.
Capability Comparison: Generic Ambient Scribe vs. Scribing.io | ||
Capability | Generic Ambient Scribe | Scribing.io (RAG-Mapped Pipeline) |
|---|---|---|
Transcription | Single-model speech-to-text | Diarized, denoised, time-stamped |
Style Matching | Specialty templates (vendor-managed) | RAG mapped to each clinician's historical gold standard notes |
G2211 Longitudinal Cues | Not addressed | Just-in-time prompts in provider's own nomenclature |
FHIR Write-Back | Vendor-dependent; often free-text paste | US Core R4 DocumentReference + Condition/Encounter links with Provenance |
Discrete Data Upsert | Limited or manual | Automated with verificationStatus controls (provisional → confirmed) |
Time-Based E/M Protection | Not documented | Time-stamped decision segments from diarized audio |
Audit Resilience | Dependent on clinician review | Structured by design: LOINC sections, author Provenance, coded Conditions |
2. How Scribing.io Works: End-to-End Pipeline Architecture
Tracing data from the moment a clinician opens a patient encounter to the final signed note in the EHR reveals five discrete stages. Each is designed to preserve clinical fidelity, protect reimbursement logic, and match the provider's documentation identity.
Stage 1: Audio Capture, Diarization, and Denoising
When the clinician initiates a session—via mobile device, desktop microphone, or integrated EHR widget—the audio stream enters Scribing.io's real-time processing layer.
Speaker diarization separates the clinician's voice from the patient's (and any third parties: interpreters, family, consulting specialists).
Environmental denoising removes background noise common in clinical settings: HVAC systems, hallway chatter, equipment alarms, paging systems.
Segment time-stamping tags every utterance with start/end times relative to the encounter. This is critical for time-based E/M documentation (99214 vs. 99215 thresholds) and prolonged-services logic under AMA's E/M guidelines.
The output of Stage 1 is not a monolithic transcript. It is a structured, speaker-attributed, time-indexed dialogue that serves as the evidentiary backbone for all downstream processing.
Stage 2: Retrieval-Augmented Generation (RAG) Against Gold Standard Notes
This is Scribing.io's core differentiator and the embodiment of its architectural thesis.
Rather than generating documentation from a generic LLM prompt, Scribing.io retrieves contextually similar passages from the specific clinician's prior notes—notes that have been vetted, signed, and successfully billed. These historical notes form the provider's "gold standard" corpus.
The RAG pipeline executes three operations:
Embeds the current encounter's transcript segments into a vector space.
Retrieves the most semantically similar sections from the provider's gold standard corpus—matching on clinical context, not just keyword overlap.
Generates a draft note that mirrors the provider's existing documentation style, preferred nomenclature, section ordering, and level of detail.
A provider who always writes "Patient has been under my longitudinal care for management of their chronic conditions including…" will see that exact phrasing surfaced in the draft. A provider who prefers terse, problem-oriented SOAP notes gets terse, problem-oriented SOAP notes. The model does not impose a house style; it learns your style.
Stage 3: Clinical Intelligence Layer — Code Eligibility and Gap Detection
Before the draft reaches the clinician, two critical checks execute:
A. Documentation gap detection. The system compares the draft against documentation requirements for the anticipated E/M level. If the transcript supports a 99215 but the note lacks the explicit language required for G2211, the system flags the gap and suggests language drawn from the provider's own historical notes. The clinician decides whether to include it. Scribing.io never auto-inserts billing language—a design constraint rooted in OIG anti-kickback and false-claims compliance.
B. ICD-10 consistency verification. The system cross-references mentioned conditions against the patient's existing problem list and ensures that coded diagnoses reach maximum specificity. Details are in §4 — ICD-10 Documentation Standards.
Stage 4: Clinician Review and Attestation
The draft note is presented within the clinician's EHR workflow. Scribing.io supports in-EHR embedded review via Epic integration and athenahealth API connections, plus a standalone review interface for EHRs with limited widget support.
Highlighted decision moments linked to time-stamped audio, so the clinician can verify that a documented clinical decision was actually verbalized.
Gap flags with one-click accept/dismiss for longitudinal cues, complexity language, and diagnostic specificity.
Side-by-side comparison with the provider's most recent similar encounter note for rapid quality assurance.
Stage 5: FHIR Write-Back and Discrete Data Upsert
Once the clinician signs, Scribing.io writes data back using the most capable available standard:
Primary path: FHIR
Compositionresource (when the EHR supports it, including Epic Hyperdrive post-November 2025).Fallback path: US Core R4
DocumentReferencewith LOINC-labeled sections (e.g.,18776-5Plan of Treatment,29299-5Reason for Visit).Discrete data:
Conditionresources linked to theEncounter, each with explicitverificationStatus(provisionalfor newly mentioned,confirmedfor provider-affirmed).MedicationRequest,Observation, and other resource types upserted as appropriate.Provenance: Every resource carries an author
Provenanceresource attributing content to the signing clinician, with the AI system listed as an assembler—not an author.
Scribing.io Pipeline: Stage-by-Stage Walkthrough | |||
Stage | Input | Process | Output |
|---|---|---|---|
1. Audio Capture | Raw encounter audio | Diarization, denoising, time-stamping | Speaker-attributed, time-indexed transcript |
2. RAG Generation | Transcript + provider's gold standard corpus | Vector retrieval → style-matched draft generation | Draft note in clinician's own documentation style |
3. Clinical Intelligence | Draft note + billing requirements + problem list | Gap detection (G2211, E/M level, ICD-10 specificity) | Flagged draft with suggested clinician-voice language |
4. Clinician Review | Flagged draft + time-stamped audio links | Human attestation, accept/dismiss flags | Signed, attested clinical note |
5. FHIR Write-Back | Signed note + discrete data | DocumentReference + Condition/Encounter + Provenance upsert | Structured, auditable EHR data |
3. Clinical Decision Logic: The Noisy-Hallway, Dual-Chronic 99215 + G2211 Scenario
This section details the exact clinical scenario that CMIOs will recognize as a recurring revenue-integrity pain point—and shows how Scribing.io resolves it end to end.
The Scenario
A primary-care internist in a noisy hallway visit sees a 68-year-old patient with Type 2 diabetes mellitus (E11.9) and essential hypertension (I10). The visit involves medication adjustment, review of recent lab work, and discussion of lifestyle modifications. The encounter lasts 32 minutes total medical decision-making time, supporting a 99215. The internist has managed this patient for six years. The visit clearly meets G2211 criteria—clinically. But the dictated note lacks three things:
Explicit "longitudinal relationship" language.
Precise decision-time segments.
Connection between today's adjustments and the multi-year treatment arc.
Result without Scribing.io: repeated denials of 99215 + G2211. The practice loses the G2211 add-on (~$16.05/claim at the 2026 national rate) and risks down-coding the 99215 itself.
Step-by-Step: How Scribing.io Solves This
Step 1 — Audio Processing in the Noisy Hallway
The encounter is captured via the clinician's lapel microphone. Background noise—hallway foot traffic, overhead paging—is stripped by the denoising layer. Speaker diarization identifies the internist, the patient, and a medical assistant who briefly enters to report a blood pressure reading. Each speaker's utterances are time-stamped.
Key transcript segments flagged by the system:
[Provider, 00:03:12–00:04:47]: Discussion of A1C results and decision to increase metformin.
[Provider, 00:08:22–00:10:15]: Review of home blood pressure log and decision to add amlodipine.
[Provider, 00:14:40–00:16:55]: Lifestyle counseling regarding sodium reduction and exercise.
[Provider, 00:22:10–00:24:30]: Coordination discussion about upcoming ophthalmology referral for diabetic retinopathy screening.
Total provider decision-making time captured: 32 minutes and 18 seconds. This exceeds the 40-minute threshold when combined with pre-visit chart review time (documented separately), firmly supporting 99215.
Step 2 — RAG Retrieval from Gold Standard Corpus
The RAG pipeline identifies this encounter as semantically similar to 47 prior encounters in this internist's gold standard corpus—specifically, established-patient visits involving diabetes management with comorbid hypertension. The system retrieves documentation patterns including:
The internist's preferred Assessment phrasing: "Type 2 diabetes mellitus, currently suboptimally controlled as evidenced by A1C of [value], which I have managed longitudinally since [year] with progressive pharmacotherapy adjustments."
The internist's Plan structure: problem-oriented, each condition addressed sequentially with medication changes, monitoring intervals, and return-visit rationale.
LOINC section ordering the internist has historically used: Chief Complaint → HPI → Review of Systems → Exam → Assessment → Plan.
The draft is generated in the internist's voice, not a vendor template voice.
Step 3 — Gap Detection Fires
The clinical intelligence layer compares the draft against G2211 documentation requirements and detects two gaps:
Gap A — Longitudinal relationship language is absent from the transcript. The internist did not verbalize "I have been managing your diabetes for six years." The system surfaces a prompt: "Your prior notes for this patient include: 'I have managed this patient's Type 2 diabetes longitudinally since 2020 with progressive therapy adjustments including lifestyle intervention, metformin titration, and most recently consideration of GLP-1 RA therapy.' Include similar language? [Accept] [Dismiss]"
Gap B — The connection between today's metformin increase and the multi-year treatment arc is implicit, not explicit. The system prompts: "Your gold standard notes for medication escalation typically include: 'This represents the next step in our longitudinal treatment plan, having previously optimized [prior interventions].' Include? [Accept] [Dismiss]"
Both prompts draw from the internist's own signed, billed notes. No language is fabricated. The clinician's documentation identity is preserved.
Step 4 — ICD-10 Specificity Check
The system verifies that the note supports the highest appropriate specificity for both conditions:
E11.9 — Type 2 diabetes mellitus without complications. But the transcript mentions "ophthalmology referral for diabetic retinopathy screening." The system flags: "Transcript references diabetic retinopathy screening. If retinopathy has been diagnosed, E11.319 (Type 2 DM with unspecified diabetic retinopathy without macular edema) or more specific code may be appropriate. Current problem list shows E11.9. Confirm or update?"
I10 — Essential (primary) hypertension. Confirmed as appropriate; no secondary cause or hypertensive crisis language is present in the transcript.
Step 5 — Clinician Review
The internist opens the draft in Epic's Hyperdrive interface. Three flags are visible, each with a one-click accept/dismiss toggle and a link to the time-stamped audio segment that triggered the flag. The internist:
Accepts Gap A (longitudinal language), verifying it reflects clinical truth.
Accepts Gap B (treatment-arc connection).
Dismisses the E11.319 suggestion, confirming that retinopathy has not yet been diagnosed—the referral is for screening only.
Total review time: 2 minutes and 40 seconds. Compare this to the JAMA Internal Medicine finding that physicians spend an average of 16 minutes per encounter on documentation outside of the visit.
Step 6 — FHIR Write-Back
The signed note is written back to Epic as a Composition resource (Hyperdrive-enabled). Discrete data upserts:
Condition: E11.9 (verificationStatus:confirmed), linked to Encounter.Condition: I10 (verificationStatus:confirmed), linked to Encounter.MedicationRequest: Metformin 1000mg BID (updated from 500mg BID).MedicationRequest: Amlodipine 5mg daily (new).ServiceRequest: Ophthalmology referral for diabetic retinopathy screening.Provenance: Author = Dr. [Internist]; Assembler = Scribing.io AI Pipeline v4.2.
The note's LOINC-sectioned structure, discrete coded Conditions, and author Provenance create a record that can survive a CMS CERT audit without requiring the clinician to reconstruct the encounter from memory months later.
4. Technical Reference: ICD-10 Documentation Standards
The two conditions in the above scenario—I10 — Essential (primary) hypertension; E11.9 — Type 2 diabetes mellitus without complications—represent the single most common diagnostic pairing in primary-care medicine. They are also the most frequently under-specified, leading to preventable denials and inaccurate risk-adjustment scoring.
I10 — Essential (Primary) Hypertension
I10 is appropriate when hypertension is primary (not secondary to renal disease, endocrine disorder, or other identifiable cause) and there is no documented hypertensive crisis. Documentation must include:
Confirmation of the diagnosis (not merely elevated blood pressure readings).
Current treatment regimen.
Control status (controlled, uncontrolled, or resistant).
Scribing.io's gap detection flags cases where the note describes medication changes for blood pressure management but uses only "high blood pressure" or "HTN" without linking to the I10 diagnosis on the problem list. Per CDC/NCHS ICD-10-CM coding guidelines, the diagnosis must be stated by the provider; lab values alone are insufficient.
E11.9 — Type 2 Diabetes Mellitus Without Complications
E11.9 is the default code when Type 2 diabetes is documented without any associated complications. However, this code is often over-used: if the patient has documented neuropathy (E11.40–E11.49), nephropathy (E11.21–E11.29), retinopathy (E11.31x–E11.35x), or other complications, the note must reflect the more specific code to support accurate CMS Hierarchical Condition Category (HCC) risk adjustment.
Scribing.io's pipeline cross-references the transcript against the patient's existing problem list, recent lab results (when accessible via FHIR), and the provider's dictated assessment. If the provider mentions complications or referrals suggesting complications, the system prompts for specificity—never auto-coding, always deferring to the clinician's judgment.
ICD-10 Specificity Escalation: Diabetes Example | |||
Documentation Cue in Transcript | Default Code | Specificity Prompt | Potential Specific Code |
|---|---|---|---|
"Diabetes, no complications mentioned" | E11.9 | None required | E11.9 |
"Referring for retinopathy screening" | E11.9 | "Has retinopathy been diagnosed? If yes, specify type." | E11.319 (if unspecified retinopathy confirmed) |
"Urine microalbumin elevated" | E11.9 | "Lab suggests nephropathy. Has diabetic kidney disease been diagnosed?" | E11.21 (if diabetic nephropathy confirmed) |
"Numbness in feet, monofilament test abnormal" | E11.9 | "Exam suggests neuropathy. Confirm diagnosis?" | E11.40 (if diabetic neuropathy, unspecified, confirmed) |
5. CMIO Deployment Framework
Rolling out Scribing.io is not a "plug it in and go" exercise. The RAG pipeline depends on building each clinician's gold standard corpus, and the FHIR write-back must be configured per EHR environment. The following framework reflects production deployments across multi-specialty groups.
Phase 1: Gold Standard Corpus Curation (Weeks 1–3)
Identify 30–50 signed notes per provider that represent clean-billed encounters across the provider's top 10 visit types by volume.
Validate coding accuracy on selected notes with the billing/compliance team. Remove any notes with post-submission corrections or audit flags.
Ingest into the RAG vector store. Notes are chunked by LOINC section (not arbitrary token windows), embedded, and indexed.
Phase 2: EHR Integration Configuration (Weeks 2–4, Overlapping)
Determine write-back capability:
Composition(Epic Hyperdrive, select Cerner Millennium builds) vs.DocumentReferencefallback (athenahealth, legacy Epic, others).Map LOINC section codes to EHR-specific note templates. Scribing.io provides a default mapping based on LOINC Document Ontology; CMIOs customize for local conventions.
Configure Provenance attribution per institutional policy: the signing clinician is always the author; Scribing.io is the assembler.
Test discrete data upsert in the sandbox environment:
Condition,MedicationRequest,ServiceRequestresources withverificationStatusworkflows confirmed.
Phase 3: Pilot and Calibration (Weeks 4–8)
Select 5–10 pilot providers across specialties. Primary care is the recommended starting point due to high G2211 eligibility volume.
Run parallel documentation for the first two weeks: providers sign both their manual note and the Scribing.io-generated draft. Compliance reviews both for accuracy, style match, and coding consistency.
Calibrate gap-detection sensitivity. Some providers find the initial prompting frequency too high. Threshold tuning is done per-provider based on their denial history and documentation completeness baseline.
Measure: Documentation time per encounter, G2211 claim acceptance rate, denial rate delta, provider satisfaction (SUS or custom scale).
Phase 4: Scaled Rollout (Weeks 8–16)
Expand to full provider cohort in phased waves (20–30 providers per wave).
Activate ongoing corpus enrichment: Each signed note is evaluated for inclusion in the provider's gold standard corpus, keeping the RAG retrieval current with evolving documentation patterns.
Monitor audit resilience: Track payer audit outcomes on Scribing.io-documented encounters vs. historical baseline. Share results with the Revenue Cycle team quarterly.
Deployment Timeline Summary | |||
Phase | Duration | Key Deliverable | Owner |
|---|---|---|---|
1. Corpus Curation | Weeks 1–3 | 30–50 validated gold standard notes per provider | CMIO + Compliance |
2. EHR Integration | Weeks 2–4 | FHIR write-back configured and sandbox-tested | IT/Informatics + Scribing.io |
3. Pilot | Weeks 4–8 | Parallel documentation validated, thresholds calibrated | Pilot providers + CMIO |
4. Scaled Rollout | Weeks 8–16 | Full cohort live, ongoing corpus enrichment active | CMIO + Operations |
6. See It in Your EHR
Reading a playbook is not the same as watching the pipeline process one of your own provider's encounters. The scenario above—the noisy-hallway 99215 + G2211 with dual-chronic complexity—is available as a live walkthrough using your EHR's sandbox environment.
Book a 15-minute demo to see our Epic/Cerner FHIR write-back with LOINC-tagged sections, real-time G2211 eligibility prompts, and audit-ready Provenance mapped to your own "gold standard" notes. Your informatics team is welcome on the call; we will show the DocumentReference, Condition, and Provenance resources in FHIR format, not slides.


