A Randomized Controlled Trial of Ambient Artificial Intelligence Scribe Technologies (AIScribe RCT)

April 22, 2026 updated by: John N. Mafi, MD, MPH, University of California, Los Angeles

A Randomized Controlled Trial of Two Ambient Artificial Intelligence Scribe Technologies to Improve Documentation Efficiency and Reduce Physician Burnout

This is a three-arm pragmatic RCT of 238 outpatient physicians at a large academic health system, randomized 1:1:1 to one of two AI scribe tools or a usual-care control group. The two-month study will observe and compare the effects of each tool prior to system-wide roll out of selected tool (anticipated Spring 2025). We will use covariate-constrained randomization to balance the arms in terms of physician baseline time in notes, survey-measured level of burnout, and clinic days per week.

The primary purpose of the initiative is to improve quality, efficiency, and business operations at University of California, Los Angeles (UCLA) Health, and this initiative is not being done for research purposes. The results of this operational initiative will inform the widespread roll out of AI scribe tools across all providers within the UCLA Health System. Nevertheless, the UCLA study team plans to rigorously examine and publish the impact of this intervention across the health system, which is why the study team pre-registered the initiative.

Study Overview

Detailed Description

This study will assess operational-oriented outcomes across all groups. Notably, all groups will eventually receive all interventions over time in this observational study of a randomized roll out of a QI initiative. Moreover, the primary purpose of this initiative is operational. In other words, based on the results of this initiative, one of these tools will be eventually selected and operationalized widely across the health system.

Enrolled participants are randomized to one of three groups. Randomization was needed to overcome secular trends, seasonal and holiday effects in December, and other factors confounding the relationship between exposure to the AI tools and the outcomes.

The primary aim of this study is to evaluate the impact of two ambient AI scribe technologies on clinician change from baseline time spent on EHR documentation, comparing each scribe to a control group. Secondary objectives include assessing the AI scribes' impact on clinician metrics such as burnout, physician satisfaction, and productivity. Additionally, the study team intends to perform an economic evaluation analysis of the tools to guide business decision making. The study team will also analyze physician reported effects of the AI tools on patient safety, equity, and any unintended consequences of the initiative.

Study Type

Interventional

Enrollment (Actual)

238

Phase

  • Not Applicable

Contacts and Locations

This section provides the contact details for those conducting the study, and information on where this study is being conducted.

Study Locations

    • California
      • Los Angeles, California, United States, 90024
        • UCLA Health System

Participation Criteria

Researchers look for people who fit a certain description, called eligibility criteria. Some examples of these criteria are a person's general health condition or prior treatments.

Eligibility Criteria

Ages Eligible for Study

  • Child
  • Adult
  • Older Adult

Accepts Healthy Volunteers

No

Description

Inclusion Criteria:

  • Ambulatory care physicians within the UCLA Health system who held at least one half-day of clinic per week

Exclusion Criteria:

  • Trainee providers (e.g., residents, medical students) and allied healthcare professionals (e.g., RNs, PAs)
  • Attendings who work exclusively with trainees

Study Plan

This section provides details of the study plan, including how the study is designed and what the study is measuring.

How is the study designed?

Design Details

  • Primary Purpose: Health Services Research
  • Allocation: Randomized
  • Interventional Model: Single Group Assignment
  • Masking: Single

Arms and Interventions

Participant Group / Arm
Intervention / Treatment
Other: Nabla, Vendor of virtual AI scribe technology
Participants in this arm will utilize AI scribe tools from Nabla and will continue their usual clinical documentation processes, supported by the scribe software, which integrates with the EHR and automatically adds the generated text to the note. The Nabla AI scribe tool is transcriptional and does not provide clinical decision support.

AI Scribe technologies capture physician-patient conversations to create a transcript, then summarize the transcript in the form of a clinical notes. These tools are integrated into the EHR and automatically adds the generated text to the provider note. All physicians must inform patients about the recording and obtain their verbal consent, and instances of patients declining to consent are tracked.

Nabla leverages its proprietary speech-to-text to transform the conversation into a written context, combined with HIPAA compliant Large Language Models (LLM) like Azure OpenAI's GPT-4. Nabla does not store any audio.

Other: Vendor B of virtual AI scribe technology
Participants in this arm will utilize AI scribe tools from Vendor B and will continue their usual clinical documentation processes, supported by the scribe software, which integrates with the EHR and automatically adds the generated text to the note. The AI scribe tool is transcriptional and does not provide clinical decision support.
AI Scribe technologies capture physician-patient conversations to create a transcript, then summarize the transcript in the form of a clinical notes. These tools are integrated into the EHR and automatically adds the generated text to the provider note. All physicians must inform patients about the recording and obtain their verbal consent, and instances of patients declining to consent are tracked.
No Intervention: No Scribe
Participants in this arm will not have access to AI scribe tools and will continue their usual clinical documentation processes

What is the study measuring?

Primary Outcome Measures

Outcome Measure
Measure Description
Time Frame
Change in the Time in Notes Per Note
Time Frame: Study month 2
The primary outcome measure is the change in provider mean time in notes per note in the second month of the trial from the providers baseline mean time in notes per note for the six months prior to enrollment. This change will be computed on the natural log scale. No patient level information will be collected for this outcome measure.
Study month 2

Secondary Outcome Measures

Outcome Measure
Measure Description
Time Frame
Provider Burnout Score
Time Frame: Study month 2
The Mini Z 2.0 Survey is a validated 10-item instrument designed to measure key factors influencing workplace satisfaction and burnout among healthcare professionals. Each item is scored on a Likert scale (1-5), with higher scores generally indicating more positive outcomes - greater job satisfaction, sufficiency of time for electronic medical record documentation, and lower levels of stress. For negatively framed items (e.g., stress due to the job or frustration with the electronic medical record), higher scores indicate lower levels of dissatisfaction. The total score ranges from 10 to 50, with scores ≥40 representing a joyful workplace. No patient level information will be collected for this outcome measure.
Study month 2
Change in EHR Signal (Activity) Data - Pajama Time
Time Frame: Study month 2
We will examine change from a retrospective baseline 6 months prior to enrollment in Signal metrics including pajama time per scheduled day. Using this data will determine how a providers time is utilized in the EHR. No patient level information will be collected for this outcome measure.
Study month 2
Provider Task Load Score
Time Frame: Study month 2
Provider task load adapted from the NASA Task Load Index (TLX), a validated tool for assessing perceived workload across six sub-scales: mental demand, physical demand, temporal demand, performance, effort, and frustration. For this study, we adapted the TLX to focus on note-writing workload, including four sub-scales (mental demand, temporal demand, physical demand, and effort) as done previously. Each sub-scale is rated from 0 (low task load) to 100 (high task load) and summed together for a total score scale of 0 (low task load) to 400 (high task load), lower is better. No patient level information will be collected for this outcome measure.
Study month 2
Provider Professional Fulfillment
Time Frame: Study month 2
The Professional Fulfillment Index (PFI) is a validated 16-item instrument that uses a 5-point Likert scale (0-4) to measure professional fulfillment, work exhaustion, and interpersonal disengagement. For this study, we utilize the 4-item work exhaustion subscale which is a mean of the 4-items within that subscale, where a low score (0) indicates a lower level of exhaustion and a high (4) score indicates greater level of exhaustion. No patient level information will be collected for this outcome measure.
Study month 2
Number of Physicians Who Are Considered Detractors, Passive, or Promoters
Time Frame: Study month 2
Self-reported satisfaction survey that asks physicians to consider note accuracy, patient safety, equity, and other potential unintended consequences and rate their overall likelihood to recommend use of the tool on a 1-10 scale. Higher scores (10) indicate greater satisfaction and likelihood to recommend, whereas lower scores (1) indicate dissatisfaction and unlikelihood to recommend. Providers are grouped as "Promoters" if they respond 9-10, "Passive" if they respond 7-8, and "Detractors" if they respond with a value less than or equal to 6. This grouping matches commonly accepted "Net Promoter Score" groupings. No patient level information will be collected for this outcome measure.
Study month 2
Change in Provider RVU
Time Frame: Study month 2
The study team will use physician-level billing information via RVU to determine their change in productivity from a retrospective baseline 6 months prior to enrollment. No patient level information will be collected for this outcome measure.
Study month 2
Change in EHR Signal (Activity) Data - Time Outside Scheduled Hours
Time Frame: Study month 2
We will examine change from a retrospective baseline 6 months prior to enrollment in Signal metrics including time outside scheduled hours per scheduled day. Using this data will determine how a providers time is utilized in the EHR. No patient level information will be collected for this outcome measure.
Study month 2
Change in EHR Signal (Activity) Data - Time on Unscheduled Days
Time Frame: Study month 2
We will examine change from a retrospective baseline 6 months prior to enrollment in Signal metrics including time spent in the system on unscheduled days where . Using this data will determine how a providers time is utilized in the EHR. No patient level information will be collected for this outcome measure.
Study month 2

Collaborators and Investigators

This is where you will find people and organizations involved with this study.

Publications and helpful links

The person responsible for entering information about the study voluntarily provides these publications. These may be about anything related to the study.

General Publications

Study record dates

These dates track the progress of study record and summary results submissions to ClinicalTrials.gov. Study records and reported results are reviewed by the National Library of Medicine (NLM) to make sure they meet specific quality control standards before being posted on the public website.

Study Major Dates

Study Start (Actual)

November 4, 2024

Primary Completion (Actual)

January 3, 2025

Study Completion (Actual)

January 15, 2025

Study Registration Dates

First Submitted

December 9, 2024

First Submitted That Met QC Criteria

January 24, 2025

First Posted (Actual)

January 27, 2025

Study Record Updates

Last Update Posted (Actual)

April 24, 2026

Last Update Submitted That Met QC Criteria

April 22, 2026

Last Verified

April 1, 2026

More Information

Terms related to this study

Plan for Individual participant data (IPD)

Plan to Share Individual Participant Data (IPD)?

NO

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

No

Studies a U.S. FDA-regulated device product

No

This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.

Clinical Trials on Physician Workflow

Clinical Trials on Use Nabla AI Scribe tool provided

  • University of Washington
    Completed
    Physician Work Environment | Physician Burnout | Artificial Intelligence (AI)
    United States
Subscribe