Preventing Medication Dispensing Errors in Pharmacy Practice With Interpretable Machine Intelligence

November 12, 2025 updated by: Corey Lester, University of Michigan
Pharmacists currently perform an independent double-check to identify drug-selection errors before they can reach the patient. However, the use of machine intelligence (MI) to support this cognitive decision-making work by pharmacists does not exist in practice. This research is being conducted to examine the effectiveness of the timing of machine intelligence (MI) advice on to determine if it results in lower task time, increased accuracy, and increased trust in the MI.

Study Overview

Detailed Description

Pharmacists currently perform an independent double-check currently to identify drug-selection errors before they can reach the patient. However, the use of machine intelligence (MI) to support this cognitive decision-making work by pharmacists does not exist in practice. Instead, pharmacists rely solely on reference images of the medication which they can compare to the prescription vial contents. Previous research has shown that decision support systems can effectively improve healthcare delivery efficiency and accuracy, while preventing adverse drug events. However, little is known about how MI technologies impact pharmacists' work performance and cognitive demand.

To facilitate the long-term symbiotic relationship between the pharmacists and the MI system, proper trust needs to be established. While trust has been identified as the central factor for effective human-machine teaming, issues arise when humans place unjustified trust in automated technologies do not place enough trust in them. Over trust in automation can lead to complacency and automation bias. For instance, the pharmacists may rely on the MI system to the extent that they blindly accept any recommendation by the system. Under trust can result in pharmacist disuse and potential abandonment of the MI system.

Furthermore, little is known about the timing of the MI advice on pharmacists' work performance. For example, showing the MI's advice while the pharmacist is performing the medication verification task may yield different results than showing the MI's advice after the pharmacist made their decision.

The study investigators have developed a MI system for medication images classification. The objective of this study is to examine the effectiveness of the timing of MI advice to determine if it results in lower task time, increased accuracy, and increased trust in the MI.

Study Type

Interventional

Enrollment (Actual)

68

Phase

  • Not Applicable

Contacts and Locations

This section provides the contact details for those conducting the study, and information on where this study is being conducted.

Study Locations

    • Michigan
      • Ann Arbor, Michigan, United States, 48109
        • University of Michigan

Participation Criteria

Researchers look for people who fit a certain description, called eligibility criteria. Some examples of these criteria are a person's general health condition or prior treatments.

Eligibility Criteria

Ages Eligible for Study

  • Adult
  • Older Adult

Accepts Healthy Volunteers

No

Description

Inclusion Criteria:

  • Licensed pharmacist in the United States
  • Age 18 years and older at screening
  • PC/Laptop with Microsoft Windows 10 or Mac (Macbook, iMac) with MacOS with Google Chrome, Edge, Opera, Safari, or Firefox web browser installed on the device
  • Screen resolution of 1024x968 pixels or more
  • A laptop integrated webcam or USB webcam is also required for the eye tracking purpose.

Exclusion Criteria:

  • Participated in Wave 1 or Wave 2
  • Eyeglasses
  • Uncorrected cataracts, intraocular implants, glaucoma, or permanently dilated pupil
  • Require a screen reader/magnifier or other assistive technology to use the computer
  • Eye movement or alignment abnormalities (lazy eye, strabismus, nystagmus)

Study Plan

This section provides details of the study plan, including how the study is designed and what the study is measuring.

How is the study designed?

Design Details

  • Primary Purpose: Other
  • Allocation: Randomized
  • Interventional Model: Crossover Assignment
  • Masking: None (Open Label)

Arms and Interventions

Participant Group / Arm
Intervention / Treatment
Experimental: No MI Help
No MI help will be presented during the verification tasks
Participants will complete the medication verification task without any MI help
Participants will receive MI in the form of a pop-up message if their decision differs from the MI's determination.
MI help will be displayed concurrently with the filled and reference images.
Experimental: Scenario #1
MI help will be presented in the form of a pop-up message the participant's decision differs from the MI's determination.
Participants will complete the medication verification task without any MI help
Participants will receive MI in the form of a pop-up message if their decision differs from the MI's determination.
MI help will be displayed concurrently with the filled and reference images.
Experimental: Scenario #2
MI help will be displayed concurrently with the filled and reference images.
Participants will complete the medication verification task without any MI help
Participants will receive MI in the form of a pop-up message if their decision differs from the MI's determination.
MI help will be displayed concurrently with the filled and reference images.

What is the study measuring?

Primary Outcome Measures

Outcome Measure
Measure Description
Time Frame
Reaction Time
Time Frame: Throughout the verification task
Difference in task time measured by the number of seconds from starting the task to accepting or rejecting a medication image
Throughout the verification task
Decision Accuracy
Time Frame: Throughout the verification task
Difference in detection rate measured by the number of medication verification errors across all participants in the Arm/Group.
Throughout the verification task
Trust Change
Time Frame: After every trial in Scenarios 1 and 2

Participants will complete 100 mock medication verification trials in each of the study arms (i.e., Scenario 1, Scenario 2, and No Help). After each trial in Scenario 1 and Scenario 2, participants will use a visual analog scale (VAS) to respond to the question: "How much do you trust the AI advice?" The endpoints of the 100-point VAS are 'Not at all' to 'Completely trust'. Participants indicate their level of trust in the MI advice after every trial on a scale from 1-100, with higher scores indicating greater levels of trust.

The trust change, as measured by the visual analog scale, will be calculated using the following formula:

Trust change (i) = Trust(i) - Trust(i - 1), where i=2, 3, ..., 100.

To compute a single, summarized value for the Trust Change variable within a specific scenario, the individual Trust Change scores measured from the trials are averaged. This averaging method provides a comprehensive measure of how trust shifted across the duration of the scenario.

After every trial in Scenarios 1 and 2
Trust
Time Frame: Post-intervention in Scenarios 1 and 2.
Trust will be assessed using the Muir & Moray's (1996) Trust in Automation scale. Scores range from 0 to 100 with higher scores indicating greater levels of trust.
Post-intervention in Scenarios 1 and 2.

Secondary Outcome Measures

Outcome Measure
Measure Description
Time Frame
Cognitive Effort
Time Frame: Throughout the verification task
Participants' eye movements were tracked using a browser-based online eye tracking system. The outcome measure is the difference in cognitive effort as measured by fixation count in the defined areas of interest: fill image, reference image, or MI plot. Higher fixation rates indicate repeated interest in a certain area.
Throughout the verification task
Cognitive Effort
Time Frame: Throughout the verification task
Participants' eye movements were tracked using a browser-based online eye tracking system. The outcome measure is the difference in cognitive effort as measured by the duration of fixations in the defined areas of interest: fill image, reference image, or MI plot. Longer fixation duration indicates a higher cognitive load.
Throughout the verification task
Workload
Time Frame: After completing 100 mock verification trials in each arm

Participants will complete 100 mock medication verification trials in each of the 3 arms. The workload of each arm will be measured by the NASA Task Load Index (TLX). The 5 TLX dimensions assessed are: mental demand, effort, temporal demand, performance, and frustration. For each dimension, participants will indicate their response to a single question. For 4 of the dimensions, the endpoints of the Likert scale are 'very low' and 'very high'. The performance dimension is reverse-scored, and the endpoints are 'perfect' and 'failure'. Participants then complete 10 pairwise comparisons of the dimensions by indicating which dimension they consider to be a more important factor (e.g., effort vs frustration).

Each category score multiplied by its respective pairwise comparison count is summed and divided by 10 to get an overall weighted workload score. The result is an overall workload score between 1 and 20, with higher scores indicating higher workload.

After completing 100 mock verification trials in each arm
Usability
Time Frame: After completing 100 mock verification trials in each arm
Participants will complete 100 mock medication verification trials in each of the 3 arms (No MI Help, Scenario 1, and Scenario 2). After completing 100 trials, participants will assess the mock verification interface using the System Usability Scale (SUS). The SUS is comprised of 10 statements that participants indicate their agreement with using a 5-point Likert scale ranging from strongly agree to strongly disagree. Odd-numbered questions have a positive response and even-numbered questions are reverse-scored. Scores are summed and multiplied by 2.5 to get a final SUS score. SUS scores range from 0 to 100 with higher scores indicating greater usability. An average SUS score is considered to be 68. Anything below 50 is "Not Acceptable. Scores between 51-70 are considered "Marginal", those above 71 are considered "Acceptable", and those at 80 or above are indicative of high usability.
After completing 100 mock verification trials in each arm

Collaborators and Investigators

This is where you will find people and organizations involved with this study.

Investigators

  • Principal Investigator: Corey A Lester, PharmD, PhD, University of Michigan

Study record dates

These dates track the progress of study record and summary results submissions to ClinicalTrials.gov. Study records and reported results are reviewed by the National Library of Medicine (NLM) to make sure they meet specific quality control standards before being posted on the public website.

Study Major Dates

Study Start (Actual)

April 11, 2024

Primary Completion (Actual)

December 4, 2024

Study Completion (Actual)

December 4, 2024

Study Registration Dates

First Submitted

January 5, 2024

First Submitted That Met QC Criteria

February 5, 2024

First Posted (Actual)

February 7, 2024

Study Record Updates

Last Update Posted (Actual)

November 26, 2025

Last Update Submitted That Met QC Criteria

November 12, 2025

Last Verified

September 1, 2025

More Information

Terms related to this study

Other Study ID Numbers

  • HUM00241223
  • 5R01LM013624 (U.S. NIH Grant/Contract)

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

No

Studies a U.S. FDA-regulated device product

No

product manufactured in and exported from the U.S.

No

This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.

Clinical Trials on Machine Intelligence in the Pharmacy

Clinical Trials on No MI Help

Subscribe