Tessa Herman, MD1, Giovanni Roldan Carrillo, MD2, Jesse Fletcher, MD2, Anders Westanmo, PharmD, MBA3, Amy Gravely, MA4, Mohammad Bilal, MD, FACG5, Daniela Guerrero Vinsard, MD6, Brian Hanson, MD6 1Vanderbilt University, Nashville, TN; 2University of Minnesota Medical School, Minneapolis, MN; 3Minneapolis VA Health Care System, Minneapolis, MN; 4Minneapolis VA Medical Center, Research Service, Minneapolis, MN; 5University of Colorado Anschutz Medical Campus, Denver, CO; 6University of Minnesota and Minneapolis VA Health Care System, Minneapolis, MN Introduction: Artificial intelligence (AI)-assisted colonoscopy for polyp detection is increasingly prevalent in colonoscopy practices. Every AI system is built on a proprietary algorithm using a unique primary dataset yielding variable performance characteristics. As a result, AI systems inherently perform differently, and these performance differences may affect important patient outcomes. Randomized trials and observational cohorts of AI-assisted colonoscopy versus non-AI abound but there is a dearth of head-to-head studies evaluating AI system performance. Methods: The aim of our prospective, single-center, pragmatic implementation trial was to perform a 90-day comparative analysis of two AI systems: System A (8/20-11/18/2024) and System B (11/19/2024-2/17/2025). We included all screening, surveillance, and positive fecal immunochemical test colonoscopies. The primary outcome was adenoma detection rate (ADR), which included both tubular adenomas and sessile serrated lesions (SSL). Secondary outcomes included the detection rates of tubular adenomas alone, advanced adenomas, SSL, advanced SSL, hyperplastic polyps, and other benign, non-adenomatous lesions, as well as withdrawal time and polyps per colonoscopy (PPC). Results: We evaluated 560 colonoscopies: 357 with System A and 203 with System B. Patient and procedure characteristics were similar between groups (Table 1). A non-significant trend toward higher ADR was seen with System B (80.3% vs 76.2%, p=0.26, Figure 1). System A resulted 3.8 PPC versus System B 2.9 (p=0.005). A notable but non-significant trend in the direction of lower detection rates of benign, non-adenomatous lesions was observed with System B. The systems performed comparably on all other secondary outcomes. Discussion: System A and System B performed similarly in the real-world setting. ADR for both systems was exceptionally high and while not statistically significant in this population the absolute difference of 4.1% (B > A) may be clinically relevant. System B also resulted in lower rates of benign, non-neoplastic detection rates. This result was not statistically significant, but the lower rate of needless resection is likely clinically meaningful and may also explain the lower PPC for System B. Our study suggests flexibility in choosing a system, leaving room for practical considerations (e.g. user preferences and cost). Our study also invites investigators to look beyond ADR and into the specificity for adenoma detection.
Figure: Table 1: Patient and Procedural Demographics Separated by Artificial Intelligence (AI)-Assisted Colonoscopy System – The patient and procedural demographics between System A and System B were similar.
Figure: Figure 1: System A vs System B Polyp Detection Rates– System A and System B had similar detection rates of all polyp types: adenoma detection rate (ADR), tubular ADR, advanced ADR, sessile serrated lesion detection rate (SSLDR), advanced SSLDR, hyperplastic detection rate, and benign, non-adenomatous, non-hyperplastic pathology detection rates.
Disclosures: Tessa Herman indicated no relevant financial relationships. Giovanni Roldan Carrillo indicated no relevant financial relationships. Jesse Fletcher indicated no relevant financial relationships. Anders Westanmo indicated no relevant financial relationships. Amy Gravely indicated no relevant financial relationships. Mohammad Bilal: Boston Scientific – Consultant. Cook endoscopy – Paid speaker. Steris Endoscopy – Consultant. Daniela Guerrero Vinsard indicated no relevant financial relationships. Brian Hanson indicated no relevant financial relationships.
Tessa Herman, MD1, Giovanni Roldan Carrillo, MD2, Jesse Fletcher, MD2, Anders Westanmo, PharmD, MBA3, Amy Gravely, MA4, Mohammad Bilal, MD, FACG5, Daniela Guerrero Vinsard, MD6, Brian Hanson, MD6. P0521 - Battle of the Bots: Real-World Comparison of Artificial Intelligence-Assisted Colonoscopy Systems Reveals Competitive Performance, ACG 2025 Annual Scientific Meeting Abstracts. Phoenix, AZ: American College of Gastroenterology.