Counterexample Guided Program Repair Using Large Language Models and MaxSAT-based Fault Localization

Name: Counterexample Guided Program Repair Using Large Language Models and MaxSAT-based Fault Localization
Start: 2025-06-23T12:00:00Z
End: 2025-06-23T13:00:00Z
Location: Instituto de Investigación en Inteligencia Artificial (IIIA), Consejo Superior de Investigaciones Científicas (CSIC)

Jun 23, 2025·

Pedro Orvalho

· 0 min read

Slides Video project paper

Abstract

Debugging and repair are costly in both software development and programming education. This talk presents two approaches that combine formal methods with large language models (LLMs) to improve accuracy and scalability. First, we introduce CFaults, a MaxSAT-based fault localization tool for C programs that ensures consistent, subset-minimal diagnoses across multiple failing tests. It outperforms existing tools like BugAssist and SNIPER in speed and precision. Second, we apply a hybrid method to automated repair of C code, using MaxSAT, CFaults, to localize bugs and LLMs to replace the faulty code via a counterexample-guided loop. This approach produces more targeted and effective repairs than existing symbolic or LLM-only methods, improving outcomes across over 1,400 incorrect student submissions. Together, these methods show how combining formal reasoning with generative models enables more efficient and precise debugging and repair.

Date

Jun 23, 2025 12:00 PM — 1:00 PM

Event

IIIA Seminars

Location

Instituto de Investigación en Inteligencia Artificial (IIIA), Consejo Superior de Investigaciones Científicas (CSIC)

IIIA-CSIC, Barcelona, Spain.

Last updated on Jun 23, 2025

LLM-Based Program Repair Automated Program Repair Fault Localisation Maximum Satisfiability Large Language Models Model-Based Diagnosis AI4SE