LLM-Driven Automated Program Repair Using MaxSAT-based Fault Localization

Abstract
Debugging and repair are costly in both software development and programming education. In this talk I will present two approaches that combine formal methods with large language models (LLMs) to improve accuracy and scalability. First, we introduce CFaults, a MaxSAT-based fault localization tool for C programs that ensures consistent, subset-minimal diagnoses across multiple failing tests. It outperforms existing tools like BugAssist and SNIPER in speed and precision. Second, we apply a hybrid method to automated repair of C code, using MaxSAT, CFaults, to localize bugs and LLMs to replace the faulty code via a counterexample-guided loop. This approach produces more targeted and effective repairs than existing symbolic or LLM-only methods, improving outcomes across over 1,400 incorrect student submissions. Together, these methods show how combining formal reasoning with generative models enables more efficient and precise debugging and repair.
Date
Dec 4, 2024 9:00 AM — 3:00 PM
Event
QAVAS Meeting, University of Oxford
Location
Department of Computer Science, University of Oxford
Wolfson Building, Parks Rd, Oxford, UK. (Online)