Formal Methods

Solving MaxSAT Problems from Natural Language Descriptions with LLMs and PySAT @ LLM-Solve @ FLoC 2026

In this talk, we will present a framework for solving MaxSAT problems expressed in natural language by combining large language models with the PySAT toolkit. We will show how LLMs …

Pedro Orvalho

• Jul 19, 2026 • 1 min read

Artificial Intelligence

📄🤖 2 Papers accepted @ EPIA 2026!! 🎉🎉

Excited to share that two of our papers have been accepted at EPIA 2026, covering the robustness of LLMs for code understanding and neuro-symbolic feedback for Vision-Language …

Pedro Orvalho

• Jul 18, 2026 • 1 min read

LLMs for Code Understanding

Large Language Models Are Not (Yet) Robust in Understanding Code Against Semantics-Preserving Mutations

In this paper we assess whether SOTA LLMs can reason about Python programs or are simply guessing. We apply five semantics-preserving code mutations, which maintain program …

Pedro Orvalho

• Jul 13, 2026 • 1 min read

Logic Programming

📄📄📄 3 Papers accepted @ FLoC 2026!! 🎉🎉🎉

Excited to share that three of our papers have been accepted at FLoC 2026, covering automated feedback for Prolog education, data-driven mutation testing for Prolog, and …

Pedro Orvalho

• May 30, 2026 • 2 min read

Large Language Models

Solving MaxSAT Problems from Natural Language Descriptions with LLMs and PySAT

In this paper, we study a neuro-symbolic approach in which an LLM translates a natural language description of an optimisation problem into executable Python code using PySAT. The …

Pedro Orvalho

• May 27, 2026 • 1 min read

LLMs for Code Understanding

From Brittle LLM Code Reasoning to MaxSAT-Based Verified Repairs @ UCL

In this talk, we examine the limitations of Large Language Models (LLMs) in semantic code reasoning, showing that their predictions may change under semantics-preserving code …

Pedro Orvalho

• May 20, 2026 • 1 min read

Software Verification

Towards Assessing and Repairing LLM-Generated Code via Model Checking and MaxSAT-Based Fault Localisation @ Dagstuhl 2026

LLMs for code often lack true semantic understanding, evidenced by their instability under semantics-preserving transformations, and we address this by integrating formal methods …

Pedro Orvalho

• May 5, 2026 • 1 min read

Neuro-Symbolic AI

PyVeritas: On Verifying Python via LLM-Based Transpilation and Bounded Model Checking for C @ P-AI-FM@AAAI 2026

In this talk I will present PyVeritas, a novel framework that leverages Large Language Models (LLMs) for high-level transpilation from Python to C, followed by bounded model …

Pedro Orvalho

• Jan 26, 2026 • 1 min read

Neuro-Symbolic AI

PyVeritas: On Verifying Python via LLM-Based Transpilation and Bounded Model Checking for C

In this paper, we propose PyVeritas, a novel framework that leverages Large Language Models (LLMs) for high-level transpilation from Python to C, followed by bounded model checking …

Pedro Orvalho

• Jan 26, 2026 • 1 min read

Software Verification

📄 Paper accepted @ the Post-AI Formal Methods Workshop @ AAAI 2026! 🎉

I’m excited to share that our paper on PyVeritas has been accepted to the P-AI-FM-26 Workshop @ AAAI 2026! 📄

Pedro Orvalho

• Nov 10, 2025 • 1 min read

Automated Program Repair

MENTOR: Automated Feedback for Introductory Programming Exercises

This PhD thesis presents MENTOR, a semantic automated program repair (APR) framework designed to provide Automated Feedback for Introductory Programming Exercises.

Pedro Orvalho

• Apr 10, 2025 • 1 min read

Large Language Models

Counterexample Guided Program Repair Using Zero-Shot Learning and MaxSAT-based Fault Localization

In this paper, we propose a novel approach that combines the strengths of both FM-based fault localization and LLMs, via zero-shot learning, to enhance APR for IPAs. Our method …

Pedro Orvalho

• Feb 25, 2025 • 1 min read

Formula-Based Fault Localisation

Model-Based Diagnosis for Software

Localising system faults has long been recognised as one of the most time-consuming and costly tasks in software engineering. Given a buggy system, fault localisation (FL) refers …

Pedro Orvalho

• Feb 1, 2025 • 4 min read

Large Language Models

📄 Paper accepted @ AAAI 2025!! 🎉

I am very happy to share that our paper that combines the strengths of both MaxSAT-based fault localisation and Large Language Models, via zero-shot learning, to enhance automated …

Pedro Orvalho

• Dec 15, 2024 • 1 min read

MENTOR

Automated Feedback for Introductory Programming Exercises

Delivering valuable and personalised feedback to students remains one of the greatest challenges in programming education, particularly in courses with large enrollments. Providing …

Pedro Orvalho

• Oct 1, 2024 • 3 min read

Formal Methods

CFaults: Model-Based Diagnosis for Fault Localization in C with Multiple Test Cases @ FM 2024

In this talk I will introduce a novel fault localization approach for C programs with multiple faults. CFaults leverages Model-Based Diagnosis (MBD) with multiple observations and …

Pedro Orvalho

• Sep 12, 2024 • 1 min read