MaxSAT-Based Fault Localisation

Large Language Models Are Not (Yet) Robust in Understanding Code Against Semantics-Preserving Mutations

In this paper we assess whether SOTA LLMs can reason about Python programs or are simply guessing. We apply five semantics-preserving code mutations, which maintain program …

Pedro Orvalho

• Jul 13, 2026 • 1 min read

LLMs for Code Understanding

From Brittle LLM Code Reasoning to MaxSAT-Based Verified Repairs @ UCL

In this talk, we examine the limitations of Large Language Models (LLMs) in semantic code reasoning, showing that their predictions may change under semantics-preserving code …

Pedro Orvalho

• May 20, 2026 • 1 min read

Software Verification

Towards Assessing and Repairing LLM-Generated Code via Model Checking and MaxSAT-Based Fault Localisation @ Dagstuhl 2026

LLMs for code often lack true semantic understanding, evidenced by their instability under semantics-preserving transformations, and we address this by integrating formal methods …

Pedro Orvalho

• May 5, 2026 • 1 min read

Neuro-Symbolic AI

PyVeritas: On Verifying Python via LLM-Based Transpilation and Bounded Model Checking for C @ P-AI-FM@AAAI 2026

In this talk I will present PyVeritas, a novel framework that leverages Large Language Models (LLMs) for high-level transpilation from Python to C, followed by bounded model …

Pedro Orvalho

• Jan 26, 2026 • 1 min read

Neuro-Symbolic AI

PyVeritas: On Verifying Python via LLM-Based Transpilation and Bounded Model Checking for C

In this paper, we propose PyVeritas, a novel framework that leverages Large Language Models (LLMs) for high-level transpilation from Python to C, followed by bounded model checking …

Pedro Orvalho

• Jan 26, 2026 • 1 min read

Software Verification

📄 Paper accepted @ the Post-AI Formal Methods Workshop @ AAAI 2026! 🎉

I’m excited to share that our paper on PyVeritas has been accepted to the P-AI-FM-26 Workshop @ AAAI 2026! 📄

Pedro Orvalho

• Nov 10, 2025 • 1 min read