Semantic Robustness of Large Language Models

Artificial Intelligence for Reliable Code

Understanding the reasoning and robustness of AI systems, such as Large Language Models (LLMs), is critical for ensuring their reliable use in programming tasks. While recent …

Pedro Orvalho

• Aug 1, 2025 • 4 min read

LLMs for Code Understanding

Are Large Language Models Robust in Understanding Code Against Semantics-Preserving Mutations?

In this talk, I will present our evaluation on whether state-of-the-art LLMs with up to 8B parameters can reason about Python programs or are simply guessing.

Pedro Orvalho

• May 15, 2025 • 1 min read