Semantic Robustness of Large Language Models

Artificial Intelligence for Reliable Code featured image

Artificial Intelligence for Reliable Code

Understanding the reasoning and robustness of AI systems, such as Large Language Models (LLMs), is critical for ensuring their reliable use in programming tasks. While recent …

avatar
Pedro Orvalho
Read more
Are Large Language Models Robust in Understanding Code Against Semantics-Preserving Mutations? featured image

Are Large Language Models Robust in Understanding Code Against Semantics-Preserving Mutations?

In this talk, I will present our evaluation on whether state-of-the-art LLMs with up to 8B parameters can reason about Python programs or are simply guessing.

avatar
Pedro Orvalho
Read more