Cor Steging

Cor Steging profile

Title: Designing Responsible Artificial Intelligence: Hybrid Approaches for Aligning Learning and Reasoning

Location: Aula, Academiegebouw, Groningen 

Time: October 1st 2024, 11:00 am

Abstract: Artificial Intelligence (AI) has become an integral part of our society: we have smart assistants with speech recognition on our phones, self-driving cars, and online algorithms that recommend what we should buy, watch or listen to. Most of these AI systems learn to make decisions based on data: large quantities of examples from the past. The exact internal reasoning of such AI systems that learn from data is difficult to determine, however. This can cause the AI system to behave irresponsibly.

In this thesis, we introduce a method to evaluate the internal reasoning of AI systems that learn from data. We show that AI systems sometimes make the right decisions, but for the wrong reasons. For example, unbeknownst to us, an AI system can learn an undesirable, hidden bias from the data.

The method that we describe in our thesis can not only evaluate the internal reasoning of an AI system, but can also adjust it and steer it in the right direction. Additionally, we also show how one can create an AI system with predefined reasoning, rather than making it learn its reasoning from data. This way, the system cannot accidentally learn to make the decisions for the wrong reasons.

All of the methods we discuss in the thesis build upon the idea that we should use the domain knowledge of human experts when designing AI systems that learn from data. The thesis shows that this is essential for designing responsible artificial intelligence.

Publications:

Cor Steging Cover Thesis