Fall Bonn HEP Meeting 2024: Embracing Diversity in High Energy Physics

Name: Fall Bonn HEP Meeting 2024: Embracing Diversity in High Energy Physics
Start: 2024-10-07T08:30:00+02:00
End: 2024-10-09T16:50:00+02:00
Location: Bethe Center

7–9 Oct 2024

Bethe Center

Europe/Berlin timezone

Contact

hepbetheforum2024@listen.uni-bonn.de

AI Alignment: Problem of Diversity & Physics

8 Oct 2024, 10:55

25m

Wegelerstr. 10 - Seminar Room 2.019 - 53115 Bonn (Bethe Center)

Wegelerstr. 10 - Seminar Room 2.019 - 53115 Bonn

Bethe Center

Short talks

Phillip Kreer

The AI Alignment Problem involves aligning AI behavior with human intentions, addressing both technical and ethical concerns. In the first part of my talk, I exemplify the AI Alignment problem in the context of gender inequality in GPT4o. The AI Alignment problem effects also particle physics, where AI is essential for tasks like event tagging or event generation, and has the potential to enable frontier precision predictions in the standard model. Therefore, ensuring reliability of AI systems is crucial for the future development of particle phyiscs.
To avoid unaligned behavior, benchmarks are typically used to assess the performance of AI models. However, benchmarks are insufficient to assure aligned AI systems. A particular failure mode is strategic underperformance on benchmarks, leading to misaligned behavior of the AI during deployment. In the second part of my talk, I present a novel experiment inspired by deep learning theories rooted in physics to detect this type of underperformance.

Talk_Philipp_Kreer.pdf

Fall Bonn HEP Meeting 2024: Embracing Diversity in High Energy Physics

Contact

AI Alignment: Problem of Diversity & Physics

Wegelerstr. 10 - Seminar Room 2.019 - 53115 Bonn

Bethe Center

Speaker

Description

Presentation materials