Talks and Seminars

Informal learning seminar on mathematics and AI

by Maximilian Schimpf (MPIM)

Europe/Berlin
0.008 (Mathezentrum)

0.008

Mathezentrum

Endenicher Allee 60 53115 Bonn
Description

Title: "Transformers as a dynamical system"

We start by noting that the repeated filtering of information through the attention layers of a transformer can be interpreted as a discrete dynamical system. The continuous limit of this system satisfies a differential equation, which can be interpreted under appropriate assumptions as a gradient flow. After explaining this, we will see how this gradient flow has a tendency to form (often metastable) clusters. This perhaps offers a partial explanation of why a chatbot can extract "meaning" from input text.

https://sites.google.com/view/informal-math-ai-bonn/home