Human-Ai cooperation requires a mutual understanding, and this will require machines that are better able at understanding human language, as well as other forms of communication, including gestures, to understand human preferences, intentions and goals. I think this idea supports why the [[System of a Sound - An interactive AudioVisual Installation using large Language Models, data sonification and pose recognition]] should be included in my thesis: it is a way to explore. **From** [[Cooperative AI machines must learn to find common ground (Dafoe et al., 2021)]] #Idea