... Prior work from 2021, the authors note, had already observed that CLIP’s embeddings don’t explicitly bind a concept’s attributes to the object itself. ‘Accordingly,’ they write. ‘they observe that that reconstructions from the decoder often mix up attributes and objects.’ ...