Recent work showed that retrieval based on embedding similarity (e.g., for retrieval-augmented generation) is vulnerable to poisoning: …
We introduce a new type of indirect, cross-modal injection attacks against visual language models that enable creation of …
Multi-modal embeddings encode texts, images, thermal images, sounds, and videos into a single embedding space, aligning representations …