AudioHijack (auditory prompt injection attack)

The study shows that carefully crafted audio clips can elicit unauthorized actions from audio-language models (LALMs), including downloading files, sending emails, and performing web searches

malicious instructions are embedded in ordinary audio using adversarial perturbations that remain nearly imperceptible to human listeners

Amazing!

4 Likes

Nice!

Microsoft acknowledged the findings […], […] noting that developers can implement additional safeguards at the application layer.

Yes, and I wonder how many developers will actually do so.

And if the developers make safeguards an option available to users, will it default to “safe” and/or how many users will actually use the safeguards?

1 Like

If non-privacy phones weren’t always listening it wouldn’t be such a problem.

(Had to edit, forgot the “n’t” after “would”.)

En-shitty-AI-ification?

You must show me the proper pronunciation for this apropos new terminology!

/ɛnˌʃɪtɪ.eɪ.aɪ.ɪ.fɪ.'keɪ.ʃən/

:zany_face:

(… following the International Phonetic Alphabet (IPA) convention, more or less)

1 Like

Maybe it’s just enshittificAtIon.

Is apropos Greek or Latin? (With the ending “os” I suspect Greek.) Empasis on the middle syllable, short a, long o’s.

It’s French: a propos - Wiktionary, the free dictionary

and yeah I know that just pushes your question one language along.

French: à propos
Greek, courtesy of DeepL: σχετικά με (σχετικά)
Latin, courtesy of DeepL: De [or adpositum, acc. to WordHippo . com]

à propos
Stress falls on final voiced syllable, as with all French multisyllabic words, i.e. excluding words with a final unvoiced (barely voiced?) syllable, like Louvre, oeuvre, fenêtre, etc..

:slight_smile: