Task-in-Prompt (TIP) adversarial attacks
An episode of the Build Wiz AI Show podcast, hosted by Build Wiz AI, titled "Task-in-Prompt (TIP) adversarial attacks" was published on August 25, 2025 and runs 13 minutes.
August 25, 2025 ·13m · Build Wiz AI Show
Summary
Tune into our latest episode where we dive deep into Task-in-Prompt (TIP) adversarial attacks, a novel class of jailbreaks that cleverly embed sequence-to-sequence tasks within prompts to bypass LLM safety safeguards. We'll explore how these attacks successfully generate prohibited content across state-of-the-art models like GPT-4o and LLaMA 3.2, revealing critical weaknesses in current defense mechanisms. Discover why traditional safeguards, including keyword-based filters, often fail against these sophisticated, indirect exploits.
Episode Description
Tune into our latest episode where we dive deep into Task-in-Prompt (TIP) adversarial attacks, a novel class of jailbreaks that cleverly embed sequence-to-sequence tasks within prompts to bypass LLM safety safeguards. We'll explore how these attacks successfully generate prohibited content across state-of-the-art models like GPT-4o and LLaMA 3.2, revealing critical weaknesses in current defense mechanisms. Discover why traditional safeguards, including keyword-based filters, often fail against these sophisticated, indirect exploits.
Similar Episodes
Jan 15, 2016 ·18m
Dec 23, 2015 ·40m
Dec 18, 2015 ·9m
Dec 7, 2015 ·16m
Nov 11, 2015 ·10m