On Adversarial Training & Robustness with Bhavna Gopal
An episode of the Thinking Machines: AI & Philosophy podcast, hosted by Daniel Reid Cahn, titled "On Adversarial Training & Robustness with Bhavna Gopal" was published on May 8, 2024 and runs 44 minutes.
May 8, 2024 ·44m · Thinking Machines: AI & Philosophy
Summary
"Understanding what's going on in a model is important to fine-tune it for specific tasks and to build trust."Bhavna Gopal is a PhD candidate at Duke, research intern at Slingshot with experience at Apple, Amazon and Vellum.We discussHow adversarial robustness research impacts the field of AI explainability.How do you evaluate a model's ability to generalize?What adversarial attacks should we be concerned about with LLMs?
Episode Description
"Understanding what's going on in a model is important to fine-tune it for specific tasks and to build trust."
Bhavna Gopal is a PhD candidate at Duke, research intern at Slingshot with experience at Apple, Amazon and Vellum.
We discuss
- How adversarial robustness research impacts the field of AI explainability.
- How do you evaluate a model's ability to generalize?
- What adversarial attacks should we be concerned about with LLMs?
Similar Episodes
Jan 23, 2024 ·41m
Jan 16, 2024 ·34m
Jan 2, 2024 ·34m
Dec 26, 2023 ·41m
Dec 19, 2023 ·40m
Dec 12, 2023 ·37m