Home /
technology Podcasts /
ExplAInable /
[153] למידה אדוורסריאלית

EPISODE · Apr 26, 2026 · 36 MIN

[153] למידה אדוורסריאלית

from ExplAInable · host Tamir Nave, Mike Erlihson, Uri Goren, Hila Paz Herszfang

מה הקשר בין הרעלת training data להורדת הסבירות ל- end of text token?בפרק 153 של אקספליינבל, אורי ומייק מארחים את ד״ר רז לפיד ואילון מזרחי לשיחה על למידה אדוורסריאלית. לא זו מארכיטקטורת GAN, אלא כזו שגורמת למודלי LLM לצטט את החוקה האמריקאית ולבזבז יותר מדי טוקנים. בפרק למדנו על תקיפות שמתחילות בwhitebox עם מודל opensource ונודדות למודלים סגורים, תקיפות פיזיות על מערכות סגורות שאומנו לזיהוי בני אדם, ואיך אפשר להתמודד עם מתקפה שמורידה את הסבירות שמודל שפה יוצא end of text token. אז האם אייג׳נטים שמשתמשים במודל סגור יותר בטוחים מכאלו שמשתמשים במשקולות מhugging face? איך תוקפים מרעילים תוצאות כשכל מה שיש להם הוא גישה ל training data? האם אורי ורז יפתחו עסק צדדי של הדפסת חולצות שיגרמו לנו להיות בלתי נראים?ה scholar של קרליני: https://scholar.google.com/citations?user=q4qDvAoAAAAJ&hl=enהגנה "לא מפוקחת" שהתקבלה ל ICCV: https://openaccess.thecvf.com/content/ICCV2025W/SafeMM-AI/html/Mizrahi_Pulling_Back_the_Curtain_Unsupervised_Adversarial_Detection_via_Contrastive_Auxiliary_ICCVW_2025_paper.htmlהתקפת black box על object detectors שהתקבלה ל - ECML: https://arxiv.org/abs/2303.04238

NOW PLAYING

[153] למידה אדוורסריאלית

0:00 36:04

1×

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Share this episode

Similar Episodes

Hollowing of the Human Abstraction Stack

May 11, 2026 ·21m

Reclaiming Cognition from the Slop Firehose

May 10, 2026 ·20m

Teenagers- Explore Before You Decide

May 9, 2026 ·18m

The Career Plan Is No Longer Fit-for-Purpose

May 8, 2026 ·25m

For 200+ Years, Capitalism Needed Humans

May 7, 2026 ·18m

Zrobiłem wszystko dobrze. I tak mnie okradli.

May 3, 2026 ·24m

Similar Podcasts

Spatial Web AI Podcast Denise Holt Active Inference AI & the Spatial Web The Future of AI is shared, distributed, and multi-scale.AI that is knowable, explainable, and capable of human governance.Based on the same mechanics as biological intelligence, it operates in a naturally efficient way, with no big data requirement.This is Active Inference AI & the Spatial Web. Trustworthy AI : De-risk business adoption of AI Pamela Gupta Description: Creating AI Trust is a very complex and hard problem. It is not clear what it is and how it can be operationalized. We will demystify what is Trustworthy AI, efficient adoption and leveraging it for reducing risks in AI programs.McKinsey reports indicates companies seeing the biggest bottom-line returns from AI—those that attribute at least 20 percent of EBIT or profitability to their use of AI—are more likely than others to follow Trustworthy AI best practices, including explainability. Further, organizations that establish digital trust among consumers through responsible practices such as making AI explainable are more likely to see their annual revenue and profitability grow at rates of 10 percent or more. Evidence → Cognition → Discernment™️ - Your Pathway to AI Leadership Greg Twemlow XperientialAI — Pathway to AI Leadership explores how people can collaborate with AI without outsourcing judgment. The spine is a three-step method: Evidence → Cognition → Discernment — a bridge from what’s scattered to what’s chosen. Through essays, reflections, and practical examples, I show how the Context & Critique Rule™ keeps thinking visible, decisions explainable, and responsibility human. Mateusz Chrobok Mateusz Chrobok Jak niebezpieczny jest Internet?Co można zrobić z danymi i dlaczego buzzwordy napędzają branżę IT?Jak przekuć pomysł w startup i spróbować zrobić coś dobrego?Mateusz Chrobok stara się dzielić swoim doświadczeniem w pracy w działach Research & Development na swoim kanale na youtube. Porusza w nim tematykę tworzenia startupów a także podejmowania kluczowych decyzji. Budowania produktów i weryfikacji ich zasadności. Opowiada także o bezpieczeństwie i o tym co robić i jak żyć by być maksymalizować swoje bezpieczeństwo. Oprócz tego jest ciekawy nowych technologii związanych z sztuczną inteligencją takich jak explainable AI, które mogą zmienić adopcję tych technologii w życiu codziennym.

URL copied to clipboard!