The Hidden Dangers of Generative AI: Who is Responsible for Protecting our Data? episode artwork

EPISODE · May 6, 2023 · 1H 4M

The Hidden Dangers of Generative AI: Who is Responsible for Protecting our Data?

from Security Voices · host Security Voices

The breakaway success of ChatGPT is hiding an important fact and an even bigger problem. The next wave of generative AI will not be built by trawling the Internet but by mining hordes of proprietary data that have been piling up for years inside organizations. While Elon Musk and Reddit may breathe a sigh of relief, this ushers in a new set of concerns that go well beyond prompt injections and AI hallucinations. Who is responsible for making sure our private data doesn’t get used as training data? And what happens if it does? Do they even know what’s in the data to begin with?We tagged in data engineering expert Josh Wills and security veteran Mike Sabbota of Amazon Prime Video to go past the headlines and into what it takes to safely harness the vast oceans of data they’ve been responsible for in the past and present. Foundational questions like “who is responsible for data hygiene?” and “what is data governance?” may not be nearly as sexy as tricking AI into saying it wants to destroy humanity but they arguably will have a much greater impact on our safety in the long run. Mike, Josh and Dave go deep into the practical realities of working with data at scale and why the topic is more critical than ever.For anyone wondering exactly how we arrived at this moment where generative AI dominates the headlines and we can’t quite recall why we ever cared about blockchains and NFTs, we kick off the episode with Josh explaining the recent history of data science and how it led to this moment. We quickly (and painlessly) cover the breakthrough attention-based transformer model explained in 2017 and key events that have happened since that point.

The breakaway success of ChatGPT is hiding an important fact and an even bigger problem. The next wave of generative AI will not be built by trawling the Internet but by mining hordes of proprietary data that have been piling up for years inside organizations. While Elon Musk and Reddit may breathe a sigh of relief, this ushers in a new set of concerns that go well beyond prompt injections and AI hallucinations. Who is responsible for making sure our private data doesn’t get used as training data? And what happens if it does? Do they even know what’s in the data to begin with?We tagged in data engineering expert Josh Wills and security veteran Mike Sabbota of Amazon Prime Video to go past the headlines and into what it takes to safely harness the vast oceans of data they’ve been responsible for in the past and present. Foundational questions like “who is responsible for data hygiene?” and “what is data governance?” may not be nearly as sexy as tricking AI into saying it wants to destroy humanity but they arguably will have a much greater impact on our safety in the long run. Mike, Josh and Dave go deep into the practical realities of working with data at scale and why the topic is more critical than ever.For anyone wondering exactly how we arrived at this moment where generative AI dominates the headlines and we can’t quite recall why we ever cared about blockchains and NFTs, we kick off the episode with Josh explaining the recent history of data science and how it led to this moment. We quickly (and painlessly) cover the breakthrough attention-based transformer model explained in 2017 and key events that have happened since that point.

NOW PLAYING

The Hidden Dangers of Generative AI: Who is Responsible for Protecting our Data?

0:00 1:04:21

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Unchained: Voices of Survival Diaz Task Force Unchained: Voices of Survival is a raw and unfiltered podcast that exposes the harsh realities of human and sex trafficking. Through courageous interviews with survivors, we amplify their voices, revealing the pain, resilience, and triumph of those who have endured the unimaginable. But we go even deeper—by speaking directly with the predators, we uncover the manipulations, tactics, and twisted justifications behind these heinous crimes.This isn’t just a podcast—it’s a mission. A platform for truth. A warning. A beacon of awareness. Join us as we break the silence, dismantle the darkness, and fight for justice.Listen. Learn. Take Action. Explicit Technado (Archived) ACI Learning The Technado crew covers a whirlwind of tech topics each week from interviews with industry experts and up-and-coming companies to commentary on topics like security, vendor certifications, networking, and just about anything IT related. Explicit Unauthorized Disclosure Kevin Gosztola Become a Paid Subscriber: https://anchor.fm/unauthorized-disclosure/subscribe"Unauthorized Disclosure" is a weekly podcast hosted by Rania Khalek and Kevin Gosztola. It focuses on issues and topics that are overlooked or pushed aside by the more mainstream media.The hosts champion adversarial journalism. Guests featured are often rarely heard or unheard voices. Or they are voices who we think can benefit from a space to have conversations, which allow for dissent and the unpacking of unpopular ideas.SUBSCRIBE on Spotify for $4.99/month and gain access to full episodes instead of clips or highlights from each week's show. Explicit Techlore Surveillance Report Techlore Techlore Surveillance Report is your weekly deep-dive into the privacy and security news that matters for your digital freedom. Hosted by Henry Fisher, founder of Techlore and long-time digital rights educator, each episode cuts through the noise to bring you carefully selected stories with the context, analysis, and historical perspective you need to truly understand what's happening to protect yourself (and others!) in the digital space.Topics covered include:• Privacy tool updates and vulnerabilities• Data breaches and cybersecurity incidents• Surveillance technology and government overreach• Big Tech privacy policies and practices• Encryption and security standards• Digital rights legislation and court cases• Open-source software developments• Corporate data practices and accountabilityWhether you're a beginner trying to stay informed or a seasoned expert tracking the ecosystem, Surveillance Report has Explicit

Frequently Asked Questions

How long is this episode of Security Voices?

This episode is 1 hour and 4 minutes long.

When was this Security Voices episode published?

This episode was published on May 6, 2023.

What is this episode about?

The breakaway success of ChatGPT is hiding an important fact and an even bigger problem. The next wave of generative AI will not be built by trawling the Internet but by mining hordes of proprietary data that have been piling up for years inside...

Can I download this Security Voices episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!