What if I train a neural network with random data? (with Stanisław Jastrzębski) (Ep. 87) episode artwork

EPISODE · Nov 12, 2019 · 19 MIN

What if I train a neural network with random data? (with Stanisław Jastrzębski) (Ep. 87)

from Data Science at Home · host Francesco <frag> Gadaleta

What happens to a neural network trained with random data? Are massive neural networks just lookup tables or do they truly learn something? Today’s episode will be about memorisation and generalisation in deep learning, with Stanislaw Jastrzębski from New York University.Stan spent two summers as a visiting student with Prof. Yoshua Bengio and has been working on Understanding and improving how deep network generaliseRepresentation LearningNatural Language ProcessingComputer Aided Drug Design What makes deep learning unique?I have asked him a few questions for which I was looking for an answer for a long time. For instance, what is deep learning bringing to the table that other methods don’t or are not capable of? Stan believe that the one thing that makes deep learning special is representation learning. All the other competing methods, be it kernel machines, or random forests, do not have this capability. Moreover, optimisation (SGD) lies at the heart of representation learning in the sense that it allows finding good representations.  What really improves the training quality of a neural network?We discussed about the accuracy of neural networks depending pretty much on how good the Stochastic Gradient Descent method is at finding minima of the loss function. What would influence such minima?Stan's answer has revealed that training set accuracy or loss value is not that interesting actually. It is relatively easy to overfit data (i.e. achieve the lowest loss possible), provided a large enough network, and a large enough computational budget. However, shape of the minima, or performance on validation sets are in a quite fascinating way influenced by optimisation. Optimisation in the beginning of the trajectory, steers such trajectory towards minima of certain properties that go much further than just training accuracy.As always we spoke about the future of AI and the role deep learning will play.I hope you enjoy the show!Don't forget to join the conversation on our new Discord channel. See you there! References Homepage of Stanisław Jastrzębski https://kudkudak.github.io/A Closer Look at Memorization in Deep Networks https://arxiv.org/abs/1706.05394Three Factors Influencing Minima in SGD https://arxiv.org/abs/1711.04623Don't Decay the Learning Rate, Increase the Batch Size https://arxiv.org/abs/1711.00489Stiffness: A New Perspective on Generalization in Neural Networks https://arxiv.org/abs/1901.09491 This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit datascienceathome.substack.com

NOW PLAYING

What if I train a neural network with random data? (with Stanisław Jastrzębski) (Ep. 87)

0:00 19:37

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

THE OMEN (1995) / DAMIEN

Apr 20, 2026 ·75m

THE OMEN (2006)

Apr 6, 2026 ·116m

OMEN IV: THE AWAKENING

Mar 30, 2026 ·126m

Big Old Life: Heather Blackbird interviews people on planet earth. Heather Blackbird loves asking questions. This podcast is a learning experience. Join me, Heather Blackbird, as I talk to people about their lives. Frequency of new episodes is a little all over the place and I'm learning as I go. Big Old Life is a small way of talking about the vastness of life, one person at a time. If you are reading this or found this podcast it's probably because someone you know gave you a link to it. :) Explicit Tales Of A Superstar DJ The Insomniac Spun seemingly out of nowhere from her complacent life in the corporate world, turned seemingly overnight from 16-Hour shift work and into the life of a literally starving artist and working musician, The Protagonist navigates her supposed rise to fame and superstardom on a journey through spiritual awakening, coming-of-age, and intimate self-realization--guided by an omnipresent force and equipped with the power of love, magic, and music. {Enter The Multiverse.} [The Festival Project] The Festival Project, Inc.™ is a multidimensional multimedia platform which encompasses exploratory and artistic social personifications and expressions on cosmic theory, spirituality, growth, health & wellness, philosophy and theoretic dynamics in entertainment such as music, design, film, television, radio, dance and festival culture, art, fashion, literature, and science. The Festival Project™ and its subsidiary Non-Profit, The Collective Complex © aims to challenge modern artistic and philosop Explicit Bitcoin Gateway Lea meakin Welcome to Bitcoin Gateway, the podcast where we dive deep into the world of Bitcoin, hosted by Lea Meakin. This show is for anyone who’s ever felt overwhelmed by the complex world of cryptocurrencies and wants a simple, straightforward explanation. Each episode, we’ll break down the basics of Bitcoin, explore its history, and discuss its potential impact on the future of finance. Whether you’re a complete beginner or just looking to expand your knowledge, Bitcoin Gateway is here to help you understand Bitcoin, one episode at a time. Explicit Chinook Realm Religion and crime collide when a gruesome murder rocks the isolated community of Chinook, Montana. Local Deputy Ruth Vogel thought she was answering a routine animal control call, only to find a mangled corpse on the frozen embankment. Her small town is whipped into a frenzy and everyone is quick to point their fingers at a drug-addicted teenager, but Ruth suspects connections to a powerful religious group. Enter Agent Loro, an enigmatic FBI investigator tracking an evangelical cult that may have roots right here in Chinook. Loro and Ruth form a cautious partnership to find the killer—but as the mystery winds through Ruth’s life, her family, and her church, she’ll discover something more sinister than murder is afoot.Binge all episodes of Chinook exclusively and ad-free by joining Wondery+ in the Wondery App, Apple Podcasts or Spotify. Start your free trial by wondery.com/links/chinook v Explicit

Frequently Asked Questions

How long is this episode of Data Science at Home?

This episode is 19 minutes long.

When was this Data Science at Home episode published?

This episode was published on November 12, 2019.

What is this episode about?

What happens to a neural network trained with random data? Are massive neural networks just lookup tables or do they truly learn something? Today’s episode will be about memorisation and generalisation in deep learning, with Stanislaw Jastrzębski...

Can I download this Data Science at Home episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!