Image Generation - Google DeepMind paper with TensorFlow - Deep Learning: Zero to One
I talk through generating an image of IRS tax return characters using a model trained on the IRS ...
An episode of the Deep Learning: Zero to One podcast, hosted by Sam Putnam, titled "Image Generation - Google DeepMind paper with TensorFlow - Deep Learning: Zero to One" was published on March 4, 2017 and runs 5 minutes.
March 4, 2017 ·5m · Deep Learning: Zero to One
Summary
I talk through generating an image of IRS tax return characters using a model trained on the IRS tax return dataset - NMIST. The authors trained for 70 hours on 32 GPUs. I used unconditioned image generation to create an image in 6 hours on my MacBook Pro CPU. I used the TensorFlow implementation of Conditional Image Generation with PixelCNN Decoders (https://arxiv.org/abs/1606.05328) by a student named Anant Gupta and learned that reasonable-looking digits can be generated with significantly fewer training steps, as soon as the training loss approaches that reached by the DeepMind authors. Each step is detailed at https://medium.com/@SamPutnam/this-is-the-1st-deep-learning-zero-to-one-newsletter-this-one-is-called-image-generation-935bcaf0f37c
Episode Description

I talk through generating an image of IRS tax return characters using a model trained on the IRS tax return dataset - NMIST. The authors trained for 70 hours on 32 GPUs. I used unconditioned image generation to create an image in 6 hours on my MacBook Pro CPU. I used the TensorFlow implementation of Conditional Image Generation with PixelCNN Decoders (https://arxiv.org/abs/1606.05328) by a student named Anant Gupta and learned that reasonable-looking digits can be generated with significantly fewer training steps, as soon as the training loss approaches that reached by the DeepMind authors. Each step is detailed at https://medium.com/@SamPutnam/this-is-the-1st-deep-learning-zero-to-one-newsletter-this-one-is-called-image-generation-935bcaf0f37c
Similar Episodes
May 10, 2025 ·30m
Apr 26, 2025 ·17m
Mar 22, 2025 ·20m
Mar 8, 2025 ·23m
Feb 22, 2025 ·24m
Feb 8, 2025 ·21m