Were RNNs All We Needed? (Feng et al., 2024) episode artwork

EPISODE · Oct 13, 2024 · 11 MIN

Were RNNs All We Needed? (Feng et al., 2024)

from Revise and Resubmit - The Mayukh Show · host Mayukh Mukhopadhyay

Welcome to Revise and Resubmit, the place where we explore research breakthroughs, challenge assumptions, and ask the questions that keep science alive. Today, we dive into a fascinating paper titled "Were RNNs All We Needed?", authored by Leo Feng, Frederick Tung, Mohamed Osama Ahmed, Yoshua Bengio, and Hossein Hajimirsadegh. Published on October 2, 2024, this preprint is hosted on arXiv, courtesy of Cornell University. For years, Transformers have reigned supreme, revolutionizing natural language processing and sequential data tasks. But Transformers come with a cost: they struggle with long sequences, raising the question—did we abandon recurrent neural networks (RNNs) too soon? This paper suggests we may have. It takes us back to the classic models—LSTMs from 1997 and GRUs from 2014—and shows that with a little clever tweaking, these older architectures can still shine. Imagine RNNs that no longer need to backpropagate through time (BPTT). The authors remove hidden-state dependencies from input and update gates, giving birth to minLSTMs and minGRUs—leaner, faster, and fully parallelizable. In fact, they train 175x faster for long sequences and match the latest sequence models in performance, proving that sometimes innovation lies in refining the old, not just chasing the new. But here’s the million-dollar question: Did we jump to Transformers too soon? Could the future of deep learning lie in revisiting old ideas with fresh eyes? Thank you to the authors for their brilliant work and to Cornell University for making this research openly accessible through arXiv. Reference Feng, L., Tung, F., Ahmed, M. O., Bengio, Y., & Hajimirsadegh, H. (2024). Were RNNs All We Needed?. arXiv preprint. https://doi.org/10.48550/arXiv.2410.01201

Welcome to Revise and Resubmit, the place where we explore research breakthroughs, challenge assumptions, and ask the questions that keep science alive. Today, we dive into a fascinating paper titled "Were RNNs All We Needed?", authored by Leo Feng, Frederick Tung, Mohamed Osama Ahmed, Yoshua Bengio, and Hossein Hajimirsadegh. Published on October 2, 2024, this preprint is hosted on arXiv, courtesy of Cornell University. For years, Transformers have reigned supreme, revolutionizing natural language processing and sequential data tasks. But Transformers come with a cost: they struggle with long sequences, raising the question—did we abandon recurrent neural networks (RNNs) too soon? This paper suggests we may have. It takes us back to the classic models—LSTMs from 1997 and GRUs from 2014—and shows that with a little clever tweaking, these older architectures can still shine. Imagine RNNs that no longer need to backpropagate through time (BPTT). The authors remove hidden-state dependencies from input and update gates, giving birth to minLSTMs and minGRUs—leaner, faster, and fully parallelizable. In fact, they train 175x faster for long sequences and match the latest sequence models in performance, proving that sometimes innovation lies in refining the old, not just chasing the new. But here’s the million-dollar question: Did we jump to Transformers too soon? Could the future of deep learning lie in revisiting old ideas with fresh eyes? Thank you to the authors for their brilliant work and to Cornell University for making this research openly accessible through arXiv. Reference Feng, L., Tung, F., Ahmed, M. O., Bengio, Y., & Hajimirsadegh, H. (2024). Were RNNs All We Needed?. arXiv preprint. https://doi.org/10.48550/arXiv.2410.01201

NOW PLAYING

Were RNNs All We Needed? (Feng et al., 2024)

0:00 11:13

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Frequently Asked Questions

How long is this episode of Revise and Resubmit - The Mayukh Show?

This episode is 11 minutes long.

When was this Revise and Resubmit - The Mayukh Show episode published?

This episode was published on October 13, 2024.

What is this episode about?

Welcome to Revise and Resubmit, the place where we explore research breakthroughs, challenge assumptions, and ask the questions that keep science alive. Today, we dive into a fascinating paper titled "Were RNNs All We Needed?", authored by Leo Feng,...

Can I download this Revise and Resubmit - The Mayukh Show episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!