NLP is not NLU and GPT-3 - Walid Saba

from Machine Learning Street Talk (MLST)

#machinelearning This week Dr. Tim Scarfe, Dr. Keith Duggar and Yannic Kilcher speak with veteran NLU expert Dr. Walid Saba.  Walid is an old-school AI expert. He is a polymath, a neuroscientist, psychologist, linguist,  philosopher, statistician, and logician. He thinks the missing information problem and lack of a typed ontology is the key issue with NLU, not sample efficiency or generalisation. He is a big critic of the deep learning movement and BERTology. We also cover GPT-3 in some detail in today's session, covering Luciano Floridi's recent article "GPT‑3: Its Nature, Scope, Limits, and Consequences" and a commentary on the incredible power of GPT-3 to perform tasks with just a few examples including the Yann LeCun commentary on Facebook and Hackernews.  Time stamps on the YouTube version 0:00:00 Walid intro  00:05:03 Knowledge acquisition bottleneck  00:06:11 Language is ambiguous  00:07:41 Language is not learned  00:08:32 Language is a formal language  00:08:55 Learning from data doesn’t work   00:14:01 Intelligence  00:15:07 Lack of domain knowledge these days  00:16:37 Yannic Kilcher thuglife comment  00:17:57 Deep learning assault  00:20:07 The way we evaluate language models is flawed  00:20:47 Humans do type checking  00:23:02 Ontologic  00:25:48 Comments On GPT3  00:30:54 Yann lecun and reddit  00:33:57 Minds and machines - Luciano  00:35:55 Main show introduction  00:39:02 Walid introduces himself  00:40:20 science advances one funeral at a time  00:44:58 Deep learning obsession syndrome and inception  00:46:14 BERTology / empirical methods are not NLU  00:49:55 Pattern recognition vs domain reasoning, is the knowledge in the data  00:56:04 Natural language understanding is about decoding and not compression, it's not learnable.  01:01:46 Intelligence is about not needing infinite amounts of time  01:04:23 We need an explicit ontological structure to understand anything  01:06:40 Ontological concepts  01:09:38 Word embeddings  01:12:20 There is power in structure  01:15:16 Language models are not trained on pronoun disambiguation and resolving scopes  01:17:33 The information is not in the data  01:19:03 Can we generate these rules on the fly? Rules or data?  01:20:39 The missing data problem is key  01:21:19 Problem with empirical methods and lecunn reference  01:22:45 Comparison with meatspace (brains)  01:28:16 The knowledge graph game, is knowledge constructed or discovered  01:29:41 How small can this ontology of the world be?  01:33:08 Walids taxonomy of understanding  01:38:49 The trend seems to be, less rules is better not the othe way around?  01:40:30 Testing the latest NLP models with entailment  01:42:25 Problems with the way we evaluate NLP  01:44:10 Winograd Schema challenge  01:45:56 All you need to know now is how to build neural networks, lack of rigour in ML research  01:50:47 Is everything learnable  01:53:02  How should we elevate language systems?  01:54:04 10 big problems in language (missing information)  01:55:59 Multiple inheritance is wrong  01:58:19 Language is ambiguous  02:01:14 How big would our world ontology need to be?  02:05:49 How to learn more about NLU  02:09:10 AlphaGo  Walid's blog: https://medium.com/@ontologik LinkedIn: https://www.linkedin.com/in/walidsaba/

What this episode covers

#machinelearning This week Dr. Tim Scarfe, Dr. Keith Duggar and Yannic Kilcher speak with veteran NLU expert Dr. Walid Saba. Walid is an old-school AI expert. He is a polymath, a neuroscientist, psychologist, linguist, philosopher, statistician, and logician. He thinks the missing information problem and lack of a typed ontology is the key issue with NLU, not sample efficiency or generalisation. He is a big critic of the deep learning movement and BERTology. We also cover GPT-3 in some detail in today's session, covering Luciano Floridi's recent article "GPT‑3: Its Nature, Scope, Limits, and Consequences" and a commentary on the incredible power of GPT-3 to perform tasks with just a few examples including the Yann LeCun commentary on Facebook and Hackernews. Time stamps on the YouTube version 0:00:00 Walid intro 00:05:03 Knowledge acquisition bottleneck 00:06:11 Language is ambiguous 00:07:41 Language is not learned 00:08:32 Language is a formal language 00:08:55 Learning from data doesn’t work 00:14:01 Intelligence 00:15:07 Lack of domain knowledge these days 00:16:37 Yannic Kilcher thuglife comment 00:17:57 Deep learning assault 00:20:07 The way we evaluate language models is flawed 00:20:47 Humans do type checking 00:23:02 Ontologic 00:25:48 Comments On GPT3 00:30:54 Yann lecun and reddit 00:33:57 Minds and machines - Luciano 00:35:55 Main show introduction 00:39:02 Walid introduces himself 00:40:20 science advances one funeral at a time 00:44:58 Deep learning obsession syndrome and inception 00:46:14 BERTology / empirical methods are not NLU 00:49:55 Pattern recognition vs domain reasoning, is the knowledge in the data 00:56:04 Natural language understanding is about decoding and not compression, it's not learnable. 01:01:46 Intelligence is about not needing infinite amounts of time 01:04:23 We need an explicit ontological structure to understand anything 01:06:40 Ontological concepts 01:09:38 Word embeddings 01:12:20 There is power in structure 01:15:16 Language models are not trained on pronoun disambiguation and resolving scopes 01:17:33 The information is not in the data 01:19:03 Can we generate these rules on the fly? Rules or data? 01:20:39 The missing data problem is key 01:21:19 Problem with empirical methods and lecunn reference 01:22:45 Comparison with meatspace (brains) 01:28:16 The knowledge graph game, is knowledge constructed or discovered 01:29:41 How small can this ontology of the world be? 01:33:08 Walids taxonomy of understanding 01:38:49 The trend seems to be, less rules is better not the othe way around? 01:40:30 Testing the latest NLP models with entailment 01:42:25 Problems with the way we evaluate NLP 01:44:10 Winograd Schema challenge 01:45:56 All you need to know now is how to build neural networks, lack of rigour in ML research 01:50:47 Is everything learnable 01:53:02 How should we elevate language systems? 01:54:04 10 big problems in language (missing information) 01:55:59 Multiple inheritance is wrong 01:58:19 Language is ambiguous 02:01:14 How big would our world ontology need to be? 02:05:49 How to learn more about NLU 02:09:10 AlphaGo Walid's blog: https://medium.com/@ontologik LinkedIn: https://www.linkedin.com/in/walidsaba/

NOW PLAYING

NLP is not NLU and GPT-3 - Walid Saba

0:00 2:20:32

1×

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Share this episode

Similar Episodes

Beating loneliness by bridging the generation gap

Apr 21, 2026 ·13m

Robotics and the Future of Aged Care

Apr 19, 2026 ·16m

The Purpose Paradox: Why Baby Boomers Delay Retirement

Apr 17, 2026 ·13m

A Growing Movement Aims to Prepare All Physicians to Care for Older Adults

Apr 15, 2026 ·12m

Defeating Recurring Charges on Cancelled Credit Cards

Apr 13, 2026 ·11m

If Your Dad Has These 11 Odd Habits, He's More Lonely Than He Admits

Apr 11, 2026 ·16m

Similar Podcasts

French Your Way Jessica: Native French teacher founder of French Your Way Boost your French listening skills and test your comprehension with this one of a kind series of podcasts. Get the chance to listen to a real conversation between native speakers talking at normal speed AND customise your learning experience through carefully designed sets of questions (2 levels of difficulty) available for download at www.frenchvoicespodcast.com. All interviews also come with the transcript. French teacher Jessica interviews native speakers of French from around the world who share a bit of their life and passion. Where else would you meet in one same place a French yoga teacher based in Melbourne, a soap manufacturer from Provence, or a couple cycling around the world? Kaizen Blueprint Aldo Chandra "Kaizen" is a Japanese term for continuous improvement. This podcast provides a blueprint to learn about health, wealth, relationships and everything else in between. Through our podcast, we strive to inspire, educate, and motivate our audience to cultivate a mindset of lifelong learning, productivity, and personal development. By sharing insights, strategies, and practical tips, we aim to guide listeners on their journey towards realizing their fullest potential, fostering success, and creating lasting positive change. One Man Went To Row PepperDawesMedia Follow the journey, from training to finish line, of a man from Derby, UK who is going from having only ever rowed on a machine to rowing 3000 miles solo across the Atlantic...just after his 70th birthday! Humanizing Change Tremendousness Join us each episode as we talk with innovators in their respective fields about their unique journeys and how they humanize change in their own work, right here, on Humanizing Change.

Frequently Asked Questions

How long is this episode of Machine Learning Street Talk (MLST)?

This episode is 2 hours and 20 minutes long.

When was this Machine Learning Street Talk (MLST) episode published?

This episode was published on November 4, 2020.

What is this episode about?

Can I download this Machine Learning Street Talk (MLST) episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.

URL copied to clipboard!