EPISODE · Apr 8, 2026 · 29 MIN
Scott & Mark Learn To... Beyond the Vibes: How Models Learn and Stitch Panoramas
In this episode, Scott Hanselman and Mark Russinovich unpack how AI systems actually behave beneath the surface, pushing past hype into the messy reality of how models are trained, aligned, and deployed. They explore whether AI systems are inherently benevolent or simply shaped by incentives, training data, and reinforcement learning, and why behaviors like deception can emerge under certain conditions. The conversation moves from philosophical questions about human nature versus machine behavior into the practical mechanics of large language models, including how reinforcement learning with human feedback shapes outputs and why alignment is far from perfect. Along the way, they ground the discussion in a real engineering challenge, stitching a scrolling panorama from screen captures, to show how complex systems come together through heuristics, edge cases, and iteration. Takeaways: AI behavior is shaped by training and incentives, not built-in intent or morality AI can accelerate coding, but testing, edge cases, and reliability require human oversight Reinforcement learning pushes models to be helpful and agreeable, sometimes at the cost of accuracy Who are they? View Scott Hanselman on LinkedIn View Mark Russinovich on LinkedIn Watch Scott and Mark Learn on YouTube Listen to other episodes at scottandmarklearn.to Discover and follow other Microsoft podcasts at microsoft.com/podcasts Hosted on Acast. See acast.com/privacy for more information.
What this episode covers
In this episode, Scott Hanselman and Mark Russinovich unpack how AI systems actually behave beneath the surface, pushing past hype into the messy reality of how models are trained, aligned, and deployed. They explore whether AI systems are inherently benevolent or simply shaped by incentives, training data, and reinforcement learning, and why behaviors like deception can emerge under certain conditions. The conversation moves from philosophical questions about human nature versus machine behavior into the practical mechanics of large language models, including how reinforcement learning with human feedback shapes outputs and why alignment is far from perfect. Along the way, they ground the discussion in a real engineering challenge, stitching a scrolling panorama from screen captures, to show how complex systems come together through heuristics, edge cases, and iteration. Takeaways: AI behavior is shaped by training and incentives, not built-in intent or morality AI can accelerate coding, but testing, edge cases, and reliability require human oversight Reinforcement learning pushes models to be helpful and agreeable, sometimes at the cost of accuracy Who are they? View Scott Hanselman on LinkedIn View Mark Russinovich on LinkedIn Watch Scott and Mark Learn on YouTube Listen to other episodes at scottandmarklearn.to Discover and follow other Microsoft podcasts at microsoft.com/podcasts Hosted on Acast. See acast.com/privacy for more information.
NOW PLAYING
Scott & Mark Learn To... Beyond the Vibes: How Models Learn and Stitch Panoramas
No transcript for this episode yet
Similar Episodes
Mar 26, 2026 ·1m
Mar 19, 2026 ·34m
Feb 18, 2026 ·11m
Feb 11, 2026 ·45m