Never. Let. AI. Write. Your. Tests. (News) episode artwork

EPISODE · Jun 9, 2025 · 10 MIN

Never. Let. AI. Write. Your. Tests. (News)

from Changelog Master Feed

Diwank explains why you should never let AI writes your tests, Apple redesigns all of their software platforms, AI has brought about the rise of judgement over technical skills, Peter Steinberger says Claude Code is now his computer, and the curious case of Memvid.

NOW PLAYING

Never. Let. AI. Write. Your. Tests. (News)

0:00 10:09
of MATCHES

TRANSCRIPT · AUTO-GENERATED

What's up, nerds? I'm Jared, and this is Changelog News for the week of Monday, June 9, 2025. Just days before their much-anticipated WWDC keynote, Apple Research published a paper on the strengths and limitations of large reasoning models, which I can't help but interpret as, seriously guys, there's good reasons why our Apple Intelligence rollout has been a nosedive. You'll see.

Okay, let's get into the news. Never let AI write your tests. Developer Duang's field guide to a new way of building software starts off as a pretty typical, here's how to be productive coding with AI, but then he says something near the end, and emphatically so, that I haven't heard anybody say. Quote, Now we come to the most important principle in AI-assisted development.

It's so important that I'm going to repeat it in multiple ways until it's burned into your memory. Never let AI write your tests. Tests are not just code that verifies other code works. Tests are executable specifications.

They encode your actual intentions, your edge cases, your understanding of the problem domain. High performers excel at both speed and stability. There's no trade-offs. Tests are how you achieve both.

End quote. Duang says AI can help with test planning, suggest test scenarios, debug, and analyze test features, but that it should never touch test files, write test code, or modify test expectations. Quote, Your tests are your specification. They're your safety net.

They're the encoded wisdom of every bug you fixed and every edge case you've discovered. Guard them zealously. End quote. I'm not sure if I agree or not.

I don't think I have enough experience yet to weigh in with more than a hunch. What do you think? Does this ring true to you, or does it sound overly cautious? Apple redesigns it all.

The headliner announcement from Apple's WWDC keynote was a complete redesign of all major software platforms. Quote, Announced simultaneously for iOS, iPadOS, macOS, watchOS, tvOS, visionOS, and CarPlay. Liquid glass forms a new universal design language for the first time. At its WWDC keynote address, Apple's software chief Craig Federighi said Apple Silicon has become dramatically more powerful, enabling software, materials, and experiences we once could only dream of.

Inspired by VisionOS, liquid glass is layered throughout the system and features rounded corners that have been matched to the curved screens of the devices. It behaves just like glass in the real world and morphs when you need more options or move between views. End quote. I'm not gonna lie, it's giving me Windows Aero vibes.

It'll probably grow on me, but I can't say I'm super excited about this change. The return of textured depth and expressiveness and UI trend I featured last week coming on the heels of Airbnb's redesign is much more interesting to this guy. The rise of judgment over technical skill. Ever since ChatGPT launched our current AI madness, developers have been asking ourselves, and each other, what it all means in the long term.

We still don't have that answer yet, but I can confidently say that at least in the medium term, it means we must move up the value chain because the once cherished technical skills we've acquired are being commoditized at a blistering pace. There's nothing new under the sun. Quote, In 1995, musician and producer Brian Eno made a profound observation about computer sequencers that has become increasingly relevant in our AI-powered world. Quote, The great benefit of computer sequencers is that they remove the issue of skill and replace it with the issue of judgment.

With Cubase or Photoshop, anybody can actually do anything, and you can make stuff that sounds very much like stuff you'd hear on the radio or looks very much like anything you see in magazines. So the question becomes not whether you can do it or not, because any drudge can do it if they're prepared to sit in front of the computer for a few days. The question then is, of all things you can do now, which do you choose to do? End quote.

You know what? Adam and I had a similar conversation about digital photography while on a photo walk in New York City years ago. It was my contention then that the skills required to take great pictures were trending towards zero, and when we get to that point, which we're pretty close to now, the only thing that would matter is taste, which is just another form of judgment. In other words, it's a way of answering the question, of all the perspectives you can now capture, which do you choose to capture?

In one sense, Changelog News is me trying to climb my way up the value chain. Sure, I write some prose too, but not notably well, and I read them aloud to you, but not all that well. What I really do is repeatedly answer the question, of all the things you can feature, which do you choose to feature? It's now time for sponsor news.

Our best customers are now robots. Kurt Mackey and our friends at Fly have had quite the experience. Quote, but a funny thing has happened over the last six months or so. If you look at the numbers, DX, developer experience, might not matter that much.

That's because the users driving the most growth on the platform aren't people at all. They're robots. End quote. We've talked about LLM SEO a few times on the pod, and this is why.

Because you don't have to attract humans when coding agents make tool selections at massive scale. Kurt and his team are now focusing on the ladder. Quote, if you try to think like a robot, you can predict other things they might want. Since robot money spends just the same as people money, I guess we ought to start doing that.

For instance, it should be easy to MCP our API. The robots can then make their own infrastructure decisions. End quote. Lots to glean from this post.

Thanks to fly.io for sharing so candidly and for sponsoring Changelog News. Claude code is my computer. Here's Peter Steinberger. Quote, I run Claude code in no prompt mode.

It saves me an hour a day and hasn't broken my Mac in two months. The $200 per month max plan pays for itself. End quote. This echoes the sentiment that Steve Yegge impressed upon us on last week's show.

After recording that, I took Steve's advice and gave Claude code the old college try at writing a few scripts that I'd procrastinated because they were just too much work for the perceived ROI. Color me impressed. The first script Claude wrote was delivered so well on my specs that I decided to vibe code the second one and didn't even look at the code itself. Worked great.

Peter says this about Claude code. Quote, Claude code shines because it was built command line first, not bolted onto an IDE as an afterthought. The agent has full access to my file system. If you're bold enough, it can execute commands, read output and iterate based on results.

End quote. I think that's right. I like Claude code more than I like Claude inside of Zed. It's even more natural in my terminal than it is in my editor for some reason.

More to come on this front. I'm just getting started. But yeah, up the value chain we go. The curious case of Memvid.

Okay, I'm feeling way too AI bullish in this episode. So here's a nice balancing story. A graduate student created a software project that got a lot of attention online. Like a lot of attention.

Its pitch, quote, Memvid revolutionizes AI memory management by encoding text data into videos, enabling lightning fast semantic search across millions of text chunks with sub-second retrieval times. Unlike traditional vector databases that consume massive amounts of RAM and storage, Memvid compresses your knowledge base into compact video files while maintaining instant access to any piece of information. End quote. Now on its face, that sounds amazing, but also sounds kind of weird.

Why would encoding text into video use less disk space or make anything faster? Well, turns out it doesn't. Quote, testing shows this library's performance is the opposite of what the readme claims. Your text will take 100x more disk space.

Searches will be 5x slower. Setup will take hours, not minutes. This library will cause serious problems at production scale. The readme's performance claims are backwards.

End quote. That was posted as an issue on the repo on the heels of this discovery came a new contribution, a proposal for Memvid 1.0, the universal streamable self-contained AI memory format. Does that sound ambitious? Does it sound sloppy?

One commenter sure thinks so. Quote, GitHub is now infested with AI slop. AI generated repo with obvious overhead and no practical usages. People that has AI replaced brains giving stars to this and AI generated issues.

Perfect. End quote. I guess the AI slopping will continue until morale improves. That's the news for now, but go and subscribe to the Changelog newsletter for the full scoop of links worth clicking on, such as the new HTTP query method, containerized environments for coding agents and markdown with superpowers.

Get in on the newsletter at changelog.news. In case you missed it, last week Steve Yegge shared with us his adventures in babysitting coding agents and Amanda Silver, CVP of the developer division of Microsoft, explained why we're all builders now. And coming up this week on Wednesday, Richard Feldman tells me all about his rock programming language. And on Friday, Justin Searles is back to help us digest all the WWDC announcements.

Have a great week, like, subscribe and leave us a five-star review if you dig the show. And I'll talk to you again real soon.

PodQuesting Dwight J Randolph- WolfShield Media PodQuesting: -By WolfShield Media and Dwight J RandolphJoin us on an exciting journey to master the world of fiction podcasting! At PodQuesting, we document our quest to improve and innovate, sharing valuable insights, strategies, and behind-the-scenes tips along the way. Whether you're an experienced podcaster or just starting your first show, our podcast is your go-to resource for everything podcasting.Discover practical advice, creative techniques, and lessons from our own experiences as we explore the ever-evolving podcasting landscape. Ready to level up your skills and embark on this adventure with us? Tune in and join the quest!Have questions or feedback? Reach out to us at [email protected] and visit our website:WolfShield.Media The PFN Cincinnati Bengals Podcast Pro Football Network The PFN Cincinnati Bengals Podcast is where you can stay up-to-date with the latest news and analysis on the Cincinnati Bengals! Our hosts, industry experts Jay Morrison and Dallas Robinson, provide weekly coverage of all the latest rumors and updates about the Bengals. Don’t forget to follow the show to receive new episodes directly in your podcast feed and leave a rating and review to let us know your thoughts. The 48 Laws of Power by Robert Greene (Full Audiobook) Robert Greene Amoral, cunning, ruthless, and instructive, this multi-million-copy New York Times bestseller is the definitive manual for anyone interested in gaining, observing, or defending against ultimate control – from the author of The Laws of Human Nature.In the book that People magazine proclaimed “beguiling” and “fascinating,” Robert Greene and Joost Elffers have distilled three thousand years of the history of power into 48 essential laws by drawing from the philosophies of Machiavelli, Sun Tzu, and Carl Von Clausewitz and also from the lives of figures ranging from Henry Kissinger to P.T. Barnum.Some laws teach the need for prudence (“Law 1: Never Outshine the Master”), others teach the value of confidence (“Law 28: Enter Action with Boldness”), and many recommend absolute self-preservation (“Law 15: Crush Your Enemy Totally”). Every law, though, has one thing in common: an interest in t Mind Force Radio.com Mind Force Radio.com Natural Strength Night is an informative, humorous, sometimes a little raucous, good-time of myth busting and honest training information from the trenches. We strive to help everyone involved with old school strength training (without steroids) to not make some common training mistakes. Along with great information, you'll hear a fair share of steroid bashing, flamingo sightings, breaking goons, iron game history, and honest drug-free training information from various leaders and strength coaches in the field to help you get real results! If your primary training information comes from reading "Muscle & Fiction" magazine we'll help get you straightened out. If you love high-intensity strength training, dinosaur style training and just like lifting heavy weights ... or loved Jack Lalanne, Sandow, Grimek, Peary Rader's Iron Man magazine, Brad Steiner's articles, Stuart McRobert's Hardgainer, Iron Nation, Osmo Kiiha's The Iron Master, you will love the show.On The Rugged Individual, we

Frequently Asked Questions

How long is this episode of Changelog Master Feed?

This episode is 10 minutes long.

When was this Changelog Master Feed episode published?

This episode was published on June 9, 2025.

What is this episode about?

Diwank explains why you should never let AI writes your tests, Apple redesigns all of their software platforms, AI has brought about the rise of judgement over technical skills, Peter Steinberger says Claude Code is now his computer, and the curious...

Can I download this Changelog Master Feed episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!