Why Over-Reliance on AI Can Bite You


	Back to Captain's Log Comments List

Upload Images @Reply Bookmark Link Email Next Unseen

Why Over-Reliance on AI Can Bite You

Richard Rost

2 days ago

Today was a reminder that as powerful as AI has become, it still isn't something you can blindly rely on, especially if it sits anywhere near the critical path of your workflow. I ran into an issue with something I haven't touched under-the-hood in a couple of months. It's part of my Access database system where I generate images using the OpenAI API. I've been using it for all kinds of things, including slides for my videos so I can get nice, realistic images without having to go hunting around for stock photos. And on a more personal note, I've also been using it to send Lauren a daily penguin. Most mornings I generate a cute little penguin image with a caption on it. It's kind of become a routine.

This setup has been working perfectly fine for months. I haven't changed the code. I haven't changed the prompts. It's just been doing its thing in the background, nice and reliable. Then all of a sudden today, it just stopped putting the caption on the image. The image itself was fine. The penguins were there, the scene looked great, but the caption was either missing, cut off, or completely unusable. So now I'm digging through my code thinking I broke something. I checked everything. The JSON going out looked fine. The response coming back looked fine. Nothing obvious had changed on my end.

After going back and forth with ChatGPT and testing a few different things, the likely culprit became clear. The model behavior changed. Not the endpoint, not my code, but the model itself. The way it handled text layout, which had been good for months, shifted just enough to break what I was doing. The exact same prompt that used to produce a nice, clean caption was now producing oversized text, clipped text, or no text at all. In other words, what I thought was a stable system was actually dependent on a moving target.

This is exactly why I'm always telling people to be careful when they start building systems that rely heavily on AI, especially if they don't fully understand what's happening under the hood. It's the same issue I have with vibe coding. Sure, you can throw prompts at an AI and get something working quickly, but if you don't understand the logic, the structure, and the dependencies, what happens when the model changes and your "working" solution suddenly doesn't work anymore? Now you're stuck trying to debug something you didn't really build in the first place.

Fortunately for me, this wasn't business critical. This was my penguin generator. I was able to play around with the prompts, adjust the caption logic, and eventually switch to a newer model that behaves better and more predictably. After some tuning, I got it working again. But if this had been something I relied on for my business, something tied directly to revenue or customer delivery, that kind of unexpected break would be a serious problem.

It reminded me of the Star Trek TNG episode "When the Bough Breaks." The Enterprise encounters the Aldeans, a society living on a hidden, technologically advanced planet run entirely by a system called the Custodian. Everything works perfectly, until it doesn't. The system begins to fail, and the Aldeans are completely helpless because over time they've become so dependent on it that they no longer understand how it works or how to fix it. That's the danger of relying on something you don't control. When it behaves the way you expect, everything is great. But the moment it changes, or breaks, you're stuck scrambling to figure out a system you didn't build and can't fully see inside.

So the takeaway here is simple. AI is an incredible tool, and I'm using it more and more every day. But it's still a tool, not a foundation you should blindly build on without safeguards and backups. If you're incorporating AI into your workflow, especially anything important, make sure you understand where the boundaries are. Know what parts are deterministic and under your control, and what parts are probabilistic and subject to change. Because as I was reminded today, the rug can get pulled out from underneath you at any time.

LLAP
RR

P.S. I'll include some of the images below that I've generated, and then you'll see exactly how and when it broke and what I had to do to fix it.

	Richard Rost OP @Reply 2 days ago
Example of one of the "broken" images the AI generated

	Richard Rost OP @Reply 2 days ago
For months, the A.I. has been generating nice cute images like this, with the caption perfectly readable.

	Richard Rost OP @Reply 2 days ago
Here's another one. No problems whatsoever.

	Richard Rost OP @Reply 2 days ago
Day after day, generating perfectly valid images and earning brownie points every time lol.

	Richard Rost OP @Reply 2 days ago
Even when she's sick, she gets her nice Daily Penguin.

	Richard Rost OP @Reply 2 days ago
Taking the dogs in the pool today? Okay, here's your penguin.

	Richard Rost OP @Reply 2 days ago
I've got a really busy day today. Have to record some videos. Don't have a lot of time to spend with you. You're still my favorite penguin.

	Richard Rost OP @Reply 2 days ago
Today is a rainy day here in Cape Coral, and I decided to send her a nice, cute one with two penguins under an umbrella and... I got no caption.

	Richard Rost OP @Reply 2 days ago
I messed around with the prompt and changed some things, and that started putting the caption on after I got really forceful with the prompt. It was cut off.

	Richard Rost OP @Reply 2 days ago
Change some more settings. Still cut off.

	Richard Rost OP @Reply 2 days ago
Shortened the caption, and it was still cut off.

	Richard Rost OP @Reply 2 days ago
And every other time it would completely ignore the caption.

	Richard Rost OP @Reply 2 days ago
After talking with GPT for half an hour about the issue and it giving me a bunch of guidelines, it didn't work. It suggested that I upgrade from image model 1.5 to model 2.0, and that fixed the problem.

Richard Rost OP @Reply

2 days ago

One other thing I discovered in the process is that yes, the newer image model is a bit more expensive. With the old setup, I was paying roughly three-tenths of a cent per image, and now it's closer to about half a cent per image. So technically, yes, the cost did go up. But let's put that in perspective. I generate maybe one of these penguin images a day, and one or two images for video slides. So we're talking about an extra fifty cents a month, maybe a buck fifty total. Meanwhile, I spent a couple of hours troubleshooting, testing prompts, checking my code, and figuring out what changed.

So from a practical standpoint, the time I already spent debugging this problem cost me far more than I will ever spend on generating these images. That's another important lesson here. Sometimes it's not about squeezing every last fraction of a cent out of your API usage. It's about reliability and your own time. If a slightly more expensive model behaves better and saves you from chasing down weird issues like this, it's probably worth it.

	Bryan Coleman @Reply 2 days ago
I can hear Mr Data saying "Intriguing. It appears the algorithmic processing has encountered a deviation from the intended logical path." or maybe something like "These results are like chasing an untamed ornithoid without cause."

Thomas Gonder @Reply

25 hours ago

I finally broke down and wrote, with the vibing help of ChatGPT, a backup routine for my BE .accdb files to quickly get a set backed up and restored during testing. I went through ten plus big iterations of the program after I created the original form.

There were so many little gotchas. The worst was it restoring mixed sets based on improperly using the Window's Date Modified property instead of a unique date/time code for each set. I spent hours pulling my hair out for data I thought I had fixed, yet the bad stuff kept coming back.

AI is a great tool, but it requires relentless validation testing. It makes lots of assumptions that just don't hold up. The final step, once I get what appears to work, is to debug step through every line of code to see if it really makes sense. And cleanup of ridiculously long variable names, For i = 1 to 10, fix convoluted GOTOs and my favorite: C = B = A when only A would have been just fine.

	Thomas Gonder @Reply 25 hours ago

Next Unseen

New Feature: Comment Live View