Why vibe coding needs observability before you automate

2026.06.04// field-notes5 min read

A few mornings ago the content pipeline fired on schedule, marked itself as having run, and produced nothing. No draft. No error. No log line. The scheduled trigger updated its own lastRunAt, set the next run, and went back to sleep, and as far as every status field was concerned the job had worked. It had not. Somewhere between the wake-up and the part that actually writes a post, the whole run evaporated, and it did it without leaving a single mark to say so.

That is the failure mode that should scare you more than a crash. A crash is loud. It hands you a stack trace, a timestamp, a place to start. A silent failure hands you nothing, and worse, it hands you false confidence. The pipeline still showed enabled: true. It still had a nextRunAt sitting in the future. Every surface I could check said the system was healthy. The only thing that disagreed was reality: there was no post. The surface said fine. The outcome said broken. I have written before about how routinely the surface and the substance disagree, and this was that gap at its most expensive, because the surface was not just wrong, it was reassuring.

Here is the part that stuck. I cannot show you the log from that morning. There isn't one. The logging didn't exist yet. That absence is the whole indictment: the system failed in exactly the way you can't debug after the fact, because it left nothing behind to debug. You can't grep a void.

build the trace before you build the trust

So instead of chasing the specific bug, I built the thing that should have existed before the pipeline ever ran unattended: an observability layer. This is a vibe-coded pipeline, the whole content engine is AI orchestration rather than hand-written code, and that is exactly why the instrumentation had to be airtight. You cannot eyeball an AI-shipped system into trustworthiness. You have to make it report on itself.

One file, STATE.json, became the single source of truth for where the pipeline is and what happened: currentStage, stageStatus, lastError, outcome, outcomeReason. Next to it, an append-only run log per day. Every stage now writes STATE.json on every transition and on every error. The rule that makes it hold: if a stage can't even write its own status, that failure to write is itself the signal. There is no longer a path where the pipeline does something that matters and says nothing about it. I also moved the schedule off a trigger that only fired while my editor was open and onto one that fires unattended, because a reliability layer that only works when you're watching isn't one.

the no-op that proved it

This morning the pipeline ran again, at 05:07, with nobody watching. It read the day's research file, found nothing worth drafting, and stopped. The old version of that decision is indistinguishable from the silent failure: in both cases, no post appears. The difference is that now I can read exactly which one happened. STATE.json says it plainly:

"outcome": "clean-noop", "outcomeReason": "Research file declared zero DAILY DRAFT OPPORTUNITY blocks; strongest candidate deferred to weekly on freshness-vs-durability grounds."

The run log spells out the rest: which file it read, how many draft opportunities it found (zero), why the one promising candidate got held for a weekly piece instead of forced into a same-day reaction. A quiet day and a dead day produce the same empty output. Only instrumentation tells them apart, and now it does. For anything you vibe code and then trust to a schedule, that gap between quiet and dead is the whole game.

the failure surface is part of the system

This isn't an afterthought on this project, it's a written invariant. It has a name, I-NOOP-CLEAN: any stage that finds no work must exit with no commit, no partial state, and a recorded reason. A quiet day is allowed to ship nothing. It is not allowed to ship nothing silently. That distinction is most of the methodology in one line. SpecMesh, the discipline this whole project runs on, treats the failure surface as part of the system, not as an exception path you bolt on later. You specify how the thing reports that it did nothing with the same care you specify how it does the work. It's the same instinct that once had me build a navigation fallback before the path that needed it, back when the surface said a link worked and the destination was broken.

Most vibe coding runs the other way. You build the happy path, watch it work once, wire it to a schedule, and walk away. Logging is the thing you add the day something breaks, which is precisely the day it's too late, because the break that would have taught you the lesson left no trace to learn from. The order is backwards. Observability isn't the reward for a system that has earned trust. It's the precondition for handing the system any trust at all. When nobody owns that layer, you get the kind of quiet damage I wrote about in what gets skipped when nobody owns enforcement. Build the failure mode first. Make the machine tell you when it does nothing, before you let it do anything unattended.

when your automation can fail unseen

If you have a vibe-coded automation or build pipeline you're afraid to leave running, because you wouldn't know if it quietly stopped working, that fear is correct and it's fixable. The fix isn't more monitoring bolted on top. It's an observability layer specified into the pipeline from the start, so a silent failure becomes impossible by construction rather than caught by luck. Send the workflow and the point where it goes dark on you. VibeKoded can scope an observability audit of where your pipeline can fail unseen, a spec-first rebuild of the orchestration layer, or a standing instrumentation setup that makes every run account for itself, so leaving it unattended is a decision instead of a gamble. Work with VibeKoded.

The silent failure taught one durable thing before it disappeared: a system you can't observe is a system you can't trust, no matter how healthy its status fields look. So you build the part that tells the truth first, and you let it earn the automation second. That's what keeps vibe coding honest past the prototype. Ship the failure mode before the happy path. The first time it saves you, there will be a log to prove it.