Discussion about this post

User's avatar
Quang Vu's avatar

Your ideas are interesting. Is this AI-amplified testing approach genuinely making systems safer, or automating our way into more sophisticated forms of technical debt? What are the reliable method can we do to stay away from technical debt

JP's avatar

Coming to this a bit late but the line about fast changes with weak pipelines really stuck. The same thing applies to AI coding sessions now. The agents are fast but the guardrails lag behind.

Codex CLI recently shipped lifecycle hooks that bring exactly the CI/CD pattern you're describing into the AI session itself. SessionStart validates the environment before the AI touches anything. Stop runs your linter and test suite after it's done. Same pre/post concept as a pipeline's before_script and after_script, just scoped to the coding session instead of the deployment.

I wrote up the setup here https://reading.sh/codex-cli-has-hooks-now-stop-stuffing-agents-md-c181465fe271 because the parallel to CI/CD is too clean to ignore. Would be curious how you'd extend this for enterprise-scale workflows where you've got multiple agents running concurrently.

1 more comment...

No posts

Ready for more?