Automated AI Agent Experiments

Kim Wong

14 Oct 2025 — 1 min read

I built an AI team that works completely autonomously—and it's wild to watch. 🤖

Not just automation. True autonomy.

These agents literally sit at computers using git/bash/editors, making decisions like humans do. And here's the thing: this isn't just for developers. Think business analysts, data analysts, content writers, researchers—any role that only needs a computer and decision thinking.

🧠 The Team:

👨‍💻 Senior Dev Agent - Lives on my VM, codes features from scratch, creates PRs with full context, enhance code based on code review

🧪 QA Agent - Tests the live deployed app, writes comprehensive e2e tests, catches bugs before users do

💬 Code Review on GitHub - Auto PR reviews

💬 @claude on GitHub - Tag it anywhere in issues/PRs, it responds instantly and pushes fixes

🔧 The Stack powering them:

📋 Task Master AI - Orchestrates work, manages dependencies, keeps everyone aligned

🎭 Playwright - Browser automation for realistic testing

🛡️ Hooks - Custom safety rules so agents can't break production

▲ Vercel - Auto-deploys preview environments for every PR, mock develop or QA envirenments

🔍 GitHub Actions - Reviews every line of code, runs CI tests automatically

⚡️ The Flow:

It's not a rigid loop—it's like a real team. Based on task dependencies, agents work in parallel or sequentially. Issues can be raised anytime, fixes happen naturally. Just like human SDLC, but faster, with no breaks or coffee!

Task → Dev codes → PR → Review + CI → Deploy preview → QA tests → E2E tests → Regression tests → Merge ✅

Is this automation? Sure, I still set up the VM, prepare tasks, and merge PRs like a project manager. But that's not the point.

✨ The breakthrough is **autonomy**—agents that make decisions, not follow scripts. I can review their PRs, check deployments, of cuz, still can work on code alongside them. It's like managing a team that never sleeps.

✨ What I learned:

The hardest part wasn't the tech setup. It was learning to treat agents as **decision engines**, not deterministic workflows. How to give them goals, not steps. How to let them adapt, not just execute.

That's what makes them effective.

🔗 Watch them work: https://kimwwk.github.io/ohmydoc-using-claude-code-agent/

🔗 Project repo: https://github.com/kimwwk/ohmydoc-using-claude-code-agent

What could your AI team build while you sleep? 🌙

#AI #AgenticAI #Autonomy #FutureOfWork #ClaudeAI

You are Not Aware of Yourself

EVERYONE is feeding their AI more data. Skills, docs, Slack history, all of it. 📌 One thing that matters most gets missed every time: the implicit working style and patterns you're not even aware of yourself. It's the same as how you sometimes can't tell

Step #1 to use AI better: get rid of ChatGPT RIGHT NOW

So, what's the problem? ❌ The AI has no hands. It can't actually do anything. ❌ The AI doesn't have the real-time data the task needs. ❌ You're not giving the AI enough context (and ChatGPT's built-in memory can make it worse, pulling

2026 May Live Summit First section of School of Hard Knocks

Do you wanna get a Million Dollar Roadmap for the next 12 months? Let me tell you all - It was a incredible section from Queen of AI, Alicia Lyttle. Everything in 10 min! Use any AI who you works with the most, it could be ChatGPT, Claude, Genimi whatever.

I Migrated a 5-Agent System to a New Laptop. Here's What Broke.

Subtitle: A post-mortem on moving a multi-agent Claude Code + OpenCode setup — and what 14 cron jobs revealed about what's actually load-bearing. I've been running a small "Agent OS" — a thin orchestration layer that runs multiple coding agents on cron, each in its own workspace,

Read more

You are Not Aware of Yourself

Step #1 to use AI better: get rid of ChatGPT RIGHT NOW

2026 May Live Summit First section of School of Hard Knocks

I Migrated a 5-Agent System to a New Laptop. Here's What Broke.