Commonplace Book

Notes & Meditations

A stream of consciousness, fragments of ideas, and things I've learned along the way. Dense, scannable, and perpetually growing.

Bookmark Feb 11, 2026

adenhq/hive: Outcome driven agent development framework that evolves

Hive frames agent development as outcome-driven and iterative: define goals, generate a graph, run it, capture failures, and evolve the agent based on what breaks. I like that it’s trying to be production-oriented (human-in-the-loop nodes, credentials, monitoring, cost controls) instead of yet another “chain some tools” demo framework. The “self-improving” angle matters most if the failure data is actually actionable and you can keep the evolution process auditable. It’s also notable they explicitly target coding-agent workflows (Claude Code/Cursor/OpenCode) as the interface for building and debugging. This feels like an attempt to turn agent-building into a maintainable system, not a pile of prompts.

github.com

Visit

bookmark github ai agents

Bookmark Feb 11, 2026

GitHub - glittercowboy/get-shit-done: A light-weight and powerful meta-prompting, context engineering and spec-driven development system for Claude Code and OpenCode.

GSD is a bluntly pragmatic layer on top of Claude Code/OpenCode/Gemini aimed at one thing: stopping “context rot” from killing long projects. The pitch is spec-driven development without the enterprise cosplay—lightweight commands that still enforce structure (requirements, roadmap, phased plans, verification). The workflow is basically: extract intent, generate plans small enough to run in fresh context windows, execute with tight state management, and keep the git history clean. I’m sympathetic to the premise: reliability comes more from process + context engineering than from hoping the model “stays smart” 80k tokens in. Even if you don’t adopt the tool, the patterns are a useful blueprint.

github.com

Visit

bookmark github ai agents

Bookmark Feb 11, 2026

How StrongDM’s AI team build serious software without even looking at the code

Simon’s writeup is a great tour of the “dark factory” end of the spectrum, with the real question front-and-center: if agents write both the code and the tests, what does “proof it works” even mean? The StrongDM approach—scenario holdout sets + LLM-as-judge + a “digital twin universe” for dependencies—reads like the first serious attempt at an answer. The clever bit is treating scenarios like evaluation data: useful for validating, dangerous to leak into the training loop (or the agent’s context). The takeaway isn’t “never review code”; it’s that verification needs to move up a level to behavior under realistic conditions. It’s an uncomfortable idea, which is why it’s interesting.

simonwillison.net

Visit

bookmark

Bookmark Feb 11, 2026

The best hiring writing is really about systems design for humans:

what you reward
what you tolerate
what you make easy

“Cracked” is just a label for unmanaged variance.

newsletter.posthog.com

Visit

leadership hiring teams

Bookmark Feb 7, 2026

Everything we’ve learned about hiring for startups (so far)

Hiring advice usually fails because it ignores constraints.

I like writing that admits the real trade: speed vs quality vs cohesion — and then tells you what to do anyway.

newsletter.posthog.com

Visit

leadership hiring management

Bookmark Feb 7, 2026

Software Factory: build serious software without reading the code

A great framing for agent-assisted development: treat code as an artifact, but do your real thinking at the level of specs, tests, and constraints.