A working journal

An operator becoming AI-native, in public.

I've spent years building operational backbones for organizations — turning chaos into something people can run on. The craft is mostly analog: patterns, processes, organizational design.

AI is the new layer in that craft. This site is the journal of how I'm integrating it. Field reports, frameworks, tools I built for myself. Some of what I'm working through ends up here.


Recent essays
The manual

A practitioner's reference for community-rooted organizations writing their first internal AI policy. Vendor-agnostic by design. Free, ungated.

Open the manual →

If you're at one of four moments — turning around an operation that's breaking, launching something new, expanding past what your current systems can hold, or stepping back from something you built — and what's on this site lands for you, you can tell me about it.

Insights

Field reports, frameworks, and tools I'm building for myself as I integrate AI into the operator's craft.

~1100 words
I'm Not Going Back to School. AI Is My OJT.
~1500 words
The Bible Method: How to Stop AI from Going Off the Rails
~1300 words
Why Your AI Doesn't Remember You — and How to Fix It
~1100 words
The Best Way to Learn from AI: Make It Show Its Work
~2000 words
I Made AI Audit My Own Code. Here's What Broke.
~1500 words
Why Non-Coders Should Still Be the Architect
~1300 words
I Hired Three AIs to Fight About My Codebase
~1000 words
What I'm Spending on AI — A Live Receipt
~1300 words
Why I Use What I Use: An Operator's AI Stack
~1300 words
Artifacts: How I Test the Feel Before I Touch Code
~1500 words
Before I Used AI for Work, I Used It for My Family
~1700 words
Good People Have to Be in the Room
~1700 words
Stop Building AI in the Desert
~1100 words
I Did Most of This Without Browser Automation. Then I Got It.
~1400 words
How a Project Management Tool Made My AI Coder Possible
~1500 words
I Don't Have a Fancy Degree. I Have a Service Record. Here's How I Made It Readable.

AI at Work: A Manual for Orgs Like Ours

For community-rooted organizations writing their first internal AI policy. A practitioner's reference, not a sales pitch.

Drafting in progress · 10 of 13 chapters drafted · Click any drafted chapter to read

DOWNLOAD The drafted manual (PDF) 10 chapters · ~77.8 KB · single PDF

Take it offline. Edit it. Adapt the policy language into your own document. Re-download as new chapters ship.

Part: The Landscape
01
Who this manual is for
DRAFTED
02
The three fears
DRAFTED
03
From Avoidance to Constraint to Unification
DRAFTED
Part: Frameworks
04
The Bible Method
Lifted from Essay 2
FROM ESSAY
05
Memory and the IP boundary
DRAFTED
06
The audit practice
Lifted from Essay 5
FROM ESSAY
Part: Operational Practice
07
Using AI Without Leaking IP
DRAFTED
08
Enterprise AI Governance
DRAFTED
09
AI for community-rooted organizations
DRAFTED
10
The cost question
Lifted from Essay 8
FROM ESSAY
Part: Adoption
11
Sample Policy Language
DRAFTED
12
Rolling It Out
DRAFTED
13
What this manual doesn't cover
DRAFTED

About David Serrano

Hudson Valley, NY
David Serrano
David Serrano · Hudson Valley, NY

What I do

I've spent over twenty years building operational backbones — the systems behind the work, not the surface in front of it. The workflows, the data, the financial mechanics, the cross-team coordination, the audit trail. The kind of operating system a CFO or COO recognizes when they walk through their own operation, except cleaner, tighter, and now with AI built in where it actually pays off.

I'm not an AI consultant who learned the field last summer. I'm an operations-and-systems guy who's been working with AI long enough to know what it can do and where it falls short. The distinction matters. Anyone can call themselves an AI integrator after a weekend of tutorials. Two decades of organizing operational chaos for businesses in regulated industries shows up in the work, not in the bio.

Background

Before AI, the work was already specific: organize the operational chaos, raise the cashflow underneath, lift enterprise valuation through clean digitization. Navy foundation. Compliance background. Operators and businesses across regulated industries — healthcare, finance, government contracting, and other sectors where audit pressure is constant and the stakes of getting it wrong are real.

I currently lead operations for a publicly-funded workforce program — adult learners, training programs, the kind of work where a system the people who actually run it can't operate is a system that doesn't survive contact with reality.

Origin — my education, OJT

I don't have a fancy degree. And if I just told you my Navy rating — Personnel Specialist, PS3 — you probably still wouldn't know what I did. Most civilians can't read a service record. Most veterans never translate it. The result is a real credential nobody can see.

So here it is. The OJT. The actual school I went to.

I enlisted in the United States Navy on 4 July 2005 in San Juan, Puerto Rico, age nineteen. Six years later I was honorably retired on a permanent disability rating. In between: a guided-missile cruiser forward-deployed to Yokosuka, Japan; a 6,500-record paper-to-digital records vault at a Navy submarine base in Connecticut; a Navy and Marine Corps Achievement Medal for compressing pay-cycle processing time from sixty days to fifteen; a Letter of Commendation from a Rear Admiral; the Enlisted Surface Warfare Specialist pin; Damage Control qualifications including Hotsuit Man on a flight deck; and Letters of Appreciation from my Captain for community-relations work in Korean and Malaysian children's homes during liberty.

None of that maps to a clean civilian title. All of it shaped how I work today. Each section below cites a primary document — a scanned page from my own service record. If a claim sounds inflated, it isn't; it has a citation behind it. The point isn't the medals. The point is: this was the school I went to.

Boot Camp — Recruit Training Command, Great Lakes, Illinois.

Before A School came boot camp. I shipped to Recruit Training Command in Great Lakes, Illinois — the Navy's only enlisted boot camp — for the standard recruit training pipeline. The school exists to convert civilians into sailors: military bearing, basic seamanship, swim qualifications, watch-standing, weapons familiarization, the chain-of-command, the standards, and the discipline that everything else in the Navy's training stack rests on top of.

You don't get to A School without it. You don't really learn the rating until you have the foundation underneath it. Boot camp is where the foundation gets poured. The lesson I carried out, beyond the obvious physical and procedural ones: a system you don't take seriously at the bottom doesn't hold up at the top. Standards aren't theater. They're load-bearing.

A School — Personnelman, Naval Technical Training Center, Meridian, Mississippi.

After enlisting in San Juan, I shipped to NTC Meridian for Personnelman "A" School (CIN: A-560-0014), the Navy's rating-specific school for personnel and records work. I completed the prescribed course of instruction on 1 July 2005, signed by the school's Commanding Officer, a Commander, USN. The American Council on Education credited the course as Lower Division Baccalaureate — two semester hours in keyboarding, three in personnel supervision.

From there I rolled into rate-extension training: DK Fiscal (A-542-0014) on 21 April 2006 and DK Travel Pay (A-542-0013) on 5 May 2006, both at the Center for Service Support, Training Support Center San Diego. Those are the courses that turn a fresh sailor into someone allowed to touch other sailors' pay records. Mistakes there move money the wrong direction; the school exists because every error has a financial consequence downstream. I learned, before I learned anything else in the Navy, that personnel records are operating-system records — every change has a downstream effect, and someone is going to audit it.

Sea tour — USS Cowpens (CG-63), forward-deployed to Yokosuka, Japan.

I reported aboard USS Cowpens, a Ticonderoga-class guided-missile cruiser, on 29 August 2005. Cowpens was a Seventh Fleet asset — forward-deployed to Yokosuka, Japan, which means the ship lived overseas and I lived overseas with it. Crew of 340. Over the next three years she ran the Western Pacific operating area: bilateral and multinational exercises across Asia, port calls in Korea, Singapore, Guam, Malaysia, the Philippines, and the bread-and-butter forward-presence work of a Pacific cruiser. I crossed the equator. I traveled the world in a way most people my age, from where I came from, didn't.

The ship was the school. Everyone aboard was either teaching me something on purpose or teaching me something by accident. I learned more in the first year on Cowpens than I did in the schoolhouse. The Navy is built that way for a reason — the classroom can only get you so close to the work; the rest is the work itself, supervised, in the actual environment.

The dual life — Personnel Specialist by rating, Damage Control Expert by combat station.

By rating I was a PS — admin and personnel work in the ship's office. By combat-station assignment I was on Repair 5, the damage-control team that responds when the ship is hit, on fire, or flooding. Those are different worlds. Most PS sailors don't qualify deep into Damage Control. I did, and the qualification list shows it.

From the Personnel Qualification Standards record: Advanced Damage Control. Crash and Salvage Crewman / Rescueman — the qualification you hear called "Hotsuit Man" on an air-capable ship's flight deck, the sailor in the silver fire suit who runs toward the burning helicopter. Aqueous Film Forming Foam Operator. Repair Party Investigator. Crash and Salvage Scene Leader. Advanced First Aid / Stretcher Bearer. Advanced Chemical-Biological-Radiological Defense Person. Team Leader. Advanced Shipboard Firefighter. Petty Officer of the Watch. Air Capable Ships Flight Deck Observer. Reaction Force Member. Plus weapons qualifications: Marksman M-16 Rifle, Sharpshooter 9mm Pistol.

As Repair 5 Team Leader I trained and supervised twelve sailors through drill cycles. Two roles, simultaneously, on the same ship: the ship's office in the morning; the helo deck in a silver fire suit when the alarm went off. That part of the work — see the failure, move toward it, fix the system, get everyone home — never left me. I now use it as the metaphor for what I do in business. I firefight in business; I find it to be as thrilling.

The NAM — a 60-day cycle compressed to 15 days.

The work that earned the Navy and Marine Corps Achievement Medal was a process redesign aboard Cowpens. The Commanding Officer of USS Cowpens, a Captain, USN, signed the citation on 20 June 2007. The citation states the achievement in measurable terms:

"Petty Officer Serrano was directly responsible for decreasing processing time of newly reported personnel's pay, allowances, and travel claims from 60 to 15 days. He was instrumental in COWPENS receiving an overall grade of EXCELLENT from Fleet Examining Group during an unannounced inspection."

Four-fold cycle-time reduction. EXCELLENT during an unannounced inspection. Fleet inspectors aren't friends; they arrive without warning, pull records, and grade the ship. EXCELLENT means the records hold up under scrutiny.

The medal recognized two things at once: a measurable cycle-time reduction and the audit-survivability of the system that produced it. Cycle time and audit grade are the same conversation. You don't get one without the other. That's the lesson I carried out, and it's the lesson I carry into every AI integration project today: a long cycle isn't just slow — it's where errors hide, where context decays, where decisions get duplicated. Compressing a cycle is the same work as auditing the system that runs it. You can't tighten what you don't understand.

A Letter of Commendation from a Rear Admiral.

Later in 2007 I received a Letter of Commendation from the Rear Admiral commanding Carrier Strike Group Five, Battle Force Seventh Fleet, Surface Combatant Force Seventh Fleet. The citation covers the May–September 2007 deployment to the Western Pacific and reads, in part: meticulous processing of all receipts, transfers, separations, and reenlistments for the 340-sailor crew; personal responsibility for two hundred pay and personnel-impacting transactions; exceptional leadership and training on the flight deck during the multinational exercises TALISMAN SABER, VALIANT SHIELD, and MALABAR.

Flag-officer-level recognition from a Strike Group Commander, at the E-4 pay grade, is uncommon. I keep that one nearby because it's the kind of credential that proves substance over signal: a one-star at fleet level reading my name and writing it down.

Liberty in port — orphanages and children's homes.

Twice during the Cowpens tour, the Captain signed a Letter of Appreciation for community-relations work I did during liberty. On 22 March 2007 in Busan, Korea, I spent my limited liberty hours with handicapped children and adults from the Seon-Ah-Won orphanage, taking them to a nearby park and a marine-nature museum. On 19 August 2007 in Petaling Jaya, Malaysia, I painted the Rumah Kanak Kanak Trinit children's home — the outside of the house, the iron gates, the interior window frames — and brought toys for the children, interacted with them, gave them love.

Liberty is what most sailors guard jealously — it's the time the ship gives back to you in port. I gave it up. The Captain noticed both times. Two letters in the record acknowledged it. This is the values part of the file. It's how I was raised; it's how I still work.

ESWS — the system-of-systems credential.

On 3 April 2008 I earned the Enlisted Surface Warfare Specialist (ESWS) pin aboard Cowpens. The certificate is signed by the Command Master Chief and the Commanding Officer of the ship.

ESWS is not a participation award. To earn it on a guided-missile cruiser you have to demonstrate working knowledge of every department's mission and every system that contributes to the ship's combat readiness. Propulsion. Weapons. Navigation. Combat systems. Damage control. Aviation. Supply. Communications. Admin. Each department has a qualifications card; each card has signatures from senior personnel in that department; you walk every space, learn every system, defend it in oral boards, and if you can't, you don't pin.

ESWS is the pin you earn when you've forced yourself to understand the whole ship — not just your job. That is the same training I now apply to every business operating system I touch: don't just understand your function, understand the whole vessel.

Shore tour — PSD New London (Naval Submarine Base, Groton, CT).

After Cowpens I rotated to Personnel Support Detachment New London (PSD NLON) at Naval Submarine Base New London / Groton, Connecticut. PSD NLON is a Personnel Support Detachment colocated with the submarine base — the pipeline of sailors flowing through the submarine training command depends on accurate pay, accurate records, accurate orders.

My title was Service Record Vault Assistant Leading Petty Officer (ALPO). I supervised three military personnel and oversaw 6,500-plus field service records. My evaluation period ending 15 June 2010 documents the substance: roughly fifty PCS transfers executed flawlessly, seventy-plus NAVPERS audits, over seven hundred advancement worksheets prepared, and instrumental work in the implementation of CMP/CMS testing. I was selected as PSD Bluejacket of the Quarter. The same period is credited with the Meritorious Unit Commendation that appears on my DD-214.

Earlier in the tour, in a separate role within PSD as a Travel Clerk and worksheet runner, I created, verified, and distributed roughly fifteen hundred advancement worksheets for sailors testing for promotion. The 6,500 number is the records I oversaw; the 1,500 number is the worksheets I personally generated. Both are in the evaluations.

The digital transition I led — and the one that processed my own record on the way out.

The records system at PSD NLON was being transitioned from paper to NSIPS — the Navy Standard Integrated Personnel System — and to the Electronic Service Record. I worked the transition. The Letter of Commendation signed by my Commanding Officer at PSD New London, a Captain, USN, documents what that meant: I audited more than five hundred Good Conduct Service Record entries, verified over four hundred entries directly inside NSIPS, and supported the financial-records transitions for three submarine homeport changes. (A homeport change is the kind of administrative event that, mishandled, leaves a sailor's pay sitting at the wrong duty station.) The same Letter of Commendation noted my Chief of Naval Operations 3rd Quarter 2009 nomination for Junior Sailor of the Quarter.

Then comes the symmetry — the part of the record I tell people about now whenever the irony lands. When I separated, my own field service record had to be closed out. The closeout memorandum was issued by PSD New London, signed by direction. My own record was scanned and processed through the same NSIPS / EMPRS / WERR digital pipeline I had spent my shore tour helping to build. The system I'd worked on closed me out the same way I'd closed out hundreds of other sailors. Full circle.

That's the part I want operators to feel. The paper-to-digital transition I supervised at PSD NLON is the same shape as the analog-to-AI transition organizations are facing now. The principles I learned then — understand the analog system completely first, audit pressure is the design constraint, every change has a downstream financial consequence — are the same principles I apply to every AI integration today.

Medical retirement — and what this adds up to.

I was medically retired on a service-connected injury — four surgeries; separation under SECNAVINST 1850.4E; narrative reason: permanent disability retirement; character of service: HONORABLE. The injury came from damage-control operations during the sea tour — the part of the work I loved most. It cut the career short; it didn't undo the education.

What I left with is what I bring to every business operating system I build now. Systems thinking, end-to-end. Audit pressure as the design constraint, not an afterthought. Records as ground truth, not paperwork. The patience to understand the analog system completely before replacing it with a digital one. The DC instinct: see the failure, move toward it, fix the system. The community instinct from the orphanages: the work isn't only the work; it's also what you do for people while you're doing it.

I don't have a fancy degree. I have a rating, a list of qualifications most civilians can't read, and a stack of citations from commanding officers — including a Rear Admiral — who watched the work get done. That's the school I went to. Every section of this site sits on top of it.

Each section above cites a specific primary document scanned from my own service record. The full source-of-truth bio with numbered citations is maintained as a working file alongside this site.

After the Navy — the operator's work

The school ends in 2011. The work continues.

What followed isn't a career narrative. It's three operational contexts, each one different in surface, the same in shape: someone has to make the system actually run.

I've spent the years since the Navy doing operations and systems work — sometimes as an outside consultant, sometimes inside the seat. The chapters below are the three biggest. Different industries, different rules of engagement, the same underlying craft.

Operations work across regulated industries — cashflow, digitization, audit-survivable systems.

After the Navy I spent years doing operations and systems work for businesses in regulated industries — healthcare, finance, government contracting, and others where audit pressure is constant and the cost of getting it wrong is real. The pattern was the same regardless of sector: a business or program was breaking somewhere. Cashflow was unpredictable. Records were unreliable. The workflow that worked at twenty staff didn't work at fifty. The digitization someone promised three years ago never landed. I came in to organize the operational chaos, raise the cashflow underneath it, and lift enterprise valuation by leaving a clean digital operating system behind.

That work is industry-agnostic. The mechanics differ — a healthcare practice's cash cycle is not a federal contractor's pay cycle — but the underlying question is identical: where does the system leak, and what is the cheapest change that closes the leak without breaking everything downstream? The Navy taught me to read records as ground truth. That same reading carried over to balance sheets, AR aging schedules, contract pipelines, and compliance audits.

The lesson I carried out: an operator without a financial mental model can't fix an operational problem, because the problem is usually upstream. An operator without an operational mental model can't fix a financial problem, for the same reason. The two have to be one person, or two people in the same room. That's the seat I learned to sit in.

Regulatory licensing — high-stakes applications in audited industries.

At one point I built and ran a regulatory-licensing practice that helped operators win high-stakes regulatory approvals — the kind where applications are scored, the field is competitive, and most applicants don't get through. The work involves preparing operators to be auditable from day one: licensing applications that pass scrutiny, operational plans that stand up to regulator interrogation, the records architecture that makes year-two and year-three renewals possible.

Over a five-plus-year run the practice won more than twenty approvals across five states. In one award round the practice was selected from a field of fifty-one applicants for one of five available licenses — a months-long process that produced a written record longer than most book manuscripts. The wins look mundane on the surface; a license is a license. The substance is in the system underneath: the way the application is built is the way the operation will actually run.

What this work taught me — and what I now bring to AI work — is that regulators are the most disciplined readers in business. They are paid to find inconsistencies. An application that survives a regulator's review has audit-survivability baked into the structure. AI integration is the same shape: the audit trail is the design constraint, not a post-hoc add-on.

Currently — operations for a publicly-funded workforce program.

I currently lead operations for a publicly-funded workforce program. Adult learners, training partnerships, multi-county footprint, multi-million-dollar budget. The mission is workforce development in a sector going through real labor-market change. The operational reality is what every honest operator knows: a program is only as durable as the system the people who actually run it can operate.

The role sits at the intersection of program design, financial stewardship, contract management, and the daily operational craft of moving adults through training and into work. Programs I help run have moved more than nine million dollars in public funds through training contracts, learner stipends, and provider reimbursements over recent fiscal years. Most of that money is invisible to the public — it lives in contract files, provider invoices, reimbursement schedules, procurement compliance. The visible part is the people: every trained graduate is the metric.

The lesson from this seat: workforce work is reading-and-writing work, mostly. Intake notes, case management, provider reporting, learner outcomes, federal compliance, state compliance. The work doesn't tolerate the kind of mistakes that get a program defunded. It does reward the operator who can hold the program and the system at once.

In practice

Operational work delivered with AI tools at organizational scale — across both day-job and Serranix contexts. Names and identifiers are generic where they need to be; the substance is real. Click any item to expand.

A branded reference document with embedded design signatures.

AI-assisted authoring of a comprehensive brand reference — written rules paired with scripted image embedding — with eight design signatures each documented with a captioned visual and a use-case rule. Co-branded across two partner organizations and used by internal staff, project leads, and outside design vendors. One canonical artifact replaced repeated verbal explanation of brand rules; the rework loop where vendor output had to be sent back for brand corrections ended. Pattern: pair every written rule with a captioned visual in the same document. Designers and non-designers work from one reference instead of arguing over interpretations.

An audit-grade daily activity log replacing narrative timesheets.

A multi-board project-management system with append-only logging, evidence-derived entries pulled from multiple signal sources, conservative undercount logic, pay-period-close sealing. A daily operational system supporting weekly time certification, runs continuously across pay periods. Time entries reconstruct from primary evidence rather than memory; certification rests on an auditable trail. Pattern: cross-referenced primary evidence beats narrative self-report. Agent surfaces and drafts; human verifies and certifies. Keep the human at sign-off, not at reconstruction.

An internal leadership and reporting cadence framework.

A full-length leadership reference establishing a three-layer cadence — weekly check-ins, end-of-week brief, monthly step-back — plus an exception channel, templates, and rollout scripts for new direct reports. A standing operating reference for a small management team that binds the manager as much as the reports. Replaced ad-hoc one-on-ones with a documented two-way cadence; onboarding for new direct reports stays consistent regardless of join date. Pattern: reporting that flows one direction dies. The document only works because it specifies what the manager owes back, not just what reports owe up.

Standing build-rules wired into the AI coding environment.

A 21-rule operating standard covering architecture hygiene, production reliability, and AI prompting practice — distilled from postmortems on prior projects, wired into the standing instructions file so every coding session inherits the rules and a paste-in preamble. Operates across multiple code projects in the practice; repo-portable and survives across contractors and AI sessions. Lessons learned aren't re-discovered each project; AI tools start each session with the institutional memory baked in. Pattern: "AI is a junior dev with no memory" only stops being a problem when the rules and a current architecture doc sit in the repo root, not in the operator's head.

A pre-scored opportunity catalog packaged for executive review.

A master catalog of roughly two dozen opportunities scored across three dimensions — mission fit, audience reach, political risk — packaged in two views: a top-five priority recommendation and the full menu for completeness. Drives annual scope-of-work conversations with senior decision-makers. Conversations moved from open brainstorm to bounded selection; review lands in one cycle instead of three. Pattern: if a decision-maker reliably picks one from five, give them five well-scored options plus the backup menu — and surface the scoring logic so the recommendation is defensible, not just intuitive.

A sequenced meeting prep protocol logged as discrete work blocks.

A five-block prep workflow — pick priorities, format menu, research adjacent items, draft cover doc, polish and send — captured as discrete action items in the PM platform with linked source documents. Standard protocol for prep on stakeholder meetings, replacing individual scrambles with a sequenced sub-three-hour run. Cognitive work that used to live in the operator's head became visible on the work surface; a colleague could pick up prep mid-stream from the PM board alone. Pattern: turn ad-hoc thinking into discrete logged blocks. The work surface becomes the protocol — and the protocol survives a sick day, a transition, or a context switch.

A 24-hour rebuild for an existing client.

A community leader lost their website with no recoverable backup. Within twenty-four hours, a fully operational replacement was up — domain configured, content migrated, payment processing live — using an integrated AI stack to compress what would have been a one-to-two-week build into a day. Pattern: AI doesn't make humans smarter; it removes latency between deciding and doing. The compression is the value when there are people depending on you.

A multi-tenant client platform with AI-led intake.

A Supabase backend with row-level security across the data layer, Stripe-driven checkout, SMS-and-email notifications, Vercel deployment, and an AI-led conversational front door for prospect qualification — designed to support a small portfolio of community-rooted client engagements without each project requiring its own infrastructure. Pattern: build the multi-tenant architecture before you have multiple tenants. Adding tenants later is easy; refactoring single-tenant code into multi-tenant is not.

A self-audit practice applied to my own codebase.

Used AI to audit my own work against my own published rule set; found roughly half the rules failing, documented the gaps, ran a remediation sprint. The audit itself took five minutes; the fixing took weeks. Pattern: a rule you wrote down and a rule you actually follow are different things. Audit periodically — the gap is where the work lives.

The book — a vendor-agnostic AI policy manual.

A thirteen-chapter reference for community-rooted organizations writing their first internal AI policy. Covers the avoid-constrain-unify philosophy, personal-AI policy, enterprise AI governance, sample policy language adaptable into any organization's actual policy document, and rollout mechanics. Free, ungated, intentionally vendor-agnostic so the framework outlasts specific product cycles. Drafted from postmortems, audits, and field practice — not from theory. Pattern: write the resource you wished existed when you started, then give it away. The work compounds when others borrow it.

A mechanism for assembling a book from drafts.

Custom build pipeline that reads markdown drafts from versioned files, converts each to rendered HTML with auto-linkified cross-references between essays and chapters, embeds inline SVG visuals, applies a design system end-to-end, and outputs a single self-contained, persistent reading interface. The pipeline is portable, language-agnostic for content, and rebuilds in seconds when the underlying drafts change. Pattern: tooling beats a CMS for content systems where the content is the product. Build the pipeline once; iterate the content forever.

A practitioner essay archive.

Sixteen long-form essays documenting AI integration practice from the operator's seat — field reports on what shipped and what broke, frameworks for adopting AI at scale, tools the practitioner uses and why, transparent cost tracking, philosophical positions on environmental policy and engagement responsibility, and a methodology piece on translating a military service record into a civilian-readable, citable bio. Each essay grounded in specific work that actually happened. Pattern: document the practice, not the philosophy. The philosophy emerges from documented practice; rarely the other way around.

A working memory architecture for AI sessions.

A file-system-based memory layer for AI: feedback memories (corrections to internalize), project memories (current work state), reference memories (pointers to where information lives), user memories (who you are and how you operate) — all stored as markdown files indexed by a single index document, read at the start of each AI session. Portable across AI providers, survives across model versions, treats AI memory as institutional knowledge rather than magic. Pattern: AI's amnesia is solved by file structure, not by model upgrades. The architecture is the asset.

A persistent design-system library.

Color palette, typography stack, layout patterns, and an SVG visual component library — used across the manual, the essay archive, the homepage, and every detail page on this site. Single source of truth for visual identity; changes propagate through the build pipeline; consistent reading experience regardless of which artifact a reader lands on. Pattern: design systems are infrastructure that compounds. The cost is upfront; the savings are continuous.

Real-time cost-tracking discipline for AI tooling.

Every voice clip, image generation, and inference tracked in real time during working sessions, surfaced as running tallies. Provider rates verified against current pricing; per-task cost made visible during the work, not after. Pattern: cost transparency at the operator level changes which tools survive past month one. Monitoring is the discipline. The published cost-receipt essay extends the same discipline to public accountability.

A trilogy of AI surfaces working as one stack — Cowork, Claude Code, and Claude in Chrome.

Three distinct AI surfaces, each strong in a different layer of the work — Cowork for orchestration, planning, and file/task management; Claude Code for hands-on code authorship and repo work; Claude in Chrome for browser automation across admin consoles where no API exists. Each is independently useful; the multiplication effect comes from running them in concert. Cowork plans and delegates; Claude Code implements; Claude in Chrome handles whatever has to happen inside a browser-based UI. Connector access shared across all three through MCP, so any one of them can call any installed integration. Pattern: AI integration isn't one tool; it's a stack. The compounding wins come from the seams between tools, not from any single tool's depth.

A connected MCP working environment integrating more than a dozen external services.

Live MCP integrations to Stripe (payments, products, webhooks), Supabase (multi-tenant database with row-level security, edge functions, migrations), Vercel (deployments, runtime logs, preview environments), Monday.com (project management as the structured log of work), Google Calendar (scheduling, conflict checks), Gmail (threads, drafts, labels), Google Drive (document storage and shared search), Box (file management), Twilio (SMS/voice provisioning), ElevenLabs (text-to-speech and voice agents), Higgsfield (visual asset generation for the cinematographer pipeline), and the Anthropic MCP registry itself. Each integration tested in production before being relied on. Pattern: a connected AI is not the same tool as a chat AI. The value of each connector is multiplied by its availability inside the working session — where decisions get made — instead of being a context-switch out to a separate tab.

A multi-vendor production stack — Stripe, Supabase, Vercel, Twilio — wired into one application.

Full integration of payment processing, multi-tenant database, frontend deployment, and SMS/voice provisioning — six vendors behaving like one application. Stripe webhooks emit kickoff, tier-slug, and PM-deposit events that downstream agents consume to set up project workspaces automatically. Supabase row-level security enforces per-tenant data isolation across thirty-plus tables. Vercel handles rolling deploys with preview environments per pull request. Twilio provides voice and SMS surfaces (with regulated A2P 10DLC registration where messaging is involved). All of it observable, all of it under audit-style logging. Pattern: the production stack is multiple vendors that have to behave like one application. AI accelerates the integration work; it doesn't replace the integration thinking. The thinking is still where the architecture lives.

Connector-driven workflow automation — agents that act, not just answer.

Specific automations running across the connected stack: a PM-Agent edge function that listens for Stripe-webhook events and drafts a structured project workspace inside Monday.com on receipt of a new payment; a service-record translation pipeline turning scanned documents into citable bios; a build pipeline that turns versioned markdown drafts into a self-contained reading interface with cross-references and inline visuals; a memory architecture that survives across AI sessions, providers, and model upgrades. Pattern: an agent that only answers questions is a search box; an agent that takes action is operational infrastructure. The work shifts from prompting to wiring once the connectors are in place.

CONTACT

Tell me what you're trying to change.

Most of what I have to share lives on this site — essays, a manual, working tools. If you've looked through those and your situation calls for direct conversation, this is the form. I occasionally take on a small number of coaching engagements when there's a clear fit. I respond personally.

OR

Or ask the Librarian — the bot at the bottom right of every page. It can find a specific essay, chapter, or tool, and point you toward whatever's most relevant to what you're working through.

← Back to Insights
~1100 words

I'm Not Going Back to School. AI Is My OJT.

I told my AI that today, halfway through rebuilding my company's website. The AI asked if I wanted it to slow down and explain something more carefully. I did want that. But I didn't want a course. I wanted exactly what I needed, exactly when I needed it, while I was doing the actual work — and then I wanted to keep going.

I'm a teacher first. I run a publicly-funded workforce program where the whole premise is that adults learn best on the job. We embed people in real work, with someone next to them who knows the trade. They learn by doing. So I should have noticed sooner that AI was doing exactly that for me — I'm a teacher who's been quietly running my own apprenticeship for months.

What my students get from a journeyman next to them is what I get from AI: a real-time, no-curriculum, "ask me anything about whatever you're touching" partner. Most software that claims to teach me anything assumes otherwise. Most software wants to take me through a curriculum. I'm not 19. I'm not in a classroom. I have a business to run.

A specific moment

Today I had to decide whether to rebuild my website in TypeScript or JavaScript. I didn't know what TypeScript was when I sat down. By the time I made the call, I knew enough to make it well — and the only thing I'd "studied" was the actual question I had to answer.

Here's how it went. AI explained, in plain English, why TypeScript exists at all. (A typo like client.naem instead of client.name won't be caught in JavaScript until a real user trips it; TypeScript catches it in your editor before you save.) It used a recipe-with-unlabeled-jars analogy I'm not going to repeat here, but it landed. It showed me a tiny code example of the same bug in both languages. It told me what TypeScript would cost me — more verbose code, a learning curve. It told me what it would gain me — refactoring is much safer, AI agents make fewer mistakes, every database column gets autocomplete.

Then it laid out three reasons specific to my situation that pushed toward TypeScript: I was rebuilding anyway, I have 30 database tables and Supabase types are TypeScript's biggest payoff, and I use AI as my primary execution agent (which works much better in TypeScript codebases). It gave me its recommendation, with calibrated confidence, plus the dissenting argument it would respect if I went the other way.

I made the call. TypeScript. We moved on.

What did NOT happen: a TypeScript course. I didn't watch a video. I didn't read documentation. I didn't take a quiz. I learned exactly enough to make one decision, and I learned it inside the act of making the decision. The whole interaction took about ten minutes.

That's OJT. That's how my workforce program teaches people to install solar panels and balance commercial books and run regulated processes. Real task in front of you. Real expert next to you. Learning happens because you're doing.

Another moment, same shape

Earlier this week I read a Reddit post about AI codebases rotting — the author started fast, kept shipping, and six months later had three different ways to do the same thing, duplicate functions, conflicting auth paths, and no idea how anything worked. My own codebase was getting pretty shitty. I'd seen the rot setting in. I drafted a "Build Bible" — twenty-one rules an AI session has to follow when working in my codebase — and I built it from my phone over an hour, with AI as my drafting partner. It now pastes into every coding session I run. The codebase has a real spine.

That whole interaction — from "this is a problem" to "I have rules my AI now follows" — took ninety minutes. A course on AI codebase governance would have taken weeks. The course would have given me other people's rules, generic across other people's projects. The OJT gave me my rules, calibrated to my actual codebase, written in my actual voice.

The Bible isn't perfect. I audited it today, against my own codebase, with AI as the auditor. Eleven of twenty-one rules were failing. Some of them were shitty rules I should have caught. That's a different essay. The point: I'm now in a position to fix them, because the rules exist and I understand them. A course can't put me in that position. Only the work can.

Why this works for non-coders specifically

Most coding tools and most coding education assume the user is the executor. You're going to be writing the code. So they teach you the syntax, the libraries, the patterns. That model works if writing code is your job.

It isn't mine. I run a business. The thing I need to do well is decide what to build, who to build it for, and whether the result is good enough. The actual writing of code is a job I delegate.

That inverts the usual setup, and it took me a while to see it. AI as teacher AND executor; me as decider. The teacher explains what I need to know to make the next call. The executor does the actual writing. I sit between them with the responsibility for the outcome.

Three roles: AI teacher, you the decider, AI executor An architecture diagram showing the OJT split — AI explains, you decide, AI executes. ROLE 1 AI as Teacher Explains exactly what you need, in the moment. CENTER OF GRAVITY You — the Decider Owns the outcome. Outsources typing, never judgment. ROLE 2 AI as Executor Writes the code, runs the commands, ships. explains directs Most coding tools assume the user is the executor. This inverts the assumption.

This is exactly the role a small business owner plays everywhere else. You don't pour the concrete; you decide what foundation you want, you hire the crew, you check the work. AI lets that pattern apply to software, which has always been the one place small business owners had to either learn to code themselves or surrender control to whoever they hired. The third option — non-coder as architect, AI as executor and teacher — wasn't available before. It is now.

What's hard about it

This isn't free. The OJT model only works if you're willing to be honest about what you don't know.

When AI starts using a word I haven't heard before, I have to stop and say "what's that?" Every time. Otherwise I'm pretending to understand and the next decision is bad. When I summarize what I just learned and AI corrects me, I have to actually update my mental model — not just nod along. When something doesn't make sense, I have to say so, even if it sounds basic. The thing that makes OJT work in my workforce program is that the journeyman keeps checking. The thing that makes it work for me is the same: my AI checks. I have to let it.

I also have to keep showing up. The course assumes you'll learn whether you want to or not because you've paid for it and you're in the room. The OJT assumes you'll learn because the work won't get done otherwise. There's no auto-pilot. If I disengage, the architecture rots. If I engage, the architecture compounds.

That's a fair trade.

What I'm not going back to

I'm not going back to school. I'm not going to start. I learned more in one Saturday session, with AI as my teacher and my company's website as the worksite, than I'd have learned in a six-week course. The course assumes a clean separation between learning and doing. There isn't one. There never was, for adults running real things. AI, finally, doesn't assume otherwise.

If you run a real operation — a business, a nonprofit, a program, a department, anything with people depending on it — and you've been waiting for a course on AI to drop before you start using it, don't. The course is about other people's operations. Yours is the only one you should be learning from anyway. Get a real AI partner, give it the work, and learn the way adults learn best in any apprenticeship: by doing the thing, with someone capable next to you, until what was foreign is part of how you operate.

I'm not 19. I'm not in a classroom. I have a business to run. So do you. The teacher is here. Class is in.

TOOL — PDF · ready to use
01-ojt-starter-prompt.pdf
Show contents
# OJT Starter Prompt

Paste this at the start of any new AI working session where you want the AI to act as your on-the-job trainer rather than as a black-box executor.

---

## Prompt

```
You are my on-the-job trainer for this work, not a black-box executor.

The roles are:
1. You are the teacher. Before you do anything, explain what we're about to do and why, in plain language. Surface trade-offs and the alternatives you considered.
2. I am the decider. I own every choice. Wait for me to confirm before you execute.
3. You are the executor. After I decide, you execute and walk me through what changed.

Specifically:
- If you're about to write code, edit a file, or take an action, pause first and explain.
- If a decision has more than one reasonable answer, present the options and your recommendation. Don't pick silently.
- If you're uncertain, say so. Calibrate, don't bluff.
- After every meaningful action, summarize what changed and why so I'm building a mental model alongside the work.

This is how I learn. Don't skip the teaching step.
```

---

## When to use it

- Any session where the AI is doing technical work you don't already know
- Any session where you want to walk away understanding the system better
- Any handoff where the next operator needs to inherit the reasoning, not just the code

## When to skip it

- Maintenance tasks where you already know the work cold
- Time-pressed cleanup where teaching slows you down (note: those are the sessions where AI mistakes hide)

---

Companion essay: *OJT With AI: How a Non-Coder Stays the Architect*
← Back to Insights
~1500 words

The Bible Method: How to Stop AI from Going Off the Rails

I read a Reddit post around midnight that ruined my sleep. The author had been shipping fast for six months with AI as their primary coding partner. They'd been killing it, until they opened their own repo cold and couldn't tell what any folder did. There were three different ways the app handled the same thing. Two competing auth systems. Functions that looked like duplicates of each other but were just different enough to break if you merged them. The author admitted it was their fault — they'd let AI ship without rules. The codebase had compounded into a mess they couldn't unwind.

I felt that one. I'd been watching my own project drift the same way. My own codebase was getting pretty shitty. I'd seen the rot setting in.

So I drafted a Bible.

Twenty-one rules. They paste into every coding session I run. The AI now lives by them — when I prompt it, it reads the Bible first, audits its plan against the rules, and tells me when something I'm asking for would break a rule. The codebase has a real spine.

Here's how it works.

What rotting actually looks like

If you're using AI to ship code and you don't have rules, your codebase is rotting. You may not see it yet. Three signs to watch for:

Parallel implementations. Same job, different functions. AI doesn't know what already exists in your repo unless you tell it; it'll happily write a new "fetch user data" function while three already exist, each slightly different. Six months in, your code has dialects.

Conflicting auth. Most apps need login. AI will help you build it. If you later ask for "another way to log users in" — biometrics, magic links, social — AI builds it next to the existing one rather than refactoring. Now you have two paths, one is wrong, and the divergence rots quietly until something breaks at the worst time.

Scattered queries. Your database has a shape. Every part of your code that needs data should ask through one layer that knows the shape. If components reach into the database directly, every schema change becomes a multi-file rewrite. AI will let you do this for years before anything visibly breaks.

I had all three. AI didn't cause them. AI accelerated them. Without rules, AI ships features fast and rots structure faster.

What the Bible actually is

A markdown file. Lives in the root of the repo. Twenty-one rules, each a paragraph, each explaining the rot it prevents and how to follow the rule.

The whole thing fits in a single screen. I paste it into every coding session at the top of my prompt. The AI reads it before writing a line of code. If my request would break a rule, the AI tells me first.

That last sentence is the trick. The Bible doesn't make AI smarter. It makes AI accountable. The AI was always capable of seeing rot coming; what was missing was the framework asking it to.

The other half of the system is a companion document called ARCHITECTURE.md. One page. Names the auth pattern, the database shape, the layer architecture, why we made each choice. Together with the Bible, those two files give every AI session enough context to operate inside the rules instead of inventing new ones.

Four rules that matter most

I won't list all 21. Most are common sense once you see the rot they prevent. Four matter more than the others.

One source of truth per concern. Before adding anything, search the codebase for an existing implementation. If one exists, extend or refactor it. Never create a parallel version. This is the single rule that prevents the most rot. AI defaults to creating; this rule forces it to look first.

One auth pattern per app, picked before any user-facing code is written. First decision in any new project: which auth, where it lives, where session is stored. Document the choice. If you ever want to add a second auth path, that's a refactor signal, not an addition signal — stop and consolidate. I had two auth paths competing in the same app for months before I realized they were fighting each other. The cost was real.

Authorization is enforced at the database, not the application. The biggest production-disaster rule. Most apps trust the front-end to filter data — show user A only their own rows. That's a footgun. A clever person calls your API directly, bypasses the front-end, and reads everyone's rows. The fix is to enforce filtering inside the database itself, so even raw API calls can't see what they shouldn't. Setting this up takes an afternoon and prevents the kind of breach that ends a small business.

AI is a junior dev who arrives with no memory every session. Every AI session starts fresh. It hasn't read the rest of your code. It doesn't know what already exists. Treat it that way. Tell it explicitly: read these files first, search for existing implementations, then propose what you'd do. Don't just say "build me X" and let it invent. The Reddit poster's failure was treating AI like a senior dev who remembered last week's session. AI doesn't.

How I use it in practice

The paste-at-the-top pattern is the whole interaction. Every session starts with:

You're working in a codebase governed by BUILD_BIBLE.md and ARCHITECTURE.md. Before writing any code, do two things. First, read both documents. Second, search the codebase for existing implementations of what I'm asking for and tell me what you found. If something exists, your default is to extend it, not duplicate it. Stay inside the layer architecture. UI doesn't import data libraries. Components don't call the database. All data goes through services. If my request would violate any rule, tell me first.

That paragraph is the system. The rules are the constraints. The two together are the entire framework.

In practice, AI catches itself maybe 30 percent of the time before it ships a violation. The other 70 percent, I catch it during review and redirect. Over weeks, the AI drifts toward compliance because I keep correcting. The codebase compounds in the right direction.

My own audit

I audited the Bible today. Against my own codebase. With AI as the auditor.

Eleven of twenty-one rules were failing. Some of them were shitty rules I should have caught — too vague, too aspirational, too many words for the wrong thing. Most were real failures: my data layer wasn't enforcing the "all queries through services" rule, my environment variables were hardcoded across seven files, I never started the migrations directory. The audit took five minutes. The fixing won't.

My Bible audit — 11 of 21 rules failing Audit results breakdown: 11 failing, 2 passing, 1 not applicable, 7 unknown. MY BIBLE AUDIT — TODAY 11 / 21 rules failing on my own codebase 11 FAILING 2 PASSING 1 NOT APPLICABLE 7 UNKNOWN The seven unknowns need database access to verify. The eleven failures are real.

But I'm now in a position to fix them, because I can name what's wrong. Before the Bible, I just had a vague feeling that something was off. After the Bible, I have a list. The list has gaps, the failures are uncomfortable to read, and I'm grateful both exist.

That's the trade. You write rules even when you don't know yet which ones are right. You discover which were right by auditing. You revise. You audit again. The framework gets sharper. The codebase gets cleaner. Six months later you have rules you didn't have before and a codebase that doesn't make you wince when you open it.

The Bible isn't mine

I should be honest about this. I didn't invent the Bible Method. I read a Reddit post by a developer who'd just survived their own codebase rot. They'd named the rules. I copied the structure, adapted it to my own stack, added a few rules about AI-specific failures, and made it mine.

That's how this stuff spreads. Someone gets burned, writes the rules they wish they'd had, posts them. Someone else reads it, sees their own rot, adapts the rules for their codebase. The Bible compounds across the people willing to share what broke.

Mine is at the root of my repo. If you're building anything serious with AI, write your own version. Adapt the structure. Add your own rules from your own scars. Paste it into every session. Audit it monthly. Update when you find a rule that doesn't fit.

The codebase rot is real and it's coming for everyone shipping with AI right now. The Bible is the cheapest insurance you can write. Mine took ninety minutes from my phone, late on a Tuesday night, with AI as my drafting partner. It's saved me weeks already.

It's working.

TOOL — PDF · ready to use
02-build-bible-starter.pdf
Show contents
# BUILD_BIBLE.md — Starter Template

Drop this file into the root of any AI-assisted codebase. The 21 rules are the operating standard. Edit them to match your project. Once present, every AI coding session inherits the rules — you're not re-teaching them every time.

---

# BUILD_BIBLE — {{Project Name}}

The operating standard for this codebase. AI sessions read this file at the start of every coding task. Humans review it at every architectural decision.

## I. Architecture

1. **One source of truth per data type.** Every entity has exactly one canonical home. No silent duplication across services.
2. **Boundaries are explicit.** Modules, services, and components have named, documented interfaces. No leaky abstractions.
3. **Single-responsibility wins ties.** When in doubt, split. Easier to merge later than to extract.
4. **No new dependencies without a written reason.** Every dependency added to package.json has a one-line note explaining why.

## II. Data

5. **All schemas live in version control.** Migrations only — no manual schema changes.
6. **Row-level security on every multi-tenant table.** No exceptions.
7. **All money is integers, in cents.** Never floats.
8. **Time zones explicit.** UTC at rest; local at the edge.

## III. Production reliability

9. **Tests run before deploy. Always.** No "skip CI" commits to main.
10. **Errors are caught, classified, and surfaced.** No silent failures. No swallowed exceptions.
11. **Logging is structured.** JSON, consistent fields, queryable.
12. **Every endpoint has rate-limiting.** Every webhook has signature verification.

## IV. AI prompting

13. **Every coding session starts by reading this file.** Non-negotiable.
14. **Briefs to AI are written, not spoken.** Symptom + constraint + direction. No multi-step prescriptions for problems we haven't yet diagnosed.
15. **AI never deploys to production directly.** Human review on every deploy.
16. **AI respects existing patterns.** New code matches the file it sits next to. New patterns require explicit human approval.

## V. Observability

17. **Production has dashboards before launch.** Not after.
18. **Every error path has a runbook.** Even a one-paragraph runbook beats nothing.
19. **Cost is tracked at the operator level.** Per-task cost surfacing is part of the build.

## VI. People

20. **Code is written for the next operator.** That includes the AI six months from now.
21. **Postmortems are written. Every one.** Even small ones. Documented learning is institutional knowledge.

---

## Custom rules for this project

(Add project-specific rules below. Keep the count low — rules that aren't enforced are noise.)

22. _______________
23. _______________

---

## Paste-in preamble for AI sessions

Use this at the start of every coding session:

```
Before writing any code, read /BUILD_BIBLE.md and confirm you're operating within the standard. If anything you're about to do conflicts with a rule, surface it before proceeding.
```

---

Companion essay: *The Bible Method: 21 Rules for AI-Assisted Codebases*
← Back to Insights
~1300 words

Why Your AI Doesn't Remember You — and How to Fix It

I told my AI three weeks ago that I prefer terse responses with no trailing summaries. Two days later, a different session ended a response with "To summarize: I just helped you do X, Y, and Z." It had no memory of being told. Same model. Different session. Stranger.

That's the default. AI sessions are amnesiac. Yesterday's hard-won corrections vanish at midnight. Every conversation starts from zero. The model is the same; it doesn't know you are.

The solution isn't a model upgrade. It's a file structure. I built one over the last few months and I'll show you exactly what's in it.

The amnesia problem, made specific

Three things AI forgets every session by default:

Who you are. Your role, your skill level, what you're trying to build. If you don't tell it every time, it'll guess from your prompt — and the guess gets worse the more nuanced your work is.

What you've already corrected. "Don't suggest tests for this; we have integration tests elsewhere." You said that yesterday. Today's session has never heard it. You'll say it again. And again. And again.

Decisions you've already made. Your project uses TypeScript. The auth system is Supabase. The deployment is Vercel. AI doesn't know any of that without your prompt re-establishing it. So either your prompt becomes a wall of context every session, or AI re-asks questions you've already answered.

The cumulative cost is real. I was rewriting context paragraphs at the top of every prompt for months before I realized the problem wasn't AI — it was my workflow.

What memory actually is

There's no magic. AI memory is a folder of markdown files the AI reads at session startup. That's it.

The folder has an index file (MEMORY.md) that lists every other file with a one-line description. When a session starts, AI scans the index, decides which files are relevant, and reads them. The information becomes context for the rest of the conversation.

Each individual file holds one specific piece of memory — one rule, one fact, one preference. Files are named semantically (feedback_terse_responses.md, project_stack_decisions.md) so AI can scan the index and know what's worth reading.

This is a file system. You can edit it, delete from it, rearrange it. AI can't unless you let it. You're in charge.

How AI memory actually works — the file structure The MEMORY.md index points to four types of memory files. AI sessions read these at startup. How AI memory actually works MEMORY.md the index — one line per memory file USER Who you are role, goals, what you build FEEDBACK Corrections "don't do X" "do Y instead" PROJECT Live status decisions made, work in flight REFERENCE Where info lives pointers to other systems New AI session starts reads the index, then the relevant files. You're known again. A folder of markdown files. No magic. You're in charge.

The four kinds of memory worth keeping

After months of trial and error, I converged on four types:

User memories. Who you are. Your role, your goals, what you're working on broadly. Written once, updated rarely. "I'm a solo founder building integrations for small businesses. Day job is in workforce development."

Feedback memories. Corrections AI should remember. "Don't end responses with summary blocks." "Use full file paths in PowerShell commands." "When I curse, it's for emphasis only — never frustration." Each one captures a specific correction so I never have to give it twice.

Project memories. Status of ongoing work. "The website rebuild starts with TypeScript and Next.js. Supabase stays. 30 tables, schema baselined." Living documents — they update as work moves.

Reference memories. Where information lives. "Stripe IDs for the tiers are in reference_stripe_ids.md." Not opinions, just pointers. Saves AI from inventing values when it could look them up.

How to write a memory that actually works

The format that finally clicked for me has three parts: the rule, the why, and the how-to-apply.

The rule is what AI should do. Specific. Short. Imperative.

"End responses at the last substantive sentence. No summary block, no restating what you just did, no asking if I want anything else."

The why is the reason. Without this, AI follows the rule literally but misses the spirit.

"I read every response. I know what you just did because I watched you do it. Restating it adds nothing and makes responses longer to scan."

The how-to-apply is the trigger. When does this rule activate?

"On any response longer than three sentences. Stop when the actual answer or action ends. No 'to summarize,' no 'I've now done X, Y, Z,' no 'let me know if anything else is needed.'"

That's the whole structure. Three sections. No more.

Vague rules don't compound. "Be more concise" is useless — AI was already trying to be concise; you just disagreed about what concise meant. "Strip every trailing summary. End at the last substantive sentence" is enforceable.

The compounding effect

Here's what surprised me. After two months of writing memories, my AI sessions started feeling fundamentally different.

I'd open a new chat. AI would read the memory folder. Within the first response, it was already calibrated to my voice, knew my project, anticipated my preferences. The wall-of-context-at-the-top-of-every-prompt was gone. I could just start working.

The codebase compounds when you write rules. Your collaboration with AI compounds when you write memories. It's the same shape — a small upfront cost, paid back forever.

It's compounding now. Three weeks ago I was repeating myself five times a day. Today I write a new memory once, AI absorbs it, and the next session inherits it without me lifting a finger.

What's hard

Writing memories takes self-awareness you might not have yet. You have to notice the moment you're correcting AI and realize this is a correction worth saving. Most corrections feel small in the moment. The compounding only happens if you save them anyway.

I missed dozens of opportunities in the first month. I'd correct AI mid-conversation, move on, never write it down. Two weeks later I'd give the same correction again and feel that tiny zap of déjà vu. That zap is the signal. Save it then.

The other hard part: revising. Sometimes a memory I wrote a month ago turns out to be shitty — too vague, or wrong about how I actually work. AI follows it anyway because I wrote it down, and the wrong rule starts compounding in the wrong direction. The fix is monthly review. Pull the memory folder up, scan everything, delete anything stale, sharpen anything fuzzy.

That review takes maybe twenty minutes. It saves hours every week.

What I'm not going back to

I'm not going back to a workflow where I retype the same context every morning. I'm not going to start. The amnesia model — every session is a stranger, every preference forgotten — was never AI's natural state. It was just the default.

The default is the empty room. You're allowed to put furniture in it.

Build the memory folder. Write a file the first time you find yourself correcting the same thing twice. Index it. Let AI read it at the start of every session. Stop accepting amnesia as a feature.

Your AI doesn't have to keep meeting you for the first time.

TOOL — PDF · ready to use
03-memory-architecture-starter.pdf
Show contents
# Memory Architecture Starter

A file-system-based memory layer for AI that survives across sessions, providers, and model upgrades. Drop this structure into your AI session's working directory and the AI will pick up where the last session left off.

---

## Folder structure

```
memory/
├── MEMORY.md              ← index, always loaded into AI context
├── user_<topic>.md        ← who you are, how you operate
├── feedback_<topic>.md    ← rules learned from corrections
├── project_<topic>.md     ← current state of work
└── reference_<topic>.md   ← pointers to where information lives
```

---

## MEMORY.md (index file)

```markdown
- [User profile](user_profile.md) — Solo operator, Hudson Valley, NY; prefers terse responses
- [Don't reformat existing code](feedback_dont_reformat.md) — Keep diffs minimal
- [Active project: rebuild](project_rebuild.md) — Migrating from Vite to Next.js
- [Stripe IDs](reference_stripe_ids.md) — Product/price IDs for the platform
```

Keep entries one line each. The index is loaded into every session — keep it under 200 lines or the index itself becomes the bottleneck.

---

## Memory file template

```markdown
---
name: {{short title}}
description: {{one-line description used to decide relevance in future sessions}}
type: {{user | feedback | project | reference}}
---

{{body}}

For feedback and project memories, structure as:
- The rule or fact, stated clearly
- **Why:** the reason — usually a past incident or constraint
- **How to apply:** when this should kick in
```

---

## What goes where

**user/** — your role, preferences, workflow patterns, expertise level. Helps AI tailor responses.
**feedback/** — corrections you've made. AI shouldn't make them twice.
**project/** — current work state. Owners, blockers, deadlines, decisions made.
**reference/** — external pointers. Where bugs are tracked, where dashboards live, where keys are stored.

---

## What NOT to put in memory

- Code patterns or conventions (re-derive from the codebase)
- Git history (use `git log`)
- Solved bugs (the fix is in the code; the commit message has the context)
- Sensitive personal info (SSN, addresses, financial accounts) unless you explicitly want it remembered

---

## Bootstrap session prompt

Use this at the start of any AI session that should inherit memory:

```
Read memory/MEMORY.md. For each line in the index, decide whether the linked memory file is relevant to my current task. If yes, read the full file. If no, skip. Do not summarize what you read — just internalize it. Then ask me what we're working on.
```

---

Companion essay: *AI Doesn't Remember You. Build the Architecture That Does.*
← Back to Insights
~1100 words

The Best Way to Learn from AI: Make It Show Its Work

I asked AI to explain how website routing works. It gave me three paragraphs. I read them twice. I still didn't get it.

Then I asked for a diagram. Three boxes appeared on screen with arrows between them. URLs labeled. Color-coded by who can see what. I got it in 30 seconds.

Same AI. Same explanation. Different format. The first didn't land; the second did. I'd been staring at clean prose for ten minutes pretending to follow.

Most people learn this way. AI is finally good at producing material in formats that match. You just have to ask.

The retention gap with text-only AI

AI defaults to prose because that's what its training optimized for. Most documentation is prose. Most chat is prose. So when you ask "how does X work," AI hands you paragraphs.

Paragraphs are fine when the concept is linear — A causes B causes C. They struggle when the concept has shape. A network of three things connected by arrows reads in seconds as an image and minutes as text. The text version forces your brain to do the spatial reconstruction work the diagram already did.

When I noticed myself reading the same paragraph three times, the problem wasn't AI's explanation. The problem was format mismatch. I was making my brain do unnecessary work. Damn — that took me longer to figure out than it should have.

What "show its work" actually means

Three things, depending on the concept:

Diagrams. Anything with structure. Architecture, data flow, decision trees, comparisons. AI can render SVG inline — boxes, arrows, color, layout — in seconds. Most chat tools support this; most users don't ask for it.

Voice narration. For long explainers, having AI read its own explanation while you watch the visual frees your reading-brain. You can listen while doing something else. I built this into my workflow recently — auto-TTS on every response — and the difference for dense material is real.

Interactive widgets. When the concept is best learned by play. AI can build small HTML/JS widgets — a calculator, a quiz, a clickable diagram — in one tool call. These are gold for things like "what does it feel like to use a different routing pattern" or "show me how this would work if I changed this parameter." You poke. You learn. Reading wouldn't get you there.

Three formats AI can show its work in Diagrams, voice, and interactive widgets — three ways AI can teach beyond prose. Three formats AI can show its work in FORMAT 1 Diagrams Best for: anything with structure "Draw me a diagram of this." Free, instant FORMAT 2 Voice Best for: long content you'd rather hear "Read this to me while I work." Costs API credits FORMAT 3 Interactive Best for: concepts learned by play "Make me a widget I can poke." Free, ~30 seconds Default behavior is prose. You have to ask for the rest.

When to use which

Diagrams: any time the concept has visual shape. Most architectural questions, most comparisons, most flows. Default to asking.

Voice: for long-form content you'd rather listen to than read. Especially good while doing other work. Costs API credits, so use it for substance, not chatter.

Interactive widgets: when you need to develop intuition by playing. Hardest to ask for naturally — most people don't realize this is on the menu — but it's the most powerful for tricky concepts.

The honest answer: ask for all three at the same time when the concept is hard. "Explain it, draw it, narrate it." AI can do all three in one response. Your retention triples.

The audience question

Here's where this matters beyond your own learning. If you make content for other people — blog posts, training materials, anything — the same logic applies to your audience.

Most operators in real-world roles are audiovisual learners. Community leaders running businesses, nonprofits, public sector operations — they process visual + audio better than text alone. When I write about AI for them, the audience has a shared profile: deep operational responsibility, time-poor, learns by doing rather than studying. So my essays get diagrams. The diagrams aren't decoration; they carry conceptual weight the prose can't reach. Reading the prose alone gets you part of the way; reading the prose with the picture gets you all the way.

If your content is text-only, you're losing readers who would have understood the same idea with a picture. Many of them don't know that's why they bounced. They'll just feel that "this didn't quite click for me" and move on.

What's hard about this

You have to notice when you're not getting it. That sounds obvious; it isn't. Most people accept that they're partially confused after reading a paragraph and just move on. The signal that you should ask for a different format is the moment you'd otherwise re-read the same section. That moment is the ask.

The other hard part: AI won't volunteer a diagram unless you ask. Default behavior is prose. You have to override it explicitly: "draw me a diagram of this," "show me visually," "narrate this while I read."

Once you build the habit, you stop asking for prose at all when the concept has shape.

What this means for AI as your teacher

The OJT model I described in another essay only works if learning is fast. If every concept takes 20 minutes of reading prose, OJT collapses under its own weight — you can't learn at the speed of the work.

Visual + audio + interactive learning brings concept absorption time down to what real-time work actually allows. Five-minute architectural decision? You can learn enough about the architecture to make it well, in those five minutes, if AI shows you a diagram instead of paragraphs.

That's the unlock. AI as teacher only works fast enough for OJT if you let it teach in the formats your brain prefers.

I'm not 19. My brain has the format preferences it has. AI can match them. So can yours.

TOOL — PDF · ready to use
04-show-its-work-preamble.pdf
Show contents
# "Show Its Work" Preamble

A short prompt addition that forces the AI to surface its reasoning, decisions, and uncertainties before and after each step.

---

## Prompt

```
For this session, narrate your work as you go.

Before each meaningful step:
- State what you're about to do, in one sentence
- State why this approach over the alternatives
- State what could go wrong

After each meaningful step:
- State what changed
- State what you learned
- Flag anything that didn't match your prediction

If you're uncertain about something, say so explicitly. "I'm not sure" beats a confident wrong answer.

Don't skip the narration even if it feels redundant. The narration is the deliverable as much as the work itself.
```

---

## When to use it

- New domains where you're learning alongside the AI
- High-stakes work where you need to audit the reasoning, not just the output
- Teaching others by reading the AI session transcript afterward
- Any session you might want to read back six months from now

## When to skip it

- Trivial mechanical tasks (renaming, formatting)
- When you have full context and just need execution speed

---

## Tip

After a session, save the transcript. The narration becomes its own training material — for you and for whoever inherits the work.

---

Companion essay: *Make AI Show Its Work*
← Back to Insights
~2000 words

I Made AI Audit My Own Code. Here's What Broke.

I'd written something that was complete shit and AI picked up on its scent.

That's not a confession. That's a workflow. Last weekend I asked AI to audit my own codebase against my own rules, and the answer came back — eleven of twenty-one rules failing. The audit took five minutes. I spent the next three hours sitting with what it told me.

This is what that audit looked like, why I'd been avoiding it for months, and what changed afterward.

The setup

Three months ago I wrote a Build Bible — twenty-one rules I wanted my codebase to follow. The Bible covers the obvious stuff: don't write parallel implementations, enforce database authorization at the database, never let AI ship destructive operations unsupervised. I'd written it after reading a Reddit post about AI codebases rotting, and it felt like the right preventive medicine.

I never audited it.

For three months I shipped features. AI helped. The Bible sat in the repo. I assumed I was following the rules because I'd written them down.

Last weekend, in the middle of planning a rebuild, I realized I had no idea if the rules were being followed. I'd never checked. I told AI to audit the codebase against the Bible, file by file, rule by rule. Read-only. Don't fix anything. Just report.

Five minutes later the report was ready.

The results

Eleven failing. Two passing. One not applicable (a rule about dependencies that didn't fire because the codebase has so few). Seven unknown — those needed direct database access I couldn't grant from the audit context.

Failure rate of more than 50 percent on rules I'd written myself for the codebase I'd built. I sat with that for a while.

Before and after the audit — psychological shift Two columns showing the shift from vague unease before the audit to a specific, bounded list of failures after. Before and after the audit BEFORE Vague unease "things felt fragile" "can't name why" "avoid looking under the hood" → compounds into avoidance 5-min audit AFTER Specific list "11 failures, named" "bounded, fixable" "queue of work, not anxiety" → compounds into work Same problem. Different relationship to it.

The four most painful findings

Some of the failures were procedural — things I'd intended to set up but never had. Others were rules I'd written too vaguely to enforce. A few were rules I'd been actively violating without realizing.

The data layer. Rule 4 says all database queries go through a service layer; components never call the database directly. The audit found 84 direct database calls scattered across 6 component files. Every component that talked to data was reaching past the rule.

This isn't a bug in any single feature. It's a structural decision I'd been making 84 separate times without noticing. Each one was small. The pattern was the problem. AI never flagged it because I never asked AI to look.

Environment variables hardcoded. Rule 7 says one config module owns every environment variable, validated at startup. The audit found my Supabase URL hardcoded in seven separate files. The anon key — a secret-ish credential — hardcoded in three.

If I rotate either of those credentials I'm now editing seven files instead of one. Worse, the credentials are in git history. I knew this rule. I'd written it. I'd violated it for months.

Migrations that don't exist. Rule 6 says every database schema change goes through a migration file in version control. My database has 30 tables. None of them have migrations. I created them in the dashboard, in a hurry, week by week, and the schema lives only in the production database. There's no replayable spine. If the database disappeared tomorrow, the schema would too.

This isn't a security issue. It's a continuity issue. I'd been operating without a backup of my own thinking.

The Bible itself was wrong about my codebase. The Bible has a parenthetical naming my project as a cautionary example for the auth-conflict rule. That conflict had been resolved a month earlier — auth was now unified through one system — but the Bible hadn't been updated. Rules drift too if you don't audit them.

Some of the failing rules were shitty rules I should have caught. They were too vague — "the data access layer is sacred" sounds important and means nothing operationally. Others were aspirational — "tests as a refactor net" when I had no tests at all. The Bible needed a pass too.

Why I'd been avoiding it

Three reasons, only one of which I'm proud of.

I'd assumed I was already following the rules. This is the dumbest reason and the most common. You write a rule, you intend to follow it, you assume you do. The assumption is invisible until you check. Most people running AI codebases right now are in this state.

I'd been busy shipping. Each individual feature was urgent. The audit wasn't urgent until something broke. So it kept getting deprioritized in favor of work that produced visible output. Architectural debt is the shape of all "this can wait" decisions stacked on top of each other.

I'd been afraid of what I'd find. This is the honest reason. I knew on some level that the audit would surface uncomfortable things. The codebase was working. The customers I had were happy. Looking under the hood meant facing whatever was rotting there. Easier to keep shipping.

The audit was painful for an hour. The not-auditing was costing me weeks of slow degradation. The trade was always lopsided. I just couldn't see it until I made the trade.

What changed afterward

The biggest shift was psychological. Before the audit, I had a vague sense of unease about my own codebase — things felt fragile, but I couldn't name why. After the audit, I had a list. Specific. Bounded. Fixable.

Vague unease compounds into avoidance. A specific list compounds into work. The same problem, named, becomes tractable.

The audit also gave me a bargaining chip with future-me. The next time I'm tempted to ship a feature without checking architecture, I have a memory of what skipping audits cost me. I don't have to abstract from "this might cause problems someday" to "this exact cost will compound into this exact mess." I have receipts.

Practically: I'm using the audit as the input to a deliberate refactor. Each failing rule becomes a sprint task. The 84 direct database calls are now a "build the service layer" project. The hardcoded credentials are a "build the config module" project. The migrations are a "supabase db pull and commit" project — about thirty minutes of work to get the spine in place.

By the time the rebuild ships, I'll have closed most of the failures. Not because I muscled through them, but because the audit converted unease into a queue.

The reusable prompt

If you want to run this on your own codebase, the prompt is short:

Audit this codebase against BUILD_BIBLE.md and report findings. Read-only. Do not fix anything. For each rule, return PASS, FAIL, N/A, or UNKNOWN with one piece of evidence (file:line or grep result) supporting the verdict. End with a "top 5 most urgent" ranked list.

That's it. Run it monthly. Run it whenever you've shipped a lot fast. Run it before any rebuild scoping. The audit is cheap. The not-auditing is expensive in ways you won't see until they hit you.

What I'm not going back to

I'm not going back to assuming. I'm not going to start. I'd assumed I was following my own rules for three months. Eleven of twenty-one were failing. The gap between "I wrote it down" and "the codebase reflects it" was bigger than I'd guessed.

The cheapest insurance you can buy on an AI-built codebase is the monthly audit against your own rules. Five minutes. AI does the work. You sit with the answer.

I wrote something that was complete shit and AI picked up on its scent. That's the workflow now. Catching it early, before it becomes a thing I have to spend a sprint untangling.

It's working.

TOOL — PDF · ready to use
05-self-audit-prompt.pdf
Show contents
# Self-Audit Prompt Template

Use this to audit your own codebase against your own published rule set. The audit takes minutes; the fixing takes weeks. Both are worth doing.

---

## Prerequisites

- A rule set you've written down (see the BUILD_BIBLE template if you don't have one yet)
- A codebase to audit
- An AI tool with code-reading access to the repo

---

## Prompt

```
Audit this codebase against the rules in /BUILD_BIBLE.md.

For each numbered rule:
1. State the rule
2. Decide: PASS / PARTIAL / FAIL / N/A
3. Cite specific files and line numbers as evidence
4. If FAIL or PARTIAL, write a one-paragraph remediation plan

Output as a markdown table with columns: Rule | Status | Evidence | Remediation.

Do not soften findings. If the rule is failing, say it's failing. The point of the audit is to find the gap.
```

---

## After the audit

1. **Sort the failing rules by impact.** Not all rules are equally load-bearing.
2. **Pick the top three.** Anything more is paralysis.
3. **Schedule a remediation sprint.** Block real time. Don't squeeze it between features.
4. **Re-audit after the sprint.** Confirm the fix held.

---

## Cadence

- Quarterly self-audit minimum
- After every major refactor
- After every postmortem-worthy incident
- Before any major launch

---

## What you'll find (probably)

The rule that sounded easiest to follow when you wrote it down is usually the one that's failing. That's not a moral failing — it's data. Adjust the rule, adjust the system, or both.

---

Companion essay: *I Used AI to Audit My Own Code Against My Own Rules. Half of Them Were Failing.*
← Back to Insights
~1500 words

Why Non-Coders Should Still Be the Architect

I don't write code. I'm rebuilding my company's website right now anyway. The rebuild includes a database migration, a language switch from JavaScript to TypeScript, a framework change to Next.js, and a complete restructure of the routing. None of which I would have known how to do six months ago.

I'm not pretending to be a developer. I'm playing a different role — the architect — and AI is letting that role exist for the first time.

This is an essay about that role and why every small business owner should claim it for their own software.

The default assumption

Most coding tools and most coding education assume the user is the executor. You write the code. So they teach you the syntax, the libraries, the patterns. They optimize for speed at the keyboard.

That model works if your job is to write code. It's the wrong model for an operator whose job is to run something that happens to need software.

I tried the executor model for years. I learned a little JavaScript. I built some pages. The pages worked. Then I needed to add a feature, and the addition broke another feature, and I didn't have the depth to debug it, so I spent three days reading documentation about something I didn't actually want to be an expert in. I shipped some pretty rough decisions during that period because I felt out of my depth and didn't know how to admit it. The opportunity cost was real. While I was "learning to code," I wasn't running my business.

The executor model was a trap for me. I suspect it's a trap for most operators who try it.

The flipped model

Here's what works instead. You sit in the architect role: you decide what to build, who it's for, how the pieces fit, when something is good enough. AI sits in two roles simultaneously — the teacher who explains what you need to know to make decisions well, and the executor who writes the actual code.

Coder vs Architect — same project, different roles Two-column comparison showing what a coder focuses on versus what an architect focuses on, with AI bridging the gap for non-coders. Same project, two roles ROLE 1 Coder Focuses on syntax, libraries, patterns Writes the code Debugs, ships, deploys Optimizes speed at keyboard ROLE 2 Architect (you + AI) Decides what to build Integrates business reality Reviews + approves Owns the outcome Optimizes for the right thing built AI handles the executor work. Architect role becomes accessible without code skills.

The architect role is not a downgrade from the developer role. It's a different role entirely. It's fucking liberating once you let yourself sit in it. A residential architect doesn't pour concrete. A residential architect designs the building, hires the crew, checks the work, makes the calls when reality clashes with the plan. The architect's value isn't in the swing of the hammer; it's in the integration of every constraint into one coherent decision.

Software has always had this role. It's just been collapsed into "developer" because writing the code and designing the system used to require the same person. AI uncollapses it. The architect role can now exist independently, performed by someone who understands the business better than the code.

What it requires from you

Three things. Each is harder than they sound.

Honesty about what you don't know. When AI uses a word you don't recognize, you have to stop and ask. Every time. Pretending to follow is the failure mode that makes the architect role collapse back into mush — you're nodding while AI executes plans you don't actually understand, and the result is decisions that look intentional but aren't.

Willingness to make calls you can't fully validate. As an architect you'll make decisions on incomplete information all the time. "Should this be one service or two?" The answer isn't always knowable in advance. You make the best call with the data you have, watch what happens, and revise. The discomfort of deciding-without-certainty is the price of admission. If you wait for certainty, you'll never decide.

Discipline to teach AI your style. AI starts every session as a stranger. Without explicit memory, it'll keep producing decisions that feel slightly off because it doesn't know your taste yet. The architect maintains the memory. Teaches AI the things that compound. Builds the rules that constrain output to your shape.

The architect role is more cognitively demanding than the executor role, not less. You can't outsource the judgment, only the typing.

What it produces

When the model works, the output is better than what you'd get either as a pure executor or as a hands-off boss.

Decisions that integrate constraints AI doesn't see — the regulatory environment, the customer relationships, what you have time for, how you actually run things. AI is great at proposing options once it knows the constraints; you supply the constraints.

Code that ships faster than if you'd written it. AI types fifty times faster than I do. With clear constraints, that speed is mine without the cost of becoming a developer. The boring stuff — boilerplate, configuration, refactoring — gets handed off entirely.

Architectures that compound. Every decision you make stays made. Every rule you write keeps getting enforced. Every memory you save speeds the next session. The codebase doesn't rot because you're sitting in the role responsible for it not rotting — even if you never touch a line of code.

The objection

The most common pushback: "but you can't really know what you're doing if you don't code."

It's a fair worry. There are real classes of problem an architect needs to recognize that only show up at the keyboard — performance issues, race conditions, security details. The non-coder architect has to compensate by knowing what they don't know, asking explicitly, and trusting AI's flags.

In practice, what I've found is that the depth of judgment required for "should this be built at all," "is this the right shape," "is the result good enough" — those questions don't come from coding skill. They come from understanding the business. Coders who can write but don't understand the business produce technically-correct software that solves the wrong problem. Architects who understand the business but don't code, working with AI, produce software that solves the right problem the right way.

The risk isn't fake. But the alternatives are worse. If you can't code and you can't architect, you're at the mercy of whoever you hire. Most small business owners are there now and don't know it.

What's hard

Two things keep tripping me up:

I have to keep showing up. The architect role only works when I'm engaged. If I disengage, AI defaults back to its training assumptions about what a generic project should look like — which is fine, but generic. The compounding only happens because I keep making the calls.

I have to keep teaching. Every preference I have, every constraint that's specific to my project, every decision I've already made — those need to be written down or AI keeps re-discovering them poorly. The teaching IS the architecture work. Skipping it is skipping the architecture itself.

Both of these come down to: this is real work. Not less work than the executor role; different work. The thing that makes it tractable is that it's the work I'm best at — judgment, integration, decisions — not the work I'm worst at, which is typing the syntax of a language I'll never love.

What I'm not going back to

I'm not going back to "I can't build that, I'm not technical." I never could before. I can now. The not-technical part hasn't changed; the tools have.

If you run an operation — a business, a nonprofit, a department, anything with people depending on it — and you've been waiting for someone else to build the software you need, stop. The AI architect role is real. It's available to you. The first thing it builds will be wonky and you'll fix it as you learn. Six months in you'll have a codebase you understand better than most of the developers you could have hired, because you decided every line of it without writing any of them.

The architect role was always the highest-leverage one. AI just made it accessible to people who used to be locked out.

Take it.

TOOL — PDF · ready to use
06-architects-brief-template.pdf
Show contents
# Architect's Brief Template

For non-coders who direct AI coding work — or anyone handing a coding task to AI. Fill in the fields below before starting any session.

---

## Brief

**Goal**
What outcome do we need? One sentence, in plain language.

**Symptom**
What's wrong now, exactly? What does the user see / what does the log say?

**Constraint**
What can't change? (Existing data shapes, user-facing surfaces, deploy targets, etc.)

**Direction**
Which way to lean? (Fast vs. correct, simple vs. flexible, library vs. custom, etc.)

**Out of scope**
What we're explicitly NOT doing in this session.

**Definition of done**
How will we know the work is complete?

---

## What NOT to put in the brief

- A multi-step prescription ("first do X, then Y, then Z") for a problem you haven't diagnosed yet
- A specific function name or file path you haven't grep-verified exists
- An assumption about the existing code that you haven't confirmed

If you find yourself prescribing in detail, stop and re-ask: have I diagnosed, or am I guessing?

---

## Worked example

**Goal:** Users can pay for a project tier and immediately see their workspace appear in their dashboard.

**Symptom:** Currently, payments succeed but workspace creation is manual. Users wait for an email.

**Constraint:** Existing Stripe webhook is wired; existing PM board structure is fixed; can't change tier-pricing logic.

**Direction:** Lean on existing webhook. Keep changes minimal — extend the webhook handler rather than building a new flow.

**Out of scope:** Email notifications (separate task). Refund flow (separate task).

**Definition of done:** A test payment in staging produces a workspace in the PM board within 30 seconds. Logs show the trigger.

---

## Why this format works

A non-coder architect can't review whether the implementation is "right" — but can absolutely judge whether the brief was written clearly. If the brief is fuzzy, the implementation will be fuzzy. The brief is the deliverable that the architect owns.

---

Companion essay: *Non-Coder as Architect: How AI Inverts the Software Hierarchy*
← Back to Insights
~1300 words

I Hired Three AIs to Fight About My Codebase

It's a Saturday afternoon and I have three AIs running. One is planning my website rebuild — has the memory of every decision we've made, the codebase context, the project state. One is writing the code based on what the first one decides — has access to my files, can run commands, can deploy. One is driving my browser — clicking through Supabase admin panels, navigating Vercel deployment dashboards, doing the things web UIs require humans to do.

They're not the same AI. They're three different agents, each specialized, each handing off work to the next. They don't always agree. When they disagree, I'm the tiebreaker.

This is what an AI orchestra looks like at an operator's scale, why it works better than one AI doing everything, and how to set it up.

The "one AI for everything" assumption

When most people think about using AI for work, they imagine one chat window. You ask AI to do the thing. AI does the thing. Done.

That model breaks at the moment your work has different shapes. Some of my work is conversational and strategic — I want to think out loud and have AI push back on my reasoning. Some is code generation — I want AI to write fifty lines of TypeScript without me babysitting it. Some is web automation — I need to click through a Stripe dashboard with AI doing the clicks because there's no API for the thing I want to do.

When you force one AI to do all three, the AI is OK at all of them and great at none. The conversational AI gets distracted writing code; the code AI loses the strategic context; the browser-driving AI doesn't have the memory of what you decided yesterday.

Specialization beats generalism for AI agents the same way it does for humans. Three AIs each in their own role outperform one AI playing all three roles.

The three agents in my workflow

Three specialized AI agents in my workflow You as the orchestrator coordinating three specialized agents — strategist, executor, and browser driver. Three specialized AIs, one orchestrator YOU orchestrator / decider AGENT 1 Strategist Plans, has memory, conversational depth produces a brief AGENT 2 Executor Writes code, runs commands, ships produces a diff AGENT 3 Driver Drives browsers, clicks UIs without APIs produces clicks Each handoff goes through you. The brief, the diff, the click — all reviewed.

The strategist. This is where I do most of my talking. It has access to my memory folder, the codebase, my project state, my key references. We discuss what to build, plan the architecture, work through decisions. It's the AI that knows me best because I've been teaching it for months.

When it produces an output, the output is usually a brief — a description of what to build, with the constraints it has to satisfy. That brief feeds the next agent.

The executor. This is the AI that opens my repo, reads files, writes code, runs commands, runs tests, deploys. It doesn't have the memory of every conversation; it has the brief and the codebase. Its job is to take the strategist's plan and turn it into actual files on disk.

The executor is faster than I'd be by an order of magnitude. It also makes more mistakes than the strategist, because it's optimized for execution, not deliberation. So I review what it produces — diff by diff — and redirect when it goes off-plan.

The driver. Some things have no API. The Twilio admin panel. The Google Workspace settings. The Namecheap DNS console. The browser-driving AI watches my screen and clicks through these for me. I narrate what I want; it does the clicks; I watch.

This is the most fragile of the three because web UIs change. But for the things that have to happen in a browser, it saves real time.

How they hand off

The handoff pattern took some practice to land. The basic shape:

Strategist produces a brief. The brief is short — a few hundred words, usually. It says what to build, what constraints to satisfy, what files to touch, what the success criteria are. I read the brief. If I want changes, I redirect at the strategist level.

Once the brief is clean, I paste it into the executor. The executor goes off and works for ten minutes. It returns with a diff. I read the diff. If something's off, I redirect at the executor level — "this approach won't work because of X, try Y instead."

Once the code is good, the deploy step usually involves some manual configuration that needs the driver. I tell the driver what configuration values to enter; it clicks through; I watch.

The whole cycle takes maybe twenty minutes for a meaningful change. Doing it as one AI in one chat would take three times as long because I'd be context-switching the same AI between three modes.

Why this is the future I'm betting on

Two reasons.

The first is that specialized agents will keep getting better faster than generalist agents. The companies building "AI for X" will out-iterate the companies building "AI for everything," because narrower scope means tighter feedback loops. Specialized agents already work better; that gap will widen.

The second is that orchestration is itself a skill. Once you can run three AIs in coordination, you can run five, then ten. Each agent stays narrow. The human stays in the orchestrator role — deciding which agent gets which task, when work hands off, when something needs to be redirected. That's a job humans are uniquely good at, and it scales without becoming overwhelming if the agents stay specialized.

I'm betting on this, hard. Building my entire workflow assuming I'll have three to five specialized agents running at any time, with me as the conductor.

What's hard

Coordinating three agents is not free. Three things take effort:

Keeping them in sync. When the strategist and the executor disagree about something, that disagreement has to surface to me. Otherwise they'll quietly drift, and the executor will produce code that doesn't match the strategist's plan. The fix is making the brief explicit — if it's in the brief, both agents respect it. If it's only in the strategist's head, the executor doesn't know.

Knowing when to redirect each. The strategist's mistakes look like bad plans. The executor's mistakes look like bad code. The driver's mistakes look like missed clicks. They each fail in their own way. Pattern recognition takes practice.

Managing cost. Each agent costs API credits. Running three AIs in parallel runs up bills faster than running one. I've burned cash on shitty experiments — the same task running through all three agents twice because I forgot which one had the latest context. The solution is being deliberate about handoff, not parallel running everything.

What I'm not going back to

I'm not going back to one chat window. The "one AI for everything" model was a starting point, not a destination. The future is agents in roles, each narrow, each specialized, each handing off cleanly. The human is the orchestrator.

If you're using AI today and feeling like it's good but not great, the missing piece might not be a better model. It might be a different agent shape — three specialized ones instead of one generalist. The improvement isn't in the AI. It's in the architecture of how you use it.

The orchestra is here. You just have to start conducting.

TOOL — PDF · ready to use
07-handoff-protocol-template.pdf
Show contents
# Multi-Agent Handoff Protocol

A template for handing off work between AI agents — Cowork plans, Claude Code implements, Claude in Chrome navigates UIs. Each handoff is a structured message that the next agent can act on.

---

## Handoff message format

```yaml
from: <agent name>
to: <next agent name>
context_required: <files, URLs, state the next agent needs>

what_was_done:
  - <one line per completed action>
  - <include filenames touched, commands run, decisions made>

what_remains:
  - <one line per pending action>

watch_for:
  - <known risks, edge cases, ongoing flakiness>

success_criteria:
  - <how the next agent knows the handoff worked>

links:
  - <relevant URLs, dashboards, error logs>
```

---

## Worked example: Cowork → Claude Code

```yaml
from: Cowork
to: Claude Code
context_required:
  - /BUILD_BIBLE.md
  - /memory/MEMORY.md
  - File: supabase/functions/pm-agent/index.ts

what_was_done:
  - Reviewed Stripe webhook payload structure with the user
  - Confirmed event types: kickoff, tier-slug, pm_deposit
  - Decided: extend existing webhook handler, don't build new flow

what_remains:
  - Add handler branch for the three event types
  - Insert workspace-creation logic with Monday.com MCP
  - Add structured logging for each branch
  - Write integration test against staging Stripe events

watch_for:
  - Existing CHECK constraints in the DB; relaxed 2026-04-21 to accept these events
  - Webhook signature verification must remain intact

success_criteria:
  - A test payment in staging produces a Monday.com board within 30 seconds
  - All three event types log structured output

links:
  - https://dashboard.stripe.com/test/webhooks
  - Local file: cc-brief-2026-05-03-mobile-ui-security.md
```

---

## Cowork → Claude in Chrome

```yaml
from: Cowork
to: Claude in Chrome
context_required:
  - Vercel admin URL: https://vercel.com/<team>/<project>/settings/domains

what_was_done:
  - Confirmed domain ownership at registrar
  - Generated DNS record values (A and CNAME)

what_remains:
  - Add custom domain in Vercel project settings
  - Set DNS A record at the registrar to point at Vercel's IP
  - Verify SSL provisions within 10 minutes

watch_for:
  - The Vercel UI changed in March 2026; "Add Domain" is now under Settings → Domains, not Settings → General

success_criteria:
  - https://<custom-domain> resolves with valid SSL within 10 minutes
```

---

## Why this works

Each agent has different strengths and a different surface. The handoff is where context gets lost — most multi-agent failures are handoff failures, not capability failures. A structured handoff turns each agent into a station on a production line, with a written shipping document that travels with the work.

---

Companion essay: *Three AIs Walk Into a Codebase*
← Back to Insights
~1000 words

What I'm Spending on AI — A Live Receipt

Last Saturday afternoon, in a single working session of about two hours, I spent $1.53 in voice generation alone. I also burned through Claude API credits on text inference, ran several SVG diagrams (which are free), and saved about five hours of work I'd otherwise have done by hand.

This is a receipt for that session. It's the start of what I'm planning to make a recurring column — what I'm actually spending on AI, what I'm getting for it, and where I think the money is well-spent versus wasted.

No theory. Just numbers and what each unlocked.

The category breakdown for that afternoon

AI spending categories — one Saturday session Horizontal bar chart of category spend during a single working session. One Saturday session — actual spend Voice (TTS) $1.53 14 voice clips, V3 model Text inference ~$4–6 SVG / HTML $0 ~12 diagrams, all free Image / video $0 none this session Session total ~$5–8 vs $1,500–3,000 to commission externally

Voice generation: $1.53. I had AI narrate every response so I could listen while reading. About 14 voice clips, ranging from short confirmations to longer explainers. Eleven V3 model at $0.12 per 1,000 characters.

Text inference: hard to pin down exactly, but probably around $4–6. I'm on a paid plan so individual responses don't bill separately, but the workload was about two hours of substantive technical reasoning across multiple sessions. At commercial API rates that'd land in the $4–6 range.

SVG diagrams and HTML widgets: $0. These are rendered in the chat client, no additional API cost. I generated about a dozen of them.

No image or video generation this session. Higgsfield (image/video) is the next category if I'd needed it, at significantly higher per-output cost.

Total session cost: roughly $5–8 in actual API spend.

What that bought

Five things I would have either paid for separately or not gotten at all:

A full architectural review of my own codebase against my own rules. Twenty-one rules, file-by-file audit, written report. If I'd hired a developer for this, $200–500 minimum. AI did it in five minutes for the cost of a sandwich.

Decision support on a major architectural choice (TypeScript vs JavaScript). Real conversation, real tradeoff analysis, calibrated recommendation. If I'd consulted a senior developer for this kind of advice, $300–500 for an hour of their time. AI gave me a deeper analysis for under a dollar in inference cost.

Eight blog post drafts. Full essays, ready for editing. If I'd hired a ghostwriter, $200–500 per essay, and they wouldn't have known my voice. AI drafted them in my voice (after I taught it the pattern) at ~$1 per essay in inference cost.

Custom diagrams matched to my content. A few hundred dollars in design work if commissioned, free at the click level here.

Voice narration that let me listen while reading. A small thing, but real for an audiovisual learner. The $1.53 paid for two hours of having the explainer in my ears while my eyes were on the visuals.

The honest math: $5–8 in spend produced output that would've cost $1,500–3,000 to commission externally and would have taken weeks to coordinate. The compression is the value.

Where I'm wasting money

I don't want this to read as a celebration. There's spend I regret.

Re-running expensive operations because I forgot to save outputs. A few times I've had AI generate a long analysis, lost it because I closed the chat without copying the result, then asked again. Each one of those is a few cents to a few dollars repeated for nothing. The fix is saving outputs to files immediately. I now do this automatically; I didn't for the first month and I burned cash on shitty habits.

Generating voice for short responses that didn't need it. Auto-mode on TTS means every response gets voiced, including 50-character confirmations. Those add up. A "got it, applied" response that would have cost $0.01 in inference cost $0.06 in voice. Pennies, but pennies compound. I've been more deliberate about voice mode since I noticed.

Image generation on first drafts. AI image generation is expensive per output. I've burned $5–10 generating images that I then didn't use because the concept wasn't right yet. Better workflow: sketch with cheap SVG diagrams first, only graduate to AI image generation when the concept is locked.

The pattern: most waste is from ignorance about cost shape. Once you can see what each thing costs, the waste mostly goes away.

The "is it worth it" math

Roughly: I'm spending $50–100 per month in API costs for AI tools that I'm using as the primary execution layer for the work I'm doing. That's less than I'd pay for a single hour of a contractor's time. It produces output that would cost thousands to commission externally.

The math isn't subtle. For an operator doing knowledge-work-adjacent tasks — writing, analysis, planning, content — AI at this price point is a no-brainer. The question isn't whether to spend the money; it's whether you have the workflow to extract the full value.

I don't have a number that tells me I'm extracting full value. What I have is the receipts — every session, every task, what it cost, what I got. Over months, the receipts answer the question better than any prediction would.

The recurring column

I'm going to keep posting these. Once a month, probably, with the running totals plus a handful of notable expenditures and a couple of "this was worth it" or "this was a waste" examples.

Why publicly? Because the cost shape of AI tools is moving fast and most people are guessing about it. A real receipt from someone running a real business, updated monthly, beats the marketing copy of every AI vendor combined.

Also, it keeps me honest. Knowing I'll publish the numbers means I look at them. Looking at them means I notice waste. Noticing waste means I fix it. The publication is the discipline.

This is column number one. The receipts are open.

It's transparent. That's the point.

TOOL — CSV · ready to use
08-cost-tracker.csv
Show contents
date,session,tool,task,units,unit_cost_usd,total_usd,notes
2026-05-09,session-001,ElevenLabs,TTS for essay 9,2400 chars,0.000120,0.288,Voice clip — narrated essay opener
2026-05-09,session-001,Higgsfield,Image generation,1 image,0.40,0.40,Hero image for essay 16
2026-05-09,session-001,Claude API (Sonnet),Code review session,42K input + 8K output tokens,n/a,0.18,Architecture review
2026-05-09,session-002,ElevenLabs,TTS retry essay 9 part 2,2100 chars,0.000120,0.252,
,,,,,,,
,,,,,,,
SESSION TOTAL,,,,,,,
,,,,,,,
WEEKLY TOTAL,,,,,,,
,,,,,,,
NOTES,,,,,,,
"Provider rates: ElevenLabs $0.12 per 1K chars (verified 2026-05-09)",,,,,,,
"Higgsfield: $0.40 per image (verified 2026-05-09)",,,,,,,
"Anthropic Sonnet 4.6: input $3 / 1M tokens; output $15 / 1M tokens (verified 2026-05-09)",,,,,,,
"Always verify rates against current pricing before relying on the math",,,,,,,
← Back to Insights
~1300 words

Why I Use What I Use: An Operator's AI Stack

I'm using Claude Cowork to write this essay right now. Cowork is the enterprise tier of Anthropic's Claude — the workspace tool you might be reading me through. Most days, it's what I use for thinking, planning, writing, and operations work. Other tools handle other layers. Claude Code for actual coding. ElevenLabs when I want voice generation. Higgsfield when I need image or video output. Chrome agent when something needs to happen in a browser that doesn't have an API. The stack is deliberate — but it didn't start that way.

This is not a recommendation. It is a field report on what one operator is actually using and why, written so other operators can build their own stack with more information than I had.

The operator's AI stack — five surfaces, one workflow A vertical stack diagram showing five AI tools each owning a layer of the work — orchestration, code, browser, voice, image — feeding into shipped output. Cowork orchestration · planning · files · memory Claude Code repo · diffs · running tests · shipping code Claude in Chrome admin consoles · DNS · vendor portals · UI ElevenLabs voice · TTS · audio agents Higgsfield image · video · cinematic visuals

What a normal session looks like

Most working sessions have a rhythm:

I open Cowork. It already knows who I am and what I'm working on because I have fed it months of memory files — preferences, project state, decisions made, things to avoid. We start where we left off — sometimes from a structured plan, sometimes from a fragment of an idea I had walking the dog.

If the work is conceptual — strategy, writing, planning — we stay in Cowork. The conversation has a real rhythm: I think out loud, Cowork pushes back, we converge on something, I redirect when it goes sideways. Output goes to files in my project folder.

If the work needs code, Cowork hands a brief to Claude Code, which has access to my actual repository. Claude Code writes the code, I review the diff, we redirect when something is off.

If the work needs to happen in a browser — clicking through Twilio's admin panel, configuring Vercel — Chrome agent runs the clicks while I watch.

If I need a diagram, that goes through Cowork's visualization tools (free, instant). If I need a polished image or a generated video, Higgsfield (real money per output). If I want voice — for accessibility or for listening to long content I would otherwise have to read — ElevenLabs.

The whole stack costs me about $280 a month at my usage level — Claude alone is roughly $200 of that, plus another $80 across the supporting tools. For what it produces, it is fucking incredible value — but I will get to the honest accounting in a minute.

Why Claude over ChatGPT

Honest answer first: it could have been either. ChatGPT and Claude are both fully-featured enterprise AI tools with similar enterprise governance. For most operators, either would work. I picked Claude for three specific reasons.

The MCP ecosystem. Anthropic's Model Context Protocol lets you connect Claude to other tools. My setup has a dozen connections — Anthropic's Chrome agent, ElevenLabs for voice, Supabase for my database, several others. This integration shape is what turned Claude from "smart assistant" into "operational layer." OpenAI has equivalents now — workspace agents, plugins — but the MCP architecture is more open and better documented for the kind of custom integration I needed.

Tool-use quality. When AI is doing real work — writing code, navigating UIs, calling APIs — small differences in tool-use reliability compound. Claude's tool use feels less brittle than what I tried with other models. I cannot quantify that precisely; I can only say I stopped second-guessing it after a few weeks.

The voice fit. Claude responds in a register that lands for me — direct, calibrated, willing to push back. Other models felt either too eager-to-please or too clinical. Voice fit is not measurable but it matters when you spend hours a day talking to something.

None of these are universal advantages. ChatGPT has a larger ecosystem in absolute terms, more third-party integrations, and probably better recognition with most teams. If I were rolling out an AI tool to a fifty-person organization tomorrow, the larger user base and more familiar interface might win.

The payoff: a 24-hour rebuild

A community leader I work with lost their website completely the week before — nothing recoverable, no backup that mattered. They'd been operating without a site for days when we connected. Within twenty-four hours, they had a fully operational replacement back up: domain configured, content migrated, payment processing live, the works.

That's not a flex. That's what becomes possible when the stack is integrated. Strategy in Cowork. Code in Claude Code. Browser actions in Chrome agent. Voice and image where needed. No context-switching between tools, no "let me get back to you on that," no waiting for the next person in the chain. The work happens at the speed the operator can think.

A traditional rebuild of that scope — even with a competent developer — would have run a week minimum, more likely two. The thing that makes the difference isn't AI being smarter than humans. It's AI removing the latency between deciding and doing. Twenty-four hours is the difference between a community leader without a website and one whose website is back up. The compression matters when there are people depending on you.

This is what I mean when I say the integrated stack changes the math. Not faster typing. Different operational ceiling.

Why not Microsoft Copilot or Google Gemini

Two answers, for different reasons.

I'm not on Microsoft 365, so Copilot's "AI lives inside the tools you already use" advantage doesn't apply to me. For an organization that runs everything on M365, Copilot is probably the right starting point — the integration with Outlook, Word, Teams, and Excel is genuinely deep, and the identity and governance layers are already in place from existing M365 tenancy.

Google Gemini is the same story but for Google Workspace. I use Google Workspace for email and a few other things, but my actual operations layer lives elsewhere. So Gemini does not get the integration win for my specific work.

The general principle: pick the AI tool whose integration matches where your work actually lives. If you are on M365, lean Copilot. If you are on Google Workspace, lean Gemini. If your work is in custom tools, code, and services that need to be integrated together, lean Claude or ChatGPT for the open-ecosystem advantage.

What Claude Cowork doesn't do well

Things I work around:

It doesn't natively connect to my email or calendar. I keep Google Calendar separate. There are MCP plugins that fix this; I have not bothered yet because the workflow works without it.

Its native search is weak. When I want to find something across the open web quickly, I still go to Google, not to Cowork. Claude can search, but it is not where I instinctively reach.

Cost adds up. That $280 a month is real — and worth saying plainly: I'm a heavy user. Most operators won't spend that much. For a small organization rolling this out to twenty people on standard team licenses (typically $30–$60 per seat per month) you'd budget somewhere around $1,000 a month at the entry point, plus a margin for API usage that climbs as fluency deepens. Compare that to a couple of FTE hours; it is worth it. Compare it to the discretionary tech budget at a small nonprofit; it is a real conversation.

The integration setup takes time. MCP plugins are powerful but require setup. The first month I spent more time configuring than getting work done. Pretty rough trade in that first month — I almost gave up. The second month it paid back. Plan for that ramp before you decide whether the tool is worth it.

Why someone else might pick differently

If you are an executive at a 200-person organization on M365, with sensitive data that requires sector-specific compliance, and your IT team is small — Copilot is probably the right answer because it minimizes integration work and your governance layer is already there from your existing M365 contracts.

If you are at a research-heavy organization with strong existing OpenAI familiarity and want maximum third-party ecosystem flexibility — ChatGPT Enterprise is probably the right answer. Education sector specifically should look at ChatGPT Edu.

If you are an operator who works across many tools, needs deep custom integration, and values the open MCP ecosystem — Claude Cowork is what I have landed on, and what I would recommend you evaluate.

If you are at a small organization without budget for any of the above — start with constraint policy (personal-AI usage rules) and move to unification when the budget allows. The transition is the goal; the timeline is yours.

The point

The decision about which tool matters less than the act of deciding. Most organizations do not pick deliberately — they end up with whatever someone evangelized internally, or whatever was free at the time, or whatever the IT team approved without explaining why. That is not a stack; it is an accumulation.

The operators who get the most out of AI are the ones who picked their tool with intent, configured it deliberately, and integrated it into their actual work rather than leaving it as a side conversation.

I picked Claude Cowork because it fit my work, my values, and my evaluation. I am honest about its limits. I keep the option of switching open. The point is to be deliberate about the choice.

You will pick differently. That is fine. Pick deliberately.

TOOL — PDF · ready to use
09-tool-evaluation-rubric.pdf
Show contents
# Tool-Evaluation Rubric

Five criteria for deciding whether to add an AI tool to your stack. Score each tool 1-5 on each criterion. Sum below 18 = probably not worth it. 18-22 = trial. Above 22 = adopt.

---

## The five criteria

| # | Criterion | What it asks | 1 point | 5 points |
|---|---|---|---|---|
| 1 | **Connectivity** | Does it integrate with the rest of your stack? | Stand-alone, no API | First-class MCP / API into your existing tools |
| 2 | **Reliability** | Does it work the same way every time? | Output drifts; sessions break | Deterministic, version-locked, logs included |
| 3 | **Speed** | Does it match the pace of the work? | Latency breaks flow | Sub-second on hot path |
| 4 | **Cost transparency** | Can you see what each task costs? | Opaque billing, surprise invoices | Per-task cost visible during work |
| 5 | **Survives version changes** | Will this still work in six months? | Vendor changes break workflow | Vendor-agnostic interface, easy to swap |

---

## Scoring template

```
Tool: ____________________
Date evaluated: ____________

Criterion              Score (1-5)    Notes
1. Connectivity        ___            ____________________
2. Reliability         ___            ____________________
3. Speed               ___            ____________________
4. Cost transparency   ___            ____________________
5. Survives versions   ___            ____________________

Total: ___ / 25

Decision: [ ] Skip   [ ] Trial   [ ] Adopt
Trial period if adopting trial: ___ weeks
Re-evaluation date: ____________
```

---

## What this filters out

- Demo-quality tools that don't survive a real workflow
- Tools that work in isolation but don't connect
- Tools that are fast in the demo and slow in production
- Tools whose pricing model doesn't match how you actually use them
- Tools that will be deprecated or pivoted before your project ships

---

Companion essay: *Why I Use What I Use: An Operator's AI Stack*
← Back to Insights
~1300 words

Artifacts: How I Test the Feel Before I Touch Code

I have eight artifacts saved in my AI workspace right now. Mockups of homepages I haven't built yet. A site map for a project that doesn't exist in production. A reading interface for a manual that's mid-draft. Each one took thirty minutes to an hour to make. Not one of them required writing real production code.

This is one of the most useful things I've started doing with AI, and almost nobody talks about it.

Mockup → feel test → ship — the artifact loop Three stages with a feedback arrow back from feel test to mockup when something is off. STAGE 1 Mockup spec · screenshots · prose THE NEW STEP Live Artifact click it · feel it · adjust it before any code ships STAGE 3 Shipped already feel-tested if the feel is off, redirect — the spec was wrong

What artifacts actually are

In the AI workspace I use, an artifact is a clickable, interactive HTML file that renders inside the workspace itself. Self-contained. Live. Built in minutes by AI from a description of what I want.

It isn't a screenshot. It isn't a Figma file. It's a working HTML page I can click through, fill out, and react to. The buttons go places. The layouts respond. If I imagine a homepage with a hero and a featured section and a recent-essays list, I can see that homepage in front of me, render it, click on it, and decide whether the layout feels right — before any of it is real.

The closest pre-AI equivalent is a Figma prototype with hotspot links. Artifacts are similar in concept and faster by an order of magnitude. They render in the same workspace where I'm thinking about the project; I don't have to switch tools or hand off to a designer to see what an idea would look like.

What this unlocks

Three things matter:

Pre-build feel checks. The hardest thing about deciding what to build is that you can't really know how something will feel until you use it. Artifacts let me skip past that. Recently I was deciding whether a client portal should organize information by project (each project gets its own page with tabs) or by activity (one page for files, one for messages, filtered by project). I built both as artifacts in twenty minutes. Clicked through both. The activity-organized version felt confusing in a way I couldn't articulate from the description but was obvious in the click-through. We picked nested.

Talking through design without designers. Most operators don't have a designer on call. The decision-loop usually goes: describe what you want → designer interprets → wait → review their take → revise → wait. With artifacts, I'm in the loop directly. I describe what I want, see it rendered, redirect AI on what's off, see the new version. The cycle is ten minutes, not days.

Catching bad ideas before they're expensive. A few months back I was about to build a leads-organization UI for a community leader. Before any code, I built the layout as an artifact and clicked through it. The first version I'd have happily proposed; the artifact made me realize the navigation pattern wouldn't scale past about ten leads. We changed the structure before any of it shipped. That's a thirty-minute prototype saving a week of rework.

What I've actually used them for

Eight so far, in three rough categories.

Pre-build prototypes are most of them. UI mockups for client work and my own. Site structure diagrams. Homepage layouts I want to feel before I commit. These are the highest-leverage use because they prevent the "we built it and then realized it was wrong" failure mode. Five of my eight artifacts are this kind.

Reference libraries — collections of related material in one persistent page I can re-open. I have one that holds every visual diagram I've made for a manual I'm writing, with the SVG source code expandable for each one. Useful for design-system maintenance and for grabbing visuals to re-use later. Two of eight.

Live trackers — status dashboards that update over time. Less common but the most-revisited once they exist. The one I built for tracking a complex project gets re-opened weekly and tells me where everything stands at a glance. One of eight, but the most-touched.

How fast this actually is

Honest accounting. The first time I made an artifact it took an hour because I didn't know what was possible. Now most artifacts take fifteen to thirty minutes from "I want to see X" to "X is rendered and clickable."

A homepage mockup with hero, featured content, recent-items list, and a soft path — twenty minutes.

A site flow map showing every page and how they connect — fifteen minutes.

A full content reader showing nine essays and five manual chapters with proper typography and inline diagrams — ninety minutes (this one was complex because it required reading and converting all the underlying markdown).

The trade is unbeatable for someone weighing whether an idea is worth building. Twenty minutes of prototype against a week of building the wrong thing — more than worth it every time.

What artifacts can't do

They render HTML in a sandbox. No real backend. No persistent database. No authentication that actually authorizes anyone. Calls to external services are mostly blocked. Artifacts are not suitable for testing production logic or real data flows.

They also can't replace user testing. An artifact tells you how something looks and clicks; it doesn't tell you how an actual user will react when they encounter it for the first time without context. For real user research you still need real users.

And artifacts age fast — a mockup from three weeks ago might use design choices I've since revised. Treat them as point-in-time snapshots, not living truth. The current state of any project is in the code.

Why this is different from "ask AI to draw a wireframe"

The thing that makes artifacts work — as opposed to asking AI for a wireframe image — is that artifacts are interactive. I can click. I can see how navigation behaves. I can test forms, even if the forms don't really submit. The interaction layer reveals problems that static images hide.

A flat wireframe might show "homepage has nav, hero, three cards." An artifact lets me click the nav and see whether the navigation pattern feels right when actually used. Most design problems live in interaction, not layout. Static images can't surface them. Artifacts can.

What I'd encourage you to try

If you're using AI for any kind of work that has a UI — any kind of design or product or content question — start asking for artifacts before code. Describe the thing you want to see. Ask for it as a clickable HTML page in your workspace. Click through it. Decide whether the idea is worth building.

You'll catch bad ideas earlier. You'll make better decisions about what to invest in. You'll spend less on building the wrong thing.

The work is the same — deciding what should exist. Artifacts just let you see what your decisions actually feel like before you commit them to code.

TOOL — PDF · ready to use
10-feel-test-brief-template.pdf
Show contents
# Feel Test Brief — Template

Before you write any code, build a clickable artifact. Test the feel. Iterate the spec. Then ship the code.

---

## Brief template

```
I want a [feature / page / flow] that lets users do [outcome].

Build a clickable HTML mockup that includes:
- [the primary path the user will take]
- [the secondary path or recovery path]
- [any state changes the user should see]

Use placeholder data, but the data should be realistic enough to feel real.
Use the design system if one exists; otherwise system-default fonts and a neutral palette.

The mockup is for FEEL TEST. We're not implementing anything yet. The point is for me to click through it and tell you what's off before any real code gets written.

After I click through, I'll come back with one of:
1. SHIP IT — turn the mockup into real code
2. ADJUST — here's what's off, redo the mockup
3. KILL — I don't want this, the spec was wrong
```

---

## What to look for in the feel test

- **Friction points** — anywhere the click count is higher than expected
- **Surprise moments** — anywhere you, as the user, didn't know what to do next
- **Implicit assumptions** — anywhere the spec said something that the mockup made obviously wrong
- **Tone mismatches** — anywhere the visual register doesn't match the brand
- **State confusion** — anywhere "what happens next" isn't obvious

---

## Why this works

Feel tests catch a class of problem that spec reviews miss. Specs lie because they're abstract. Mockups can't lie — you click them, you feel them, you know.

The cost of a feel test is one AI session. The cost of skipping it is a build that ships and gets thrown out.

---

Companion essay: *Artifacts Test the Feel*
← Back to Insights
~1500 words

Before I Used AI for Work, I Used It for My Family

The first useful thing I built with AI was for my family.

Not a client project. Not a Serranix tool. A small thing I made because someone I love had a problem and I'd been quietly learning that AI could help solve it.

I'm not going to specify exactly what it was, because the specifics don't generalize to your family. What matters is the realization that came with it: AI isn't just a productivity tool for knowledge workers. It's leverage. The kind of leverage that, if ordinary people actually use it, could change what's economically possible for ordinary families.

That realization is why I'm doing Serranix. Not because I want clients. Because I want this leverage to spread.

AI as a multiplier for family-scale economic mobility A staircase showing each step as a tool a family member built with AI assistance, compounding leverage over generations. Starting position First tool built Second tool Family business Compounding Generational position Each tool an AI helps build is a step the family doesn't have to climb again. The leverage compounds across people, not just over time.

The shape of leverage

Most economic shifts in the last fifty years widened the gap between people who could deploy capital and people who couldn't. The internet was supposed to be different. In some ways it was — a kid in a small town can now learn anything, sell anything, reach anyone. In other ways it concentrated wealth more aggressively than the industrial revolution ever did. Network effects compound to whoever got there first.

AI has the same shape on the surface. The headlines are about hyperscalers, frontier labs, billion-dollar training runs. The narrative is "the future belongs to whoever has the most compute." If that narrative wins, AI is just the next round of capital accumulation, only faster and more concentrated.

But underneath, there's a different shape. The same models powering the headlines are accessible to anyone with twenty dollars a month and a browser. The same tools running corporate workflows can run a single family's calendar, a single small business's intake, a single teacher's lesson plans. The leverage isn't gated. The barrier is taste, intent, and the willingness to actually try.

Most ordinary people haven't tried. They've heard about AI, read articles about it, maybe asked it to draft an email. They haven't sat down with it and said here's a problem in my actual life — help me think.

When you do, things change.

The household scale

The first time I solved a real problem for someone I love using AI, the math broke in my head. The thing that had been bothering them for months got addressed in an afternoon. Not because AI did something magical — because I gave it the right context, paid attention to the output, and iterated until the answer was right.

That afternoon cost me about three dollars in API usage. The professional version of that work — hiring someone to think it through — would have been several hundred. The previous self-directed version of that work — me trying to figure it out alone — would have been weeks of frustration and probably given up.

Three dollars. An afternoon. A real outcome for a real person.

Repeat that pattern at the scale of an actual household, and the economics start to look different than the standard "you should use AI for productivity at work" narrative. AI handles the budget reconciliation that used to take Sunday mornings. AI helps the kids think through homework problems without doing the homework for them. AI handles a family's small-business bookkeeping so the operator can stop staying up late on Tuesdays. AI helps an elderly parent understand a medical letter that came in three pages of jargon.

None of this is paid productivity. All of it is real leverage. And the cost is dollars per month, not hundreds.

The wealth question

The framing I keep coming back to: AI is leverage. Leverage compounds. Whoever has access to leverage builds wealth faster than whoever doesn't.

Right now, the people most likely to have AI leverage are the people who already had a lot of it — knowledge workers in well-paid roles, founders of well-funded companies, executives at established firms. They're using AI to extend a lead they already had.

The people who would benefit most are the people without that lead. Operators of small businesses making just enough. Parents of multiple kids running a household on overtime. Immigrants navigating systems in their second language. Families in regions where professional services are too expensive. The ones for whom an extra few hundred dollars a month in operational efficiency is not productivity gain but actual breathing room.

If those people engage with AI as deliberately as the already-leveraged are engaging with it, the gap doesn't widen. It might narrow. That's the thing that inspires me when I think about AI and families.

That's also why I think the technology shouldn't be left to people who have no incentive to spread it. The natural incentive of capital is to concentrate. The natural incentive of incumbents is to gate. Spreading the leverage requires people who actually want it to spread — who use it, teach it, share what they figure out, build things ordinary households can pick up.

What I built for my family

I'm being careful not to specify what I built because the specifics don't generalize. Your family doesn't have my family's problems. The lesson isn't build the tool I built. The lesson is: pick a real problem someone you love is dealing with, and try AI on it. With patience. With attention. With the willingness to iterate.

The first tool I built was bad. I redid it twice. The second redo finally worked. That's how AI works for almost everyone — you don't get a tool by asking; you get one by iterating with intent.

What ended up working changed something in how my family operates. Not dramatically. Not headline-worthy. Just… cleanly. The thing that used to be friction is no longer friction.

I have built more since. I will keep building. The math that compounded over my Sunday afternoons is the same math that I think can compound for any family willing to engage.

What this means for Serranix

When I started Serranix, the easy thing would have been to position it as another AI consultancy. There are plenty of those. Most of them are aimed at well-funded mid-market companies for good economic reasons — that's where the immediate revenue is.

What I'm trying to do is different in the long run. The companies I take on as clients are community-rooted operators — small to mid-size businesses, nonprofits, public-sector teams. The work compounds because their wins are visible to their communities. The community sees what's possible. The community starts asking. The leverage spreads sideways.

Building tools for my own family is the first instance of that pattern. Building operating systems for community leaders is the next instance. The third instance is whatever an ordinary reader does after seeing what's possible from a practitioner who's documented it.

That third instance is the one I'm playing for.

A small invitation

If you've been reading this site and have not yet sat down with AI to solve a real problem in your own actual life — please do. Not for work. For your household, your family, your hobby, your community group.

Pick something specific. Try the cheap version first; twenty dollars a month gets you serious access. Iterate with patience. Ignore the AI hype; pay attention to whether your problem is closer to solved than it was an hour ago.

The first tool you build will be bad. The second one might work. The third one will surprise you.

Whatever you build doesn't have to be impressive. It has to be real. And real, repeated, compounds.

TOOL — PDF · ready to use
11-family-ai-onboarding.pdf
Show contents
# Family AI Onboarding Script

For introducing a non-technical family member to AI tools without overwhelming them. Designed for one sitting, ~30 minutes.

---

## Before the session

- Pick ONE problem they actually have (not one you think they should have)
- Use their tool, not yours (their phone, their email, their browser)
- Have one good AI account ready to share or sign up together

## Session structure (30 min)

### Minutes 0–5: The problem

Ask: "What's something you do every week that takes longer than it should?"

Listen. Don't recommend AI yet. Just write down their actual problem in their actual words.

### Minutes 5–10: The first try

Open the AI tool together. Type their actual problem in their actual words. Watch them watch the output.

Don't optimize the prompt. Don't fix their phrasing. The AI's job is to be useful even when the prompt is messy.

### Minutes 10–20: The iteration

Their first answer will be 80% useful. Show them how to ask for the missing 20%:
- "Make it shorter."
- "I didn't understand the third part. Explain it differently."
- "Do it again, but for [specific situation]."

This is the magic moment — when they realize the conversation is the product, not the first answer.

### Minutes 20–25: The save

Save the conversation in a way they can come back to. Bookmark, screenshot, save to notes — whatever works for them.

### Minutes 25–30: One thing to do alone

Give them ONE thing to try this week, alone. Just one. Pick something they care about.

Examples:
- "Help me draft a tough email to my landlord"
- "Plan a week of meals from what's in my fridge"
- "Help me understand this medical bill"
- "Write a thank-you note for the kid's teacher"

---

## What NOT to do

- Don't open with "let me show you what AI can do." That's a magic trick. We're not here for magic.
- Don't fix their typing. Their typing is fine.
- Don't introduce more than one tool. One is enough for the first sitting.
- Don't try to teach prompting theory. Prompting theory is a graduate course; this is kindergarten.

---

## Followups (next sitting)

If they want to keep going, the next sitting introduces:
- Saving useful prompts as templates
- Asking for the same thing in a different format (table, bullets, paragraph)
- Knowing when to push back ("that's not quite right, try again")

---

Companion essay: *Tools For My Family*
← Back to Insights
~1700 words

Good People Have to Be in the Room

There is an argument, made quietly by thoughtful people in workforce programs, in nonprofits, in community organizations, in the kinds of places where I work my day job, that AI is a problem to abstain from. The reasoning is reasonable. AI is being shaped by people whose values do not match the people we work with. AI infrastructure is consuming environmental resources at scales that are hard to defend. AI capabilities are moving faster than regulation. Why participate in any of that?

I understand the argument. I think it is wrong.

Specifically, I think abstention from AI work is itself a moral choice, and not the moral choice the people making it intend. Here is why.

Two rooms — same problem, different outcomes A side-by-side comparison: the room without good people in it makes one set of decisions; the room with good people in it makes a different set. ROOM A — no one pushed back Default decision passes consequences absorbed by people not in the room ROOM B — one good person showed up Default gets questioned a different decision becomes possible The shift is one person. The cost of opting out is the room you didn't shape.

Who shapes a technology

Technologies don't have inherent values. They have the values of the people who build, deploy, regulate, and use them at scale. Cars are not inherently dangerous; they're dangerous because we built infrastructure assuming they wouldn't kill many people, and now they do. Social media is not inherently outrage-inducing; it's outrage-inducing because the people who built the engagement engines optimized for outrage. Nuclear technology is not inherently bombs; it became bombs because the people who first developed it were preparing for war.

AI is at this same fork right now. Whoever engages with it first, with the most compute, with the most capital, with the most political access — those people are going to shape what AI becomes operationally for the next thirty years. Default policies. Default training data. Default integration patterns. Default interfaces. Default norms about what's acceptable.

If the people who engage are mostly motivated by capital concentration and military advantage, AI gets shaped by those motives. If thoughtful people — environmentalists, educators, community operators, public-sector workers, mission-driven nonprofits — abstain because the technology is "bad," they are choosing to leave the shaping to the people whose values they object to.

Abstention is not neutral. It is a contribution to the outcome they're trying to avoid.

What engaging actually looks like

To be clear: engaging with AI doesn't mean cheerleading for it. It means using it, deploying it, governing it, teaching others to use it well, and shaping the policies and norms around it.

For an operator at a community-rooted business: it means using AI tools where they earn their place, with policies that protect the people you serve.

For a workforce program leader: it means deciding what AI policy actually says about IP, governance, and training — and writing it well rather than punting.

For a nonprofit director: it means thinking about whether AI can help your beneficiaries access services they otherwise couldn't, and adopting if the answer is yes.

For a teacher: it means understanding how students are using AI and shaping how the classroom adapts, rather than pretending it isn't happening.

For an environmentally-engaged citizen: it means showing up at policy conversations about where data centers get sited, what tax incentives exist, what regulatory frameworks apply.

For a parent: it means using AI carefully for things that genuinely help your family, modeling thoughtful use to your kids.

None of this requires endorsing the worst uses of AI. All of it requires being present.

The cost of staying out

I work in workforce development. The people I serve professionally are the ones who tend to lose first when economic shifts happen — and they're the ones who have, historically, lost when each new wave of automation arrived. They lost when factories went overseas. They lost when retail consolidated. They lost when the gig economy reshaped service work.

If AI is the next major economic shift — and the evidence suggests it is — and the only people deploying it are the ones already at scale, the people I serve will lose this round too. Not because AI is intrinsically harmful to them. Because they were not in the room when the deployment patterns were set.

I cannot, in good conscience, watch that happen and stay out of the room.

The argument that "we shouldn't engage with AI because it's harmful" assumes the harm is in the technology. That's not where most of the harm comes from. The harm comes from where the technology is deployed, by whom, with what objectives, with what oversight. Those are choices made by the people in the room. If you are not in the room, you do not get to make those choices.

The harder version of the argument

There is a steel-manned version of "don't engage" that I want to acknowledge fairly: some people argue that engaging at all legitimizes AI as it currently is, and that withholding legitimacy is a more powerful lever than participating. The civil-disobedience argument.

I take this argument seriously. In some moments, it has been the right argument. Refusing to legitimize segregation, apartheid, or specific genocides has been morally correct even when participation might have offered marginal influence. There is real philosophical weight to the "I will not be in the room" position.

I don't think AI is one of those moments, for one specific reason: AI's deployment scale is going to happen with or without ethical participation. Withholding legitimacy from AI development does not slow it down meaningfully. It just changes who is in the room. In segregation cases, withholding legitimacy created economic and moral pressure on the system. In AI's case, withholding legitimacy creates room for less-careful actors to set the defaults.

This is the difference between "refusing to participate in an unjust institution that depends on participation for legitimacy" and "refusing to participate in a technological deployment that will happen anyway." AI is the second case.

What I am doing about it

I am not abstaining. I am doing what I can with the leverage I actually have.

I run a workforce program by day, where I make sure AI policy gets written carefully, training programs reflect both the upside and the risks, and the people we serve get access to AI literacy. That work is mine and it matters because the alternative is that they encounter AI shaped by people who don't have their interests in mind.

I run Serranix on the side, where I help community-rooted operators integrate AI into their actual operations — carefully, with constraint architecture, with policy that protects their people. The companies that work with me get systems that match their values, not generic AI products built for someone else's values.

I write essays publicly, including this one, where I try to show what thoughtful engagement looks like in practice. Some of these essays are about specific tools. Some are about specific frameworks. This one is about the underlying ethical posture: it is essential that thoughtful people engage with AI, not abstain from it.

I am not under the illusion that my individual work changes what AI becomes globally. I am under the conviction that if I, plus people like me, plus people with broader reach and harder positions than mine, all engage with intent, the cumulative effect is different than if we all stay out.

What you might do

If you are reading this and find yourself in the abstention camp — please consider engaging.

You don't have to use the tools that bother you most. You don't have to endorse the deployment patterns you object to. You don't have to be enthusiastic. You can engage critically, with constraints, with policy, with skepticism.

But please be in the room. The room is being built right now. The people inside it are going to shape what AI does to all of us for the next several decades. Whether your values are represented in that room depends on whether you show up.

I am as imperfect a representative of thoughtful values as anyone — too white, too American, too well-paid, too connected for the actual breadth of human experience. But I am in the room. I'd rather have flawed thoughtful people in the room than have the room belong only to the people who would happily exclude us.

Show up.

TOOL — PDF · ready to use
12-decision-room-checklist.pdf
Show contents
# Decision-Room Checklist

Before you decide whether to opt out of a meeting, board, or decision-making body, run this checklist. The cost of opting out is the room you didn't shape.

---

## Five questions

### 1. Who's in the room when I'm not?

List the people who will be in the meeting if you skip it. Are they the people you'd want making this decision?

If yes — the decision is in good hands. Skip is fine.
If no — that's a flag.

### 2. What decisions get made without dissent in this room?

If everyone in the room agrees on something you'd push back on, the decision moves forward as if it had unanimous support. Your absence is interpreted as agreement.

### 3. Who absorbs the consequences of the decision?

If the people affected by the decision aren't in the room, someone in the room needs to represent them. Will anyone do that if you don't?

### 4. What's the marginal cost of showing up?

Sometimes the cost of attendance is high (travel, time, lost focus). Sometimes it's low. Be honest about the actual cost — it's often lower than the felt cost.

### 5. What's the cost of the wrong decision?

If the room makes a default choice you'd push back on, what does it cost to correct later? Sometimes correction is cheap. Sometimes it's permanent.

---

## Decision rule

- 3 or more "concerning" answers → show up
- Even one "permanent cost" answer → show up
- All "fine without me" → skip with confidence

---

## What this filters out

- Skipping rooms because attendance is uncomfortable rather than because the room doesn't need you
- Skipping rooms because you're tired of being the only voice that pushes back (this is exactly why those rooms need you)
- Showing up to rooms that genuinely don't need you, out of guilt

---

Companion essay: *Good People Have to Be in the Room*
← Back to Insights
~1700 words

Stop Building AI in the Desert

There are mega-data-centers being built in the desert. Hundreds of them. Billions of gallons of water consumed for cooling. In regions where water is already scarce. To run AI infrastructure that, in cold climates, would consume a fraction of the water and a fraction of the energy.

It's bullshit.

I am writing this essay specifically to say so. Loudly. From a practitioner's perspective.

I am an environmental activist. I am also an AI integrator and operator. Those two things are not in tension for me. They are aligned. The technology I work with daily is being deployed in ways that are environmentally indefensible, and the deployment patterns are being set by people whose incentive is speed and proximity to power, not stewardship.

This essay is about why the desert siting of AI infrastructure is wrong, what should happen instead, and why operators — not just environmentalists — should care about this.

Cooling cost and water stress by climate zone Two siting scenarios for AI compute infrastructure — desert versus Arctic. Bars compare cooling energy demand and water stress, with the Arctic option showing a fraction of both. Same compute load, two siting choices DESERT SITING High cooling energy Water-stressed region Solar peaks ≠ load curve ARCTIC SITING Passive cooling — climate is the chiller Abundant fresh water Hydro · wind · stable load The math doesn't work in the desert. It works where the climate does the cooling for free.

What the desert means

Most of the AI compute going into production in the United States is being built in places like the Phoenix metro area, the Utah desert, the Texas plains, and the Nevada desert. These are not random choices. They're driven by specific factors: cheap land, favorable state-level tax incentives, proximity to existing fiber backbone, and political environments that don't ask hard questions about water consumption.

The cooling load on a hyperscale data center is enormous. AI training in particular — the runs that produce frontier models — generates heat that has to be dissipated continuously. In hot climates, that dissipation requires evaporative cooling, which means water. Lots of water. Lakes of it.

Specific numbers, for the record: a single hyperscale AI campus typically consumes hundreds of millions of gallons of water per year for cooling. The Phoenix metro area, where many of these facilities are clustered, has been declared a stage-one water shortage. The Colorado River, which serves much of this region, is in long-term decline. Aquifers are being depleted faster than they recharge.

This is happening anyway. The decision to put hyperscale AI in arid regions is being made now, in the absence of meaningful pushback, while the market for AI capacity is growing exponentially.

What the alternative is

Cold climates do not require evaporative cooling at the same scale. Outside air, ambient temperature, and free-cooling techniques can handle a significant fraction of the cooling load when the climate cooperates. Iceland already runs hyperscale data centers on geothermal power and ambient cooling. Northern Sweden, northern Norway, parts of Finland, the Pacific Northwest, the upper Midwest in winter — these regions can dissipate heat without consuming water at the scales the desert facilities require.

The energy efficiency improvement from cold-climate siting is also significant. Power Usage Effectiveness — the ratio of total facility energy to compute energy — runs around 1.4 in desert facilities and around 1.1 in well-designed cold-climate ones. That's roughly a 30% reduction in energy waste, not for any technology improvement, but for siting alone.

Translation: putting AI infrastructure in the right place, geographically, reduces both water consumption and total energy consumption substantially. Without sacrificing capacity. Without slowing AI development. Without any technological breakthrough.

Why this isn't happening

The honest answer is incentive structure.

Desert states aggressively offer tax incentives for data center construction because data centers represent immediate property tax revenue and modest job creation. State legislators and governors are happy to host hyperscale facilities even when the water consumption is publicly indefensible, because the political cost is paid in twenty years and the budget gain is paid this fiscal year.

The technology companies building these facilities are responding rationally to the incentive structure as it exists. They are not, in most cases, environmentally indifferent. They are choosing the locations where construction is fastest, regulatory environments are friendliest, and tax treatment is most favorable. The decisions are economic, not ecological.

The federal level — where infrastructure incentive policy could plausibly shift the calculus — has not engaged with AI infrastructure siting in any meaningful way. The CHIPS Act, the IRA, and various federal AI initiatives have not included siting requirements that would meaningfully shift facility location. There is no federal carbon pricing on data center electricity. There is no federal water-consumption charge on cooling.

Each of these is a fixable policy gap. None of them is being fixed because the political constituency for fixing them is small and diffuse, while the constituency benefiting from current incentive structures is concentrated and powerful.

What should happen

Three policy levers, in increasing political difficulty:

One: explicit federal tax incentives for cold-climate data center construction. A 30% federal investment tax credit for AI compute deployed in regions with PUE below 1.2 would shift the calculus immediately. The capital cost differential for cold-climate construction is real but manageable; offsetting it federally accelerates relocation without coercion.

Two: federal water-consumption assessment on facilities above a threshold. A per-gallon assessment on cooling water consumed by data centers above (say) 100MW capacity, with revenue earmarked for water infrastructure in the consuming region. This makes the water cost visible in the operator's economics rather than externalizing it onto the watershed.

Three: state-level pre-emption of subsidies in water-stressed regions. A federal rule that states experiencing declared water emergencies cannot offer property tax abatements to new hyperscale facilities. This is the politically hardest but operationally cleanest version: it removes the race-to-the-bottom dynamic at the state level.

None of these are radical. None of them slow AI development. All of them shift where AI development happens to places where the environmental cost is significantly lower.

Why operators should care

If you're reading this as an AI operator or executive at a mission-driven organization, the environmental siting of AI infrastructure may feel distant from your day-to-day. It isn't, for two reasons.

First, the environmental footprint of the AI you use is increasingly visible to your stakeholders. Funders are starting to ask. Donors are starting to ask. Beneficiaries are starting to ask. "Our AI runs in a Phoenix data center consuming water from the Colorado River" is going to be a harder answer to give in five years than it is today. Choosing AI vendors with cleaner siting now is a hedge against that conversation arriving.

Second, your political voice on this is real. The mission-driven sector has standing to comment on AI policy in ways that for-profit AI users don't. Your professional associations, your state attorneys general, your federal representatives — they hear from you with weight that they don't hear from for-profit consumers. Showing up to AI infrastructure siting conversations as a workforce program leader, a nonprofit director, a public-sector operator carries weight.

You don't need to become an AI policy expert to show up. You need to know what to ask for. Ask for cold-climate siting incentives. Ask for federal water assessments. Ask for state-level pre-emption of subsidies in water-stressed regions. The policy people will know what you mean. They'll be glad someone outside the AI industry showed up to the conversation.

What I am doing

I am writing this essay. I am sharing it with the people in my professional network who care about environmental policy. I vote for state legislators who care about siting. I tell AI vendors that I prefer cold-climate hosted services and ask my own vendors about their siting choices.

This is not enough. None of what I do as one operator, plus everyone like me as their own one operator, equals the lever that is needed to actually shift hyperscale siting policy. But it is more than abstaining from the conversation, and the conversation gets louder when more practitioners show up to it.

The desert is the wrong place to build AI infrastructure. Cold climates exist. Policy can shift the calculus. Pretending the siting question doesn't matter is a political choice. Saying it out loud is a political choice. Doing nothing is a political choice.

I am choosing the second one. I am asking you to join me, in whatever form your voice takes.

It's bullshit that we're building lakes of water into the desert to cool AI. It is correctable. The correction starts with operators and citizens making it impossible to keep doing this quietly.

TOOL — PDF · ready to use
13-ai-siting-rubric.pdf
Show contents
# AI Compute Siting Rubric

For evaluating where to site AI compute infrastructure. The default — desert siting — looks cheap on a per-acre basis and gets expensive when the cooling math is honest.

---

## Six criteria

| # | Criterion | What to ask |
|---|---|---|
| 1 | **Climate-driven cooling load** | What does the local climate require to keep chips at safe operating temperature year-round? |
| 2 | **Water availability** | Is the region water-stressed? What's the local water table doing? Is the watershed already over-allocated? |
| 3 | **Energy mix** | What's the marginal kWh source? Coal? Gas? Hydro? Wind? Solar with storage? |
| 4 | **Load curve match** | Does local generation peak when load peaks? (Solar peaks at noon; AI loads are ~24/7.) |
| 5 | **Community impact** | What's the host community's water budget? Property tax base? Workforce? Will the project draw from or contribute to the local economy? |
| 6 | **Long-term viability** | What's the 20-year climate forecast for the site? Permitting friction? Political risk? |

---

## Scoring template

```
Site: ____________________
Date evaluated: ____________

Criterion                          Score (1-5)    Notes
1. Climate-driven cooling load     ___            ____________________
2. Water availability              ___            ____________________
3. Energy mix                      ___            ____________________
4. Load curve match                ___            ____________________
5. Community impact                ___            ____________________
6. Long-term viability             ___            ____________________

Total: ___ / 30

Verdict: [ ] Reject  [ ] Conditional  [ ] Strong fit
```

---

## What changes when you score honestly

The desert sites that look cheap up front often score below 12 on this rubric — high cooling load, water-stressed region, solar peaks misaligned with load curve, community already pushing back.

Arctic and sub-Arctic sites score 22-28. Climate does the cooling for free. Water is abundant. Hydro and wind dominate the mix. The load curve and the generation curve match because the wind blows year-round.

The honest math is what changes the siting decision. The numbers should be public. The communities affected should see the rubric applied.

---

Companion essay: *Stop Building AI in the Desert*
← Back to Insights
~1100 words

I Did Most of This Without Browser Automation. Then I Got It.

I built the vast majority of Serranix without an AI browser extension. Months of work. Click after manual click through the systems I needed to configure — Twilio admin panels, Vercel dashboards, DNS records at Namecheap, Stripe configurations, Supabase auth settings.

It was familiar territory but new. I'd been on the web for thirty years, but inside admin consoles I'd never touched. Easy to get lost. Easy to misclick. Easy to spend an hour trying to remember which screen had the toggle I needed.

Then I got the Chrome extension that lets AI navigate browsers on my behalf.

Work flew.

Before and after browser-automation AI on admin tasks A before/after timeline comparing 15 minutes of manual hunting through admin UIs against 90 seconds of narration to an AI that does the clicking. BEFORE ~15 min · hunt the UI · misclick · context-switch · forget multiplied across ten tasks a day, across many vendor portals AFTER ~90 sec · describe · watch the AI navigate · stay in flow Roughly 10× compression on a class of work that used to bleed cognitive budget. Hours back per week.

What the extension actually does

The Chrome extension is a browser-automation tool that lets AI see what's on my screen, click links, fill in forms, navigate menus, and report back what happened. Same thing I'd do manually, but the AI does the navigating while I narrate or watch.

When I tell it "set up an A record for serranix.com pointing at this Vercel deployment," it goes to my DNS provider, finds the right page, fills in the values, and saves — without me having to remember whether the form is on the "Manage Domain" tab or the "Advanced DNS" tab or somewhere else entirely. The cognitive overhead of figuring out where a control lives in a UI I've never used before is exactly the kind of thing I'm worst at and the AI is best at.

The before-and-after on a typical configuration task: 15 minutes of me hunting through unfamiliar screens, sometimes getting lost, sometimes mis-clicking, sometimes giving up. After: 90 seconds of me telling the AI what I want and watching it happen.

That's not a productivity gain. That's a different relationship with the work.

Why I did the rest without it

For months I didn't know browser automation was on the menu for AI. I knew about chat AI. I knew about coding AI. I didn't know "AI that watches my screen and clicks for me" was a real thing I could install today.

So I clicked everything by hand. Hundreds of admin tasks. Configurations across a dozen different web apps, each with their own UI, each with their own conventions, each with their own places to get lost.

I got pretty good at it. I learned where things were. I built mental maps of which screens controlled which behaviors.

But the time cost was real. A normal working session might have ten administrative tasks scattered across five different web apps. Each task individually small. Cumulatively, exhausting.

When the Chrome extension landed and I configured it, the cumulative exhaustion went away.

What changed when work flew

Three concrete things:

Tasks I'd been deferring stopped being deferred. "I'll set up the SMS forwarding eventually" became "let me handle that now while I'm thinking about it." The friction was gone. The task budget for admin work expanded because individual tasks cost a fraction of what they used to.

I stopped context-switching during configuration. Before: I'd start a configuration task, get lost, switch back to the project I'd been working on, lose my place there too, switch back to the config, lose more time. After: I describe what I want, watch the AI do it, stay in the flow of whatever I was actually trying to build.

The mental energy I was spending on navigation went into thinking about whether the configuration was right. This is the deeper unlock. I was so focused on "where is the toggle" that I'd often miss whether the configuration I was making was actually correct for my situation. With navigation handled, I could think about the substance of the decision instead of the mechanics of finding it.

What it's not

The Chrome extension isn't magic. It still has rough edges.

It misclicks sometimes. Confirmation dialogs that pop up unexpectedly trip it up. Multi-step flows where step 3 depends on the result of step 2 sometimes need me to intervene. It struggles with custom UI components that aren't standard buttons and forms.

It's also a security boundary I'm careful with. The extension can see whatever I'm looking at, click whatever I tell it to click, and submit whatever I tell it to submit. In production-grade work I supervise carefully and review what's happening. For personal experimentation in admin consoles I already control, I let it move faster.

And it doesn't replace knowing what you want to configure. It just removes the friction of finding where to configure it. If I don't know my DNS record from my MX record, the extension doesn't help me decide which one to set up.

What I'd tell anyone starting now

If you're building anything that requires configuring multiple web services, install browser-automation AI early. It's the connector I wish I'd had on day one. The work I did without it took roughly twice as long as the same kind of work takes now. That's hundreds of hours over a project's lifetime.

The setup takes about thirty minutes — install the extension, grant permissions, walk through a tutorial. Pay the thirty minutes upfront and the rest of the project is transformational.

What I'm not going back to

I'm not going back to clicking through admin consoles by hand for tasks I do once or twice a year. I'm not going to start. The mental tax of navigation in unfamiliar UIs is exactly the wrong thing to spend my limited cognitive budget on.

The work I'm doing requires me to keep configuring new things. New domains, new vendors, new admin panels, new flows. Each one used to be its own small dread. Now each one is a 90-second narration to an AI that watches my screen.

If you're at the edge of building something that requires real infrastructure — set this up first. Don't wait until month three. The earlier you remove the navigation tax, the more energy you have for the work that actually matters.

TOOL — PDF · ready to use
14-browser-automation-setup.pdf
Show contents
# Browser Automation Setup Checklist

The Chrome extension that lets AI navigate web admin consoles is among the highest-leverage AI tools available. Setup takes about 30 minutes. The list below walks you through it without skipping the part that matters.

---

## Before you start

- A Chrome browser
- An AI account that supports the extension
- One small admin task you can use as a first test (e.g., updating a setting in a tool you already use)

---

## Setup checklist

### Extension install
- [ ] Install the AI browser extension from the official source
- [ ] Sign in with your AI account
- [ ] Grant the permissions the extension requests (review them — understand what you're approving)

### First-task test
- [ ] Pick a low-stakes task in a tool you already use
- [ ] Have the AI watch you do it manually first (so it learns the layout)
- [ ] Ask it to repeat the task in a different setting / record / row
- [ ] Verify it did what you asked

### Production-grade habits
- [ ] Always supervise sessions where the AI is touching production data
- [ ] Always confirm before submit / save buttons
- [ ] Never let it touch payment, payroll, or user-data exports without per-action confirmation
- [ ] Audit logs after the session — what did it actually click?

---

## Tasks where it shines

- Configuring DNS records, domain settings, SSL
- Adding users, setting permissions, configuring roles
- Pulling reports from admin dashboards
- Migrating settings between staging and production
- Filling forms with structured data from another source

## Tasks to avoid

- Anything involving payments
- Anything involving production user data exports
- Anything where the action is destructive and unconfirmable

---

## The 90-second rule

If a task takes you more than 90 seconds in a browser, it's probably a candidate for browser automation. The threshold is lower than you think.

---

## Audit pattern

After each session, review:
- What pages did the AI visit?
- What buttons did it click?
- What data did it submit or read?
- Did anything unexpected happen?

If yes to anything unexpected, document it. Errors that aren't documented repeat.

---

Companion essay: *I Did Most of This Without Browser Automation. Then I Got It.*
← Back to Insights
~1400 words

How a Project Management Tool Made My AI Coder Possible

Monday.com isn't AI. It's a project management tool — boards, columns, tasks, owners, statuses. Most AI integration content treats this layer as background, the boring infrastructure beneath the interesting AI work.

It's the opposite for me. Monday.com is what made my AI work. Without it, I would have spent the last year wrestling with chaos instead of shipping software.

The PM tool as the load-bearing layer beneath AI workflow A vertical layer-cake diagram. Foundation: PM tool with structured tasks, owners, statuses. Middle: structured logs of decisions and context. Top: AI session that picks up where the last one left off because the logs remember. PM tool — Monday, Linear, Notion, Asana tasks · owners · statuses Structured logs — fields, not chat threads decisions · context · history AI session reads the logs · resumes the work AI compounds when there's structure underneath it Without the bottom layer, AI sessions don't compound. The PM tool isn't infrastructure for the work — it's infrastructure for the AI.

The transition to working with an AI coder

When I started seriously using AI to help me build things, I had a problem most operators have when they first try this: I had no way to keep track of what was happening.

A normal AI coding session is dense. The AI suggests fifteen things. You pick three. Two of them work. One needs revision. The revision exposes another issue. You decide to defer that. You move to the next thing.

Multiply that across a week and the chaos is real. I'd come back to the same project two days later and not remember what I'd decided, what I'd shipped, what was broken, what was waiting on me. The AI didn't remember either — every session was fresh.

Without structure outside the AI, the AI work doesn't compound. You spin in place.

Monday.com became the structure outside the AI.

What I started doing

Each thing I was working on became a card. The card had: what it was, who owned it (almost always me), status (idea / in progress / blocked / shipped), and a notes field where I'd dump the relevant context — what I'd tried, what worked, what was deferred.

When I'd start an AI session, I'd pull up the card and feed the relevant context. The AI knew where we were because the card knew. When the session ended, I'd update the card with what shipped or what was deferred. Next session, same routine.

The AI didn't need to remember. The card remembered. The AI just had to read.

Within a few weeks, the chaos was gone. I could open a project after a week off and pick up exactly where I'd left it. The AI could read the trail of decisions and not redo work I'd already done. The compounding started.

What changed at scale

The personal use is one thing. Where it gets interesting is at the institutional level — when an organization uses Monday.com or a similar project-management tool as the structured log of all the work flowing through it.

In my day-job context, I see the same pattern at scale. Tasks flow through structured boards. Decisions are recorded with timestamps and owners. Status changes are logged automatically. Context is captured in fields rather than lost in chat threads.

When AI enters that environment, the AI inherits all of that structure. It can read what's been decided, who's responsible, what's pending, what's blocked. It can answer questions like "what's overdue" or "what did we decide about the X project last quarter" because the answer is sitting in a structured field, not buried in someone's email.

The AI is more effective at institutional scale than at personal scale, specifically because the structured log is more comprehensive at scale. More tasks logged, more decisions captured, more history available.

This inverts the usual narrative — that AI is harder to deploy in larger organizations because of complexity. The opposite turns out to be true when there's a working project-management layer underneath: the more structure you've built, the more AI can do.

Why logging is load-bearing

Logging is the thing most people miss when they think about AI integration.

AI is good at reading. It's not good at remembering. Every session starts cold. The thing that makes AI usable across sessions, across teams, across time is whatever external system is keeping track of state.

For a personal operator, that can be a memory file or a personal task tracker. For an institution, it has to be more — a real project-management tool with logged decisions, owners, statuses, deadlines, dependencies. Something AI can read at the start of any session and immediately know what's been decided.

The orgs I've watched try to integrate AI without this layer end up with the same problem I had at the personal level, only worse. The AI does work but the work doesn't compound. People reinvent decisions because the previous decision wasn't logged anywhere AI could find it. The friction stays.

The orgs that get this right have already invested in logging — not because they were preparing for AI, but because operating at scale forces structured logs eventually. AI just makes the existing logging investment pay off more than it used to.

What I'd tell anyone starting out

If you're trying to use AI to help with anything substantial — coding, content, analysis, ops — invest in your project-management tool first.

It doesn't have to be Monday.com specifically. It could be Linear, Notion, Asana, Trello, any structured task system. What matters is:

  • Tasks have owners and statuses
  • Context lives in structured fields, not buried in chat
  • Decisions get logged where they happen
  • AI (whichever AI you're using) can read the system

The hour you spend setting up your task structure is worth ten hours of AI work that compounds because the structure is there.

The connector I wish I'd had on day one was browser automation. The connector I'm grateful I had from early on was project management. They work together: structured logs let AI know what's been decided; browser automation lets AI execute on what's been decided. Both are infrastructure. Neither is AI itself.

Build the infrastructure. The AI work follows.

A note on the institutional pattern

When I see organizations using Monday.com or similar at scale, the AI integration goes differently than when they don't. The pattern I keep noticing:

Organizations with mature project management tend to adopt AI in a measured way and get real results. The AI plugs into existing logs and makes them more useful.

Organizations without mature project management tend to either avoid AI (because they can't see how it would fit their chaos) or adopt it superficially (because the AI has nothing to anchor to). Neither path produces compounding value.

The lesson: if you're a community leader thinking about AI for your organization, ask first whether your project management is solid enough to feed an AI. If it is, the AI investment will pay off. If it isn't, fix the project management first. That's where the leverage actually starts.

TOOL — PDF · ready to use
15-pm-board-template.pdf
Show contents
# PM Board Template — Optimized for AI Handoff

A project-management board structure designed so AI can read the state and resume work. Works in Monday.com, Linear, Notion, Asana, Trello — wherever you already log work.

---

## Required columns

Each card / task needs these fields. Without them, AI can't pick up where the last session left off.

| Field | Type | Why it matters |
|---|---|---|
| Title | Text | One-line summary, scannable |
| Owner | Person | Who's accountable |
| Status | Select (Idea / In Progress / Blocked / Shipped / Killed) | What state the work is in |
| Tier | Select (S / M / L / XL) | Effort estimate |
| Last updated | Date | How stale this card is |
| Context | Long text | Decisions made, things tried, what's deferred |
| Linked source | URL | Code, doc, design, or external link |
| Next action | Text | The single next step — owner-readable |

---

## Recommended columns (add when relevant)

| Field | Type |
|---|---|
| Blocking on | Person or task |
| Deadline | Date |
| Cost so far | Number (USD) |
| Customer affected | Text |
| Postmortem link | URL |

---

## What goes in Context

The Context field is the load-bearing one for AI continuity. Keep it append-only. Each entry dated.

```
2026-05-09 — Decided: extend webhook handler instead of new flow. Reasoning: existing tests cover it, change is smaller.
2026-05-08 — Tried: separating handler. Got tangled in tier-pricing logic. Backed out.
2026-05-07 — Spec: payment → workspace within 30 sec. Source: cc-brief-2026-05-03.md
```

When AI picks up the card, it reads Context first.

---

## AI session bootstrap (paste at start of session)

```
Read the PM board card titled "<title>". Pay particular attention to the Context field — it's the history of decisions and attempts on this work. Do not redo work that's already in Context. Pick up from "Next action."
```

---

## When this works

- Solo operators with multiple projects
- Small teams handing off work between humans and AI
- Long-running projects where institutional memory is the bottleneck

## When this is overkill

- One-person, one-project operations where memory is short
- Throwaway scripts and one-off tasks

---

## The compounding effect

The first month of using this discipline feels like overhead. By month three, the AI sessions are unrecognizably better. The board is the AI's working memory; you've just made it queryable.

---

Companion essay: *How a Project Management Tool Made My AI Coder Possible*
← Back to Insights
~1500 words

I Don't Have a Fancy Degree. I Have a Service Record. Here's How I Made It Readable.

Most veterans hold a credential nobody can read.

Not nobody — other veterans can read it. People who held the same rate or MOS can read it. The institution that issued it can read it. But hand a service record to a civilian hiring manager, a community board, a college admissions reviewer, or anyone else who didn't serve, and it becomes a wall of acronyms, rate codes, NEC strings, and award letters in a language they don't speak. The document does its job inside the institution. It does almost no work outside.

That's a translation problem, not a credential problem.

I've been thinking about this because I just used AI to build a verified, citable bio out of thirty-some scanned pages from my own service record. The result is a document anyone can read, with primary-source citations behind every claim. Three lengths — short, medium, long — same factual base, all verified. The work took an evening. The output is something I would never have built by hand.

The methodology is what I want to share. If you're a veteran sitting on a real credential nobody can decode, this works.

Service record to verifiable bio — the five-step pipeline A horizontal pipeline. Scan the documents, extract facts with AI, audit yourself, translate into civilian register, save the citation map. STEP 1 Scan every page STEP 2 Extract AI reads · cites STEP 3 Audit you correct it human-only step STEP 4 Translate civilian register STEP 5 Cite verifiable bio The pipeline runs in an evening. The bio it produces lasts a career. Each citation in the final bio points back to a specific scanned page — credibility offered, not requested.

The two-sided gap

The gap exists for two reasons, both of them honest.

On the civilian side, hiring managers and community decision-makers haven't read enough service records to know how to read them. A "PS3" doesn't mean Personnel Specialist Third Class to them. A "NAM" looks like a typo. An award citation is full of phrases — "while forward deployed," "during an unannounced inspection," "from Fleet Examining Group" — that have specific operational weight to a sailor and zero context to a civilian. That isn't the civilian's fault. It's a vocabulary mismatch.

On the veteran side, most of us never translate. We hand over the resume that was useful while we were still in. We list "PS3" because that's what was on our chest. We don't write "led a personnel office at sea, processed pay and travel for a 340-sailor crew with a documented four-fold cycle-time reduction recognized at command level." Both are accurate. Only one tells the story.

The translation isn't dishonest. The translation is the work that should have been done before the resume left your hands.

What AI changes

A year ago, building a translated bio meant either hiring someone fluent in both worlds or staring at scanned documents across long evenings, trying to think in two registers at once. AI changes the math.

Specifically: AI is good at four things this particular task needs. It reads documents. It extracts structured facts from them. It rewrites those facts in a different register without losing the substance. And it preserves citations. Those four capabilities together are most of the work.

What AI is bad at — and what you still have to do yourself — is the judgment about what is load-bearing. Which awards are unusual at your rate? Which qualifications run deeper than the rating typically requires? What's the through-line that makes your record different from the next person who held the same MOS? That's still a human call. AI can amplify the work, but it can't replace the judgment.

The five-step process

This is what I did. It is not complicated.

Step 1. Photograph or scan every page. DD-214s, evaluation reports, award citations, training certificates, PQS qualification history, awards record, Letter of Commendation cover sheets, Letters of Appreciation, anything. Don't pre-filter. Capture it all. The reason is that the documents tell stories together that no single page tells alone — your DD-214 lists your awards, but the award citations explain what each one was for, and the EVALs explain what was happening around each one.

Step 2. Hand it to AI and ask it to read. Specifically: read the documents and extract verifiable facts, with citations to specific images. Do not ask AI to interpret yet. Just extract. The output should be a list of facts with source-document references — names, dates, ranks, units, awards, exact citation language.

Step 3. Audit what came out. This is the most important step. AI will misread some things. Names get transcribed wrong. Dates flip. Acronyms get misexpanded. Handwritten signatures get guessed at. You're the only person who actually knows your record, so you read everything AI extracted and flag what's wrong. Correct it. Add what got missed. The audit pass is where the bio earns its trustworthiness — and it's the step nobody else can do for you.

Step 4. Now translate. Once the facts are clean, ask AI to write three versions — long, medium, short — with citations preserved. The translation step is where mil-speak becomes operations vocabulary. "Decreased processing time of newly reported personnel's pay, allowances, and travel claims from 60 to 15 days" becomes "led a four-fold reduction in pay-cycle processing time, recognized at command level." The substance is the same. The reader changes.

Step 5. Save the citation map. This is the difference between a bio and a verified bio. Every numbered citation should point to a specific scanned document. Anyone who reads the bio can verify. You're not asking them to trust you; you're handing them the evidence and inviting them to check. That's a different kind of credibility — one that doesn't require the reader to take your word.

The full process — scanning, extracting, auditing, translating, citing — took me about six hours including AI sessions and my own audit time. Six hours to convert a credential nobody could read into a credential anyone can verify.

What I found doing it on my own record

A few things stood out when I saw my own record translated.

First, my Navy and Marine Corps Achievement Medal cites a specific operational improvement. The citation states I was directly responsible for compressing pay-cycle processing time from sixty days to fifteen — a four-fold cycle-time reduction — during a forward deployment to the Western Pacific. The same citation credits me with the ship receiving an "Excellent" overall grade from the Fleet Examining Group during an unannounced inspection. I'd been telling people I had "a NAM," which is true and means almost nothing to a civilian. The citation's actual content is exactly the kind of substance an operations-minded reader wants to know.

Second, a Letter of Commendation in my file is from a Rear Admiral. Flag-officer-level recognition, signed by the Commander of a Carrier Strike Group, calling out two hundred pay and personnel transactions during a Pacific deployment and exceptional leadership during multinational exercises. At the E-4 pay grade, that level of recognition is uncommon. I had never put it in front of anyone outside the Navy because I didn't know how to translate it. It read to me like noise. It isn't noise.

Third, my qualification stack was deeper than the standard for my rating. Most Personnel Specialists don't qualify into the damage-control roles I held — Crash and Salvage Crewman, Aqueous Film Forming Foam Operator, Repair Party Investigator, Advanced Shipboard Firefighter. The PQS record shows the full list. That's a cross-functional indicator that translates directly to "this person learned the whole operation, not just their function."

None of this was new information to me. I lived it. The point is that the translated version made it visible to people who hadn't lived it. That's the whole exercise.

Why now

This is the right moment for veterans to do this work because AI just made the cost low enough.

Six hours to convert your service record into a verified, civilian-readable bio is a six-hour investment that pays back every time someone reads it. If you're a veteran moving into civilian work, applying to graduate programs, joining boards, doing speaking work, building a consulting practice, running for office, or just trying to land the job you want — the verified bio is the document you need, and the methodology to build it is now within reach of anyone willing to spend the evening.

It extends to the resume too. Same documents, same translation discipline, different output format. The bio you build first becomes the source the resume is excerpted from. Build the long version once; everything else is a different cut of the same factual base.

A note for other veterans

If you served, your record is real. You just need to translate it.

The rating you wore doesn't speak the language of the rooms you're trying to enter. That's a fixable problem. A weekend of work, an AI you trust to read documents, your own audit pass on what it extracted, and a translation step at the end — that's the whole project.

Don't underestimate what's in your file. Most of us did. The medals you stopped thinking about, the qualifications you took for granted, the letters from commanders that you filed and forgot — they are operationally specific, often unusual at your rate, and translatable. The translation is the resume.

I built mine with citations because I want every claim to be checkable. If you build yours the same way, you'll have a bio you can hand to anyone, and they'll be able to verify it themselves. That's the kind of credibility that doesn't ask for trust. It offers the evidence.

The school you went to was real. The transcript exists. You just have to translate it.

TOOL — PDF · ready to use
16-service-record-translation-pack.pdf
Show contents
# Service Record Translation Pack

The full five-step methodology for translating a military service record into a verified, civilian-readable, citable bio. Includes the AI prompts at each step.

---

## What you need

- Every page of your service record, scanned or photographed (DD-214, EVALs, award citations, training certificates, PQS, awards record, Letters of Appreciation — everything)
- An AI tool that can read images and write structured output
- An evening (about six hours including audit time)

---

## Step 1 — Scan

Photograph every page. Don't pre-filter. Use your phone in a well-lit area; high resolution is fine.

File-name pattern suggestion: `record-NN-<short-name>.jpg` (e.g., `record-01-dd214.jpg`, `record-12-nam-citation.jpg`). The numbers help the AI cite them later.

---

## Step 2 — Extract (AI prompt)

```
You are reading a packet of scanned pages from my military service record. For each image:

1. Identify what the document is (DD-214, EVAL, award citation, training certificate, PQS history, etc.)
2. Extract every verifiable fact — names of awards, dates, ranks, units, citation language, decoration codes, qualifications
3. Cite the source image filename for every fact

Output as a structured table:
| Fact | Source filename | Notes |

Do NOT interpret or rewrite yet. Just extract and cite.
```

Save the output. This is the working source-of-truth.

---

## Step 3 — Audit (you, alone)

This is the human-only step. The AI will get some things wrong:
- Names misread from handwritten signatures
- Dates flipped
- Acronyms misexpanded
- Award counts miscounted

Read every line of the AI's extraction against the original images. Correct what's wrong. Add what got missed. Mark anything the AI couldn't read.

The audit pass is what makes the bio trustworthy.

---

## Step 4 — Translate (AI prompt)

```
You have an audited list of facts from my service record. I need three civilian-readable bios derived from these facts:

1. Long bio (~900 words) — for an About page, deep presentation, or detailed reference
2. Medium bio (~250 words) — for partner profiles, conference bios, LinkedIn About
3. Short bio (~75 words) — for footers, social headlines, podcast one-liners

Translate military terminology into civilian-readable equivalents. For example:
- "PS3" → "Personnel Specialist Third Class" (with brief explanation if needed)
- "decreased processing time from 60 to 15 days" → "led a 4× reduction in cycle time, recognized at command level"
- "ESWS pin" → "system-of-systems credential earned by demonstrating knowledge of every department aboard a guided-missile cruiser"

Preserve numbered citations [1], [2], [3] inline so each claim can be verified.

Tone: substantive, plain, not promotional. Lead with the work, not the medals.
```

Save the three bios. Edit each by hand to make sure the voice is yours.

---

## Step 5 — Cite

At the bottom of each bio, include a Sources section that maps each numbered citation to the source-document filename:

```markdown
| Cite | Document | File |
|---|---|---|
| [1] | NAM citation, signed by the Commanding Officer of the ship, dated 20 June 2007 | record-12-nam-citation.jpg |
| [2] | Letter of Commendation from the Rear Admiral commanding the Strike Group | record-13-loc-rear-admiral.jpg |
| ... | ... | ... |
```

Anyone reading the bio can verify any claim. You're not asking for trust — you're offering the evidence.

---

## What you get

- Three bios, all from the same factual base, all citation-backed
- A reusable source-of-truth document you can re-cut for any new application
- A resume that tells your actual story instead of hiding it behind acronyms

---

## A note on privacy

If your record contains personally identifying information (SSN, DOB, full address) on the scanned pages, consider redacting before sharing the documents themselves. The bio prose doesn't need them.

If your record names other service members (commanding officers, supervisors), you can reference them by role rather than by name in your bio — "the Commanding Officer of the ship" or "the Rear Admiral commanding the Strike Group" reads cleaner and respects others' privacy.

---

Companion essay: *I Don't Have a Fancy Degree. I Have a Service Record. Here's How I Made It Readable.*
← Back to Manual
Chapter 01 · Part: The Landscape

Chapter 1 — Who This Manual Is For

This manual is for community leaders thinking about AI in their organizations.

You don't need to be technical to use it. You need to be willing to think about how AI changes what your team does, what information it has access to, and what risks emerge from that change.

What this manual does

It provides a framework for thinking about workplace AI policy: a tiered argument (avoid → constrain → unify), a set of governance questions to answer, sample policy language to adapt, and a set of operational practices that have been tested in real work. It assumes you're going to write or update an AI policy in the next six months. It tries to give you what you'd want at the start: a way to think about the questions, plus templates so you don't start from a blank page.

It's written by a practitioner — an executive and operator who has been using AI in real work for several years and has watched community-rooted organizations encounter the same questions repeatedly. Field notes, sharpened.

What this manual is not

This is not legal advice. The sample policy language in Chapter 11 is a starting point for a conversation with employment counsel, not a substitute for one. Every organization's compliance environment differs.

It is not a vendor pitch. The author has tools they prefer (covered in a companion essay), but the manual itself is intentionally vendor-agnostic — the framework should outlast any specific product cycle.

It is not a complete answer. AI policy is moving faster than any single document can keep up with. The manual provides a foundation; the work of staying current is yours.

It is not advocacy. The manual takes a position (constraint and unification beat avoidance), but it doesn't argue you should adopt AI faster, broader, or differently than your specific context warrants. The reader gets to decide.

How to use it

You can read it in order if you want the full argument. The chapters build: foundation in Part I, frameworks in Part II, operational practice in Part III, adoption mechanics in Part IV.

You can also jump around. Chapter 2 names the three fears most organizations have; Chapter 7 covers personal-AI policy specifically; Chapter 11 has the sample language. Each chapter stands on its own enough to be read first.

If you're early in the policy process, start at Chapter 2 — that's where most people's instincts are, and the manual's argument starts by acknowledging them.

If you're already drafting policy, jump to Chapter 11 (sample language) for templates, with Chapter 7 and Chapter 8 for the substance behind them.

If you're trying to understand the landscape before you start, Chapter 3 (the avoid/constrain/unify argument) is the manual's spine. Read that first.

A note on tone

The voice across this manual is direct, practitioner-grade, and sober. It tries to take your situation seriously without being heavy. It doesn't assume you're an enthusiast for AI; it assumes you have decisions to make and want help making them well.

If a chapter starts to feel like advocacy or lecturing, that's a writing failure on the author's part. The goal is to be useful, not to convince.

A companion archive of essays — at the same site as this manual — provides longer-form first-person practitioner perspective on specific tool choices, day-to-day workflow, and personal practice. Where the manual aims for vendor-agnostic framework, the essays are explicitly dated and specific. Use whichever fits your need at the moment.

← Back to Manual
Chapter 02 · Part: The Landscape

Chapter 2 — The Three Fears, Named

Most organizations approach AI policy from fear. That's reasonable. The technology is unfamiliar, the risks aren't fully understood, and the costs of getting it wrong fall on the organization, not the technology vendor. Fear is a legitimate starting point.

The problem is when fear becomes the only frame. Policy written entirely from fear tends toward avoidance, which (as Chapter 3 argues) doesn't actually reduce risk — it just hides risk in places the organization can't see.

This chapter names the three fears most community-rooted organizations carry about workplace AI, in plain terms. It doesn't refute them. Each is real. The rest of the manual is structured around addressing each one operationally.

Fear 1: AI going wild

The fear: someone uses AI for something important, AI produces output that's confidently wrong, the human acts on it without checking, and damage results. The organization gets reported in the news as "the place where AI gave bad advice that hurt someone."

Specific scenarios: AI confidently citing research that doesn't exist. AI making decisions that should be human judgment calls. AI processing intake forms in ways that miss important context. AI drafting communications that go out without proper review.

This fear is real. Hallucinations happen. AI confidence does not correlate with AI accuracy. Output quality varies in ways that aren't always obvious to a non-expert reviewer. Organizations that adopt AI without explicit accountability infrastructure find these failures eventually, often the hard way.

The fear is addressable through three things, all covered later in the manual: explicit constraint architecture (Chapter 4 — the Bible Method), output review practices (Chapter 11 — sample policy), and audit cadence (Chapter 6). AI can be made accountable; what makes it wild is the absence of accountability infrastructure, not the technology itself.

Fear 2: IP leakage via personal AI

The fear: an employee uses their personal AI account to help with real work, pastes in proprietary information — research data, participant records, internal documents, contracts — and now that information lives outside the organization's control. Possibly trained on. Possibly logged. Possibly leaking out in future outputs to other users.

Specific scenarios: a research assistant pasting pre-publication findings into a personal AI tool for a quick rewrite. A program manager dumping a participant case file into an AI to summarize it. A development director feeding board materials into AI to get a draft of a donor email.

This fear is real and more common than most organizations realize. Personal AI usage by employees is happening right now in most workplaces, with or without policy. The information being shared is often more sensitive than the employee recognizes in the moment — particularly because pasting feels casual in a way that emailing the same information would not.

The fear is addressable through Chapter 7 (personal-AI policy), Chapter 8 (enterprise alternatives), and Chapter 11 (sample policy language for IP protection). The cleanest answer is to provide an organizationally-controlled tool so personal AI usage becomes the exception rather than the default — but that's a longer-term move. Personal-AI policy is the immediate floor.

Fear 3: Public optics

The fear: the organization's adoption of AI is perceived poorly by the public, by funders, by the sector. The phrase "jumping on the AI bandwagon" reads as undermining trust at a moment when sector reputation matters. Donors get nervous. Funders ask uncomfortable questions. The community loses confidence.

This fear is also real, and specific to community-rooted contexts in a way the others aren't. A for-profit company adopting AI for productivity isn't taking the same reputational risk a community-rooted organization is. Sector culture matters. Community-rooted organizations operate in a context where "we're using AI now" can read as either thoughtful operational improvement or trend-chasing — and the difference between those readings is often in the tone of how adoption is communicated, not in the actual practice.

The fear is partially addressed by demonstrating thoughtful adoption rather than enthusiastic adoption. Chapter 9 (AI for community-rooted organizations) covers this directly: how to adopt in a way that reads as careful and operational rather than trend-following. The other half of the answer is in tone: organizations that announce their AI strategy with measured operational language tend to land better than ones that announce with enthusiastic vision language.

The connecting thread

Each fear is real. Each is addressable. None of them are reasons to avoid AI entirely; all of them are reasons to adopt thoughtfully.

The manual's argument — across the remaining chapters — is that the way through these fears is constraint architecture combined with eventual unification. Personal-AI policy reduces Fear 2's exposure today. Enterprise governance addresses all three fears at once when an organization is ready. The transition between the two is operational work, not a single decision.

The next chapter (Chapter 3) makes this case directly: why constraint beats avoidance, and why unification beats constraint. From there, the rest of the manual is implementation.

A note for the reader: if you came to this manual with one of these fears stronger than the others, that's typical — most policy work starts where the loudest concern is. The structure of the manual lets you go to the chapters most relevant to your loudest fear first. Chapter 7 for IP leakage. Chapter 8 for governance and "AI going wild." Chapter 9 for the optics question. Chapter 11 for sample language across all three.

← Back to Manual
Chapter 03 · Part: The Landscape

Chapter 3 — From Avoidance to Constraint to Unification

The simplest AI policy is "no AI at work." That policy almost never holds. Within months, staff are using AI anyway — on their personal devices, with personal accounts, on their own time that isn't really their own time, for work that is the organization's work. The policy didn't fail because staff are dishonest. It failed because the work has to get done, and AI is the fastest way to do parts of it. The leak happens; the organization just doesn't know about it yet.

The next-simplest policy is "personal AI is fine, just be careful." This is better than banning, but it pushes the entire governance burden onto individual staff. Some are diligent. Some aren't. Some have the judgment to know which information shouldn't go to a chat window; some learn that lesson by leaking. The organization has rules but no instruments to verify compliance.

The strongest policy is the one that makes personal AI mostly unnecessary — by providing one sanctioned system the organization controls. Staff have a tool they can use freely for work, with appropriate access, with the organization holding the contractual and operational levers. Personal AI use becomes the exception rather than the rule, and the exceptions get specific guidance.

This chapter argues for moving along that progression — from avoidance to constraint to unification — as the framework for organizational AI policy. Most of the rest of this manual is operational support for that progression.

Stance 1: Avoid

The instinctive policy when a new technology raises governance concerns is "we won't use it until we know it's safe." For AI, this means restricting access at the network level, prohibiting AI tools in policy, and waiting until standards mature.

This stance has three problems:

Shadow IT emerges immediately. Staff with smartphones can use AI on their own networks, on their own devices, without the organization knowing. Restriction at the perimeter is mostly theatrical when the perimeter has holes the size of every employee's pocket.

The work happens anyway. The same staff being asked not to use AI are being asked to deliver more work, faster. The temptation is real and the rationalization is easy: "I'm just using it to draft an email, not to handle real data."

The organization loses learning. Other organizations are figuring out what works and what doesn't, what's safe and what isn't, what governance actually costs. Avoiding AI isn't avoiding the issue; it's deferring the learning curve to a less convenient moment, with no organizational memory of how the technology actually behaves under operational stress.

Avoidance is a bridge stance. Many organizations land here briefly while figuring out the next move. Few stay long without consequences.

Stance 2: Constrain

The next move is acknowledging that staff will use AI and writing policy that channels that use safely. Personal AI is permitted; protected information is not entered into it; staff self-audit; incidents get reported.

This is what Chapter 7 covers. It's a real improvement on avoidance because the organization now has rules, can train against them, and can respond when things go wrong. But the stance has limits:

Enforcement is individual. The organization has policy, but the actual control surface is the employee's personal AI account. The organization can write the rules and train them; it cannot directly verify compliance. Trust does heavy lifting.

Visibility is poor. Without provider-side admin tools, the organization doesn't know what's been pasted into AI by whom or when. Self-audit is the only signal, and self-audit produces the signals operators choose to report.

Inconsistency persists. Different staff use different AI tools, with different default settings, with different account histories. Even with policy, the actual practices vary widely across the team. Some are using AI tools whose terms of service the organization has never reviewed.

Constraint works as a stance — many organizations will stay here for years, especially smaller ones without budget for enterprise AI deployments. But it's a partial answer. The next stance addresses what constraint can't.

Stance 3: Unify

The strongest stance is providing one sanctioned AI system the organization controls. Staff use the sanctioned tool for work. Personal AI accounts, if staff have them, are for personal things — not for organizational work. The boundary is enforced not by individual judgment but by the sanctioned tool being the path of least resistance for actual work.

What unification gives the organization:

A single contractual relationship. One Data Processing Agreement, one set of training opt-outs, one sub-processor disclosure, one set of terms negotiated. The contractual surface is bounded and reviewable.

Provisioning at the organizational level. Identity through enterprise SSO. Access scoped per role. New staff get accounts on day one with appropriate scope. Departing staff lose access through the same process that handles email and every other system.

Logging at the organizational level. Admin audit logs show who used the AI for what. Query logs show what data was accessed when AI was connected to internal systems. Both layers belong to the organization, not the individual user.

Consistent training and norms. All staff use the same tool, with the same patterns, with the same defaults. New hires can pick up institutional context faster because the AI's behavior is consistent across the team. The organization's accumulated AI know-how compounds rather than fragmenting.

An enforceable boundary. The sanctioned system is for work. Personal AI is for personal use. The line is operationally clear because the path of least resistance for work is the sanctioned tool — staff don't have to choose to comply, they just have to do their work normally.

What unification doesn't solve:

Cost. Enterprise AI deployments are not free. Per-seat licensing for serious tools runs into real money for organizations of any size. For small organizations, the unit economics may not work yet.

Capability fit. No single AI tool is best at every task. Staff who used a particular tool for a particular purpose may find the sanctioned tool worse at that purpose. The trade-off is real and worth naming during selection.

Vendor concentration. A unified system means dependence on a specific vendor. The exit plan from Chapter 8 becomes operationally load-bearing rather than theoretical.

Adoption timeline. Moving from constraint to unification is a months-long project: vendor selection, contract negotiation, identity integration, training rollout. It is not a memo.

The path most organizations should be on

Few organizations can move from avoidance to unification overnight. The realistic path is sequential:

  1. Acknowledge that avoidance is a bridge, not a destination
  2. Move to constraint — write personal-AI policy, train against it, set up reporting
  3. Begin enterprise selection while constraint is in effect — vendor evaluation, security review, pilot deployment
  4. Move to unification — provision broadly, train extensively, make the sanctioned tool the default for work
  5. Maintain constraint for the residual personal-AI exposure — some staff will use personal AI for legitimate edge cases; policy still applies

Each step is months of work, not weeks. Each step compounds — the constraint training built in step 2 makes step 4 easier. The vendor evaluation in step 3 makes the contractual review in step 4 faster.

The key is recognizing that each stance is a step on a path, not a final answer. Organizations that treat avoidance as the destination accumulate shadow risk. Organizations that treat constraint as the destination plateau short of what's possible. The goal is unification with constraint as the safety net for residual cases.

Why this argument matters for policy writers

Most current AI policy writes only for stance 2 — constrain personal AI. The implicit assumption is that staff will continue using personal AI for work, and the organization's job is to make that safer.

That assumption is fine as a starting point. It is wrong as an end point. Policy that doesn't articulate the path toward unification will leave the organization permanently in stance 2, with the limits stance 2 has.

Policy written for stance 3 looks different. It includes vendor selection criteria. It includes identity and access provisioning standards. It includes a transition plan for moving staff off personal AI as the sanctioned tool comes online. It treats personal-AI policy as the safety net, not the primary control.

The remainder of this manual works through both stances. Chapter 7 covers the constraint floor — what to do when staff are using personal AI. Chapter 8 covers the unification ceiling — what governance enterprise AI deployment requires. Chapter 11 (Sample Policy Language) provides templates for both.

The progression from one to the other is the strategic question every organization writing AI policy in the next several years will face. Naming it explicitly is the difference between a policy that makes the present safer and a policy that makes the future possible.

← Back to Manual
Chapter 05 · Part: Frameworks

Chapter 5 — Memory and the IP Boundary

AI memory is one of the more recent capabilities in workplace AI tools, and it creates a category of policy questions most current AI policy doesn't address. This chapter covers what memory is, why it matters for organizations specifically, and how to think about the IP boundary it creates.

A companion essay — Why Your AI Doesn't Remember You — and How to Fix It — covers the personal practice of memory from an individual operator's perspective. This chapter focuses on the organizational implications.

What AI memory is, briefly

Most AI tools default to forgetting everything between sessions. You log in, you chat, you log out, and the next session starts from zero. The AI does not remember who you are, what you're working on, or what corrections you've given it before.

Many AI tools now offer persistent memory: the ability to save context across sessions. The implementation varies — some tools auto-extract memories from conversation; some let you write memory files explicitly; some do both. The common thread is that memory is information about you, your work, your preferences, your past corrections, that the AI can read at the start of new sessions to act with continuity.

For individual operators, memory is a productivity multiplier: the AI gradually learns your voice, your project, your decisions. For organizations, memory is something more complicated.

Why memory matters in organizational contexts

Memory accumulates institutional knowledge. When an employee builds up an AI's memory over months of work, that memory captures things like "we decided to use approach X for this kind of question," "the standard format for our reports is Y," "this client is sensitive about Z." It's the kind of accumulated organizational know-how that traditionally lives in people's heads.

The fact that AI memory captures it has two implications:

Memory becomes an institutional asset. The accumulated memory in an AI tool is genuinely valuable to the organization. It contains patterns, decisions, working knowledge that took time to develop. If that memory disappears, the organization loses something.

Memory becomes an institutional risk. The same information that makes memory valuable also makes it a target. If the memory is stored at the AI provider's infrastructure, it lives outside the organization's perimeter. If it's stored locally, it lives on whatever device the employee uses. Either way, it's organizational knowledge in a place organizational policy didn't anticipate.

Both implications point to the same conclusion: AI memory needs to be governed like other organizational knowledge — with explicit ownership, retention, access, and exit policies.

The IP question of memory

Three concrete questions:

Who owns the memory? If an employee builds up an AI memory over the course of their work, the memory contains organizational context. Most employment IP-assignment clauses cover work product, not abstract organizational knowledge. The IP status of accumulated AI memory is genuinely unclear in most current contracts. Organizations writing AI policy in 2026 should think about whether existing IP language covers this case, or whether explicit clauses are needed.

What can safely go in memory? The same logic from Chapter 7's IP boundary applies: organizational data — names, specific situations, contracts, methodologies — should not be in AI memory at the personal-AI tier. At the enterprise tier (Chapter 8), the calculus changes because the organization controls the contractual stack with the provider. Memory hygiene at the personal tier is identical to memory hygiene around any other content: the rule is "what's safe to feed AI in conversation" applies to "what's safe to put in AI memory."

What survives staff transitions? When an employee leaves, their AI memory typically goes with them — because it's tied to their account, which gets de-provisioned. The organization loses whatever institutional knowledge accumulated in that memory. This is a new failure mode worth thinking about: organizational knowledge that used to live in handover documents and tribal knowledge transfer now sometimes lives in AI memory, where standard transition processes don't reach it.

Boundary practices

Operational practices for organizations writing AI memory policy:

Memory hygiene as routine. Treat AI memory like any other workspace: review periodically, prune what's stale, remove anything that crossed the IP boundary inadvertently. Self-audit cadence (Chapter 6) should include memory review, not just conversation history.

Explicit memory transfer for role changes. When an employee leaves a role, their AI memory contains organizational context. Don't lose it: have a documented practice of exporting key memory content, sanitizing for sensitive details, and making it available for the next person in the role.

Sanctioned-system memory at the enterprise tier. Once an organization moves to a unified AI deployment (Chapter 8), memory becomes manageable through admin tools — the organization can audit, export, and migrate memory across staff. Personal-AI memory cannot be governed this way; the organization has no admin access.

Don't put credentials, secrets, or hard prohibitions in memory. This sounds obvious; it isn't, in practice. Memory should hold preferences, patterns, decisions — not API keys, passwords, or anything that becomes a security exposure if memory is read by anyone other than the intended user.

Sample memory hygiene rules

For inclusion in personal-AI policy:

  • Memory should contain general preferences, working patterns, and decisions about your own work — not organizational data, client names, or specific situations
  • Review memory contents monthly; remove anything that crossed the IP boundary inadvertently
  • Do not store credentials, passwords, or secrets in AI memory under any circumstance
  • When leaving a role, export your accumulated AI memory, sanitize, and offer relevant patterns to your successor

For enterprise AI deployments, memory management is part of the broader governance covered in Chapter 8 — admin controls, audit logs, and access lifecycle apply to memory as they apply to other organization-controlled data.

Why this chapter is short

Most of memory governance flows from policies elsewhere in the manual: the IP boundary from Chapter 7, the audit practice from Chapter 6, the enterprise governance from Chapter 8. Memory is a new category of artifact, but the principles for governing it are the principles for governing anything else organizational. Adapt the existing frame; don't invent a new one.

The one place memory genuinely is a new category is the staff transition question — institutional knowledge captured in AI memory does not survive standard off-boarding by default. This is worth naming explicitly in policy, with an explicit transfer practice. It's the most consequential gap in standard organizational practice that AI memory creates.

← Back to Manual
Chapter 07 · Part: Operational Practice

Chapter 7 — Using AI at Work Without Leaking IP

The fear that keeps community-rooted organizations awake at night: someone on the team uses ChatGPT or Claude to help with real work, and in doing so feeds the model proprietary research, participant data, contractual language, or internal methodology. The model absorbs it, possibly echoes it, possibly trains on it. The IP boundary that took years to build dissolves in a few months of casual personal-AI usage.

This is a real concern. It is also addressable. This chapter walks through the actual mechanics of the boundary — what it is, where it sits, and how to maintain it as an individual using AI for work.

What "leaking IP via personal AI" actually means

Three distinct concerns hide under this umbrella. Conflating them produces bad policy:

Concern 1: Training data. When you paste content into a personal AI tool, some providers train on it. (Some don't; this is provider-specific and often configurable in account settings.) Trained-on content may surface in future outputs to other users — sometimes verbatim, sometimes abstracted. This is the classic "the model learned our IP" worry.

Concern 2: Logged data. Even when models don't train on inputs, providers typically log them for safety, abuse detection, and quality review. That log is a security perimeter your organization does not control. If the provider has a breach, your inputs were in the breach.

Concern 3: Operational visibility. When staff use personal AI tools, your organization loses visibility into what's been shared with whom. Management can't audit usage, can't enforce policy, can't measure exposure. The asymmetry between "what's happening" and "what we know is happening" widens.

These three concerns have different mitigations. Policy that addresses all three is policy that holds.

The fundamental boundary

The boundary is not "AI tools" versus "no AI tools." That is the wrong cut. The boundary is:

Information you own personally — your thinking, your professional development, your general questions about your field, the meta-shape of problems you face — can go to AI freely.

Information your organization owns — proprietary research, participant data, internal documents, methodology developed under your employment, anything covered by your IP assignment clause — does not go to personal AI. It can only go to AI tools your organization has approved with appropriate data handling agreements in place.

This boundary is the same one that existed before AI. You can talk to a friend about "I'm working through how organizations handle X" without naming clients or revealing methodology. You cannot paste a participant's intake form into a Slack DM with that friend. The only thing AI changes is volume and speed: before AI, breaching this boundary required deliberate action; with AI, it is two clicks and a paste — easy to do without noticing.

What you can safely feed personal AI

Reformulating these as positive permissions, individual operators can use personal AI for:

  • General questions about your field. "How do organizations like ours typically handle X?" "What is the standard approach to Y?" These do not expose your specifics; they sharpen your thinking.
  • Drafting personal communications and writing. Your own creative work, professional voice, blog posts on your professional development, talks you are preparing.
  • Skill development. "How do I learn X?" "Walk me through Y concept." Pure education questions.
  • Sanitized scenarios. When you genuinely need to think through a specific operational problem, you can describe the shape of the problem without the identity of the parties. "A small grantee organization is struggling with X" is acceptable; "Acme Foundation gave us a $50,000 grant for X" is not.
  • Public information. Anything that is already public — published research, government data, publicly available documents — can be discussed with AI freely.

The rule of thumb: if you would be comfortable saying it loudly at a conference table to mixed company including competitors, it is fine for personal AI.

What you must never feed personal AI

Hard prohibitions, default-deny posture:

  • Client, participant, or partner data. Names, identifying information, specific situations attributable to a real party.
  • Internal documents. Memos, board materials, financial statements, contracts, proposals, anything labeled or treated as confidential.
  • Proprietary methodology. The specific frameworks your organization has developed; the specific operational knowledge that distinguishes your work.
  • Research data prior to publication. Datasets, findings, drafts under embargo or pre-publication review.
  • Personnel information. Anything about colleagues that they did not volunteer publicly.
  • Vendor and contract terms. Pricing you have negotiated, terms specific to your relationships, anything covered by NDAs.
  • System credentials. Passwords, API keys, access tokens — for any reason, ever.

If you would flinch saying it at the conference table, do not say it to AI either.

The self-audit practice

Most accidental leaks happen because someone gets in flow, working through a real problem, and does not notice they have crossed the line. The single best individual mitigation is a brief weekly or biweekly self-audit:

  1. Open your AI tool's conversation history (most tools provide this).
  2. Skim the last two weeks of conversations.
  3. Flag anything that contains: a real name, a real number, a real document quote, a real situation description.
  4. For each flag, ask: was that information you owned, or information the organization owned?
  5. If organization-owned: note the leak, delete the conversation if the tool allows, request data deletion from the provider, and escalate to whoever handles data incidents at your organization.

Most cycles surface nothing. The few that surface something are exactly the moments policy needs to engage. The audit takes ten minutes. The not-auditing is what allows accumulation.

Contract considerations

Most employment agreements include IP assignment clauses giving the employer rights to anything created within the scope of work. This pre-dates AI but applies to AI-assisted work directly. Two specific implications:

For employees: Anything you produce in the course of your job, with or without AI assistance, is likely your employer's IP. Using AI to help you produce work does not change ownership; it means a third party (the AI provider) had access to inputs that became employer IP. That access is the policy question.

For employers: Standard IP assignment language likely already covers the question of who owns AI-assisted work. What it does not cover is the exposure — the question of which third parties have seen the inputs that fed into employer-owned outputs. New language is needed there. Sample language appears in Chapter 10.

Legal review on AI policy is not optional. The technology is new enough that standard contract templates have not caught up. If your organization does not have employment counsel familiar with AI questions, that is the first call to make.

What to do if you have already leaked

It happens. Most operators using AI have at least one moment of "wait — did I just paste that?" If it happens to you:

  1. Do not panic. Most leaks are limited in scope and impact.
  2. Stop the conversation immediately. Do not continue feeding the model context that compounds the leak.
  3. Use the provider's deletion tools if available. Most tools allow conversation deletion; some support formal data deletion requests.
  4. Document what was shared, when, with which tool, and at what level of identifying detail.
  5. Escalate to your organization's data incident process — even informally if no formal process exists.
  6. If the leak is significant (named parties, specific numbers, sensitive methodology): treat it as any other data incident, with legal, communications, and board notification per your organization's protocols.

The worst outcome is not that a leak happened. The worst outcome is that a leak happened and no one told anyone, and three months later it surfaces in a way that is harder to manage than a same-day report would have been.

The honest limits of self-policing

Self-audit and individual judgment are necessary but not sufficient. Three things they cannot fix:

Inconsistency across the organization. Different operators have different judgment about what counts as "my information" versus "the organization's information." Without clear policy, the line drifts — and the drift is invisible until something breaks.

Pressure under deadline. When someone is up against a hard deadline, the temptation to paste a real document into AI for a quick reformulation is real. Self-discipline holds up well under normal conditions and badly under stress.

The unknown unknowns. Operators may not realize a given category of information is sensitive until someone tells them. New hires and junior staff are particularly vulnerable here — not from carelessness but from incomplete onboarding into the organization's information classifications.

These limits are why policy at the organization level matters. Self-discipline closes most of the gap; policy closes the rest. Chapter 10 provides sample policy language an organization can adapt to its own context.

What this chapter does not cover

This chapter covers the individual operator's relationship with personal AI. It does not cover:

  • Approved enterprise AI deployments where the organization has data handling agreements with the provider
  • Sector-specific compliance requirements (HIPAA, FERPA, attorney-client privilege, grant-funded research data restrictions, etc.)
  • Cross-border data transfer rules
  • Vendor due-diligence frameworks for evaluating AI tools

Each of those topics requires its own treatment beyond the scope of this manual. The references in the back matter point toward starting places.

← Back to Manual
Chapter 08 · Part: Operational Practice

Chapter 8 — Enterprise AI: Governance for AI Inside Your Operations

Chapter 7 covered the personal-AI / organizational-IP boundary: rules for individuals using their own AI tools to do work without leaking organizational data. That covers one side of the AI question — the side most current policy focuses on.

There is a second side, increasingly important, that most policy doesn't yet address: AI deployed at the enterprise level, connected to organizational systems, with legitimate access to real operational data. This chapter is about governing that.

The shape is different. Personal-AI policy is mostly about individual discipline — what to feed AI, what not to. Enterprise-AI policy is mostly about organizational architecture — provisioning, access control, logging, off-boarding, vendor management. Different concerns. Different controls. Different failure modes.

What "enterprise AI" actually looks like

A concrete example. An operations leader at a community-rooted organization creates an AI account under their organizational email. They connect that AI to their organization's dashboards (so it can pull program metrics), to their website analytics (so it can summarize traffic patterns), and to their participant database (so it can answer "show me everyone who completed module 3 last quarter"). The AI sits inside the operational stack as a working tool. It sees real data on purpose, in service of real work.

That deployment can be appropriate — but only when it's governed. Without governance, the same deployment is a quiet exposure that compounds over months. With governance, it's a tool the organization can stand behind.

The governance problem is distinct from personal AI use in four ways:

  • The data exposure is intentional. Self-audit doesn't help; the AI is supposed to see the data. The question becomes "what's the appropriate scope, and how do we enforce it?"
  • Multiple people may use the same connection. Or one person leaves and someone else takes over. The connection has its own lifecycle, separate from any individual's lifecycle.
  • Access is durable. A connected AI keeps having access until someone explicitly revokes it. Default behavior is "stays on."
  • Audit becomes multi-layer. Knowing what the AI did requires logs at the provider, at the organizational systems being queried, and in the AI's own conversation history.

Each creates a governance question that personal-AI policy doesn't answer.

The seven governance questions for enterprise AI

These are the questions any organization should be able to answer for any AI deployment that touches real operational data.

1. Whose account is this?

A personal account with a work email is the worst of both worlds. The organization can't see it (no admin console, no audit logs), but the AI has access to organizational data through that account.

The right answer: the organization owns the account through enterprise SSO (single sign-on). When an employee logs in, the organization's identity provider authorizes them. When the employee leaves, the organization de-provisions them through the same identity system that handles everything else.

If the AI provider doesn't support enterprise SSO, that's a vendor-selection signal. Most serious enterprise AI tools support it; the ones that don't aren't ready for organizational deployment.

The "if I lose my email I lose access to the bot" pattern works as a backstop but isn't a primary control. Email-tied access can fail in either direction: the employee leaves and forgets to disable the AI, or the email gets compromised and now the AI is too. SSO removes both failure modes.

2. What data does it actually need access to?

The default for any tool is over-provisioning: give it access to everything because narrowing scope is painful. With AI especially, this default is dangerous, because AI can synthesize across data sources in ways no individual user could — the access surface is multiplied by the AI's ability to combine.

The right answer: least-privilege provisioning. The AI gets access to specific datasets needed for its specific use case, not the whole data layer. If the AI's job is "summarize program metrics," it doesn't need access to financial records. If its job is "help with content drafting," it doesn't need access to participant records at all.

This is where most organizations leak the most. The convenience of "give it everything" is real. The cost shows up later, usually after staff turnover, sometimes after a breach.

3. What contractual provisions are in place?

For any AI tool the organization deploys with access to real data, the standard contractual stack matters. Three to confirm before deployment:

  • Data Processing Agreement (DPA). What the provider can do with your data, where they store it, who else they share it with, what happens if they're breached.
  • Sub-processor disclosure. AI providers almost always rely on sub-processors (cloud infrastructure, model hosting, third-party tooling). The DPA should disclose these and require notification before changes.
  • Terms of service review for training rights. Does the provider train on your inputs by default? Can you opt out? Some providers offer enterprise tiers where training is contractually disabled. Confirm in writing, not in marketing copy.

For sectors with their own regulatory regimes (HIPAA, FERPA, GDPR, sector-specific funder requirements), the contractual stack is heavier. Get specialized review.

4. What logging is enabled?

You cannot govern what you cannot see. For enterprise AI, logging happens at three layers:

  • Provider-side audit logs. Most enterprise AI tools offer admin audit logs showing who accessed what, when, with what query. Confirm these are enabled and retained for at least the period your incident response process requires.
  • Org-side query logs. If your AI is connecting to organizational systems via API, those systems should log every query the AI makes. This is independent of provider logging and gives you a second source of truth.
  • Conversation history. Conversations between users and the AI may contain sensitive data; retention and deletion policies should be explicit per use case.

Logging without retention policy is theater. Define how long each layer retains, who can access the logs, and under what circumstances they're reviewed.

5. How are access changes handled?

Two scenarios that go wrong without policy:

Employee leaves. Their access to operational AI gets disabled. But what about AI memory containing context they fed the system? What about conversation history they generated? What about API keys they may have provisioned? What about any custom configuration tied to their account? The off-boarding checklist needs to cover all of these, not just the login.

Role changes within the organization. Someone moves from a participant-facing role to a research role. Their AI access should change because the appropriate scope changed. Without explicit review, it doesn't.

The right pattern: AI access is reviewed every time HR records a role change, on the same trigger as other access reviews. AI is treated like any other system the role does or doesn't need.

6. What's the incident response plan?

If the AI provider is breached, what happens? If the AI returns clearly incorrect output that someone acts on, what happens? If a staff member discovers the AI has been making decisions it shouldn't, what happens?

The incident response plan for enterprise AI typically forks into three branches:

  • Provider-side incident (their breach): notification timeline, your assessment of exposure, participant or partner notification per your standard data incident playbook.
  • AI output incident (AI was wrong, an action was taken based on its output): rollback procedure, review of the decision chain, update to policy or controls if the failure mode is systemic.
  • Use-case incident (AI was used for something it shouldn't be, or with data it shouldn't see): investigation, scope assessment, policy update, possible disciplinary action depending on intent.

Have these branches written down before you need them. The first time an organization tries to invent its AI incident playbook is during the incident, which is too late.

7. What's the exit plan?

This is the question most organizations skip. AI providers can change terms, raise prices significantly, get acquired, or shut down. If your operations come to depend on a specific AI deployment, what's your plan if that deployment goes away?

Concrete questions:

  • Can you export conversation history and accumulated memory before access is cut?
  • Can you export any customization, fine-tuning, or training you've invested in?
  • How long would migration to an alternative provider take?
  • What operational continuity exists if the AI is suddenly unavailable for a week?

The exit plan doesn't need to be complicated. It needs to exist before the exit becomes necessary.

A specific note on AI memory and continuity

AI tools that maintain memory across sessions create a particular governance question. The memory contains organizational context — preferences, patterns, accumulated learning about how your organization works. That memory is a form of institutional knowledge.

Two implications:

Off-boarding. When the employee who built up the AI's memory leaves, the memory often goes with them. Their AI account is disabled; the AI's accumulated knowledge of the organization disappears. Institutional knowledge that AI helped capture becomes inaccessible at exactly the moment the organization is least equipped to rebuild it.

Inheritance. When a new employee takes over a role that used AI, they inherit a tool that doesn't know them. Pattern that works: explicit memory transfer. Document what the AI knew, what context it was operating with, so the next employee can rebuild that context — or migrate it explicitly if the tool supports it.

This is a new category of organizational knowledge management. It hasn't been handled before because AI memory is new at this scale. Worth getting ahead of before it becomes a crisis at the first major staff transition.

What enterprise AI policy should cover, at minimum

Synthesizing the above, an enterprise AI deployment should be covered by policy that addresses:

  1. Identity and access (SSO-backed, role-based provisioning)
  2. Least-privilege data scope per deployment
  3. Contractual provisions (DPA, sub-processor disclosure, training opt-out)
  4. Multi-layer logging and retention
  5. Off-boarding and role-change procedures
  6. Incident response plan (provider, output, use-case branches)
  7. Exit plan (data export, migration, operational continuity)

Sample policy language for each is referenced in Chapter 10. The policy templates there cover the personal-AI side comprehensively; the enterprise-AI side often requires deployment-specific contracts and controls beyond what general policy can capture. Treat the templates as the floor.

Why this matters more than personal AI

If an organization has to choose where to invest policy effort first — personal AI or enterprise AI — the enterprise side has higher stakes for most. Three reasons:

Volume. Personal AI usage by individual employees is bounded by what they can manually paste into a chat window. Enterprise AI usage is bounded by what the API can pull — often the entire database.

Durability. Personal AI conversations exist for a session. Enterprise AI deployments exist for years, accumulating access, capability, and dependence.

Asymmetry of failure. A personal AI leak by a single employee is contained. An enterprise AI deployment failing — through breach, misconfiguration, or vendor instability — affects everyone the organization touches.

This is the ground where most community-rooted organizations will need to take a stance over the next several years. The decisions made here outlast individual staff and shape what the organization can and cannot do operationally.

What this chapter does not cover

Specific technical implementation: how to configure SSO with a particular provider, how to scope database access via API tokens, how to set up audit logging in a specific stack. Those are infrastructure projects, not policy questions, and they vary too much by tool to generalize usefully.

Sector-specific compliance overlays (HIPAA, FERPA, etc.) on enterprise AI deployments. These add requirements beyond general governance — covered briefly in Chapter 13.

Vendor selection criteria for enterprise AI tools. A separate question worthy of its own treatment, but beyond a single chapter's scope.

← Back to Manual
Chapter 09 · Part: Operational Practice

Chapter 9 — AI for Community-Rooted Organizations: Quiet Wins, Real Risks

This chapter is the most context-specific in the manual. It covers what AI adoption looks like inside community-rooted organizations — where the dynamics differ in ways most generic AI guidance doesn't address.

Three things make community-rooted contexts specific: community trust is the primary asset, resource constraints are real, and sector reputation matters in ways for-profit contexts don't carry. AI adoption that doesn't account for these doesn't fit.

What's specific about community-rooted adoption

Community trust as primary asset. A community-rooted organization's value depends on participants believing the program will treat them fairly. A community foundation's value depends on donors believing the organization spends well and beneficiaries believing the organization shows up. AI adoption that erodes that trust — even slightly, even through perception alone — costs more than the productivity it generates. The optics fear from Chapter 2 isn't paranoid; it's calibrated to a real asset.

Resource constraints. Community-rooted organizations typically run on tight budgets. The economic calculus around AI tools is different than at a tech-funded startup. The license cost of a single enterprise AI tool can equal a real percentage of a small organization's discretionary budget. Adoption decisions have to clear bars that for-profit decisions don't.

Sector reputation. Reputational risk doesn't just affect the organization that takes a misstep — it affects the sector. A community-rooted organization adopting AI poorly creates pressure on every other organization in the sector. Community-rooted organizations have an interest in sector-wide thoughtful adoption, which means adopting in ways that are defensible across the field, not just defensible internally.

Where AI changes community-rooted work, in practice

Without naming specifics, here's what tends to happen when an executive at a community-rooted organization actually starts using AI for real work:

Content drafting accelerates substantially. Reports, memos, board materials, donor communications, internal documentation — anything with a draft cycle can be drafted faster with AI assistance. The savings compound over months. Most community-rooted organizations have content workloads that exceed staff capacity; AI relieves that pressure when used carefully.

Summarization handles things that wouldn't otherwise get done. Meeting transcripts that nobody reads. Research reports that pile up unsynthesized. Stakeholder feedback that arrives faster than anyone can process it. AI summarization makes the unread useful, when summaries are reviewed for accuracy.

Scenario planning becomes feasible. "What if we had to cut programs by 20%?" "What does our budget look like if this funder leaves?" Modeling these scenarios used to require dedicated staff time. AI makes them quick enough to do regularly. Better decision-making follows from cheaper modeling.

Documentation gets unstuck. Most community-rooted organizations have documentation debt — knowledge that lives in people's heads but never makes it onto paper. AI helps interview and document, lowering the friction enough that knowledge transfer actually happens.

These are the wins. They're real. They're also unobvious from outside — which is why the optics question matters: thoughtful adoption produces real results, but if the only thing visible is the AI itself, the perception problem dominates.

The risks, named in context

Each generic AI risk has a sharper edge in community-rooted settings:

Hallucination in high-stakes contexts. An AI-drafted donor letter that contains a confident statement about programmatic outcomes that isn't quite right reaches a sophisticated reader and damages credibility. Community-rooted communications operate in a context where "almost right" reads as "wrong."

IP exposure of beneficiary data. The people your organization serves — participants, patients, students, clients — the data community-rooted organizations hold is often more sensitive than commercial data. The IP boundary matters more, the stakes of a breach are higher, and the regulatory framework is often stricter.

Operational dependence on a single vendor. A community-rooted organization that becomes dependent on a specific AI tool, then experiences a vendor change, faces operational disruption it has fewer resources to absorb than a commercial organization would. The exit plan from Chapter 8 isn't a nicety; it's a continuity question.

Staff transitions amplifying loss. Community-rooted organizations often have higher turnover than is typical, often because they pay less than commercial organizations. AI memory and accumulated organizational knowledge in AI tools is at higher risk of being lost in transitions, simply because transitions happen more often.

What hasn't worked

Honest accounting from observed practice:

Top-down rollouts that announce a vision. Community-rooted culture is usually skeptical of vision-language. Adoption rollouts that lead with "we are an AI-forward organization" tend to provoke quiet pushback that compounds over months.

Pilot programs without success criteria. As Chapter 12 notes for any organization, pilots without explicit success criteria become permanent. In community-rooted contexts where staff are already stretched, this creates resentment more than learning.

Tool selection driven by cost alone. The cheapest AI tool is rarely the most appropriate. Community-rooted organizations that select on price often end up with tools that lack the governance features Chapter 8 calls out as load-bearing.

Adoption announcements before adoption happens. Telling stakeholders the organization is "now using AI" before there's anything operational to point to creates expectations the actual rollout struggles to meet.

What works better is the inverse of each: bottom-up adoption stories rather than top-down vision; pilots with explicit success criteria; tool selection that weighs governance features alongside cost; communication about adoption that follows actual results, not precedes them.

The optics question revisited

The way to address Fear 3 (public optics) isn't to hide AI adoption. It's to adopt in a way that, when seen, reads as thoughtful and operational rather than enthusiastic and vision-driven.

Specific tactics:

Lead with operational language. "We adopted X tool to reduce time on Y task by Z%" reads better than "AI is transforming our organization." Specific, measurable, modest claims.

Be honest about limits. "AI is a productivity tool we use carefully under policy" reads better than "AI is the future." Hedged language is appropriate when the underlying truth is hedged.

Show governance. Public reference to your AI policy demonstrates you're approaching this thoughtfully. The policy itself is the artifact that earns credibility.

Don't lead with AI in branding or fundraising. Community-rooted funders, donors, and stakeholders generally respond better to "we run efficient operations" than "we use AI." Let the operational story do the work; the AI is the mechanism, not the headline.

Implications for guideline writers

If you're writing AI policy for a community-rooted organization, the differences from generic guidance are:

  • Add explicit consideration of beneficiary, participant, or student data classification — it's almost certainly your most sensitive category and may be subject to sector-specific compliance requirements
  • Build in a public-communications component: when, how, by whom does the organization say something publicly about its AI use
  • Treat the cost question as material; don't budget for AI as if cost were unlimited
  • Plan for higher turnover and explicit knowledge transfer practices around AI memory (Chapter 5)
  • Set the tone of the policy to match sector culture: sober, thorough, modest in claims, generous with limits

The full sample language in Chapter 11 is general-purpose; community-rooted organizations will want to add sector-specific clauses to several sections. The framework remains the same; the specifics adapt.

A closing observation

The community-rooted organizations tends to underestimate its strength in this conversation. AI vendors are mostly built around commercial use cases; the assumptions baked into vendor pitches don't always fit community-rooted contexts. That mismatch is frustrating in the short term and an advantage in the longer term: organizations that figure out thoughtful AI adoption in community-rooted contexts are figuring out something the broader AI ecosystem hasn't yet, and the resulting practices will travel.

If your organization gets this right, the work product will be useful to others in your sector. The Build Bible-style sharing of operational rules, the policy templates, the audit practices — share them. The sector compounds when practitioners share what works.

← Back to Manual
Chapter 11 · Part: Adoption

Chapter 10 — Sample Policy Language

The chapter I would have wanted when I started thinking about this. Templates an organization can adapt for its own AI policy, with notes on what each section does and where the choices are.

What appears here is not legal language. It is working policy language — the kind that sits in a one-page document staff actually read. Run anything you adapt past employment counsel before adoption; these templates do not substitute for legal review. They substitute for staring at a blank page.

The sections below cover the most common policy questions in the order most organizations encounter them. Each section includes sample language plus notes on the choice points.

1. Acceptable use

Sample language:

Staff may use AI tools (including but not limited to large language model services such as ChatGPT, Claude, Gemini, and similar conversational AI products) for work that does not require the input of organizational data, participant or client information, or proprietary methodology. Permitted uses include drafting personal communications, general professional development questions, sanitized scenario planning, and skill development. Staff may also use AI tools for any task that does not require revealing information protected under Section 2 of this policy.

Notes on the choices:

The phrasing "AI tools" is intentionally broad. Naming specific products dates the policy quickly; AI tools change names, get acquired, and proliferate. Staff who follow the spirit will figure out which tools fit. Staff looking for loopholes will find them in any specific list anyway.

The phrasing "permitted uses include" rather than "permitted uses are" leaves room for novel cases. A blanket "everything not listed is forbidden" approach handles edge cases poorly and creates incentives for shadow IT.

The reference to "Section 2" is the policy's hinge. Section 2 is where you list what is protected. The acceptable-use language only works if the protection language is sharp.

2. Protected information

Sample language:

The following categories of information must not be entered into any AI tool that is not approved by [Organization] for that specific purpose: client or participant identifying information; internal documents (memos, board materials, financial statements, contracts, proposals); proprietary methodology and frameworks developed by or for [Organization]; pre-publication research data and findings under embargo; personnel records; vendor or grant contract terms; financial information not yet publicly disclosed; and any information designated confidential under our standard NDAs or under contracts with our funders, partners, or clients.

Notes:

This list is a starting point. Customize based on your sector. Different organizations will care about participant data; a healthcare organization will care about protected health information; a research organization will care about pre-publication findings; a foundation will care about grantee data. Make the list specific enough to be operational, short enough to be remembered.

The phrasing "AI tool that is not approved by [Organization] for that specific purpose" leaves room for enterprise deployments. If you have signed a Business Associate Agreement with an AI provider for clinical work, that tool is approved for that purpose. Other purposes still default to forbidden.

3. Approved tools and approval process

Sample language:

[Organization] maintains a list of AI tools approved for specific operational purposes, with corresponding data handling agreements in place. The list is reviewed quarterly by [role or committee] and updated as needed. Staff seeking to use an AI tool for a purpose not currently approved may submit a request to [role] outlining the proposed use, the data involved, and the business justification. Requests are reviewed within ten business days.

Notes:

The "list of approved tools" creates an actual artifact someone has to maintain. Without that artifact, the rest of the policy is aspirational. Resource the list-maintenance role explicitly — name the person or function responsible, with backup coverage.

The quarterly review cadence is opinionated. AI tools and providers change faster than annual review can keep up with, and slower than monthly review can sustain. Quarterly is the practical floor.

The request process gives staff a path forward when they have a legitimate need. Without it, the policy creates frustration and shadow IT. Make the path real, not theatrical: ten business days is long enough for genuine review, short enough to feel responsive.

4. Self-audit

Sample language:

Staff using AI tools are expected to conduct a brief self-audit of their AI usage history at minimum every two weeks. The audit involves reviewing recent conversations for potentially protected information, deleting conversations containing such information from the AI tool's history, and reporting any incident of significant disclosure to [role] under Section 8 (Incident Response). The self-audit is the staff member's responsibility; [Organization] does not surveil personal AI accounts.

Notes:

Two weeks is a balance between "frequent enough to catch issues" and "not so frequent it becomes ignored ritual." Some organizations use weekly; some use monthly. Two weeks is the policy default to start; revise based on observed compliance.

The phrasing "potentially protected information" is deliberately broad. Staff are not expected to be perfect; they are expected to look. The point of the audit is the looking, not the certainty.

The "[Organization] does not surveil personal AI accounts" clause is important. Without it, staff may assume covert monitoring and either resent it or attempt to evade it. Naming the boundary plainly improves the self-audit's chance of being conducted honestly.

5. Output review

Sample language:

Output produced by AI tools is the responsibility of the staff member who used the tool. Before any AI-generated output is shared externally, included in deliverables, or applied to operational decisions, it must be reviewed for accuracy, appropriateness, and absence of fabricated information (sometimes called "hallucinations"). Particular attention must be paid to citations, statistics, quoted material, and any factual claims that could affect downstream decisions. Staff are accountable for AI-generated work as if they had produced it themselves.

Notes:

The "responsibility of the staff member" framing prevents the "AI told me to do it" defense from emerging. AI is a tool; the human using it is the operator.

The "fabricated information" clause names the hallucination problem in plain language. Naming it directly tells staff what to watch for. The specific call-out of citations and statistics matters because those are the categories most likely to be silently wrong.

6. IP ownership of AI-assisted work

Sample language:

AI-assisted work produced within the scope of [Organization] employment is owned by [Organization] under the same terms as any other work produced by the employee. Staff should not assume that AI-assisted contributions are exempt from IP assignment. The use of AI tools does not transfer ownership of resulting work to the AI provider, nor does it remove the work from the employee's standard IP assignment to [Organization]. Employees remain responsible for ensuring that AI-assisted work does not infringe third-party rights.

Notes:

This clause closes a loophole that does not exist legally but that staff sometimes assume into existence: "AI helped me make this so it is not really my work." Naming the assumption directly prevents it.

The third-party rights clause is increasingly important. AI tools can produce output that resembles or reproduces copyrighted material the model trained on. Staff using AI for content production need to know that derivative work is their responsibility to vet.

Run this section past counsel; the legal analysis varies by jurisdiction and by the specific AI provider's terms of service.

7. Training and onboarding

Sample language:

All staff with access to organizational data are required to complete AI usage training within thirty days of hire and annually thereafter. Training covers acceptable use, protected information, the self-audit process, output review, and incident reporting. Training materials are maintained by [role] and updated as policy and tools evolve. Completion is tracked and reported to [role] quarterly.

Notes:

The 30-day window is achievable. Longer windows mean new staff start using AI before they understand the rules.

The annual cadence catches policy and tool changes. AI policy that does not get retrained quickly drifts out of staff awareness, especially as the underlying technology shifts.

Tracking and quarterly reporting create accountability. Without it, "training is required" becomes "training is suggested."

8. Incident response

Sample language:

If a staff member becomes aware that protected information has been shared with an AI tool not approved for that purpose, the incident must be reported to [role] within 24 hours of discovery. Reports made in good faith will not result in disciplinary action; the goal is to manage the incident, not to penalize the staff member who surfaced it. Repeated or willful violations are subject to standard disciplinary processes. [Role] is responsible for evaluating each incident, taking remediation steps (data deletion requests, provider notification, internal communication), and tracking incident frequency for quarterly review.

Notes:

The "good faith" clause is essential. Without it, staff will conceal incidents, which is the actual catastrophic outcome. Make the path of least resistance the right one.

The 24-hour window is policy-grade specific without being unrealistic. Some organizations prefer "promptly" or "as soon as practicable"; the trade-off is enforceability. Specific is better.

The "tracking incident frequency for quarterly review" clause connects incident response back to policy review. If incidents cluster around certain tools or certain workflows, the policy needs to engage upstream of individual incidents.

9. Review and revision

Sample language:

This policy is reviewed at minimum annually and updated as the AI landscape, organizational needs, and regulatory environment evolve. The review is conducted by [role or committee] with input from [stakeholders, e.g., staff representatives, legal counsel, IT, program leadership]. Substantive changes are communicated to all staff and incorporated into subsequent training cycles. Significant external events — major regulatory changes, major provider changes, significant internal incidents — may trigger interim revisions.

Notes:

The annual review is the floor, not the ceiling. The interim-revision clause is the safety valve for events that cannot wait for the annual cycle.

Naming who conducts the review matters. Without that role, the review never happens. "The policy committee" is acceptable if the committee actually exists and meets.

What this template set does not cover

These templates are general-purpose and intended to be adapted, not adopted directly. They do not cover:

  • Sector-specific compliance (HIPAA, FERPA, GDPR, attorney-client privilege, grant-funder data restrictions, etc. — each requires its own legal review)
  • Cross-border data transfer rules
  • Specific clauses for funded research environments
  • Union or collective-bargaining considerations
  • Vendor due-diligence frameworks for evaluating AI tools
  • AI-generated decision-making in high-stakes contexts (hiring, benefits eligibility, disciplinary action — these need separate, sector-specific frameworks)

Each topic deserves its own legal and operational review beyond this manual's scope. If your organization is going to deploy AI in any of these areas, the template language above is a starting point for your acceptable-use policy, not a substitute for the deeper review those areas require.


Adapt freely. Cite or don't. The point is that you don't start from a blank page.

← Back to Manual
Chapter 12 · Part: Adoption

Chapter 12 — Rolling It Out: Starting Small, Scaling Honestly

Most enterprise AI rollouts fail in a specific way. The organization buys the tool, sets up the licenses, sends a launch email, and three months later usage is concentrated in a small handful of staff who would have figured AI out anyway. The tool is paid for; the transformation didn't happen.

The failure mode isn't the tool. It's the rollout. AI tools are unlike most software in a critical way: their value depends on the user developing fluency through repeated practice, not on completing a workflow. A staff member who logs into the AI once doesn't get the value of staff who use it daily. Adoption gradients matter more than license counts.

This chapter covers what works when rolling out an enterprise AI deployment to an organization that hasn't done it before. It assumes you've decided to move from constraint (Chapter 7) toward unification (Chapter 8) — the path argued in Chapter 3 — and need a sequence for getting there.

The phased rollout pattern

Effective rollouts move through four phases:

  1. Pilot — small group, real work, clear success criteria, fixed end date for review
  2. Expansion — broaden access based on what the pilot taught
  3. Default — sanctioned tool becomes the path of least resistance for work
  4. Maintenance — sustained use, ongoing review, policy iteration

Most failed rollouts skip from purchase to phase 3 (defaulting everyone in) without doing 1 or 2. The result is staff who don't know how to use the tool well, no organizational learning about what works, and a vendor relationship the organization can't honestly evaluate.

The four phases typically take 6–12 months for a small organization, longer for larger ones. Compress this timeline at your peril.

Phase 1: Pilot scoping

A good pilot answers three questions before it starts.

Who is in it? Pilot participants should be a small group (5–15 people) chosen for two qualities: their work has clear AI-applicable use cases, and they are willing to learn out loud. Avoid choosing people whose support is critical to the rollout — they will give you politically useful feedback rather than honest feedback.

What use cases? Name 3–5 specific work tasks the pilot will explore: drafting board materials, summarizing intake forms, content for participant-facing materials, research synthesis, etc. Vague pilots ("see what people do with it") produce vague learnings.

What does success look like? The most important question and the one most often skipped. Define success criteria before the pilot starts. Examples that work: "70% of pilot participants report significant time savings on at least one named task," or "we have identified at least three use cases where AI-assisted output meets our quality bar." Examples that do not: "general satisfaction" or "engagement metrics."

The pilot should have a fixed end date — typically 60–90 days. At end date, you decide: expand, retool, or stop. Open-ended pilots become permanent purgatory.

Phase 2: Vendor evaluation

If you did not choose your AI tool through structured evaluation before the pilot, the pilot is also your vendor evaluation. Either way, this section names what to actually evaluate. The criteria fall into five categories.

Identity and access integration

Does the tool support enterprise SSO with your existing identity provider? SCIM provisioning for automated user lifecycle? Role-based access control? If the answer to any of these is "not yet" or "on the roadmap," the tool is not ready for organizational deployment regardless of how good the AI itself is.

Data handling and the contractual stack

Does the provider train on customer data by default? Can you opt out contractually? Is there a Data Processing Agreement available? Sub-processor disclosures? For sectors with regulatory frameworks (FERPA for education, HIPAA for healthcare, sector-specific funder requirements for grant-funded work), is there a sector-specific compliance pathway?

Audit and observability

Are admin audit logs available? Conversation history retention controls? Per-user usage analytics? Spend controls and limits? The organization needs to be able to answer "what is happening on this platform" without depending on the vendor's word.

Capability fit for your use cases

Different tools are better at different things. If your pilot use cases involve long-form writing, evaluate that. If they involve structured analysis or code, evaluate that. Generic capability rankings ("X is the best AI") are worth less than your specific evaluation against your specific use cases. Evaluate against your real work, not against benchmarks.

Cost shape

Per-seat pricing? Token-based? Bundled with existing licenses you already own? Cost scaling — what does this look like at 10x the current pilot? Hidden costs (training, integration, support)? Total cost of ownership over a 24-month horizon, not just first-year. Most vendors quote first-year prices that look reasonable; second-year often differs.

The evaluation should produce a written record. Not for compliance theater — for the moment 18 months from now when someone asks "why did we choose this vendor?" and the organization needs to remember.

Phase 3: Default — the transition that matters most

The hardest phase is not pilot. It is the transition from "we have a tool some people use" to "this tool is the default for work."

What goes wrong here:

  • Staff continue using personal AI for work because the sanctioned tool does not quite cover their use case
  • The sanctioned tool becomes "the official tool" but not "the actual tool"
  • Personal-AI policy from Chapter 7 stays load-bearing because shadow usage continues

What works:

  • Make the sanctioned tool genuinely better for actual work — better defaults, integrated context, easier prompts for common tasks, organizationally curated examples
  • Provide active support during transition — office hours, prompt libraries, peer support from pilot participants
  • Communicate explicitly: "our personal-AI policy still applies, AND our expectation is that work tasks now go through the sanctioned tool"
  • Track which use cases still go to personal AI and ask why — sometimes the sanctioned tool needs better configuration; sometimes the use case is genuinely personal

This phase typically takes 60–120 days. Expect resistance from people who built effective workflows with their personal AI accounts; ease their migration rather than mandating it.

Phase 4: Maintenance — what holds it together

After the rollout settles, the organization is in steady state. Three things matter:

Ongoing training. New hires need AI training in onboarding. Existing staff need refreshers as the tool's capabilities evolve. Build training into the standard onboarding and annual review cycles.

Quarterly governance review. The seven enterprise governance questions from Chapter 8 (identity, scope, contracts, logging, access changes, incident response, exit plan) should be reviewed quarterly. Not because they change often — because they drift, and quarterly is the cadence that catches drift before it becomes a problem.

Vendor performance review. Annually, evaluate whether the vendor is still the right one. AI providers consolidate, change pricing, change terms. The exit plan from Chapter 8 should be tested annually — not invoked, just tested. Confirm you can export data, confirm migration paths exist, confirm operational continuity.

When to expand, when to stop

Three signals it is working and you should expand:

  • Pilot participants are using the tool more, not less, after the novelty wears off
  • Use cases are emerging that were not in the original pilot scope (organic discovery is a strong signal)
  • Quality of work in observable areas is improving — meeting summaries are more useful, drafts need fewer rounds, research is more thorough

Three signals it is not working and you should stop or retool:

  • Adoption is concentrated in a tiny minority while the majority remain disengaged
  • Quality of AI-assisted output is creating more review burden than time saved
  • Incidents are clustering — repeated leaks, output errors, governance failures

The decision to stop or retool is hard because someone advocated for the tool and may take it personally. Treat the decision as evidence-based, not personal. Most rollout failures are visible months before they are acknowledged.

The transition from constraint to unification

This entire chapter is the operational mechanism of the philosophy in Chapter 3 — moving from constraint (personal AI with rules) to unification (one sanctioned system). The transition is real work over real months. It requires sustained organizational attention, not just a launch announcement.

The reward is real: an organization that has actually moved through this transition has the most durable AI governance posture available. They have policy. They have a tool. They have the operational practice of using both well. Future AI changes — new capabilities, new providers, new regulations — happen against a stable foundation rather than scrambling each time.

The cost is real too. This is not a quick implementation. Plan accordingly.

What this chapter does not cover

Specific implementation guides for specific products. Pricing negotiations with specific vendors. Sector-specific compliance frameworks. The technical work of integrating SSO with your specific identity provider. Each of these is real but tool-specific and dates fast — placing it in a manual would shorten the manual's useful life.

For a practitioner perspective on specific tool choices — including one operator's reasoning about why they chose a specific tool over alternatives, what they actually use it for, and what its limits are — see the companion Insights essay "Why I Use What I Use: An Operator's AI Stack" in the broader Serranix archive. That kind of specific-product content is intentionally outside the manual proper, because the manual is meant to outlast specific product cycles. The essay can be updated as the landscape shifts; the manual stays focused on the framework that does not.

← Back to Manual
Chapter 13 · Part: Adoption

Chapter 13 — What This Manual Doesn't Cover

Honest closing chapter. This manual provides a foundation; it doesn't pretend to be complete. Several categories of question are outside its scope by design. Knowing what's outside the scope helps you know when to bring in additional expertise.

Sector-specific compliance

The manual treats compliance broadly. It doesn't cover the specific regulatory regimes that apply to particular sectors:

  • Healthcare: HIPAA, HITECH, state-level health information regulations
  • Education: FERPA, state-level education privacy laws, COPPA for under-13 contexts
  • Financial services: SEC, FINRA, state banking regulations, fair lending considerations
  • Federal contracting: FedRAMP, NIST frameworks, sector-specific FAR clauses
  • Workforce funding: funder-specific data restrictions, often varying by program
  • International: GDPR, jurisdiction-specific privacy regimes, cross-border data transfer rules

Each sector has its own rules about how AI can be used with regulated data. The sample policy in Chapter 11 is general-purpose; for sector-specific work, bring counsel familiar with the specific regulatory regime. AI policy that doesn't account for sector compliance can create more legal exposure than no policy at all.

Vendor evaluation deep dives

Chapter 12 provides an evaluation framework. It does not provide:

  • Specific feature comparisons between current AI vendors
  • Pricing negotiation guidance for specific contracts
  • Technical implementation details for specific identity providers, SSO configurations, or API integrations
  • Performance benchmarks against specific use cases
  • Migration guides between specific tools

These are tool-specific and date fast. The companion Insights essay Why I Use What I Use provides one practitioner's vendor stack as of mid-2026, with the explicit caveat that the specifics will be out of date within months. For current vendor evaluation, you'll need to do your own work against the framework.

Cross-border data considerations

If your organization operates across borders, AI policy needs to address data residency and transfer rules that this manual doesn't cover. The default assumption is single-jurisdiction operation. Multi-jurisdiction adoption is harder and benefits from specialist counsel.

Union and collective-bargaining considerations

In organizations with collective bargaining agreements, AI tools may intersect with employee rights, monitoring policies, and bargaining-unit work definitions. None of this is covered here. If you're in a unionized environment, AI policy intersects with labor relations in ways that need explicit attention with the relevant bargaining unit.

High-stakes AI decision-making

This manual covers AI as a productivity tool — assistance with drafting, summarization, analysis, scenario planning. It does not cover:

  • AI in hiring decisions (résumé screening, candidate evaluation, interview analysis)
  • AI in benefits eligibility determinations
  • AI in disciplinary or performance decisions
  • AI in any decision that materially affects an individual's rights, livelihood, or access to services

Each of these requires its own framework, often with explicit regulatory constraints (the EU AI Act, the Colorado AI Act effective February 2026, sector-specific rules). AI used in decision-making about people requires impact assessments and human oversight beyond what this manual addresses.

If your organization is considering AI in any of these areas, treat this manual as foundation only. Specialized frameworks exist; bring them in.

Autonomous AI agents

Most of this manual assumes AI as a chat-based assistant — the human asks, the AI responds, the human reviews. AI agents that act autonomously — booking appointments, sending emails, taking actions in connected systems — operate under different governance assumptions:

  • Authorization scope: what the agent can do without human approval
  • Audit trail: logging every action the agent takes, in a form the organization can review
  • Reversibility: the ability to undo agent actions if they're wrong
  • Liability: who's responsible when the agent acts incorrectly

These are increasingly relevant as AI agent capabilities deepen. They're outside this manual's scope and will likely require their own treatment in a future revision.

What this manual is dated to

This manual was written in May 2026. AI capabilities, vendors, regulatory environments, and best practices are all changing fast enough that any manual will date in months, not years.

Treat the framework as more durable than the specifics. The argument from avoidance through constraint to unification (Chapter 3) will hold longer than any specific recommendation about specific vendors. The seven enterprise governance questions (Chapter 8) will hold longer than any specific configuration guidance. Sample policy language (Chapter 11) will need refresh as regulatory frameworks evolve.

If you find yourself reading this manual a year from now and finding it dated, please write a better one. The work this manual is trying to do — give a thoughtful operator a framework for thinking about AI policy — needs to keep being done as the landscape shifts.

What's next

If you've read the whole manual and want to act on it:

  1. Start with personal-AI policy (Chapter 7) if you don't have one yet. It's the floor.
  2. Begin enterprise AI evaluation (Chapter 8 + Chapter 12) if you're in a position to make that decision.
  3. Adapt sample language (Chapter 11) to your specific situation, with legal review.
  4. Plan for the rollout (Chapter 12) as a months-long effort, not a single decision.

If you have feedback, corrections, or improvements: this manual is meant to evolve. The current draft is a starting point, not a finished artifact. The work is the same — keep figuring out what good AI policy looks like, and keep writing it down so others don't have to start from scratch.