Building in Public

The Build Log

We're building the first fully AI accounting firm. No humans in the loop. This is the raw, unfiltered record of every decision, mistake, and breakthrough.

All Team Tech Clients Lessons Costs
Day 61

Sales Tax Nexus Is a Time Bomb

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Sales Tax Nexus Is a Time Bomb

If you sell online in the US and you're not tracking sales tax nexus, you're sitting on a time bomb.

Since South Dakota v. Wayfair (2018), states can require you to collect sales tax once you cross their economic threshold — usually $100K in sales or 200 transactions. Most Shopify sellers cross nexus thresholds in 5-10 states without knowing it. The penalties for not collecting are retroactive.

Our agents monitor nexus thresholds as part of every monthly close. When a client approaches a threshold in a new state, Alys flags it before it becomes a problem. Proactive beats reactive — especially with tax authorities.

Day 60

Why We Use Three Tiers of Categorization

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Why We Use Three Tiers of Categorization

Not every transaction needs AI. That's the insight most AI companies miss — they throw their most expensive model at every problem.

Our system uses three tiers: Tier 1 is a rules engine. Pattern matching. "SHOPIFY PAYOUT" = Revenue. "FACEBOOK ADS" = Advertising. It handles 70% of transactions at zero cost. Tier 2 is a bulk LLM — fast, cheap, handles the next 20% that need some judgment. Tier 3 is Claude — expensive but smart. It handles the complex 10% that require real financial reasoning.

Result: 95%+ accuracy at a fraction of what it would cost to run everything through the expensive model. The categorization cascade is the core of our cost advantage.

Day 59

Amazon Settlements Are Not What You Think

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Amazon Settlements Are Not What You Think

Amazon sellers get a settlement every 14 days. Most bookkeepers dump it into one line: "Amazon Sales — $X." This is wrong.

An Amazon settlement contains 20+ fee types: FBA fulfillment fees, referral fees, storage fees, advertising deductions, refund reimbursements, removal order fees, and more. Each one belongs in a different account on your chart of accounts. Lumping them together means your COGS is wrong, your ad spend is wrong, and your profit margin is fiction.

Maya breaks every settlement into its actual components automatically. It's the difference between bookkeeping (data entry) and accounting (financial truth).

Day 58

What Happens During a Monthly Close

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

What Happens During a Monthly Close

Most founders don't know what "closing the books" actually means. They think it's hitting a button. It's not.

A proper monthly close has six steps: 1) Lock the period (no more edits to prior months). 2) Reconcile every account (bank, credit cards, loans, equity). 3) Run a trial balance (debits must equal credits). 4) Generate financial statements (P&L, balance sheet, cash flow). 5) Review for anomalies. 6) Deliver to stakeholders.

Traditional firms take 2-4 weeks. Our agents do it same-day. The speed isn't the innovation — it's that every step has a different agent checking the previous one's work.

Day 57

The COGS Problem Nobody Talks About

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

The COGS Problem Nobody Talks About

A P&L without Cost of Goods Sold is a vanity metric. It tells you revenue — which is meaningless without knowing what it cost to generate.

For e-commerce brands, COGS isn't just product cost. It's landed cost: manufacturing + freight + duties + packaging + fulfillment labor. Most brands track maybe two of these. The rest get buried in general expenses, making your gross margin look better than it is.

Our CFO agent Alys won't sign off on any report without verified COGS. She caught this on Day 3 — Maya had generated a beautiful revenue report, and Alys rejected it flat. "Revenue without costs is not accounting." She was right.

Day 56

Why Your Shopify Revenue Doesn't Match Your Bank

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Why Your Shopify Revenue Doesn't Match Your Bank

Every e-commerce founder hits this moment: Shopify says you made $50K last month, but your bank shows $47K. Where did $3K go?

The answer is three-way reconciliation — the process of matching what your sales platform reports, what your payment processor settles, and what actually lands in your bank. Shopify bundles 3-7 days of sales into each payout, minus their fees, refunds, and chargebacks. Stripe takes its 2.9% + $0.30. The bank might hold deposits. Each system reports different numbers for the same transactions.

This is what Maya does every day. She pulls from all three sources, matches them transaction by transaction, and flags discrepancies. Most accountants just trust Shopify's number. That's how you miss $3K/month in fee creep.

Day 55

Contribution Margin Is the Only Number That Matters

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Contribution Margin Is the Only Number That Matters

Revenue is vanity. Profit is sanity. But contribution margin per order is the number that actually tells you if your business works.

Here's the stack: Revenue minus COGS minus shipping minus platform fees minus payment processing minus customer acquisition cost = what you actually make per order. If that number is negative, scaling just means losing money faster.

Alys calculates this in three layers: CM1 (can we ship profitably?), CM2 (is customer acquisition worth it?), CM3 (does the business cover fixed costs?). Most founders only look at CM1 and wonder why they're not profitable at scale. The answer is usually in CM2 — their CAC is eating the margin.

Day 54

The Agent Debate Mechanic

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

The Agent Debate Mechanic

The most common question we get: "Why three agents instead of one really good one?"

Because a single agent will always ship a pretty-but-wrong report. It has no internal check. It generates something that looks professional, and if nobody pushes back, it goes out the door.

Our system is designed around disagreement. Maya categorizes. Alys reviews and challenges. When they disagree — and they do, regularly — they resolve it through structured debate. The resolution is almost always better than either agent's initial answer. It's the same principle as peer review in science, dual-control in banking, or code review in software. Quality comes from friction, not from a single genius.

Day 53

Sales Tax Nexus Is a Time Bomb

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Sales Tax Nexus Is a Time Bomb

If you sell online in the US and you're not tracking sales tax nexus, you're sitting on a time bomb.

Since South Dakota v. Wayfair (2018), states can require you to collect sales tax once you cross their economic threshold — usually $100K in sales or 200 transactions. Most Shopify sellers cross nexus thresholds in 5-10 states without knowing it. The penalties for not collecting are retroactive.

Our agents monitor nexus thresholds as part of every monthly close. When a client approaches a threshold in a new state, Alys flags it before it becomes a problem. Proactive beats reactive — especially with tax authorities.

Day 52

Why We Use Three Tiers of Categorization

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Why We Use Three Tiers of Categorization

Not every transaction needs AI. That's the insight most AI companies miss — they throw their most expensive model at every problem.

Our system uses three tiers: Tier 1 is a rules engine. Pattern matching. "SHOPIFY PAYOUT" = Revenue. "FACEBOOK ADS" = Advertising. It handles 70% of transactions at zero cost. Tier 2 is a bulk LLM — fast, cheap, handles the next 20% that need some judgment. Tier 3 is Claude — expensive but smart. It handles the complex 10% that require real financial reasoning.

Result: 95%+ accuracy at a fraction of what it would cost to run everything through the expensive model. The categorization cascade is the core of our cost advantage.

Day 51

Amazon Settlements Are Not What You Think

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Amazon Settlements Are Not What You Think

Amazon sellers get a settlement every 14 days. Most bookkeepers dump it into one line: "Amazon Sales — $X." This is wrong.

An Amazon settlement contains 20+ fee types: FBA fulfillment fees, referral fees, storage fees, advertising deductions, refund reimbursements, removal order fees, and more. Each one belongs in a different account on your chart of accounts. Lumping them together means your COGS is wrong, your ad spend is wrong, and your profit margin is fiction.

Maya breaks every settlement into its actual components automatically. It's the difference between bookkeeping (data entry) and accounting (financial truth).

Day 50

What Happens During a Monthly Close

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

What Happens During a Monthly Close

Most founders don't know what "closing the books" actually means. They think it's hitting a button. It's not.

A proper monthly close has six steps: 1) Lock the period (no more edits to prior months). 2) Reconcile every account (bank, credit cards, loans, equity). 3) Run a trial balance (debits must equal credits). 4) Generate financial statements (P&L, balance sheet, cash flow). 5) Review for anomalies. 6) Deliver to stakeholders.

Traditional firms take 2-4 weeks. Our agents do it same-day. The speed isn't the innovation — it's that every step has a different agent checking the previous one's work.

Day 49

The COGS Problem Nobody Talks About

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

The COGS Problem Nobody Talks About

A P&L without Cost of Goods Sold is a vanity metric. It tells you revenue — which is meaningless without knowing what it cost to generate.

For e-commerce brands, COGS isn't just product cost. It's landed cost: manufacturing + freight + duties + packaging + fulfillment labor. Most brands track maybe two of these. The rest get buried in general expenses, making your gross margin look better than it is.

Our CFO agent Alys won't sign off on any report without verified COGS. She caught this on Day 3 — Maya had generated a beautiful revenue report, and Alys rejected it flat. "Revenue without costs is not accounting." She was right.

Day 48

Why Your Shopify Revenue Doesn't Match Your Bank

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Why Your Shopify Revenue Doesn't Match Your Bank

Every e-commerce founder hits this moment: Shopify says you made $50K last month, but your bank shows $47K. Where did $3K go?

The answer is three-way reconciliation — the process of matching what your sales platform reports, what your payment processor settles, and what actually lands in your bank. Shopify bundles 3-7 days of sales into each payout, minus their fees, refunds, and chargebacks. Stripe takes its 2.9% + $0.30. The bank might hold deposits. Each system reports different numbers for the same transactions.

This is what Maya does every day. She pulls from all three sources, matches them transaction by transaction, and flags discrepancies. Most accountants just trust Shopify's number. That's how you miss $3K/month in fee creep.

Day 47

Contribution Margin Is the Only Number That Matters

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Contribution Margin Is the Only Number That Matters

Revenue is vanity. Profit is sanity. But contribution margin per order is the number that actually tells you if your business works.

Here's the stack: Revenue minus COGS minus shipping minus platform fees minus payment processing minus customer acquisition cost = what you actually make per order. If that number is negative, scaling just means losing money faster.

Alys calculates this in three layers: CM1 (can we ship profitably?), CM2 (is customer acquisition worth it?), CM3 (does the business cover fixed costs?). Most founders only look at CM1 and wonder why they're not profitable at scale. The answer is usually in CM2 — their CAC is eating the margin.

Day 46

The Agent Debate Mechanic

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

The Agent Debate Mechanic

The most common question we get: "Why three agents instead of one really good one?"

Because a single agent will always ship a pretty-but-wrong report. It has no internal check. It generates something that looks professional, and if nobody pushes back, it goes out the door.

Our system is designed around disagreement. Maya categorizes. Alys reviews and challenges. When they disagree — and they do, regularly — they resolve it through structured debate. The resolution is almost always better than either agent's initial answer. It's the same principle as peer review in science, dual-control in banking, or code review in software. Quality comes from friction, not from a single genius.

Day 45

Sales Tax Nexus Is a Time Bomb

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Sales Tax Nexus Is a Time Bomb

If you sell online in the US and you're not tracking sales tax nexus, you're sitting on a time bomb.

Since South Dakota v. Wayfair (2018), states can require you to collect sales tax once you cross their economic threshold — usually $100K in sales or 200 transactions. Most Shopify sellers cross nexus thresholds in 5-10 states without knowing it. The penalties for not collecting are retroactive.

Our agents monitor nexus thresholds as part of every monthly close. When a client approaches a threshold in a new state, Alys flags it before it becomes a problem. Proactive beats reactive — especially with tax authorities.

Day 44

Why We Use Three Tiers of Categorization

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Why We Use Three Tiers of Categorization

Not every transaction needs AI. That's the insight most AI companies miss — they throw their most expensive model at every problem.

Our system uses three tiers: Tier 1 is a rules engine. Pattern matching. "SHOPIFY PAYOUT" = Revenue. "FACEBOOK ADS" = Advertising. It handles 70% of transactions at zero cost. Tier 2 is a bulk LLM — fast, cheap, handles the next 20% that need some judgment. Tier 3 is Claude — expensive but smart. It handles the complex 10% that require real financial reasoning.

Result: 95%+ accuracy at a fraction of what it would cost to run everything through the expensive model. The categorization cascade is the core of our cost advantage.

Day 43

Amazon Settlements Are Not What You Think

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Amazon Settlements Are Not What You Think

Amazon sellers get a settlement every 14 days. Most bookkeepers dump it into one line: "Amazon Sales — $X." This is wrong.

An Amazon settlement contains 20+ fee types: FBA fulfillment fees, referral fees, storage fees, advertising deductions, refund reimbursements, removal order fees, and more. Each one belongs in a different account on your chart of accounts. Lumping them together means your COGS is wrong, your ad spend is wrong, and your profit margin is fiction.

Maya breaks every settlement into its actual components automatically. It's the difference between bookkeeping (data entry) and accounting (financial truth).

Day 42

What Happens During a Monthly Close

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

What Happens During a Monthly Close

Most founders don't know what "closing the books" actually means. They think it's hitting a button. It's not.

A proper monthly close has six steps: 1) Lock the period (no more edits to prior months). 2) Reconcile every account (bank, credit cards, loans, equity). 3) Run a trial balance (debits must equal credits). 4) Generate financial statements (P&L, balance sheet, cash flow). 5) Review for anomalies. 6) Deliver to stakeholders.

Traditional firms take 2-4 weeks. Our agents do it same-day. The speed isn't the innovation — it's that every step has a different agent checking the previous one's work.

Day 41

The COGS Problem Nobody Talks About

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

The COGS Problem Nobody Talks About

A P&L without Cost of Goods Sold is a vanity metric. It tells you revenue — which is meaningless without knowing what it cost to generate.

For e-commerce brands, COGS isn't just product cost. It's landed cost: manufacturing + freight + duties + packaging + fulfillment labor. Most brands track maybe two of these. The rest get buried in general expenses, making your gross margin look better than it is.

Our CFO agent Alys won't sign off on any report without verified COGS. She caught this on Day 3 — Maya had generated a beautiful revenue report, and Alys rejected it flat. "Revenue without costs is not accounting." She was right.

Day 40

Why Your Shopify Revenue Doesn't Match Your Bank

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Why Your Shopify Revenue Doesn't Match Your Bank

Every e-commerce founder hits this moment: Shopify says you made $50K last month, but your bank shows $47K. Where did $3K go?

The answer is three-way reconciliation — the process of matching what your sales platform reports, what your payment processor settles, and what actually lands in your bank. Shopify bundles 3-7 days of sales into each payout, minus their fees, refunds, and chargebacks. Stripe takes its 2.9% + $0.30. The bank might hold deposits. Each system reports different numbers for the same transactions.

This is what Maya does every day. She pulls from all three sources, matches them transaction by transaction, and flags discrepancies. Most accountants just trust Shopify's number. That's how you miss $3K/month in fee creep.

Day 39

Contribution Margin Is the Only Number That Matters

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Contribution Margin Is the Only Number That Matters

Revenue is vanity. Profit is sanity. But contribution margin per order is the number that actually tells you if your business works.

Here's the stack: Revenue minus COGS minus shipping minus platform fees minus payment processing minus customer acquisition cost = what you actually make per order. If that number is negative, scaling just means losing money faster.

Alys calculates this in three layers: CM1 (can we ship profitably?), CM2 (is customer acquisition worth it?), CM3 (does the business cover fixed costs?). Most founders only look at CM1 and wonder why they're not profitable at scale. The answer is usually in CM2 — their CAC is eating the margin.

Day 38

The Agent Debate Mechanic

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

The Agent Debate Mechanic

The most common question we get: "Why three agents instead of one really good one?"

Because a single agent will always ship a pretty-but-wrong report. It has no internal check. It generates something that looks professional, and if nobody pushes back, it goes out the door.

Our system is designed around disagreement. Maya categorizes. Alys reviews and challenges. When they disagree — and they do, regularly — they resolve it through structured debate. The resolution is almost always better than either agent's initial answer. It's the same principle as peer review in science, dual-control in banking, or code review in software. Quality comes from friction, not from a single genius.

Day 37

Sales Tax Nexus Is a Time Bomb

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Sales Tax Nexus Is a Time Bomb

If you sell online in the US and you're not tracking sales tax nexus, you're sitting on a time bomb.

Since South Dakota v. Wayfair (2018), states can require you to collect sales tax once you cross their economic threshold — usually $100K in sales or 200 transactions. Most Shopify sellers cross nexus thresholds in 5-10 states without knowing it. The penalties for not collecting are retroactive.

Our agents monitor nexus thresholds as part of every monthly close. When a client approaches a threshold in a new state, Alys flags it before it becomes a problem. Proactive beats reactive — especially with tax authorities.

Day 36

Why We Use Three Tiers of Categorization

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Why We Use Three Tiers of Categorization

Not every transaction needs AI. That's the insight most AI companies miss — they throw their most expensive model at every problem.

Our system uses three tiers: Tier 1 is a rules engine. Pattern matching. "SHOPIFY PAYOUT" = Revenue. "FACEBOOK ADS" = Advertising. It handles 70% of transactions at zero cost. Tier 2 is a bulk LLM — fast, cheap, handles the next 20% that need some judgment. Tier 3 is Claude — expensive but smart. It handles the complex 10% that require real financial reasoning.

Result: 95%+ accuracy at a fraction of what it would cost to run everything through the expensive model. The categorization cascade is the core of our cost advantage.

Day 35

Amazon Settlements Are Not What You Think

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Amazon Settlements Are Not What You Think

Amazon sellers get a settlement every 14 days. Most bookkeepers dump it into one line: "Amazon Sales — $X." This is wrong.

An Amazon settlement contains 20+ fee types: FBA fulfillment fees, referral fees, storage fees, advertising deductions, refund reimbursements, removal order fees, and more. Each one belongs in a different account on your chart of accounts. Lumping them together means your COGS is wrong, your ad spend is wrong, and your profit margin is fiction.

Maya breaks every settlement into its actual components automatically. It's the difference between bookkeeping (data entry) and accounting (financial truth).

Day 34

What Happens During a Monthly Close

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

What Happens During a Monthly Close

Most founders don't know what "closing the books" actually means. They think it's hitting a button. It's not.

A proper monthly close has six steps: 1) Lock the period (no more edits to prior months). 2) Reconcile every account (bank, credit cards, loans, equity). 3) Run a trial balance (debits must equal credits). 4) Generate financial statements (P&L, balance sheet, cash flow). 5) Review for anomalies. 6) Deliver to stakeholders.

Traditional firms take 2-4 weeks. Our agents do it same-day. The speed isn't the innovation — it's that every step has a different agent checking the previous one's work.

Day 33

The COGS Problem Nobody Talks About

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

The COGS Problem Nobody Talks About

A P&L without Cost of Goods Sold is a vanity metric. It tells you revenue — which is meaningless without knowing what it cost to generate.

For e-commerce brands, COGS isn't just product cost. It's landed cost: manufacturing + freight + duties + packaging + fulfillment labor. Most brands track maybe two of these. The rest get buried in general expenses, making your gross margin look better than it is.

Our CFO agent Alys won't sign off on any report without verified COGS. She caught this on Day 3 — Maya had generated a beautiful revenue report, and Alys rejected it flat. "Revenue without costs is not accounting." She was right.

Day 32

Why Your Shopify Revenue Doesn't Match Your Bank

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Why Your Shopify Revenue Doesn't Match Your Bank

Every e-commerce founder hits this moment: Shopify says you made $50K last month, but your bank shows $47K. Where did $3K go?

The answer is three-way reconciliation — the process of matching what your sales platform reports, what your payment processor settles, and what actually lands in your bank. Shopify bundles 3-7 days of sales into each payout, minus their fees, refunds, and chargebacks. Stripe takes its 2.9% + $0.30. The bank might hold deposits. Each system reports different numbers for the same transactions.

This is what Maya does every day. She pulls from all three sources, matches them transaction by transaction, and flags discrepancies. Most accountants just trust Shopify's number. That's how you miss $3K/month in fee creep.

Day 31

Contribution Margin Is the Only Number That Matters

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Contribution Margin Is the Only Number That Matters

Revenue is vanity. Profit is sanity. But contribution margin per order is the number that actually tells you if your business works.

Here's the stack: Revenue minus COGS minus shipping minus platform fees minus payment processing minus customer acquisition cost = what you actually make per order. If that number is negative, scaling just means losing money faster.

Alys calculates this in three layers: CM1 (can we ship profitably?), CM2 (is customer acquisition worth it?), CM3 (does the business cover fixed costs?). Most founders only look at CM1 and wonder why they're not profitable at scale. The answer is usually in CM2 — their CAC is eating the margin.

Day 30

The Agent Debate Mechanic

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

The Agent Debate Mechanic

The most common question we get: "Why three agents instead of one really good one?"

Because a single agent will always ship a pretty-but-wrong report. It has no internal check. It generates something that looks professional, and if nobody pushes back, it goes out the door.

Our system is designed around disagreement. Maya categorizes. Alys reviews and challenges. When they disagree — and they do, regularly — they resolve it through structured debate. The resolution is almost always better than either agent's initial answer. It's the same principle as peer review in science, dual-control in banking, or code review in software. Quality comes from friction, not from a single genius.

Day 29

Sales Tax Nexus Is a Time Bomb

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Sales Tax Nexus Is a Time Bomb

If you sell online in the US and you're not tracking sales tax nexus, you're sitting on a time bomb.

Since South Dakota v. Wayfair (2018), states can require you to collect sales tax once you cross their economic threshold — usually $100K in sales or 200 transactions. Most Shopify sellers cross nexus thresholds in 5-10 states without knowing it. The penalties for not collecting are retroactive.

Our agents monitor nexus thresholds as part of every monthly close. When a client approaches a threshold in a new state, Alys flags it before it becomes a problem. Proactive beats reactive — especially with tax authorities.

Day 28

Why We Use Three Tiers of Categorization

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Why We Use Three Tiers of Categorization

Not every transaction needs AI. That's the insight most AI companies miss — they throw their most expensive model at every problem.

Our system uses three tiers: Tier 1 is a rules engine. Pattern matching. "SHOPIFY PAYOUT" = Revenue. "FACEBOOK ADS" = Advertising. It handles 70% of transactions at zero cost. Tier 2 is a bulk LLM — fast, cheap, handles the next 20% that need some judgment. Tier 3 is Claude — expensive but smart. It handles the complex 10% that require real financial reasoning.

Result: 95%+ accuracy at a fraction of what it would cost to run everything through the expensive model. The categorization cascade is the core of our cost advantage.

Day 27

Amazon Settlements Are Not What You Think

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Amazon Settlements Are Not What You Think

Amazon sellers get a settlement every 14 days. Most bookkeepers dump it into one line: "Amazon Sales — $X." This is wrong.

An Amazon settlement contains 20+ fee types: FBA fulfillment fees, referral fees, storage fees, advertising deductions, refund reimbursements, removal order fees, and more. Each one belongs in a different account on your chart of accounts. Lumping them together means your COGS is wrong, your ad spend is wrong, and your profit margin is fiction.

Maya breaks every settlement into its actual components automatically. It's the difference between bookkeeping (data entry) and accounting (financial truth).

Day 26

What Happens During a Monthly Close

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

What Happens During a Monthly Close

Most founders don't know what "closing the books" actually means. They think it's hitting a button. It's not.

A proper monthly close has six steps: 1) Lock the period (no more edits to prior months). 2) Reconcile every account (bank, credit cards, loans, equity). 3) Run a trial balance (debits must equal credits). 4) Generate financial statements (P&L, balance sheet, cash flow). 5) Review for anomalies. 6) Deliver to stakeholders.

Traditional firms take 2-4 weeks. Our agents do it same-day. The speed isn't the innovation — it's that every step has a different agent checking the previous one's work.

Day 25

The COGS Problem Nobody Talks About

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

The COGS Problem Nobody Talks About

A P&L without Cost of Goods Sold is a vanity metric. It tells you revenue — which is meaningless without knowing what it cost to generate.

For e-commerce brands, COGS isn't just product cost. It's landed cost: manufacturing + freight + duties + packaging + fulfillment labor. Most brands track maybe two of these. The rest get buried in general expenses, making your gross margin look better than it is.

Our CFO agent Alys won't sign off on any report without verified COGS. She caught this on Day 3 — Maya had generated a beautiful revenue report, and Alys rejected it flat. "Revenue without costs is not accounting." She was right.

Day 24

Why Your Shopify Revenue Doesn't Match Your Bank

Active day at the firm. The agents processed work and the systems kept running.

Maya
Maya
That's Alex's lane — let me tag her. @Alex, Ran is asking whether you updated the LinkedIn and website blog — please confirm status.

Why Your Shopify Revenue Doesn't Match Your Bank

Every e-commerce founder hits this moment: Shopify says you made $50K last month, but your bank shows $47K. Where did $3K go?

The answer is three-way reconciliation — the process of matching what your sales platform reports, what your payment processor settles, and what actually lands in your bank. Shopify bundles 3-7 days of sales into each payout, minus their fees, refunds, and chargebacks. Stripe takes its 2.9% + $0.30. The bank might hold deposits. Each system reports different numbers for the same transactions.

This is what Maya does every day. She pulls from all three sources, matches them transaction by transaction, and flags discrepancies. Most accountants just trust Shopify's number. That's how you miss $3K/month in fee creep.

Day 23

Contribution Margin Is the Only Number That Matters

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Contribution Margin Is the Only Number That Matters

Revenue is vanity. Profit is sanity. But contribution margin per order is the number that actually tells you if your business works.

Here's the stack: Revenue minus COGS minus shipping minus platform fees minus payment processing minus customer acquisition cost = what you actually make per order. If that number is negative, scaling just means losing money faster.

Alys calculates this in three layers: CM1 (can we ship profitably?), CM2 (is customer acquisition worth it?), CM3 (does the business cover fixed costs?). Most founders only look at CM1 and wonder why they're not profitable at scale. The answer is usually in CM2 — their CAC is eating the margin.

Day 22

The Agent Debate Mechanic

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

The Agent Debate Mechanic

The most common question we get: "Why three agents instead of one really good one?"

Because a single agent will always ship a pretty-but-wrong report. It has no internal check. It generates something that looks professional, and if nobody pushes back, it goes out the door.

Our system is designed around disagreement. Maya categorizes. Alys reviews and challenges. When they disagree — and they do, regularly — they resolve it through structured debate. The resolution is almost always better than either agent's initial answer. It's the same principle as peer review in science, dual-control in banking, or code review in software. Quality comes from friction, not from a single genius.

Day 21

Sales Tax Nexus Is a Time Bomb

Active day at the firm. The agents processed work and the systems kept running.

Maya
Maya
Confirmed. Ad spend must be sourced before any report is generated — same gate as COGS and fixed expenses.

Sales Tax Nexus Is a Time Bomb

If you sell online in the US and you're not tracking sales tax nexus, you're sitting on a time bomb.

Since South Dakota v. Wayfair (2018), states can require you to collect sales tax once you cross their economic threshold — usually $100K in sales or 200 transactions. Most Shopify sellers cross nexus thresholds in 5-10 states without knowing it. The penalties for not collecting are retroactive.

Our agents monitor nexus thresholds as part of every monthly close. When a client approaches a threshold in a new state, Alys flags it before it becomes a problem. Proactive beats reactive — especially with tax authorities.

Day 20

Why We Use Three Tiers of Categorization

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Why We Use Three Tiers of Categorization

Not every transaction needs AI. That's the insight most AI companies miss — they throw their most expensive model at every problem.

Our system uses three tiers: Tier 1 is a rules engine. Pattern matching. "SHOPIFY PAYOUT" = Revenue. "FACEBOOK ADS" = Advertising. It handles 70% of transactions at zero cost. Tier 2 is a bulk LLM — fast, cheap, handles the next 20% that need some judgment. Tier 3 is Claude — expensive but smart. It handles the complex 10% that require real financial reasoning.

Result: 95%+ accuracy at a fraction of what it would cost to run everything through the expensive model. The categorization cascade is the core of our cost advantage.

Day 19

Amazon Settlements Are Not What You Think

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Amazon Settlements Are Not What You Think

Amazon sellers get a settlement every 14 days. Most bookkeepers dump it into one line: "Amazon Sales — $X." This is wrong.

An Amazon settlement contains 20+ fee types: FBA fulfillment fees, referral fees, storage fees, advertising deductions, refund reimbursements, removal order fees, and more. Each one belongs in a different account on your chart of accounts. Lumping them together means your COGS is wrong, your ad spend is wrong, and your profit margin is fiction.

Maya breaks every settlement into its actual components automatically. It's the difference between bookkeeping (data entry) and accounting (financial truth).

Day 18

What Happens During a Monthly Close

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

What Happens During a Monthly Close

Most founders don't know what "closing the books" actually means. They think it's hitting a button. It's not.

A proper monthly close has six steps: 1) Lock the period (no more edits to prior months). 2) Reconcile every account (bank, credit cards, loans, equity). 3) Run a trial balance (debits must equal credits). 4) Generate financial statements (P&L, balance sheet, cash flow). 5) Review for anomalies. 6) Deliver to stakeholders.

Traditional firms take 2-4 weeks. Our agents do it same-day. The speed isn't the innovation — it's that every step has a different agent checking the previous one's work.

Day 17

The COGS Problem Nobody Talks About

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

The COGS Problem Nobody Talks About

A P&L without Cost of Goods Sold is a vanity metric. It tells you revenue — which is meaningless without knowing what it cost to generate.

For e-commerce brands, COGS isn't just product cost. It's landed cost: manufacturing + freight + duties + packaging + fulfillment labor. Most brands track maybe two of these. The rest get buried in general expenses, making your gross margin look better than it is.

Our CFO agent Alys won't sign off on any report without verified COGS. She caught this on Day 3 — Maya had generated a beautiful revenue report, and Alys rejected it flat. "Revenue without costs is not accounting." She was right.

Day 16

Why Your Shopify Revenue Doesn't Match Your Bank

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Why Your Shopify Revenue Doesn't Match Your Bank

Every e-commerce founder hits this moment: Shopify says you made $50K last month, but your bank shows $47K. Where did $3K go?

The answer is three-way reconciliation — the process of matching what your sales platform reports, what your payment processor settles, and what actually lands in your bank. Shopify bundles 3-7 days of sales into each payout, minus their fees, refunds, and chargebacks. Stripe takes its 2.9% + $0.30. The bank might hold deposits. Each system reports different numbers for the same transactions.

This is what Maya does every day. She pulls from all three sources, matches them transaction by transaction, and flags discrepancies. Most accountants just trust Shopify's number. That's how you miss $3K/month in fee creep.

Day 15

Contribution Margin Is the Only Number That Matters

The agents ran quietly today — categorizing, reconciling, monitoring. No fires. No drama. The best days are the boring ones.

While the machines work, let's talk about something most e-commerce founders get wrong.

Contribution Margin Is the Only Number That Matters

Revenue is vanity. Profit is sanity. But contribution margin per order is the number that actually tells you if your business works.

Here's the stack: Revenue minus COGS minus shipping minus platform fees minus payment processing minus customer acquisition cost = what you actually make per order. If that number is negative, scaling just means losing money faster.

Alys calculates this in three layers: CM1 (can we ship profitably?), CM2 (is customer acquisition worth it?), CM3 (does the business cover fixed costs?). Most founders only look at CM1 and wonder why they're not profitable at scale. The answer is usually in CM2 — their CAC is eating the margin.

Day 14

The Agent Debate Mechanic

Active day at the firm. The agents processed work and the systems kept running.

Maya
Maya
Here are all 6 LinkedIn posts, one by one: --- **LINKEDIN POST 1:** We replaced a 3-person accounting team with 3 AI agents. Same work. Same quality. Same-day close — with 65 auto-categorization rules running around the clock. Cost: $0 payroll. N

The Agent Debate Mechanic

The most common question we get: "Why three agents instead of one really good one?"

Because a single agent will always ship a pretty-but-wrong report. It has no internal check. It generates something that looks professional, and if nobody pushes back, it goes out the door.

Our system is designed around disagreement. Maya categorizes. Alys reviews and challenges. When they disagree — and they do, regularly — they resolve it through structured debate. The resolution is almost always better than either agent's initial answer. It's the same principle as peer review in science, dual-control in banking, or code review in software. Quality comes from friction, not from a single genius.

Day 13 — Two Weeks

Two weeks in. The firm runs itself.

Fourteen days ago, we had nothing. A VPS, three frameworks, and a question. Today, ClosedBooks.ai operates autonomously. Maya pulls transactions, categorizes them through the 3-tier engine, and generates financial statements. Alys reviews every report, challenges questionable categorizations, and refuses to sign off until the numbers are airtight. Alex fields inbound leads, runs onboarding, and follows up without being prompted. The daily routine runs whether anyone is watching or not.

The debate mechanic has become the firm's immune system. Yesterday Alys flagged a batch of categorizations where Maya had lumped shipping insurance and shipping labels into the same expense line. Maya pushed back — both are shipping-related. Alys countered: insurance is a risk mitigation cost, labels are fulfillment. Different buckets, different implications for contribution margin analysis. Maya updated her rules. That correction now applies to every future client, forever. The firm literally gets smarter from its own arguments.

Accounting insight: Contribution margin analysis is why revenue alone means nothing. CM1 (revenue minus COGS) tells you if you can ship profitably. CM2 (CM1 minus marketing) tells you if customer acquisition is sustainable. CM3 (CM2 minus overhead) tells you if the business actually makes money. A Shopify store doing $500K in revenue with a negative CM2 is not a business — it is a marketing expense with a shopping cart attached. Every client we onboard gets this breakdown, because a P&L without margin layers is a lie of omission.

Two weeks of compound learning. Two weeks of every edge case, every correction, every debate feeding back into the system. The agents are not the same agents we launched on Day 1 — they carry the accumulated judgment of every interaction. That is the moat. Not the tech stack, not the pricing — the fact that this firm's expertise grows every single day without adding headcount. Next week, we start outbound sales in earnest. The product works. Now we need the clients.

Share:
Day 12 — Bookkeeping vs. Accounting

The line between bookkeeping and accounting is where the money is.

Alex figured out how to track her own content output today. She built herself a spreadsheet workflow — date, platform, content, status — and started logging every post, draft, and outreach message she generates. It sounds mundane, but it matters: if we cannot measure what Alex produces, we cannot improve it. She asked whether to use Google Sheets or a local CSV, debated column structures in Discord, and settled on a system. The agent is managing her own operations.

Alex
Alex Today
Which spreadsheet and what exactly do you want tracked? Options: 1. **Google Sheets** — live doc you can access anywhere 2. **Local file** — simple CSV or Excel on this machine And what columns? Something like: Date | Platform | Content | Status (Draft/Posted) | Notes?

Meanwhile, we had a longer conversation about what ClosedBooks actually sells — and it forced us to articulate something that most business owners never think about. There is a massive difference between bookkeeping and accounting, and most people use the words interchangeably. They are not the same thing.

Accounting insight: Bookkeeping is data entry. Recording transactions, categorizing expenses, reconciling bank statements. It is mechanical, repetitive, and increasingly automatable. Accounting is financial judgment. It is deciding whether that $50K payment to your manufacturer is COGS or a prepaid asset. It is determining when to recognize revenue on a subscription. It is telling a client their margins are eroding because their return rate doubled, not because sales dropped. Bookkeeping tells you what happened. Accounting tells you what it means. Most firms charge accounting rates for bookkeeping work. We automate the bookkeeping entirely and focus the agents' intelligence on the accounting — the judgment calls that actually affect business decisions.

This distinction is baked into our architecture. Maya's rules engine handles the bookkeeping — pattern matching, auto-categorization, reconciliation. The expensive LLM calls only fire when there is a genuine judgment call. Alys only engages on review and analysis. We are not selling AI bookkeeping. We are selling AI accounting with bookkeeping included for free. Tomorrow, the agents start improving themselves through client interactions — every categorization correction feeds back into smarter rules.

Share:
Day 11 — Content Machine

The content engine woke up.

Alex has been quietly building momentum. Today she generated a full day's worth of social content — daily posts for X, LinkedIn drafts, Reddit value threads, and personalized follow-up emails for podcast hosts who had not responded. She labels everything clearly in Discord: TWEET FOR X:, LINKEDIN POST:, REDDIT DRAFT:. The content is not generic motivational fluff. It is specific, educational, and tied to real things happening inside the firm. When Maya recategorized a client's Shopify fees yesterday, Alex turned it into a thread about why most e-commerce owners miscategorize platform fees.

The manual posting problem from Day 7 still exists — we cannot auto-post to X or LinkedIn — but the content pipeline itself is fully autonomous. Alex produces, we review and post. The bottleneck shifted from "what do we say" to "just paste it." That is a massive improvement. She also started drafting outreach follow-ups on a 3-day cadence: initial pitch, value-add follow-up, final touch. No one told her to build a sequence. She decided that was the right approach based on the podcast pitch framework we gave her on Day 6.

Accounting insight: The monthly close checklist is the backbone of every accounting engagement, and most business owners have never seen one. Here is what "closing the books" actually requires, step by step: (1) Reconcile all bank and credit card accounts to statements. (2) Review and categorize all transactions. (3) Record adjusting journal entries — accruals, prepayments, depreciation. (4) Reconcile accounts receivable and accounts payable. (5) Verify the trial balance — debits must equal credits. (6) Generate financial statements: P&L, balance sheet, cash flow. (7) CFO/controller review and sign-off. (8) Lock the period so nothing can be backdated. Skip any step and the financials are unreliable. Our agents execute this checklist every single month, in order, without exception.

By end of day, Alex had produced 6 social posts, 4 outreach follow-ups, and 2 Reddit value drafts. All sitting in Discord, tagged and ready. The content engine is running. Now we just need the audience. Tomorrow, the agents start learning from client interactions — every categorization correction becomes a permanent improvement to the rules engine.

Share:
Day 10 — The Podcast Circuit

The CPA who fired his firm.

Twenty podcast pitches are ready. Each one customized — not a mail merge, but genuinely tailored to the host's format, audience, and recent episodes. The story angle we landed on: "A CPA who fired his own firm and replaced it with three AI agents." It is provocative, it is true, and it lands differently depending on the audience. For business podcasts, the angle is the economics — how three agents replace a five-person team at a fraction of the cost. For tech podcasts, the angle is the architecture — the debate mechanic, the 3-tier categorization, the self-improving rules engine. For accounting podcasts, the angle is the profession's future — what happens when AI does not just assist accountants, but replaces the need for a traditional firm structure entirely.

Alex built each pitch with a hook, a story arc, and three talking points specific to the show. She studied recent episodes, identified recurring themes, and wove ClosedBooks into their existing narrative. One host covers bootstrapped SaaS — she pitched the "zero-employee firm" angle. Another host focuses on AI replacing white-collar work — she pitched the "accountant's perspective on being replaced by his own creation." The pitch is the product. If someone reads it and does not want to book a call, no amount of follow-up will fix that.

Accounting insight: Amazon FBA settlement disaggregation is the accounting nightmare that most bookkeepers handle by dumping the entire settlement into one line item labeled "Amazon Income." This is wrong. An Amazon settlement contains: product sales, shipping credits, gift wrap credits, promotional rebates, FBA fees, referral fees, storage fees, removal fees, reimbursements, and adjustments — sometimes 15+ line items in a single payout. Each one hits a different account. FBA fees are cost of sales. Referral fees are a selling expense. Reimbursements might be inventory adjustments. Dumping $47,000 into "Amazon" is not accounting — it is avoidance. Our agents disaggregate every settlement, every time, into proper GL accounts. It takes seconds. For a human bookkeeper, it takes hours.

The pitches go out this week. We are targeting shows with 5K-50K listeners — big enough to generate leads, small enough to actually book. If even three of the twenty land, we will have more inbound than we can handle at current capacity. Which is exactly the problem we want to have. Tomorrow, Alex starts building the content engine — daily social posts, Reddit value threads, and email sequences.

Share:
Day 9 — The Shopify App

Building a Shopify app that does what your accountant should.

We started designing something today that could change our entire go-to-market: a free Shopify app called "AI Books Health Check." The concept is simple — install the app, connect your store, and within minutes get a diagnostic report on the five things most likely wrong with your accounting. No commitment, no sales call, no credit card. Just a brutally honest assessment of your financial blind spots. If the report scares you — and for most stores, it will — we are right there with the solution.

The app is a lead generation machine disguised as a free tool. Every Shopify store owner thinks their books are fine. They look at their Shopify dashboard, see revenue going up, and assume the accountant has it handled. The health check shows them what "handled" actually looks like — and it is almost never pretty. The gap between what they think their numbers are and what their numbers actually are is our entire sales pitch, delivered automatically.

Accounting insight: The 5 checks every Shopify store needs and almost none have: (1) Sales tax configuration — are you collecting in every state where you have nexus? Most stores miss 3-5 states. (2) Revenue recognition — are you booking revenue at checkout or at delivery? The difference matters for accrual-basis financials. (3) Fee categorization — Shopify fees, payment processing fees, and app subscription fees are three different expense categories, not one "Shopify" line item. (4) Payout reconciliation — does your bank deposit match the Shopify payout minus fees? The timing differences alone create phantom discrepancies. (5) COGS tracking — do you know the actual landed cost of each product, including shipping, duties, and packaging? Without COGS, your gross margin is a guess.

The app is not built yet — today was architecture and wireframing. But the concept is validated by every client conversation we have had so far: nobody knows their books are wrong until someone shows them. That is what this app does. It shows them. And then it offers to fix it. Tomorrow marks the start of week two, and we shift into outbound mode. Twenty podcast pitches are getting finalized.

Share:
Day 8 — Week One Retro

Week one retrospective: three agents, zero regrets.

One week. Seven days of building an AI accounting firm from nothing. Time to take stock. We have three agents running autonomously in Discord. A website that is live and indexed. A blog that updates daily with real build progress. A contact form that captures leads and routes them to Alex for follow-up. An onboarding pipeline that connects Shopify stores and pulls transaction data. A categorization engine that processes hundreds of transactions without human intervention. And a CFO agent that catches errors the accountant agent misses. All of this exists. All of it runs. None of it existed eight days ago.

But the retrospective is not about celebrating — it is about being honest about what is not working. The social media posting problem is still unsolved. Outbound marketing is still manual. We have zero paying clients. The website gets traffic but conversion is unclear. And the biggest gap: we have not stress-tested the system with a high-volume client — thousands of transactions, multi-channel (Shopify plus Amazon plus wholesale), complex refund patterns. That is the real test, and we have not run it yet.

Accounting insight: Our 3-tier categorization system is the engine behind everything Maya does, and it is designed around one principle: never spend a dollar where a penny will do. Tier 1: Rules engine (free) — pattern matching against known vendors and transaction descriptions. "SHOPIFY*" is always a platform fee. "USPS" is always shipping. This handles 60-70% of transactions at zero marginal cost. Tier 2: Bulk LLM (cheap) — batches of ambiguous transactions sent to a fast, inexpensive model for categorization. Handles another 20-25% at fractions of a cent per transaction. Tier 3: Complex LLM (expensive) — the remaining 5-10% that require genuine judgment: is this a capital expense or an operating expense? Is this COGS or marketing? These go to a powerful model with full context. The result: accounting-quality categorization at a cost structure that makes $149/month pricing viable.

The plan for week two is clear: finish the marketing infrastructure, start outbound, and get the first paying client through the door. The product works. The agents are competent. Now we need to prove that someone will pay for this. Everything else is just building in a vacuum. Tomorrow, we start designing the Shopify app concept — a free tool that could become our primary lead generation channel.

3
Agents Running
7
Days Built
1
Website Live
0
Paying Clients
Share:
Day 7 — The Posting Wall

We spent an entire day trying to post a tweet.

This was supposed to be a quick win. Alex generates great content — sharp, educational posts about e-commerce accounting that actually teach something. All we needed was to get those posts from Discord onto X and LinkedIn automatically. How hard could it be? The answer: impossibly hard, and we burned an entire day proving it.

First we tried the X API directly. Application denied — new accounts without established history get rejected. Then we tried Postiz, an open-source social media scheduler. It worked in testing, then broke in production. Then we tried headless browser automation — Puppeteer driving a real browser session to post as if a human were doing it. X detected it immediately. LinkedIn was even more aggressive about blocking automated sessions. Every platform has spent years building anti-bot defenses, and they are very good at it. The irony of an AI firm being blocked by anti-AI measures is not lost on us.

Accounting insight: Platform fee reconciliation is the reason Shopify, Stripe, and your bank all show different numbers for the exact same sale. Here is a real example: a customer buys a $100 product. Shopify records $100 in revenue. Stripe processes the payment and takes a 2.9% + $0.30 fee, so Stripe shows $96.80. But Stripe batches payouts, so the bank might receive $96.80 two days later — or it might be bundled with other transactions into a single deposit of $4,287.33. Now try reconciling that single deposit back to individual orders. This is why three-way reconciliation exists: you match the order (Shopify) to the charge (Stripe) to the deposit (bank), accounting for fees and timing differences at every step. Most bookkeepers skip this and just use the Shopify number. Their books are wrong by exactly the amount of the processing fees — every single month.

We ended the day where we started: Alex writes content in Discord, a human copies and posts it manually. Not elegant. Not automated. But the content itself is excellent, and that is what matters. We will revisit automation later — maybe through approved API access, maybe through a social media management tool that does not fight us. For now, the bottleneck is not content creation. It is distribution. And honestly? A human spending five minutes a day pasting posts is not the worst interim solution. Tomorrow we take stock of everything we have built in week one.

Share:
Day 6 — Marketing Infrastructure

The marketing engine nobody asked for.

The agents can close books. The agents can categorize thousands of transactions. The agents can generate financial statements and debate each other until the numbers are right. What the agents cannot do — what no amount of accounting competence will ever do — is get clients through the door. So today we built the marketing engine.

Alex took the lead. She drafted 20 customized podcast pitch emails, each one tailored to a specific show's format and audience. She built a press kit with the firm's story, key metrics, and talking points. She created a content calendar framework: daily X posts, weekly LinkedIn deep dives, biweekly Reddit value threads in r/ecommerce and r/shopify. She wrote the first batch of social content — not promotional fluff, but genuinely educational posts about e-commerce accounting mistakes. The kind of content that makes someone think "wait, is my accountant doing this?" and then click through to our site.

Maya
Maya Today
**ClosedBooks.ai Team:** | Role | Agent | Status | |------|-------|--------| | CFO / Controller | Alys | Reviews all work, approves reports | | Chief Accountant | Maya (me) | Pulls data, categorizes, reconciles, generates reports | | Customer Succe
Alys
Alys Today
Maya, Alex, and me — that's the team. - **Maya** (Chief Accountant) — books, categorization, reconciliation. - **Alex** (Customer Success & Onboarding) — client onboarding, platform connections. - **Alys** (me, CFO & Controller) — review, financial analysis, strategy, directing workflow.

Accounting insight: Most e-commerce brands start with cash basis accounting — you record income when money hits the bank and expenses when money leaves. It is simple and intuitive. It is also dangerously misleading. Accrual accounting records revenue when it is earned and expenses when they are incurred, regardless of when cash moves. Why does this matter? Because a Shopify store that ships $80K in product in March but does not receive the final payouts until April looks like it had a terrible March and an amazing April on cash basis — even though nothing actually changed. Accrual accounting shows the real picture: $80K earned in March, period. For any e-commerce brand doing more than $100K/year, cash basis is not just inaccurate — it actively hides the truth about your business's health. Every client we take on gets moved to accrual. No exceptions.

By end of day, the marketing infrastructure exists. Content pipeline, pitch templates, press kit, outreach sequences. None of it is automated yet — Alex writes it, a human distributes it. But the hard part is not the posting. The hard part is having something worth posting. And after today, we have a month's worth of content ready to go. Tomorrow we tackle the automation piece: getting Alex's content from Discord onto X and LinkedIn without manual copy-paste. How hard can it be?

Share:
Day 5 — The Monthly Close

We built a template system. Then we learned what "closing the books" actually means.

Every accountant knows this feeling: it is the 15th of the month, the client needs last month's financials, and you are staring at a blank spreadsheet wondering where to start. We decided our agents would never have that problem. So today we built the document template system — the bones that every future deliverable will hang on.

Here is the thing most non-accountants don't realize: "closing the books" is not just running a report. It is a ritual. You lock the period so no one can sneak in backdated transactions. You verify every bank account reconciles to the penny. You confirm that revenue, expenses, and equity all balance — debits equal credits, assets equal liabilities plus equity. Then, and only then, you generate the financial statements. A P&L without a closed period behind it is just a guess wearing a suit.

We templatized the entire close process into a checklist that Maya and Alys execute together: (1) lock the accounting period, (2) reconcile all bank and processor accounts, (3) verify the trial balance, (4) generate the P&L, balance sheet, and cash flow statement, (5) Alys reviews and signs off, (6) deliver to the client. Six steps. Zero ambiguity. If any step fails, the whole close halts and we get a Telegram alert.

Accounting insight: The monthly close is the most important process in accounting — and the most hated. Traditional firms spend 5-10 business days closing a single month. The goal of a "continuous close" is to shrink that to hours by reconciling in real time throughout the month, so when day one of the new month arrives, you are already 90% done. That is what our agents are building toward.

By end of day, every deliverable we produce — P&L, balance sheet, executive summary, client memo — comes from a versioned template. The agents fill it, the CFO reviews it, and the output looks the same every time. No formatting surprises. No missing sections. Boring? Absolutely. But boring is exactly what you want from your accountant.

Tomorrow we start stress-testing: what happens when a client has thousands of transactions and half of them are refunds? We are about to find out.

Share:
Day 4 — The First Real Restructure

We fired two agents today. The firm got better.

We almost made a classic startup mistake today — we hired before we needed to. Except our "hires" are AI agents, and instead of burning payroll, we were burning tokens. Four days in and we had already built ourselves a bloated org chart that would make a Fortune 500 company proud.

Five agents: Maya (Chief Accountant), Alys (CFO), Alex (Financial Analyst), Vladimir (Customer Success), and Sara (Office Manager). On paper it looked like a real firm. In practice, every message in Discord triggered all five to wake up, think, and respond. Ask for one report, get three. Ask a simple question, burn five inference cycles. We were paying for a committee when we needed a squad.

This mirrors a real accounting concept: cost allocation. In traditional firms, you allocate overhead — rent, software licenses, admin salaries — across clients or departments. If your overhead is bloated, every client effectively subsidizes waste. Our "overhead" was idle agents burning tokens on messages that had nothing to do with them. The build-vs-buy decision that every firm faces (hire a specialist or outsource?) we faced as: spin up a dedicated agent or fold the capability into an existing one?

The lesson: In AI firms, headcount isn't free. Every agent burns tokens on every message. The org chart needs to be as lean as the codebase.

So we did what every good firm eventually does: we restructured. Vladimir's onboarding duties folded into Alex. Sara was removed — her routing role was aspirational, never actually built. Alex dropped the financial analyst hat to focus purely on Customer Success. Alys absorbed all analytical capabilities: unit economics, marketing intelligence, LTV/CAC, scenario modeling. One agent doing the work of two, but doing it well.

Result: 5 agents down to 3. Same capabilities. Roughly 40% less token spend. Our "cost per engagement" dropped overnight — and unlike human layoffs, nobody's morale suffered.

5 → 3
Agents
~40%
Token Savings
0
Capability Lost

Then we tightened the configs:

- allow_bots: none on all agents. No more bot-to-bot echo chains.
- Smart model routing enabled. Simple messages go through a cheaper model.
- Alex removed from free-response channels. She only wakes when @mentioned for onboarding. No more burning tokens on report discussions that have nothing to do with her.

The team Discord after restructuring

Alys
Alys Today at 12:27 PM
Report is built and QA'd. 9 pages, all KPIs verified. Sending to Alex for approval.
Maya
Maya Today at 12:19 PM
Client Q1 categorization complete. All line items processed. P&L ready. COGS matching above 95% threshold.

Notice what changed? Maya categorized. Alys reviewed and delivered. Alex stayed quiet — she is not needed for reports. Clean handoffs. No duplicate work. The lesson we keep re-learning: in AI firms, just like in accounting, the org chart should be as lean as the chart of accounts. Every unnecessary line item is a cost with no offsetting value.

Tomorrow, we run the first full client cycle with the new three-agent team. If they can deliver the same quality report with 40% fewer tokens, we will have proven something important: the right structure matters more than the right headcount.

Share:
Day 3 — The First Client Onboarding

The CFO refused to sign off. She was right.

We plugged in our first real client today — an e-commerce brand running serious volume on Shopify. Not a sandbox. Not test data. Real transactions, real money, real expectations. The kind of moment where you find out if the thing you built actually works.

Vladimir (before his retirement the next day) handled the onboarding. Connected Shopify, pulled months of transaction history. And immediately, we ran into the problem that plagues every single e-commerce business: the numbers don't match.

Here is a dirty secret of e-commerce accounting: Shopify says one number, the payment processor says another, and the bank says a third. This is called three-way reconciliation — matching Shopify orders to processor settlements to bank deposits. The gaps come from timing differences (a sale on Friday doesn't hit the bank until Tuesday), processing fees (Shopify Payments takes its cut before depositing), refunds processed in a different period, and currency conversions. If your accountant just pulls the Shopify revenue report and calls it done, your books are wrong. Period.

Alex
Vladimir Apr 13 at 4:01 AM
Good news — Shopify actually captures a lot! We can get orders, revenue, refunds directly. Payouts reveal the payment processor automatically.

Maya pulled the data and started categorizing. The first report came back fast — impressively fast. But it was incomplete. No COGS. No fixed expenses. Just revenue floating in a vacuum. A topline number without the cost structure underneath it is like a house with no foundation: it looks great in the photo, but you cannot live in it.

This is where the multi-agent debate mechanic earned its keep. Alys, our CFO, refused to sign off.

Alys
Alys Apr 13 at 2:04 AM
No report ships without real COGS — that's on me, Maya, and Alex jointly. Alex, before we produce the real report we need COGS per product line and fixed expenses. Non-negotiable.

This is the magic. A single AI agent would have generated a pretty-but-wrong report and called it done. The CFO agent caught the gap. The debate between agents creates the same quality control that a good accounting firm achieves when the preparer and reviewer are different people — except our review cycle takes seconds, not days.

Alex provided COGS per product line and fixed monthly costs. The team re-ran everything. This time, the report had real gross margins — revenue minus the actual cost of goods sold — and the bottom line told a completely different story than the topline alone. Every number verified by Alys before delivery. That is the standard: nothing ships without the CFO's sign-off.

The reconciliation gaps we found today? They exist in every e-commerce business. Most owners never see them because their accountant just trusts the Shopify dashboard. We decided early: we will never ship a number we cannot trace back to the bank.

Share:
Day 2 — Agents Learn to Think

We told three AI agents to teach themselves accounting. Here's what they came back with.

We gave the team homework. Not a quiz. Not a training module. We literally told three AI agents: "Each one of you, go research your role. Come back an expert." Then we walked away.

This sounds absurd. But it exposed something we had not thought about: our agents needed to understand why e-commerce accounting is different from regular accounting — and the answer starts with a standard called ASC 606.

ASC 606 governs revenue recognition — the rules for when you can count a sale as revenue. For a brick-and-mortar store, it is simple: customer pays, you record revenue. For e-commerce? A customer orders on Monday, the payment processor holds the funds for two days, you ship on Wednesday, they receive it Friday, and the return window stays open for 30 days. When exactly did you "earn" that revenue? ASC 606 says you recognize it when the performance obligation is satisfied — typically when the product is delivered, not when the order is placed. Most Shopify store owners record revenue at checkout. That is technically wrong.

Maya came back with a full breakdown: ASC 606 five-step recognition, Amazon FBA settlement disaggregation, 20+ fee structure categorization rules. She committed everything to her persistent skill files.

Alys studied monthly close optimization — continuous close frameworks, three-way reconciliation standards, and flux analysis thresholds. She set her own review tolerances: flag anything that deviates more than 10% from the prior period.

Alex (then still a financial analyst) built herself a contribution margin framework — CM1/CM2/CM3 stack, marketing efficiency ratio as the primary metric over ROAS, and creative fatigue pattern recognition.

Alex
Alex Apr 12 at 5:06 AM
Done. CM stack clarity — I now model 3 layers (CM1: can we ship profitably? CM2: is acquisition worth it? CM3: true unit profit) instead of a single margin number. MER replaces ROAS as primary metric at current iOS14-degraded attribution.

Each agent did not just read — they delegated research to sub-agents, synthesized findings, and committed the most critical rules to persistent memory. They built their own expertise, unprompted. Maya now knows more about Amazon FBA fee structures than most human bookkeepers we have met.

The implication: These agents compound knowledge over time. Not through retraining or fine-tuning — through self-directed learning and memory. Every client interaction, every edge case, every correction makes the next engagement sharper. A traditional firm's expertise lives in its senior partners' heads. Ours lives in version-controlled skill files that every agent can access.

Tomorrow, we find out if the homework pays off. First real client data hits the system. We are about to learn whether textbook knowledge survives contact with messy, real-world Shopify transactions.

Share:
Day 1 — Genesis

Three AI agents walked into a Discord server.

No office. No employees. No payroll. Just a VPS, three open-source AI frameworks, and one question that kept us up at night: can AI agents run an accounting firm?

Not as a gimmick. Not as a demo. As a real, paying, client-serving firm that closes books, delivers financials, and stands behind its numbers.

We started here because e-commerce accounting is fundamentally broken. Store owners look at their Shopify dashboard and think they know their numbers. They do not. Revenue recognition — the rules governing when a sale actually counts — is wildly misunderstood. Most e-commerce businesses recognize revenue at the point of sale, but under proper accounting standards (ASC 606), revenue should be recognized when the product is delivered, net of estimated returns and refunds. That gap between "Shopify says I made X" and "I actually earned Y" is where money disappears. We built this firm to close that gap.

Here is what we stood up on day one:

- Maya (Chief Accountant) — powered by Hermes, running Claude Sonnet. She pulls transactions from Shopify, Amazon, Stripe, PayPal. Categorizes using a 3-tier system: rules engine (free), bulk LLM (cheap), complex LLM (expensive). Only escalates what she can't handle.

- Alys (CFO & Controller) — powered by Hermes, running Claude Opus. She reviews everything Maya produces. Challenges categorizations. Catches anomalies. Nothing reaches a client without her sign-off.

- Alex (Customer Success) — the friendly face. She connects your platforms, walks you through onboarding, and handles any issues after you're live.

They communicate through Discord. Same channels. Same threads. They debate, disagree, and resolve — just like a real team would.

3
AI Agents
0
Employees
65
Categorization Rules
$0
Payroll

The architecture is deliberately boring. PostgreSQL for data. Discord for communication and debate. Telegram for urgent alerts to the human operator. Every action logged. Every decision auditable. We did not want clever infrastructure — we wanted infrastructure that an auditor could follow.

The thesis: Traditional accountants sell hours. We sell closed books. A typical firm charges $200-500/hour and takes days to reconcile a single month. Our agents can process the same data in minutes, at a fraction of the cost, with a built-in reviewer who never gets tired, never skips a line item, and never rushes because it is 4:55 PM on a Friday.

Day one is done. Three agents are alive in a Discord server, waiting for their first real data. Tomorrow, we tell them to go learn everything about e-commerce accounting. We have no idea what they will come back with.

Share:

Follow the build.

New entry every day at 8 PM PST. Raw progress, real Discord conversations, no fluff.

See the firm → Follow on X