How do I price an AI project for a client?

Three components: (1) build cost (your time × rate), (2) ongoing token cost passed through to client at 2-3x markup, (3) maintenance retainer for prompt tuning and model updates. Be transparent about the token cost — clients trust agencies that show the breakdown.

Should I include LLM API costs in my quote?

Either pass through with markup (most transparent) or absorb into a fixed monthly fee with a usage cap (cleanest billing). Never absorb token cost without a cap — one runaway month can wipe out your project margin.

What margin should I add to LLM costs?

2-3x markup on direct token cost is standard. This covers your wrapper infrastructure, monitoring, prompt tuning time, and edge-case spikes. Below 2x you have no safety net. Above 3x clients will notice and renegotiate.

How much should an AI project cost a client?

For a typical AI feature integration: $5K-25K build cost + $50-500/month ongoing API + 10-20 hours/month maintenance retainer. For a custom agent or RAG system: $15K-75K build + $200-2,000/month API + 20-40 hours/month maintenance.

AI Cost Estimator for Agency Owners Pitching AI Projects to Clients

Last updated: April 20267 min readAI Tools

Agency owners and freelance AI consultants get burned by underquoted projects. The build looks profitable, but six months in the client's monthly token bill is eating the maintenance retainer. Here is how to quote AI projects so you actually make money — and how to talk to clients about ongoing API costs without losing the deal.

The Three-Part AI Project Quote

Every AI project quote should have three line items:

Build (one-time): Your time × your rate. Include discovery, prompt engineering, integration, testing, and deployment.
Ongoing API costs (monthly, variable): Pass through to client at 2-3x markup, with a clear usage cap.
Maintenance retainer (monthly, fixed): 5-20 hours/month for prompt tuning, model updates, monitoring, edge-case handling.

If you bundle all three into a single fixed monthly fee, you take on the variance risk. Clients love it. You hate it the first time a model upgrade or a usage spike eats your margin.

Estimating the API Cost Component

The agency-specific estimation method:

Map the AI feature to a workload type (chatbot, RAG, generation, summarization, classification)
Use the AI Cost Calculator presets to get a starting cost shape
Adjust for the client's actual volume (their user count × usage rate)
Add a 50% buffer for variance and growth
Pick the cheapest model that meets quality requirements
Multiply by 2-3x for your markup

Sample Quote: AI Customer Support Bot

Client: SaaS company with 5,000 active customers. Wants to add a tier-1 support chatbot.

Estimated usage: 30% of customers use the chatbot monthly = 1,500 customers. Average 4 conversations per active user per month = 6,000 conversations. Each conversation ~3,000 input + 800 output tokens.

Token cost on Claude Haiku 3.5 (good for support quality):

6,000 × (3,000 input × $0.80/M + 800 output × $4/M) = $33.60/month direct cost
With 50% buffer: $50.40/month
With 2.5x markup: $126/month billed to client

Quote structure:

Item	Amount	Notes
Build (one-time)	$12,500	Discovery, integration, prompt engineering, testing
API usage (monthly)	$126	Pass-through with markup, capped at 8,000 conversations/mo
Maintenance retainer	$1,500/mo	15 hours/mo for prompt tuning, monitoring, model updates
Total month 1	$14,126	build + 1 month services
Ongoing months	$1,626	Recurring services + API

Build accurate AI project quotes in 60 seconds.

Open AI Cost Calculator →

Sample Quote: Document Summarization Pipeline

Client: legal firm with ~500 case files per month, each averaging 50 pages.

Token cost estimation (Claude Sonnet 4 for legal accuracy):

500 × (32,000 input × $3/M + 800 output × $15/M) = $54/month direct cost
With 50% buffer: $81/month
With 2.5x markup: $202/month billed

Quote:

Item	Amount
Build (one-time)	$18,500
API usage (monthly)	$202 — capped at 700 documents
Maintenance retainer	$2,400/mo (24 hrs)
Total ongoing	$2,602/mo

Three Pricing Models for AI Agency Work

Model A — Pass-through with markup. You bill the client for actual API usage at 2-3x markup. They see exact volume monthly. Most transparent. Best for clients who want predictable per-unit pricing.

Model B — Fixed monthly with usage cap. Bundle all costs into a flat monthly fee. Cap usage at a defined level. Charge overages. Cleanest billing. Works when usage is predictable.

Model C — Volume tiers. Define 3-4 tiers ($X/mo for up to N requests, $Y/mo for up to M requests). Client picks a tier. Upgrade when they outgrow it. Easiest for clients to understand.

How to Talk About API Costs With Clients

Most clients have no idea what AI costs. Education is part of the sale. Use these talking points:

"AI APIs charge per token, similar to how phone bills used to charge per minute." Familiar analogy.
"Your costs scale with your usage — more customers means more API calls means higher bill, but also more value." Aligns cost with value.
"We picked the model that gives you the best quality at the lowest cost. We can show you the comparison." Then actually show them the calculator output.
"You pay for actual usage. There's no minimum and no per-seat fee." Differentiates from typical SaaS.
"We monitor your usage and alert you if it grows unexpectedly." Reduces fear of runaway costs.

Five Mistakes That Kill Agency AI Margins

1. Quoting "unlimited" anything. Never. You will lose. Always cap usage in the contract.

2. Absorbing token cost without monitoring. Set up alerts at 50%, 75%, 90% of budget. Pause processing at 100%.

3. Using a flagship model when a cheap model would work. The client doesn't know the difference. Use the cheap model and pocket the margin.

4. Forgetting prompt iteration costs in the build. You will iterate on the prompt 50-200 times during build. That costs money. Add 20% to your build estimate to cover it.

5. No model upgrade clause. When OpenAI deprecates a model, you have to migrate. Build a clause that lets you charge for re-tuning prompts on the new model.

The Quote Template

For every AI project quote, include:

Scope of work (specific feature, not "AI integration")
Volume assumptions (X requests/month, Y users)
Model selection rationale
API cost estimate with buffer
Markup percentage (be transparent if asked)
Usage cap and overage rate
Maintenance retainer hours and scope
Model upgrade clause

Run your specific project numbers through the AI Cost Calculator before sending any quote.

Estimate API costs for any client project across every model.