AI Agent Cost Calculator

Calculate real operational costs for AI agents. Compare models, estimate API costs, and understand the economics of running AI systems at scale.

Technology

Preset Templates

LLM Model

Parameters

User Message Length

75 tokens

750

1,500

2,250

3,000

Agent Response Length

250 tokens

1,250

2,500

3,750

5,000

System Prompt Length

1,000 tokens

2,500

5,000

7,500

10,000

Context Length (RAG)

3,000 tokens

5,000

10,000

15,000

20,000

Conversation Length

5 messages

100

Enable Conversation History Summarization

Cost Breakdown

$0.0009/message

$0.0043/conversation

Cost breakdown (per message)

System Prompt

$0.0002(17%)

Context (RAG)

$0.0004(52%)

User Message

$0.0000(1%)

Agent Response

$0.0002(17%)

Message History

$0.0001(11%)

Model Specifications

Input Cost

$0.15/1M tokens

Output Cost

$0.60/1M tokens

Speed

200 tokens/s

Intelligence Index

Context Window

128K tokens

TTFT

450ms

Calculated using GPT-4o mini

Want to know the full cost of your AI agent?

API pricing doesn't cover development, testing, infrastructure, or optimization. Share what you're building and we'll map the complete cost.

Built with♥by

Complete LLM Model Comparison

Compare pricing, performance, and capabilities across 29+ LLM models from OpenAI, Anthropic, Google, DeepSeek, Meta, and xAI. Use this comprehensive comparison to choose the best model for your AI agent based on cost, speed, intelligence, and reasoning capabilities.

Model	Provider	Input Price (1M tokens)	Output Price (1M tokens)	Speed (tokens/sec)	Intelligence (index)	Context Window (tokens)	TTFT (ms)	Reasoning	Max Thinking (tokens)	Link
GPT-4.1	OpenAI	$2.00	$8.00	38	53	1,000,000	850	—	—	Pricing
GPT-4.1 mini	OpenAI	$0.40	$1.60	940	65	1,000,000	950	—	—	Pricing
GPT-4o	OpenAI	$5.00	$15.00	191.3	40	128,000	400	—	—	Pricing
GPT-4o mini	OpenAI	$0.15	$0.60	200	40.24	128,000	450	—	—	Pricing
GPT-5	OpenAI	$1.25	$10.00	126.2 (126.2 thinking)	44 (68 thinking)	400,000	460 (15,000 thinking)	✓	128,000	Pricing
GPT-5 mini	OpenAI	$0.25	$2.00	70.1 (170 thinking)	61 (64 thinking)	400,000	29,820 (14,600 thinking)	✓	128,000	Pricing
GPT-5 nano	OpenAI	$0.05	$0.40	219 (219 thinking)	29 (51 thinking)	400,000	900 (900 thinking)	✓	128,000	Pricing
Claude 3.7 Sonnet	Anthropic	$3.00	$15.00	75 (81 thinking)	50 (57 thinking)	200,000	1,690 (1,690 thinking)	✓	128,000	Pricing
Claude 4 Opus	Anthropic	$15.00	$75.00	46.1	54	200,000	1,590	—	—	Pricing
Claude 4.1 Opus	Anthropic	$15.00	$75.00	44.3	45	200,000	2,820	—	—	Pricing
Claude 4.5 Haiku	Anthropic	$1.00	$5.00	150 (150 thinking)	42 (55 thinking)	200,000	500 (500 thinking)	✓	128,000	Pricing
Claude 4.5 Sonnet	Anthropic	$3.00	$15.00	63 (63 thinking)	49 (61 thinking)	200,000	1,800 (1,800 thinking)	✓	64,000	Pricing
Claude Sonnet 4	Anthropic	$3.00	$15.00	57.5 (55 thinking)	57 (59 thinking)	1,000,000	1,430 (1,430 thinking)	✓	64,000	Pricing
Gemini 2.0 Flash	Google	$0.10	$0.40	165.9	34	1,000,000	350	—	—	Pricing
Gemini 2.0 Flash Lite	Google	$0.10	$0.40	470	30	1,000,000	400	—	—	Pricing
Gemini 2.5 Flash	Google	$0.30	$2.50	375.6 (375.6 thinking)	38 (60 thinking)	1,000,000	8,350 (8,350 thinking)	✓	24,576	Pricing
Gemini 2.5 Flash Lite	Google	$0.10	$0.40	470	48	1,000,000	1,000	—	—	Pricing
DeepSeek-R1	DeepSeek	$0.56	$1.68	45	68	128,000	500	✓	64,000	Pricing
DeepSeek-V3	DeepSeek	$0.56	$1.68	18.6	44	130,000	2,760	—	—	Pricing
DeepSeek-V3.1	DeepSeek	$0.27	$1.00	180	60	130,000	3,000	—	—	Pricing
DeepSeek-V3.1 Terminus	DeepSeek	$0.23	$0.90	200	46	130,000	2,900	—	—	Pricing
Llama 3.1 405B	Meta	$3.75	$3.75	30.4	26	128,000	770	—	—	Pricing
LLaMA 3.3 70B	Meta	$0.59	$0.99	276	74	128,000	400	—	—	Pricing
Llama 4 Maverick	Meta	$0.24	$0.77	124	50	1,000,000	340	—	—	Pricing
Llama 4 Scout	Meta	$0.15	$0.40	120	43	10,000,000	390	—	—	Pricing
Grok-3	xAI	$3.00	$15.00	27.2	36	1,000,000	1,500	—	—	Pricing
Grok-4	xAI	$3.00	$15.00	75	73	2,000,000	1,340	—	—	Pricing
Grok-4 Fast	xAI	$0.20	$0.50	344	60	2,000,000	2,550	—	—	Pricing

TTFT: Time to First Token (latency before model starts generating)

TTFAT: Time to First Answer Token (latency before model starts answering, after reasoning)

Intelligence Index: Relative capability score based on benchmark performance (higher is better)

Thinking/Reasoning: Models that show their reasoning process before answering (values in parentheses show performance when reasoning is enabled)

All pricing is per million tokens. Performance metrics are approximate and may vary based on query complexity and API load.

Last updated: November 5, 2025