Documentation Index

Fetch the complete documentation index at: https://kb.expedient.com/llms.txt

Use this file to discover all available pages before exploring further.

AI CTRL Ratecard

Prev Next

★ Standard Model

Universal Credits powered by GPT-5 mini
100
Credits
=
117.65 M
Tokens
Based on standard usage patterns with a 9:1 input-to-output ratio. Credits work across all models — token allocations vary by model capability.
OpenAIAnthropicGooglePerplexity⬆ More tokens than GPT-5 mini
O

OpenAI

GPT-5, GPT-4.1, GPT-4o, and o-series reasoning models
ModelInput Tokens / 100 CROutput Tokens / 100 CR
GPT-5 miniStandard
General chat, analysis, and light coding
200M25M
GPT-4.1
Strong coding and long-context instruction following
25M6.25M
GPT-4o
Vision and general multimodal assistants
20M5M
GPT-4o mini
Budget multimodal chat, vision, and simple extraction
333.33M83.33M
GPT-5 nano
Summaries, tagging, and high-volume triage
1B125M
GPT-5 Pro
Deep analysis and planning with extended thinking
3.33M416.67K
GPT-5.1
Everyday coding and agent-style tasks
40M5M
GPT-5.1 Codex
Economical coding, implementation, and code review
40M5M
GPT-5.2
Coding, tool use, and multimodal workflows
28.57M3.57M
GPT-5.2 Codex
IDE-style edits, debugging, and repo-aware changes
28.57M3.57M
GPT-5.2 Pro
Very demanding reasoning, long-form analysis, complex decisions
2.38M297.62K
GPT-5.3 Codex
Agentic coding, refactors, and multi-file software work
28.57M3.57M
GPT-5.4
Coding, agents, math, and broad analysis
20M3.33M
GPT-5.4 Mini
Quick iterations, lighter coding, and tool-heavy workflows
66.67M11.11M
GPT-5.4 Nano
Snippets, small fixes, classification, and routing
250M40M
GPT-5.4 Pro
Hardest problems, maximum precision, long-running work
1.67M277.78K
o3
STEM, logic, and careful step-by-step analysis
25M6.25M
o3 Deep Research
Long reports, due diligence, and literature-style synthesis
5M1.25M
o4-mini
Cost-efficient math, logic, and structured reasoning
45.45M11.36M
o4-mini Deep Research
Faster, more affordable deep research model
25M6.25M
A

Anthropic

Claude Opus, Sonnet, and Haiku model families
ModelInput Tokens / 100 CROutput Tokens / 100 CR
Claude Haiku 3
Maximum economy and minimum latency at high scale
200M40M
Claude Haiku 4.5
High-volume assistants and chat with strong answer quality
50M10M
Claude Opus 4
Broad flagship-quality coding, analysis, and writing
3.33M666.67K
Claude Opus 4.1
Premium coding, instruction following, and high-stakes work
3.33M666.67K
Claude Opus 4.5
Elite reasoning, research depth, and strategic analysis
10M2M
Claude Opus 4.6
Hardest agents, complex coding, and very long documents
10M2M
Claude Sonnet 4
Implementation, documentation, and general software tasks
16.67M3.33M
Claude Sonnet 4.5
Product engineering, coding, and long-context sessions (up to 1M with opt-in context)
16.67M3.33M
Claude Sonnet 4.6
Coding, agent planning, and drafting with strong long-context reasoning
16.67M3.33M
G

Google Gemini

Gemini Pro, Flash, and Flash-Lite models
ModelInput Tokens / 100 CROutput Tokens / 100 CR
Gemini 2.0 Flash
Balanced multimodal assistant
500M125M
Gemini 2.0 Flash-Lite
Simple input–output and minimum spend per request
666.67M166.67M
Gemini 2.5 Flash
Chat, summarization, and extraction at large scale
166.67M20M
Gemini 2.5 Flash-Lite
Routing, translation, and extremely high request volume
500M125M
Gemini 2.5 Pro
Complex code, deep analysis, and very long context
40M5M
Gemini 3 Flash
Fast multimodal work, thinking-style tasks, large context
100M16.67M
Gemini 3 Pro
Complex analysis, coding, and multimodal assistants (preview)
25M4.17M
Gemini 3.1 Flash-Lite
High-throughput tasks where cost and speed both matter
200M33.33M
Gemini 3.1 Pro
Agents, coding, and multimodal reasoning (preview)
25M4.17M
P

Perplexity

Sonar search-grounded models
ModelInput Tokens / 100 CROutput Tokens / 100 CR
Sonar
Web-grounded answers
50M50M
Sonar Deep Research
Long reports, due diligence, and literature-style synthesis
25M6.25M
Sonar Pro
Multi-step research questions, broad sources, nuanced synthesis
16.67M3.33M
Sonar Reasoning
Structured analysis grounded in live search results
50M10M
Sonar Reasoning Pro
Deep multi-step reasoning over retrieved web evidence
25M6.25M

Understanding This Rate Card

  • Higher-capability models consume more credits per token
  • Efficient models stretch your credits further
  • All values based on 100 Credits
  • GPT-5 mini is the baseline standard model
  • Green values indicate more tokens than GPT-5 mini
  • Rates are subject to change by providers
  • Actual usage varies based on conversation patterns
© 2026 Expedient. All rights reserved.  |  AI CTRL Platform Documentation
Page last updated: 6/9/2026, 6:00:07 AM ET