~/docs/reference/pricing.md
4,354 bytesΒ·edit on github β†’

#Pricing

Pay only for what you use. No subscriptions, no minimums. All prices are in USD per 1M tokens.

##Chat completions

ModelInputOutputCached input*Reasoning**
gob-5.5-scout$0.15$0.60$0.075$0.30
gob-5.5$2.50$10.00$1.25$5.00
gob-5.5-deep$3.50$14.00$1.75$7.00
gob-5.5-horde$5.00$15.00$2.50$7.50

\ Cached input applies when the same prefix is reused within 5 minutes (50% discount).* \\* Reasoning tokens β€” used by Goblin-of-Thought when reasoning: "visible" is set β€” are billed at 50% of the output rate.*

##Embeddings

ModelPrice per 1M tokens
gob-embed-cave-small$0.02
gob-embed-cave$0.10
gob-embed-cave-large$0.13
gob-embed-shadow$0.18

##Goblin Reward Signal API

EndpointPrice per 1M input tokens
/grs/score$0.05

Output is a single scalar β€” no output token charge.

##Cave Memory

ChargePrice
Hoard storage$0.10 / 1M hoards / month
Hoard retrieval (per call)included with chat call (no separate charge)
Hoard extraction (per session)included with last chat call

##Per-request cost calculator

Approximate cost of a typical chat completion:

ScenarioTokens (in/out)ModelCost
Short Q&A50 / 200scout$0.000128
Standard chat reply500 / 800gob-5.5$0.00925
Code generation2k / 4kgob-5.5$0.045
Multi-step research10k / 8kgob-5.5-deep$0.147
Agentic workflow with cave memory30k / 12kgob-5.5-horde$0.330

##Free tier

gob-test- keys are completely free with these caps:

ResourceCap
Chat completions10,000 / month
Embeddings1M tokens / month
GRS scoring100k tokens / month
Cave Memory storage1k hoards

Test-key responses are functionally identical to production but may be slightly delayed during peak hours.

##Volume discounts

Auto-applied based on monthly spend:

Monthly spendDiscount
$0 – $1,0000%
$1k – $10k5%
$10k – $50k10%
$50k – $250k15%
$250k+contact sales (up to 30%)

##Prepaid credits

Buy credits in advance for additional discounts:

Prepaid amountBonus
$1005%
$5008%
$2,00012%
$10,00018%

Credits never expire.

##Billing

  • β–ΈBilled in arrears at the end of each calendar month.
  • β–ΈItemized invoice with per-model and per-endpoint breakdown.
  • β–ΈAuto-charged to your payment method on the 1st.
  • β–ΈFailed payments suspend the API after 7 days. No data is deleted.

##Cost monitoring

Set hard and soft spend alerts in Console β†’ Billing. When you cross 80% of your soft alert, you get an email. When you cross your hard cap, the API starts returning no_treasure_in_hoard (429).

You can also poll usage programmatically:

bash
curl https://api.gpt-gob.ai/v1/usage?start_date=2026-05-01&end_date=2026-05-31 \
-H "Authorization: Bearer $GOB_API_KEY"

##What's included free

  • β–ΈAll API key management
  • β–ΈCave Memory infrastructure (storage charges separate)
  • β–ΈAll SDKs and tooling
  • β–ΈStandard support (community Discord + email)
  • β–ΈStatus page + incident notifications

##What costs extra

  • β–ΈDedicated capacity (enterprise)
  • β–ΈVPC/private deployment
  • β–Έ24/7 phone support
  • β–ΈCustom fine-tuning
  • β–ΈOn-prem deployment