gpubox.ai

Pricing — Nigeria

Pay only for what you use, in Naira.

Pay-as-you-go. Top up via Paystack with your Nigerian card. Every API call deducts in kobo against a credit balance. No subscription, no monthly minimum, no GPU-hour math.

Chat completions (LLM)

qwen2.5-32b-instruct

₦2,150per 1M tokens

£1.00

Blended rate — input + output tokens charged at the same rate. No separate prompt/completion pricing.

Speech-to-text

whisper-large-v3-turbo

₦10.75per audio minute

£0.005

Billed on actual audio duration (not file size). 6-second clip = 0.1 minutes ≈ ₦1.08. Strong on English, Yoruba, Igbo, Hausa.

Embeddings

bge-m3

₦107.50per 1M tokens

£0.05

Multilingual 1024-dim dense vectors via BGE-M3. Token counts come from the model's own tokenizer for accuracy.

FX reference: 2,150 NGN / £1. Naira rates above are for display; actual conversion at top-up time uses a live rate. Charged via Paystack; no Nigerian VAT applies (digital service supplied by a UK-incorporated counterparty).

Private beta access

Sign up with your email — we issue a starter API key on first sign-in, no card required. Beta credit covers your first weeks of tinkering. After that, top up from ₦1,000 via Paystack.

Sign up free →

Enterprise / regulated

Dedicated capacity, signed DPA (NDPR-aligned processor terms on request), audit-log retention, named-subprocessor disclosure. For banks, fintech, telco, public-sector buyers.

Read sovereignty →

Model Hosting

Coming with Factory

Once you fine-tune a model with us, it stays with us — callable, durable, optionally always-warm. Hosting is a separate subscription on top of inference, also Naira-quoted. Three tiers:

Cold

30–90s warm-up

₦1,075

per GB / month

Stored in UK object storage. Loads to GPU on first call after idle. No minimum — pay for what you store.

Warm

2–5s cold start

Flat ₦/mo

per active model

Kept hot on local SSD; loads to VRAM on demand. Per-call inference billed at the standard rate above.

Always-hot

0s — permanent VRAM

Flat ₦/mo

reserves a VRAM slot

Permanently loaded into GPU memory. For latency-sensitive production traffic. Available once second GPU lands.

Email [email protected] for an enterprise quote.

What's included at every price

  • Hardware located and operated in the United Kingdom
  • OpenAI-compatible API surface (drop-in for any OpenAI SDK)
  • Top-up in Naira via Paystack (cards, bank transfer, USSD)
  • Per-call audit log retained for 30 days minimum
  • Streaming responses on chat completions (SSE)
  • Token-level usage metering visible in dashboard