Q
First Rubin R100 NVL72 Cloud Provider

The AI Supercloud Built for Scale

Next-gen NVIDIA Rubin R100 NVL72 + Groq LPX real-time inference. $0.048/kWh energy. Zero taxes in SEZ.

GPU Access from July 2027

🚀 GPU access from July 2027. Join the waitlist to reserve $500 in free credits and get early access pricing.

8
MW Power
$0.048
/kWh Energy
<10
ms Latency
0%
Tax in SEZ

Why Leading Companies Choose Qube Compute

Next-Gen Performance

NVIDIA Vera Rubin R100 NVL72 — 5x more powerful per token than H100. 1,400+ ExaFLOPS FP4 per rack. NVLink 6.0 fabric.

Unmatched Economics

$0.048/kWh gas-powered energy — 3x cheaper than AWS/Azure. ABHM absorption cooling with PUE 1.10. 0% taxes in SEZ.

Sharia-Compliant

The world's only AFSA-certified halal GPU cloud. Mudaraba structure. Access to $4.5 trillion in Islamic sovereign wealth funds.

Developer-First API

Deploy GPUs in Seconds

Full REST API, Python SDK, Node.js SDK, and CLI. Spin up NVIDIA Rubin R100 NVL72 instances or call Groq LPX inference with a single command.

1
Get API Key
Sign up and generate your API key in the dashboard
2
Launch Instance
Choose GPU type, count, and container image
3
Train or Infer
SSH in for training, or use Groq API for <10ms inference
import qube

client = qube.Client(api_key="your-api-key")

# Launch a GPU instance
instance = client.instances.create(
    gpu_type="rubin-r100-nvl72",
    gpu_count=8,
    image="nvidia/pytorch:24.04",
    region="almaty-sez"
)

print(f"Instance {instance.id} is {instance.status}")
print(f"SSH: ssh root@{instance.ip}")

# Run inference with Groq LPX
response = client.inference.create(
    model="llama-3.1-70b",
    messages=[{"role": "user", "content": "Hello!"}],
    max_tokens=512
)
print(response.choices[0].message.content)
# Latency: 8ms
api.qubecompute.com
SDK Access: Private Beta for LOI Signatories

Five Unbeatable Advantages

No other cloud provider combines these strengths

0
months monopoly

First Rubin R100 NVL72 Cloud Provider

No NVL72 in KZ, UZ, KG. Nearest competitor with Rubin — Finland (Nebius), UAE (G42). 18-24 month exclusive window.

0
/kWh

3x Cheaper Energy

Gas-powered generation at $0.048/kWh vs $0.09-0.18 at Western competitors. ABHM absorption cooling, PUE 1.10.

0
addressable market

Only Sharia-Compliant GPU Cloud

AFSA/AIFC certified Mudaraba structure. Access to PIF ($925B), QIA ($475B), ADIA ($993B) sovereign wealth funds.

0
taxes until 2029

SEZ Tax Exemptions

0% corporate tax, VAT, personal income tax in SEZ PIT Alatau. Government investment contract guarantees.

0
ms latency

Groq LPX Real-Time Inference

Sub-10ms LLM inference API globally. Unique Rubin + Groq combination for training + real-time serving.

Performance

Benchmark Comparisons

NVIDIA Rubin R100 NVL72 delivers up to 5x more performance per dollar compared to H100. Combined with Groq LPX for inference — unmatched speed and efficiency.

LLaMA 3.1 70B Training

Time to train (1T tokens)
Rubin R100 NVL72~3 days
H100 SXM (8×)~15 days
A100 SXM (8×)~38 days

Inference Throughput

Tokens/sec (LLaMA 70B)
Groq LPX~3,000 tok/s
Rubin R100~800 tok/s
H100 TensorRT~350 tok/s
A100~120 tok/s

Memory Bandwidth

Per rack
Rubin R100 NVL72468 TB/s
GB200 NVL72~380 TB/s
H100 SXM (8×)26.4 TB/s

FP4 Performance

Per rack
Rubin R100 NVL721,400+ ExaFLOPS
GB200 NVL72~720 ExaFLOPS
H100 SXM (8×)~16 ExaFLOPS

* Benchmark estimates based on NVIDIA published specifications and industry testing. Actual performance may vary by workload. Rubin R100 NVL72 specs from NVIDIA GTC 2025 announcements.

Save Up to 70%

See How Much You'll Save

Compare Qube Compute with major GPU cloud providers. Our Rubin R100 + low-cost energy = unbeatable economics.

172 (full rack)
1h24h
ProviderGPUCost per R100-equivalent taskMonthly CostAnnual CostYou Save
Qube ComputeBest PriceRubin R100$14$80,640$967,680
AWS (p5.48xlarge)H100$163.85
($32.77 x 5hrs)
$943,776$11,325,31291%
$10,357,632/yr
Azure (ND H100 v5)H100$136.00
($27.20 x 5hrs)
$783,360$9,400,32090%
$8,432,640/yr
GCP (a3-highgpu-8g)H100$156.10
($31.22 x 5hrs)
$899,136$10,789,63291%
$9,821,952/yr
CoreWeaveH100$23.80
($4.76 x 5hrs)
$137,088$1,645,05641%
$677,376/yr
Lambda CloudH100$17.45
($3.49 x 5hrs)
$100,512$1,206,14420%
$238,464/yr

* Performance-adjusted: R100 delivers ~5x the throughput of H100. Competitor cost = hourly rate x 5 hours to match 1 hour of R100. Prices from public listings, 2025.

Ready to save 3x on GPU compute?
Client Pipeline

Enterprise Demand Already Building

Active LOIs and discussions with enterprises across 4 sectors. Phase 1 capacity (8 racks) is being allocated to anchor clients.

3+
LOIs in pipeline
576+
GPUs requested
$8M+
Annual contract value
4
Industry sectors

Active Client Pipeline (names under NDA)

SectorRegionUse CaseGPU CapacityStatus
Financial ServicesMiddle EastAnti-fraud ML, risk scoringMulti-rackLOI In Progress
Energy & IndustrialEurasiaPredictive analytics, optimizationDedicated rackLOI In Progress
Government & Public SectorEurasiaNational AI platform, NLPMulti-rack clusterIn Discussion
Healthcare & Life SciencesMiddle EastDrug discovery, diagnosticsDedicated rackIn Discussion

Client names protected under NDA. Details available under investor NDA upon request.

AI Solutions for Every Industry

Purpose-built infrastructure for your most demanding workloads

Oil & Gas AI

Seismic analysis, predictive maintenance, and well optimization for KMG, TCO, Karachaganak.

Financial AI

Anti-fraud ML, algorithmic trading signals, and risk scoring for banks and fintech.

Government AI

National digitalization projects, NLP for public services, and smart city infrastructure.

Healthcare AI

Real-time diagnostics, drug discovery, and molecular dynamics on Groq LPX under 10ms.

Technology & Ecosystem Partners

Built on partnerships with global leaders in AI infrastructure, finance, and compliance

NVIDIA
GPU Partner
Groq
Inference Partner
AIFC
Legal Framework
Al Hilal Bank
Escrow Partner
ComplyAdvantage
KYC/AML
Astana Hub
Tech Ecosystem

Built for Every Industry

Serving enterprises globally — from AI startups to Fortune 500

AI Startups
Fortune 500
Sovereign Wealth Funds
Fintech
Healthcare AI
Oil & Gas
Research Labs
Government AI

Ready to Scale Your AI?

Limited Phase 1 capacity — 8 racks available. Reserve now to lock in anchor pricing.

GPU access from July 2027. Reserve now to secure anchor pricing.