GPT-OSS Now Live on Core42 AI Cloud via Compass
Unmatched Performance. Seamless Scalability. Silicon Freedom.

Why Use Core42 Compass for gpt-oss models?
Core42 Compass gives you choice with no lock in. Deploy sovereign AI solutions at unmatched speed on the silicon that suits your workloads.
Get Started Now-
Unmatched Performance - Hit world-record inference speed – an incredible 3,000 tokens/second/user on Cerebras clusters - for real time AI at scale.
-
Silicon Freedom - Silicon Freedom - Choose from Nvidia, Microsoft Azure, Qualcomm, AMD or Cerebras. Align each workload with the optimal accelerator.
-
Sovereign AI - All UAE hosted deployments run under UAE jurisdiction. Benefit from locally optimized infrastructure with global-class performance and full regulatory compliance.
-
Instant Access - Live today through Core42 Compass API. Flexible pricing, custom integrations and enterprise-grade support from day one.
Choose Your Silicon
The broadest AI hardware choice in the market to give you the power to match everyday workload with the optimal accelerator.
Competitive Pricing for gpt-oss Models
Simple, transparent pricing across three silicon platforms that scales with your usage
Model/Deployment
Silicon
Input $/M
Output $/M
gpt-oss-120b
Cerebras
0.25
0.69
gpt-oss-120b
NVIDIA HGX
0.30
0.75
gpt-oss-120b
Microsoft Azure
0.40
0.95
gpt-oss-20b
NVIDIA HGX
0.20
0.60
gpt-oss-120b
AMD
0.30
0.75
gpt-oss-20b
AMD
0.20
0.60
gpt-oss-120b
Qualcomm
0.15
0.37
gpt-oss-20b
Qualcomm
0.10
0.30
Set custom rate limits that work for you: adjust requests per minute, token volumes, and throughput quotas to match your application's unique demands
Ready to Get Started?
Sign up to Core42 Compass and get started in minutes!
Talk to Sales
Our team can tailor a plan to meet your needs.





