Open Scheduler

Dashboard Features Inference Pricing FAQs

Open Scheduler

Host your Inference on the cheapest rentable GPUs globally!

PRICING NAVIGATOR

Cloud Provider Spot Navigation

Open Scheduler identifies the cheapest spot GPU offerings of the largest Cloud Providers and rents them on your behalf. Keeping the compute surcharges out of the equasion.

INFERENCE CLUSTERING

Automated Inference Clustering and Loadbalancing

Effortlessly manage and scale your distributed inference clusters. It automatically load-balances your clusters, providing secure and streamlined entry points for seamless scalability.

SPOT GPU PRICING

Uncover the best global
VRAM Offerings

Discover the top global GPU offerings by exploring the best spot Virtual Machine options available today. Delve into the dynamic pricing models, regional benefits, and unique features of these spot VMs. This in-depth analysis not only helps optimize cost-efficiency but also ensures you select the most powerful GPU configurations tailored for complex inference workloads.

LLM CONFIGURATOR

Open LLM Inference Configurations

Models often demand specific GPU setups, which can either be overly expensive or insufficient for optimal performance. Identifying the right requirements can be a tedious task. With our platform, you can bring your own configurations or leverage our carefully curated and tested setups. Start running fine-tuned models or new releases seamlessly and without hassle.

QUOTA MANAGEMENT

Full transparency on your quotas

Managing which Subscriptions/Projects are allowed to rent which VMs in what regions of the world can be hard. Gain transparency by scanning your current quotas on your projects.

INFERENCE PRICING

Shape your own
Inference Pricing

Spin up OnDemand Inference Clusters, make efficient use of the rented compute and bring down inference pricing yourself. Open scheduler helps you keep a good view on spending and most importantly token throughput and inference rates.

Model

$ / 1M Tokens

Llama 3.1 70B

~ 0.40 $

Llama 3.3 70B

~ 0.48 $

Llama 3.1 Nemotron 70B

~ 0.50 $

Qwen 2.5 72B

~ 0.65 $

Athene V2 Chat 72B

~ 0.45 $

Bring your own model

Shape it yourself!

CREATE INFERENCE CLUSTERS IN SECONDS

How it works?

Step 1: Register your Cloud Client

Sign up and securely connect your cloud environment in minutes. Our intuitive setup ensures your client is ready for seamless integration.

Step 2: Pick or Create a Inference Configuration

Choose from pre-built configurations or customize your own to match your specific requirements. Tailor every detail for optimal results.

Step 3: Start generating Tokens

With everything set, begin generating tokens instantly. Enjoy fast, reliable performance to power your applications effortlessly.

Currently Supported Cloud Providers

FREQUENTLY ASKED QUESTIONS

Frequently asked questions

Looking for something else? Chat with us via [email protected] and we will try our best to help you with your questions!

Open Scheduler

[email protected]

Note:This software is currently in its beta phase as we continue to refine and enhance the experience. We appreciate your understanding and support!

Open Scheduler

PRICING NAVIGATOR

Cloud Provider Spot Navigation

INFERENCE CLUSTERING

Automated Inference Clustering and Loadbalancing

SPOT GPU PRICING

Uncover the best globalVRAM Offerings

LLM CONFIGURATOR

Open LLM Inference Configurations

QUOTA MANAGEMENT

Full transparency on your quotas

INFERENCE PRICING

Shape your ownInference Pricing

CREATE INFERENCE CLUSTERS IN SECONDS

How it works?

Step 1: Register your Cloud Client

Step 2: Pick or Create a Inference Configuration

Step 3: Start generating Tokens

FREQUENTLY ASKED QUESTIONS

Frequently asked questions

Who is Open Scheduler designed for?

What is “Cloud Provider Spot Navigation”?

How does Open Scheduler ensure low costs?

Are there any additional fees for using Open Scheduler?

How do I join Open Scheduler?

What do I need to get started?

Open Scheduler

Uncover the best global
VRAM Offerings

Shape your own
Inference Pricing