Try our new intelligent model routing solution, Arcee Conductor. Sign up today and get a $20 credit.

Introducing Arcee Conductor

A new standard for intelligent model routing.

Conductor intelligently routes your prompt to the best model, to efficiently deliver precise results, for any task.

Get started with $20 free credit.

Try now

Today’s AI is about more than the single best large language model (LLM)

With new AI models released daily, it's hard to keep up with which model is best for your business. Our pioneering work in small language models (SLMs) gives us unique insights into which model is the right one for your tasks or queries.

That's why we built Arcee Conductor, a model-agnostic platform that gives you access to a complete suite of top-performing SLMs, as well as other industry-leading LLMs.

Arcee Conductor intelligently routes your query to the optimal model based on factors like industry/specialty, complexity, efficiency, and cost–all in an easy-to-use interface that requires no technical expertise.

How Arcee Conductor works

Arcee Conductor takes intelligent model routing to a new level

Precise, efficient routing

Automatically routes your prompt or query based on complexity, type of task, industry or domain, language, and whether it involves tool / function calling.

Slash your AI spend

Reduce cost-per-prompt by over 99% compared to when you use just  a single premium LLM. Save on routine prompts that you may be currently  overpaying with a single LLM. Gain real-time visibility into your model spending per prompt.

Routes to small, large models

Based on model router classifications, your prompt is routed to one of the language models in Arcee Conductor. Available models include Arcee SLMs plus the latest models from leading providers like OpenAI, Anthropic, and Deepseek.

Advanced query processing

Supports chain-of-thought reasoning for enhanced analytical capabilities, and automatic compute scaling for task-specific optimization.

One API across all your AI touchpoints

Directly invoke Arcee Conductor via an API. Switch a few parameters in the OpenAI-compatible endpoint to use model routing without having to rebuild infrastructure.

Customizable model settings

User-defined preferences for routing parameters and model prioritization, with preset profiles for different tasks.

The right model for your prompt, every time

Some tasks can require an LLM, but over 80% of the tasks that companies use an LLM for can be handled with equal accuracy by an SLM.

Arcee Conductor's intelligent routing gives you cost efficiencies with optionality. 

Conductor analyzes each prompt based on complexity, domain, and cost, then dynamically routes it to the optimal model, large or small, to maximize cost efficiency without compromising performance.

 Featured Arcee SLMs included with Conductor are showcased below. You can also to Conductor to explore our complete model catalog.

See complete model catalog

Modes

“Auto” utilizes Arcee AI's intelligent model router to route prompts to the most optimal and efficient language model based on task and domain complexity

For nuanced questions that require more than a surface-level reply, select “Auto Reasoning”, so your prompt will be directed only to a reasoning model within Conductor, best suited for the request.

To manage tool integrations across models, use “Auto Tools” mode, for function-calling models within Conductor, where the best tool-calling model is selected for your query based on its complexity.

Pricing

To unlock additional options–like volume discounts, custom model configuration, and dedicated SLAs–contact Sales to learn about our  Enterprise Tier.

Mode: AUTO
Model
Price per Million Tokens (I/O)
Anthropic
Claude 3.7 Sonnet
$3.00 / $15.00
OpenAI
GPT-4.1
$2.00 / $8.00
Arcee AI
Virtuoso-Large
$0.75/ $1.20
Arcee AI
Virtuoso-Medium
$0.50 / $0.80
Arcee AI
Arcee-Blitz
$0.45 / $0.75
Mode: AUTO REASONING
Model
Price per Million Tokens (I/O)
Anthropic
Claude 3.7 Sonnet-Thinking
$3.00 / $15.00
OpenAI
OpenAI o3-mini
$1.10 / $4.40
DeepSeek
DeepSeek-R1
$3.00 / $7.00
Arcee AI
Maestro
$0.90 / $3.30
Mode: AUTO TOOLS
Model
Price per Million Tokens (I/O)
Anthropic
Claude 3.7 Sonnet
$3.00 / $15.00
OpenAI
GPT-4.1
$2.00 / $8.00
Arcee AI
Caller-Large
$0.55 / $0.85

Frequently
Asked Questions

Why is model routing important for your business?

Currently, companies face significant challenges:

  • Foundation model performance varies significantly and in unintuitive ways
  • High-margin AI products are creating financial pressure on traditional SaaS businesses
  • Teams cannot realistically track which model performs best for each specific prompt.

While selecting the best model for your task improves response quality, that isn't enough – on its own – to solve these challenges completely. This is why intelligent model routing is essential. It automatically selects the optimal model for each prompt, helping you to reclaim your profit margins without sacrificing quality. 

What exactly is Arcee Conductor?

Arcee Conductor is an intelligent model routing platform that directs each input to its ideal AI model based on complexity, domain, cost, and other requirements. By dynamically routing between large language models (LLMs) and small language models (SLMs), Conductor maximizes cost efficiency without compromising performance. You get the right model for each prompt, every time. 

What models power Arcee Conductor?

In Arcee Conductor, your prompt is automatically routed to the most suitable model through an advanced routing mechanism. 

Available models  include purpose-built Arcee SLMs  for specific tasks  alongside the latest models from leading  providers like OpenAI, Anthropic, and Deepseek.

Log in to Conductor to see the full list of our current model offering.

What is a mode on Arcee Conductor? 

A mode on Arcee Conductor is a selection of specialized AI models grouped by functionality. Each mode serves a different purpose, giving you the flexibility to use the most suitable approach for your task, whether for general inquiries, complex reasoning, or function-calling.

What modes are available on Arcee Conductor?

  • Auto Mode: Intelligently route a prompt to the most suitable general-purpose model based on complexity and efficiency. Ideal for everyday questions and tasks.
  • Auto Reasoning: Automatically selects the optimal reasoning model for complex prompts that need problem breakdown into smaller steps. Ideal for tasks requiring detailed, step-by-step solutions.
  • Auto Tools: Provides models specifically designed for function-calling, allowing integration with external systems and workflow execution.

Each mode serves a specific purpose to deliver the most effective results for your particular needs.

How do I get started with Arcee Conductor?

You can sign up here to begin using Arcee Conductor today.  First-time users receive $20 in credits (equivalent to approximately 400 million tokens) towards your Conductor usage. 

How does Arcee Conductor's billing work?

Arcee Conductor is based on a usage-based pricing model. When you sign up for Conductor for the first time,  you'll receive $20 in credits automatically. This allows you to immediately start using the platform.

After you've used your $20 in credits, charges will be applied to your payment method based on your usage. The specific rates vary depending on specific rates per token of models. 

What are the specific rates per token in Arcee Conductor?

                      Model

Input Tokens
(per million tokens)

Output Tokens
(per million tokens)

Arcee-Blitz 
(General Purpose)

$0.45

$0.75

Virtuoso-Medium
(General Purpose)

$0.50

$0.80

Virtuoso-Large
(General Purpose)

$0.75

$1.20

Maestro
(Reasoning)

$0.90

$3.30

Caller-Large
(Function Calling)

$0.55

$0.85

Claude 3.7 Sonnet

$3.00

$15.00

GPT-4.1

$2.00

$8.00

DeepSeek-R1

$3.00

$7.00

OpenAI o3-mini

$1.10

$4.40

Industry-leading SLMs & LLMs with unified API access, inference, and intelligent routing–
all in one solution.

First-time users of Arcee Conductor can get started with a one-time $20 credit. Optimize output across models, reduce usage costs, and maximize performance with intelligent routing.

Get started