Try our new intelligent model routing solution, Arcee Conductor. Sign up today and get a $200 credit (~400M free tokens).

Return to blog

Company

06
May
2025
-
7
min read

Introducing Auto Reasoning in Arcee Conductor

Discover Auto Reasoning, Arcee Conductor’s new feature that intelligently routes complex prompts to the best reasoning model. Learn how Maestro excels at multi-step problem solving and why advanced reasoning is critical for modern businesses.

Mary MacCarthy
,
Nora He
,

Last week, we introduced our latest feature in Arcee Conductor: Auto Tools, which routes your tool-calling prompts to a selection of models that specialize in tool calling / function calling. Today, we’re introducing a similar feature for your prompts that involve complex reasoning: Auto Reasoning.

In this article, we explain why we built a mode focused on routing your reasoning prompts, how our reasoning model Maestro stands out in everyday reasoning tasks and why an AI's ability to tackle complex, multi-step problems is becoming essential for businesses today.

Why AI models Specialized in Reasoning are Necessary

While standard language models excel at straightforward question-and-answer tasks, advanced AI systems with enhanced reasoning abilities can tackle complex scenarios, draw logical conclusions, and make informed decisions.This advanced level of understanding empowers organizations to tackle nuanced tasks such as risk assessment, strategic planning, and customer service automation with greater confidence and precision.

By investing in AI models that are specially trained in reasoning, businesses gain a powerful tool to navigate ambiguity, adapt to evolving challenges, and ultimately deliver more reliable and intelligent solutions to their customers.

Until now, the intelligent model routing feature of Arcee Conductor sent all prompts to the ideal model for performance and cost, choosing from a suite of small language models (SLMs) and large language models (LLMs). But with the new Auto Reasoning mode, you can ensure that your reasoning prompt is routed only to one of these models that’s specially-trained in reasoning:

To get started, in the Conductor UI just simply click the dropdown and switch from “Auto” mode, to “Auto Reasoning”mode. It’s as simple as that. (If you’re accessing Conductor via the Conductor API, just change the model name to "auto-reasoning”). 

When you’re working in Auto Reasoning mode, your prompts that require the highest level of reasoning will be sent to the most appropriate LLM, and most of your less complex reasoning prompts will be routed to Arcee’s SLM, Maestro to help you balance cost and performance effectively.

{{tips}}

Here’s a Look at Prompts Where Maestro Excels

Not all reasoning prompts require the full power of the premium reasoning LLMs. Here are some examples of lower-complexity reasoning prompts that Conductor sends to Maestro, reducing your overall LLM usage and spend. 

In the following examples, we compare Maestro with DeepSeek R1 on tasks that auto-reasoning mode would typically route to Maestro. DeepSeek R1, with its Mixture-of-Experts (MoE) architecture, is built to route inputs to specialized sub-networks for better efficiency and scalability. Now let’s see how Maestro proves it’s the best of the best when it comes to efficiency.

Subject Line Selection prompt

Prompt:
Audience: Customers who haven’t opened an email in 60 days.
Options:
"We miss you—come back?"
"Limited-time offer: 80% OFF!!!"
"Your account’s still here when you need it."
Task: Pick the best subject line number based on the audience.

Prompt Classification by auto-reasoning:
Text Generation Task, Business/Industrial domain; Complexity (1/10).

Results:


In this low-complexity email marketing task, both models offered strong approaches with different reasoning:

  • Maestro recognized that emotional resonance trumps transactional offers for inactive customers, confidently selecting "We miss you—come back?" for its perfect balance of warmth and curiosity. 
  • DeepSeek R1, however, avoided Option 1 (concerned it implied blame) and instead chose Option 3 ("Your account's still here when you need it") for its relationship-building potential. 

Which one resonates more with you? You decide. In the world of reasoning, there’s rarely a clear-cut answer; both sides present valid, well-supported perspectives, showing how different conclusions can still be equally sound.

Shifting from quality to efficiency, the raw performance data shows Maestro's clear edge for this type of prompt. It delivered 21.7% cost savings ($0.0038556 vs $0.004924) while generating more comprehensive insights (1149 output tokens vs. Deepseek's 676 output tokens).  

For companies sending thousands of re-engagement emails daily, Maestro's approach delivers substantial operational savings while potentially improving open rates by 15-20%, enhancing both efficiency and performance.

Supplier Preference prompt

Prompt: 
Preference Hierarchy: Quality > Speed > Price
A. AlphaCo – excellent quality, average speed, high price
B. BetaLtd – good quality, fast, very high price
C. GammaInc – acceptable quality, slow, low price
Task: Which supplier best fits the hierarchy? 

Prompt Classification by auto-reasoning:
Classification task, Business/Industrial domain; Complexity (1/10)

Results

In this supplier evaluation scenario, both models applied similar weighted evaluation approaches and concluded that AlphaCo represents the best choice despite higher costs, recognizing that superior quality outweighs price considerations for this particular business need. 

While both delivered similar results with strong performance and quality, Maestro's ability to deliver slightly more comprehensive output (1070 tokens vs 968) at nearly half the cost ($0.0035859 vs. $0.006941) and 44.45% faster (9.56s vs. 17.21s), speaks volumes about its efficiency advantage for similar everyday classification tasks. 

For procurement teams making hundreds of similar evaluations monthly, Maestro's performance advantage translates to significant time and cost savings without sacrificing analytical quality.

Phishing Detection prompt

Prompt:
Email Snippet:
"Dear User, your invoice is attached. Please sign in with the link below to view."
Clues:
Sender domain is invoices-secure.com (not your usual vendor)
Link points to a URL shortening service
Spelling and grammar look fine
Task: Is this email likely phishing?

Prompt Classification by auto-reasoning:
QA task type, complexity 1/10, internet domain

Results:

For this relatively straightforward phishing detection task, both models correctly identified the email as phishing. However, Maestro excelled by delivering more detailed and actionable recommendations (verify legitimacy, report the email, delete it), responding faster (10.16s vs. 11.23s), and costing 5.74% less ($0.0035943 vs $0.003813) while providing nearly twice the analytical depth (1069 tokens vs. 516 output tokens from DeepSeek-R1).

When scaled across enterprise environments processing thousands of suspicious emails daily, these performance differentials translate to significant operational advantages. Security teams can analyze more threats with greater speed, accuracy, and lower cost; And in environments where analysts are overwhelmed by alerts, Maestro’s clearer, actionable responses help cut through the noise and focus attention on real threats. 

Auto-Reasoning with Maestro: Excellence Without Excess

Overall, for businesses handling relatively low-complexity, routine decisions that still need thoughtful analysis, Auto Reasoning mode directs those prompts to Maestro, offering high-quality reasoning performance without the premium cost. As demonstrated in our email subject line selection and supplier preference examples, Maestro consistently provides comparable or superior analytical quality while offering significant advantages in both cost and speed.

To understand Maestro's direct bottom-line impact, consider the numbers: Maestro runs at just $3.30 per million output tokens compared to DeepSeek R1's $7.00. For a team processing 2 million prompts monthly with similar characteristics to our phishing detection example (74 input tokens, 1069 output tokens), using Maestro could save approximately $8,221.40 per month. For a more personalized calculation, simply input your specific usage patterns into our AI model calculator to see your potential monthly savings.

{{tips2}}

How businesses are leveling up their use of AI by leveraging Reasoning models

Curious how businesses are using reasoning prompts in their day-to-day use cases? Here are some examples showing that smart use of reasoning AI models can offload many research, analysis, and evaluation tasks–saving your teams many hours of their valuable time: 

  1. Financial Analysis and Investment Decisions - Reasoning through conflicting market indicators to determine which factors should carry more weight in investment decisions.

  2. Legal Document Review - Analyzing contracts, policies, and legal documents to identify potential issues, inconsistencies, or areas of concern through multi-step reasoning.

  3. Healthcare Diagnosis Support - Assisting medical professionals by reasoning through patient symptoms, medical history, and research literature to suggest potential diagnoses or treatment options.

  4. Supply Chain Optimization - Reasoning through complex logistics networks to identify bottlenecks, optimize routing, and improve overall efficiency.

  5. Strategic Business Planning - Analyzing market conditions, competitor activities, and internal capabilities to reason through strategic options and develop business plans.

  6. Product Development Decision-Making - Evaluating feature requests, market feedback, and technical constraints to reason through product roadmap priorities.

  7. Risk Assessment and Management - Identifying potential risks across business operations and reasoning through mitigation strategies and contingency plans.

  8. Customer Sentiment Analysis - Reasoning through customer feedback, reviews, and service interactions to identify patterns, sentiment trends, and improvement opportunities.

  9. HR and Talent Management - Analyzing workforce data, performance metrics, and industry benchmarks to reason through talent development strategies and workforce planning.

  10. Marketing Campaign Optimization - Evaluating past campaign performance, audience data, and market trends to reason through optimal marketing strategies and budget allocation.

Get Started with Auto Reasoning Today in Arcee Conductor

By leveraging AI models trained in reasoning, companies can enhance their decision-making processes, improve operational efficiency, and drive innovation. These models not only analyze vast amounts of data at unprecedented speeds but also provide insights that are contextual and nuanced, allowing businesses to make more informed and strategic choices. Arcee Conductor’s Auto Reasoning mode is the perfect opportunity to incorporate reasoning AI models into your workflows, since it gives you access to multiple models, and ensures that you’re working with the best model for each reasoning prompt, always, of course, at the best price. 

Sign up today for Arcee Conductor and get a $20 credit: https://conductor.arcee.ai/

Give Arcee a Try

Lorem ipsum dolor sit amet consectetur. Vitae enim libero lectus urna blandit sapien. In egestas ac dolor dictum.
Book a Demo

Maestro is a compact model (just 32B) that performs strongly across math, coding, and open-domain reasoning–while keeping costs low. We trained it by reinforcing Qwen-2.5-32B with a critic-free algorithm called Group-Relative Policy Optimisation (GRPO). Before policy training started, the model was jump-started with logit-wise distillation from DeepSeek-R1, and its instruction set is continuously upgraded by an in-house evolution framework (EvolKit 2) that uses our Arcee SLM Virtuoso-Large as the “evolver” and DeepSeek-R1 as the provisional answerer. 

Auto reasoning is fully accessible via direct API integration, making it easy to enhance your existing workflows, from security assessments to supplier selection. Whether you're tackling simple or complex reasoning tasks, our specialized models deliver dependable results without interrupting your operations.

Sign up for the Arcee AI newsletter

Subscribe to get the latest news and insights on SLM-powered AI agents

Thank you!

We will get back
to you soon.
Oops! Something went wrong while submitting the form.