Arcee Blog

Research
July 9, 2024

How Do I Prep my Data to Train an LLM?

So you want to train a custom language model, and you do have the requisite large set of text data. But how do you know that the data is *really actually ready* for model training? Our researchers here at Arcee AI tell you what to look out for.

Open-Source SLMs
July 3, 2024

Introducing Arcee Agent: A Specialized 7B Language Model for Function Calling and Tool Use

Arcee Agent is yet another Arcee model punching above its weight: it's just 7B (initialized from Qwen2-7B) and outranks much larger models. Try it out for function calling and tool use!

Research
June 25, 2024

The Hidden Challenges of Domain-Adapting LLMs

Adapting an LLM to a specific domain might sound straightforward, but it in fact opens a Pandora's box of challenges. Our research team explains the shortfalls of some of the most common techniques.

Open-Source SLMs
June 24, 2024

Arcee Spark: A Compact & Efficient 7B Parameter Language Model

Looking for proof that Small is the new Big when it comes to language models? Look no further than the model we've just dropped here at Arcee AI: you get top-notch results with just 7B parameters.

Research
June 18, 2024

Introducing the Ultimate SEC LLM: Revolutionizing Financial Insights

We built Llama-3-SEC upon the powerful Meta-Llama-3-70B-Instruct model, with the goal of providing unparalleled insights and analysis capabilities for financial professionals, investors, researchers, and anyone working with SEC filings and related financial data.

Product
May 20, 2024

Arcee Cloud: The LLM Solution for Everyone

We’re thrilled to announce that we’re launching Arcee Cloud, a fully hosted SaaS offering that makes world-class LLM production accessible to all – in an easy-to-use platform for Training, Merging, and Deploying custom language models.

Open-Source Toolkits
April 26, 2024

Use MergeKit to Extract LoRA Adapters from any Fine-Tuned Model

We show you how to use Arcee's MergeKit to extract LoRA adapters from fine-tuned models, then leverage the Hugging Face Hub to create a library of general and task-specific LoRA adapters.

Open-Source Toolkits
April 23, 2024

Evolutionary Model Merging For All

We've been focused on developing this groundbreaking technique for the community, and we're now excited to announce the launch of this state-of-the-art functionality in MergeKit.

Research
April 19, 2024

Why Methods Like QLoRA Fall Short in Domain Knowledge Injection

Arcee's research shows that the Standard Continual Pre-Training (CPT) approach performs better than QLORA-based CPT.