Blog

Blog

Latest updates

Guide

An overview of how to choose the best coding models for the right tasks.

How to Choose the Best Coding Models (2026 Edition)


Read more

Guide

A practical guide to SLMs: main architectures, when they outperform frontier models on production tasks, and how to fine-tune one.

A Guide to Small Language Models (SLMs)


Read more

Engineering

Our serving path falls off a cliff at adapter 33. Here's why, and what we did about it.

The 33rd Adapter Problem: How We Got 44x More Throughput from One L4


Read more

Research

We're releasing GLiNER2-PII, a 300M parameter SOTA open-source model for detecting and redacting PII.

We're releasing GLiNER2-PII, a 300M parameter SOTA open-source model for detecting and redacting PII.

GLiNER2-PII: Open Source Privacy Filtering with PII Detection


Read more

Fastino's GLiGuard model launch

Research

Introducing GLiGuard, a new open source small language model for safety moderation.

Introducing GLiGuard, a new open source small language model for safety moderation.

GLiGuard: 16x Faster Safety Moderation with a Small Language Model


Read more

Research

The research behind Pioneer, where fine-tuning small language models becomes a closed loop.

The Agent Behind Pioneer


Read more

Product

We’re launching Pioneer, the world’s first agent for fine-tuning and inferencing open source SLMs and LLMs.

Introducing Pioneer: The First Agent for Fine-tuning and Inference of LLMs


Read more

Green Fern

Research

The future belongs to models with architectures crafted, optimized, and deployed for focused tasks.

GLiNER2 for Agentic Information Extraction


Read more

Yellow Flower

Research

The one-size-fits-all LLM era is over. Scaling AI effectively now demands matching models to the right domain or task.

GLiNER for Modern Named Entity Recognition


Read more

Research Papers