Blog

Latest updates

a helpful article on What Hermes Agent is, steps to set up, how it compares to other agents, and how to use it with Pioneer - the preferred inference provider for Hermes agent that allows you to switch between 70+ models seamlessly with one API key.

Guide

Hermes Agent: The Complete Guide to the Self-Improving AI Agent (2026)

What Hermes Agent is, steps to set up, how it compares to other agents, and how to use it with Pioneer.

Jul 10, 2026

Research

GLiNER2-Guardrails-PII-Multi: Safety moderation and privacy filtering in a SLM

Introducing GLiNER2-Guardrails-PII-Multi - a multilingual multi-task small language model for safety moderation and privacy filtering.

Jul 8, 2026

Guide

OpenCode: The Complete Guide to the Open Source AI Coding Agent (2026)

What OpenCode is, how to set it up, how it compares to other agents, and how to run any model via Pioneer.

Jul 2, 2026

Guide

A guide to LLM inference

An overview of what inference is and what affects inference speed and performance.

Jun 19, 2026

Guide

How to Choose the Best Coding Models (2026 Edition)

An overview of how to choose the best frontier and open-source coding models for the right tasks.

Jun 3, 2026

Guide

A Guide to Small Language Models (SLMs)

A practical guide to SLMs: main architectures, when they outperform frontier models on production tasks, and how to fine-tune one.

Jun 1, 2026

Engineering

The 33rd Adapter Problem: How We Got 44x More Throughput from One L4

Our serving path falls off a cliff at adapter 33. Here's why, and what we did about it.

May 15, 2026

Research

GLiNER2-PII: Open Source Privacy Filtering with PII Detection

We're releasing GLiNER2-PII, a 300M parameter SOTA open-source model for detecting and redacting PII.

May 14, 2026

Research

GLiGuard: 16x Faster Safety Moderation with a Small Language Model

Introducing GLiGuard, a new open source small language model for safety moderation.

May 12, 2026

Research

The Agent Behind Pioneer

The research behind Pioneer, where fine-tuning small language models becomes a closed loop.

May 4, 2026

Product

Introducing Pioneer: The First Agent for Fine-tuning and Inference of LLMs

We’re launching Pioneer, the world’s first agent for fine-tuning and inferencing open source SLMs and LLMs.

Apr 21, 2026

Research

GLiNER2 for Agentic Information Extraction

The future belongs to models with architectures crafted, optimized, and deployed for focused tasks.

Feb 19, 2026

Research

GLiNER for Modern Named Entity Recognition

The one-size-fits-all LLM era is over. Scaling AI effectively now demands matching models to the right domain or task.

Jan 30, 2026

Research Papers

Correcting Stochastic Update Bias in Preconditioned Language Model Optimizers

GLiNER2-PII: A Multilingual Model for Personally Identifiable Information Extraction

GLiGuard: Schema-Conditioned Classification for LLM Content Moderation

Pioneer Agent: Continual Improvement of Small Language Models in Production

GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer

GLiNER2: An Efficient Multi-Task Information Extraction System with Schema-Driven Interface

Beyond Reactivity: Measuring Proactive Problem solving in LLM Agents

Fastino Inc. ("Fastino") develops specialized AI models and provides APIs designed to support structured data extraction, classification, reasoning, and production AI workflows. Fastino is a technology company and does not provide legal, financial, compliance, or advisory services. Any outputs, predictions, classifications, or decisions generated through Fastino models are based on the configuration, data, and implementation provided by the customer. Fastino does not control, verify, or guarantee the accuracy, completeness, or suitability of model outputs for any specific purpose. By using this website or Fastino's models and services, you acknowledge that all content and outputs are provided for informational and operational purposes only and agree to our Terms of Use and Privacy Policy.