Open-weight models we actually ship in 2026

Prototype — concept, not a live service. This blog post is part of the redesign proposal demonstrated in this MVP and is not a current service of LIACC. Any data shown is illustrative. See the project report for context.

TUTORIAL · 02 Jan 2026 · 3 min read · 133 words

Open-weight models we actually ship in 2026

A short leaderboard of the open-weight models we deploy at LIACC, by task.

LIACC

Illustrative byline (prototype)

Not a leaderboard. Just the ones we actually ship.

Text classification / extraction

ModernBERT-large (English) and Albertina (Portuguese). Small, fast, fine-tune in a single afternoon. Dominant for closed-label tasks.

General-purpose Portuguese chat

Llama 3.1 8B + our SFT on 30k Portuguese instructions. Fast enough for real-time, good enough for 90% of requests from Portuguese public bodies.

Reasoning-heavy tasks

DeepSeek-R1-Distill 14B. Punches above its weight on legal reasoning once grounded with RAG.

Code-adjacent work

Qwen 2.5 Coder 7B. We use it for extraction and transformation of structured code and log data — not for the IDE.

Vision-language

Qwen2-VL 7B. Document layout understanding at a cost that fits our budget.

Where we still call an API

Novel reasoning tasks with no eval. We keep a frontier model subscription for the cold-start week.

Open-weight models we actually ship in 2026

Text classification / extraction

General-purpose Portuguese chat

Reasoning-heavy tasks

Code-adjacent work

Vision-language

Where we still call an API

Read next

The EU AI Act at enforcement: a lab's field guide

Small language models are eating the enterprise

Reasoning models vs retrieval: when test-time compute wins