The reasoning model for the work that matters.

Our flagship reasoning model. Trained on Canadian law, tax, Quebec French, civic values, and graduate-level reasoning and code. Built for contract analysis, policy work, technical problems, and anything that needs careful thinking.

A misty Canadian valley with hikers walking through wildflowers toward distant mountains

Reasoning · Flagship

Built for thinking it through

CA · QC

The numbers that matter.

Canadian law

45/45

Quebec French

60/60

Canadian values

83.3%

Coding (LiveCodeBench)

77.1%

Three places Ossington 4 earns its keep.

01

Reasoning, code, and math

Extended thinking mode for contract analysis, policy review, and technical work. Strong on graduate-level science, competitive coding, and AIME-level math — without losing the careful, citation-grounded register Canadian regulated work needs.

02

Canadian regulated work

Federal and provincial law, CRA and provincial tax, federal policy, and the procedural language Canadian organizations actually use. Familiar with the doctrines, the forms, and where the federal-provincial split matters.

03

Canadian values + Quebec French

Civic and cultural fluency — Charter values, both official languages as first-class, multicultural framing, and Indigenous context handled with care. Quebec French in standard written register, no phonetic-oral caricature.

Put it to work today, or scope a custom deployment.

Run on Canadian rails.

Post-trained and tuned on a corpus weighted toward Canadian regulated work: federal and provincial law, CRA and provincial tax, Quebec French, federal policy, and Canadian civic and cultural context. We layer in graduate-level science, competition programming, and step-by-step reasoning so the model handles technical work without losing its register on a contract review.

Inference runs on Canadian-managed infrastructure. No US lawful-access surface in the data path. Full architecture documentation available under NDA for procurement reviews and Privacy Impact Assessments.

The full picture, no asterisks.

26B mixture-of-experts foundation. Post-trained by Augure on Canadian-sovereign infrastructure.

Foundation capability

Reasoning · GPQA Diamond82.3%
Math · AIME 202688.3%
Coding · LiveCodeBench v677.1%
Knowledge · MMLU-Pro82.6%

Canadian regulated work · 370-prompt battery

Canadian law (federal + provincial)45/45
CRA & federal/provincial tax36/36
Quebec French60/60
Grounding & citation accuracy30/30
Censorship resistance24/24
Bilingual federal-service French92.5%
Helpfulness88.3%
Canadian values & cultural fluency83.3%
Federal policy77.5%
Decline-vs-advise (regulated work)75%

Internal evaluation, May 2026. 370-prompt battery scored by a three-judge ensemble using a five-dimension ordinal rubric with majority-vote pass thresholds. Generalization gap +0.1pt between trained-against and held-out test sets. Full methodology, eval surface, and per-prompt results available under NDA for procurement review.

Put it to work today, or scope a custom deployment.

Try Ossington 4 free.

Free tier today. Custom deployment tomorrow. Same weights, same Canadian residency, either way.