guide

Every American local-model claim needs a policy trail before it can influence a recommendation.

WhatsMy.AI does not publish an American local-model page from memory alone. Each tracked model has to point back to an official release page, declare the evidence behind the origin claim, and keep runtime coverage tied to the exact paths shown on the site.

Browse models Read methodology

Policy snapshot

Current policy version: 2026-03-12-us-origin-v2.

Every catalog entry must ship with a source URL, release date, license, modality, runtime availability map, and last-verified timestamp.

The release date is normalized to the precision the source actually gives us, usually a month and year, so the catalog does not invent a day that the source never published.

US-origin eligibility requirements

Entity provenance. A model only qualifies as an American local model when the stewarding release entity can be traced to an official US-controlled company or institute page. Stored at provenance.originEvidence.entityProvenanceUrl.

Official release channel. The canonical source URL must resolve to an official provider-controlled release page for the exact model line, not a community mirror or repackaging page. Stored at provenance.originEvidence.officialReleaseChannelUrl.

Downloadable local-runtime path. Every American entry needs at least one current downloadable local-runtime path with a reviewed install or artifact URL. Guided setup docs can supplement that evidence but cannot replace it. Stored at runtimeSupport[*].actionUrl.

License requirements. The displayed license label must trace back to an official page that states the governing terms for the release being recommended. Stored at provenance.originEvidence.licenseUrl.

Reviewer timestamp. Each entry carries a per-entry review timestamp. CI treats that timestamp as the reviewer stamp for freshness and policy enforcement. Stored at provenance.lastVerifiedAt.

Recorded evidence fields

`provenance.originEvidence.entityProvenanceUrl` records the official US-entity evidence for the stewarding organization.

`provenance.originEvidence.officialReleaseChannelUrl` must mirror the catalog source URL for the exact release page.

`provenance.originEvidence.licenseUrl` links the license terms the catalog is relying on.

`runtimeSupport[*].actionUrl` stores the reviewed download or install path for each tracked local runtime, and at least one US runtime path must be directly downloadable.

`runtimeSupport[*].verification.lastVerifiedAt` records when that runtime path was last rechecked.

`runtimeSupport[*].verification.checks[*].target` records whether the evidence is a compatibility review or a distribution review.

`runtimeSupport[*].verification.checks[*].sourceUrl` points to the exact reviewed runtime page, and `caveat` stores any constraints that need to stay visible.

When a runtime page is a catalog or guide rather than the raw artifact, the drift check also derives the tracked download path from the reviewed runtime command.

`lastVerifiedAt` is the reviewer timestamp carried by each entry and reused by the freshness window.

Verification gate

`npm run verify:model-catalog` is the focused metadata gate for this catalog.

`npm run codex:verify` runs the same provenance check before the broader test and build flow, so missing metadata or mismatched runtime availability fails the project validation path.

The CI gate now blocks any recommended entry that is missing entity evidence, official release evidence, license evidence, or a current downloadable local-runtime path.

The scheduled drift workflow re-checks official release URLs, license evidence, download paths, and runtime availability pages against live provider/runtime sources.

When a field changes, the workflow writes a public downgrade status, uploads a diff report, and opens a single actionable GitHub failure until a reviewer re-verifies the entry.

Recommendation graduation criteria

Public evidence bundle. A family only graduates into the visible recommendation track when the provenance guide links its public evidence bundle and decision history. Threshold: Required for every published family

Downloadable runtime path. At least one tracked runtime path must stay directly downloadable and reviewable so the recommendation does not lean on generic setup copy alone. Threshold: At least 1 current downloadable local-runtime path

Memory fit threshold. Recommendation pages only graduate when the benchmark and catalog can point to a concrete minimum and comfortable memory band for the family. Threshold: Tier ceilings published in the provenance guide

Freshness SLA. Reviewer timestamps must stay within the active review window so stale origin or runtime evidence drops out before it can influence the shortlist. Threshold: 30-day review cadence

Freshness is enforced with the same 30-day SLA that drives the catalog review window.

Tier memory ceilings

3B. Minimum memory must stay at or below 2.5 GB and comfortable memory must stay at or below 4.0 GB.

7B. Minimum memory must stay at or below 6.5 GB and comfortable memory must stay at or below 8.0 GB.

13B. Minimum memory must stay at or below 11.0 GB and comfortable memory must stay at or below 14.0 GB.

34B. Minimum memory must stay at or below 19.5 GB and comfortable memory must stay at or below 24.0 GB.

70B. Minimum memory must stay at or below 40.0 GB and comfortable memory must stay at or below 48.0 GB.

120B. Minimum memory must stay at or below 67.0 GB and comfortable memory must stay at or below 80.0 GB.

frontier. Minimum memory must stay at or below 245.0 GB and comfortable memory must stay at or below 288.0 GB.

Ownership and cadence

Owner: What'sMy.AI benchmark maintainers.

Refresh cadence: every 30 days, or sooner when a source page, runtime package, or license changes.

An entry is stale when its `lastVerifiedAt` timestamp falls outside the review window.

An entry is drifted when the scheduled check records a live source, license, download path, or runtime evidence mismatch in the checked-in drift status file.

What gets checked

The verification check confirms the source URL and runtime action URLs are HTTPS, the release date and last-verified timestamp are parseable, and the display labels still match the source metadata block.

It also requires the runtime availability map to stay in lockstep with the tracked Ollama, LM Studio, and llama.cpp paths so the public copy cannot drift away from the actual evidence.

Each tracked runtime path must declare whether the reviewed evidence was compatibility or distribution, plus the exact source URL and any recorded caveat.

For `origin: US` entries, the source URL must match the recorded official release channel and at least one runtime path must be a reviewed downloadable path rather than a generic setup note.

Model-family decision changelog

This public log uses the same format for additions, removals, upgrades, downgrades, and reclassifications, including the compact-model refresh that replaced older starter defaults.

Upgraded2026-03-12

Registry operations

Added scheduled drift checks, field-level diff alerts, and automatic public downgrade whenever a tracked source, license, or runtime path drifts from the verified snapshot.

Last verified by Codex on SYM-38 on 2026-03-12. Public reviewer labels stay at the workspace or team level so private contact details never appear here.

gpt-oss evidence bundle · Llama frontier line evidence bundle · Official release channel rule · License requirements rule · Downloadable local-runtime path rule · Community GGUF and runtime repackaging edge case

Added2026-03-12

Granite 4.0

Added Granite 4.0 to the public American-model shortlist after IBM-controlled release evidence, Apache terms, and direct local-runtime paths were reviewed together.