Skip to content

Model Lifecycle and Changelog

The NRP catalog rotates quickly to track the open-weights frontier. Newer, faster models replace obsolete ones, and GPU allocations are shifted toward what the community is actually using. This page explains the process and lists what’s changed recently.

How models are added and removed

Added — new models are added based on benchmarks (artificialanalysis.ai) and qualitative evidence (e.g. r/LocalLLaMA), with the final decision made by administrators in discussion with users.

Removed — obsolete models are removed when smaller models perform better all-around or another model has clearly replaced the use case.

Deprecated — research groups that need a specific model for reproducibility can declare research usage. Deprecated models stay up until the research concludes, but their replacement is still encouraged. If your group depends on a model that has been deprecated or removed, please reach out via the Matrix Nautilus AI/ML channel.

GPU allocation is the limiting factor: larger models that require many GPUs are removed sooner if relative performance falls behind, while small or efficient models get more leniency. New-model decisions and retirement discussion happen in the same Matrix channel.

Recent changes

April 2026

March 2026

  • qwen3-embedding (Qwen/Qwen3-VL-Embedding-8B) added on the AI Gateway.
  • embed-mistral (intfloat/e5-mistral-7b-instruct) decommissioned and replaced with qwen3-embedding due to incompatibilities with Jupyter AI.
  • llama3-sdsc (Llama-3.3-70B-Instruct) removed from the Envoy AI Gateway after a long deprecation.
  • glm-v (GLM-4.6V multimodal route) removed from the Envoy AI Gateway. Use glm-4.7 for text and other multimodal options for vision/video.

Older changes

February 2026
January 2026

Added/Changed

Removed

December 2025
November 2025

Added/Changed

Removed

NSF Logo
This work was supported in part by National Science Foundation (NSF) awards CNS-1730158, ACI-1540112, ACI-1541349, OAC-1826967, OAC-2112167, CNS-2100237, CNS-2120019.