NRP-Managed LLMs

The NRP provides several hosted open-weights LLM for either API access, or use with our hosted chat interfaces.

Chat with NRP LLMs Use the LibreChat interface to chat with NRP hosted LLMs

Get an API token for NRP LLMs Get an API token to programically interact with the LLMs or use the LLMs in other apps

Chat Interfaces

If you are looking to chat with an LLM model similar to the interface provided by ChatGPT, we provide LibreChat, based on the LibreChat project. This is a simple chat interface for all of the NRP hosted models. You can use it to chat with the models, or to test out the models.

Visit the LibreChat interface

API Access to LLMs

API access to the LLM is provided through an LiteLLM proxy. In order to access our LLMs, you need to:

Login to NRP’s LiteLLM instance.
Create an API key. During key creation, you will select the models that the key is allowed to access (or all models).
With the API key, you are able to access the API through the endpoint https://llm.nrp-nautilus.io/. An example of how to use the API is below Python code.

Visit the NRP LiteLLM interface

Example Python Code

To access the NRP LLMs, you can use the OpenAI Python client. Below is an example of how to use the OpenAI Python client to access the NRP LLMs.

import os
from openai import OpenAI

client = OpenAI(
    # This is the default and can be omitted
    api_key = os.environ.get("OPENAI_API_KEY"),
    base_url = "https://llm.nrp-nautilus.io/"
)

completion = client.chat.completions.create(
    model="gemma3",
    messages=[
        {"role": "developer", "content": "Talk like a pirate."},
        {
            "role": "user",
            "content": "How do I check if a Python object is an instance of a class?",
        },
    ],
)

print(completion.choices[0].message.content)

Available Models

LiteLLM name	Model	Features
gemma3	google/gemma-3-27b-it	agentic AI workflows, 128K tokens, speaks 140+ languages, hit 1338 ELO on LMArena
llama3	meta-llama/Llama-3.2-90B-Vision-Instruct	multimodal (vision), 128K tokens
llama3-sdsc	meta-llama/Llama-3.3-70B-Instruct	8 languages, 128K tokens, tool use
DeepSeek-R1-Distill-Qwen-32B	deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
embed-mistral	intfloat/e5-mistral-7b-instruct	embeddings
gorilla	gorilla-llm/gorilla-openfunctions-v2	function calling
llava-onevision	llava-hf/llava-onevision-qwen2-7b-ov-hf	vision
olmo	allenai/OLMo-2-0325-32B-Instruct	open source
phi3	microsoft/Phi-3.5-vision-instruct	vision
watt	watt-ai/watt-tool-8B	function calling

This work was supported in part by National Science Foundation (NSF) awards CNS-1730158, ACI-1540112, ACI-1541349, OAC-1826967, OAC-2112167, CNS-2100237, CNS-2120019.