Client Configurations

Pre-built configurations for popular coding CLIs. The endpoint and token setup is covered on API Access — start there if you don’t have a token yet.

Use the right-hand “On this page” navigation to jump to your client. Every config below assumes:

Endpoint: https://ellm.nrp-nautilus.io/v1 (Anthropic-compatible at /anthropic)
Token: the value from the LLM token page, passed in Authorization: Bearer … or whatever the client’s API-key field expects.

OpenCode

anomalyco/opencode.

After applying the configuration below, search for NRP in the list of models (Ctrl+P → Switch models).

Either set the OPENAI_API_KEY environment variable or replace {env:OPENAI_API_KEY} with the actual API key. Adjust "output" as needed, but never set it close to or larger than "context".

Contents of ~/.config/opencode/opencode.jsonc:

{
  "$schema": "https://opencode.ai/config.json",
  "provider": {
    "NRP": {
      "npm": "@ai-sdk/openai-compatible",
      "name": "NRP",
      "options": {
        "baseURL": "https://ellm.nrp-nautilus.io/v1",
        "apiKey": "{env:OPENAI_API_KEY}"
      },
      "models": {
        "gpt-oss": {
          "name": "gpt-oss",
          "limit": {
            "context": 131072,
            "output": 32768
          }
        }
      }
    }
  }
}

pi

pi.dev — press Ctrl-L to switch models and search for NRP (or type /model modelname).

apiKey accepts the literal key, an environment variable name (OPENAI_API_KEY), or a shell command (!decrypt_key NRP).

Contents of ~/.pi/agent/models.json:

{
  "providers": {
    "nrp": {
      "baseUrl": "https://ellm.nrp-nautilus.io/v1",
      "api": "openai-completions",
      "apiKey": "<YOUR_API_KEY>",
      "models": [
        {
          "id": "kimi",
          "name": "Kimi 2.6",
          "input": ["text", "image"],
          "contextWindow": 262144,
          "reasoning": true
        },
        {
          "id": "qwen3-small",
          "name": "Qwen3 27B",
          "input": ["text", "image"],
          "contextWindow": 262144,
          "reasoning": true,
          "compat": {
            "supportsDeveloperRole": false
          }
        }
      ]
    }
  }
}

Crush

charmbracelet/crush — see also Andrea Zonca’s writeup: Configure NRP LLM with OpenCode and Crush.

After applying the configuration below, search for NRP in the list of models. Adjust "default_max_tokens" as needed; never push it close to or above "context_window".

Contents of ~/.config/crush/crush.json:

{
  "$schema": "https://charm.land/crush.json",
  "options": {
    "disable_metrics": true,
    "disable_provider_auto_update": false,
    "debug": false,
    "debug_lsp": false,
    "attribution": {
      "trailer_style": "none",
      "generated_with": false
    }
  },
  "mcp": {},
  "providers": {
    "nrp": {
      "name": "NRP",
      "type": "openai-compat",
      "base_url": "https://ellm.nrp-nautilus.io/v1",
      "api_key": "<LLM_API_KEY>",
      "models": [
        {
          "id": "gpt-oss",
          "name": "gpt-oss",
          "context_window": 131072,
          "default_max_tokens": 32768
        }
      ]
    }
  }
}

Kimi CLI

MoonshotAI/kimi-cli.

Contents of ~/.kimi/config.toml:

default_model = "kimi"

[providers.nrp]
type = "openai_legacy"
base_url = "https://ellm.nrp-nautilus.io/v1"
api_key = "<YOUR_API_KEY>"

[models.kimi]
provider = "nrp"
model = "kimi"
max_context_size = 262144
capabilities = ["thinking", "image_in", "video_in"]

[services]

Claude Code

anthropics/claude-code talks to NRP through the Anthropic-compatible endpoint at /anthropic. Set the variables below in your shell or ~/.claude/.env:

ANTHROPIC_BASE_URL="https://ellm.nrp-nautilus.io/anthropic"
ANTHROPIC_API_KEY="<llm-token>"
ANTHROPIC_DEFAULT_OPUS_MODEL="<name-of-nrp-model>"
ANTHROPIC_DEFAULT_SONNET_MODEL="<name-of-nrp-model>"
ANTHROPIC_DEFAULT_HAIKU_MODEL="<name-of-nrp-model>"
CLAUDE_CODE_DISABLE_NONESSENTIAL_TRAFFIC="1"
CLAUDE_CODE_DISABLE_FEEDBACK_SURVEY="1"
CLAUDE_CODE_ENABLE_TELEMETRY="0"
API_TIMEOUT_MS="3000000"
DISABLE_TELEMETRY="1"

Copilot CLI

github/copilot-cli.

Set COPILOT_PROVIDER_MAX_PROMPT_TOKENS to the model’s context length and COPILOT_PROVIDER_MAX_OUTPUT_TOKENS to a smaller value (roughly 1/16 to 1/4 of the context, depending on the task).

COPILOT_PROVIDER_BASE_URL="https://ellm.nrp-nautilus.io/v1"
COPILOT_PROVIDER_KEY="<llm-token>"
COPILOT_MODEL="<name-of-nrp-model>"
COPILOT_PROVIDER_MAX_PROMPT_TOKENS="<context-length>"
COPILOT_PROVIDER_MAX_OUTPUT_TOKENS="<max-output>"

This work was supported in part by National Science Foundation (NSF) awards CNS-1730158, ACI-1540112, ACI-1541349, OAC-1826967, OAC-2112167, CNS-2100237, CNS-2120019.