Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Installation

Install the published package for the runtime your application uses. All public client packages use the same endpoint model: register an endpoint, keep the returned endpoint reference, and choose that endpoint for query, chat, or embed.

Package Installs

SurfaceInstallUse for
Browsernpm install @sipp/sippBrowser-local GGUF inference and browser gateway clients.
Node.jsnpm install @sipp/sipp-serverServer-side local inference and framework route handlers.
Pythonpip install sipppyPython scripts, services, and gateway clients.
Python CUDAGitHub release wheelPython local inference with CUDA backend wheels.
Python Vulkanpip install "sipppy[vulkan]"Python local inference with Vulkan backend wheels.
Python Metalpip install "sipppy[metal]"Python local inference with Metal backend wheels on macOS.
Rustcargo add sipp-rsRust applications and services.

The current release workflow publishes browser npm, Node npm, Python wheels, and Rust crates. It does not yet publish a standalone gateway-server binary, container image, or cargo install target. Use the source checkout and Dockerfile when deploying the gateway server until a public server artifact is added.

Runtime Requirements

  • Local inference needs a compatible GGUF model file or browser-served GGUF asset.
  • Python wheels require Python 3.10 or newer.
  • Browser-local inference needs a modern browser with WebAssembly support; WebGPU acceleration depends on the browser and device. For details, please refer to Gateway.
  • Node installs use @sipp/sipp-server; npm resolves the matching optional platform binary package automatically. Python installs use the sipppy wheel (imported as sipp) for CPU and extras such as sipppy[cuda] for GPU backend wheels; the sipppy wheels currently ship from GitHub Releases while the full PyPI build matrix is in progress (see the Python package page). Use SIPP_NODE_BACKEND or SIPP_PYTHON_BACKEND when you need to force cpu, vulkan, cuda, or metal.
  • Gateway clients need only the gateway base URL, public target name, and application-owned authentication value.

Next Steps