#

inference-server

Here are 2 public repositories matching this topic...

containers / podman-desktop-extension-ai-lab

Work with LLMs on a local environment using containers

ai local containers inference-server podman llms

Updated Apr 14, 2025
TypeScript

wingman

curtisgray / wingman

Wingman is the fastest and easiest way to run Llama models on your PC or Mac.

windows macos linux downloader ai local download gpu chatbot inference openai gpu-acceleration llama inference-server inference-engine gpu-monitoring llm chatgpt llamacpp

Updated Jun 2, 2024
TypeScript

Improve this page

Add a description, image, and links to the inference-server topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the inference-server topic, visit your repo's landing page and select "manage topics."