A visual control panel for your AI.

    Miniloader runs agents, models, and tools on your own machine. No subscriptions, no leaked prompts, no terminal required.

    VIDEO_FEEDCH:01
    MODULE STATUS
    AGENT
    LLM
    RAG
    MCP
    DB
    FILE
    THE PROBLEM WITH AI TOOLING

    The tools exist, but the interface doesn't.

    CLOUD AGENTS

    OpenClaw burns tokens faster than most users expect. Usage is invisible. Costs spiral before you notice.

    CLI AGENTS

    Hermes is capable but command-line only and hard to configure. Not an option for most people.

    LOCAL RUNNERS

    Ollama has no workflow layer. LM Studio is a backend with no front end. Neither helps you build a system.

    MINILOADER

    Pick a preset. Slot in your modules. Auto-wiring does the rest. You're running in minutes, with full visibility.

    WORKS WITH YOUR STACK

    Plug in your agent. Miniloader is the engine underneath.

    CONNECTED
    OpenClaw

    personal AI assistant · desktop + tools + workflows

    CONNECTED
    OpenCode

    open-source coding agent · terminal + IDE + desktop

    CONNECTED
    Hermes Agent

    NousResearch Hermes · instruction-tuned models

    CONNECTED
    Cline

    VS Code BYOK agent · human-in-the-loop approval

    CONNECTED
    Roo Code

    structured VS Code agent · plan / code / debug modes

    CONNECTED
    Aider

    CLI pair programmer · git-native editing

    CONNECTED
    Goose

    CLI + desktop agent · Block open-source

    CONNECTED
    Kilo Code

    VS Code + JetBrains BYOK agent

    CONNECTED
    LangChain

    framework for composing LLM applications

    Cure token anxiety. Let local models handle the routine work, and save premium tokens for the tasks that actually need them.

    YOUR INTELLIGENCE RACK

    Miniloader modules are a portal to productivity.

    AGENT ENGINE
    ACTIVE

    Central orchestration loop for chat turns, tool calls, and streamed responses. Takes client requests, prepares model calls, and coordinates tool execution round-trips through the tools channel. Expects an OpenAI-compatible backend upstream and tracks connectivity state for that endpoint.

    API_IN
    TOOLS_IN
    DB_IN
    LOCAL BRAIN
    ACTIVE

    Local inference engine that loads GGUF models through llama.cpp and generates token streams. Process-isolated so model runtime crashes do not take down the main Hypervisor process. Core local model runtime used behind server-facing modules such as gpt_server.

    BRAIN_OUT
    CLOUD BRAIN
    ACTIVE

    In-process LiteLLM gateway for cloud providers that acts as a drop-in OpenAI-compatible API source for the same downstream wiring used by local stacks.

    API_OUT
    LIVEKIT VOICE
    ACTIVE

    LiveKit Voice handles realtime STT/TTS by turning speech into agent turns and streaming responses back as synthesized audio with browser join configuration.

    AGENT_IO
    VOICE_CONFIG
    INSTALLATRON
    ACTIVE

    OpenClaw install and auto-config helper against your local AI server configuration.

    API_IN
    WEB_OUT
    NGROK TUNNEL
    ACTIVE

    Ngrok Tunnel publishes local services to the internet by consuming routing config and emitting tunnel status with a public URL on the same channel.

    WEB_IN

    17 modules ship with Miniloader: Local Brain, Chat Terminal, GPT Server, Database, File Access, and more.

    VIEW ALL

    Supported Services

    *coming soon

    OpenAI
    Anthropic
    Hugging Face
    ElevenLabs*
    AWS
    Google Cloud
    Supabase*
    PostgreSQL
    GitHub*
    Slack*
    Discord
    Obsidian
    Twilio*
    OBS Studio*
    LangChain
    OpenCode
    OpenClaw
    Cline
    Roo Code
    Aider
    Goose
    Kilo Code
    Hermes Agent
    OpenAI
    Anthropic
    Hugging Face
    ElevenLabs*
    AWS
    Google Cloud
    Supabase*
    PostgreSQL
    GitHub*
    Slack*
    Discord
    Obsidian
    Twilio*
    OBS Studio*
    LangChain
    OpenCode
    OpenClaw
    Cline
    Roo Code
    Aider
    Goose
    Kilo Code
    Hermes Agent
    LIMITED EARLY ACCESS

    Be one of the first to run Miniloader on your machine.

    GET EARLY ACCESS