Miniloader at Full Power: Four Real-World Use Cases

A fully configured Miniloader rack enables a single operator to run an automated research pipeline (RAG + web retrieval via MCP), a continuous operations system (local inference + external service integrations), and a local data analysis environment (Postgres + code execution tools) — without recurring API subscriptions, cloud data exposure, or dependency on external infrastructure.

1. The Autonomous Research Department

The Goal

Digest large volumes of raw data — PDFs, codebases, web content — retrieve precise citations, and synthesize reports, without transmitting sensitive documents to a cloud provider.

The Wiring

Knowledge Ingestion: File Access reads a local document folder and pushes batches via FILES_OUT to FILES_IN on the RAG Engine.
Context Injection: The RAG Engine processes embeddings entirely on-device (using all-MiniLM-L6-v2) and delivers context via CONTEXT_OUT to CONTEXT_IN on the Terminal.
External Reach: A custom Web Scraper MCP Cartridge connects to the MCP Bus via MCP_UPSTREAM, supplying live web data to the terminal alongside local sources.
Inference: The local brain runs inference against the combined retrieved context and tool data.

Results

The outcome is a locally-contained research assistant with full access to your private archive. It retrieves live web data, cross-references external sources against internal documents, and produces synthesized reports — with no data leaving the machine and no per-token cost on large document sets.

2. The Local Data Analysis Team

The Goal

Query live, complex relational databases to extract business intelligence, generate reports, or audit logs using natural language.

The Wiring

Database Tooling: The PG Cartridge connects to your Postgres instance (credentials stored securely in the Vault) and exposes tools including pg_list_schemas and pg_query.
Tool Routing: The PG Cartridge registers via TOOLS_OUT into the MCP Bus via MCP_UPSTREAM.
The Interface: The MCP Bus delivers the merged tool schema via MCP_DOWNSTREAM into MCP_IN on the Terminal.
Inference: Cloud Brain — optionally using a high-quality model via LiteLLM for SQL generation — handles reasoning while execution runs locally through the PG Cartridge.

Results

This configuration replaces expensive enterprise BI tooling with a natural-language interface to your own database. A request such as "Run a cohort analysis on last month's churned users" causes the agent to explore the schema, generate and validate SQL, execute it locally via the PG Cartridge, and return structured results — without exposing data to a third-party service.

3. The Continuous Operations Pipeline

The Goal

Run an autonomous system that interfaces with external services — managing a Discord community, responding to GitHub events, or driving an OBS Studio broadcast — powered entirely by local hardware.

The Wiring

Inference: The local brain streams tokens via BRAIN_OUT into BRAIN_IN on the AI Server.
API Bridge: The AI Server exposes an OpenAI-compatible REST interface (/v1/chat/completions).
External Access: The Web Gateway takes WEB_OUT from the Server and provides a secure, publicly addressable URL.
Integrations: External webhooks from Discord, GitHub, or OBS Studio are directed to the gateway URL.

Results

The result is a private inference API capable of sustaining high-volume, continuous automation — a 24/7 community assistant, a pull request reviewer, or a broadcast controller — without recurring inference costs. Compute is fully owned and scales independently by upgrading the GPU module.

4. The Unified Command Center

The Goal

A single, secure interface for managing your entire stack when working remotely.

The Wiring

Persistence: The database stores all threads, system state, and settings, encrypted via SQLCipher.
Application Layer: The Terminal manages chat, tool dispatch, and RAG injection.
Remote Access: The Web Gateway exposes the Terminal via a custom domain. Bearer authentication middleware protects all /api/* and /db/* routes using tokens stored in the Vault.

Results

The result is a complete, self-hosted AI platform accessible from any device. Secure remote access, full conversation history, and all connected tools remain available wherever you are — with no dependency on any external service.