Miniloader at Full Power: Four Real-World Use Cases
A fully configured Miniloader rack enables a single operator to run an automated research pipeline (RAG + web retrieval via MCP), a continuous operations system (local inference + external service integrations), and a local data analysis environment (Postgres + code execution tools) — without recurring API subscriptions, cloud data exposure, or dependency on external infrastructure.
1. The Autonomous Research Department
The Goal
Digest large volumes of raw data — PDFs, codebases, web content — retrieve precise citations, and synthesize reports, without transmitting sensitive documents to a cloud provider.
The Wiring
- Knowledge Ingestion: File Access reads a local document folder and pushes batches via
FILES_OUTtoFILES_INon the RAG Engine. - Context Injection: The RAG Engine processes embeddings entirely on-device (using all-MiniLM-L6-v2) and delivers context via
CONTEXT_OUTtoCONTEXT_INon the Terminal. - External Reach: A custom Web Scraper MCP Cartridge connects to the MCP Bus via
MCP_UPSTREAM, supplying live web data to the terminal alongside local sources. - Inference: The local brain runs inference against the combined retrieved context and tool data.
Results
The outcome is a locally-contained research assistant with full access to your private archive. It retrieves live web data, cross-references external sources against internal documents, and produces synthesized reports — with no data leaving the machine and no per-token cost on large document sets.
2. The Local Data Analysis Team
The Goal
Query live, complex relational databases to extract business intelligence, generate reports, or audit logs using natural language.
The Wiring
- Database Tooling: The PG Cartridge connects to your Postgres instance (credentials stored securely in the Vault) and exposes tools including
pg_list_schemasandpg_query. - Tool Routing: The PG Cartridge registers via
TOOLS_OUTinto the MCP Bus viaMCP_UPSTREAM. - The Interface: The MCP Bus delivers the merged tool schema via
MCP_DOWNSTREAMintoMCP_INon the Terminal. - Inference: Cloud Brain — optionally using a high-quality model via LiteLLM for SQL generation — handles reasoning while execution runs locally through the PG Cartridge.
Results
This configuration replaces expensive enterprise BI tooling with a natural-language interface to your own database. A request such as "Run a cohort analysis on last month's churned users" causes the agent to explore the schema, generate and validate SQL, execute it locally via the PG Cartridge, and return structured results — without exposing data to a third-party service.
3. The Continuous Operations Pipeline
The Goal
Run an autonomous system that interfaces with external services — managing a Discord community, responding to GitHub events, or driving an OBS Studio broadcast — powered entirely by local hardware.
The Wiring
- Inference: The local brain streams tokens via
BRAIN_OUTintoBRAIN_INon the AI Server. - API Bridge: The AI Server exposes an OpenAI-compatible REST interface (
/v1/chat/completions). - External Access: The Web Gateway takes
WEB_OUTfrom the Server and provides a secure, publicly addressable URL. - Integrations: External webhooks from Discord, GitHub, or OBS Studio are directed to the gateway URL.
Results
The result is a private inference API capable of sustaining high-volume, continuous automation — a 24/7 community assistant, a pull request reviewer, or a broadcast controller — without recurring inference costs. Compute is fully owned and scales independently by upgrading the GPU module.
4. The Unified Command Center
The Goal
A single, secure interface for managing your entire stack when working remotely.
The Wiring
- Persistence: The database stores all threads, system state, and settings, encrypted via SQLCipher.
- Application Layer: The Terminal manages chat, tool dispatch, and RAG injection.
- Remote Access: The Web Gateway exposes the Terminal via a custom domain. Bearer authentication middleware protects all
/api/*and/db/*routes using tokens stored in the Vault.
Results
The result is a complete, self-hosted AI platform accessible from any device. Secure remote access, full conversation history, and all connected tools remain available wherever you are — with no dependency on any external service.
