Welcome to the Glyphoxa documentation. These guides cover how to set up, configure, extend, and operate Glyphoxa โ€” a real-time voice AI framework for TTRPG NPCs.

Where to Start

New to Glyphoxa? Follow this path:

  1. Getting Started โ€” Install prerequisites, build, and run your first NPC
  2. Architecture โ€” Understand the system at a glance
  3. Configuration โ€” Configure providers, NPCs, and memory
  4. Testing โ€” Run tests and learn conventions before contributing

Documentation Index

Setup and Overview

Document Description
Getting Started Prerequisites, build, first run, development workflow
Architecture System layers, data flow, key packages, latency budget
Configuration Complete config field reference, hot-reload, provider options

Core Systems

Document Description
Providers Provider interfaces, supported providers, adding new providers, resilience
NPC Agents NPC definition, entities, campaigns, VTT import, hot context assembly
Memory 3-layer memory system, PostgreSQL setup, transcript correction, session lifecycle

Subsystems

Document Description
MCP Tools Tool system, built-in tools, building custom tools, budget tiers
Audio Pipeline Audio flow, transports, VAD, engine types, mixer, engine comparison

Operations

Document Description
Commands Discord slash commands, voice commands, puppet mode, dashboard
Deployment Docker Compose, Kubernetes / Helm, binary modes, production checklist
Multi-Tenant Gateway, admin API, tenant model, session orchestration, usage tracking
Observability Metrics, Prometheus, Grafana dashboards, per-tenant labels, health endpoints, alerting

Quality

Document Description
Testing Running tests, conventions, mocks, provider testing, integration tests
Troubleshooting Build issues, provider issues, runtime debugging, diagnostic steps

Design Documents

These documents explain the why behind each subsystem โ€” design decisions, rationale, and specifications. The guides above explain how to work with each subsystem.

Document Description
Overview Vision, goals, product principles
Architecture System layers and data flow
Providers LLM, STT, TTS, Audio platform interfaces
Memory Hybrid memory system and knowledge graph
MCP Tools Tool integration and performance budgets
Sentence Cascade Dual-model cascade (experimental)
NPC Agents Agent design and multi-NPC orchestration
Technology Technology decisions and latency budget
Open Questions Resolved and open design questions
Roadmap Development phases
Knowledge Graph L3 graph schema and query patterns

Other Resources


This site uses Just the Docs, a documentation theme for Jekyll.