AI assistant with a FastAPI REST API, RAG-powered semantic memory, multi-provider LLM fallback, and production infrastructure — Dockerized and deployed on Railway.
A.L.F.R.E.D is an AI assistant with three entry points — a voice CLI, a PySide6 desktop GUI, and a FastAPI REST API — all sharing one Supabase backend. The project demonstrates end-to-end backend engineering: API design, containerization, CI/CD, test coverage, and cloud deployment.
The system uses a RAG pipeline backed by Supabase + pgvector for persistent semantic memory, a multi-provider LLM fallback chain (Claude 3.5 Sonnet → GPT-4o-mini), and integrations with Google Calendar, weather APIs, file management, and system monitoring.
Thin HTTP layer over existing logic — all endpoints delegate to brain.py and memory_manager.py with zero duplication. Auto-generated OpenAPI docs available.
| Method | Endpoint | Description |
|---|---|---|
| POST | /chat | Send a message, get AI response |
| GET | /chat/history | Retrieve conversation history |
| POST | /memories | Store a key-value memory |
| GET | /memories/{key} | Recall a specific memory |
| DELETE | /memories/{key} | Forget a memory |
| POST | /memories/search | Semantic vector search |
| GET | /system/health | Health check + Supabase status |
| WS | /ws/chat | Real-time WebSocket chat with auth handshake |
def) so FastAPI auto-offloads to a thread pool, matching the existing sync codebase with zero refactoring.Want to see more of my work?
All Projects