Projects

A selection of things I've built — from agentic systems and RAG pipelines to low-level runtimes and developer tooling.

A LangGraph-based system that coordinates specialized agents to plan, research and execute long-running tasks with shared memory.

Production retrieval-augmented generation pipeline with hybrid search, re-ranking and evaluation harnesses for grounded answers.

Reproducible fine-tuning workflows with dataset curation, LoRA training and automated regression tests via pytest.

A framework for systematic prompt engineering with versioning, A/B comparisons and structured scoring across models.

A lightweight C++ runtime for serving small models with low latency and a clean Lua scripting layer for orchestration.

An internal dashboard to trace agent runs, inspect tool calls and surface failure modes across multi-step pipelines.