Skip to content

AI / ML

Model training, retrieval-augmented generation, agent loops, and serving topologies. A purple theme marks the ML domain; the central agent / model gets a dramatic sizeScale.

Training pipeline with a gate

Data → features → train → eval → registry, told as a three-frame story. Hover and step through: frame 1 is the pipeline at rest, frame 2 clears the eval gate, frame 3 lands the model in the registry.

Frame 1 / 1–3
frame-gallery-training-gate (SVG, frame 1)

RAG: retrieval-augmented generation

The model's answer is grounded in documents retrieved from a vector store. The LLM sits at the centre with the weight; the store / docs feed it as a region.

Agent + tools (MCP-style)

A giant agent in the middle, compact tool satellites around it. Size variation makes the "orchestrator vs capability" relationship obvious.

Online serving with cache

A prediction service fronted by a cache. Cold queries fall through to the model; results are written back.

Edit this pageLast updated: