The four hops
| Hop | What it tells you |
|---|---|
| Rule → Inference log | The exact LLM call: model, versioned prompt, token counts, timestamp. Not “an LLM”; that call. |
| Inference log → Events | The precise set of structured events the model saw as input. |
| Events → Raw records | The original payloads: the actual Teams/Slack message, email, or document, verbatim. |
| Raw record → Connector | Which system it came from and when it was ingested. |
- “Why does Seyn think we need CEO approval at $30M?” Here are the 23 deals it observed.
- “Show me every message the model looked at when forming this claim.” Here they are, verbatim.
- “Which version of which prompt produced this?” This one, on this date, with these token counts.
Enforced, not promised
Three design decisions make the chain trustworthy:- First-class links. Every connection in the chain is a queryable, constrainable relation, not a free-text reference.
- Restricted deletion. You cannot delete a raw record that an event depends on, or an event that an inference depends on. Deleting evidence out from under a claim is a database error, not a policy.
- One inference choke point. Every LLM call in the platform passes through a single central function that records input event IDs as part of the call. A model call that bypasses provenance is not possible by construction. See Observability.
This is deliberately inconvenient. Restricted deletion means cleanup requires walking the chain in dependency order, a price we pay so that the chain can never silently break.
Where you see it
| Surface | What you get |
|---|---|
| Dashboard | Rule detail pages drill down through evidence → inference log → events. |
| API / SDK | client.rules.provenance(ruleId) returns the entire chain in one call. |
| MCP | The get_rule_provenance tool gives agents the same drill-down, so your AI can show receipts too. |
| Chat | Citations on every answer resolve to rules, and from rules to the full chain. |
Common mistakes
| Symptom | Cause | Fix |
|---|---|---|
| Provenance looks “thin” (1–2 events) for a high-confidence rule | Confidence reflects model certainty, not evidence volume | Weigh confidence, frequency, and evidence count together |
| Expecting provenance for pattern metrics | Pattern metrics are SQL aggregates over events, not LLM outputs; they have no inference log by design | Query the events endpoints directly for the underlying data |
| Trying to delete ingested data that has dependents | Restricted deletion protects the chain | Deletion walks the chain top-down; source removals are soft-deletes |
Related
Observability
The inference log and tracing that feed the chain.
Knowledge
Rules, libraries, and the review workflow that sits on top.