niklas/amduat-api

Carl Niklas Rydberg 92aa3b6a3f Commit local script and AI workflow updates

2026-02-08 07:55:43 +01:00

1.7 KiB

Raw Blame History

AI v2 Plan

Goal

Ship one reliable AI vertical slice on top of the v2 graph API:

ingest deterministic graph facts,
retrieve graph context for a root,
answer with grounding evidence,
execute a minimal planner loop with persisted run state.

Scope Rules

Prioritize app-level AI workflow work in this repo.
Treat backend fault investigation as out-of-scope unless it blocks the vertical slice.
Keep vendor/amduat-api pinned while iterating on prompts/evals.

Working Lane

Use branch: feat/ai-v2-experiments.
Keep core command stable: ./scripts/v2_app.sh ai-vertical-slice.
Track prompt/eval tweaks under ai/.

Acceptance Criteria

./scripts/v2_app.sh ai-vertical-slice passes on a running daemon with Ollama.
Output contains non-empty answer text with grounding.has_evidence == true.
tests/ai_eval.sh and tests/ai_answer_eval.sh pass in the same environment.
./scripts/v2_app.sh ai-agent --json 'doc-ai-1' 'What domain is doc-ai-1 in?' 'ms.within_domain' writes checkpoint state under ai/runs/.

Quick Run Sequence

Start daemon (or let the vertical slice auto-start it): ./scripts/dev_start_daemon.sh
Run AI vertical slice: ./scripts/v2_app.sh ai-vertical-slice
If daemon may not be running, use: ./scripts/v2_app.sh ai-vertical-slice --auto-start-daemon
Run minimal agent loop: ./scripts/v2_app.sh ai-agent --json --auto-start-daemon 'doc-ai-1' 'What domain is doc-ai-1 in?' 'ms.within_domain'

Stop Conditions

If startup, ingest, or retrieve fails due to backend regression, log the failure and pause AI iteration until fixed.
Do not switch scope to broad backend cleanup without an explicit decision.