One evaluation question. One agreed dataset. Structured evidence handoff.
A clear four-step sequence from scope agreement to evidence handoff. No open-ended work.
Agree on the evaluation question, input data format, and success criteria before any execution begins. Everything outside this scope is explicitly noted as out of scope.
Deterministic GPU run on the agreed input data. Benchmark figures captured per tier. Rerun pass executed to confirm determinism. No undocumented deviations.
Setup note documenting explicit assumptions and non-assumptions. Benchmark summary with RTX 3060 Ti figures. Rerun evidence confirming bit-exact output. Result artifacts from the agreed run.
Structured packet ready for internal technical review or institutional forwarding. Scope closeout note included. The handoff marks the end of the bounded engagement.
A complete, structured output set — nothing left implicit, nothing assumed.
Explicit documentation of all assumptions made and non-assumptions held during execution. What the pipeline was told, and what it was not.
RTX 3060 Ti figures across applicable tiers. Latency (ms), energy efficiency (outputs/J), and D=1000 measurements. Directly comparable to published benchmark data.
Determinism verification across N runs. Same input, same output — confirmed and documented. Bit-exact match record included.
Sample output files from the agreed execution. Hash-referenced for traceability. Reviewable without re-running the pipeline.
All deliverables compiled into a structured packet, formatted for internal review or forwarding to an institutional technical audience.
Explicit documentation of what was and was not addressed. Closes the engagement boundary cleanly. No implied continuation.
The sprint is bounded by design. These items are outside the engagement unless separately scoped.