Reasoning Capture Methodology, evidentiary standards, AI decision records, chain-of-thought, SHAP, LIME, automated quality checks, severity scoring, CC BY 4.0, high-stakes AI, credit decisions, export control, clinical triage, hiring, Reg BI

CC BY 4.0 · March 2026 · Public Draft

Reasoning Capture Methodology

Evidentiary standards for AI decision records: contemporaneous, specific, verifiable reasoning.

Evidentiary Standard

Reasoning is not an afterthought. It is the primary forensic artifact. This methodology defines the minimum content required for AI decision records to satisfy evidentiary scrutiny in high-stakes domains.

Credit
Export Control
Clinical Triage
Hiring
Reg BI
Contemporaneous

Capture at Decision Time

Reasoning must be recorded at the exact instant of inference, not reconstructed afterward. Post-hoc explanations are opinions. Contemporaneous traces are evidence.

// First ADR capture · Feb 2026
{
  "adr_id": "adr_01j6q3k8z9x7y5w4",
  "timestamp": "2026-02-18T14:23:15.123Z",
  "agent_id": "export_screen_v4",
  "decision_type": "export_eligibility",
  "output": "DENIED · Entity List match 0.94",
  "current_hash": "0x3d9b...c2e4"
}
// Production provenance
{
  "crate": "agdr-aki",
  "version": "v1.8.12",
  "runtime": "Rust + PyO3",
  "license": "Apache-2.0 or CC0-1.0",
  "steward": "Genesis Glass Foundation",
  "install_cargo": "cargo add agdr-aki",
  "install_pip": "pip install agdr-aki",
  "chain_integrity": "VERIFIED · TAMPER EVIDENT"
}
Specific

Approved Methods

The methodology approves three reasoning capture methods that satisfy specificity requirements under evidentiary standards:

  • Chain-of-thought: Token-level reasoning path with intermediate steps
  • SHAP: Feature attribution with directional contribution values
  • LIME: Local interpretable model-agnostic explanations

Automated Quality

Human review alone is insufficient at scale. The methodology mandates automated quality checks and severity scoring as structural requirements, not optional enhancements.

"A decision without verified reasoning is a liability without a defense. Severity scoring assigns evidentiary weight before the record leaves the kernel."

Quality invariant: VERIFY_REASONING → SCORE_SEVERITY → COMMIT_RECORD
Automated checks run before human review. The machine validates the machine.

Verifiable Integrity

Long-term evidentiary value requires more than tamper-evidence. It requires cryptographic binding to the decision record and public deposit for institutional redundancy.

⚠ Companion Standard Deprecated

ADR Specification v0.1, previously referenced as the companion container standard to this methodology, is archived, deprecated, and obsolete as of March 2026. Current implementations should reference AgDR-Phoenix v1.8 for the active container specification. This methodology remains authoritative for evidentiary content requirements.

Public Deposit

Permanent Record

Deposited under CC BY 4.0. The methodology is a permanent public record, not a proprietary specification.

// ProvenancePacket · wire format for tamper-evident chain
// First uploaded to GitHub: March 11, 2026
// schema: agdr.provenance.v1

syntax = "proto3";

package agdr.provenance.v1;

import "google/protobuf/timestamp.proto";

message ProvenancePacket {
  string packet_id = 1;
  uint64 sequence_number = 2;
  bytes parent_hash = 3;           // BLAKE3 of previous packet
  google.protobuf.Timestamp emit_timestamp = 4;
  bytes provenance_hash = 5;       // origin of decision
  bytes place_hash = 6;            // execution context
  bytes purpose_hash = 7;          // alignment manifest
  bytes decision_hash = 8;       // action fingerprint
  bytes signature = 9;             // Ed25519 or ECDSA over packet
  optional bytes fec_parity = 10;  // for UDP dropless recovery
  map<string, bytes> extensions = 99;
}

Also deposited at archive.org and Library and Archives Canada (March 2026).

Legal Alignment

  • Canada Evidence Act s.31.1: Contemporaneous capture satisfies the "reliability of the electronic records system" test by construction
  • CBCA s.122: Specific, verifiable reasoning encodes Director duty of care into the decision record itself
  • EU AI Act (High-Risk): Article 14 human oversight requirements met through immutable reasoning preservation
  • Reg BI (US): Best interest obligation documentation through explainable, severity-scored records

Distinction from Logging

Traditional ML logging captures what happened. Reasoning Capture Methodology captures why it happened · with specificity, at decision time, with automated quality verification.

Structural monitoring verifies system health. Reasoning Capture verifies decision defensibility. Both are necessary. Neither substitutes for the other.

Contemporaneous

Captured at inference time, not reconstructed. The reasoning trace is sealed before the decision exits the kernel.

Specific

Chain-of-thought, SHAP, or LIME. No black-box summaries. Every weight and branch is accountable.

Verifiable

Automated quality checks and severity scoring. The machine validates its own reasoning before human review.