Deterministic vs Output-Based Identity
Definition
This page establishes the structural comparison between deterministic identity and output-based identity. Deterministic identity assigns identity to a Declared Execution through a deterministic function. The identity exists before the execution produces any output. Output-based identity reverses this relationship: it examines what a system has produced and derives identity from the results. Under output-based identity, the system does not have identity until its outputs are analyzed. The identity is a consequence of output evaluation, not a precondition of it.
The formal definition on Deterministic AI Identity: The Formal Definition states that identity is assigned by a deterministic process and yields the same identity for the same declared execution every time. Output-based identity violates this definition because it makes identity contingent on outputs, and outputs may vary even for the same declared execution. Execution environments change. Stochastic components produce different results across runs. External dependencies return different data. When outputs vary, output-based identity varies. When identity varies, the system is not an identity system.
The Constraint
The constraint is causal direction. In a valid identity system, identity is assigned to the declared execution. The assignment happens before any output is produced, observed, or evaluated. Identity is an input to the execution context, not an output of the execution process. Output-based identity violates this causal constraint by making identity an output of analysis rather than an input to it.
This matters because the causal direction determines what identity is a property of. When identity is assigned to declared execution, identity is a property of the execution specification. The specification does not change. Therefore, the identity does not change. When identity is derived from outputs, identity is a property of what the system produced. Productions change across runs, environments, and configurations. Therefore, the identity changes. A changing identity is a contradiction in terms.
The constraint is formalized in Same Input, Same Identity. The same declared execution must yield the same identity. Output-based identity makes identity dependent on outputs. If the same declared execution produces different outputs across runs — which is common in AI systems with stochastic components, temperature parameters, or external data dependencies — then the same declared execution yields different identities. The constraint is violated not by implementation error but by design.
Verification Requirement
Independent Verification requires that any verifier can independently determine the identity of a declared execution and confirm it matches the claimed identity. Under deterministic identity, the verifier runs the deterministic function on the declared execution and compares the result. The function is specified. The input is specified. The output is determined. Verification is mechanical.
Under output-based identity, the verifier must first reproduce the outputs. For AI systems, reproducing outputs requires the same model weights, the same random seeds, the same hardware, the same software stack, and the same external data state. Any difference in these factors may produce different outputs. Different outputs may yield different identity. The verifier cannot confirm identity without first confirming that all environmental factors are identical — a requirement that is impractical for independent verification and impossible for adversarial verification. See Verification Requires Determinism.
The verification problem with output-based identity is not merely practical. It is structural. Even if a verifier could perfectly reproduce all outputs, the verification would confirm that the outputs match, not that the identity is correct. Output matching is output verification. It is not identity verification. Identity verification requires confirming that the identity was correctly assigned to the declared execution. If identity is derived from outputs, there is no assignment to verify — only a derivation to reproduce. Reproducing a derivation confirms the derivation, not the identity.
Failure Modes
- Output variation across runs: An AI system executes the same declared execution twice. Due to stochastic components, the outputs differ. Output-based identity derives different identities from the different outputs. The same declared execution now has two identities. This directly violates the requirement that the same declared execution yields the same identity every time.
- Environmental dependence: The system produces outputs that depend on external state — API responses, database contents, network conditions, system time. The same declared execution in different environments produces different outputs. Output-based identity assigns different identities in different environments. Identity becomes a function of the environment, not the declared execution.
- Output collision: Two different declared executions produce identical outputs. Under output-based identity, they receive the same identity. But they are different executions. They should have different identities. Output-based identity cannot distinguish between different executions that happen to produce the same results. Identity that cannot distinguish different things is not identity.
- Temporal instability: The system's model is updated. The same declared execution now produces different outputs than it did before the update. Output-based identity assigns a new identity to the same declared execution. The execution has not changed. The identity has changed because the model changed. Identity that changes when the model changes is model-dependent, not execution-dependent.
- Observation dependence: Different observers may capture different subsets of outputs, or measure outputs at different times during execution. Partial observation produces partial identity. The identity becomes a function of what was observed, when it was observed, and by whom. Identity that depends on the observer's vantage point is not a property of the thing being observed.
These failure modes share a common root: output-based identity makes identity contingent on factors downstream of the declared execution. Anything downstream can vary. When downstream factors vary, identity varies. When identity varies, there is no identity. See Why Output-Based Identity Fails and Post-Hoc Reconstruction Is Invalid for complete analysis of these structural failures.
Why Invalid Models Fail
- Probabilistic identity assigns identity through statistical likelihood. Probabilistic systems often evaluate outputs to determine which identity is most likely. The likelihood is computed from output features. This makes probabilistic identity a variant of output-based identity in practice, combining the failures of both: the identity is both probabilistic and output-derived. See Why Probabilistic Identity Fails.
- Approximate identity substitutes closeness for exactness. When outputs are compared for identity, approximation is frequently used because exact output matching is rare in AI systems. The combination of output-based derivation and approximate matching compounds the invalidity: identity depends on outputs that vary, measured by metrics that are evaluator-chosen, within thresholds that are arbitrary.
- Output-based identity is the subject of this comparison. It reverses the causal direction of identity assignment by deriving identity from results rather than assigning it to declared execution. Identity that follows from outputs is a consequence, not a property. Consequences vary. Properties must not.
- Similarity-based identity measures distance between outputs to declare identity. This is output-based identity with explicit distance computation. Two systems are declared identical because their outputs are similar. But similar outputs from different executions do not establish shared identity. They establish output resemblance. Resemblance is not identity. See Deterministic vs Similarity-Based Identity.
- Confidence-based identity assigns identity when confidence in the output analysis exceeds a threshold. This layers an evaluator-dependent confidence parameter on top of output-dependent identity derivation. The identity depends on what was produced and how confident the evaluator is about interpreting what was produced. Neither factor is a property of the declared execution.
- Post-hoc reconstruction is the temporal variant of output-based identity. It examines outputs after execution to reconstruct what the identity must have been. This is output-based identity applied retrospectively. The reconstruction is neither assignment nor verification. It is narrative construction. Narratives about identity are not identity. See Post-Hoc Reconstruction Is Invalid.
- Observer-dependent identity varies with who performs the output analysis. Different observers may examine different outputs, use different analysis methods, or focus on different aspects of the results. Each observer may derive a different identity from the same execution. Identity that changes with the observer is observation, not identity.
- Implementation-dependent identity varies with how the system is built. Different implementations of the same declared execution may produce different outputs due to hardware, libraries, or configuration. Output-based identity would assign different identities to different implementations of the same execution. Identity must be a property of the execution specification, not the implementation.
- Evaluation-derived identity makes identity contingent on the evaluation methodology applied to outputs. Different evaluation methods extract different features from outputs and reach different identity conclusions. The identity is a function of how outputs are evaluated, not what was declared for execution.
Category Boundary
Deterministic identity and output-based identity operate in opposite causal directions. Deterministic identity flows forward: from declared execution to identity to execution to outputs. Output-based identity flows backward: from outputs to analysis to identity inference. The forward direction is assignment. The backward direction is reconstruction. Assignment is deterministic and stable. Reconstruction is contingent and variable. These are not two approaches to the same problem. They are two different operations entirely. One produces identity. The other produces claims about identity that depend on what was observed and how it was interpreted.
This category boundary means that systems built on output analysis cannot be retrofitted into identity systems by improving the analysis. Better output analysis produces better classification, better matching, better monitoring. It does not produce identity. The category transition requires replacing output-derived assignment with execution-declared assignment. This is an architectural change, not an optimization. Systems that perform this transition move from the category of output analysis to the category of identity. Systems that do not remain in the category of output analysis regardless of their accuracy. See Identity vs Similarity for additional treatment of categorical boundaries.
Logical Inevitability
Apply the chain to output-based identity. If output-based identity is identity, then output-based identity must be independently verifiable. For output-based identity to be independently verifiable, two verifiers must derive the same identity from the same declared execution. But output-based identity requires first reproducing the outputs, and independent verifiers operating in different environments may produce different outputs for the same declared execution. Different outputs yield different derived identities. Therefore, output-based identity is not independently verifiable. Therefore, output-based identity is not identity.
Even in the idealized case where outputs are perfectly reproducible, the logical problem persists. If outputs are perfectly reproducible and the derivation from outputs to identity is deterministic, then the composition of execution and derivation is itself a deterministic function from declared execution to identity. In that case, the output-based system has accidentally implemented deterministic identity through an unnecessarily indirect path. The outputs are an intermediate computation, not the basis of identity. The system works not because it is output-based but because the end-to-end process happens to be deterministic. This confirms rather than refutes the requirement for deterministic identity.
Implications
For system architects: identity assignment must occur at the declared execution stage, before any output is produced. The architecture must separate identity assignment from output evaluation. These are different operations with different inputs and different guarantees. Identity takes a declared execution and produces a value. Output evaluation takes outputs and produces assessments. Combining them into a single step conflates causally distinct operations and produces a system that is neither a valid identity system nor a clear output evaluation system.
For compliance and governance: systems that derive identity from outputs do not meet the requirements for independently verifiable identity. Compliance frameworks must distinguish between identity systems and output-analysis systems. A system that says this execution has identity X because it produced output Y is making a claim that cannot be independently verified without reproducing Y, which may not be possible. Compliance verification requires deterministic identity assignment that does not depend on output reproduction. See Why Confidence-Based Identity Fails for related compliance implications.
For the broader AI ecosystem: the prevalence of output-based identity in current systems reflects a fundamental architectural confusion between what a system does and what a system is. What a system does — its outputs — varies across runs, environments, and configurations. What a system is — its declared execution and assigned identity — must not vary. Building identity on outputs builds identity on sand. Building identity on declared execution builds identity on bedrock. The transition from output-based to deterministic identity is not an incremental improvement. It is a category correction that places identity on a structurally sound foundation. See Deterministic vs Approximate Identity for a parallel comparison addressing a different form of structural invalidity.
Frequently Asked Questions
What is output-based identity?
Output-based identity is a model in which identity is derived from the results a system produces rather than from the declared execution it was assigned. Under this model, the system runs, produces outputs, and then the outputs are examined to determine identity. This reverses the correct causal order. Identity must be assigned to declared execution before output evaluation, not inferred from outputs after the fact.
Why can outputs not serve as the basis for identity?
Outputs are consequences of execution. They are downstream of identity, not upstream. The same declared execution always has the same identity regardless of what outputs it produces. Conversely, different declared executions may produce identical outputs. If identity is derived from outputs, then different executions that produce the same output would share identity, which is incorrect. Identity distinguishes executions, not outputs.
Can output-based identity work if the outputs are always the same?
Even if a system reliably produces the same outputs for the same inputs, deriving identity from those outputs reverses the causal chain. The question is not whether the derivation produces consistent results. The question is whether the derivation is structurally valid. Identity must be assigned to declared execution, not derived from outputs. A system that consistently derives the same identity from the same outputs is a consistent classification system. It is not an identity system.
How does output-based identity differ from post-hoc reconstruction?
They are closely related. Output-based identity derives identity from results. Post-hoc reconstruction infers identity after execution by examining what happened. Both reverse the temporal order that valid identity requires. The distinction is emphasis: output-based identity focuses on the content of outputs, while post-hoc reconstruction focuses on the timing of the inference. Both are invalid for the same structural reason — identity must exist before output evaluation begins.
What should systems use outputs for if not identity?
Outputs serve many valid purposes: quality assessment, compliance checking, performance measurement, debugging, and user satisfaction evaluation. These are all legitimate uses of output analysis. What outputs cannot do is establish identity. The identity of a declared execution is fixed by the deterministic identity function. Outputs may confirm that the execution ran as expected, but they do not define what the execution is.
Is deterministic identity impractical because it ignores what systems actually do?
Deterministic identity does not ignore outputs. It assigns identity to declared execution independently of outputs. The system still produces outputs. Those outputs are still evaluated. But the identity assignment is not contingent on the outputs. This separation is not impractical — it is architecturally sound. It means identity is stable even when outputs vary, which they will across different runs, configurations, and environments. Stable identity in the face of output variation is a feature, not a limitation.