- Inference forkability § 4
- Can someone take the released model and run it?
- The ability to take a released model and execute it on hardware
that an independent party can assemble. Inference forkability is necessary
for any meaningful claim of openness, but it is not sufficient: a system
that can be run but not retrained gives users a tool, not accountability.
- Training forkability § 4
- Can someone reproduce or fundamentally alter the model?
- The ability to reproduce, retrain, or substantively modify a released
model from disclosed materials. Training forkability is what produces
accountability — only when the training pipeline can be re-run can the
community contest the choices encoded in the deployed system.
- Compute capture § 4
- Reproduction is illegal-free but practically out of reach.
- When training cost so far exceeds the compute the open community can
assemble that legal forkability becomes meaningless without practical
reproducibility. A model whose weights are released but whose training run
cost twenty million dollars is captured by compute regardless of license.
- Data capture § 4
- Training data can only come from infrastructure you do not run.
- When the data required to train a model can only be produced by
infrastructure the open community cannot reproduce — synthetic instruction
data, reasoning traces, distilled experts, or reward signals that flow from
frontier systems. Reproduction then requires reproducing the upstream
pipeline that produced the training signal, not just the small model itself.
- Hardware capture § 5
- Local inference is gated by closed silicon and runtimes.
- When local inference is the deployment topology but the substrate is
not addressable by the open community: proprietary neural processing units,
closed runtime layers, or gatekept distribution channels. Edge deployment
escapes cloud capture but remains within hardware capture if the silicon
and runtime are not themselves open.
- Action boundary § 7
- The deterministic policy layer outside the model that decides what is allowed.
- In an agentic system, the model proposes actions; the action boundary
decides which proposals are allowed. The boundary is a deterministic policy
layer that sits outside the prompt context, is inspectable, and is
machine-readable. The same transparency that helps operators specify what
agents may do helps defenders see when agents are being exploited.
- Harness § 7
- The architecture that turns a model into something agentic.
- A harness manages memory across steps, decomposes plans, selects
tools, parses outputs, recovers from errors, and orchestrates flow between
model inference and tool execution. A closed harness around an open model
produces a system whose behavior cannot be reproduced from the model alone.
Harness disclosure is held to the same standard as model disclosure.
- DPI · Deterministic Public Infrastructure Paper I
- Ledgers, registries, payment rails, and other deterministic public systems.
- In the corrigibility framework, the class of infrastructure whose
behavior is governed by deterministic rules — registries, ledgers, payment
rails, and similar systems where outcomes follow legibly from inputs.
Examined in Paper I.
- EPI · Epistemic Public Infrastructure Paper II
- Learned and agentic systems whose outputs shape what counts as known.
- In the corrigibility framework, the class of infrastructure whose
behavior is shaped by learning, by training data, and by the categorical
schemas the system inherits — AI systems, recommender systems, and the
learned components of administrative decision-making. Examined in Paper II.