The audit workflow¶

This is the provenance and audit workflow end to end. You run an analysis, record its provenance digest next to the number you report, export the full result as JSON, then verify and trace that number back to the raw data later. Every cell below runs, so the digests, the lineage, and the True/False from verification are printed as real output, not pasted in.

Run in Colab View on GitHub Download notebook

Install it first (skip this if mfgQC is already in your environment):

In [1]:

Copied!

!pip install mfgqc
!pip install mfgqc

ERROR: Could not find a version that satisfies the requirement mfgqc (from versions: none)
ERROR: No matching distribution found for mfgqc

The example¶

We use a small, strictly-positive dataset and apply a Box-Cox transform, so the lineage has something interesting in it. The seed is fixed, so the digests this notebook prints are reproducible run to run.

In [2]:

Copied!





import json
import dataclasses as dc
import numpy as np, pandas as pd, mfgqc

rng = np.random.default_rng(11)
df = pd.DataFrame({
    "cycles": np.round(rng.lognormal(mean=1.2, sigma=0.35, size=80), 3),
})

qc  = mfgqc.load(df, measure="cycles").spec(lower=0.5, upper=12.0)
cap = qc.transform("boxcox").capability()
cap
import json
import dataclasses as dc
import numpy as np, pandas as pd, mfgqc

rng = np.random.default_rng(11)
df = pd.DataFrame({
    "cycles": np.round(rng.lognormal(mean=1.2, sigma=0.35, size=80), 3),
})

qc  = mfgqc.load(df, measure="cycles").spec(lower=0.5, upper=12.0)
cap = qc.transform("boxcox").capability()
cap

Out[2]:

Process Capability (method=normal)
==================================
n = 80   mean = 1.7437
sigma (within)  =   n/a
sigma (overall) = 0.57318
Cp/Cpk sigma    = overall

Cp  = 3.344  95% CI (2.82, 3.86)
Cpk = 0.7233  95% CI (0.589, 0.858)   (Cpu=5.965, Cpl=0.7233)
Pp  = 3.344    Ppk = 0.7233   (Ppu=5.965, Ppl=0.7233)
Cpm =   n/a

Assumption checks:
  [PASS] normality (Anderson-Darling): AD=0.343, p=0.481; est. Cpk impact 15.1%; n=80

1. Run the analysis and read its lineage¶

Every result carries the full chain of operations that produced it. lineage() returns one dict per step. Pull the operation names to see the shape of the computation:

In [3]:

Copied!

[s["operation"] for s in cap.lineage()]
[s["operation"] for s in cap.lineage()]

Out[3]:

['load', 'spec', 'transform', 'capability', 'assumption:normality']

That is the whole derivation: the frame was loaded, spec limits were attached, the measure was Box-Cox transformed, capability was computed, and a normality assumption check ran. Nothing happened that is not on this list.

2. Record the digest when you report the number¶

When you write the reported value down (into a report, a LIMS, a Certificate of Analysis), capture the provenance digest next to it:

In [4]:

Copied!

digest = cap.provenance_digest()
print(digest)
digest = cap.provenance_digest()
print(digest)

7cb845af09aa053b023f88fb972d8901ee1d6eaca6123919eca0b7ffd8279a07

That SHA-256 string pins the computation that produced the number: the operations, their parameters (including the fitted Box-Cox lambda), and how many rows each step touched. The timestamp is deliberately not in the digest, so it is reproducible run to run.

Store the digest as a sibling field of the reported value, not instead of it. The digest is a fingerprint, not the data. Keeping it next to the reported Cpk gives anyone re-deriving the number later something to check against.

3. Export the full result as JSON¶

to_dict() is the canonical payload. It carries the fields, the flat summary, the assumption checks, and the lineage plus the digest: everything a downstream report builder needs, with no report() text to parse.

In [5]:

Copied!

d = cap.to_dict()
list(d.keys())
d = cap.to_dict()
list(d.keys())

Out[5]:

['result_type',
 'title',
 'summary',
 'fields',
 'assumptions',
 'history',
 'provenance_digest']

The two provenance keys are history (the lineage, each step carrying its running digest) and provenance_digest (the head digest from step 2). They are the same digest you recorded above, stamped into the export by construction:

In [6]:

Copied!

print("provenance_digest:", d["provenance_digest"])
print("matches step 2:   ", d["provenance_digest"] == digest)
print("history step keys:", list(d["history"][0].keys()))
print("provenance_digest:", d["provenance_digest"])
print("matches step 2:   ", d["provenance_digest"] == digest)
print("history step keys:", list(d["history"][0].keys()))

provenance_digest: 7cb845af09aa053b023f88fb972d8901ee1d6eaca6123919eca0b7ffd8279a07
matches step 2:    True
history step keys: ['operation', 'params', 'n_affected', 'digest']

The transform step in history shows that the fitted lambda and its confidence interval are recorded in the provenance, not buried in a log:

In [7]:

Copied!

transform_step = next(s for s in d["history"] if s["operation"] == "transform")
print(json.dumps(transform_step, indent=2))
transform_step = next(s for s in d["history"] if s["operation"] == "transform")
print(json.dumps(transform_step, indent=2))

{
  "operation": "transform",
  "params": {
    "method": "boxcox",
    "lambda": 0.5379697633151592,
    "lambda_ci": [
      -0.14218010515761142,
      1.2255339555066445
    ]
  },
  "n_affected": 80,
  "digest": "d64b236be9447d2bfa7672f3f56b9e12fda50bf93daad8f5406cfe0948535125"
}

The assumption checks ride along too. Here is the normality check that justifies the normal-method capability:

In [8]:

Copied!

print(json.dumps(d["assumptions"][0], indent=2))
print(json.dumps(d["assumptions"][0], indent=2))

{
  "name": "normality",
  "test": "Anderson-Darling",
  "statistic": 0.3432987956436193,
  "p_value": 0.4812437156608955,
  "passed": true,
  "magnitude": 0.15081585539601827,
  "magnitude_label": "est. Cpk impact",
  "reliability": "ok",
  "n": 80,
  "recommendation": null
}

Write it to a file and you have a self-describing, archivable record. The provenance_digest stamped into the file equals the digest you reported, so the export and the reported number agree by construction.

In [9]:

Copied!





import pathlib
payload = json.dumps(cap.to_dict(), indent=2)
pathlib.Path("result.json").write_text(payload)
print("wrote result.json,", len(payload), "bytes")
import pathlib
payload = json.dumps(cap.to_dict(), indent=2)
pathlib.Path("result.json").write_text(payload)
print("wrote result.json,", len(payload), "bytes")

wrote result.json, 3494 bytes

Frontends and report builders should consume to_dict() (or the flat summary()), never parse report() text. The JSON is the stable contract; the text report is for humans.

4. Verify later¶

Months later, someone reopens the archived result (or recomputes it from the same inputs) and checks it against the digest you recorded:

In [10]:

Copied!

cap.verify_provenance(digest)
cap.verify_provenance(digest)

Out[10]:

True

verify_provenance(expected) recomputes the digest over the current history and compares it to the one you pass in. True means the recorded computation is intact.

Tamper-evidence, demonstrated honestly¶

The chain is tamper-evident: changing the operation, params, or n_affected of any recorded step changes the head digest, so verification fails.

The result and its history are frozen, so there is no in-place edit to make. To show this we construct an altered copy with dataclasses.replace. We are not mutating the original cap; we build a new object whose recorded transform step has its fitted lambda bumped by 1.0, then verify that copy against the original digest.

In [11]:

Copied!





hist = list(cap.history)
for i, s in enumerate(hist):
    if s.operation == "transform":
        bad = dict(s.params)
        bad["lambda"] = bad["lambda"] + 1.0          # alter a recorded parameter
        hist[i] = dc.replace(s, params=bad)

tampered = dc.replace(cap, history=tuple(hist))      # a new, altered copy

print("original digest: ", digest)
print("tampered digest: ", tampered.provenance_digest())
print("verify tampered: ", tampered.verify_provenance(digest))
print("original intact: ", cap.verify_provenance(digest))
hist = list(cap.history)
for i, s in enumerate(hist):
    if s.operation == "transform":
        bad = dict(s.params)
        bad["lambda"] = bad["lambda"] + 1.0          # alter a recorded parameter
        hist[i] = dc.replace(s, params=bad)

tampered = dc.replace(cap, history=tuple(hist))      # a new, altered copy

print("original digest: ", digest)
print("tampered digest: ", tampered.provenance_digest())
print("verify tampered: ", tampered.verify_provenance(digest))
print("original intact: ", cap.verify_provenance(digest))

original digest:  7cb845af09aa053b023f88fb972d8901ee1d6eaca6123919eca0b7ffd8279a07
tampered digest:  e5e9dc334f030262e8cb9fc43f80e16bb0e78bc2f4978baf64a38193b3feb0db
verify tampered:  False
original intact:  True

One altered parameter, in one step, three steps deep, and the head digest moves and verification returns False. The original cap is untouched and still verifies True: we built a new object rather than editing it, because the history is append-only by construction.

5. Trace a number back to raw data¶

lineage() is the audit trail. Each step gives you its operation, its params, its n_affected, and the running digest folded in up to and including that step:

In [12]:

Copied!

for s in cap.lineage():
    print(s["operation"], "| n_affected:", s["n_affected"], "| digest:", s["digest"][:16], "...")
for s in cap.lineage():
    print(s["operation"], "| n_affected:", s["n_affected"], "| digest:", s["digest"][:16], "...")

load | n_affected: 80 | digest: a69636b71f0bddc7 ...
spec | n_affected: None | digest: 72c30c2486ecf8c3 ...
transform | n_affected: 80 | digest: d64b236be9447d2b ...
capability | n_affected: 80 | digest: 4aae3812003203cd ...
assumption:normality | n_affected: None | digest: 7cb845af09aa053b ...

Read it bottom-up to walk the reported number back to the raw frame. Each step's params records exactly what it did:

In [13]:

Copied!

for s in cap.lineage():
    print(s["operation"])
    print("   ", s["params"])
for s in cap.lineage():
    print(s["operation"])
    print("   ", s["params"])

load
    {'measure': 'cycles', 'roles': {}, 'units': None, 'subgroup_size': None, 'spec': {'lower': None, 'upper': None, 'target': None}}
spec
    {'lower': 0.5, 'upper': 12.0, 'target': None}
transform
    {'method': 'boxcox', 'lambda': 0.5379697633151592, 'lambda_ci': [-0.14218010515761142, 1.2255339555066445]}
capability
    {'method': 'normal', 'sigma_used': 'overall', 'cp': 3.343918554734314, 'cpk': 0.7232996917103978, 'pp': 3.343918554734314, 'ppk': 0.7232996917103978, 'cpm': None}
assumption:normality
    {'test': 'Anderson-Darling', 'passed': True, 'magnitude': 0.15081585539601827, 'reliability': 'ok', 'p_value': 0.4812437156608955, 'statistic': 0.3432987956436193}

So the reported capability was computed after a Box-Cox transform with the fitted lambda above, against spec limits [0.5, 12.0], on 80 loaded rows, and the normality check that justifies the normal-method capability is right there in the chain. No step is hidden, and each step's digest lets you confirm where in the chain a difference first appears.

The running digest also lets you cross-check intermediate state. The QCData after the transform exposes the same provenance surface, and its digest equals the transform step's running digest in the result's lineage:

In [14]:

Copied!





qct = qc.transform("boxcox")
transform_running = next(s["digest"] for s in cap.lineage() if s["operation"] == "transform")

print("QCData-after-transform digest:", qct.provenance_digest())
print("transform step running digest:", transform_running)
print("equal:                        ", qct.provenance_digest() == transform_running)
qct = qc.transform("boxcox")
transform_running = next(s["digest"] for s in cap.lineage() if s["operation"] == "transform")

print("QCData-after-transform digest:", qct.provenance_digest())
print("transform step running digest:", transform_running)
print("equal:                        ", qct.provenance_digest() == transform_running)

QCData-after-transform digest: d64b236be9447d2bfa7672f3f56b9e12fda50bf93daad8f5406cfe0948535125
transform step running digest: d64b236be9447d2bfa7672f3f56b9e12fda50bf93daad8f5406cfe0948535125
equal:                         True

lineage(), provenance_digest(), and verify_provenance() exist on both QCData and every result object. The trail is continuous from the loaded frame through to the final number.

What passing and failing verify actually mean¶

A passing verify_provenance() means the recorded result is intact: the archived analysis has not been edited since the digest was captured. A failing one means the history no longer matches, so something in the recorded chain changed.

What it does not do, on its own: it does not stop an actor who controls the Python interpreter at runtime from recomputing the whole analysis over fabricated inputs and stamping a fresh, self-consistent digest. The digest is a content hash, not a cryptographic signature. It defends against accidental corruption and post-hoc tampering with a stored result, not against an adversary who controls the process that produces it.

Closing that gap requires anchoring the head digest outside the process: signing it with a key the operator does not hold, or writing it to an append-only external log. That is out of scope for the core library and left to the deployment. The full scope statement is in Provenance model.

Next¶

Provenance model: the data model, the hash-chain algorithm, and the honest scope of the guarantee.
Reference: the formula, assumptions, and source standard behind every method, plus the full result surface.