Trust Infrastructure · High-Stakes AI Data

Prove your data is exactly
what you claim it to be.

ResEthiq is India’s first and only data veracity infrastructure. We make trust in AI measurable, cryptographic, and machine-verifiable - transforming datasets into proof-bearing assets that endure audit, regulation, and time.

5M⁺ rows per dataset, streaming

100% offline verification

0 server trust required

12^wk pilot readiness sprint

The Problem

"Trust me" is no longer
acceptable in high-stakes AI.

Regulators, auditors, and courts are asking questions most organizations can't answer. Where did the data come from? Was it altered? Can you prove what your model was trained on — exactly?

Patient harm from wrong AI decisions

Clinical AI trained on unvalidated data carries direct risk to human life. Integrity is a prerequisite, not a feature.

Failed regulatory submissions

FDA, EMA, and EU AI Act require data provenance, traceability, and reproducibility. Most pipelines cannot demonstrate this.

Uncontained legal liability

Courts require a chain of evidence for AI-driven decisions. Without it, exposure is unlimited and indefensible.

Silent dataset mutations

Without a sealed, signed dataset record, results cannot be reproduced — making scientific claims indefensible.

The Protocol

Four steps. One unbreakable
chain of evidence.

Freeze

Seals the exact dataset version into a cryptographic Merkle root — deterministic across every platform. Any modification, even a single character, is instantly detectable.

→ Immutable fingerprint

Examine

Forensic investigation: duplicate records, train/test leakage, suspiciously perfect patterns, domain anomalies, and hidden preprocessing artifacts.

→ Structured findings

Enforce

Applies versioned domain-specific policy rules — separation, provenance, independence. Every failure recorded as a clear, defensible assertion.

→ Policy compliance

Certify

Outputs Approved / Restricted / Rejected — packaged as an Ed25519-signed Policy Object with full audit trail, independently verifiable offline by anyone.

→ Signed proof object

Use Cases

Where integrity is the difference
between safety and catastrophe.

Healthcare AI · Primary

A diagnostic AI flags a scan as benign. The patient is sent home. Who can prove what the model was trained on?

Without Resethiq, the training dataset, the checks run, and the policy applied are not retrievable in a legally defensible form. With Resethiq, the sealed evidence pack — frozen at pipeline approval — answers every question an auditor or court will ask.

$ resethiq-verify pack --bundle radiology_v2.zip

✓ Bundle signature: VALID

✓ Merkle root: MATCHES

✓ Dataset: radiology_v2.4-frozen

✓ Train/test leakage: NONE

✓ Policy: healthcare_v3.yaml

→ Outcome: APPROVED

Exit 0 · 2025-02-10T09:14:32Z

Clinical Trials · Pharma

An FDA submission requires proof that trial data was never altered post-collection.

Resethiq freezes data at collection, signs it, and generates an audit-ready evidence pack that travels with every regulatory submission — independently verifiable without system access.

Regulated AI Startups

Enterprise procurement requires proof of AI data governance before vendor approval.

Resethiq gives startups the compliance evidence layer that unlocks enterprise deals — transforming a procurement checkbox into a verifiable, trust-building asset.

Scenarios

Real operational moments.
Real consequences.

Walk through what Resethiq actually does at the critical decision points in high-stakes AI pipelines.

Scenario 01 · Pre-Training Integrity Gate

Your ML team is ready to train. But do you know what you're about to feed your model?

A healthcare AI team has prepared a 2.3M-row dataset from three hospital systems. Before training begins, Resethiq runs as a mandatory pipeline gate.

The Trust Kernel freezes the dataset — computing a Merkle root across every row and column. Any modification, even a single character, produces a different root and will be detected at verification time.

Forensic examination checks for duplicates across sources, train/test leakage, policy-defined medical domain constraints, and provenance violations. All rules pass. Training is authorized. The signed SPO is archived.

Result

Training authorized — signed SPO archived

The team can demonstrate, at any future point, exactly what data was used, under which policy, and that it passed every check at the time.

resethiq verify spo · Pre-Training Gate

── Resethiq Trust Kernel v1.4.2 ──

Dataset : patient_training_v3.parquet

Rows : 2,341,887

── Running Integrity Protocol ──

✓ Freeze dataset → merkle: a3f7c29d…

✓ Duplicate scan → 0 found

✓ Leakage check → CLEAN

✓ Domain validate → all in bounds

✓ Provenance → no forbidden sources

── Signing ──

✓ Ed25519 sig: COMPLETE

✓ Evidence pack: spo_pt_v3.zip

OUTCOME: APPROVED — safe to train

Exit 0 · 2025-02-17T08:43:11Z

Scenario 02 · Tamper Detection

A dataset was modified after signing. Resethiq catches it — even if one byte changed.

During a routine audit, a compliance officer asks to verify the training dataset against the archived SPO. The file appears identical — same filename, size, and metadata.

But Resethiq recomputes the Merkle root from raw bytes and compares against the signed root. They do not match.

This is not a warning. It is a hard, signed, verifiable fact — the dataset was modified after the signing event. Resethiq exits code 2 (Mismatch). The evidence is preserved in the Evidence Ledger permanently.

Result

Tamper detected — investigation initiated

The compliance team has cryptographic proof of post-signing modification, enabling precise investigation.

resethiq verify spo · Tamper Audit

── Recomputing Root ──

→ Computed root: 7c4d1a9f8e2b…

→ Signed root: a3f7c29d14b8…

✗ ROOT MISMATCH DETECTED

computed ≠ signed

Dataset was modified after signing

→ Original signed: 2025-01-14T11:22:00Z

→ Modification Δ: +18 days, 4h

OUTCOME: MISMATCH — tamper detected

Exit 2 · 2025-02-01T15:07:44Z

Scenario 03 · Regulatory Submission

An FDA reviewer asks: can you prove your training data was exactly what you claimed, at the time you claimed?

A pharma company submitting a companion diagnostic AI to the FDA is asked to provide data integrity evidence. The reviewer wants a provenance chain — independently verifiable without accessing company systems.

The company retrieves the Resethiq Audit Bundle: a PDF summary, signed SPO, evidence pack ZIP, and public key — all self-contained.

The reviewer runs the Verifier CLI offline. No system access. No API. No trust required. The root recomputes identically. Signature validates. Submission complete.

Result

Regulatory submission strengthened

Cryptographic data integrity demonstrated to the regulator — independently verifiable, without sharing system access.

resethiq verify pack · FDA Bundle

── Offline Mode · No network required ──

✓ audit_report.pdf found

✓ signed_spo_v2.cbor found

✓ policy_digest.yaml found

✓ verifier_pubkey.pem found

✓ Signature: Ed25519 VALID

✓ Merkle: MATCHES

✓ Version: spec v1.4.2 — compatible

OUTCOME: VERIFIED — chain intact

Exit 0 · Offline · Zero server trust

Scenario 04 · Enterprise Procurement

A hospital risk committee requires proof of AI data governance before approving a vendor.

Enterprise procurement evaluates an AI diagnostic vendor. The questionnaire asks: "Can you provide data integrity evidence that our team can independently verify?"

Previously, vendors answered with attestations and SOC 2 reports — none independently verifiable at the data level. Resethiq changes the answer entirely.

The vendor shares their Audit Bundle. The security officer runs the Verifier CLI in under 5 minutes. Cryptographic evidence is clear. Vendor approved.

Result

Procurement approved — competitive differentiation

Resethiq becomes the integrity layer that unlocks enterprise deals, transforming compliance into a verifiable trust asset.

resethiq verify pack · Procurement

── Procurement Integrity Review ──

Vendor : DiagnosticAI Inc.

Model : CardioRisk v4.1

✓ Data frozen: 2024-11-03

✓ Integrity policy: cardiac_v3.yaml

✓ All policy rules: PASS

✓ Provenance: CLEAN

✓ Crypto proof: VERIFIED

Review time: 4m 22s

Server access req: None

OUTCOME: VENDOR APPROVED

Latest Research

The regulatory landscape for AI
data integrity is accelerating.

Perspective · Regulation

The EU AI Act and the coming data provenance mandate

High-risk AI systems must demonstrate data governance and traceability. Why dashboards won't be enough — and what will.

Technical · Architecture

Why Merkle trees are the right structure for dataset integrity proofs

The design decisions behind Resethiq's Trust Kernel — and why deterministic canonicalization across platforms is harder than it sounds.

Case Study · Healthcare

What FDA's guidance on AI/ML software actually requires from your training data

A plain-English breakdown of the FDA's PCCP framework and what "good machine learning practice" means for data integrity evidence.

Your AI pipelines need evidence. Not assurances.

Request a demo and see what Resethiq proves about your data — before a regulator asks first. We work with you through a structured 12-week pilot sprint.

We review every request. Resethiq pilots are for healthcare AI, pharma, and regulated AI teams. We'll respond within 2 business days.

Research

Perspectives on data integrity, regulation, and trust infrastructure.

Resethiq's research team publishes on the regulatory landscape, technical architecture, and real-world implications of data integrity enforcement for high-stakes AI.

Perspective Feb 2025

The EU AI Act and the coming data provenance mandate — what high-risk AI teams need to know now

Annex III of the EU AI Act creates mandatory data governance obligations for high-risk AI. We break down what "data provenance" means in practice, why existing monitoring tools fall short, and what verifiable integrity evidence actually looks like.

Technical Jan 2025

Why Merkle trees are the right structure for dataset integrity proofs — a design rationale

Resethiq's Trust Kernel uses a Merkle tree over canonicalized row bytes. We explain the design decisions: why canonicalization is harder than it looks, how determinism across macOS arm64 and Linux x86_64 is enforced, and why content-addressing enables offline verification.

Case Study Jan 2025

What FDA's guidance on AI/ML-based software actually requires from your training data

A plain-English breakdown of the FDA's PCCP (Predetermined Change Control Plan) framework, the GMLP guidance, and what "good machine learning practice" means for data integrity evidence in medical device submissions under 21 CFR 820.

Perspective Dec 2024

The difference between data quality tools and data integrity enforcement — why the gap matters in regulated AI

Data quality tools observe and alert. Data integrity enforcement decides and proves. This piece maps the conceptual and practical gap between the two — and explains why organizations relying on dashboards will not survive regulatory review.

Technical Nov 2024

Ed25519 signing for AI data integrity — why we chose it and how we implemented KMS abstractions

A technical walkthrough of Resethiq's cryptographic signing layer: the choice of Ed25519 over RSA, the canonical encoding invariant, KMS abstraction design, and how offline verification works without trusting Resethiq's servers.

Commitment

We are building trust infrastructure for the world's most consequential AI systems. That is a responsibility we take seriously.

Our Foundation

Integrity is not a feature. It is the product.

Resethiq exists because high-stakes AI has crossed a threshold. The systems we help certify touch patient lives, clinical decisions, regulatory submissions, and legal accountability. Our commitment is to the integrity of those systems — above all else.

Zero Server Trust by Design

Our Verifier CLI works 100% offline. We do not want you to trust us — we want you to verify independently. This is an architectural commitment, not a marketing claim. The math doesn't lie, even when humans do.

Permanent, Immutable Evidence

Every integrity decision is stored as a content-addressed, cryptographically sealed object. Resethiq cannot alter historical records. Neither can our customers. The ledger is the truth.

Sector-First, Not Scale-First

We deliberately start in healthcare and pharma — the domains where the cost of integrity failure is measured in human harm. We would rather do this right in one domain than do it carelessly across many.

Decisions, Not Warnings

Resethiq does not surface alerts for humans to interpret later. It makes enforceable decisions — Approved, Restricted, or Rejected — backed by verifiable proof. Vagueness is a form of negligence we refuse to practice.

Architectural Invariants

The non-negotiable
hard gates in our system.

These invariants are enforced in CI/CD and architectural reviews. They cannot be bypassed — not by customers, not by Resethiq engineers, not by anyone.

INVARIANT A

Determinism

Same (Dataset + Spec + Kernel) → Identical Merkle Root across macOS arm64 and Linux x86_64. Cross-platform parity is verified in CI on every commit.

INVARIANT B

Canonical Encoding

Signed payload bytes are authoritative. JSON is a non-authoritative mirror. What was signed is what counts — always.

INVARIANT C

Zero Server Trust

The Verifier CLI must function 100% offline using only the evidence pack and dataset. No Resethiq server required. No Resethiq account required. No trust required.

INVARIANT D

Version Discipline

Every change to canonicalization, hashing, or encoding triggers a version bump and digest update. Historical evidence is never silently invalidated by software changes.

Learn

Everything you need to understand how Resethiq works.

What is Resethiq?

Resethiq is a data integrity enforcement layer for high-stakes AI pipelines. It acts as a security checkpoint for data before it enters any sensitive workflow — training, validation, clinical studies, or regulated approval.

Before data is allowed to train an AI model, support a clinical claim, or be deployed into regulated production, Resethiq performs four fundamental operations: it freezes the dataset, examines it forensically, applies strict policy rules, and produces a signed, enforceable outcome.

"Resethiq makes sure only trustworthy data is allowed to power high-stakes AI — and proves it beyond doubt."

The core insight is that existing tools observe and warn; Resethiq enforces and proves. A dashboard can tell you something looks wrong. Resethiq can prove what was right, under which policy, at a specific point in time — in a form that survives legal and regulatory scrutiny.

Step 1: Freeze — Locking the Dataset

Resethiq seals the dataset so everyone knows exactly which version was checked. This prevents silent updates, last-minute swapping, and post-hoc manipulation.

Technically, the Trust Kernel transforms raw data (CSV, Parquet, or Arrow format) into a canonical byte representation — a deterministic encoding that produces identical output on macOS arm64 and Linux x86_64. From these canonical bytes, it computes a Merkle root: the dataset's unique cryptographic fingerprint.

"Think of this as sealing evidence in a legal case, or notarizing a document so nobody can claim it was different later. Except the Merkle root is mathematically verifiable by anyone."

Once frozen, any change to the dataset — even a single character in a single field — produces a completely different Merkle root. This mismatch is detectable instantly at verification time, even years later, without access to Resethiq's systems.

Step 2: Examine — Forensic Investigation

Resethiq looks for signals that the dataset may be unreliable or unsafe. This is not a data quality score or a prettified dashboard. It is a forensic investigation with a precise question: can this dataset be trusted for this purpose under this policy?

The examination detects:

Duplicated or recycled records across sources — common when datasets are assembled from multiple hospitals or trial sites without deduplication controls.

Suspiciously perfect patterns often seen in synthetic or fabricated data — statistical signatures that no naturally-collected dataset would exhibit.

Train/test leakage — records appearing in both training and test splits, which inflates model performance metrics without any real-world validity.

Distribution anomalies and schema inconsistencies — unexpected shifts in value distributions, mismatched column types, or implausible values indicating hidden preprocessing.

Every detected issue is recorded as a clear, structured assertion — not a vague warning — so that the policy enforcement step can make a precise, defensible decision.

Step 3: Enforce — Applying Policy Rules

Resethiq converts integrity requirements into explicit, testable constraints encoded in a versioned policy spec (YAML or JSON). These rules are domain-specific, not generic best practices.

Examples of policy rules Resethiq enforces:

Separation rules: training and test data must not share any records. Domain rules: medical values must stay within physiologically plausible ranges. Independence rules: clinical trial groups must satisfy statistical independence constraints. Provenance rules: forbidden or unlicensed data sources must not appear anywhere in the provenance chain.

"If a rule is broken, the failure is recorded as a clear, defensible assertion — not a vague warning. Vagueness is a form of negligence we refuse to practice."

Crucially, every SPO (Signed Policy Object) is bound to the exact policy digest that governed it. The rules that applied are permanently recorded alongside the outcome. An auditor can always see what rules were checked — and which version of those rules was in effect at the time.

Step 4: Certify — Signed, Enforceable Outcome

Resethiq produces one of three outcomes: Approved (safe to use), Restricted (use with constraints and monitoring), or Rejected (not permitted). This is not a recommendation — it is an enforceable gate wired directly into your CI/CD pipeline.

The outcome is packaged as a Signed Policy Object (SPO): a cryptographically signed, content-addressed record stored in the Evidence Ledger. The SPO contains the Merkle root, the policy digest, the Ed25519 signature, a timestamp, and the full audit trail.

This object can be shared with auditors, regulators, partners, or courts as verifiable proof — independently checkable offline by anyone with the Verifier CLI, without trusting any of Resethiq's infrastructure. That is the zero-server-trust guarantee.

Architecture Overview

Resethiq is built around four core components:

Trust Kernel (rk) — Rust: The core proof generation engine. Written in Rust for determinism and performance. Handles spec parsing, canonical transformation (CSV/Parquet → Arrow → Bytes), Merkle root computation with range/inclusion proofs, and Ed25519 signing with KMS abstractions. Processes 5M+ row datasets in streaming mode.

Verifier CLI (resethiq-verify): The independent credibility anchor. Functions 100% offline. Can recompute the Merkle root from raw data, validate Ed25519 signatures, and verify the complete evidence chain from a bundle ZIP. Exit codes: 0 (Pass), 2 (Mismatch), 3 (Invalid Signature), 4 (Version Mismatch).

Evidence Ledger: Postgres-indexed, content-addressed object store. Objects stored at path objects/<hash_prefix>/<hash>.cbor. The index is rebuildable from object storage alone — no single point of failure.

Audit Bundle Generator: Produces PDF audit reports and signed evidence pack ZIPs. Every output is bound to a versioned policy digest and includes everything needed for independent verification.

Who is Resethiq For?

Resethiq is built for organizations where data mistakes are unacceptable.

Primary users (high urgency, high stakes): Healthcare AI teams and hospitals deploying clinical decision support. Pharmaceutical companies and CROs running clinical trials. Regulated AI startups in diagnostics, risk scoring, and medical decision support. Data science teams working under compliance and audit requirements.

Secondary users (control and oversight): Internal and external auditors who need clear evidence chains. Risk, legal, and compliance teams who sign off on AI systems. Regulators and assessment bodies. Enterprise procurement committees evaluating AI risk.

"These buyers are not paying for better charts. They are paying for risk prevention and defensible proof."

Expansion markets: Finance and insurance risk models. Government datasets and policy decision pipelines. Large-scale enterprise AI training pipelines. Data marketplaces and model-sharing ecosystems where trust between parties is required.

News

Updates from Resethiq.

🛡️

Announcement February 2025

Resethiq Launches Pilot Programme for Healthcare AI Teams

We are opening our 12-week structured pilot programme to healthcare AI teams, clinical AI companies, and regulated AI startups. The pilot includes full Trust Kernel access, Verifier CLI, Evidence Ledger, and hands-on integrity engineering support. Applications are now open.

🔬

Research January 2025

Resethiq Trust Kernel v1.4.2 — Cross-Platform Determinism Milestone

Our Rust-based Trust Kernel now achieves verified identical Merkle roots across macOS arm64 and Linux x86_64 for datasets of 5M+ rows in streaming mode, passing all L0–L6 test layers.

📋

Perspective December 2024

We Published Our First Research Paper on AI Data Integrity Infrastructure

A technical perspective on the architectural patterns, cryptographic choices, and design trade-offs that make a data integrity enforcement system defensible in regulated environments.

🤝

Company November 2024

Resethiq Founded — Building Trust Infrastructure for High-Stakes AI

Resethiq was founded with a clear mission: make data integrity in AI pipelines verifiable, enforceable, and defensible. We're starting with healthcare and pharma, where the cost of getting it wrong is measured in human harm.

ResEthiq Pulse · Demo Request

See exactly what Resethiq
proves about your data.

Pulse is our hands-on pilot programme for healthcare AI teams, pharma companies, and regulated AI startups. We work with you through a structured 12-week integrity engineering sprint.

Full Trust Kernel access

Resethiq's Rust-based proof engine on your actual datasets

Verifier CLI + Evidence Ledger

Offline-capable verification and content-addressed storage

Domain-specific policy design

Healthcare, pharma, or finance rules crafted for your context

Tamper drill & audit bundle demo

Live demonstration of detection and evidence packaging

12-week pilot readiness sprint

Structured onboarding from specs to security-hardened deployment

Request a Pilot Demo

We review every application. Resethiq pilots are for organizations where data mistakes have serious consequences. We'll respond within 2 business days.

First name

Last name

Work email

Organization

Your role

Sector

Tell us about your use case

I agree to Resethiq's Privacy Policy and Terms of Service. I understand my information will be used to evaluate and respond to my pilot request.

Request received.

Thank you for your interest in ResEthiq Pulse. We review every application carefully and will respond within 2 business days to discuss your specific use case and pilot structure.

In the meantime, explore our and to deepen your understanding of how Resethiq works.

Prove your data is exactly what you claim it to be.

"Trust me" is no longeracceptable in high-stakes AI.

Patient harm from wrong AI decisions

Failed regulatory submissions

Uncontained legal liability

Silent dataset mutations

Four steps. One unbreakablechain of evidence.

Freeze

Examine

Enforce

Certify

Where integrity is the differencebetween safety and catastrophe.

A diagnostic AI flags a scan as benign. The patient is sent home. Who can prove what the model was trained on?

An FDA submission requires proof that trial data was never altered post-collection.

Enterprise procurement requires proof of AI data governance before vendor approval.

Real operational moments.Real consequences.

Your ML team is ready to train. But do you know what you're about to feed your model?

Training authorized — signed SPO archived

A dataset was modified after signing. Resethiq catches it — even if one byte changed.

Tamper detected — investigation initiated

An FDA reviewer asks: can you prove your training data was exactly what you claimed, at the time you claimed?

Regulatory submission strengthened

A hospital risk committee requires proof of AI data governance before approving a vendor.

Procurement approved — competitive differentiation

The regulatory landscape for AIdata integrity is accelerating.

The EU AI Act and the coming data provenance mandate

Why Merkle trees are the right structure for dataset integrity proofs

What FDA's guidance on AI/ML software actually requires from your training data

Your AI pipelines need evidence. Not assurances.

Perspectives on data integrity, regulation, and trust infrastructure.

The EU AI Act and the coming data provenance mandate — what high-risk AI teams need to know now

Why Merkle trees are the right structure for dataset integrity proofs — a design rationale

What FDA's guidance on AI/ML-based software actually requires from your training data

The difference between data quality tools and data integrity enforcement — why the gap matters in regulated AI

Ed25519 signing for AI data integrity — why we chose it and how we implemented KMS abstractions

We are building trust infrastructure for the world's most consequential AI systems. That is a responsibility we take seriously.

Integrity is not a feature. It is the product.

Zero Server Trust by Design

Permanent, Immutable Evidence

Sector-First, Not Scale-First

Decisions, Not Warnings

The non-negotiablehard gates in our system.

Determinism

Canonical Encoding

Zero Server Trust

Version Discipline

Everything you need to understand how Resethiq works.

What is Resethiq?

Step 1: Freeze — Locking the Dataset

Step 2: Examine — Forensic Investigation

Step 3: Enforce — Applying Policy Rules

Step 4: Certify — Signed, Enforceable Outcome

Architecture Overview

Who is Resethiq For?

Updates from Resethiq.

Resethiq Launches Pilot Programme for Healthcare AI Teams

Resethiq Trust Kernel v1.4.2 — Cross-Platform Determinism Milestone

We Published Our First Research Paper on AI Data Integrity Infrastructure

Resethiq Founded — Building Trust Infrastructure for High-Stakes AI

See exactly what Resethiqproves about your data.

Full Trust Kernel access

Verifier CLI + Evidence Ledger

Domain-specific policy design

Tamper drill & audit bundle demo

12-week pilot readiness sprint

Request a Pilot Demo

Request received.

Prove your data is exactly
what you claim it to be.

"Trust me" is no longer
acceptable in high-stakes AI.

Four steps. One unbreakable
chain of evidence.

Where integrity is the difference
between safety and catastrophe.

Real operational moments.
Real consequences.

The regulatory landscape for AI
data integrity is accelerating.

The non-negotiable
hard gates in our system.

See exactly what Resethiq
proves about your data.