Independent certification body

The accuracy standard for clinical AI.

MedAttest is the independent certification body for clinical AI accuracy. FACT certification — Fabrication, Accuracy & Completeness Testing — is the procurement standard for ambient AI medical scribes. Think SOC 2, but for what the AI writes in the chart.

134
Verified clinical facts per encounter battery
30
Adversarial fabrication traps per run
0
PHI on the platform. Ever.
MEDATTEST · CERTIFIED ACCURACYFACT v2.1 · ISSUED 2026
TierA
Fabrication < 1%0 severity-4 omissions
SHA8f3a · 02e1 · b774·VERSIONED
The procurement gap

Hospitals are buying ambient AI on the vendor's word.

There is no objective way to know whether the AI fabricates findings, omits critical clinical facts, or makes unsupported inferences. Vendors self-report accuracy on closed benchmarks. Buyers have nothing independent to point to.

FACT gives both sides a common, verifiable standard.

Vendor self-report

Self-reported benchmarks

Vendor-chosen test set. Vendor-defined scoring. No adversarial pressure. No independent review.

Independent

FACT certification

Third-party encounters with known ground truth, adversarial traps, severity-weighted scoring, and clinician adjudication.

How FACT works

A repeatable protocol, not a vibe check.

Every certification run is reproducible, severity-aware, and snapshotted against the exact FACT criteria version in effect.

  1. 01

    Controlled synthetic encounters

    MedAttest sends a battery of physician-authored synthetic patient encounters — primary care, cardiology, psychiatry — to the vendor's AI.

    134 verified clinical facts · 30 adversarial fabrication traps · 0 PHI

  2. 02

    Claim-by-claim grading

    Every assertion in the AI-generated note is extracted and scored against ground truth by a physician-calibrated AI judge: grounded, fabricated, or unsupported inference. Omission detection catches what the AI failed to document, weighted by clinical severity.

    Grounded / Fabricated / Unsupported · Severity 1–4 omission weighting

  3. 03

    Human clinician adjudication

    Low-confidence judgments are routed to licensed clinicians, whose rulings always override the AI judge. Severity-3 and 4 omissions and fabrications are always human-reviewed.

    Clinician override is final · Judge is continuously recalibrated

  4. 04

    FACT scoring and tiering

    Vendors receive a composite score and a FACT tier — A, B, C, or fail — against published criteria. FACT-A requires a fabrication rate under 1% and zero severity-4 omissions.

    Composite score · Tier A / B / C / Fail · Published criteria

  5. 05

    Public, procurement-ready attestation

    Certified vendors get a versioned public trust page and FACT badge, a shareable PDF attestation with a confidential claim-level appendix, and a CHAI-compatible JSON export.

    Public trust page · FACT badge · PDF + CHAI-compatible JSON

The scoring formula is public

No black box. No vendor adjustments.

Severity-aware by design — a missed drug allergy isn't scored like a missed social-history detail.

FACT composite scoredoc ref · FACT-MAT-01
composite = 1 0.6·fabrication 0.3·severity-weighted omission 0.1·unsupported inference
Fabrication
0.60
Weighted omission
0.30
Unsupported inference
0.10
FACT tiers

One score. Four outcomes. Published criteria.

FACT criteria are snapshotted into every certification run, so an attestation never silently changes meaning.

AFACT-A

Procurement-ready

  • Fabrication rate < 1%
  • Zero severity-4 omissions
  • Composite ≥ 0.97
BFACT-B

Compliant

  • Fabrication rate < 2.5%
  • ≤ 1 severity-4 omission
  • Composite ≥ 0.93
CFACT-C

Provisional

  • Composite ≥ 0.85
  • Requires elevated clinician oversight
  • Re-test in 90 days
FFACT-F

Fail — do not deploy

  • Fabrication rate ≥ 2.5%
  • Any severity-4 fabrication
  • Composite < 0.85
What FACT catches

Two AI scribes. Same battery. Very different chart.

[ FACT-A · Certified ]Vendor 092
0.9%

Fabrication rate

Severity-4 omissions
0
Severity-3 omissions
2
Adversarial traps caught
29 / 30
Composite score
0.978
[ Uncertified ]Vendor null
4.6%

Fabrication rate

Severity-4 omissions
3
Severity-3 omissions
14
Adversarial traps caught
11 / 30
Composite score
0.812

Same synthetic encounters. Same ground truth. The FACT report makes the delta visible to procurement, governance, and patient-safety committees before deployment.

Why FACT

Procurement-grade rigor, by construction.

01

Independent, not self-reported

Third-party testing against ground truth known only to MedAttest — not a vendor benchmark dressed up as one.

02

Built to catch hallucination

Adversarial traps in every encounter. Designed to surface fabrications that random sampling misses entirely.

03

Human-in-the-loop safety

AI judge for scale, licensed physicians for the calls that matter. The judge is continuously calibrated against clinician rulings.

04

Severity-aware

A missed drug allergy is not scored like a missed social-history detail. Clinical impact shapes the math.

05

Versioned and auditable

FACT criteria are snapshotted into every certification run, so an attestation never silently changes meaning.

06

Zero PHI

All test encounters are synthetic. Nothing sensitive ever touches the platform — not in transit, not at rest.

For AI vendors

Get FACT certified. Win hospital deals faster.

A FACT-A badge and a public trust page shorten procurement cycles. Stop answering one-off security and accuracy questionnaires for every health system — point them at your versioned attestation.

Start a certification run
For health systems

Require FACT certification before you deploy clinical AI.

Defensible purchasing, AI governance, and patient safety in one document. Use the vendor registry to compare FACT tiers, fabrication rates, and severity-weighted omission scores side by side.

Look up a vendor's FACT report
Begin certification

Know what your AI scribe gets wrong — before your clinicians do.