Benchmark

Benchmark proof

Live benchmark view for insurance extraction schemas. Thirty-four schemas now have scored diagnostic real-public-document results on named corpora; future lines publish with the same evidence and release-gate discipline.

Diagnostic Snapshot

Accuracy result by completed schema.

Commercial GL, Commercial Auto, Workers Compensation, Commercial Property, Umbrella / Excess, Professional Lines, Cyber Liability, Directors and Officers, Employment Practices Liability, Crime, Inland Marine, Cargo, Environmental / Pollution, Surety / Bonds, Builders Risk, Personal Auto, Homeowners, Renters, Condo, Personal Umbrella, Life, Health, Disability, Travel, Pet, COI, Policy Declaration, Endorsement, Schedule, Binder, Quote, Renewal, Cancellation, Application, and First Notice of Loss now show scored diagnostic results from named real-public-document corpora. Public 99% claims still require larger corpora and the release gate.

Commercial GL benchmark

Limits, entities, additional insured posture, waiver, primary/non-contributory, and endorsement effects.

Scored diagnostic snapshotCommercial GLState: diagnosticShip bar: passed
Accuracy Result100%4/4 documents passed - diagnostic
Corpus Docs4
Evaluated4
Required Fields100%12 scored fields
Evidence Score100%12 evidence checks
Exact Match100%16 exact checks

Current scored diagnostic result for Commercial GL: 100% on 4 real public documents. This is a small proof set, not an insurance-wide accuracy claim, and it is not labeled published until the full release gate is cleared.

Public release gate: pass rate >= 99%, required fields >= 99%, evidence >= 99%, deferred docs = 0.

Wake County Commercial General Liability Declarations (Commercial General Liability view)

Source URL

Portella Bellisimo Unit 2 Homeowners Association General Liability Policy (Commercial General Liability view)

Source URL

Greenhills Homeowners Association Commercial General Liability Policy (Commercial General Liability view)

Source URL

Sean Scarmack Commercial General Liability Policy (Commercial General Liability view)

Source URL
Benchmark Rules

What appears here.

Publication

Diagnostic before published

A scored diagnostic result can appear before a schema is marked published. Published accuracy still requires the full release gate.

1.1

Schema-specific proof

Each line publishes independently. One schema passing does not imply another schema has cleared the same bar.

1.2

Scoring

99% release gate

A schema does not publish as passed until document pass rate, required fields, and evidence checks clear the 99% bar.

2.1

Evidence matters

A value without evidence is not treated like a defended extraction. Evidence quality stays part of the release bar.

2.2

Boundaries

No blended insurance number

Accuracy is tracked by schema, document profile, and required-field applicability.

3.1

Real insurance documents

The current hardened proof sets use real public documents. The broader 99% claim requires larger gold-labeled corpora by schema family.

3.2
Core Lines

The first thirty-five proof-gated hardening tracks.

Commercial GL, Commercial Auto, Workers Compensation, Commercial Property, Umbrella / Excess, Professional Lines, Cyber Liability, Directors and Officers, Employment Practices Liability, Crime, Inland Marine, Cargo, Environmental / Pollution, Surety / Bonds, Builders Risk, Personal Auto, Homeowners, Renters, Condo, Personal Umbrella, Life, Health, Disability, Travel, Pet, COI, Policy Declaration, Endorsement, Schedule, Binder, Quote, Renewal, Cancellation, Application, and First Notice of Loss are hardened named-corpus schemas. The remaining tracks are active hardening targets and cannot publish accuracy until their own evidence gates clear.

01

Commercial GL

Limits, entities, additional insured posture, waiver, primary/non-contributory, and endorsement effects.

02

Commercial Auto

CSL, symbols, hired/non-owned signals, deductibles, and fleet-facing coverage structure.

03

Workers Comp

Statutory states, employers liability limits, mod factors, and class-code related signals.

04

Umbrella / Excess

Occurrence, aggregate, retention, follow-form posture, and underlying schedule references.

05

Commercial Property

Building, BPP, business income, deductibles, valuation basis, and causes-of-loss signals.

06

Professional Lines

Professional liability, cyber, D&O, pollution, claims-made posture, retro dates, and sublimits.

07

Cyber

Cyber aggregate limits, each-claim/event limits, retentions, retroactive dates, and breach-response signals.

08

D&O

Aggregate limits, Side A posture, retentions, continuity dates, and claims-made evidence.

09

EPLI

Each-claim and aggregate limits, retentions, continuity dates, and claims-made evidence for EPLI schedules.

10

Crime

Employee theft, computer fraud, forgery, funds transfer fraud, deductibles, and crime/fidelity schedules.

11

Inland Marine

Contractor's equipment, equipment floaters, transit limits, installation risk, scheduled property, and deductibles.

12

Cargo

Ocean cargo, marine cargo, stockthroughput, shipment limits, transit deductibles, policy numbers, and territory signals.

13

Environmental

Contractors pollution liability, pollution legal liability, premises pollution liability, per-pollution-condition limits, aggregate limits, retentions, and retroactive-date language.

14

Surety

Bond numbers, penal sums, obligees, principals, surety names, effective dates, and performance/payment/reclamation bond evidence.

15

Builders Risk

Project limits, soft-cost limits, transit and temporary-storage limits, deductibles, valuation basis, and completed-value/all-risk form evidence.

16

Personal Auto

Policy numbers, policy periods, bodily injury and property damage limits, UM/UIM limits, and vehicle schedules from declarations and public court packets.

17

Homeowners

Coverage A dwelling, other structures, personal property, loss of use, personal liability, medical payments, deductibles, and policy periods from homeowners declarations.

18

Renters

Personal property, loss of use, personal liability, medical payments, deductibles, and policy periods from renters declarations.

19

Condo

HO-6 / condominium unit-owner Coverage A/C, loss assessment, liability, medical payments, deductibles, and policy period evidence.

20

Personal Umbrella

Umbrella liability limits, policy periods, retained limits, and underlying insurance requirements from declarations and public court records.

21

Life

Policy numbers, issue dates, face amounts, insured names, and beneficiary evidence from life policies and public court records.

22

Health

SBC plan names, covered-member tiers, overall deductibles, and out-of-pocket maximums from public health plan documents.

23

Disability

Long-term disability monthly benefit amounts, elimination periods, and maximum benefit-period evidence from public certificates and summaries.

24

Travel

Travel insurance trip-cost, medical-limit, cancellation-limit, and coverage-effective trigger evidence from public travel protection documents.

25

Pet

Pet insurance annual policy limits, reimbursement percentages, deductibles, and insured-pet names from public policy packets.

26

COI

Certificate producer, insured, certificate-holder, and coverage-row extraction from real public ACORD certificate packets.

27

Declaration

Declaration-page policy numbers, named insureds, policy periods, and limit evidence from real public policy evidence documents.

28

Endorsement

Endorsement form numbers, edition dates, effective dates, and changed-term titles from real public policy packages.

29

Schedule

Schedule extraction for policy numbers, schedule types, scheduled rows, and form-or-limit schedule evidence from real public policy packages.

30

Binder

Binder extraction for binder numbers, effective and expiration dates, and bound coverages from real public insurance binders.

31

Quote

Quote extraction for quote numbers, quoted premiums, proposed limits, and subjectivities from real public quote schedules.

32

Renewal

Renewal extraction for renewal terms, expiring policy numbers, renewal premiums, and change-condition evidence from real public renewal invites.

33

Cancellation

Cancellation extraction for policy numbers, effective cancellation dates, and stated cancellation reasons from real public cancellation notices.

34

Application

Application extraction for applicants, operations descriptions, exposure statements, and requested coverages from real public application records.

35

FNOL

First Notice of Loss extraction for loss dates, claimant names, loss locations, and claim descriptions from real public claim notice records.

Methodology

Honest, bounded benchmark claims.

No single blended number

PolDex does not publish a single blended "insurance accuracy" number. Accuracy is tracked by schema, by document profile, and by required-field applicability.

Public corpus target

The current Commercial GL diagnostic set is the first Worker-reachable proof set. The broader corpus already has 11 gold-labeled real public documents, and each core schema still targets 50 real public or approved documents before it can carry a broader published accuracy claim.

Proof before integration

Use Live Proof to inspect a real extraction, then use Docs to inspect the underlying API.

Validate the product directly.

Run live proof, inspect schema contracts, then initialize access when you are ready.