How the register is built.
Sources, ingestion pipeline, source-of-truth principle, recipient grouping, and the discipline behind every fact on every page.

Two ways into the register
Anyone can search the open part of the register without an account. Solicitors, paralegals, and trained helpline volunteers can additionally run the deep research flow, which combines AI-assisted disambiguation with a referenced report drawn from seven public sources.
Open search covers the organisations index, HSE enforcement notices, court venues and inquest coverage where in scope, direct links out to the original source for every entry, and bulk CSV download under CC BY 4.0 where applicable. No account, no login, no rate limit.
Gated research is for solicitors and helpline staff. When a claimant remembers a company name from decades ago, the flow disambiguates which legal entity they mean through a short Haiku 4.5 interview (up to four questions), then assembles a referenced dossier across Companies House, The Gazette, reported case law, the FCA Register and FSCS, the British Newspaper Archive and Internet Archive, and run-off insurer acquisitions. Every finding is a candidate match — verify against its source URL before relying on it.
The research flow is in alpha for partner solicitors and Mesothelioma UK helpline staff. Public access for self-represented claimants is gated behind the data-protection work in Phase E.
Sources
Live: HSE enforcement notices — fetched from the official register, archived byte-for-byte to private storage with SHA-256 content addressing, and parsed into the public dataset.
Planned: HSE prosecutions, ONS mortality data, Companies House successor records, archived news investigations, BAILII judgments. Each new source ships with its own per-source page documenting cadence, archival policy, and parsing approach.
Source-of-truth principle
The original source is canonical for every fact. AsbestosIQ does not edit, paraphrase, or infer beyond what the source published. Generated narrative paragraphs on notice and recipient pages are template-driven against the structured fields — never freehand. If the source is silent on a fact, the register is silent too.
Ingestion pipeline
- Fetch from the official source URL on a published cadence.
- Archive the raw page byte-for-byte to private storage with a SHA-256 content address.
- Parse and validate structured fields against an explicit schema.
- Write to the public register database; emit an idempotent change log.
- Surface on the public site within minutes of archival, with the source link visible on every page.
String-grouped recipients (current limit)
Until Companies House identifiers land in Phase B, recipient pages group notices by exact recipient-name match. Two unrelated companies sharing a name will currently appear together under one recipient page; a permanent caveat explains this on every affected page. Cross-check Companies House before relying on a recipient page for due diligence.
Takedown SLA & right of reply
The takedown and right-of-reply process is documented in the legal notice. Takedowns against the underlying source must be raised with the source operator (e.g. HSE for HSE notices); AsbestosIQ mirrors any source withdrawal in the register within 7 days of detection.
Open source & reproducibility
The full source code is published under AGPL-3.0. The methodology, schema, and fuzzy-matching pipeline are all documented in the public repository. Independent re-derivation of the dataset from the underlying sources is supported and welcomed.
Future sources roadmap
The full source-by-source roadmap, with target dates and per-source dependencies, is maintained in the public project planning. Each new source will publish its own methodology page following the same discipline as this one.