Open Road Risk
  • Home
  • Project
    • Project overview
    • Current model status
    • AI-assisted development
  • Literature
    • Literature overview
    • Literature evidence register
    • Literature-pipeline alignment
    • Crash frequency models
    • Exposure and traffic volume
    • Spatial methods and network risk
    • Junctions and conflict structure
    • Severity modelling
    • Validation and metrics
    • Transferability and open data limits
  • Data Sources
    • Overview
    • STATS19 Collisions
    • OS Open Roads
    • AADF Traffic Counts
    • WebTRIS Sensors
    • Network Model GDB
  • Methodology
    • Methodology Overview
    • Joining the Datasets
    • Feature Engineering
    • Empirical Bayes Shrinkage
  • Exploratory Data Analysis
    • Collision EDA
    • Collision-Exposure Behaviour
    • Vehicle Mix Analysis
    • Road Curvature
    • Months and Days of Week
    • Traffic Volume EDA
    • OSM Coverage
  • Models
    • Modelling Approach
    • Stage 1a: Traffic Volume
    • Stage 1b: Time-Zone Profiles
    • Stage 2: Collision Risk Model
    • Facility Family Split
    • Model Inventory
  • Investigations
    • Investigations overview
    • KSI atlas diagnostic
    • Staffordshire data quality
    • Temporal descriptors evaluation
    • AADF counted-only filter
    • Rank stability harness
    • Zero-calibration diagnostic
  • Outputs
    • Top-risk map
  • Tools
    • ukgeo — UK Geocoder
  • Future Work

On this page

  • Question
  • Method
  • Result
  • Limitations
  • Revisit condition
  • Related artefacts

KSI Atlas Diagnostic

Pre-registered severity-reporting consistency check; Part A failed; investigation parked.

Can a standalone KSI count model produce a meaningfully different operational ranking from the all-injury model?

The Staffordshire flags surfaced by this investigation are a separate data-quality issue; see Staffordshire data quality for that finding.

Decision register entries: 2026-05-22 — Standalone KSI atlas — parked and 2026-05-23 — Adjusted Part A rerun — parking confirmed

Question

Can a separate KSI (killed or seriously injured) count model produce a meaningfully different operational ranking from the all-injury model on the Open Road Risk study area?

Method

The investigation was pre-registered as a two-part diagnostic.

Part A tested whether KSI reporting was stable enough across police forces and years to justify a national-scope KSI model. It used the same snapped collision source that feeds road_link_annual.parquet, after the Stage 2 snap-method and snap-score filters. KSI was defined as collisions with collision_severity equal to fatal or serious. For each force/year cell, the diagnostic calculated the KSI-to-all-injury ratio and flagged rows where the year-on-year ratio change exceeded ±20%. A practical sensitivity flag also required an absolute KSI count change of at least 25 collisions, but the pre-registered verdict was based on the ±20% ratio rule.

Part B would have compared a standalone KSI modelling path against the all-injury ranking, including a predictor-set comparison and EB-shrunk Jaccard against the current ranking. Part B was conditional on Part A clearing the reporting-consistency gate.

After the original Part A failed, a follow-up feasibility check confirmed that DfT severity-adjustment columns were available across the 2015–2024 study window. Part A was then rerun using an adjusted expected KSI target:

adjusted_expected_ksi = fatal_indicator + collision_adjusted_severity_serious

This adjusted target is probabilistic and aggregate-facing: it is an expected count, not a deterministic record-level KSI label. The same pre-registered ±20% year-on-year force/year ratio rule was retained unchanged.

Result

The original Part A diagnostic did not clear the reporting-consistency gate. Across 23 police forces and 2015–2024:

  • 450,991 all-injury collision records were retained after Stage 2 snap filters;
  • 99,559 were recorded KSI collisions;
  • 28 force/year rows were flagged by the pre-registered ±20% rule;
  • 26 of those also survived the practical sensitivity threshold.

The result was not mainly small-number volatility. The flag pattern was consistent with a mixture of the 2016–2019 CRaSH/COPA severity-reporting transition, Staffordshire-specific anomalies, and residual force-specific heterogeneity that did not collapse under tested restricted windows. The strict pre-registered verdict was: per-force handling required before KSI modelling is defensible.

The adjusted Part A rerun improved the result but did not reverse the verdict. Using adjusted expected KSI:

  • total adjusted expected KSI was 114,817.3;
  • the pre-registered flagged force/year rows fell from 28 to 15;
  • the practical-sensitivity flagged rows fell from 26 to 13;
  • residual flags remained concentrated in Staffordshire 2021–2024, the 2016–2017 CRaSH/COPA transition, and a scattered 2019 transition tail.

Because the adjusted rerun still failed the pre-registered consistency gate, Part B was not run. The standalone KSI atlas remains parked.

Note

The Staffordshire flags are now treated as a source-data issue rather than a KSI modelling result. See Staffordshire data quality for the separate investigation.

Limitations

  • The diagnostic tests reporting consistency, not whether KSI risk is substantively unimportant. It only says the available target is not yet stable enough for a defensible national-scope standalone model.
  • The DfT-adjusted target is an expected count, not an observed integer count. Using it in a link-year model would require explicit target-definition, validation, and EB-shrinkage decisions.
  • The KSI target is sparse at link-year level, so even a reporting-consistent target would need careful validation before operational ranking.
  • The adjusted rerun does not prove that KSI modelling is impossible. It narrows the revisit path: use a restricted window, treat Staffordshire as out of scope by default, and re-derive the modelling and EB assumptions for expected-count targets.

Revisit condition

Reopen the KSI atlas only if one of the following happens:

  • the adjusted Part A diagnostic is rerun on a restricted window, such as 2019–2023 excluding Staffordshire, and clears the pre-registered ±20% threshold, with explicit methodology for expected KSI at link-year grain and EB shrinkage; or
  • DfT publishes a corrected historical Staffordshire series for the acknowledged 2017–2023 under-reporting issue, and the full-window adjusted diagnostic then passes.

Related artefacts

  • reports/preregistration/ksi_diagnostic_preregistration.md
  • reports/ksi_reporting_consistency.md
  • reports/figures/ksi_reporting_consistency/
  • src/road_risk/diagnostics/ksi_reporting_consistency.py
  • todo/parked.md (KSI entry)
  • reports/dft_severity_adjustment_feasibility.md (adjusted rerun feasibility check)
  • reports/ksi_reporting_consistency_adjusted.md (adjusted Part A rerun)

Open Road Risk

 

Built with Quarto