By Gaurab Chhetri - Research Project

CrashTransformer - Causal Summarization of Police Crash Narratives

CrashTransformer uses transformer models to convert long police crash narratives into concise, causality-focused summaries that help safety analysts and policymakers act faster. Paper submitted to different venues, will be made available shortly after acceptance.

CrashTransformer is a research project that turns unstructured police crash narratives into short, causal summaries. Transportation agencies collect thousands of reports each year. The key causes are often buried in long text. CrashTransformer helps surface the root cause, what failed, and why it happened, so teams can move from reading to action.

Highlights

Results snapshot: BERTScore F1 ≈ 0.884, ROUGE-L F1 ≈ 0.327, mean cosine similarity ≈ 0.647, compression ratio ≈ 0.743 on a curated benchmark of police narratives.

Problem and Motivation

Crash narratives are rich with causal signals, but they are hard to search and hard to compare across cities and time. Traditional analysis relies on structured fields and manual coding. That limits depth and scale. We built CrashTransformer to bridge this gap. The goal is to compress long narratives into precise causal statements that analysts and policymakers can use in minutes.

Method in Brief

Why this combo: ROUGE checks surface overlap, BERTScore checks meaning, cosine similarity checks sentence-level alignment, compression checks whether we distill the narrative without losing the cause.

Example Output

Input narrative
Unit 1 traveled northbound and turned right near a construction zone with cones blocking the right turn lane. Unit 2 had moved safely into the right lane. Unit 1 turned when unsafe and struck Unit 2 in the left front. No injuries reported.

CrashTransformer summary
Unit 1 caused the crash by turning right when it was unsafe, striking Unit 2 that was already in the right lane near the construction zone.

What the Results Mean

Use Cases

👉 Code: GitHub Repository
📄 Paper: manuscript submitted to different venues for review. Will be made available upon acceptance and completed presentation.

Acknowledgements

Thanks to collaborators at Texas State University AIT Lab 1 for guidance on transportation safety use cases, and to the open source communities behind Transformers, BART, and Llama for the tools that made this project possible.

References

Footnotes

  1. https://ait-lab.vercel.app