In late February 2024, the San Francisco Chronicle published Fast and Fatal, an investigation into police car chases across the U.S. The investigation centered on a dataset reporters built of people killed in pursuits from 2017 through 2022.
This repository houses our public-facing dataset, which we invite researchers, other journalists and anyone else interested in fatal police pursuits to download and explore.
The fields have the following definitions:
Column name | Data type | Description |
---|---|---|
unique_id |
integer | A unique number for each person in the data. |
data_source |
single select. options: nhtsa , sfchronicle , nhtsa_sfchronicle |
The source of person and fatality information. sfchronicle refers to our detailed dataset sourced from news reports, public records and datasets compiled by other research organizations. nhtsa refers to NHTSA's Fatality Analysis Reporting System, specifically if the death is reported as stemming from a "police pursuit-involved" fatal crash. nhtsa_sfchronicle means the death was included in both our detailed dataset and NHTSA's FARS pursuit fatality data. |
year |
integer | The year the pursuit occurred. |
date |
date | The date the pursuit occurred. If the pursuit takes place over multiple days, this refers to the date of the crash or end of the pursuit. |
number_killed |
integer | Number of people confirmed killed (excluding any person fatally shot) as a result of the pursuit. |
age |
float | Age of the person killed. A blank field indicates age is unknown. |
gender |
single select. options: male , female , nonbinary , unknown |
The gender of the person killed. |
race |
multiple select. options: black , white , latino , asian , other , unknown |
The perceived race and/or ethnicity of the person killed. Race has been included when reporters were able to identify a person's likely race based on photos, news reports and public records, and/or if the person's race and ethnicity was included in NHTSA's FARS data. |
race_source |
multiple select. options: news reports , nhtsa , photo , original data , other |
The source of the perceived race and ethnicity information. |
county |
string | The county where the fatality or fatal crash occurred. |
state |
single select. options: 50 states plus D.C. | The two-letter abbreviation for the state where the fatality or fatal crash occurred. |
lat |
float | The approximate latitude of the fatality or fatal crash. |
long |
float | The approximate longitude of the fatality or fatal crash. |
name |
string | The name of the person killed. |
initial_reason |
single select. options: traffic stop , suspected nonviolent , suspected violent , domestic incident , minor/no crime , other , unknown |
The alleged incident that touched off officers' pursuit. Even if a different crime is later confirmed (such as stolen vehicle) or it's confirmed that no crime has actually occurred, this column specifies the incident that touched off the chase according to news reports and other sources. |
person_role |
single select. options: driver , passenger , bystander , officer , unclear , other |
The role of the person killed as described in news reports and other records. Driver refers to the driver of the car being pursued; passenger refers to a passenger in the car being pursued. Bystander refers to a person (on foot or in another car) that was killed but not being pursued. |
main_agency |
string | The main agency engaged for the pursuit if included in our detailed dataset, as described in news reports. If multiple agencies gave chase, this value defaults to the agency chasing closest to the fatality or fatal crash. |
news_urls |
url(s) | One or more links to a relevant story about the pursuit, if included in our detailed dataset. |
city |
string | The city where the fatality or fatal crash occurred, if included in our detailed dataset. |
zip |
string | The ZIP code where the fatality or fatal crash occurred if included in our detailed dataset. Note: ZIP code was stored in a string format to avoid the deletion of leading zeroes. |
centroid_geo |
binary 0 , 1 |
If 1, indicates the coordinates of this crash are a) the centroid of the ZIP code where it occurred and not exact coordinates. If 0, indicates coordinates of this crash were entered by researchers and should be accurate to the approximate location of crash. |
in_fars_pursuit |
binary 0 , 1 |
If 1, indicates the death is included in NHTSA's Fatality Analysis Reporting System as stemming from a "police pursuit-involved" fatal crash. If 0, indicates reporters could not find this death in FARS pursuit-involved fatal crash data. |
To build the Chronicle’s national dataset of at least 3,336 people killed in police vehicle pursuits from 2017 through 2022, we used information from three primary sources: the federal government, private research organizations and our reporting.
While no government agency counts every police pursuit death, the National Highway Traffic Safety Administration (NHTSA) comes closest. We used data published by NHTSA via its Fatality Analysis Reporting system (FARS) to produce a list of people killed in police pursuits recorded by the federal agency. Specifically, we drew from the FARS global person, vehicle and accident files, as well as its auxiliary accident file.
In a separate database, we gathered, cleaned and analyzed information about pursuit deaths from research organizations Mapping Police Violence, Fatal Encounters and IncarcerNation.com, manually reviewing each row entered by researchers to ensure accuracy.
To obtain a more robust count, we identified hundreds of additional deaths by searching for news reports on LexisNexis and Google using targeted search terms such as “pursuit,” “chase” and “death.” We also examined lawsuits and government documents obtained via public records requests.
The Chronicle followed the Police Executive Research Forum's definition of a pursuit: "(1) an active attempt by the officer to apprehend the occupant of the vehicle and (2) the driver refusing to submit to the detention and taking actions to avoid apprehension." We removed hundreds of vehicle-related deaths that did not meet this definition, such as cases in which officers struck people while responding to emergency calls unrelated to pursuits. We excluded fatalities identified by other research organizations if a) we could not find news reports or other public records indicating a pursuit occurred, and b) we could not find a match in NHTSA’s “pursuit-involved” fatal crash data in FARS. We included several dozen deaths tied to pursuits but not caused directly by a crash, such as when drivers or passengers drowned in a body of water following a chase. However, we excluded people fatally shot by police during or after a pursuit, as those deaths fall into the category of a fatal police shooting.
The next step was to create one dataset, while avoiding the duplication of cases.
We merged the FARS data with the information from the research groups and our own reporting — comparing them multiple times using different combinations of date, county and state variables. When a person in FARS matched multiple people in the non-FARS data, we assigned a quality index to each match based on the demographic similarities between the two people (close in age, same gender, etc.), and filtered for the top-quality match in each case.
In dozens of cases, the best-quality (or only) match was imperfect; FARS data listed a death as occurring in a bordering county, for example, or on a date that was up to seven days before or after a similar death record in our data. Most of these cases listed the same number of people killed with consistent genders and the same or similar ages. Given this, and the relative unlikelihood of multiple fatal pursuits occurring within several days of each other in all but the most populous areas, we concluded most of them were likely actual matches and the inconsistencies were due to minor issues with data entry by government analysts, researchers or ourselves.
Still, we manually reviewed each imperfect match and identified several that we could not confirm were the same pursuit or person. In these cases we considered them separate pursuits.
To calculate the minimum undercount of pursuit-related fatalities by NHTSA, we produced a list of deaths from 2017 through 2021 that were included in the data from research groups and our reporting but missing from FARS' count of pursuit-involved fatalities. As of late February, when this project was published, FARS data for 2022 was not publicly available. We manually reviewed news reports associated with each case to determine the most likely reason the death was excluded from FARS’ pursuit-involved death data.
To understand why police initiated pursuits that ended with fatalities and who died, we relied on the subset of our pursuit fatalities data — roughly two-thirds of the total — that included additional details about the causes and circumstances of each chase and the people involved, gathered from Mapping Police Violence, Fatal Encounters and IncarcerNation.com, plus news reports and public records.
Within this subset, we categorized people killed in pursuits by their role (driver, passenger, officer or bystander) and the alleged violation that led to the fatal pursuit (traffic stop, suspected nonviolent crime, suspected violent crime, domestic incident, minor or no crime or unclear). We manually reviewed each row of data multiple times to ensure that we and other researchers accurately recorded public information.
Limitations:
This dataset would not have been possible without the work of many other researchers, most notably D. Brian Burghart, the founder of Fatal Encounters, and FE's team of researchers and volunteers. We would also like to thank, in no particular order: Geoffrey Alpert, Tom Gleason, John P. Gross, IncarcerNation.com, Abdul Nasser Rad and the Mapping Police Violence team, Albert L. Liebno, Jr., Lisa Pickoff-White, Lisa Fernandez, NHTSA officials, Alexis Piquero, and others. Thanks to Janie Haseman at Hearst DevHub for helping review and improve our dataset in countless ways.
Finally, we wanted to thank the thousands of journalists who covered the fatal pursuits included in our data. Without their stories, hundreds of chase-related deaths would have remained hidden.