SAFE-T: Safety Algorithm Fairness Evaluation for Transportation

Prediction Error by Census Tract

Simulated

AI volume prediction error across Durham census tracts. Darker red indicates higher prediction errors, concentrated in low-income areas.

Accuracy by Income Quintile

Simulated

Prediction accuracy across income quintiles (Q1=poorest, Q5=richest). Shows mean absolute error in predicted vs actual pedestrian/cyclist counts.

Accuracy by Minority Percentage

Simulated

Prediction errors grouped by census tract racial composition. Areas with higher minority percentages show systematically worse accuracy.

Predicted vs Actual Volume

Simulated

Predicted volumes vs actual counts. Perfect predictions follow the diagonal. Systematic deviations reveal where bias occurs.

Prediction Errors by Quintile

Simulated

Each dot is one counter location. Dots left of zero indicate underprediction by the AI model.

Crash Distribution Map

Real Data

Actual vs predicted crash counts by census tract. Toggle between views to compare.

Model Performance by Quintile

Real Data

Binary classification (above/below within-quintile median) evaluated per income group. Lower scores in poorer quintiles indicate the model struggles to rank tracts within those areas.

Crashes Over Time

Real Data

Actual vs predicted crashes over time. Shows persistent over/underprediction patterns by income level.

Prediction Error by Income Quintile

Real Data

Relative prediction error by income level. Higher error in poorer quintiles indicates systematic bias in model accuracy.

Infrastructure Recommendations Map

Modeled

Safety project locations from AI vs need-based allocation. Shading shows danger scores; markers show projects. Toggle to compare.

Budget Allocation by Income

Modeled

AI-driven vs need-based safety budget allocation per income quintile.

Equity Comparison: AI vs Need-Based

Modeled

Four normalized equity metrics (0-100) comparing AI-driven and need-based allocation strategies.

Demand Distribution Map

Modeled

Suppressed, potential, and actual cycling/walking demand across Durham. High suppression (red) indicates latent demand AI tools miss.

Demand Suppression: Q1 vs Q5

Modeled

Demand suppression stages from potential to actual usage. Width represents trip volume at each stage. Q1 areas show severe drop-off.

AI Detection Capability

Modeled

Detection accuracy for suppressed demand. Naive AI fails; sophisticated AI achieves partial detection. Neither matches human expert baseline.

SAFE-T / Safety Algorithm Fairness Evaluation for Transportation

Durham Transportation Equity Audit

Durham Income Distribution by Census Tract

Prediction Error by Census Tract

Accuracy by Income Quintile

Accuracy by Minority Percentage

Predicted vs Actual Volume

Prediction Errors by Quintile

Crash Distribution Map

Model Performance by Quintile

Crashes Over Time

Prediction Error by Income Quintile

Infrastructure Recommendations Map

Budget Allocation by Income

Equity Comparison: AI vs Need-Based

Demand Distribution Map

Demand Suppression: Q1 vs Q5

AI Detection Capability