This section highlights selected technical reports that demonstrate applied data analysis, spatial modeling, and empirical evaluation across transportation, public policy, and urban systems. Each report integrates real-world datasets with rigorous quantitative methods to answer policy-relevant questions, emphasizing reproducible workflows, clear visualization, and interpretable results.
Geospatial data processing | Spatial joins & reprojection | Nearest-feature distance analysis | Threshold-based classification | Data aggregation & visualization | R Studio
This project integrates census-tract demographics with point-based facility data to compute distance metrics, classify vulnerable regions, and aggregate spatial results to the county level for comparative analysis.
Spatial feature engineering | Grid-based Aggregation | Kernel Density Estimation (KDE) | Nearest-neighbor distance metrics | Local Moran’s I clustering | Count regression (Poisson, Negative Binomial) | Spatial cross-validation | R Studio
This project builds and evaluates grid-based spatial prediction models for burglary risk by engineering distance- and cluster-based features, fitting count regression models, and benchmarking performance against a KDE baseline using spatial cross-validation.
Space–time panel data | Temporal feature engineering | Lag-based forecasting | Weather + census data integration | Fixed effects regression | Temporal train/test validation | Spatial & temporal error diagnostics | R (Quarto) | GitHub
This project builds and evaluates station-hour forecasting models for Philadelphia’s Indego bike share system. Using Q2 2024 trip data, I construct a complete space–time panel, integrate weather and neighborhood context, and compare model specifications—showing the largest performance gains from temporal lag features and diagnosing when/where prediction errors concentrate across the city.
Airport performance metrics | Passenger enplanements | Airport & air carrier operations | Hub classification (large vs. small) | Delay analysis (2013 vs. 2023) | Data visualization | R (Quarto)
This project evaluates changes in U.S. airport performance since 2013 by identifying “winners” and “losers” among large and small hub airports. Using Federal Aviation Administration Terminal Area Forecast data and Bureau of Transportation Statistics on-time performance records, I compare airports based on passenger enplanements, total airport operations, air carrier activity, and delay severity. The analysis highlights how scale, capacity constraints, and infrastructure conditions shape both airport growth and service reliability over time.