![]() |
![]() |
Engineered scalable user profiling pipeline that aggregated content labels to the user level—incorporating NSFW filtering, label grouping, and temporal decay—delivered via batch (Airflow/BigQuery) and streaming (Flink) systems, with downstream user-to-subreddit mappings powered by approximate nearest neighbors.
2021-2022Integrated and reconciled player and marketing data across multiple source systems, improving data completeness and reliability for downstream analytics.
2016-2018Identified key user-level covariates and built tooling to compute them, improving the rigor and interpretability of A/B test impact analyses.
2021-2022Implemented a service to support CRUD operations, exports, and reversals for a payments table.
2018-2021Project Titan is a set of software I wrote to make predictions for sports. This is a large project with web scraping, modeling, and architecture components.
2023-2022LinkThis is a now-defunct website I’ve made to track predictions made for NHL games, by experts on the internet. The front-end is built with Angular, which uses a PHP handler to access the MySQL database, which gets populated with a library of Python scrapers.
2020Design database and background program for calculating and storing minesweeper probabilities. A heartbeat will load new work done by the background program.
2024Link