User interests

Engineered scalable user profiling pipeline that aggregated content labels to the user level—incorporating NSFW filtering, label grouping, and temporal decay—delivered via batch (Airflow/BigQuery) and streaming (Flink) systems, with downstream user-to-subreddit mappings powered by approximate nearest neighbors.

2021-2022

Data engineering

Integrated and reconciled player and marketing data across multiple source systems, improving data completeness and reliability for downstream analytics.

2016-2018

User covariates

Identified key user-level covariates and built tooling to compute them, improving the rigor and interpretability of A/B test impact analyses.

2021-2022

Payments database

Implemented a service to support CRUD operations, exports, and reversals for a payments table.

2018-2021

Slot DB

A database of slot machines.

2025Link

Project Titan

Project Titan is a set of software I wrote to make predictions for sports. This is a large project with web scraping, modeling, and architecture components.

2023-2022Link

StacksByStacks

This is a now-defunct website I’ve made to track predictions made for NHL games, by experts on the internet. The front-end is built with Angular, which uses a PHP handler to access the MySQL database, which gets populated with a library of Python scrapers.

2020

Minesweeper 6

Design database and background program for calculating and storing minesweeper probabilities. A heartbeat will load new work done by the background program.

2024Link

Scraper 11

Switched from sqlite to file directory. Added compression.

2024Link