User interests

Aggregates content labels to user level. Project included filtering NSFW, grouping labels, and decaying. I designed and implemented. Batch feature was built with Airflow scheduler calling BigQuery scripts. Streaming feature was built with Flink. Further built User-to-Subreddit mapping using Annoy approximate nearest neighbors.

2021-2022Link

Data work

Aggregated player data and joined with the marketing data from many sources and systems.

2016-2018Link

User covariates

Covariates are user variables that we control for when analyzing impact of A/B tests. I identified impactful covariates and wrote script to compute these.

2021-2022Link

Payments database

Implemented server for CRUD operations (and export/reverse) on a payment table.

2018-2021Link

Slot DB

A database of slot machines.

2025Link

Project Titan

Project Titan is a set of software I wrote to make predictions for sports. This is a large project with web scraping, modeling, and architecture components.

2023-2022Link

StacksByStacks

This is a now-defunct website I’ve made to track predictions made for NHL games, by experts on the internet. The front-end is built with Angular, which uses a PHP handler to access the MySQL database, which gets populated with a library of Python scrapers.

2020Link

Minesweeper 6

Design database and background program for calculating and storing minesweeper probabilities. A heartbeat will load new work done by the background program.

2024Link

Scraper 11

Switched from sqlite to file directory. Added compression.

2024Link