User embeddings

Built user embeddings using Collaborative Filtering and user history. Designed pipeline to resolve cold start problem. Proved predictive value in recommendation models.

2021-2022Link

User interests

Aggregates content labels to user level. Project included filtering NSFW, grouping labels, and decaying. I designed and implemented. Batch feature was built with Airflow scheduler calling BigQuery scripts. Streaming feature was built with Flink. Further built User-to-Subreddit mapping using Annoy approximate nearest neighbors.

2021-2022Link

Project Titan

Project Titan is a set of software I wrote to make predictions for sports. This is a large project with web scraping, modeling, and architecture components.

2023-2022Link