User embeddings

Built user embeddings using Collaborative Filtering and user history. Designed pipeline to resolve cold start problem. Proved predictive value in recommendation models.

2021-2022Link

User interests

Aggregates content labels to user level. Project included filtering NSFW, grouping labels, and decaying. I designed and implemented. Batch feature was built with Airflow scheduler calling BigQuery scripts. Streaming feature was built with Flink. Further built User-to-Subreddit mapping using Annoy approximate nearest neighbors.

2021-2022Link