Home AI News Building Recommendation Systems with TensorFlow: Content-based & Collaborative Filtering

Building Recommendation Systems with TensorFlow: Content-based & Collaborative Filtering

Introduction
Traditional Approaches to Recommendation Systems
- 2.1 Content-Based Filtering
  - 2.1.1 Item Features
  - 2.1.2 Recommendations based on Previous Actions
- 2.2 Collaborative Filtering
  - 2.2.1 Similarities between Users and Items
  - 2.2.2 Serendipitous Recommendations
Implementing Collaborative Filtering
- 3.1 Assigning Values to Users and Movies
- 3.2 Embeddings for Users and Movies
- 3.3 Optimization Objective
  - 3.3.1 Minimizing Squared Difference
  - 3.3.2 SGD vs WALS
- 3.4 Handling Unobserved Entries
  - 3.4.1 SVD solution
  - 3.4.2 Weighted Matrix Factorization
Conclusion
References

Introduction

Welcome back to our video series on "Building recommendation systems with TensorFlow." In our previous video, we provided an overview of recommendation systems and introduced several open-source projects by Google that can help You build powerful recommender systems. In this video, we will Delve into the concepts of content-based filtering and collaborative filtering, which are traditional approaches in recommendation systems literature. These concepts form the foundation for more advanced models that we will explore in future episodes.

Traditional Approaches to Recommendation Systems

Recommendation systems can be built using various traditional approaches. Two common approaches are content-based filtering and collaborative filtering.

Content-Based Filtering

Content-based filtering relies on item features to recommend similar items to users based on their previous actions or explicit feedback. For example, if a user installs a health app, we can recommend other health-related apps to them because these apps are similar to the one they installed.

Item Features

To identify similarities between items, item features are used. Each row in the feature matrix represents an app, while each column represents a specific feature of the app. By analyzing these features, we can determine the similarity between items.

Recommendations based on Previous Actions

Content-based filtering leverages a user's previous actions or explicit feedback to recommend items. By considering a user's preferences or interests based on their actions, we can suggest similar items that Align with their preferences.

Collaborative Filtering

Collaborative filtering takes into account the similarities between users and items to provide recommendations. Unlike content-based filtering, collaborative filtering considers the preferences of similar users to recommend items to a specific user.

Similarities between Users and Items

Collaborative filtering allows for serendipitous recommendations, recommending an item to a user based on the interests of a similar user. By analyzing the feedback matrix, which represents users' preferences for various movies, we can identify users with similar preferences and recommend Relevant movies to them.

Serendipitous Recommendations

To illustrate collaborative filtering, consider the feedback matrix of four users and five movies. Each row represents a user, and each column represents a movie. The green checkmark indicates that a user has watched a particular movie. By analyzing this matrix, we can recommend movies to users based on the preferences of other similar users.

Implementing Collaborative Filtering

Now let's dive into the implementation of collaborative filtering in recommendation systems.

Assigning Values to Users and Movies

To implement collaborative filtering, we assign values between -1 and 1 to represent users' interest levels for specific movie genres. For example, a value of -1 indicates a high level of interest in children's movies, while 1 means no interest at all. Similarly, we assign values to each movie to indicate whether it is suitable for children or not.

Embeddings for Users and Movies

We can represent users and movies using embeddings, which are low-dimensional representations that capture their preferences and characteristics. These embeddings serve as additional Dimensions to consider when making recommendations. In practice, these embeddings are of higher dimensions, but for visualization purposes, we will stick to two dimensions.

Optimization Objective

The optimization objective in collaborative filtering is to minimize the squared difference between the feedback label and the predictive feedback. This objective is achieved by optimizing the user and item embeddings.

Minimizing Squared Difference

The goal is to ensure that the predictive feedback matrix, obtained by multiplying the user and item embeddings, is as close to the ground truth feedback matrix as possible. By minimizing the squared difference between these matrices, we improve the accuracy of the recommendations.

SGD vs WALS

There are two methods commonly used to optimize the embeddings: Stochastic Gradient Descent (SGD) and Weighted Alternating Least Squares (WALS). SGD is a generic optimization algorithm commonly used in training neural networks, while WALS is specific to this recommendation system problem. WALS usually converges faster than SGD but has limitations in handling other loss functions.

Handling Unobserved Entries

It is essential to consider unobserved entries when implementing collaborative filtering. Two approaches can be used to handle these unobserved entries: SVD solution and weighted matrix factorization.

SVD solution

One approach is to treat all unobserved entries as zeros and use Singular Value Decomposition (SVD) to solve the optimization problem. However, SVD may not perform well in practice due to the sparsity of the feedback matrix.

Weighted Matrix Factorization

A better approach is weighted matrix factorization. In this approach, unobserved entries are still treated as zeros, but the unobserved part of the objective function is scaled so that it is not overweighted. This approach improves the generalization capabilities of the model.

Conclusion

In this video, we explored the concepts of content-based filtering and collaborative filtering in recommendation systems. Content-based filtering leverages item features to recommend similar items, while collaborative filtering considers the similarities between users and items to provide personalized recommendations. We also discussed the implementation of collaborative filtering, including assigning values to users and movies, using embeddings, optimizing the objective function, and handling unobserved entries. In the next video, we will introduce TensorFlow recommenders, which simplify the process of building recommendation models.