How to evaluate a Content-based Recommender System

Tags:

recommendation-engine

I'm building a content-based movie recommender system. It's simple, just let a user enter a movie title and the system will find a movie which has the most similar features.

After calculating similarity and sorting the scores in descending order, I find the corresponding movies of 5 highest similarity scores and return to users.

Everything works well till now when I want to evaluate the accuracy of the system. Some formulas that I found on Google just evaluate the accuracy based on rating values (comparing predicted rating and actual rating like RMSE). I did not change similarity score into rating (scale from 1 to 5) so I couldn't apply any formula.

Can you suggest any way to convert similarity score into predicted rating so that I can apply RMSE then? Or is there any idea of solution to this problem ?

684

asked May 29 '11 12:05

user691223

1 Answers

Do you have any ground truth? For instance, do you have information about the movies that a user has liked/seen/bought in the past? It doesn't have to be a rating but in order to evaluate the recommendation you need to know some information about the user's preferences.

If you do, then there are other ways to measure the accuracy besides RMSE. RMSE is used when we predict ratings (as you said is the error between the real rating and the prediction) but in your case you are generating top N recommendations. In that case you can use precision and recall to evaluate your recommendations. They are very used in Information Retrieval applications (see Wikipedia) and they are also very common in Recommender Systems. You can also compute F1 metric which is an harmonic mean of precision and recall. You'll see they are very simple formulas and easy enough to implement.

"Evaluating Recommendar Systems" by Guy Shani is a very good paper on how to evaluate recommender systems and will give you a good insight into all this. You can find the paper here.

132

answered Oct 28 '22 12:10

MsLovelace

Related questions
                            
                                Evaluating the LightFM Recommendation Model
                            
                                Methods for Lazy Initialization with properties
                            
                                Building a Collaborative filtering / Recommendation System
                            
                                Developing a web application in python with neo4j
                            
                                Spark MLlib - Collaborative Filtering Implicit Feed
                            
                                Recommendation engine without ratings
                            
                                Appending pandas DataFrame with MultiIndex with data containing new labels, but preserving the integer positions of the old MultiIndex
                            
                                Architecture & Essential Components of StumbleUpon's Recommendation Engine
                            
                                Recommender: Log user actions & datamine it – good solution [closed]
                            
                                How to implement a Digg-like algorithm?
                            
                                What do I need in a database for "Customers Who Bought This Item Also Bought"?
                            
                                Multikey Multivalue Non Deterministic python dictionary
                            
                                Google Prediction API vs Graph Databases for Generated Recommendations?
                            
                                Neural Network Recommendation Engine [closed]
                            
                                Mahout Plugin for ruby on rails
                            
                                Writing a basic recommendation engine [closed]
                            
                                Recommendation Engines for Java applications [closed]
                            
                                How to use mllib.recommendation if the user ids are string instead of contiguous integers?
                            
                                Lightfm: handling user and item cold-start
                            
                                Get Google Analytics "Visitors Flow" data from API

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With