Apache Spark ALS collaborative filtering results. They don't make sense

Tags:

I wanted to try out Spark for collaborative filtering using MLlib as explained in this tutorial: https://databricks-training.s3.amazonaws.com/movie-recommendation-with-mllib.html The algorithm is based on the paper "Collaborative Filtering for Implicit Feedback Datasets", doing matrix factorization.

Everything is up and running using the 10 million Movielens data set. The data set it split into 80% training 10% test and 10% validation.

RMSE Baseline: 1.060505464225402
RMSE (train) = 0.7697248827452756
RMSE (validation) = 0.8057135933012889 for the model trained with rank = 24, lambda = 0.1, and Iterations = 10.
The best model improves the baseline by 23.94%.

Which are values similar to the tutorial, although with different training parameters.

I tried running the algorithm several times and always got recommendations that don't make any sense to me. Even rating only kids movies I get the following results:

For ratings:

personal rating: Toy Story (1995) rating: 4.0
personal rating: Jungle Book, The (1994) rating: 5.0
personal rating: Lion King, The (1994) rating: 5.0
personal rating: Mary Poppins (1964) rating: 4.0
personal rating: Alice in Wonderland (1951) rating: 5.0

Results:

Movies recommended for you:

Life of Oharu, The (Saikaku ichidai onna) (1952)
More (1998)
Who's Singin' Over There? (a.k.a. Who Sings Over There) (Ko to tamo peva) (1980)
Sundays and Cybele (Dimanches de Ville d'Avray, Les) (1962)
Blue Light, The (Das Blaue Licht) (1932)
Times of Harvey Milk, The (1984)
Please Vote for Me (2007)
Man Who Planted Trees, The (Homme qui plantait des arbres, L') (1987)
Shawshank Redemption, The (1994)
Only Yesterday (Omohide poro poro) (1991)

Which except for Only Yesterday doesn't seem to make any sense.

If there is anyone out there who knows how to interpret those results or get better ones I would really appreciate you sharing your knowledge.

Best regards

EDIT:

As suggested I trained another model with more factors:

Baseline error: 1.0587417035872992
RMSE (train) = 0.7679883378412548
RMSE (validation) = 0.8070339258049574 for the model trained with rank = 100, lambda = 0.1, and numIter = 10.

And different personal ratings:

personal rating: Star Wars: Episode VI - Return of the Jedi (1983) rating: 5.0
personal rating: Mission: Impossible (1996) rating: 4.0
personal rating: Die Hard: With a Vengeance (1995) rating: 4.0
personal rating: Batman Forever (1995) rating: 5.0
personal rating: Men in Black (1997) rating: 4.0
personal rating: Terminator 2: Judgment Day (1991) rating: 4.0
personal rating: Top Gun (1986) rating: 4.0
personal rating: Star Wars: Episode V - The Empire Strikes Back (1980) rating: 3.0
personal rating: Alien (1979) rating: 4.0

The recommended movies are:

Movies recommended for you:

Carmen (1983)
Silent Light (Stellet licht) (2007)
Jesus (1979)
Life of Oharu, The (Saikaku ichidai onna) (1952)
Heart of America (2003)
For the Bible Tells Me So (2007)
More (1998)
Legend of Leigh Bowery, The (2002)
Funeral, The (Ososhiki) (1984)
Longshots, The (2008)

Not one useful result.

EDIT2: With using the implicit feedback method, I get much better results! With the same action movies as above the recommendations are:

Movies recommended for you:

Star Wars: Episode IV - A New Hope (a.k.a. Star Wars) (1977)
Terminator, The (1984)
Raiders of the Lost Ark (Indiana Jones and the Raiders of the Lost Ark) (1981)
Die Hard (1988)
Godfather, The (1972)
Aliens (1986)
Rock, The (1996)
Independence Day (a.k.a. ID4) (1996)
Star Trek II: The Wrath of Khan (1982)
GoldenEye (1995)

That's more what I expected! The question is why the explicit version is so-so-so bad

669

asked Oct 06 '14 09:10

MachineLearner

1 Answers

Note that the code you are running does not use implicit feedback, and is not quite the algorithm you refer to. Just make sure you are not using ALS.trainImplicit. You may need a different, lambda and rank. RMSE of 0.88 is "OK" for this data set; I am not clear that the example's values are optimal or just the one that the toy test produced. You use a different value still here. Maybe it's just not optimal yet.

It could even be stuff like bugs in the ALS implementation fixed since. Try comparing to another implementation of ALS if you can.

I always try to resist rationalizing the recommendations since our brains inevitably find some explanation even for random recommendations. But, hey, I can say that you did not get action, horror, crime drama, thrillers here. I find that kids movies go hand in hand with taste for arty movies, since, the kind of person who filled out their tastes for MovieLens way back when and rated kids movies were not actually kids, but parents, and maybe software engineer types old enough to have kids do tend to watch these sorts of foreign films you see.

180

answered Sep 19 '22 06:09

Sean Owen

Related questions
                            
                                Keras Embedding Layer: keep zero-padded values as zeros
                            
                                Do I need to split the data for isolation forest?
                            
                                sklearn utils compute_class_weight function for large dataset
                            
                                Diminishing the impact of one variable over output in a regression model
                            
                                NotImplementedError: Learning rate schedule must override get_config
                            
                                Loss with custom backward function in PyTorch - exploding loss in simple MSE example
                            
                                Why is KNN so much faster with cosine distance than Euclidean distance?
                            
                                what is the best way to generate fake data for classification problem?
                            
                                Interactive Decision Tree Classifier
                            
                                questions on clustering methods
                            
                                Text Classification into Categories
                            
                                Machine learning task: what tool to use?
                            
                                Is anyone implementing an interface for Eureqa in R?
                            
                                Representation and a good similarity measure between Tweets for topic detection
                            
                                Numeric example of the Expectation Maximization Algorithm [duplicate]
                            
                                Apply PMML predictor model in python
                            
                                Randomized stratified k-fold cross-validation in scikit-learn?
                            
                                Reinforcement Learning
                            
                                How can I parse an arff file without using external libraries in Python
                            
                                error tuning SVM in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Apache Spark ALS collaborative filtering results. They don't make sense

Tags:

machine-learning

apache-spark

collaborative-filtering

matrix-factorization

MachineLearner

People also ask

1 Answers

Sean Owen

Recent Activity

Donate For Us