Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Why is the Netflix Prize so challenging? [closed]

Having just read the recent article in Wired, I'm curious: what is it about the Netflix Prize that's so challenging? I mean this in the sincerest way possible, I'm just curious about the difficulties posed by the contest. Are most recommendation engines in general this hard to improve? If so, why is that? Or, is Netflix unusually difficult to improve, and if this is the case, what's special about Netflix that makes this so much more challenging than, say, Amazon?

like image 951
Alex Basson Avatar asked Jun 19 '09 11:06

Alex Basson


People also ask

What was the Netflix Prize Why did Netflix's actions raise privacy concern?

Netflix is canceling its second $1 million Netflix Prize to settle a legal challenge that it breached customer privacy as part of the first contest's race for a better movie-recommendation engine. Friday's announcement came five months after Netflix had announced a successor to its algorithm-improvement contest.

Why did Netflix cancel the competition?

Cancelled sequel On March 12, 2010, Netflix announced that it would not pursue a second Prize competition that it had announced the previous August. The decision was in response to a lawsuit and Federal Trade Commission privacy concerns.

What was the cash prize for Netflix?

Up for grabs is a $4.56 million reward — which Netflix says is the largest cash prize in reality TV history. “The stakes are high, but in this game the worst fate is going home empty-handed,” the company said in a statement Tuesday.

Who won Netflix competition?

The Netflix Prize: How a $1 Million Contest Changed Binge-Watching Forever. "We need to go win a million dollars." Lester Mackey was just a senior computer science major at Princeton when a friend burst into his dorm room in a hysterical fit of excitement.


1 Answers

Recommender systems suffer from problems that are hard to fix:

  • Cold start - In a new system or with a new user, there isn't enough data to create an accurate statistical model for a recommendation.
  • Rating bias - If you base recommendations on user ratings, users that rate often sway the results toward their taste. If you're the type of person that doesn't like the extra step of rating, it's possible people with similar taste don't like rating either so their opinions are excluded from recommendations.
  • Items that are not rated are less likely to be rated - if you select, and therefore rate, items based on their ratings, items that aren't rated are less visible and will have a hard time getting the ratings they need to affect recommendations. In the other direction, popular items have more visibility, are rated more often, and therefore play a larger part in recommendations.
  • Temporal bias - Users' ratings change with time. With long-term changes, you can compensate by adding a time element to your recommendations. Short-term changes are harder to fix. After a Chuck Norris marathon, you may be more likely to give action movies high marks. The next day, after crying your eyes out to Steel Magnolias, you may be temporarily biased against action movies.
  • Varying motives - in item-based recommender systems, the knitting book you purchased for your aunt's birthday will skew your recommendations (if you don't take the time to tell the system not to use it). You may give a bad kids' movie a high rating because your kids loved it.

All together, this makes recommender systems hard to improve past just-okay. A system with 80% accuracy seems great but is wrong 1 out of 5 times. This makes them more trouble than they're worth for some users.

like image 140
Corbin March Avatar answered Sep 28 '22 01:09

Corbin March