Ranking algorithms

Tags:

I have around 4000 blog posts with me. I wanna rank all posts according to the following values

Upvote Count => P
Comments Recieved => C
Share Count => S
Created time in Epoch => E
Follower Count of Category which post belongs to => F (one post has one category)
User Weight => U (User with most number of post have biggest weight)

I am expecting answer in pseudo code.

232

asked Jun 11 '13 04:06

shajin

1 Answers

Your problem falls into the category of regression (link). In machine learning terms, you have a collection of features (link) (which you list in your question) and you have a score value that you want to predict given those features.

What Ted Hopp has suggested is basically a linear predictor function (link). That might be too simple a model for your scenario.

Consider using logistic regression (link) for your problem. Here's how you would go about using it.

1. create your model-learning dataset

Randomly select some m blog posts from your set of 4000. It should be a small enough set that you can comfortably look through these m blog posts by hand.

For each of the m blog posts, score how "good" it is with a number from 0 to 1. If it helps, you can think of this as using 0, 1, 2, 3, 4 "stars" for the values 0, 0.25, 0.5, 0.75, 1.

You now have m blog posts that each have a set of features and a score.

You can optionally expand your feature set to include derived features - for example, you could include the logarithm of the "Upvote Count," the "Comments Recieved", the "Share Count," and the "Follower Count," and you could include the logarithm of the number of hours between "now" and "Created Time."

2. learn your model

Use gradient descent to find a logistic regression model that fits your model-learning dataset. You should partition your dataset into training, validation, and test sets so that you can carry out those respective steps in the model-learning process.

I won't elaborate any more on this section because the internet is full of the details and it's a canned process.

Wikipedia links:

Gradient descent
Logistic regression model fitting

3. apply your model

Having learned your logistic regression model, you can now apply it to predict the score for how "good" a new blog post is! Simply compute the set of features (and derived features), then use your model to map those features to a score.

Again, the internet is full of the details for this section, which is a canned process.

If you have any questions, make sure to ask!

If you're interested in learning more about machine learning, you should consider taking the free online Stanford Machine Learning course on Coursera.org. (I'm not affiliated with Stanford or Coursera.)

200

answered Oct 07 '22 21:10

Timothy Shields

Related questions
                            
                                Dynamic programming - paint fence algorithm
                            
                                Why are logarithms in computer science presumed to be base-2 logarithms? [closed]
                            
                                Statistical approach to chess?
                            
                                Finding and removing outliers in PHP
                            
                                C strlen() implementation in one line of code
                            
                                Algorithm for dividing very large numbers
                            
                                Find maximum product of 3 numbers in an array
                            
                                Bitwise operator to get byte from 32 bits
                            
                                Check whether a point is inside of a simple polygon
                            
                                Speed dating algorithm
                            
                                Longest path between 2 Nodes
                            
                                Efficient algorithm to get primes between two large numbers
                            
                                What is difference between Array and Binary search tree in efficiency?
                            
                                Next odd number in javascript
                            
                                Algorithms for helping humans choose (kitten war, for instance)
                            
                                Python factorization
                            
                                Algorithm for digit summing?
                            
                                Second min cost spanning tree
                            
                                Shuffling a deck of cards
                            
                                Mergesort in java

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Ranking algorithms

Tags:

algorithm

math

machine-learning

ranking

rank