How-to: Ranking Search Results

Tags:

I have a webapp development problem that I've developed one solution for, but am trying to find other ideas that might get around some performance issues I'm seeing.

problem statement:

a user enters several keywords/tokens
the application searches for matches to the tokens
need one result for each token
- ie, if an entry has 3 tokens, i need the entry id 3 times
rank the results
- assign X points for token match
- sort the entry ids based on points
- if point values are the same, use date to sort results

What I want to be able to do, but have not figured out, is to send 1 query that returns something akin to the results of an in(), but returns a duplicate entry id for each token matches for each entry id checked.

Is there a better way to do this than what I'm doing, of using multiple, individual queries running one query per token? If so, what's the easiest way to implement those?

edit
I've already tokenized the entries, so, for example, "see spot run" has an entry id of 1, and three tokens, 'see', 'spot', 'run', and those are in a separate token table, with entry ids relevant to them so the table might look like this:

Click to copy

'see', 1 
'spot', 1 
'run', 1 
'run', 2 
'spot', 3

637

asked Sep 06 '08 19:09

warren

3 Answers

you could achive this in one query using 'UNION ALL' in MySQL.

Just loop through the tokens in PHP creating a UNION ALL for each token:

e.g if the tokens are 'x', 'y' and 'z' your query may look something like this

Click to copy

SELECT * FROM `entries` 
WHERE token like "%x%" union all 
    SELECT * FROM `entries` 
    WHERE token like "%y%" union all 
        SELECT * FROM `entries` 
        WHERE token like "%z%" ORDER BY score ect...

The order clause should operate on the entire result set as one, which is what you need.

In terms of performance it won't be all that fast (I'm guessing), however with databases the main overhead in terms of speed is often sending the query to the database engine from PHP and receiving the results. With this technique this only happens once instead of once per token, so performance will increase, I just don't know if it'll be enough.

answered Oct 12 '22 00:10

Robin Barnes

I know this isn't strictly an answer to the question you're asking but if your table is thousands rather than millions of rows, then a FULLTEXT solution might be the best way to go here.

In MySQL when you use MATCH on your indexed column, each keyword you supply will be given a relevance score (calculated roughly by the number of times each keyword was mentioned) that will be more accurate than your method and certainly more effecient for multiple keywords.

See here: http://dev.mysql.com/doc/refman/5.0/en/fulltext-search.html

answered Oct 12 '22 00:10

David McLaughlin

If you're using the UNION ALL pattern you may also want to include the following parts to your query:

Click to copy

SELECT COUNT(*) AS C
...
GROUP BY ID
ORDER BY c DESC

While this is a really trivial example it does get you the frequency of the matches for each result and this could be a pseudo rank to start with.

answered Oct 12 '22 00:10

Erik

Related questions
                            
                                Laravel 5 hasManyThrough
                            
                                Yii2 DropDownList Onchange change Autocomplete Widget "source" attribute?
                            
                                iCal format for Google Calendar / Yahoo calendar not working
                            
                                creating database from postgreSQL with symfony
                            
                                HotelBeds Php API providing me empty result
                            
                                PHP 7 - Unsupported declare 'strict_types'
                            
                                POST http://localhost:3000/ 404 (Not Found)
                            
                                Wildcard in prepared MySQLi returning bad values
                            
                                How can I bundle search terms into more efficient queries?
                            
                                Laravel Multiple Model Events
                            
                                How to access my website on GoDaddy with just the IP address of my Web Hosting account [closed]
                            
                                Pass variables from one table to another in another PHP page [duplicate]
                            
                                How can i rename laravel controller with command line interface(CLI)?
                            
                                Apache error: cannot load mod_access_compat.so
                            
                                What is `HtmlString` used for in Laravel?
                            
                                How to read and echo file size of uploaded file being written at server in real time without blocking at both server and client?
                            
                                laravel validate with user function
                            
                                DOCKERFILE: Running multiple CMD. (Starting NGINX and PHP) [duplicate]
                            
                                Reformat number inside array of string PHP
                            
                                Puphpeteer - Get text and href-attribute from link

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How-to: Ranking Search Results

Tags:

php

search

mysql

warren

People also ask

3 Answers

Robin Barnes

David McLaughlin

Erik

Recent Activity

Donate For Us