I am using mysql database which has only one table "data" with 17,151257 rows.This table has a column string. i want to print all the rows in which string column contains a particular query string (stored in "entered_query" variable), so i used following:
SELECT DISTINCT * from data WHERE string LIKE '%".$entered_query."%' limit 10
As obvious above query is taking too much time to execute.
I have read that indexing can be used but how in this case ?
I have also thought of dividing whole data into 10 different rows and then executing 10 parallel queries using perl DBI.
Now I have following questions :
Regular indexing can't be used to improve that query. MySQL indexes are B-trees, which means they can find a prefix of the indexed column very quickly. But since your LIKE
query has %
at the beginning, there's no unique prefix to search for. So every row has to be scanned to match against the pattern.
However, MySQL also supports full-text searching. This creates an index of all the words in the column, and it can find these words quickly. See the documentation for details.
If you use LIMIT 10
, it will stop scanning as soon as it finds the first 10 rows that satisfy the conditions. Unless you also use ORDER BY
-- then it has to find all the rows so that it can sort them before selecting the first 10.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With