I am trying to run a SQL query to get four random items. As the table <code>product_filter</code> has more than one touple in <code>product</code> i have to use <code>DISTINCT</code> in <code>SELECT</code>, so i get this error: for SELECT DISTINCT, ORDER BY expressions must appear in select list But if i put <code>RANDOM()</code> in my <code>SELECT</code> it will avoid the <code>DISTINCT</code> result. Someone know how to use <code>DISTINCT</code> with the <code>RANDOM()</code> function? Below is my problematic query. <pre class="prettyprint"><code>SELECT DISTINCT p.id, p.title FROM product_filter pf JOIN product p ON pf.cod_product = p.cod JOIN filters f ON pf.cod_filter = f.cod WHERE p.visible = TRUE LIMIT 4 ORDER BY RANDOM(); </code></pre>

You can simplify your query to avoid the problem a priori: <pre class="prettyprint"><code>SELECT p.cod, p.title FROM product p WHERE p.visible AND EXISTS ( SELECT 1 FROM product_filter pf JOIN filters f ON f.cod = pf.cod_filter WHERE pf.cod_product = p.cod ) ORDER BY random() LIMIT 4; </code></pre> <h3>Major points:</h3> <ul> <li>You have only columns from table <code>product</code> in the result, other tables are only checked for existence of a matching row. For a case like this the <code>EXISTS</code> semi-join is likely the fastest and simplest solution. Using it does not multiply rows from the base table <code>product</code>, so you don't need to remove them again with <code>DISTINCT</code>.</li> <li><code>LIMIT</code> has to come last, after <code>ORDER BY</code>.</li> <li>I simplified WHERE <code>p.visible = 't'</code> to <code>p.visible</code>, because this should be a boolean column.</li> </ul>

How to use SELECT DISTINCT with RANDOM() function in PostgreSQL?

Tags:

sql

select

postgresql

I am trying to run a SQL query to get four random items. As the table product_filter has more than one touple in product i have to use DISTINCT in SELECT, so i get this error:

for SELECT DISTINCT, ORDER BY expressions must appear in select list

But if i put RANDOM() in my SELECT it will avoid the DISTINCT result.

Someone know how to use DISTINCT with the RANDOM() function? Below is my problematic query.

SELECT DISTINCT
    p.id, 
    p.title
FROM
    product_filter pf
    JOIN product p ON pf.cod_product = p.cod
    JOIN filters f ON pf.cod_filter = f.cod
WHERE
    p.visible = TRUE
LIMIT 4
ORDER BY RANDOM();

300

asked Jul 09 '12 18:07

Marcio Mazzucato

2 Answers

You either do a subquery

SELECT * FROM (
    SELECT DISTINCT p.cod, p.title ... JOIN... WHERE
    ) ORDER BY RANDOM() LIMIT 4;

or you try GROUPing for those same fields:

SELECT p.cod, p.title, MIN(RANDOM()) AS o FROM ... JOIN ...
    WHERE ... GROUP BY p.cod, p.title ORDER BY o LIMIT 4;

Which of the two expressions will evaluate faster depends on table structure and indexing; with proper indexing on cod and title, the subquery version will run faster (cod and title will be taken from index cardinality information, and cod is the only key needed for the JOIN, so if you index by title, cod and visible (used in the WHERE), it is likely that the physical table will not even be accessed at all.

I am not so sure whether this would happen with the second expression too.

answered Oct 26 '22 07:10

LSerni

You can simplify your query to avoid the problem a priori:

SELECT p.cod, p.title
FROM   product p
WHERE  p.visible
AND    EXISTS (
    SELECT 1
    FROM   product_filter pf
    JOIN   filters f ON f.cod = pf.cod_filter
    WHERE  pf.cod_product = p.cod
    )
ORDER  BY random()
LIMIT  4;

Major points:

You have only columns from table product in the result, other tables are only checked for existence of a matching row. For a case like this the EXISTS semi-join is likely the fastest and simplest solution. Using it does not multiply rows from the base table product, so you don't need to remove them again with DISTINCT.
LIMIT has to come last, after ORDER BY.
I simplified WHERE p.visible = 't' to p.visible, because this should be a boolean column.

answered Oct 26 '22 06:10

Erwin Brandstetter

Related questions
                            
                                Should every MySQL table have an auto-incremented primary key?
                            
                                Oracle MIN as analytic function - odd behavior with ORDER BY?
                            
                                Interesting tree/hierarchical data structure problem
                            
                                Joining results of two different aggregate functions on the same table
                            
                                Behavior of Sequence in Oracle Merge Statements
                            
                                Draft / Live Content System Database Design
                            
                                Using outer query result in a subquery in postgresql
                            
                                How to track all queries submitted to Oracle DB from app server? [duplicate]
                            
                                How to SELECT DEFAULT value of a field
                            
                                Mysql, handlersocket and partitioning?
                            
                                Counting the number of rows with a value greater than or equal to a value from another column in SQL
                            
                                How to tally wins and losses using SUM and CASE?
                            
                                How can I find the differences between two databases? [closed]
                            
                                SQL Command to replace embedded spaces with another character
                            
                                Get this week's data using SQLite
                            
                                Real SQL syntax highlighting in PHP scripts with Vim
                            
                                How to get a SQL Server stored procedure return value using pyodbc?
                            
                                SQl Query Results by Year
                            
                                Generating Scripts for Specific Records in SQL Server
                            
                                Remove Duplicates records from view

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With