I have select with more then <blockquote> 70 milion rows </blockquote> I'd like to save the selected data into the one large <code>csv</code> file on <code>win2012 R2</code> Q: How to retrieve the data from MySQL by chunks for better performance ? because when I try to save one the large select I got <blockquote> out of memory errors </blockquote>

You could try using the <code>LIMIT</code> feature. If you do this: <pre class="prettyprint"><code>SELECT * FROM MyTable ORDER BY whatever LIMIT 0,1000 </code></pre> You'll get the first 1,000 rows. The first <code>LIMIT</code> value (0) defines the starting row in the result set. It's zero-indexed, so 0 means "the first row". The second <code>LIMIT</code> value is the maximum number of rows to retrieve. To get the next few sets of 1,000, do this: <pre class="prettyprint"><code>SELECT * FROM MyTable ORDER BY whatever LIMIT 1000,1000 -- rows 1,001 - 2,000 SELECT * FROM MyTable ORDER BY whatever LIMIT 2000,1000 -- rows 2,001 - 3,000 </code></pre> And so on. When the <code>SELECT</code> returns no rows, you're done. This isn't enough on its own though, because any changes done to the table while you're processing your 1K rows at a time will throw off the order. To freeze the results in time, start by querying the results into a temporary table: <pre class="prettyprint"><code>CREATE TEMPORARY TABLE MyChunkedResult AS ( SELECT * FROM MyTable ORDER BY whatever ); </code></pre> Side note: it's a good idea to make sure the temporary table doesn't exist beforehand: <pre class="prettyprint"><code>DROP TEMPORARY TABLE IF EXISTS MyChunkedResult; </code></pre> At any rate, once the temporary table is in place, pull the row chunks from there: <pre class="prettyprint"><code>SELECT * FROM MyChunkedResult LIMIT 0, 1000; SELECT * FROM MyChunkedResult LIMIT 1000,1000; SELECT * FROM MyChunkedResult LIMIT 2000,1000; .. and so on. </code></pre> I'll leave it to you to create the logic that will calculate the limit value after each chunk and check for the end of results. I'd also recommend much larger chunks than 1,000 records; it's just a number I picked out of the air. Finally, it's good form to drop the temporary table when you're done: <pre class="prettyprint"><code>DROP TEMPORARY TABLE MyChunkedResult; </code></pre>

MySQL : retrieve a large select by chunks

2 Answers

You could try using the LIMIT feature. If you do this:

SELECT * FROM MyTable ORDER BY whatever LIMIT 0,1000

You'll get the first 1,000 rows. The first LIMIT value (0) defines the starting row in the result set. It's zero-indexed, so 0 means "the first row". The second LIMIT value is the maximum number of rows to retrieve. To get the next few sets of 1,000, do this:

SELECT * FROM MyTable ORDER BY whatever LIMIT 1000,1000 -- rows 1,001 - 2,000 SELECT * FROM MyTable ORDER BY whatever LIMIT 2000,1000 -- rows 2,001 - 3,000

And so on. When the SELECT returns no rows, you're done.

This isn't enough on its own though, because any changes done to the table while you're processing your 1K rows at a time will throw off the order. To freeze the results in time, start by querying the results into a temporary table:

CREATE TEMPORARY TABLE MyChunkedResult AS (   SELECT *   FROM MyTable   ORDER BY whatever );

Side note: it's a good idea to make sure the temporary table doesn't exist beforehand:

DROP TEMPORARY TABLE IF EXISTS MyChunkedResult;

At any rate, once the temporary table is in place, pull the row chunks from there:

SELECT * FROM MyChunkedResult LIMIT 0, 1000; SELECT * FROM MyChunkedResult LIMIT 1000,1000; SELECT * FROM MyChunkedResult LIMIT 2000,1000; .. and so on.

I'll leave it to you to create the logic that will calculate the limit value after each chunk and check for the end of results. I'd also recommend much larger chunks than 1,000 records; it's just a number I picked out of the air.

Finally, it's good form to drop the temporary table when you're done:

DROP TEMPORARY TABLE MyChunkedResult;

123

answered Sep 22 '22 00:09

Ed Gibbs

The LIMIT OFFSET approach slows query down when a size of the data is very large. Another approach is to use something called Keyset pagination. It requires a unique id in your query, which you can use as a bookmark to point to the last row of the previous page. The next page is fetched using the last bookmark. For instance:

SELECT user_id, name, date_created FROM users WHERE user_id > 0 ORDER BY user_id ASC LIMIT 10 000;

If the resultset above returns the last row with user_id as 12345, you can use it to fetch the next page as follows:

SELECT user_id, name, date_created FROM users WHERE user_id > 12345 ORDER BY user_id ASC LIMIT 10 000;

For more details, you may take a look at this page.

answered Sep 25 '22 00:09

prafi

Related questions
                            
                                How to insert DECIMAL into MySQL database
                            
                                INNER JOIN same table
                            
                                Can a base64 encoded string contain whitespace?
                            
                                MySQL: Finding rows that don't take part in a relationship
                            
                                How can I implement an "interesting tags" feature like that on Stack Overflow?
                            
                                Can a table field contain a hyphen?
                            
                                Generating Depth based tree from Hierarchical Data in MySQL (no CTEs)
                            
                                MySQLdb Python insert %d and %s
                            
                                Unique (multiple columns) and null in one column
                            
                                Performance of MySQL ALTER TABLE ADD COLUMN AFTER COLUMN - on a large table
                            
                                Can you GROUP BY with a CASE WHEN THEN alias name?
                            
                                Transfer Mysql database to another computer [closed]
                            
                                MySQL local variables
                            
                                MySQL multiple columns in IN clause
                            
                                Varbinary vs Blob in MySQL
                            
                                InvalidRequestError: VARCHAR requires a length on dialect mysql
                            
                                Selecting distinct 2 columns combination in mysql
                            
                                Selecting first and last values in a group
                            
                                Expression mysql NOW() in Doctrine QueryBuilder
                            
                                What is the difference between <> and != operators in MySQL? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

MySQL : retrieve a large select by chunks

Tags:

select

mysql

save

Toren

People also ask

2 Answers

Ed Gibbs

prafi

Recent Activity

Donate For Us