I have an application where I receive each data 40.000 rows. I have 5 million rows to handle (500 Mb MySQL 5.0 database). Actually, those rows are stored in the same table => slow to update, hard to backup, etc. Which kind of scheme is used in such application to allow long term accessibility to the data without problems with too big tables, easy backup, fast read/write ? Is <code>postgresql</code> better than <code>mysql</code> for such purpose ?

We're having log tables of 100-200million rows now, and it is quite painful. <ul> <li>backup is impossible, requires several days of down time.</li> <li>purging old data is becoming too painful - it usually ties down the database for several hours</li> </ul> So far we've only seen these solutions: <ul> <li>backup , set up a MySQL slave. Backing up the slave doesn't impact the main db. (We havn't done this yet - as the logs we load and transform are from flat files - we back up these files and can regenerate the db in case of failures)</li> <li>Purging old data, only painless way we've found is to introduce a new integer column that identifies the current date, and partition the tables(requires mysql 5.1) on that key, per day. Dropping old data is a matter of dropping a partition, which is fast.</li> </ul> If in addition you need to do continuously transactions on these tables(as opposed to just load data every now and then and mostly query that data), you probably need to look into InnoDB and not the default MyISAM tables.

Database design for heavy timed data logging

2 Answers

1 - 40000 rows / day is not that big

2 - Partition your data against the insert date : you can easily delete old data this way.

3 - Don't hesitate to go through a datamart step. (compute often asked metrics in intermediary tables)

FYI, I have used PostgreSQL with tables containing several GB of data without any problem (and without partitioning). INSERT/UPDATE time was constant

answered Sep 28 '22 08:09

chburd

We're having log tables of 100-200million rows now, and it is quite painful.

backup is impossible, requires several days of down time.
purging old data is becoming too painful - it usually ties down the database for several hours

So far we've only seen these solutions:

backup , set up a MySQL slave. Backing up the slave doesn't impact the main db. (We havn't done this yet - as the logs we load and transform are from flat files - we back up these files and can regenerate the db in case of failures)
Purging old data, only painless way we've found is to introduce a new integer column that identifies the current date, and partition the tables(requires mysql 5.1) on that key, per day. Dropping old data is a matter of dropping a partition, which is fast.

If in addition you need to do continuously transactions on these tables(as opposed to just load data every now and then and mostly query that data), you probably need to look into InnoDB and not the default MyISAM tables.

answered Sep 28 '22 09:09

nos

Related questions
                            
                                How might you unobtrusively enhance the jQuery Datepicker class?
                            
                                Database triggers / referential integrity and in-memory caching
                            
                                How do you link to an action that takes an array as a parameter (RedirectToAction and/or ActionLink)?
                            
                                What IDE setup and workflow is used for OSGi development?
                            
                                Problem with Remember Me Service in Spring Security
                            
                                Jboss Seam: Enabling Debug page on WebLogic 10.3.2 (11g)
                            
                                WPF: Horizontal Alignment
                            
                                How to use separate class loaders and run in same JVM? (OSGI)
                            
                                Chat client with GWT
                            
                                Me.Invoke in VB.NET doesn't actually "Invoke" - threads stall on Invoke statement
                            
                                Code assist in (jsp /jstl) view for Spring MVC model objects in Eclipse
                            
                                Implement abstract class as a local class? pros and cons

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Database design for heavy timed data logging

Tags:

hotips

People also ask

2 Answers

chburd

nos

Recent Activity

Donate For Us