database vs. flat files

Tags:

The company I work for is trying to switch a product that uses flat file format to a database format. We're handling pretty big files of data (ie: 25GB/file) and they get updated really quick. We need to run queries that randomly access the data, as well as in a contiguous way. I am trying to convince them of the advantages of using a database, but some of my colleagues seem reluctant to this. So I was wondering if you guys can help me out here with some reasons or links to posts of why we should use databases, or at least clarify why flat files are better (if they are).

903

asked Mar 01 '10 15:03

hyperboreean

2 Answers

Databases can handle querying tasks, so you don't have to walk over files manually. Databases can handle very complicated queries.
Databases can handle indexing tasks, so if tasks like get record with id = x can be VERY fast
Databases can handle multiprocess/multithreaded access.
Databases can handle access from network
Databases can watch for data integrity
Databases can update data easily (see 1) )
Databases are reliable
Databases can handle transactions and concurrent access
Databases + ORMs let you manipulate data in very programmer friendly way.

answered Oct 07 '22 17:10

Andrey

This is an answer I've already given some time ago:

It depends entirely on the domain-specific application needs. A lot of times direct text file/binary files access can be extremely fast, efficient, as well as providing you all the file access capabilities of your OS's file system.

Furthermore, your programming language most likely already has a built-in module (or is easy to make one) for specific parsing.

If what you need is many appends (INSERTS?) and sequential/few access little/no concurrency, files are the way to go.

On the other hand, when your requirements for concurrency, non-sequential reading/writing, atomicity, atomic permissions, your data is relational by the nature etc., you will be better off with a relational or OO database.

There is a lot that can be accomplished with SQLite3, which is extremely light (under 300kb), ACID compliant, written in C/C++, and highly ubiquitous (if it isn't already included in your programming language -for example Python-, there is surely one available). It can be useful even on db files as big as 140 terabytes, or 128 tebibytes (Link to Database Size), possible more.

If your requirements where bigger, there wouldn't even be a discussion, go for a full-blown RDBMS.

As you say in a comment that "the system" is merely a bunch of scripts, then you should take a look at pgbash.

answered Oct 07 '22 18:10

Esteban Küber

Related questions
                            
                                Django's ManyToMany Relationship with Additional Fields
                            
                                Insert 2 million rows into SQL Server quickly
                            
                                How do databases work internally? [closed]
                            
                                Android database encryption
                            
                                What is the difference between HSET and HMSET method in redis database
                            
                                Hidden Features of PostgreSQL [closed]
                            
                                Still Confused About Identifying vs. Non-Identifying Relationships
                            
                                Adding a leading zero to some values in column in MySQL
                            
                                How to delete mysql database through shell command
                            
                                Is there a way to get a list of column names in sqlite?
                            
                                What's your #1 way to be careful with a live database? [closed]
                            
                                A list of Entity Framework providers for various databases
                            
                                Mysql database sync between two databases
                            
                                How to update selected rows with values from a CSV file in Postgres?
                            
                                Convert datetime value into string
                            
                                How does data denormalization work with the Microservice Pattern?
                            
                                Getting-started: Setup Database for Node.js
                            
                                What's the better database design: more tables or more columns?
                            
                                PostgreSQL how to see which queries have run
                            
                                MySql views performance [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

database vs. flat files

Tags:

file

database

hyperboreean

People also ask

2 Answers

Andrey

Esteban Küber

Recent Activity

Donate For Us