Databases versus plain text

Tags:

When dealing with small projects, what do you feel is the break even point for storing data in simple text files, hash tables, etc., versus using a real database? For small projects with simple data management requirements, a real database is unnecessary complexity and violates YAGNI. However, at some point the complexity of a database is obviously worth it. What are some signs that your problem is too complex for simple ad-hoc techniques and needs a real database?

Note: To people used to enterprise environments, this will probably sound like a weird question. However, my problem domain is bioinformatics. Most of my programming is prototypes, not production code. I'm primarily a domain expert and secondarily a programmer. Most of my code is algorithm-centric, not data management-centric. The purpose of this question is largely for me to figure out how much work I might save in the long run if I learn to use proper databases in my code instead of the more ad-hoc techniques I typically use.

743

asked Feb 05 '09 03:02

dsimcha

2 Answers

1) Concurrency. Do you have multiple people accessing the same dataset? Then it's going to get pretty involved to broker all of the different readers and writers in a scalable fashion if you roll your own system.

2) Formatting and relationships: Is your data something that doesn't fit neatly into a table structure? Long nucleotide sequences and stuff like that? That's not really conveniently tabular data.

Another example: Nobody would consider implementing software like Photoshop to store PSDs in a relational format, because the data structures don't really lend themselves to that type of storage or query pattern.

3) ACID (sort of a corollary to #1): If Atomicity, Consistency, Integrity, and Durability are not challenges with a flat file, then go with a flat file.

155

answered Sep 30 '22 15:09

Dave Markle

For me, the line is crossed once I have to query my data in ways that involve more than a single relationship. Relating two flat data structures on disk is fairly simple, but once we get beyond that, a set-based language like SQL and formal database relationships actually reduce complexity.

answered Sep 30 '22 15:09

Rex M

Related questions
                            
                                Inserting NULL into MySQL timestamp
                            
                                Select Query by Pair of fields using an in clause
                            
                                Execute SQL from file in SQLAlchemy
                            
                                Database - (rows or records, columns or fields)?
                            
                                Do database transactions prevent race conditions?
                            
                                Does LevelDB support java?
                            
                                What does N' stands for in a SQL script ? (the one used before characters in insert script)
                            
                                Can I download an SQLite db on /sdcard and access it from my Android app?
                            
                                Database indexes and their Big-O notation
                            
                                Calculate distance between Zip Codes... AND users.
                            
                                cursor.rowcount always -1 in sqlite3 in python3k
                            
                                Table with 80 million records and adding an index takes more than 18 hours (or forever)! Now what?
                            
                                how to pass a null value to a foreign key field?
                            
                                escaping dash in username
                            
                                Command-line/API for Schema Compare in SSDT SQL Server Database Project?
                            
                                copy from one database to another using oracle sql developer - connection failed
                            
                                Getting the primary key of an newly inserted row in SQL Server 2008
                            
                                best event sourcing db strategy
                            
                                How to change the ownership of a table in database
                            
                                how to select distinct value from multiple tables

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Databases versus plain text

Tags:

complexity-theory

database

dsimcha

People also ask

2 Answers

Dave Markle

Rex M

Recent Activity

Donate For Us