What I think I'm looking for is a no-SQL, library-embedded, on disk (ie not in-memory) database, thats accessible from java (and preferably runs inside my instance of the JVM). That's not really much of a database, and I'm tempted to roll-my-own. Basically I'm looking for the "should we keep this in memory or put it on disk" portion of a database. Our model has grown to several gigabytes. Right now this is all done in memory, meaning we're pushing the JVM for upward of several gigabytes. It's currently all stored in a flat XML file, serialized and deserialized with xstream and compressed with Java'a built in gzip libraries. That's worked well when our model stays under 100MB, but now that its larger than that its becoming a problem. loosely speaking that model can be broken down as <ul> <li>Project <ol> <li>configuration component (directed-acyclic-graph), not at all database friendly</li> <li>a list of a dozen "experiment" structures <ul> <li>each containing a list of about a dozen "run-model" structures. <ol> <li>each run-model contains hundreds of megs of data. Once written they are never edited. </li> </ol> </li> </ul> </li> </ol> </li> </ul> What I'd like to do is have something that conforms to a map interface, of guid -> run-model. This mini-database would keep a flat table of these objects. On our experiment model, we would replace the list of run-models with a list of guids, and add, at the application layer, a get call to this map, which would pull it off the disk and into memory. That means we can keep configuration of our program in XML (which I'm very happy with) and keep a table of the big data in a DBMS that will keep us from consuming multi-GB of memory. On program start and exit I could then load and unload the two portions of our model (the config section in XML, and the run-models in the database format) from an archiving format. I'm sort've feeling gung-ho about this, and think that I could probably implement it with some of X-Stream's XML inspection strategies and a custom map implementation, but something a voice in the back of my head is telling me I should find a library to do it instead. Should I roll my own or is there a database that's small enough to fit this bill? Thanks guys, -Geoff

http://www.mapdb.org/ Also take a look at this question: Alternative to BerkeleyDB?

Since MapDB is a possible solution for your problem, Chronicle Map is also worth consideration. It's an embeddable Java key-value store, optionally persistent, offering a very similar programming model to MapDB: it also via the vanilla <code>java.util.Map</code> interface and transparent serialization of keys and values. The major difference is that according to third-party benchmarks, Chronicle Map is times faster than MapDB. Regarding stability, no bugs were reported about the Chronicle Map data storage for months now, while it is in active use in many projects. Disclaimer: I'm the developer of Chronicle Map.

java embedded library on-disk key-value database [closed]

Tags:

java

performance

database

nosql

embedded-database

What I think I'm looking for is a no-SQL, library-embedded, on disk (ie not in-memory) database, thats accessible from java (and preferably runs inside my instance of the JVM). That's not really much of a database, and I'm tempted to roll-my-own. Basically I'm looking for the "should we keep this in memory or put it on disk" portion of a database.

Our model has grown to several gigabytes. Right now this is all done in memory, meaning we're pushing the JVM for upward of several gigabytes. It's currently all stored in a flat XML file, serialized and deserialized with xstream and compressed with Java'a built in gzip libraries. That's worked well when our model stays under 100MB, but now that its larger than that its becoming a problem.

loosely speaking that model can be broken down as

Project
1. configuration component (directed-acyclic-graph), not at all database friendly
2. a list of a dozen "experiment" structures
  - each containing a list of about a dozen "run-model" structures.
    1. each run-model contains hundreds of megs of data. Once written they are never edited.

What I'd like to do is have something that conforms to a map interface, of guid -> run-model. This mini-database would keep a flat table of these objects. On our experiment model, we would replace the list of run-models with a list of guids, and add, at the application layer, a get call to this map, which would pull it off the disk and into memory.

That means we can keep configuration of our program in XML (which I'm very happy with) and keep a table of the big data in a DBMS that will keep us from consuming multi-GB of memory. On program start and exit I could then load and unload the two portions of our model (the config section in XML, and the run-models in the database format) from an archiving format.

I'm sort've feeling gung-ho about this, and think that I could probably implement it with some of X-Stream's XML inspection strategies and a custom map implementation, but something a voice in the back of my head is telling me I should find a library to do it instead.

Should I roll my own or is there a database that's small enough to fit this bill?

Thanks guys,

-Geoff

990

asked Mar 05 '14 00:03

Groostav

2 Answers

http://www.mapdb.org/

Also take a look at this question: Alternative to BerkeleyDB?

153

answered Oct 17 '22 18:10

cruftex

Since MapDB is a possible solution for your problem, Chronicle Map is also worth consideration. It's an embeddable Java key-value store, optionally persistent, offering a very similar programming model to MapDB: it also via the vanilla java.util.Map interface and transparent serialization of keys and values.

The major difference is that according to third-party benchmarks, Chronicle Map is times faster than MapDB.

Regarding stability, no bugs were reported about the Chronicle Map data storage for months now, while it is in active use in many projects.

Disclaimer: I'm the developer of Chronicle Map.

answered Oct 17 '22 17:10

leventov

Related questions
                            
                                java sample for encoding/decoding EXI? [closed]
                            
                                H2 In memory DB treating ORDER BY differently in MySQL mode
                            
                                Should findviewbyid be called in UI thread?
                            
                                Security exception while using wsimport
                            
                                how to use generics in Java with language operators and generic class extending Number
                            
                                Checking generic type
                            
                                CXF java2ws: how to include external xsd files?
                            
                                How can I get the current location in android programatically?
                            
                                Does Eclipse have a tool to merge a type hierarchy into a single class?
                            
                                jQuery and Spring MVC
                            
                                Java Batch job in cluster environment
                            
                                Java / Eclipse: Removing multiple warnings for 'Unnecessary @SuppressWarnings'
                            
                                Remote debugging Java 9 in a docker container from IntelliJ IDEA
                            
                                Error on some devices - couldn't find class 'com.google.android.gms.measurement.internal.zzz'
                            
                                Writing a new refactoring plugin for Eclipse?
                            
                                Where are the Java 7 updates for OpenJDK?
                            
                                Renaming Part Files in Hadoop Map Reduce
                            
                                Weblogic datasource disappears from JNDI tree
                            
                                Java Creating a new ObjectInputStream Blocks
                            
                                Eclipse IDE - Error: Build path specifies execution environment Java SE 1.7

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With