Random-access container that does not fit in memory?

Tags:

I have an array of objects (say, images), which is too large to fit into memory (e.g. 40GB). But my code needs to be able to randomly access these objects at runtime.

What is the best way to do this?

From my code's point of view, it shouldn't matter, of course, if some of the data is on disk or temporarily stored in memory; it should have transparent access:

Click to copy

container.getObject(1242)->process();
container.getObject(479431)->process();

But how should I implement this container? Should it just send the requests to a database? If so, which one would be the best option? (If a database, then it should be free and not too much administration hassle, maybe Berkeley DB or sqlite?)

Should I just implement it myself, memoizing objects after acces sand purging the memory when it's full? Or are there good libraries (C++) for this out there?

The requirements for the container would be that it minimizes disk access (some elements might be accessed more frequently by my code, so they should be kept in memory) and allows fast access.

UPDATE: I turns out that STXXL does not work for my problem because the objects I store in the container have dynamic size, i.e. my code may update them (increasing or decreasing the size of some objects) at runtime. But STXXL cannot handle that:

STXXL containers assume that the data types they store are plain old data types (POD). http://algo2.iti.kit.edu/dementiev/stxxl/report/node8.html

Could you please comment on other solutions? What about using a database? And which one?

627

asked Jan 25 '10 19:01

Frank

2 Answers

Consider using the STXXL:

The core of STXXL is an implementation of the C++ standard template library STL for external memory (out-of-core) computations, i.e., STXXL implements containers and algorithms that can process huge volumes of data that only fit on disks. While the compatibility to the STL supports ease of use and compatibility with existing applications, another design priority is high performance.

143

answered Nov 15 '22 07:11

James McNellis

You could look into memory mapped files, and then access one of those too.

answered Nov 15 '22 08:11

Liz Albin

Related questions
                            
                                The latest version of gcc to use libstdc++.so.5
                            
                                Should I add throw() to the declarations for my C++ destructors?
                            
                                I want to start Qt development - what basic knowledge in C++ and OS do I have to own? [closed]
                            
                                How to use msxml with Visual Studio 2008 Express (no ATL classes) without becoming crazy?
                            
                                Calling Ruby class methods from C++
                            
                                What is the best way to port from Objective-C to C++?
                            
                                How to Cross Compile for Cell Linux on the PS3 from Windows?
                            
                                Determining what object files have caused .dll size increase [C++]
                            
                                Binary operator overloading on a templated class
                            
                                Issues with seeding a pseudo-random number generator more than once?
                            
                                can a GC be implemented with C++ raw pointers?
                            
                                What is Compatible "int" type in both 32Bit & 64Bit windows in C++?
                            
                                Drawing on the Desktop Background (WIN32)
                            
                                Is it possible to write an impure template in C++?
                            
                                astyle formatting multiple line <<
                            
                                Distributed shared memory library for C++? [closed]
                            
                                When or where was the term "Most vexing parse" coined?
                            
                                Simplest way to read registry key value to std::string?
                            
                                Platform independent resource system (like the Qt Resource system)
                            
                                Why is my C++ app faster than my C app (using the same library) on a Core i7

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Random-access container that does not fit in memory?

Tags:

c++

database

memory

data-structures

random-access

Frank

People also ask

2 Answers

James McNellis

Liz Albin

Recent Activity

Donate For Us