Suppose, I have a very large <code>std::map< unsigned int, Foo > FooDB</code>, which holds <code>Foo</code> objects in memory, retrievable by their ID. Now there might be more <code>Foo</code> objects than there is memory available to store them. So I'd like to have the following construct: <ul> <li>retrieve <code>Foo</code> object with ID x from <code>FooDB</code> </li> <li>if object x is in <code>FooDB</code>, return it</li> <li>if it isn't, load it from HD, try to store it in <code>FooDB</code> for further queries <ul> <li>there is enough memory available: add it to <code>FooDB</code> </li> <li>there is not enough memory: free some space by removing from <code>FooDB</code> objects not in use (oldest query timestamp)</li> </ul> </li> </ul> I'd like to reserve some memory for the <code>FooDB</code> and I can't tell, how many <code>Foo</code> objects can be stored in it, as they differ in size. Any ideas on how to implement this? EDIT My basic problem is: how can I tell a <code>std::map</code>'s size in memory? All heap objects stored in it included, of course. How can I know when the not enough memory part has been reached?

It's fairly straightforward. Just place a reference to each in-memory Foo instance in FooDB in a sorted linked list ordered by age. When you load a new item for the first time into memory add it to the front of the list. When you read/modify an item move it from the middle of list to the front of the list. When you need to delete an old item to make space pop it off the back of the list. For example: <pre class="prettyprint"><code>typedef shared_ptr<Foo> PFoo; class Foo { ... list<PFoo>::iterator age; }; typedef map< unsigned int, PFoo > FooDB; FooDB foodb; list<PFoo> ages; void LoadFoo(PFoo foo) { ages.push_front(foo); } void ReadFoo(PFoo foo) { ... ages.erase(foo->age); ages.push_front(foo); } void MakeSpace() { PFoo foo = ages.back(); ages.pop_back(); DeleteFoo(foo); } </code></pre>

How do I implement caching in C++?

Tags:

c++

memory-management

caching

Suppose, I have a very large std::map< unsigned int, Foo > FooDB, which holds Foo objects in memory, retrievable by their ID. Now there might be more Foo objects than there is memory available to store them. So I'd like to have the following construct:

retrieve Foo object with ID x from FooDB
if object x is in FooDB, return it
if it isn't, load it from HD, try to store it in FooDB for further queries
- there is enough memory available: add it to FooDB
- there is not enough memory: free some space by removing from FooDB objects not in use (oldest query timestamp)

I'd like to reserve some memory for the FooDB and I can't tell, how many Foo objects can be stored in it, as they differ in size.

Any ideas on how to implement this?

EDIT

My basic problem is: how can I tell a std::map's size in memory? All heap objects stored in it included, of course. How can I know when the not enough memory part has been reached?

462

asked Aug 27 '12 14:08

Ben

2 Answers

As far as I know, there's no way to ask an object what its size is, other than sizeof(). You said that sizeof() won't work because the Foo objects don't have a fixed size. In that case, if you can modify Foo, then maybe your Foo class can keep track of its memory footprint internally. And if you can't modify Foo, you might be able to write an external function that can deduce the memory footprint.

Fundamentally, it would be very difficult for the language/compiler/runtime to know how big a dynamically-sized object is because it doesn't know which allocations belong to the object. A simple solution, just recursively sum all of the things that it's members point to, will fail on anything that has a pointer to an object that it doesn't "own". Another simple solution, to keep track of all of the allocations done between when the constructor starts and when it returns, will fail for anything that makes allocations after the constructor is called.

You might want to just use the number of Foo's as your cache limit instead of the memory size. Unless you know a lot about the memory availability and usage of the entire system, a cap based on memory size would be arbitrary as well. And if you know a lot about the memory usage of the entire system, you could just use the overall memory availability to determine when to release objects from the cache.

answered Sep 22 '22 09:09

Tom Panning

It's fairly straightforward.

Just place a reference to each in-memory Foo instance in FooDB in a sorted linked list ordered by age.

When you load a new item for the first time into memory add it to the front of the list.

When you read/modify an item move it from the middle of list to the front of the list.

When you need to delete an old item to make space pop it off the back of the list.

For example:

Click to copy

typedef shared_ptr<Foo> PFoo;

class Foo
{
    ...
    list<PFoo>::iterator age;
};

typedef map< unsigned int, PFoo > FooDB;
FooDB foodb; 

list<PFoo> ages;

void LoadFoo(PFoo foo)
{
    ages.push_front(foo);
}

void ReadFoo(PFoo foo)
{
    ...
    ages.erase(foo->age);
    ages.push_front(foo);
}

void MakeSpace()
{
    PFoo foo = ages.back();
    ages.pop_back();
    DeleteFoo(foo);
}

answered Sep 22 '22 09:09

Andrew Tomazos

Related questions
                            
                                Boost::Spirit result of phrase_parse
                            
                                compile C++ header-only templates to a shared library
                            
                                Address of an object in c++ and its members
                            
                                Using Partial Specialization in C++11
                            
                                Heap corruption detected after normal block
                            
                                Trying to build muParser: error: explicit instantiation of 'std::basic_ostream but no definition available
                            
                                How to find Black Pixel locations
                            
                                Forms of constants for high performance addition and multiplication for double
                            
                                Compiling a MFC app from Visual Studio 2010 to 2012 RC results in LNK2038
                            
                                new and delete operator overloading for dll
                            
                                How can I have a temporary variable in a constexpr function?
                            
                                Are C++ polymorphic classes without any functions possible?
                            
                                Copy constructor called on singleton class
                            
                                c++ memory in array of class objects
                            
                                How to properly extend interface?
                            
                                spurious "missing sentinel in function call"
                            
                                #define MY_INT VS const int MY_INT [duplicate]
                            
                                Boost -gd lib file
                            
                                C++11 lambda capture list [=] use a reference
                            
                                Distance from a point to a line/segment

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With