Scalable Image Storage

Tags:

I'm currently designing an architecture for a web-based application that should also provide some kind of image storage. Users will be able to upload photos as one of the key feature of the service. Also viewing these images will be one of the primary usages (via web).

However, I'm not sure how to realize such a scalable image storage component in my application. I already thought about different solutions but due to missing experiences, I look forward to hear your suggestions. Aside from the images, also meta data must besaved. Here are my initial thoughts:

Use a (distributed) filesystem like HDFS and prepare dedicated webservers as "filesystem clients" in order to save uploaded images and service requests. Image meta data are saved in a additional database including the filepath information for each image.
Use a BigTable-oriented system like HBase on top of HDFS and save images and meta data together. Again, webservers bridge image uploads and requests.
Use a completly schemaless database like CouchDB for storing both images and metadata. Additionally, use the database itself for upload and delievery by using the HTTP-based RESTful API. (Additional question: CouchDB does save blobs via Base64. Can it however return data in form of image/jpeg etc.)?

241

asked Dec 25 '09 13:12

b_erb

1 Answers

We have been using CouchDB for that, saving images as an "Attachment". But after a year the multi-dozen GB CouchDB Database files turned out to be a headache. For example CouchDB replication still has issues if you use it with very large document sizes.

So we just rewrote our software to use CouchDB for image information and Amazon S3 for the actual image storage. The code is available at http://github.com/hudora/huImages

You might want to set up a Amazon S3 compatible Storage Service on-site for your project. This keeps you flexible and leaves the amazon option without requiring external services for now. Walruss seems to become the most popular and scalable S3 clone.

I also urge you to look into the Design of Livejournal with their excellent Open Source MogileFS and Perlbal offerings. This combination is probably the most Famous image serving setup.

Also the flickr Architecture can be an inspiration, although they don't offer Open Source software to the public, like Livejournal does.

answered Sep 25 '22 09:09

max

Related questions
                            
                                how to check if directory exists with Storage:: facade in laravel?
                            
                                amazon s3 vs google cloud storage [closed]
                            
                                C#: Create a virtual drive in Computer
                            
                                What does "Thin Pool" in docker mean?
                            
                                requestLegacyExternalStorage is not working in Android 11 - API 30
                            
                                In Laravel, how can I obtain a list of all files in a public folder?
                            
                                Really force file sync/flush in Java
                            
                                Does PostgreSQL support transparent compressing of tables (fragments)?
                            
                                How to find the amount of free storage (disk space) left on Android? [duplicate]
                            
                                "Not allowed to load local resource: file:///C:....jpg" Java EE Tomcat
                            
                                how to check internal and external storage if exist
                            
                                Should I obfuscate OAuth consumer secret stored by Android app?
                            
                                is there a maximum size to android internal storage allocated for an app?
                            
                                where is the best place to save images from users upload
                            
                                Do .bss section zero initialized variables occupy space in elf file?
                            
                                Core Data - Storing Images (iPhone) [closed]
                            
                                Save a picture of a signature to a file in Rhodes on Android
                            
                                React Native AsyncStorage storing values other than strings
                            
                                Accessing functions bound to event handlers with jQuery
                            
                                How does jQuery .data() work?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Scalable Image Storage

Tags:

couchdb

storage

hadoop

hbase

hdfs

b_erb

People also ask

1 Answers

max

Recent Activity

Donate For Us