Hadoop (HDFS) - file versioning

Tags:

At the given time I have user file system in my application (apache CMIS). As it's growing bigger, I'm doubting to move to hadoop (HDFS) as we need to run some statistics on it as well. The problem: The current file system provides versioning of the files. When I read about hadoop - HDFS- and file versioning, I found most of the time that I have to write this (versioning) layer myself. Is there already something available to manage versioning of files in HDFS or do I really have to write it myself (don't want to reinvent the hot water, but don't find a proper solution either).

Answer

For full details: see comments on answer(s) below

Hadoop (HDFS) doesn't support versioning of files. You can get this functionality when you combine hadoop with (amazon) S3: Hadoop will use S3 as the filesystem (without chuncks, but recovery will be provided by S3). This solution comes with the versioning of files that S3 provides. Hadoop will still use YARN for the distributed processing.

817

asked Mar 13 '17 09:03

Vandeperre Maarten

1 Answers

Versioning is not possible with HDFS.
Instead you can use Amazon S3, which provides Versioning and is also compatible with Hadoop.

answered Sep 25 '22 10:09

franklinsijo

Related questions
                            
                                NodeJS dotenv - Not reading string value properly
                            
                                ActiveAdmin nested form duplicate
                            
                                Bind ViewModel to XAML View in Prism.Forms
                            
                                Show buttons in Fluent Ribbon backstage
                            
                                How to adjust play speed of video on android? [closed]
                            
                                How to fire a new http request only when the first one is completed and ignore/cancel all the other requests in between
                            
                                How to set weights of the batch normalization layer?
                            
                                Receiving items from reactive stream SubmissionPublisher
                            
                                Ionic 2 Refresher toggle ion-refresher on page load
                            
                                Inno Setup - HTTP request - Get www/web content
                            
                                How do define source root in VS Code
                            
                                File Upload using node js without multer

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With