What's the best way to sync large amounts of data around the world?

Tags:

I have a great deal of data to keep synchronized over 4 or 5 sites around the world, around half a terabyte at each site. This changes (either adds or changes) by around 1.4 Gigabytes per day, and the data can change at any of the four sites.

A large percentage (30%) of the data is duplicate packages (Perhaps packaged-up JDKs), so the solution would have to include a way of picking up the fact that there are such things lying aruond on the local machine and grab them instead of downloading from another site.

The control of versioning is not an issue, this is not a codebase per-se.

I'm just interested if there are any solutions out there (preferably open-source) that get close to such a thing?

My baby script using rsync doesn't cut the mustard any more, I'd like to do more complex, intelligent synchronization.

Thanks

Edit : This should be UNIX based :)

580

asked Oct 24 '08 15:10

Spedge

1 Answers

Have you tried Unison?

I've had good results with it. It's basically a smarter rsync, which maybe is what you want. There is a listing comparing file syncing tools here.

answered Nov 16 '22 00:11

Vinko Vrsalovic

Related questions
                            
                                Replying to emails: how to condense multiple "blank" (not really blank; lines consisting only of ">") lines into one?
                            
                                What is making this du command error with invalid options?
                            
                                How to export a function in Bourne shell?
                            
                                Waiting for a file to be created in Bash
                            
                                how to display spaces and tabs using unix and the "cat" command
                            
                                What's the Solaris equivalent to the BSD's 'tail -n100'?
                            
                                File Locking vs. Semaphores
                            
                                Can a script be used as an interpreter by the #! hashbang line?
                            
                                How to concatenate huge number of files
                            
                                What is the difference between $* and $@
                            
                                'SOCK_RAW' option in 'socket' system call
                            
                                How to add 100 spaces at end of each line of a file in Unix
                            
                                How to read/convert mysql history (.mysql_history) containing \040 instead of whitespace?
                            
                                How can I tell if a makefile is being run from an interactive shell?
                            
                                What is the difference between the jobs and ps commands in linux?
                            
                                Shell Script: How to trim spaces from a bash variable [duplicate]
                            
                                ffmpeg exit status -1094995529
                            
                                Unix sendmail - html embed image not working
                            
                                How to pass current date to a curl query using shell script?
                            
                                Dig +trace does not do a trace

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What's the best way to sync large amounts of data around the world?

Tags:

synchronization

unix

networking

large-files

Spedge

People also ask

1 Answers

Vinko Vrsalovic

Recent Activity

Donate For Us