GNU parallel with rsync

Tags:

I'm trying to run some instances of rsync in parallel using ssh with GNU parallel. The command I'm running is like this:

find /tmp/tempfolder -type f -name 'chunck.*' | sort | parallel --gnu -j 4 -v ssh -i access.pem user@server echo {}\; rsync -Havessh -auz -0 --files-from={} ./ user@server:/destination/path

/tmp/tempfolder contains files with the prefix chunck and they contain the actual file lists.

With this command, I got the 4 calls for rsync alright, but they take a while to start running and don't start all together and don't run in parallel.

What am I doing wrong?

544

asked Mar 26 '14 21:03

Daivid

1 Answers

Are you sure the rsyncs are really not running in parallel ?
Checking with ps | grep rsync while the command is running will show which and how many rsyncs are actually running simultaneously.

By default, parallel holds printing output from each job until it's finished so that the different commands' output don't get all mixed up together:

--group  Group output. Output from each jobs is grouped together and is only printed when the command
         is finished. stderr (standard error) first followed by stdout (standard output). This takes
         some CPU time. In rare situations GNU parallel takes up lots of CPU time and if it is
         acceptable that the outputs from different commands are mixed together, then disabling
         grouping with -u can speedup GNU parallel by a factor of 10.

         --group is the default. Can be reversed with -u.

My guess is the rsyncs are actually running in parallel, but from the output it feels like they're running serial. -u option changes that.

For example with this cmd:

$ for i in 1 2 3 ; do echo a$i ; sleep 1 ; done
a1
a2
a3

By default in parallel we get no feedback until it's all done:

$ (echo a ; echo b ; echo c ) | parallel 'for i in 1 2 3 ; do echo {}$i ; sleep 1 ; done  ' 
a1
a2
a3
b1
b2
b3
c1
c2
c3

Whereas with -u stuff get printed right away:

$ (echo a ; echo b ; echo c ) | parallel -u 'for i in 1 2 3 ; do echo {}$i ; sleep 1 ; done  ' 
a1
b1
c1
a2
b2
c2
a3
b3
c3

In both cases it took 3s to run though so it's really running simultaneously...

111

answered Oct 05 '22 05:10

lemonsqueeze

Related questions
                            
                                popen fails with "sh: <command>: not found"
                            
                                AWK: execute CURL on each line and parse result
                            
                                How to monitor processes that accessed a particular file?
                            
                                PBXCP: No such file or directory, in most recent xcode
                            
                                Sqlite3 Module in Python far Slower SELECT than in Shell
                            
                                Asynchronously consuming pipe with bash
                            
                                App to send hdmi cec command
                            
                                psql shell command execution with \!
                            
                                Listing all variables in a file in bash
                            
                                Using a Git hook to create a commit log and add to the current commit
                            
                                Neovim terminal emulator configuration for Windows 10
                            
                                How to chain multiple command line responses in Python?
                            
                                How to write multi-line command in heroku procfile
                            
                                EXTENDS challenge: preprocessor function macros and class-like oop
                            
                                Is there anyway for a bash (or any other shell) script to detect whether the current terminal supports unicode characters?
                            
                                sh: How do I avoid clobbering numbered file descriptors?
                            
                                Is there a way to make a script that automatically corrects scanned documents?
                            
                                bash variable substitution within command substitution
                            
                                UNIX shell: sort a string by word length and by ASCII order ignoring case
                            
                                Redirection in PHP exec call creates empty file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

GNU parallel with rsync

Tags:

shell

parallel-processing

rsync

gnu-parallel

Daivid

People also ask

1 Answers

lemonsqueeze

Recent Activity

Donate For Us